• Complain

Sheng Li - Robust Representation for Data Analytics: Models and Applications

Here you can read online Sheng Li - Robust Representation for Data Analytics: Models and Applications full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2017, publisher: Springer, genre: Romance novel. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Sheng Li Robust Representation for Data Analytics: Models and Applications
  • Book:
    Robust Representation for Data Analytics: Models and Applications
  • Author:
  • Publisher:
    Springer
  • Genre:
  • Year:
    2017
  • Rating:
    3 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 60
    • 1
    • 2
    • 3
    • 4
    • 5

Robust Representation for Data Analytics: Models and Applications: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Robust Representation for Data Analytics: Models and Applications" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This book introduces the concepts and models of robust representation learning, and provides a set of solutions to deal with real-world data analytics tasks, such as clustering, classification, time series modeling, outlier detection, collaborative filtering, community detection, etc. Three types of robust feature representations are developed, which extend the understanding of graph, subspace, and dictionary.Leveraging the theory of low-rank and sparse modeling, the authors develop robust feature representations under various learning paradigms, including unsupervised learning, supervised learning, semi-supervised learning, multi-view learning, transfer learning, and deep learning.Robust Representations for Data Analytics covers a wide range of applications in the research fields of big data, human-centered computing, pattern recognition, digital marketing, web mining, and computer vision.

Sheng Li: author's other books


Who wrote Robust Representation for Data Analytics: Models and Applications? Find out the surname, the name of the author of the book and a list of all author's works by series.

Robust Representation for Data Analytics: Models and Applications — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Robust Representation for Data Analytics: Models and Applications" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Springer International Publishing AG 2017
Sheng Li and Yun Fu Robust Representation for Data Analytics Advanced Information and Knowledge Processing
1. Introduction
Sheng Li 1 and Yun Fu 1
(1)
Northeastern University, Boston, MA, USA
Abstract
High-dimensional and large-scale data are everywhere. Nowadays, the wide deployment of low-cost sensors has made it possible to continuously collect measurements for different purposes. Several real-world scenarios are: In Entertainment , the consumer devices such as Kinect could capture and process visual information and motion data in real time; In Social Network , people refine, upload, and comment on images or videos that are captured by mobile devices with high-definition (HD) cameras; In Visual Surveillance , the cameras deployed in public spaces are capturing the pedestrians and objects, in order to identify abnormal behaviors; In Digital Marketing , the online responses from customers could be collected from different channels, and integrated together to provide better recommendations; In Healthcare , the medical sensors for patient monitoring would provide a comprehensive understanding of patients. Learning representations from high-dimensional data collected in these scenarios drives intelligent applications. In this chapter, we introduce some critical issues in representation learning, describe the robust data representations, and present the organization of this book.
Extracting informative features and inferring knowledge from big data is considered as an essential objective of data analytics. Particularly, learning feature representations plays a major role, as it mitigates the gap between low-level observed data and high-level semantic knowledge. To deal with the high-dimensional and large-scale data, intelligent and efficient representation learning models are highly desired. In the past decades, data mining and machine learning researchers have made significant progress toward modeling data for different analytics tasks. Two critical issues should be carefully addressed when designing the models. The first one is scalability . Data collected in real-world applications are keep growing in terms of data size and dimensionality. For instance, according to some statistics on social networks, every minute, over 400h of videos are uploaded to YouTube, and more than 50,000 photos are presented to Instagram. A successful data mining model would be able to deal with a large amount of high-dimensional data efficiently. The second issue is model robustness . Many traditional models, especially the statistical learning based ones, pose strong assumptions on the underlying distribution of data. However, the data captured in real-world might be corrupted or contaminated with severe noise, which violates the underlying assumptions. Therefore, it is of great importance to develop robust data mining models that can learn robust data representations from data with uncertainty. This book presents the concepts, models, and applications of robust data representations.
1.1 What Are Robust Data Representations?
To understand the robustness of data representations, we first discuss the possible types of uncertainty that might be found in real-world data. In particular, we consider the uncertain data observations in a general sense. The uncertainty might be:
  • Gaussian noise;
  • Random corruptions;
  • Missing values in data, due to data loss during transmission, etc.
  • Outliers or anomalies;
  • Uncertainty within one modality;
  • Uncertainty across multiple modalities.
The first four types are well aligned with the traditional interpretations of data uncertainty in the literature, while the last two are considered as special cases of uncertainty in a more general sense. In common settings of data analytics, one object usually has multiple instances, such as multiple face images of the same person. If the multiple instances are from the same modality, variations on appearance may introduce uncertain information. For example, face images from the same person may have expression variations or illumination changes. For the last one, if the object is captured as multiple modalities using different types of sensors, the variations across different modalities would introduce another level of uncertainty.
It has been extensively demonstrated that exploiting the low-dimensional structure from high-dimensional data will greatly benefit the data analytics tasks. Particularly, recent advances on low-rank and sparse modeling have shown promising performance on recovering clean data from noisy observations, by discovering the low-dimensional subspace structures. This observation motivate us to develop new models for extracting robust data representations. The research objectives are twofold: (1) learning robust data representations from data with uncertainty, by exploiting the low-dimensional subspace structures; (2) evaluating the performance of the learned representations on a wide rage of real-world data analytics tasks.
Four categories of data representations are studied in this book, including graph , subspace , dictionary and latent factor . Robust data representations have been developed under each of the four categories. First, two novel graph construction schemes are introduced, by integrating the low-rank modeling with graph sparsification strategies. Each sample is represented in the low-rank coding space. And it is revealed that the similarity measurement in the low-rank coding space is more robust than that in the original sample space. The robust graphs could greatly enhance the performance of graph based clustering and semi-supervised classification. Second, low-dimensional discriminative subspaces are learned in single-view and multi-view scenarios, respectively. A single-view robust subspace discovery model is motivated from low-rank modeling and Fisher criterion, and it is able to accurately classify the noisy images. In addition, a multi-view subspace learning model is designed for extracting compact features from multimodal time series data, which leverages a shared latent space and fuses information from multiple data views. Third, dictionary serves as expressive bases for characterizing visual data. A robust dictionary learning method is designed to transfer knowledge from source domain to a target domain with limited training samples. A cross-view dictionary learning framework is presented to model the view consistency and extract robust features for images from two camera views. Fourth, latent factors, as compact representations of high-dimensional features, are extracted for the tasks of response prediction and collaborative filtering.
From the perspective of machine learning paradigms, this book covers clustering, semi-supervised learning, classification, multi-view learning, time-series modeling, graph mining, subspace learning, dictionary learning, transfer learning, and deep learning. The proposed models have obtained remarkable improvements on many real-world data analytics tasks, including image clustering, object recognition, face recognition, kinship verification, recommender system, outlier detection, person re-identification, and response prediction.
1.2 Organization of the Book
The rest of this book is organized as follows.
Part I focus on developing the robust representation models by learning robust graphs, robust subspaces, and robust dictionary. It consists of the following five chapters. Chapter presents the fundamentals of robust representations, which covers the overviews of existing representation learning and robust representation methods. The advantages and disadvantages of these existing methods are also discussed.
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Robust Representation for Data Analytics: Models and Applications»

Look at similar books to Robust Representation for Data Analytics: Models and Applications. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Robust Representation for Data Analytics: Models and Applications»

Discussion, reviews of the book Robust Representation for Data Analytics: Models and Applications and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.