• Complain

Chen Ye - Knowledge Discovery from Multi-Sourced Data

Here you can read online Chen Ye - Knowledge Discovery from Multi-Sourced Data full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. City: Singapore, year: 2022, publisher: Springer, genre: Politics. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Chen Ye Knowledge Discovery from Multi-Sourced Data
  • Book:
    Knowledge Discovery from Multi-Sourced Data
  • Author:
  • Publisher:
    Springer
  • Genre:
  • Year:
    2022
  • City:
    Singapore
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Knowledge Discovery from Multi-Sourced Data: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Knowledge Discovery from Multi-Sourced Data" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to label or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

Chen Ye: author's other books


Who wrote Knowledge Discovery from Multi-Sourced Data? Find out the surname, the name of the author of the book and a list of all author's works by series.

Knowledge Discovery from Multi-Sourced Data — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Knowledge Discovery from Multi-Sourced Data" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Contents
Landmarks
Book cover of Knowledge Discovery from Multi-Sourced Data SpringerBriefs in - photo 1
Book cover of Knowledge Discovery from Multi-Sourced Data
SpringerBriefs in Computer Science
Series Editors
Stan Zdonik
Brown University, Providence, RI, USA
Shashi Shekhar
University of Minnesota, Minneapolis, MN, USA
Xindong Wu
University of Vermont, Burlington, VT, USA
Lakhmi C. Jain
University of South Australia, Adelaide, SA, Australia
David Padua
University of Illinois Urbana-Champaign, Urbana, IL, USA
Xuemin Sherman Shen
University of Waterloo, Waterloo, ON, Canada
Borko Furht
Florida Atlantic University, Boca Raton, FL, USA
V. S. Subrahmanian
University of Maryland, College Park, MD, USA
Martial Hebert
Carnegie Mellon University, Pittsburgh, PA, USA
Katsushi Ikeuchi
University of Tokyo, Tokyo, Japan
Bruno Siciliano
Universit di Napoli Federico II, Napoli, Italy
Sushil Jajodia
George Mason University, Fairfax, VA, USA
Newton Lee
Institute for Education, Research and Scholarships, Los Angeles, CA, USA

SpringerBriefs present concise summaries of cutting-edge research and practical applications across a wide spectrum of fields. Featuring compact volumes of 50 to 125 pages, the series covers a range of content from professional to academic.

Typical topics might include:

  • A timely report of state-of-the art analytical techniques

  • A bridge between new research results, as published in journal articles, and a contextual literature review

  • A snapshot of a hot or emerging topic

  • An in-depth case study or clinical example

  • A presentation of core concepts that students must understand in order to make independent contributions

Briefs allow authors to present their ideas and readers to absorb them with minimal time investment. Briefs will be published as part of Springers eBook collection, with millions of users worldwide. In addition, Briefs will be available for individual print and electronic purchase. Briefs are characterized by fast, global electronic dissemination, standard publishing contracts, easy-to-use manuscript preparation and formatting guidelines, and expedited production schedules. We aim for publication 812 weeks after acceptance. Both solicited and unsolicited manuscripts are considered for publication in this series.

**Indexing: This series is indexed in Scopus, Ei-Compendex, and zbMATH **

Chen Ye , Hongzhi Wang and Guojun Dai
Knowledge Discovery from Multi-Sourced Data
Logo of the publisher Chen Ye Computer and Software Department Hangzhou - photo 2
Logo of the publisher
Chen Ye
Computer and Software Department, Hangzhou Dianzi University, Hangzhou, China
Hongzhi Wang
Computer Science and Technology, Harbin Institute of Technology, Harbin, China
Guojun Dai
Computer and Software Department, Hangzhou Dianzi University, Hangzhou, China
ISSN 2191-5768 e-ISSN 2191-5776
SpringerBriefs in Computer Science
ISBN 978-981-19-1878-0 e-ISBN 978-981-19-1879-7
https://doi.org/10.1007/978-981-19-1879-7
The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2022
This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.

The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721, Singapore

This book is dedicated to all contributors in this field.

Preface

With the rapid development of information technology, all areas have ushered in the era of big data. One big challenge in analyzing the overwhelming generated data is the veracity of the data. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to label or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery.

At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

In this book, the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used to study the knowledge discovery problem on multi-source data. This book mainly focuses on three data models: the first is multi-source isomorphic data, which has a clear and significant entity-attribute-source structure; the second is multi-source heterogeneous data, where the entities and attributes from different sources may have various representations; and the third is text data, which does not intuitively reflect the entity-attribute-source structure and contains a lot of irrelevant words. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery, pattern discovery, and fact discovery on multi-source data from four important properties: relevance, inconsistency, sparseness, and heterogeneity. We hope the proposed ideas in this book can inspire researchers in both academics and industry, and further prompt them to join the field of knowledge discovery.

Acknowledgements

This book was partially supported by the Fundamental Research Funds for the Provincial Universities of Zhejiang (No. GK219909299001-011), the Natural Science Foundation of Zhejiang Province (No. KYZ054122042CZ), and the National Key Research and Development Program of China (No. 2017YFE0118200).

Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Knowledge Discovery from Multi-Sourced Data»

Look at similar books to Knowledge Discovery from Multi-Sourced Data. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Knowledge Discovery from Multi-Sourced Data»

Discussion, reviews of the book Knowledge Discovery from Multi-Sourced Data and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.