LitArk » Books » Politics

Chen Ye - Knowledge Discovery from Multi-Sourced Data

Here you can read online Chen Ye - Knowledge Discovery from Multi-Sourced Data full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. City: Singapore, year: 2022, publisher: Springer, genre: Politics. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Knowledge Discovery from Multi-Sourced Data
Author:
Chen Ye / Hongzhi Wang / Guojun Dai
Publisher:
Springer
Genre:
Books / Politics
Year:
2022
City:
Singapore
Rating:
4 / 5
Favourites:
Add to favourites
Your mark:
- 80
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Knowledge Discovery from Multi-Sourced Data: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Knowledge Discovery from Multi-Sourced Data" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to label or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

Chen Ye: author's other books

Who wrote Knowledge Discovery from Multi-Sourced Data? Find out the surname, the name of the author of the book and a list of all author's works by series.

Knowledge Discovery from Multi-Sourced Data — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Knowledge Discovery from Multi-Sourced Data" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Contents

Landmarks

Book cover of Knowledge Discovery from Multi-Sourced Data

SpringerBriefs in Computer Science

Series Editors

Stan Zdonik

Brown University, Providence, RI, USA

Shashi Shekhar

University of Minnesota, Minneapolis, MN, USA

Xindong Wu

University of Vermont, Burlington, VT, USA

Lakhmi C. Jain

University of South Australia, Adelaide, SA, Australia

David Padua

University of Illinois Urbana-Champaign, Urbana, IL, USA

Xuemin Sherman Shen

University of Waterloo, Waterloo, ON, Canada

Borko Furht

Florida Atlantic University, Boca Raton, FL, USA

V. S. Subrahmanian

University of Maryland, College Park, MD, USA

Martial Hebert

Carnegie Mellon University, Pittsburgh, PA, USA

Katsushi Ikeuchi

University of Tokyo, Tokyo, Japan

Bruno Siciliano

Universit di Napoli Federico II, Napoli, Italy

Sushil Jajodia

George Mason University, Fairfax, VA, USA

Newton Lee

Institute for Education, Research and Scholarships, Los Angeles, CA, USA

SpringerBriefs present concise summaries of cutting-edge research and practical applications across a wide spectrum of fields. Featuring compact volumes of 50 to 125 pages, the series covers a range of content from professional to academic.

Typical topics might include:

A timely report of state-of-the art analytical techniques
A bridge between new research results, as published in journal articles, and a contextual literature review
A snapshot of a hot or emerging topic
An in-depth case study or clinical example
A presentation of core concepts that students must understand in order to make independent contributions

Briefs allow authors to present their ideas and readers to absorb them with minimal time investment. Briefs will be published as part of Springers eBook collection, with millions of users worldwide. In addition, Briefs will be available for individual print and electronic purchase. Briefs are characterized by fast, global electronic dissemination, standard publishing contracts, easy-to-use manuscript preparation and formatting guidelines, and expedited production schedules. We aim for publication 812 weeks after acceptance. Both solicited and unsolicited manuscripts are considered for publication in this series.

**Indexing: This series is indexed in Scopus, Ei-Compendex, and zbMATH **

Chen Ye , Hongzhi Wang and Guojun Dai

Knowledge Discovery from Multi-Sourced Data

Logo of the publisher

Chen Ye

Computer and Software Department, Hangzhou Dianzi University, Hangzhou, China

Hongzhi Wang

Computer Science and Technology, Harbin Institute of Technology, Harbin, China

Guojun Dai

Computer and Software Department, Hangzhou Dianzi University, Hangzhou, China

ISSN 2191-5768 e-ISSN 2191-5776

SpringerBriefs in Computer Science

ISBN 978-981-19-1878-0 e-ISBN 978-981-19-1879-7

https://doi.org/10.1007/978-981-19-1879-7

The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2022

This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed.

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.

The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721, Singapore

This book is dedicated to all contributors in this field.

Preface

With the rapid development of information technology, all areas have ushered in the era of big data. One big challenge in analyzing the overwhelming generated data is the veracity of the data. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to label or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery.

At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

In this book, the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used to study the knowledge discovery problem on multi-source data. This book mainly focuses on three data models: the first is multi-source isomorphic data, which has a clear and significant entity-attribute-source structure; the second is multi-source heterogeneous data, where the entities and attributes from different sources may have various representations; and the third is text data, which does not intuitively reflect the entity-attribute-source structure and contains a lot of irrelevant words. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery, pattern discovery, and fact discovery on multi-source data from four important properties: relevance, inconsistency, sparseness, and heterogeneity. We hope the proposed ideas in this book can inspire researchers in both academics and industry, and further prompt them to join the field of knowledge discovery.

Acknowledgements

This book was partially supported by the Fundamental Research Funds for the Provincial Universities of Zhejiang (No. GK219909299001-011), the Natural Science Foundation of Zhejiang Province (No. KYZ054122042CZ), and the National Key Research and Development Program of China (No. 2017YFE0118200).

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Knowledge Discovery from Multi-Sourced Data»

Look at similar books to Knowledge Discovery from Multi-Sourced Data. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Ole Olesen-Bagneux

The Enterprise Data Catalog: Improve Data Discovery, Ensure Data Governance, and Enable Innovation

Brij B Gupta (editor)

Data Mining Approaches for Big Data and Sentiment Analysis in Social Media (Advances in Data Mining and Database Management)

Soham Sarkar (editor)

Intelligent Multi-Modal Data Processing (The Wiley Series in Intelligent Signal and Data Processing)

Charu C. Aggarwal (editor)

Data Clustering: Algorithms and Applications (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)

Xingni Zhou

Non-Linear Data Structures and Data Processing

Layton

Learning data mining with Python: use Python to manipulate data and build predictive models

Danneman Nathan

R mining spatial, text, web, and social media data: create and customize data mioning algorithms: a course in three modules

Fawcett Tom

Data Science for Business

Kuan-Ching Li

Big Data: Algorithms, Analytics, and Applications

Jake VanderPlas

Python Data Science Handbook: Essential Tools for Working with Data

OReilly Media Inc.

Big Data Now: 2012 Edition

Foster Provost

Data Science for Business: What you need to know about data mining and data-analytic thinking

Reviews about «Knowledge Discovery from Multi-Sourced Data»

Discussion, reviews of the book Knowledge Discovery from Multi-Sourced Data and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.