• Complain

Colette Faucher - Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering

Here you can read online Colette Faucher - Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. City: Cham, publisher: Springer International Publishing, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Colette Faucher Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering
  • Book:
    Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering
  • Author:
  • Publisher:
    Springer International Publishing
  • Genre:
  • City:
    Cham
  • Rating:
    3 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 60
    • 1
    • 2
    • 3
    • 4
    • 5

Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Colette Faucher: author's other books


Who wrote Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering? Find out the surname, the name of the author of the book and a list of all author's works by series.

Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Part 1
Big Data and Ontologies
Colette Faucher and Lakhmi C. Jain (eds.) Studies in Computational Intelligence Innovations in Intelligent Machines-4 2014 Recent Advances in Knowledge Engineering 10.1007/978-3-319-01866-9_1
Springer International Publishing Switzerland 2014
1. Large Scale Text Mining Approaches for Information Retrieval and Extraction
Patrice Bellot 1
(1)
CNRS, Aix-Marseille Universit, LSIS UMR 7296, Av. Esc. Normandie-Niemen, 13397 Marseille cedex 20, France
(2)
iSmart, 565 rue M. Berthelot, 13851 Aix-en-Provence cedex 3, France
(3)
LIA, Universit dAvignon et des Pays de Vaucluse, Agroparc, 84911 Avignon cedex 9, France
Patrice Bellot (Corresponding author)
Email:
Ludovic Bonnefoy
Email:
Vincent Bouvier
Email:
Frdric Duvert
Email:
Young-Min Kim
Email:
Abstract
The issues for Natural Language Processing and Information Retrieval have been studied for long time but the recent availability of very large resources (Web pages, digital documents) and the development of statistical machine learning methods exploiting annotated texts (manual encoding by crowdsourcing is a new major way) have transformed these fields. This allows not limiting these approaches to highly specialized domains and reducing the cost of their implementation. For this chapter, our aim is to present some popular text-mining statistical approaches for information retrieval and information extraction and to discuss the practical limits of actual systems that introduce challenges for future.
1.1 Introduction
Real text mining systems have been developed for finding precise and specific information on large collections of texts from keyword-based or natural language questions. The development of Web searching and question-answering systems that aim to retrieve precise answers corresponds to the combination of methods from computational linguistics, graph analysis, machine learning and, of course, information extraction and information retrieval. The most recent systems tend to combine statistical machine learning methods along with symbolic and rule-based approaches. The challenge is to get robust efficient systems that can self-adapt to users needs and profiles, to dynamic collections composed of unstructured or weakly structured and interconnected documents.
Numerical methods based on a statistical study of corpus, have proven their ability to adapt quickly to different themes and languages and are very popular on Web search systems. This was done despite some approximation in the results that would be more accurate by using symbolic rule-based approaches as long as the rules would be fair, complete and unambiguous while natural language is absolutely ambiguous. On the other hand, Natural Language Processing (NLP) tasks such named entity recognition (automatic extraction of the names of things in texts: person or company names, locations) and part-of-speech tagging (automatic labeling of words according to their lexical classes: noun, verbs, adjectives) can be realized both by means of symbolic approach that rely on hand-coded rules, grammar and gazetteers and by following supervised machine learning-based approaches that need large quantities of manually annotated corpus. For Named Entity Recognition, symbolic systems tend to produce better results on well-formed texts [].
This chapter is composed of two main sections dedicated to text mining, information retrieval and information extraction. We begin by a general discussion about symbolic and numerical approaches for Natural Language Processing then we present the most popular tasks that have been evaluated through annual conferences for twenty years along with some resources one can employ to process texts. In the first section, we present some classical information retrieval models for document and Web searching and the principles of semantic information retrieval that can exploit specialized lexicon, thesaurus or ontologies. The second section introduces natural language processing for information extraction. We outline the most effective approaches for question-answering systems and semantic tagging of texts (named entities recognition, pattern extraction): rule and lexicon based approaches and machine learning approaches (Hidden Markov Models and Conditional Random Fields). Then, we present some approaches that aim to find information about entities and to populate knowledge bases. In this section, we describe the approaches we proposed and experimented in the last few years. The last section is dedicated to some industrial applications we work on and that respectively relate to digital libraries, marketing and dialog systems.
1.2 Symbolic and Numerical Approaches for Natural Language Processing
Symbolic and numerical approaches for natural language processing (NLP), have long been opposed, both linked to scientific communities distinguished by differing goals between limited but accurate prototypes and rough but functional systems and by attitudes more or less pragmatic. This opposition joined in some way the one that opposed and opposes always, some linguists and philosophers on the nature of language and its acquisition that is, in simplified terms, the pre-existence of a system (cognitive one) generating rules of possible sentences in a language (or at least the degree of pre-existence). This debate on the nature and role of grammar has its origins in the 17th century between proponents of empiricism (the human being is born empty and is fully shaped by experience) and rationalism (the man can not be reduced to experience). In the 1950s, the behaviorists, empiricist, attempted to define the acquisition of language learning as a form of chain reaction from positive reinforcement or negative one. In contrast, Chomsky proposed the pre-existence of mechanisms in the human brain that are specific to language and that could distinguish humans from other species. That suggests language is something really organic [] even in the very beginning of the life and that language learning does not only rely on association between stimuli and responses.
A direct consequence of the pre-existence of a minimalist program (generative grammar) in language acquisition has been to define both a universal grammar expressing linguistic universals and particular grammars for the specificities of given languages [].
The learning process can be viewed from two points of view, namely statistical learning or analytical (both unconscious and possibly combined in human mind). In the first case, the child has to observe which language productions lead to the goal (s)he fixed and, on a rolling basis, to accumulate a kind of accounting of what succeeds and what fails and allows him/her to achieve a selection of possibilities. Computational neural networks, involving different layers more or less explicit, linking lexicon, concepts and sounds, might simulate this kind of learning. In that sense, any communicational intention (any goal) and any linguistic production might be seen as specific paths within the neural network. The success or failure would result in strengthening or weakening connections and the issue of convergence during learning step arises as well as the reduction of combinatorial.
In the second case, learning is a progressive refinement of the value of the parameters of universal grammar allowing the production and the understanding, of the statements it can generate. Note that in the first case (statistical learning), a rule-based grammar can still be induced once the system is stabilized. It would allow producing explicit structural patterns, which would be necessary to improve consistency over time and mutual understanding between two dialoging people.
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering»

Look at similar books to Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering»

Discussion, reviews of the book Innovations in Intelligent Machines-4 Recent Advances in Knowledge Engineering and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.