• Complain

Shahi - Apache Solr

Here you can read online Shahi - Apache Solr full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. City: Berkeley;CA, year: 2016, publisher: Apress, genre: Home and family. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Shahi Apache Solr
  • Book:
    Apache Solr
  • Author:
  • Publisher:
    Apress
  • Genre:
  • Year:
    2016
  • City:
    Berkeley;CA
  • Rating:
    4 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 80
    • 1
    • 2
    • 3
    • 4
    • 5

Apache Solr: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Apache Solr" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Shahi: author's other books


Who wrote Apache Solr? Find out the surname, the name of the author of the book and a list of all author's works by series.

Apache Solr — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Apache Solr" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
Dikshant Shahi 2015
Dikshant Shahi Apache Solr 10.1007/978-1-4842-1070-3_1
1. Apache Solr: An Introduction
Dikshant Shahi 1
(1)
Karnataka, India
Electronic supplementary material
The online version of this chapter (doi: 10.1007/978-1-4842-1070-3_1 ) contains supplementary material, which is available to authorized users.
Search is ubiquitous and has become part of our digital life! You open your browser, and it is highly likely that your home page is your preferred search engine. The search engine of your smartphone is just a swipe away. Open an e-commerce web site, and the search box appears right at the center or top of the page. Basically, every portal that has data to expose needs a search engine. This proves how critical a search engine is for an enterprise or a product that its building.
Enterprises today have a tremendous amount of information to expose, and this volume will keep growing. Its nearly impossible and tedious to browse through all the data to find relevant information. The only solution to the problem is a search engine.
If you have lots of data and struggle to find desired information when you need it and in a manner that you need, then that data is almost worthless and its time to call for the development or upgrading of your search engine. Because you are reading this chapter, I assume that you are building or planning to build a search engine, and you understand why you need one for your enterprise. So without much ado, lets proceed.
The goal of this book is to help you develop a search engine by using the open source Apache Solr. First you need to understand that each search engine is different and has its own unique challenges that need to be addressed differently. We will explore common problems and their solutions and provide approaches to resolve specific problems during the flow of this book.
This chapter starts with an introduction to Apache Solr and provides a high-level view of its features. It touches upon several facets of Solr so that you have a comprehensive understanding before starting on your proof of concept or development.
This chapter covers the following topics:
  • Important features of Solr
  • Important components of Solr
  • Use of Solr beyond its search engine capabilities
  • Comparison of Solr with other solutions
  • Technologies used in Solrs ecosystem
Overview
Solr (pronounced solar ), is an enterprise-ready, blazingly fast, and highly scalable search platform built using Apache Lucene. Over the years and with such a vibrant community, Solr has matured to a level that it provides all the much needed features out of the box as well as provisions to customize it. It is cloud ready and powered for robustness, fault tolerance, and reliability.
Solr is written in Java and runs as a stand-alone server. Getting started is extremely easy. All you need to do is run a script with a start/stop command. Being purely configuration based, Solr requires only that you register the components with each other. Even without any knowledge of Java, you can build a decent search engine, as long as you dont need customization. Solr also provides a graphical administrator interface for easy monitoring, which can be accessed from the browser by pointing to the port you are running the search engine on.
Note
Apache Lucene is a popular open source search library written entirely in Java. It is widely used for indexing a large collection of documents and supporting full-text search.
Inside Solr
To choose a framework, you evaluate it on aspects such as internals, features, capabilities, usability, performance, and scalability. This section will answer some of your questions, including whats inside Solr, how to configure it, and how to use it. The features covered in this section might not be what makes Solr so popular, but theyre important to know as part of your basic understanding of Solr. The next section covers the features that make Solr popular, and with that you will have comprehensive knowledge of Solrs features. The following points provide a quick overview of Solrs features:
  • Inverted index : Lucene builds an inverted index of the documents that you add to Solr, and at query time it searches the index for matching documents. You can think of an inverted index as similar to the index at the end of this book.
  • Vector space model : By default, Lucene uses the vector space model (VSM) along with the Boolean model to determine the relevance of a document with respect to a user query. In a nutshell, the Boolean model approves, and the VSM ranks.
  • Config-based : solrconfig.xml and schema.xml are the two primary configuration files of Solr. The schema.xml file primarily defines the fields of your schema and the behavior of those fields (how the text will be tokenized while indexing and querying). Almost everything else goes in solrconfig.xml . You can also go schemaless and let Solr create fields automatically while indexing the data. The configurations can be edited manually or modified dynamically by calling the respective APIs. As of Solr 5.0, you can even upload JAR files through an API call.
  • Analysis chain : Your search query and the documents being indexed go through a chain of analyzers and tokenizers (the output of one tokenizer is fed to another in the chain). The output of the last tokenizer is the term that is indexed and matched against.
  • Java : Solr and Lucene are written in Java. Solr 5.0 requires Java 7+ to run. To customize any feature of Solr, you need to extend the appropriate Java class.
  • SolrJ : Solr bundles a Java client library that can be used for indexing documents and querying for results. Libraries are also available for other languages such as Perl and Python.
What Makes Apache Solr So Popular
Apache Solr is one of the most widely used search solutions, with millions of downloads, tens of thousands of applications in production, and hundreds of commits every month. The following are some of the factors that make it so popular:
  • Lucene : Solr uses the Lucene search library at its core and wraps it to add features and expose it as a RESTful service that can be accessed over HTTP. Development of Solr and Lucene merged in March 2010, and both the code bases reside in the same trunk in Apache Subversion (SVN); hence you are sure to get all the latest Lucene features in your latest Solr release.
  • Highly scalable and fault-tolerant : You can add or remove computing capacity to Solr, just by adding or removing replicas of your instance as needed. SolrCloud even abstracts your application from knowing how the data is distributed and saves you from getting into further nuances such as load balancing and sharding. Data indexed to Solr can be replicated among multiple instances; so even if one instance goes down, the data will still be accessible.
  • Enterprise ready : Solr is well proven and trusted by many leading organizations for their search requirements and for handling extensive loads. It can be deployed in stand-alone, traditional distributed architecture or in cloud mode based on the need of the organization, big or small.
  • Full-text search : As Solr is built on top of Lucene, it provides all the matching capabilities needed including token, phrases, fuzzy, wildcard, spell-check, and autocomplete.
  • RESTful XML/JSON over HTTP : Solr is exposed as a RESTful web service and can be accessed over HTTP. Data can be exchanged in XML, JSON, CSV, and binary format.
Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Apache Solr»

Look at similar books to Apache Solr. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Apache Solr»

Discussion, reviews of the book Apache Solr and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.