Apache Solr Beginner’s Guide
- Length: 324 pages
- Edition: 1
- Language: English
- Publisher: Packt Publishing
- Publication Date: 2013-12-26
- ISBN-10: 1782162526
- ISBN-13: 9781782162520
- Sales Rank: #1488590 (See Top 100 Books)
Configure your own search engine experience with real-world data with this practical guide to Apache Solr
Overview
- Learn to use Solr in real-world contexts, even if you are not a programmer, using simple configuration examples
- Define simple configurations for searching data in several ways in your specific context, from suggestions to advanced faceted navigation
- Teaches you in an easy-to-follow style, full of examples, illustrations, and tips to suit the demands of beginners
In Detail
With over 40 billion web pages, the importance of optimizing a search engine’s performance is essential.
Solr is an open source enterprise search platform from the Apache Lucene project. Full-text search, faceted search, hit highlighting, dynamic clustering, database integration, and rich document handling are just some of its many features. Solr is highly scalable thanks to its distributed search and index replication.
Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Apache Tomcat or Jetty. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable with most popular programming languages. Solr’s powerful external configuration allows it to be tailored to many types of application without Java coding, and it has a plugin architecture to support more advanced customization.
With “Apache Solr Beginner’s Guide” you will learn how to configure your own search engine experience. Using real data as an example, you will have the chance to start writing step-by-step, simple, real-world configurations and understand when and where to adopt this technology.
“Apache Solr Beginner’s Guide” will start by letting you explore a simple search over real data. You will then go through a step-by-step description that gives you the chance to explore several practical features. At the end of the book you will see how Solr is used in different real-world contexts.
Using data from public domains like DBpedia, you will define several different configurations, exploring some of the most interesting Solr features, such as faceted search and navigation, auto-suggestion, and rich document indexing. You will see how to configure different analysers for handling different data types, without programming.
You will learn the basics of Solr, focusing on real-world examples and practical configurations.
What you will learn from this book
- Understand what is full-text search and a faceted navigation are and when to use them
- Install and use Solr for testing
- Write your own configurations for the Solr index incrementally and test them with the Solr web UI
- Learn how to test a Solr running instance using cURL with different formats, like XML, JSON, and so on
- Construe your data and define the entities to be indexed in Solr
- Examine text and make auto-suggestions
- Index data using various formats and various data sources, and learn how to expose data in various formats
- Start using Solr in contexts like Open Data and Linked Data
- Use Solr for expanding your data with resources from public, well-known knowledge bases
Approach
Written in a friendly, example-driven format, the book includes plenty of step-by-step instructions and examples that are designed to help you get started with Apache Solr.
Table of Contents
Chapter 1: Getting Ready with the Essentials
Chapter 2 Indexing with Local PDF Files
Chapter 3: Indexing Example Data from DBPedia – Paintings
Chapter 4: Searching the Example Data
Chapter 5: Extending Search
Chapter 6: Using Faceted Search – from Searching to Finding
Chapter 7: Working with Multiple Entities, Multicores, and Distributed Search
Chapter 8: Indexing External Data sources
Chapter 9: Introducing Customizations
Appendix: Solr Clients and Integrations