官术网_书友最值得收藏!

Introduction

In this chapter, you will learn two very important recipes. The first recipe demonstrates how you can index your data, and the second recipe, which is very closely connected to the first recipe, demonstrates how you can search through your indexed data.

For both indexing and searching, we will be using Apache Lucene. Apache Lucene is a free, opensource Java software library used heavily for information retrieval. It is supported by the Apache Software Foundation and is released under the Apache Software License.

Many different modern search platforms, such as Apache Solr and ElasticSearch, or crawling platforms, such as Apache Nutch, use Apache Lucene in the backend for data indexing and searching. Therefore, any data scientist who learns those search platforms will benefit from the two basic recipes in this chapter.

主站蜘蛛池模板: 祁门县| 龙川县| 新化县| 仪陇县| 沈丘县| 巴林右旗| 贵南县| 韶关市| 修文县| 泽州县| 珲春市| 洛川县| 葫芦岛市| 恩施市| 裕民县| 高平市| 宿迁市| 郁南县| 乐至县| 巴里| 泾源县| 遂川县| 五台县| 常德市| 鲁甸县| 剑阁县| 阿瓦提县| 青铜峡市| 阳谷县| 大石桥市| 宜丰县| 松江区| 昌黎县| 密山市| 乐清市| 军事| 黎城县| 合江县| 青阳县| 金秀| 永寿县|