{"id":151,"date":"2011-04-04T20:20:21","date_gmt":"2011-04-04T20:20:21","guid":{"rendered":"http:\/\/theinfosearch.com\/?p=151"},"modified":"2011-04-04T20:20:21","modified_gmt":"2011-04-04T20:20:21","slug":"building-search-applications-lucene-lingpipe-and-gate","status":"publish","type":"post","link":"https:\/\/theinfosearch.com\/2011\/04\/04\/building-search-applications-lucene-lingpipe-and-gate\/","title":{"rendered":"Building Search Applications: Lucene, LingPipe, and Gate"},"content":{"rendered":"
Lucene, LingPipe, <\/b> and Gate <\/b> are popular open source tools to build powerful search applications. Building Search Applications <\/b> describes functions from Lucene <\/b> that include indexing, searching, ranking, and spelling correction to build search engines. Use LingPipe <\/b> and Gate <\/b> to find the meaning of text to make search applications more useful. <\/p>\n
With this book you will learn to:
– Extract tokens <\/b> from text using custom tokenizers and analyzers from Lucene, LingPipe, and Gate.
– Construct a search engine index <\/b> with an optional backend database to manage large document collections.
– Explore the wide range of Lucene queries <\/b> to search an index, understand the ranking algorithm for a query, and suggest spelling corrections.
– Find the names <\/b> of people, places, and other entities in text using LingPipe and Gate.
– Categorize <\/b> documents by topic using classifiers and build groups of self-organized documents using clustering algorithms <\/b> from LingPipe.
– Create a Web crawler to scan the Web, Intranet, or desktop using Nutch. <\/b>
– Track the sentiment <\/b> of articles published on the Web with LingPipe
– Detect <\/b> plagiarism of documents using a registered document collection.<\/p>\n
Price: <\/b>$44.95<\/p>\n