Query parsed to: +search +insid
1 - 9 of 9 results (Page 1 of 1)
1.3.1 : What is indexing, and why is it important?
starts on page 10 under section 1.3 (Indexing and searching) in chapter 1 (Meet Lucene)
...Suppose you needed to
search a large number of files, and you wanted to be able to find files that contained a certain word or a phrase. How would you go about writing a program to do this? A naïve ... files are very large. This is where indexing comes in: To
search large amounts of text quickly, you must first index that text and convert it into a format that will let you
search it rapidly ... access to words stored
inside it. The concept behind it is analogous to an index at the end of a book... [
Full sample chapter]
4.1 : Using analyzers
starts on page 104 in chapter 4 (Analysis)
...Before we get into the gory details of what lurks inside an analyzer, let's look at how an analyzer is used within Lucene. Analysis occurs at two spots: during indexing and when using QueryParser. In the following two sections, we detail how an analyzer is used in these scenarios. Before we begin with any code details, look at listing 4.1 to get a feel for what the analysis process is all about ... during search- ing. When searching with QueryParser, the analysis process happens again in order...
4.7.3 : Hole lot of trouble
starts on page 138 under section 4.7 (Stemming analysis) in chapter 4 (Analysis)
... during searching. PhraseQuery and QueryParser are the two troublemakers. Exact phrase matches now fail ... ", porterAnalyzer); Hits hits = searcher.search(query); assertEquals("exact match not found!", 0, hits.length ... deeper inside PhraseQuery and its current inability to deal with positional gaps. All terms in a PhraseQuery must be side by side, and in our test case, the phrase it's searching for is "over lazi" (stop ... query = parser.parse("\"over the lazy\""); Hits hits = searcher.search(query); assertEquals("hole...
5.6.2 : Multithreaded searching using ParallelMultiSearcher
starts on page 180 under section 5.6 (Searching across multiple Lucene indexes) in chapter 5 (Advanced search techniques)
... 1.4. A search operation spins a thread for each Searchable and waits for them all to finish. The basic search and search with filter options are parallelized, but searching with a HitCollector has not yet been ... . An exam- ple, using ParallelMultiSearcher remotely, is shown in listing 5.9. Searching multiple indexes remotely Lucene includes remote index searching capability through Remote Method Invocation (RMI). There are numerous other alternatives to exposing search remotely, such as through web services...
6.5.1 : Testing the speed of a search
starts on page 213 under section 6.5 (Performance testing) in chapter 6 (Extending search)
... we determine that a searching performance issue is caused by how we index, and find out how we can easily fix ... .addDocument(document); } Being the test-infected coders we are, we even ensure that our search is returning the expected results by searching over a timestamp range that encompasses all documents indexed: public void testSearchByTimestamp() throws Exception { Search s = new Search(); Hits hits ... = new RangeQuery(beginTerm, endTerm, true); return newSearcher( index.byTimestampIndexDirName()).search...
4.2.3 : Visualizing analyzers
starts on page 112 under section 4.2 (Analyzing the analyzer) in chapter 4 (Analysis)
...] Let's now look deeper into what makes up a Token. Looking inside tokens TokenStreams can create Tokens ... in a block of text to begin and end highlighting, allowing words that users search for to stand out in search...
8.2.2 : Luke: the Lucene Index Toolbox
starts on page 271 under section 8.2 (Interacting with an index) in chapter 8 (Tools and extensions)
... browser. This gem provides an intimate view inside a file system- based index from an attractive desktop ... in the same context. The interface is divided into five tabs: Overview, Documents, Search, Files ... with two options: "Show all term docs" opens the Search tab for that term so all 1 The usual issues ... button shifts the view to the Search tab with a search on the selected term, such that all documents ... value isn't available). Still searching over here, boss We've already shown two ways to automatically...
index
starts on page 416
... 362 encoding government intelligence, searching 362 ISO-8859-1 142 use of Lucene 352 storing index inside ... 301 Directory 19 search space 171-178 tag 140 FSDirectory 19 token. See TokenFilter parsing ... 42-47 ISO-8859-1 142 community 10 Lucene (continued) multithreaded searching. versus analysis ... , Dr. Martin 25, matching 363 TheServerSide search 136, 283 mock object 131, 211 results ... ive 339 stripping plurals 334 Python 10 Q overriding for synonym search 68 injection 134 products...
Appendix B : Lucene index format
starts on page 394
... we've fed documents into Lucene and retrieved them during searches. Then, we'll expose the inner structure ... ). When searching, you again use the abstract Directory class to repre- sent the index. You pass that Directory to the IndexSearcher class and then find Documents that match a given query by passing search terms encapsulated in the Query object to one of IndexSearcher's search methods. The results ... that are being searched or indexed simultaneously, it's possible to reach the limit of open files set...