lucene phrase query not working

后端 未结 3 582
走了就别回头了
走了就别回头了 2021-01-15 10:21

I am trying to write a simple program using Lucene 2.9.4 which searches for a phrase query but I am getting 0 hits

public class HelloLucene {

public static          


        
3条回答
  •  刺人心
    刺人心 (楼主)
    2021-01-15 10:49

    There are two issues with the code (and they have nothing to do with your version of Lucene):

    1) the StandardAnalyzer does not index stopwords (like "in"), so the PhraseQuery will never be able to find the phrase "Lucene in"

    2) as mentioned by Xodarap and Shashikant Kore, your call to create a document needs to include Index.ANALYZED, otherwise Lucene does not use the Analyzer on this section of the Document. There's probably a nifty way to do it with Index.NOT_ANALYZED, but I'm not familiar with it.

    For an easy fix, change your addDoc method to:

    public static void addDoc(IndexWriter w, String value)throws IOException{
        Document doc = new Document();
        doc.add(new Field("content", value, Field.Store.YES, Field.Index.ANALYZED));
        w.addDocument(doc);
    }
    

    and modify your creation of the PhraseQuery to:

        PhraseQuery pq = new PhraseQuery();
        pq.add(new Term("content", "computer"),0);
        pq.add(new Term("content", "science"),1);
        pq.setSlop(0);
    

    This will give you the result below since both "computer" and "science" are not stopwords:

        Found 1 hits.
        1.The Art of Computer Science
    

    If you want to find "Lucene in Action", you can increase the slop of this PhraseQuery (increasing the 'gap' between the two words):

        PhraseQuery pq = new PhraseQuery();
        pq.add(new Term("content", "lucene"),0);
        pq.add(new Term("content", "action"),1);
        pq.setSlop(1);
    

    If you really want to search for the sentence "lucene in", you will need to select a different analyzer (like the SimpleAnalyzer). In Lucene 2.9, just replace your call to the StandardAnalyzer with:

        SimpleAnalyzer analyzer = new SimpleAnalyzer();
    

    Or, if you're using version 3.1 or higher, you need to add the version information:

        SimpleAnalyzer analyzer = new SimpleAnalyzer(Version.LUCENE_35);
    

    Here is a helpful post on a similar issue (this will help you get going with PhraseQuery): Exact Phrase search using Lucene? -- see WhiteFang34's answer.

提交回复
热议问题