Lucene.Net: How can I add a date filter to my search results?

前端 未结 2 372
清歌不尽
清歌不尽 2020-12-30 13:06

I\'ve got my searcher working really well, however it does tend to return results that are obsolete. My site is much like NerdDinner whereby events in the past become irrel

相关标签:
2条回答
  • 2020-12-30 14:00

    You're linking to the api documentation of Lucene 1.4.3. Lucene.Net is currently at 2.9.2. I think an upgrade is due.

    First, you're using Store.Yes alot. Stored fields will make your index larger, which may be a performance issue. Your date problem can easily be solved by storing dates as strings in the format of "yyyyMMddHHmmssfff" (that's really high resolution, down to milliseconds). You may want to reduce the resolution to create fewer tokens to reduce your index size.

    var dateValue = DateTools.DateToString(searchableEvent.EventDate, DateTools.Resolution.MILLISECOND);
    doc.Add(new Field("date", dateValue, Field.Store.YES, Field.Index.NOT_ANALYZED));
    

    Then you apply a filter to your search (the second parameter, where you currently pass in Nothing/null).

    var dateValue = DateTools.DateToString(DateTime.Now, DateTools.Resolution.MILLISECOND);
    var filter = FieldCacheRangeFilter.NewStringRange("date", 
                     lowerVal: dateValue, includeLower: true, 
                     upperVal: null, includeUpper: false);
    var topDocs = searcher.Search(query, filter, 10000);
    

    You can do this using a BooleanQuery combining your normal query with a RangeQuery, but that would also affect scoring (which is calculated on the query, not the filter). You may also want to avoid modifying the query for simplicity, so you know what query is executed.

    0 讨论(0)
  • 2020-12-30 14:05

    You can combine multiple queries with a BooleanQuery. Since Lucene only searches text note that the date field in your index must be ordered by the most significant to the least significant part of the date, i.e. in IS8601 format ("2010-11-02T20:49:16.000000+00:00")

    Example:

    Lucene.Net.Index.Term searchTerm = new Lucene.Net.Index.Term("fullText", searchTerms);
    Lucene.Net.Index.Term dateRange = new Lucene.Net.Index.Term("date", "2010*");
    
    Lucene.Net.Search.Query termQuery = new Lucene.Net.Search.TermQuery(searchTerm);
    Lucene.Net.Search.Query dateRangeQuery = new Lucene.Net.Search.WildcardQuery(dateRange);
    
    Lucene.Net.Search.BooleanQuery query = new Lucene.Net.Search.BooleanQuery();
    query.Add(termQuery, BooleanClause.Occur.MUST);
    query.Add(dateRangeQuery, BooleanClause.Occur.MUST);
    

    Alternatively if a wildcard is not precise enough you can add a RangeQuery instead:

    Lucene.Net.Search.Query termQuery = new Lucene.Net.Search.TermQuery(searchTerm);
    Lucene.Net.Index.Term date1 = new Lucene.Net.Index.Term("date", "2010-11-02*");
    Lucene.Net.Index.Term date2 = new Lucene.Net.Index.Term("date", "2010-11-03*");
    Lucene.Net.Search.Query dateRangeQuery = new Lucene.Net.Search.RangeQuery(date1, date2, true);
    
    Lucene.Net.Search.BooleanQuery query = new Lucene.Net.Search.BooleanQuery();
    query.Add(termQuery, BooleanClause.Occur.MUST);
    query.Add(dateRangeQuery, BooleanClause.Occur.MUST);
    
    0 讨论(0)
提交回复
热议问题