simple in memory full text search solution

后端 未结 3 2002
一向
一向 2021-01-18 03:13

I have a small website running on Java with probably a dozen of markdown files. I want to provide a full text search for user to quickly access those markdown files. Since i

3条回答
  •  温柔的废话
    2021-01-18 03:24

    As a side project I have implemented a simple in memory text search solution for java.

    https://github.com/bradforj287/SimpleTextSearch

    Key Features:

    • Inverted Index
    • Cosine Similarity algorithm w/ TFIDF ranking
    • MultiThreadded index creation and searching
    • Word Stemming (snowball stemmer)
    • Strips HTML tags automatically
    • Stop words
    • String tokenizer (Stanford NLP)

    Might want to take a look.

提交回复
热议问题