whoosh

Whoosh index viewer

半世苍凉 提交于 2019-12-02 20:51:22
I'm using haystack with whoosh as backend for a Django app. Is there any way to view the content (in a easy to read format) of the indexes generated by whoosh? I'd like to see what data was indexed and how so I can better understand how it works. zeekay You can do this pretty easily from python's interactive console: >>> from whoosh.index import open_dir >>> ix = open_dir('whoosh_index') >>> ix.schema <<< <Schema: ['author', 'author_exact', 'content', 'django_ct', 'django_id', 'id', 'lexer', 'lexer_exact', 'published', 'published_exact']> You can perform search queries directly on your index

Exact match of multiple words in whoosh

筅森魡賤 提交于 2019-12-02 03:14:17
问题 By default, a multiple word search is split into files that have each word separately. How can I override this default and have whoosh search for an exact match? Even though it's most likely supported, I can't find in google/whoosh documentation. In addition, would searching for an exact match have better or worse performance than the same multi-word search? 回答1: Use phrase queries (that is, put double quotes around the words which you want to be matched in your query), e.g. : "to be or not

Exact match of multiple words in whoosh

假装没事ソ 提交于 2019-12-01 23:12:38
By default, a multiple word search is split into files that have each word separately. How can I override this default and have whoosh search for an exact match? Even though it's most likely supported, I can't find in google/whoosh documentation. In addition, would searching for an exact match have better or worse performance than the same multi-word search? Use phrase queries (that is, put double quotes around the words which you want to be matched in your query), e.g. : "to be or not to be" However, this only works if the field you're searching in is of type whoosh.fields.TEXT . As for the

Django Haystack & Whoosh Search Working, But SearchQuerySet Return 0 Results

自古美人都是妖i 提交于 2019-12-01 08:54:45
问题 Edit: More info at bottom of post... Original Question: I seem to be having the same problem as in this (unresolved) question: django-haystack + Whoosh SearchQuerySet().all() always None I've set up Haystack with Whoosh on my Django project and all was working fine at first (SearchQuerySet used to return results), but after an aborted attempt to create a new custom search form (rolled back from git) it appears that indexing and the original search page still all work fine, but now

Document search on partial words

你说的曾经没有我的故事 提交于 2019-11-30 13:03:52
I am looking for a document search engine (like Xapian, Whoosh, Lucene, Solr, Sphinx or others) which is capable of searching partial terms. For example when searching for the term "brit" the search engine should return documents containing either "britney" or "britain" or in general any document containing a word matching r *brit* Tangentially, I noticed most engines use TF-IDF (Term frequency-Inverse document frequency) or its derivatives which are based on full terms and not partial terms. Are there any other techniques that have been successfully implemented besides TF-IDF for document