lucene | 易学教程

Lucene custom scoring for numeric fields

阅读更多关于 Lucene custom scoring for numeric fields

问题 I would like to have, in addition to standard term search with tf-idf similarity over text content field, scoring based on "similarity" of numeric fields. This similarity will be depending on distance between the value in query and in document (e.g. gaussian with m= [user input], s= 0.5) I.e. let's say documents represent people, and person document have two fields: description (full text) age (numeric). I want to find documents like description:(x y z) age:30 but age to be not the filter ,

How do you run Lucene on .net?

阅读更多关于 How do you run Lucene on .net?

问题 Lucene is an excellent search engine, but the .NET version is behind the official Java release (latest stable .NET release is 2.0, but the latest Java Lucene version is 2.4, which has more features). How do you get around this? 回答1: One way I found, which was surprised could work: Create a .NET DLL from a Java .jar file! Using IKVM you can download Lucene, get the .jar file, and run: ikvmc -target:library <path-to-lucene.jar> which generates a .NET dll like this: lucene-core-2.4.0.dll You can

Is there a good indexing / search engine for Node.js? [closed]

阅读更多关于 Is there a good indexing / search engine for Node.js? [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 3 years ago . I'm looking for a good open source (with LGPL or a permissive license) indexing engine for a node.js application, something like Lucene. I'm looking for in-process indexing and search and am not interested in indexing servers like Sphinx or Solr. I am not afraid to create bindings for a C/C++ library either so I

Is there a good indexing / search engine for Node.js? [closed]

阅读更多关于 Is there a good indexing / search engine for Node.js? [closed]

Class 'ZendSearch\Lucene\Lucene' not found ZendFramework2

阅读更多关于 Class 'ZendSearch\Lucene\Lucene' not found ZendFramework2

问题 I've installed ZendSearch with composer using these commands: $ cd /var/www/CommunicationApp/vendor/ $ git clone https://github.com/zendframework/ZendSearch.git ZendSearch $ cd ZendSearch/ $ curl -s https://getcomposer.org/installer | php $ php composer.phar install And I've installed Zendskeleton according to GITHUB So, I don't know What I'm missing here. Than, in the same book, it teaches how to use ZendSearch, but I'm not getting the same results, instead I'm getting a Fatal error: Fatal

mapping in create index in elasticsearch through mongodb river is not taking effect

阅读更多关于 mapping in create index in elasticsearch through mongodb river is not taking effect

问题 I am trying to index mongodb in elasticsearch using mongodb-river using the following command but the document mapping is not taking effect. It is still using the default analyzer(standard) for field text Mongodb-river The document specifies the creation of index but there is no documentation on how to provide custom mapping. This is what I tried. Is there any other documentation where I can find how to specify custom analyzers etc in using mongodb-river. curl -XPUT "localhost:9200/_river

mapping in create index in elasticsearch through mongodb river is not taking effect

阅读更多关于 mapping in create index in elasticsearch through mongodb river is not taking effect

Change dynamically elasticsearch synonyms

阅读更多关于 Change dynamically elasticsearch synonyms

问题 Is it possible to store the synonyms for elasticsearch in the index? Or is it possible to get the synonym list from a database like couchdb? I'd like to add synonyms dynamically to elasticsearch via the REST-API. 回答1: There are two approaches when working with synonyms : expanding them at indexing time, expanding them at query time. Expanding synonyms at query time is not recommended since it raises issues with : scoring, since synonyms have different document frequencies, multi-token

Field having multiple distinct values

阅读更多关于 Field having multiple distinct values

问题 Am building a "Book search" API using Lucene. I need to index Book Name,Author, and Book category fields in Lucene index. A single book can fall under multiple distinct book categories...for example: BookName1 --fiction,humour,philosophy. BookName1 --fiction,science. BookName1 --humour,business. BookName4-humour and so on..... User should be able to search all the books under a particular category say "homour". Given this situation, how do i index above fields and build the query in lucene?

ES入门

阅读更多关于 ES入门

1. ElasticSearch ES即为了解决原生Lucene使用的不足，优化Lucene的调用方式，并实现了高可用的分布式集群的搜索方案，其第一个版本于2010年2月出现在GitHub上并迅速成为最受欢迎的项目之一。首先，ES的索引库管理支持依然是基于Apache Lucene™的开源搜索引擎。 ES也使用Java开发并使用Lucene作为其核心来实现所有索引和搜索的功能，但是它的目的是通过简单的 RESTful API来隐藏Lucene的复杂性，从而让全文搜索变得简单。不过，ES的核心不仅仅在于Lucene，其特点更多的体现为：分布式的实时文件存储，每个字段都被索引并可被搜索分布式的实时分析搜索引擎 KB-MB-GB-TB-PB 可以扩展到上百台服务器，处理PB级结构化或非结构化数据高度集成化的服务，你的应用可以通过简单的 RESTful API、各种语言的客户端甚至命令行与之交互。上手Elasticsearch非常容易。它提供了许多合理的缺省值，并对初学者隐藏了复杂的搜索引擎理论。它拥有开瓶即饮的效果（安装即可使用），只需很少的学习既可在生产环境中使用。 1.1 ES的使用者及类似框架典型使用案例全球程序员github – 中国码云 ①Github(美国)使用Elasticsearch搜索20TB的数据，包括13亿的文件和1300亿行的代码.