solr

Solr and Jetty deployment issue

隐身守侯 提交于 2019-12-24 11:47:59
问题 I am having trouble with getting solr + jetty to work. I am following all instructions to the letter from - http://wiki.apache.org/solr/SolrJetty. It works like a good. But when I restart jetty multiple times, after 3/4 such restarts it starts hanging. Admin pages just don't load and my app fails to acquire a connection with solr. I also created a work folder - /opt/solr/work . I am also setting tmpdir to a new path in /etc/default/jetty . I can confirm the tmpdir is set to the new path from

Is there a way to get Hit Counts in solr

丶灬走出姿态 提交于 2019-12-24 11:47:59
问题 I have a simple question that is it possible to get hit counts in a document. like if we search "risk" could we get the count of risk in the matched document. 回答1: You can use highlighting and count the snippets. Make sure you have termVectors enabled for the affected fields and understand the highlighting options. You could also use the TermVectorComponent, it's currently not directly addressable via SolrNet though. 来源: https://stackoverflow.com/questions/4143896/is-there-a-way-to-get-hit

Search document by id very slow

混江龙づ霸主 提交于 2019-12-24 11:42:12
问题 my documents has an id field, called "doc_id", declared with the following attributes: <field name="doc_id" type="long" indexed="true" stored="true" required="true" /> This id is marked as an unique key with the tag. I try to load many documents at once, giving their ids in request f.e. doc_id:(235, 163, 256, ..., 10473) The count of identifiers in the query is never greater than 100. However, the more documents are there in the storage the longer it takes for this query to execute. I thought

How to search documents in within date range but based on text format info in Solr?

删除回忆录丶 提交于 2019-12-24 11:30:10
问题 All, I wonder if there is a way can solve this problem that: I have a lot of solr documents have field pub_date , but unfortunately with text format like "20180901" , if I currently want to search by pub_date within a range, how should I achieve it in Solr query? Thanks, 回答1: Assuming that all your fields have the same format, i.e. 20180901 - mapping to YYYYMMDD , you can use a regular range search. The lexiographic ordering of the strings will be the same as for the interval. pub_date:

solr facet search truncate words

雨燕双飞 提交于 2019-12-24 11:24:34
问题 have a solr configured for french content. Search is fine, but when i activate facet search, words are truncated in a special way. All e disappear, for eg automobil instead of automobile, montagn instead of montagne, styl instead of style , homm => homme etc.... <lst name="keywords"> <int name="automobil">1</int> <int name="citroen">1</int> <int name="minist">0</int> <int name="polit">0</int> <int name="pric">0</int> <int name="shinawatr">0</int> <int name="thailand">0</int> </lst here is the

solr query: x out of n words matched

狂风中的少年 提交于 2019-12-24 11:18:03
问题 supposedly I have 4 words (let's call them a,b,c and d for sake of example) and I want to search in solr to match 3 of these. I could write a query with all the permutations as follows: (a AND b AND c)OR(a AND b AND d)OR(b AND c and d) This will work and matches documents with 3 of these 4 words. but things get messy when I have 10 words and need to match 3 of them. The number of permutations required grows exponentially. obviously thanks to the complexity of the query solr goes busted(it

What is the difference between a Join Query and Embedded Entities in Solr DIH?

蹲街弑〆低调 提交于 2019-12-24 10:21:43
问题 I am trying to index data across multiple tables using Solr's Data Import Handler. The official wiki on the DIH suggests using embedded entities to link multiple tables like so: <document> <entity name="item" pk="id" query="SELECT * FROM item"> <entity name="member" pk="memberid" query="SELECT * FROM member WHERE memberid='${item.memberid}'> </entity> </entity> </document> Another way that works is: <document> <entity name="item" pk="id" query="SELECT * FROM item INNER JOIN member ON item

Solr - count documents in the range of two date fields

*爱你&永不变心* 提交于 2019-12-24 09:32:52
问题 Here are some example Solr documents I got: { "id": "1", "openDate": "2017-12-01T00:00:00.000Z", "closeDate": "2017-12-04T00:00:00.000Z" }, { "id": "2", "openDate": "2017-12-02T00:00:00.000Z", "closeDate": "2017-12-04T00:00:00.000Z" }, { "id": "3", "openDate": "2017-12-02T00:00:00.000Z", "closeDate": "2017-12-06T00:00:00.000Z" } The dates that a document is "active" are the dates between the openDate (inclusive) and the closeDate (exclusive). I want to count the number of documents that are

Error while indexing .xml files in solr

久未见 提交于 2019-12-24 09:27:02
问题 I am trying to index xml files in solr search engine using following command: java -Durl=http://10.1.11.143:8080/solr/#/ -jar post.jar solr.xml But I am getting following error: SimplePostTool version 1.5 Posting files to base url http://10.1.11.143:8080/solr/#/ using content-type application/xml.. POSTing file solr.xml SimplePostTool: WARNING: Solr returned an error #500 Internal Server Error SimplePostTool: WARNING: IOException while reading response: java.io.IOException: Server returned

Store Images to display in SOLR search results

早过忘川 提交于 2019-12-24 09:04:21
问题 I have built a SOLR Index which has the image thumbnail urls that I want to render an image along with the search results. The problem is that those images can run into millions and I think storing the images in index as binary data would make the index humongous. I am seeking guidance on how to efficiently store those images after rendering them from the URLs , should I use the plain file system and have them rendered by tomcat , or should I use a JCR repository like Apache Jackrabbit ? Any