solr

DataImportHandler and partial updates

為{幸葍}努か 提交于 2020-01-05 01:41:53
问题 Is that possible to use DataImportHandler with partial updates in Solr 4? Should I be able to use a data-config.xml like the one below, and import both entities in separate moments and get full documents with both data? <document name="item"> <entity name="pricing" query="select * from prc"> <field column="ID" name="itemId" /> <field column="NM" name="itemName" /> <field column="default" name="defaultPrice" /> <field column="sale" name="salesPrice" /> </entity> <entity name="tag" query=

Remote streaming with Solr

回眸只為那壹抹淺笑 提交于 2020-01-04 15:31:14
问题 I'm having trouble using remote streaming with Apache Solr. We previously had Solr running on the same server where the files to be indexed are located so all we had to to was pass it the path of the file we wanted to index. We used something like this: stream.file=/path/to/file.pdf This worked fine. We have now moved Solr so that it runs on a different server to the website that uses it. This was because it was using up too many resources. I'm now using the following to point Solr in the

Specifying multivalued term frequency in Solr update?

六眼飞鱼酱① 提交于 2020-01-04 14:29:22
问题 I have a Solr schema containing a multivalued field. I'm parsing documents externally to Solr and updating the index using http://wiki.apache.org/solr/UpdateJSON (see also http://wiki.apache.org/solr/UpdateXmlMessages). Below is a toy example that demonstrates the problem I'm trying to solve. { "add": { "doc": { "id": "MyDocumentID", "user": "MyUserID", "meals": ["pizza", "pizza", "pizza", "burger"] } } } I'm hoping to find some sort of syntax that will allow me to indicate that "pizza"

Working with hyphenated words in SOLR

左心房为你撑大大i 提交于 2020-01-04 13:42:48
问题 I have a hyphenated word. In my case it is "re-use". I want to be able to match it for "re-use", "reuse" and "re use". If I use a WordDelimiterFilterFactory with catenateAll=1 then it will transform "re-use" into "reuse". This doesn't cover the case of a search for "re use". In addition to this, the word 're-use' is being used as as a synonym with SynonymFilterFactory so the solution would have to work with that too. If my synonym file says "re-use => other thing", then I want to be able to

Working with hyphenated words in SOLR

三世轮回 提交于 2020-01-04 13:41:09
问题 I have a hyphenated word. In my case it is "re-use". I want to be able to match it for "re-use", "reuse" and "re use". If I use a WordDelimiterFilterFactory with catenateAll=1 then it will transform "re-use" into "reuse". This doesn't cover the case of a search for "re use". In addition to this, the word 're-use' is being used as as a synonym with SynonymFilterFactory so the solution would have to work with that too. If my synonym file says "re-use => other thing", then I want to be able to

solrj : expicit commit required after add?

拥有回忆 提交于 2020-01-04 09:28:09
问题 I am using the solrClient.add(SolrInputDocument doc) method to add documents, one by one, to my solr . after this i explicitly call solrClient.commit() Is it required? , I have seen some add methods, which specify a delay for commit . What does this mean, does the simple add method does not commit, or if it does, after how long? 回答1: In Solr we have mainly two different types of commit: Hard commit : This is governed by the autoCommit option in solrconfig.xml or explicit calls from a client

Hibernate Search Integration with Apache Solr unable to index data

为君一笑 提交于 2020-01-04 07:35:29
问题 In my current application I use hibernate search to index and searching data. It works fine. But when building a cluster of server instances I do not need to use Master Slave clusters using JMS or JGroups. So I am trying to integrate hibernate search with apache solr. I had follow this example. And did some minor changes to be compatible with new apache.lucene.core version. public class HibernateSearchSolrWorkerBackend implements BackendQueueProcessor { private static final String ID_FIELD

Hibernate Search Integration with Apache Solr unable to index data

泪湿孤枕 提交于 2020-01-04 07:35:02
问题 In my current application I use hibernate search to index and searching data. It works fine. But when building a cluster of server instances I do not need to use Master Slave clusters using JMS or JGroups. So I am trying to integrate hibernate search with apache solr. I had follow this example. And did some minor changes to be compatible with new apache.lucene.core version. public class HibernateSearchSolrWorkerBackend implements BackendQueueProcessor { private static final String ID_FIELD

Solr: Multi Word Synonyms : Only first word is highlighting

那年仲夏 提交于 2020-01-04 06:48:26
问题 I am doing index time synonym expansion for multi word synonyms. In my synonyms.txt I have synonyms like (barc=>barc,bhabha atomic research center). But when I am searching for "barc" it is not highlighting "bhabha atomic research center". Only 1st word i.e "bhabha" is highlighting... Can any one help me??? thanks... 回答1: For now easiest way to fix this is to set LUCENE_33 instead of LUCENE_40 in solrconfig.xml and reindex. Highlighting will work then as expected. A patch is now available:

Timestamp compatibility while performing delta import in solr

旧时模样 提交于 2020-01-04 06:35:26
问题 Im new to solr.I have successfully indexed oracle 10g xe database. Im trying to perform delta import on the same. The delta query requires a comparison of last_modified column of the table with ${dih.last_index_time} . However in my application I do not have such a column . Also, i cannot add this column. Therefore i used ' scn_to_timestamp(ora_rowscn) ' to give the value of the required timestamps. This query returns the value of type timestamp in the following format 24-JUL-13 12.42.32