solr | 易学教程

DataImportHandler and partial updates

阅读更多关于 DataImportHandler and partial updates

问题 Is that possible to use DataImportHandler with partial updates in Solr 4? Should I be able to use a data-config.xml like the one below, and import both entities in separate moments and get full documents with both data? <document name="item"> <entity name="pricing" query="select * from prc"> <field column="ID" name="itemId" /> <field column="NM" name="itemName" /> <field column="default" name="defaultPrice" /> <field column="sale" name="salesPrice" /> </entity> <entity name="tag" query=

Remote streaming with Solr

阅读更多关于 Remote streaming with Solr

问题 I'm having trouble using remote streaming with Apache Solr. We previously had Solr running on the same server where the files to be indexed are located so all we had to to was pass it the path of the file we wanted to index. We used something like this: stream.file=/path/to/file.pdf This worked fine. We have now moved Solr so that it runs on a different server to the website that uses it. This was because it was using up too many resources. I'm now using the following to point Solr in the

Specifying multivalued term frequency in Solr update?

阅读更多关于 Specifying multivalued term frequency in Solr update?

问题 I have a Solr schema containing a multivalued field. I'm parsing documents externally to Solr and updating the index using http://wiki.apache.org/solr/UpdateJSON (see also http://wiki.apache.org/solr/UpdateXmlMessages). Below is a toy example that demonstrates the problem I'm trying to solve. { "add": { "doc": { "id": "MyDocumentID", "user": "MyUserID", "meals": ["pizza", "pizza", "pizza", "burger"] } } } I'm hoping to find some sort of syntax that will allow me to indicate that "pizza"

Working with hyphenated words in SOLR

阅读更多关于 Working with hyphenated words in SOLR

问题 I have a hyphenated word. In my case it is "re-use". I want to be able to match it for "re-use", "reuse" and "re use". If I use a WordDelimiterFilterFactory with catenateAll=1 then it will transform "re-use" into "reuse". This doesn't cover the case of a search for "re use". In addition to this, the word 're-use' is being used as as a synonym with SynonymFilterFactory so the solution would have to work with that too. If my synonym file says "re-use => other thing", then I want to be able to

Working with hyphenated words in SOLR

阅读更多关于 Working with hyphenated words in SOLR

solrj : expicit commit required after add?

阅读更多关于 solrj : expicit commit required after add?

问题 I am using the solrClient.add(SolrInputDocument doc) method to add documents, one by one, to my solr . after this i explicitly call solrClient.commit() Is it required? , I have seen some add methods, which specify a delay for commit . What does this mean, does the simple add method does not commit, or if it does, after how long? 回答1: In Solr we have mainly two different types of commit: Hard commit : This is governed by the autoCommit option in solrconfig.xml or explicit calls from a client

Hibernate Search Integration with Apache Solr unable to index data

阅读更多关于 Hibernate Search Integration with Apache Solr unable to index data

问题 In my current application I use hibernate search to index and searching data. It works fine. But when building a cluster of server instances I do not need to use Master Slave clusters using JMS or JGroups. So I am trying to integrate hibernate search with apache solr. I had follow this example. And did some minor changes to be compatible with new apache.lucene.core version. public class HibernateSearchSolrWorkerBackend implements BackendQueueProcessor { private static final String ID_FIELD

Hibernate Search Integration with Apache Solr unable to index data

阅读更多关于 Hibernate Search Integration with Apache Solr unable to index data

Solr: Multi Word Synonyms : Only first word is highlighting

阅读更多关于 Solr: Multi Word Synonyms : Only first word is highlighting

问题 I am doing index time synonym expansion for multi word synonyms. In my synonyms.txt I have synonyms like (barc=>barc,bhabha atomic research center). But when I am searching for "barc" it is not highlighting "bhabha atomic research center". Only 1st word i.e "bhabha" is highlighting... Can any one help me??? thanks... 回答1: For now easiest way to fix this is to set LUCENE_33 instead of LUCENE_40 in solrconfig.xml and reindex. Highlighting will work then as expected. A patch is now available:

Timestamp compatibility while performing delta import in solr

阅读更多关于 Timestamp compatibility while performing delta import in solr

问题 Im new to solr.I have successfully indexed oracle 10g xe database. Im trying to perform delta import on the same. The delta query requires a comparison of last_modified column of the table with ${dih.last_index_time} . However in my application I do not have such a column . Also, i cannot add this column. Therefore i used ' scn_to_timestamp(ora_rowscn) ' to give the value of the required timestamps. This query returns the value of type timestamp in the following format 24-JUL-13 12.42.32