solrcloud

solr ReplicationHandler - SnapPull failed to download files

吃可爱长大的小学妹 提交于 2019-12-11 09:29:07
问题 we are continuously getting this exception during replication from master to slave. our index size is 9.7 G and we are trying to replicate a slave from scratch. 30 Oct 2013 18:22:16,996 [explicit-fetchindex-cmd] ERROR ReplicationHandler - SnapPull failed :org.apache.solr.common.SolrException: Unable to download _41c_Lucene41_0.doc completely. Downloaded 0!=107464871 at org.apache.solr.handler.SnapPuller$DirectoryFileFetcher.cleanup(SnapPuller.java:1266) at org.apache.solr.handler.SnapPuller

Loading stopwords from Postgresql to Solr6

牧云@^-^@ 提交于 2019-12-11 04:42:18
问题 I am new to solr. I want to load synonyms or stopwords from DB instead of txt file to solr at analyzing phase. How can I acheive it in solr 6. I tried porting Solr-JDBC(https://github.com/shopping24/solr-jdbc), and I configured web.xml with code below: <resource-ref> <description>my datasource</description> <res-ref-name>jdbc/dsTest</res-ref-name> <res-type>javax.sql.DataSource</res-type> <res-auth>Container</res-auth> </resource-ref> and solr-jetty-context.xml with: <New id="dsTest" class=

How to setup Solr Cloud with two search servers?

别来无恙 提交于 2019-12-10 18:50:49
问题 Hi I'm developing rails project with sunspot solr and configuring Solr Cloud. My environment: rails 3.2.1, ruby 2.1.2, sunspot 2.1.0, Solr 4.1.6. Why SolrCloud: I need more stable system - oftentimes search server goes on maintenance and web application stop working on production. So, I think about how to make 2 identical search servers instead of one, to make system more stable: if one server will be down, other will continue working. I cannot find any good turtorial with simple, easy to

Sorting on text field in solr

坚强是说给别人听的谎言 提交于 2019-12-10 17:50:57
问题 I am using solr 3.4 and want to have solr search results to be sorted on text field. How can I achieve sorting on text field as with int natural ordering? Is there any way to convert text field to int during query? My sort field is of string type and I want it to be behave as int field while sorting. I can not change field type as the index size is too large and can not re index. Any idea how can I achieve this? 回答1: Not very sure, but you can check for the Function Queries with functions

SolrJ and Auto Commit

♀尐吖头ヾ 提交于 2019-12-10 14:53:56
问题 I am adding documents to a Solr 4.3 core using SolrJ API , I noticed that I have the autocommit set to 15 seconds in the stock solrconfig that I am using as below. <autoCommit> <maxTime>${solr.autoCommit.maxTime:15000}</maxTime> <openSearcher>false</openSearcher> </autoCommit> My undestanding is that since the auto commit is set to true it means that the Solr instance would be auto commiting anyhow every 15 seconds, so I would not need to commit explicity using the SolrJ API as in below

Solr: Adding cores using a configSet in a SolrCloud setup does not work

混江龙づ霸主 提交于 2019-12-10 12:12:50
问题 I have a SolrCloud (v5.2) setup with 2 different cores [ users, docs ] working properly. Now I need to be able to dynamically add new cores that are configuration-wise identical to the docs core that already exists. I don't want to duplicate the configuration (I tried an it works OK) as I need to actually share the synonyms.txt and some other files that constantly change. I think that I need "configsets" but I cannot get them to work. I uploaded the "template" conf using this command: sh

Solr Custom Similarity - Using a field from the indexed document

佐手、 提交于 2019-12-10 03:59:36
问题 We are currently on a very old version of Lucene V 4.X and are now migrating to Solr V 7.4.0 cloud. We had a custom Similarity Class that we use to influence the score using an indexed field ("RANK") we have in the documents. Here is how the classes looks like - CustomSimilarity.java public class CustomSimilarity extends Similarity { private final Similarity sim; private final double coefficiency; private String popularityRank; static InfoStream infoStream; public CustomSimilarity() { this

SolrCloud: workaround for classic pagination with “start,rows” parameters

依然范特西╮ 提交于 2019-12-08 12:02:01
问题 I have SolrCloud with 3 shards. My purpose : select and process all products from category. Current implementation : Portion selection in cycle. 1st iteration: q=cat:1&start=0&rows=100 2nd iteration: q=cat:1&start=100&rows=100 3th: q=cat:1&start=200&rows=100 ... But growing "start", performance is down. Explanation here: https://wiki.apache.org/solr/DistributedSearch Makes it more inefficient to use a high "start" parameter. For example, if you request start=500000&rows=25 on an index with

Solr server HttpParse error

混江龙づ霸主 提交于 2019-12-07 19:36:46
问题 Hi when i am sending a query to solr server using http client , solr server prints exception log saying HttpParser full. I am running solrcloud 4.0 in jetty server. Is this a problem with jetty ? What could be the reason for this and how can i resolve it ? 来源: https://stackoverflow.com/questions/15973884/solr-server-httpparse-error

Solrcloud multicore configuration

爱⌒轻易说出口 提交于 2019-12-07 13:33:43
问题 I have a standalone Solr instance with 4 different cores working fine using the embedded Jetty server. I configured the cores for v4.10.3 but since I moved to v5.1 and all seems to work fine without any changes. Before going into production, I need to set it up as a Solrcloud installation, initially with 2 nodes (two different machines) with 1 shard per node (to keep it simple). I have been trying to get it to work but I have not been able to do it. I tried to run it like this (I think using