solrcloud

Error when start Solr cloud, due to location of 3rd party libraries

醉酒当歌 提交于 2019-12-20 15:11:27
问题 I tried to migrate to Solr 3.1 , my project uses Dataimport handler , when I started solr it asked me SolrCoreAwar not found I copied the following file to lib directory apache-solr-dataimporthandler-3.1-SNAPSHOT.jar but again an error appeared when stating solr I need to fix this problem , if you can help Thank You The next message is the exception I get When Starting Solr 3.1 java.lang.NoClassDefFoundError: Could not initialize class org.slf4j.LoggerFactory at org.apache.solr.handler

SolrCloud(集群,基于zookeeper集群)的搭建与使用

好久不见. 提交于 2019-12-20 09:36:23
SolrCloud简介 SolrCloud(solr 云)是Solr提供的分布式搜索方案,当你 需要大规模,容错,分布式索引和检索能力 时使用 SolrCloud。当一个系统的索引数据量少的时候是不需要使用SolrCloud的,当索引量很大,搜索请求并发很高,这时需要使用SolrCloud来满足这些需求。 SolrCloud是基于Solr和Zookeeper的分布式搜索方案,它的主要思想是使用Zookeeper作为集群的配置信息中心。 它有几个特色功能: 1)集中式的配置信息 2)自动容错 3)近实时搜索 4)查询时自动负载均衡 Solr集群的系统架构 1.1. 物理结构 三个 Solr 实例( 每个实例包括两个 Core ),组成一个 SolrCloud 。 1.2. 逻辑结构 索引集合包括两个 Shard ( shard1 和 shard2 ), shard1 和 shard2 分别由三个 Core 组成,其中一个 Leader 两个 Replication , Leader 是由 zookeeper 选举产生, zookeeper 控制每个 shard 上三个 Core 的索引数据一致,解决高可用问题。 用户发起索引请求分别从 shard1 和 shard2 上获取,解决高并发问题。 1.2.1. collection Collection 在 SolrCloud

Solr 5.0.0 is not starting properly in CentOS

a 夏天 提交于 2019-12-13 04:33:20
问题 When I running command bin/solr start -e cloud it is not asking me to collection name and other information like no of replicas and configuration settings. I got following output Welcome to the SolrCloud example! This interactive session will help you launch a SolrCloud cluster on your local workstation. To begin, how many Solr nodes would you like to run in your local cluster? (specify 1-4 nodes) [2] 2 Ok, let's start up 2 Solr nodes for your example SolrCloud cluster. Please enter the port

Zookeeper timeouts without error in zookeeper Solr

一笑奈何 提交于 2019-12-13 04:13:18
问题 We are facing issue with solr/zookeeper where zookeeper timeouts after 10000ms. Error below. SolrException: java.util.concurrent.TimeoutException: Could not connect to ZooKeeper <server1>:9181,<server2>:9182,<server2>:9183 within 10000 ms. at org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:184) at org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:121) We are not getting any error in zookeeper logs.Except below logs 2018-12-19 04:35:22,305 [myid:2] - INFO

Spring Data Solr: HTTP ERROR 404 when updating Document

心已入冬 提交于 2019-12-13 04:12:25
问题 I am doing a simple Search exercise in solr. I followed this tutorial. http://www.baeldung.com/spring-data-solr I did all same. I run the test case I am getting an error like this: <head> <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/> <title>Error 404 Not Found</title> </head> <body><h2>HTTP ERROR 404</h2> <p>Problem accessing /solr/categories/categories/update. Reason: <pre> Not Found</pre></p> </body> </html> ; nested exception is org.apache.solr.client.solrj.impl

How to filter a huge list of ids from Solr at runtime

烂漫一生 提交于 2019-12-13 03:29:06
问题 I have an index for products is Solr. I need to serve a customized list of products for each customer such that I have to exclude some specific products for each customer. Currently I am storing this relationship of customer & excluded products in a SQL database and then filtering them in Solr using a terms query. Is there a way I can store this relationship in Solr itself so that I dont have to calculate the exclude list every time from SQL first. Something very similar to what we can do in

Zookeeper multiple collection different schema

懵懂的女人 提交于 2019-12-12 20:17:42
问题 I have a zookeeper instance (Solr integrated version) with 3 collections. Each collection has a different schema.xml in it's conf directory. Zookeeper starts sucessfully - the goal is to do a distributed search across the different collections. My problem is that when I look at the schema of collection2 and collection3 via the admin tool they both appear to be using collection1 schema. So I cannot update collections 2 & 3 as they are missing their correct fields. I have tried each of the

Solr - Collections API timeouts

怎甘沉沦 提交于 2019-12-12 16:01:19
问题 I've got a setup with 3x zoo keeper's and 4x solrcloud node's. This is all working, all nodes are seeing each other and I initially had a default collection. From there, I used the collections API to create a new collection which successfully completed and all it's successfully sharded across 2 nodes, with the other 2 being used for replica's. I can also successfully save documents to that collection. Browsing the solr web GUI on any of the boxes all works, no speed issues. However, anytime I

Solrcloud- does it matter if I have even or odd number of shards?

你离开我真会死。 提交于 2019-12-12 05:47:40
问题 I had a few queries on choosing exact number of shards for collection and nodes in cloud- is there any impact on search/ingestion, if I choose even or odd number of shards? is there any thumb rule or guidelines for deciding number of shards and nodes in cloud? It would be really helpful if you could provide suggestions to plan solrcloud, collection(#shards) for below requirement: Data type: structured Expected data load: 3 TB Ingestion Strategy: 2 MM records( INSERT/UPDATE/DELETE requests) in

Solr : existing index

懵懂的女人 提交于 2019-12-12 05:41:36
问题 I work with solr to index some data. I need to check existing index before start another index. Is there any command to check existing index with solr? Thanks! 回答1: If you want to explore content of the index, you can use Luke which you can download in github : https://github.com/DmitryKey/luke/releases 来源: https://stackoverflow.com/questions/28882522/solr-existing-index