apache-zookeeper | 易学教程

Kafka on Kubernetes multi-node

阅读更多关于 Kafka on Kubernetes multi-node

问题 So my objective here is to set up a cluster of several kafka-brokers in a distributed fashion. But I can't see the way to make the brokers aware of each other. As far as i understand, every broker needs a separate ID in their config, which I cannot guarantee or configure if I launch the containers from kubernetes? They also need to have the same advertised_host? Are there any parameters I'm missing that would need to be changed for the nodes to discover each other? Would it be viable to do

Kafka Cluster - Producer

阅读更多关于 Kafka Cluster - Producer

问题 I have several questions about Kafka. If someone can help me by responding to one of them, i will be very thankful. Thank you in advance :) Q1) I know that partitions are split across Kafka Broker. But the split is based on what ?. For instance, if I have 3 brokers and 6 partitions, how to ensure that each broker will have 2 partitions ? How this split is currently made in Kafka ? Q2) When a producer send a new message, what id the default format of the message ? Avro format ? How can I

How to run the same script on several Linux systems concurrently

阅读更多关于 How to run the same script on several Linux systems concurrently

问题 I have a question related to latency benchmark. I run Apache ZooKeeper in a cluster of 5 machines (one leader and the rest are followers). There is another machine (client) used to sequence send requests to the protocol. I manage to run a benchmark program which lasts for pre-selected time, aims to send requests simultaneously and continuously to each ZooKeeper server. When the pre-selected time elapses, I can see the latency result. However, the above benchmark uses only one client machine

Learning Zookeeper - Help me with example

阅读更多关于 Learning Zookeeper - Help me with example

问题 I'm trying to wrap my head around Zookeeper and what it does. To this point, my experience with Zookeeper has been through other libraries that require Zookeeper (Solr and Kafka) and so my basic understand is the very vague "you better use Zookeeper to keep your configuration straight". So help me think through a simple example problem. Let's say that I build my own service that does "stuff". There are two things that I want to protect: I want to have as little downtime as possible (gotta

Multiple shards on single machine performance

阅读更多关于 Multiple shards on single machine performance

问题 Does it make sense to have multiple shards in Elasticsearch if I am going to use only single machine? Will it improve performance in any way? Same question for Apache Solr - does it make sense to use Solr Cloud with ZooKeeper for single server instance or just create one core without any sharding? Let's assume I am not going to use other machines in future, so the main point is how sharding on single machine influence search engines performance? 回答1: There are certain parts of Lucene that's

java.io.IOException: Connection to kafka VIA port 6667 failed at kafka.utils.NetworkClientBlockingOps$.awaitReady

阅读更多关于 java.io.IOException: Connection to kafka VIA port 6667 failed at kafka.utils.NetworkClientBlockingOps$.awaitReady

问题 We have 3 kafka machines and 3 zookper machines kafka machines are separated from the zookeper machines , and all machines are with OS - redhat 7.x under /var/log/kafka we can see the following , java.io.IOException that displayed in server.log at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:63) [2018-10-07 07:42:13,342] WARN [ReplicaFetcherThread-0-1007], Error in fetch kafka.server.ReplicaFetcherThread$FetchRequest@407ac4e4 (kafka.server.ReplicaFetcherThread) java.io

Distributed Application using Zookeeper

阅读更多关于 Distributed Application using Zookeeper

问题 I need to design distributed application using zookeeper. This is the first time I am using Zookeeper so I am little confused with its usage. I have read that Zab protocol ensures serializability when it comes to multiple updates but I am unable to understand, if that is the case than it will automatically allow lock free implementation. So why locks are needed in first place? It will be great if some one can help here. Thanks 回答1: Zab protocol is a critical PART of zookeeper.It ensures

ZooKeeper C binding installation

阅读更多关于 ZooKeeper C binding installation

问题 I am trying to install the C binding for ZooKeeper while ssh'ed to another computer using a build system: i386-pc-solaris2.11 (I don't know if that is important). Anyways, the computer I ssh'ed to is running a linux based OS. I ran the following code in the command line: cd ../zookeeper cd src cd c ./configure make The terminal inputs above were taken from the Apache website for setting up the C shell client. The error code I am getting is originates from the make: libtool: link: ( cd ".libs"

When CuratorFramework's methods start and close shoud be called?

阅读更多关于 When CuratorFramework's methods start and close shoud be called?

问题 According to documentation CuratorFramework instances are fully thread-safe. You should share one CuratorFramework per ZooKeeper cluster in your application. It also states that the method start should be called before any other operations can be performed. Should this method be called before every operation or calling it once after initialization is enough? 回答1: Call start once after creating the CuratorFramework instance. So, the startup is: Create the CuratorFramework via one of the

How to force/reproduce FullGC in JVM?

阅读更多关于 How to force/reproduce FullGC in JVM?

问题 Is there a way to force/reproduce FullGC in JVM for x seconds ? Basically I needed this to verify root cause of an issue in certain heart beat based app ( a client of zookeeper) EDIT : Does unix command kill -STOP <pid> and kill -CONT <pid> simulate FullGC ( stop the world behaviour) ? 回答1: You can simulate a very long stop-the-world event on HotSpot JVMs which is similar to FullGC from user's point of view. HotSpot doesn't put safepoints into counted int loops, because it assumes that they