apache-storm

Storm Cluster Duplicate Tuples

吃可爱长大的小学妹 提交于 2019-12-04 18:51:00
Currently I am working on a project where I have setup a Storm cluster across four Unix hosts. The topology itself is as follows: JMS Spout listens to an MQ for new messages JMS Spout parses and then emits the result to an Esper Bolt The Esper Bolt then processes the event and emits a result to a JMS Bolt The JMS Bolt then publishes the message back onto the MQ on a different topic I realize that Storm is a "at least-once" framework. However, if I receive 5 events and pass these onto the Esper Bolt for counting then for some reason I am receiving 5 count results in the JMS Bolt(all the same

Running Trident Topology in Storm TrackedTopology Unit Test

送分小仙女□ 提交于 2019-12-04 17:33:07
How can I run a JUnit test of a Trident Topology to allow tuples to flow through the topology while testing and verifying the output at each stage? I've tried running within Storm's Testing framework, but it's falling short of allowing verification and consistent execution of Trident. Here's an example topology with some in-line comments where I'm having the most issues. import static org.junit.Assert.assertEquals; import java.util.Arrays; import java.util.List; import org.junit.Test; import storm.trident.TridentState; import storm.trident.TridentTopology; import storm.trident.operation

Storm Topology not submit

风流意气都作罢 提交于 2019-12-04 10:54:14
i have configured my machine zookeeper,nimbus,supervisor are running properly and my topology working in LocalCluster LocalCluster cluster = new LocalCluster(); cluster.submitTopology("SendPost", conf, builder.createTopology()); Utils.sleep(10000000000l); cluster.killTopology("SendPost"); cluster.shutdown(); now i want try submit my topology bt it not working /usr/local/storm/bin$ ./storm jar /home/winoria/Desktop/Storm/storm-starter/target/storm-starter-0.0.1-SNAPSHOT-jar-with-dependencies.jar com.winoria.post.PostTopology Post i getting following error SLF4J: Class path contains multiple

Storm dynamic topology

你离开我真会死。 提交于 2019-12-04 09:59:02
Does Storm support dynamic topology? The functionality I want from this is to dynamically change the topology according to the user requirement while the Storm topology is running. For example, when user want to know the top-10 words of a stream, I use the top-10 bolt to process it, when user want to know something else, I use the other bolt to process the stream and 'unplug' the top-10 bolt. I know it could be done by partition the stream or duplicate the stream and alway running every functionalities and only demo the data we want, or we could shut down the stream and update another topology

storm-starter with intellij idea,maven project could not find class

谁说我不能喝 提交于 2019-12-04 09:42:52
I'm beginner of storm and intellij idea,when I import storm-starter(apache-storm-0.9.5.zip) to intellij idea(14 CE OS),everything is OK,but when I run the "ExclamationTopology" ,a problem appears as follow: Exception in thread "main" java.lang.NoClassDefFoundError: backtype/storm/topology/IRichSpout at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:264) at com.intellij.rt.execution.application.AppMain.main(AppMain.java:122) Caused by: java.lang.ClassNotFoundException: backtype.storm.topology.IRichSpout at java.net.URLClassLoader.findClass(URLClassLoader.java:381)

Apache Storm compared to Hadoop

我的梦境 提交于 2019-12-04 07:31:56
问题 How does Storm compare to Hadoop? Hadoop seems to be the defacto standard for open-source large scale batch processing, does Storm has any advantages over hadoop? or Are they completely different? 回答1: Why don't you tell your opinion. http://www.infoq.com/news/2011/09/twitter-storm-real-time-hadoop/ http://engineering.twitter.com/2011/08/storm-is-coming-more-details-and-plans.html Twitter Storm has been touted as real time Hadoop. That is more a marketing take for easy consumption. They are

Apache Kafka vs Apache Storm

﹥>﹥吖頭↗ 提交于 2019-12-04 07:23:10
问题 Apache Kafka: Distributed messaging system Apache Storm: Real Time Message Processing How we can use both technologies in a real-time data pipeline for processing event data? In terms of real time data pipeline both seems to me do the job identical. How can we use both the technologies on a data pipeline? 回答1: You use Apache Kafka as a distributed and robust queue that can handle high volume data and enables you to pass messages from one end-point to another. Storm is not a queue. It is a

Unable to run a storm-starter topology from the Storm tutorial

假装没事ソ 提交于 2019-12-04 06:48:58
When following the guide in the storm-starter repo , I'm unable to actually run any topology- like the ExclamationTopology . mvn clean install -DskipTests=true ran successfully, executed from the top level Storm repo, as did mvn package at the storm-examples level. When I try to run storm jar target/storm-starter-2.0.0-SNAPSHOT.jar org.apache.storm.starter.ExclamationTopology , I get the error: Error: A JNI error has occurred, please check your installation and try again Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/storm/topology/IRichSpout I'm running OS X, Java

Compile Error when killing topology from storm bolt programatically: object and package with same name

旧时模样 提交于 2019-12-04 05:23:17
问题 I was looking for how to kill a topology from a storm bolt in storm 1.2.1 in my scala project. From this answer, I wrote following code to do it: private def shutTopology(){ import org.apache.storm.utils.Utils import org.apache.storm.utils.NimbusClient val conf = Utils.readStormConfig val nimbusClient = NimbusClient.getConfiguredClient(conf).getClient nimbusClient.deactivate(topology_name) } But it given following error in line: val conf = Utils.readStormConfig , How to resolve this? Error:

Storm - Supervisors crashing on reboot

不问归期 提交于 2019-12-04 01:24:52
This is an issue that is simply driving me nuts. I have a one machine Storm instance running on my Local LAN. I am currently running v0.9.1-incubating release version (from the Apache Incubator site . The issue is simply that my storm supervisor process refuses to start after EVERY SINGLE reboot. The hack fix is quite simple, remove the supervisor and workers folders from the storm local directory and re run the process; things run hunky dory then on until next reboot. I'm providing every bit of information I think might be relevant to debug this issue. Please ask for more if needed, but just