cluster-computing | 易学教程

Can MySQL Cluster handle a terabyte database

阅读更多关于 Can MySQL Cluster handle a terabyte database

问题 I have to look into solutions for providing a MySQL database that can handle data volumes in the terabyte range and be highly available (five nines). Each database row is likely to have a timestamp and up to 30 float values. The expected workload is up to 2500 inserts/sec. Queries are likely to be less frequent but could be large (maybe involving 100Gb of data) though probably only involving single tables. I have been looking at MySQL Cluster given that is their HA offering. Due to the volume

Starting remote processes in a Windows network

阅读更多关于 Starting remote processes in a Windows network

问题 I have several slave machines and a master machine which together run a distributed application. Processes on each slave machine have to have a GUI and network access (I think it would be called an interactive process then). For ease of use it would be nice if the master machine could start/stop the processes on those slave machines. My first idea was to use WMI and the Win32_Process class to start a remote process but upon further investigation it was reveiled that processes started this way

Starting Zookeeper Cluster. Error: Could not find or load main class org.apache.zookeeper.server.quorum.QuorumPeerMain

阅读更多关于 Starting Zookeeper Cluster. Error: Could not find or load main class org.apache.zookeeper.server.quorum.QuorumPeerMain

问题 (I'm running on CentOS 5.8). I've been following the direction for a Clustered (Multiserver) Zookeeper Set-up, but getting an error when I try to start up my server. When I run the command as described in the documentation: java -cp zookeeper-3.4.6.jar:lib/log4j-1.2.16.jar:conf \ org.apache.zookeeper.server.quorum.QuorumPeerMain conf/zoo.cfg I get the error: Error: Could not find or load main class org.apache.zookeeper.server.quorum.QuorumPeerMain I have my files location as such and am

Hadoop on windows server

阅读更多关于 Hadoop on windows server

I'm thinking about using hadoop to process large text files on my existing windows 2003 servers (about 10 quad core machines with 16gb of RAM) The questions are: Is there any good tutorial on how to configure an hadoop cluster on windows? What are the requirements? java + cygwin + sshd ? Anything else? HDFS, does it play nice on windows? I'd like to use hadoop in streaming mode. Any advice, tool or trick to develop my own mapper / reducers in c#? What do you use for submitting and monitoring the jobs? Thanks From the Hadoop documentation : Win32 is supported as a development platform .

Is there a way to add nodes to a running Hadoop cluster?

阅读更多关于 Is there a way to add nodes to a running Hadoop cluster?

I have been playing with Cloudera and I define the number of clusters before I start my job then use the cloudera manager to make sure everything is running. I’m working on a new project that instead of using hadoop is using message queues to distribute the work but the results of the work are stored in HBase. I might launch 10 servers to process the job and store to Hbase but I’m wondering if I later decided to add a few more worker nodes can I easily (read: programmable) make them automatically connect to the running cluster so they can locally add to clusters HBase/HDFS? Is this possible

How does weblogic clustering work?

阅读更多关于 How does weblogic clustering work?

I'm new to weblogic. I've read http://download.oracle.com/docs/cd/E11035_01/wls100/cluster/overview.html and searched this topic on the internet but still had a hard time understanding some of weblogic's clustering concepts. Can anybody confirm/correct my understandings below? a cluster contains one or more logical servers which can reside on one or many physical servers when deploying a j2ee app to a cluster, it is tied to one server in that cluster external users of the deployed app aren't aware of clustering the log file of that app is located on the server it's deployed if the server

Slave nodes not in Yarn ResourceManager

阅读更多关于 Slave nodes not in Yarn ResourceManager

问题 I've set up a 3 node Apache Hadoop cluster. On master node, I can see [hadoop-conf]$ jps 16856 DataNode 17051 SecondaryNameNode 16701 NameNode 21601 ResourceManager 21742 NodeManager 18335 JobHistoryServer and on slave nodes, I see [fedora20-template dfs]$ jps 28677 Jps 28510 NodeManager 27449 DataNode I can see three live nodes from master:50070. However, in the ResourceManager Web UI (http://master:8088/cluster/nodes), I can see only master node. Why are the two slave nodes not in the

Spread vs MPI vs zeromq?

阅读更多关于 Spread vs MPI vs zeromq?

In one of the answers to Broadcast like UDP with the Reliability of TCP , a user mentions the Spread messaging API. I've also run across one called ØMQ . I also have some familiarity with MPI . So, my main question is: why would I choose one over the other? More specifically, why would I choose to use Spread or ØMQ when there are mature implementations of MPI to be had? MPI was deisgned tightly-coupled compute clusters with fast, reliable networks. Spread and ØMQ are designed for large distributed systems. If you're designing a parallel scientific application, go with MPI, but if you are

How do I use Node.js clusters with my simple Express app?

阅读更多关于 How do I use Node.js clusters with my simple Express app?

— I built a simple app that pulls in data (50 items) from a Redis DB and throws it up at localhost. I did an ApacheBench (c = 100, n = 50000) and I'm getting a semi-decent 150 requests/sec on a dual-core T2080 @ 1.73GHz (my 6 y.o laptop), but the proc usage is very disappointing as shown: Only one core is used, which is as per design in Node, but I think I can nearly double my requests/sec to ~300, maybe even more, if I can use Node.js clusters. I fiddled around quite a bit but I haven't been able to figure out how to put the code given here for use with my app which is listed below: var

Quartz Scheduler: Trigger some jobs on every cluster node and some only once per cluster

阅读更多关于 Quartz Scheduler: Trigger some jobs on every cluster node and some only once per cluster

问题 I am using Quartz Scheduler as a Spring bean in a clustered environment. I have some jobs annotated with @NotConcurrent and they are running once per cluster (i.e. only in one node, only in one thread). Now I need to run one job on every node of the cluster. I removed the @NotConcurrent annotation, but it only run on every thread on one machine. It does not get fired on other nodes. What should I annotate the job with? Example: Job1 NotConcurrent annotated is scheduled at midnight => It fires