cluster-computing

Condor job using DAG with some jobs needing to run the same host

こ雲淡風輕ζ 提交于 2020-01-15 04:24:06
问题 I have a computation task which is split in several individual program executions, with dependencies. I'm using Condor 7 as task scheduler (with the Vanilla Universe, due do constraints on the programs beyond my reach, so no checkpointing is involved), so DAG looks like a natural solution. However some of the programs need to run on the same host. I could not find a reference on how to do this in the Condor manuals. Example DAG file: JOB A A.condor JOB B B.condor JOB C C.condor JOB D D.condor

R: making cluster in doParallel / snowfall hangs

久未见 提交于 2020-01-13 13:07:02
问题 I've got two servers on a LAN with fresh installs of Centos 6.4 minimal and R 3.0.1. Both computers have doParallel, snow, and snowfall packages installed. The servers can ssh to each other fine. When I attempt to make clusters in either direction, I get a prompt for a password, but after entering the password, it just hangs there indefinately. makePSOCKcluster("192.168.1.1",user="username") How can I troubleshoot this? edit: I also tried calling makePSOCKcluster on the above-mentioned

R: making cluster in doParallel / snowfall hangs

风格不统一 提交于 2020-01-13 13:06:11
问题 I've got two servers on a LAN with fresh installs of Centos 6.4 minimal and R 3.0.1. Both computers have doParallel, snow, and snowfall packages installed. The servers can ssh to each other fine. When I attempt to make clusters in either direction, I get a prompt for a password, but after entering the password, it just hangs there indefinately. makePSOCKcluster("192.168.1.1",user="username") How can I troubleshoot this? edit: I also tried calling makePSOCKcluster on the above-mentioned

TSQL - how to tell if SQL is clustered?

孤街醉人 提交于 2020-01-13 09:54:11
问题 Is there a way using TSQL, CMD Script or VB Script to tell: 1) if SQL Server is clustered on the server, and 2) if it is the active node or not? Thanks. 回答1: Both of these can be found using SELECT * FROM fn_virtualservernodes() If there are no results, it's not clustered. 回答2: For active node you may try this to check the name of the node on which the clustered SQL Server instance is running on: Select ServerProperty('ComputerNamePhysicalNetBIOS') and for checking if its clustered: SELECT

Does Apache Mesos recognize GPU cores?

偶尔善良 提交于 2020-01-13 02:42:10
问题 In slide 25 of this talk by Twitter's Head of Open Source office, the presenter says that Mesos allows one to track and manage even GPU (I assume he meant GPGPU) resources. But I cant find any information on this anywhere else. Can someone please help? Besides Mesos, are there other cluster managers that support GPGPU? 回答1: Mesos does not yet provide direct support for (GP)GPUs, but does support custom resource types. If you specify --resources="gpu(*):8" when starting the mesos-slave, then

What are the differences between a node, a cluster and a datacenter in a cassandra nosql database?

久未见 提交于 2020-01-11 14:49:26
问题 I am trying to duplicate data in a cassandra nosql database for a school project using datastax ops center. From what I have read, there is three keywords: cluster, node, and datacenter, and from what I have understand, the data in a node can be duplicated in another node, that exists in another cluster. And all the nodes that contains the same (duplicated) data compose a datacenter. Is that right? If it is not, what is the difference? 回答1: The hierarchy of elements in Cassandra is: Cluster

How to make RabbitMQ scalable?

扶醉桌前 提交于 2020-01-11 05:38:09
问题 I tried to test RabbitMQ, but I found that rabbitmq has some problems: if I created a cluster of 3 nodes, I can't publish/delivered more than 6000/s. in other hand, if I worked with one single node, I can publish/delivery until 25000/s. which means, more that I add nodes, more performance is deteriorating. but from this article : https://blog.pivotal.io/pivotal/products/rabbitmq-hits-one-million-messages-per-second-on-google-compute-engine they can publish more than 1 million, so how they can

How to run binary executables in multi-thread HPC cluster?

匆匆过客 提交于 2020-01-08 02:32:23
问题 I have this tool called cgatools from complete genomics (http://cgatools.sourceforge.net/docs/1.8.0/). I need to run some genome analyses in High-Performance Computing Cluster. I tried to run the job allocating more than 50 cores and 250gb memory, but it only uses one core and limits the memory to less than 2GB. What would be my best option in this case? Is there a way to run binary executables in HPC cluster making it use all the allocated memory? 回答1: The scheduler just runs the binary

pbs job no output when busy

妖精的绣舞 提交于 2020-01-07 05:30:54
问题 I am experiencing a problem with PBS where, of all the jobs I submit, there tends to be a fraction that do not produce any output as they should. I have to resubmit them several times until they have all produced the output. I have also noticed that this is especially bad when other users submit large numbers of jobs. In this case, ALL of my jobs fail to produce the expected output files. I'm only user of PBS so don't understand what is going on. If anyone can give some suggestions that'd be

NodeJS cluster not using round robin (Developing on windows)

为君一笑 提交于 2020-01-07 04:14:33
问题 Hi im currently developing a MEAN stack project and i found out the nodejs cluster module which is recommended before you deploy the the project. I've applied it and done some ApacheBenchmark, the thing is, it's only responding to 1 worker, it's not rotating around my other 7 workers. I've google about this and yeah windows default can't do round-robin, but is there a way to enable round-robin in windows? Thank you! 回答1: In node v4.x and newer you can set the default scheduling policy by