Setup and configuration of Titan for a Spark cluster and Cassandra
There are already several questions on the aurelius mailing list as well as here on stackoverflow about specific problems with configuring Titan to get it working with Spark. But what is missing in my opinion is a high-level description of a simple setup that uses Titan and Spark. What I am looking for is a somewhat minimal setup that uses recommended settings. For example for Cassandra, the replication factor should be 3 and a dedicated datacenter should be used for analytics. From the information I found in the documentation of Spark, Titan, and Cassandra, such a minimal setup could look