TITAN

Reading a large graph from Titan (on HBase) into Spark

浪尽此生 提交于 2019-12-05 02:04:54
问题 I am researching Titan (on HBase) as a candidate for a large, distributed graph database. We require both OLTP access (fast, multi-hop queries over the graph) and OLAP access (loading all - or at least a large portion - of the graph into Spark for analytics). From what I understand, I can use the Gremlin server to handle OLTP-style queries where my result-set will be small. Since my queries will be generated by a UI I can use an API to interface with the Gremlin server. So far, so good. The

How to delete graph in Titan with Cassandra storage backend?

∥☆過路亽.° 提交于 2019-12-04 19:12:06
问题 I use Titan 0.4.0 All, running Rexster in shared VM mode on Ubuntu 12.04. How could I properly delete a graph in Titan which is using the Cassandra storage backend? I have tried the TitanCleanup.clear(graph) , but it does not delete everything. The indices are still there. My real issue is that I have an index which I don't want (it crashes every query), however as I understand Titan's documentation it is impossible to remove an index once it is created. 回答1: You can clear all the edges

Gremlin remote command fails with timeout error: Host did not respond in a timely fashion

时光总嘲笑我的痴心妄想 提交于 2019-12-04 10:48:30
I connected to a remote gremlin server via gremlin groovy shell. Connection succeeded. But for any remote command I try to execute it gives timeout error. Even for command :> 1+1 gremlin> :remote connect tinkerpop.server conf/senthil.yaml ==>Connected - 10.40.40.65/10.40.40.65:50080 gremlin> :> 1+1 Host did not respond in a timely fashion - check the server status and submit again. Display stack trace? [yN] org.apache.tinkerpop.gremlin.groovy.plugin.RemoteException: Host did not respond in a timely fashion - check the server status and submit again. at org.apache.tinkerpop.gremlin.console

Setup and configuration of JanusGraph for a Spark cluster and Cassandra

北战南征 提交于 2019-12-04 08:10:35
I am running JanusGraph (0.1.0) with Spark (1.6.1) on a single machine. I did my configuration as described here . When accessing the graph on the gremlin-console with the SparkGraphComputer, it is always empty. I cannot find any error in the logfiles, it is just an empty graph. Is anyone using JanusGraph with Spark and can share his configuration and properties? Using a JanusGraph, I get the expected Output: gremlin> graph=JanusGraphFactory.open('conf/test.properties') ==>standardjanusgraph[cassandrathrift:[127.0.0.1]] gremlin> g=graph.traversal() ==>graphtraversalsource[standardjanusgraph

Reading a large graph from Titan (on HBase) into Spark

依然范特西╮ 提交于 2019-12-03 17:02:37
I am researching Titan (on HBase) as a candidate for a large, distributed graph database. We require both OLTP access (fast, multi-hop queries over the graph) and OLAP access (loading all - or at least a large portion - of the graph into Spark for analytics). From what I understand, I can use the Gremlin server to handle OLTP-style queries where my result-set will be small. Since my queries will be generated by a UI I can use an API to interface with the Gremlin server. So far, so good. The problem concerns the OLAP use case. Since the data in HBase will be co-located with the Spark executors,

Unable to create a composite index, stuck at INSTALLED

那年仲夏 提交于 2019-12-03 16:56:00
问题 I'm unable to create an index. My Gremlin code is as follows: usernameProperty = mgmt.getPropertyKey('username') usernameIndex = mgmt.buildIndex('byUsernameUnique', Vertex.class).addKey(usernameProperty).unique().buildCompositeIndex() mgmt.setConsistency(usernameIndex, ConsistencyModifier.LOCK) mgmt.commit() Shortly after I receive two errors: 18:04:57 ERROR com.thinkaurelius.titan.graphdb.database.management.ManagementLogger - Evicted [1@0a00009d2537-ip-10-0-0-1572] from cache but waiting

Neo4j graph model for a social network

可紊 提交于 2019-12-03 15:18:02
问题 I've created a graph model for a social network and needed some concrete advice regarding the design in regards to scaling. Pardon the n00bness of these questions but I'm not finding very many clear examples out there... NOTE: the status updates and activity nodes /relationships are linked lists - with the newest entries constantly being placed at the top of the list. Linked lists allow for news feed generation, but there could be hundreds of records per user - I presume the limit clause isn

Gremlin - Giraph - GraphX ? On TitanDb

让人想犯罪 __ 提交于 2019-12-03 14:40:43
I need some help to be confirm my choice... and to learn if you can give me some information. My storage database is TitanDb with Cassandra. I have a very large graph. My goal is to use Mllib on the graph latter. My first idea : use Titan with GraphX but I did not found anything or in development in progress... TinkerPop is not ready yet. So I have a look to Giraph. TinkerPop, Titan can communique with Rexster from TinkerPop. My question is : What are the benefit to use Giraph ? Gremlin seems to do the same think and is distributed. Thank you very much to explain me. I think I don't really

How to delete graph in Titan with Cassandra storage backend?

纵然是瞬间 提交于 2019-12-03 12:27:41
I use Titan 0.4.0 All, running Rexster in shared VM mode on Ubuntu 12.04. How could I properly delete a graph in Titan which is using the Cassandra storage backend? I have tried the TitanCleanup.clear(graph) , but it does not delete everything. The indices are still there. My real issue is that I have an index which I don't want (it crashes every query), however as I understand Titan's documentation it is impossible to remove an index once it is created . You can clear all the edges/vertices with: g.V.remove() but as you have found that won't clear the types/indices previously created. The

Titan vertex centric indices vs Neo4j labels

試著忘記壹切 提交于 2019-12-03 07:08:05
问题 I was trying to make a comparison between these two technologies when approaching this and I was wondering if any of you already have some experience dealing with any or both of them? I am mainly interested in performance numbers when dealing with similar use cases. 回答1: Agreeing with everything Marko said, one could take it further and argue that in the graph database world local indexes can (and even should) substitute global ones. In my opinion, the single greatest advantage of a graph