Tree structures in a nosql database

前端未结

关注

 2  432

故里飘歌

I\'m developing an application for Google App Engine which uses BigTable for its datastore.

It\'s an application about writing a story collaboratively. It\'s a very

相关标签:

2条回答

深忆病人

2020-12-23 23:54
There are a number of well known ways to represent trees in databases; each of them have their pros and cons. Here are the most common:
- Adjacency list, where each node stores the ID of its parent.
- Materialized path, which is the strategy Keyur describes. This is also the approach used by entity groups (eg, parent entities) in App Engine. It's also more or less what you're describing in your update.
- Nested sets, where each node has 'left' and 'right' IDs, such that all child nodes are contained in that range.
- Adjacency lists agumented with a root ID.
Each of these has its own advantages and disadvantages. Adjacency lists are simple, and cheap to update, but require multiple queries to retrieve a subtree (one for each parent node). Augmented adjacency lists make it possible to retrieve an entire tree by storing the ID of the root node in every record.

Materialized paths are easy to implement and cheap to update, and permit querying arbitrary subtrees, but impose increasing overhead for deep trees.

Nested sets are tougher to implement, and require updating, on average, half the nodes each time you make an insertion. They allow you to query arbitrary subtrees, without the increasing key length issue materialized path has.

In your specific case, though, it seems like you don't actually need a tree structure at all: each story, branched off an original though it may be, stands alone. What I would suggest is having a 'Story' model, which contains a list of keys of its paragraphs (Eg, in Python a db.ListProperty(db.Key)). To render a story, you fetch the Story, then do a batch fetch for all the Paragraphs. To branch a story, simply duplicate the story entry - leaving the references to Paragraphs unchanged.
0 讨论(0)
发布评论:

提交评论
- 加载中...
伪装坚强ぢ

2020-12-24 00:01

One solution I can think about is - the path to the node is also the key of that node. So the key of node 11 is "2/7/6/11". You can traverse the path by a simple key lookup of all keys in the path - "2/7/6/11", "2/7/6", "2/7", "2"

0 讨论(0)
发布评论:

提交评论
- 加载中...