cassandra - 超级列族中的嵌套可以有多深？

Question

我试图了解 Cassandra 的 DB 模型，但我运气不佳。几乎所有的文档都在解释 Twissandra，一个 twitter 克隆，但它实际上是一个相当简单的案例，并没有真正帮助学习如何有效地使用 Cassandra；或者它是非常基本的低级别的东西，并没有真正显示您在现实世界情况下通常对列/超级列所做的事情。

所以我认为论坛将是一个足够复杂的应用程序来学习。假设一个网络论坛有点像 phpBB；你可以有多个论坛，每个论坛显示多个主题，每个主题都有多个帖子。

我首先想到有单独的帖子、主题和论坛列，但这似乎与我在 RDBMS 上实现它的方式没有什么不同。

所以现在我想知道超级列族中的嵌套有多深。类似下面的伪模型是一种合适的建模方式吗？

Forums = {
    forum001: {
        name: "General News",
        topics: {
            topic000001: {
                subject: "This is what I think",
                date: "2012-08-24 10:12:13",
                posts: {
                    post20120824.101213: { username: "tom", content: "Blah blah", datetime: "2012-08-24 10:12:13" }
                    post20120824.101513: { username: "dick", content: "Blah blah blah", datetime: "2012-08-24 10:15:13" }
                    post20120824.103213: { username: "harry", content: "Blah blah", datetime: "2012-08-24 10:32:13" }
                }
            },
            topic000002: {
                subject: "OMG Look at this",
                date: "2012-08-24 10:42:13",
                posts: {
                    post20120824.104213: { username: "tom", content: "Blah blah", datetime: "2012-08-24 10:42:13" }
                    post20120824.104523: { username: "dick", content: "Blah blah blah", datetime: "2012-08-24 10:45:23" }
                    post20120824.104821: { username: "harry", content: "Blah blah", datetime: "2012-08-24 10:48:21" }
                }
            }
        }
    },
    forum002: {
        name: "Specific News",
        topics: {
            topic000003: {
                subject: "Whinge whine",
                date: "2012-08-24 10:12:13",
                posts: {
                    post20120824.101213: { username: "tom", content: "Blah blah", datetime: "2012-08-24 10:12:13" }
                    post20120824.101513: { username: "dick", content: "Blah blah blah", datetime: "2012-08-24 10:15:13" }
                }
            }
        }
    }
}

score 2 · Accepted Answer

除非您正在编写不打算直接查询的 JSON 文档，否则您无法拥有该级别的嵌套。
您不应该使用超列，因为它们已被弃用。
克里斯对您的问题的评论是绝对正确的：首先要问自己打算如何查询数据，然后像您打算阅读的那样编写。有时这意味着编写不止一种方式（即规范化不是目标）。
您可以通过使用复合键和/或列来获得额外的“嵌套”。
由于您只能通过键和列范围进行查询，或者使用二级索引，因此您通常需要为不适合二级索引的列（即具有高基数）编写自己的索引。

我知道这些规则不会给你一个模型，但你的模型完全依赖于查询。这是一种与 RDBMS 不同的看待世界的方式。如果您真的需要临时查询支持（许多用例确实不需要），Cassandra 不是正确的选择。

cassandra - 超级列族中的嵌套可以有多深？

1 回答 1

Related

Reference