考虑一下我的图表中将包含以下内容:
100 Million nodes, More than 1 Billion connections/relationships
Node properties: around 10 properties, mix of int, doubles, strings, HashMaps etc.
Relationship properties: around 10 double values and 2-3 string (with avg 50 chars) values
现在,假设我想通过在每个节点上查询一次邻居来更新所有节点和关系属性值。即说,
step1: search a node, say X, with given Id,
step2: get it's neighbours,
step3: update node properties of X and all relationship properties between X and it's neighbors.
对所有节点重复这 3 个步骤一次。给定以下系统配置,一次更新所有节点需要多少时间(大约时间对我来说是可以的,可能以秒/分钟/小时为单位):
Two dual core processors, 3.0 GHz each, 4*4 GB memory, 250 GB Hard disk space.
上述数据大约需要多少存储空间?
请通过提供任何近似的样本性能(时间和存储)分析来帮助我。任何示例性能分析都将帮助我可视化我的需求。谢谢。