问题标签 [apache-crunch]

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

0 投票
0 回答
19 浏览

python - Python 生命周期包可扩展性超过 10-2000 万个交易数据

我需要构建一个调用lifetimespython 包的 Java/Python 管道来每天计算客户生命周期价值。

目前我们大约有 10-2000 万行交易数据。我想知道在处理这么多数据时,生命周期包的可扩展性如何?

如果扩展性不够,如何与 Apache Crunch 集成并行计算?谢谢!