问题标签 [apache-spark-1.4]

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

0 投票
1 回答
112 浏览

python - pyspark 1.4 如何在聚合函数中获取列表

我想在 pyspark 1.4 中获取聚合函数中的列值列表。collect_list不可用。有没有人有建议怎么做?

原始列:

我想要像下面这样的输出,groupby (ID, date, hour)

但我的 pyspark 在 1.4.0 中,collect_list不可用。我做不到: df.groupBy("ID","date","hour").agg(collect_list("cell"))