所以我有一个 Python Stream-sourced DataFrame ,其中包含我想使用spark-cassandra-connectordf
放入 Cassandra 表中的所有数据。我尝试过两种方式:
df.write \
.format("org.apache.spark.sql.cassandra") \
.mode('append') \
.options(table="myTable",keyspace="myKeySpace") \
.save()
query = df.writeStream \
.format("org.apache.spark.sql.cassandra") \
.outputMode('append') \
.options(table="myTable",keyspace="myKeySpace") \
.start()
query.awaitTermination()
但是,我不断收到此错误:
pyspark.sql.utils.AnalysisException: "'write' can not be called on streaming Dataset/DataFrame;
和
java.lang.UnsupportedOperationException: Data source org.apache.spark.sql.cassandra does not support streamed writing.
无论如何我可以将我的 Streamed DataFrame 发送到我的 Cassandra 表中吗?