excel - Spark：使用 spark-excel 跳过顶行

Question

我有一个顶部有损坏行的excel文件（前3行）需要跳过，我正在使用spark-excel库来读取excel文件，在他们的github上没有这样的功能，所以有没有办法实现这个？

这是我的代码：

Dataset<Row> ds = session.read().format("com.crealytics.spark.excel")
                                .option("location", filePath)
                                .option("sheetName", "Feuil1")
                                .option("useHeader", "true")
                                .option("delimiter", "|")
                                .option("treatEmptyValuesAsNulls", "true")
                                .option("inferSchema", "true")
                                .option("addColorColumns", "false")
                                .load(filePath);

score 1 · Accepted Answer

我查看了源代码，但没有相同的选项

https://github.com/crealytics/spark-excel/blob/master/src/main/scala/com/crealytics/spark/excel/DefaultSource.scala

您应该修复您的 excel 文件并删除前 3 行。否则，您将需要创建代码的修补版本以允许您这样做。与拥有正确的 Excel 表相比，这将是更多的努力

score 0 · Accepted Answer

0

此问题已通过githubspark excel 0.9.16中的问题链接修复

于 2018-07-27T13:37:16.140 回答

excel - Spark：使用 spark-excel 跳过顶行

2 回答 2

Related

Reference