java - Java读取具有多个标题的csv文件

Question

我正在使用apache commons csv从 CSV 文件中读取内容，我从谷歌趋势中下载为相关查询部分右下角的 csv。文件的一小部分：

Category: All categories
"bluetooth speakers: (1/1/04 - 8/15/16, Worldwide)"

TOP
speaker,100
bluetooth speaker,100

RISING
portable speakers bluetooth,Breakout
portable speakers,Breakout

我从文件中读取的代码：

private void readCsv(String inputFilePath) {
    try {
        Reader in = new FileReader(inputFilePath);
        Iterable<CSVRecord> records = CSVFormat.RFC4180.withFirstRecordAsHeader().parse(in);
        for (CSVRecord record : records) {
            String topic = record.get(0);
            if (topic != null && !topic.isEmpty()) {
                System.out.println(topic);
            }
        }
    } catch (FileNotFoundException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    }

}

输出：

bluetooth speakers: (1/1/04 - 8/15/16, Worldwide)
TOP
speaker
bluetooth speaker
RISING
portable speakers bluetooth
portable speakers

期望的输出：

speaker
bluetooth speaker
portable speakers bluetooth
portable speakers

根据来自谷歌的数据（没有标题）和两个标题TOP和RISING我无法提取所需的值。是否有任何过滤配置我可以应用以获得所需的值？

score 0 · Accepted Answer

虽然严格来说不是一个好的解决方案，但对于我而言，忽略具有单个元素的记录会消除标题。我仍在寻找/研究解决方案，例如配置或扩展某些类以获得更清洁的解决方案。

private void readCsv(String inputFilePath) {
    try {
        Reader in = new FileReader(inputFilePath);
//            Iterable<CSVRecord> records = CSVFormat.RFC4180.withFirstRecordAsHeader().parse(in);
        Iterable<CSVRecord> records = CSVFormat.RFC4180.parse(in);
        for (CSVRecord record : records) {
            if (record.size() <= 1){
                continue;
            }
            String topic = record.get(0);
            if (topic != null && !topic.isEmpty()) {
                System.out.println(topic);
            }
        }
    } catch (FileNotFoundException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    }
}

这不是一个好的解决方案的原因是因为可能存在许多其他 csv 文件，而该解决方案可能会出现错误。仍然可能对某人有用。

java - Java读取具有多个标题的csv文件

1 回答 1

Related

Reference