我正在使用 Twint 创建一个包含十个结果的 .csv 文件。但是,每当我尝试将其加载到 pandas 数据框中时,都会出现错误。有人可以帮助我了解发生了什么吗?
Traceback (most recent call last):
File "k:\Documents\Visual Studio Code\Twitter Project\exploratory stage.py", line 4, in <module>
scrapedData = pd.read_csv('demo.csv')
File "K:\Programs\Python\lib\site-packages\pandas\util\_decorators.py", line 311, in wrapper
return func(*args, **kwargs)
File "K:\Programs\Python\lib\site-packages\pandas\io\parsers\readers.py", line 586, in
read_csv
return _read(filepath_or_buffer, kwds)
File "K:\Programs\Python\lib\site-packages\pandas\io\parsers\readers.py", line 488, in
_read
return parser.read(nrows)
File "K:\Programs\Python\lib\site-packages\pandas\io\parsers\readers.py", line 1047, in read
index, columns, col_dict = self._engine.read(nrows)
File "K:\Programs\Python\lib\site-packages\pandas\io\parsers\c_parser_wrapper.py", line 223, in read
chunks = self._reader.read_low_memory(nrows)
File "pandas\_libs\parsers.pyx", line 801, in pandas._libs.parsers.TextReader.read_low_memory
File "pandas\_libs\parsers.pyx", line 857, in pandas._libs.parsers.TextReader._read_rows
File "pandas\_libs\parsers.pyx", line 843, in pandas._libs.parsers.TextReader._tokenize_rows
File "pandas\_libs\parsers.pyx", line 1925, in pandas._libs.parsers.raise_parser_error
pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 3, saw 3
-编辑-
我查看了我的 csv 文件,发现数据的格式很奇怪。包括用户名、日期时间和推文在内的一整行信息都将被塞进一个单元格中。
对于其他几行,推文会中断并继续在它旁边的单元格中。它看起来像这样。