我对python很陌生,熊猫。我正在尝试使用 pandas 读取 csv 文件,但出现 csv 解析错误。那是我的 csv 顶部模式,请参阅此(无法显示图像,请参阅链接)。
https://i.stack.imgur.com/HIBoj.jpg
--------------------------------------------------------------------------------------------------------------------------------------------------,,,,,,,
Data Records,,,,,,,
--------------------------------------------------------------------------------------------------------------------------------------------------,,,,,,,
ABC : - xxxxxxxxxxx,,,,,,,
Type :- xxxxxxxxxxx,,,,,,,
Date :- xxxxxxxxxx,,,,,,,
Till Date :- xxxxxxxxxx,,,,,,,
Report Index :- xxxxxxxxxx,,,,,,,
Report Date :- 01-Jul-2017 11:18:41 AM,,,,,,,
--------------------------------------------------------------------------------------------------------------------------------------------------,,,,,,,
A PARTY, B PARTY, DATE, TIME, DURATION, ID, ID_A, TYPE
--------------------------------------------------------------------------------------------------------------------------------------------------,,,,,,,
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:51:54,1,123456788889999, -, ZXC
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:52:06,1,123456788889999, -, QWE
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:52:11,1,123456788889999, -, RRR
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:52:12,1,123456788889999, -, BGF
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:52:25,1,123456788889999, -, OOO
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:53:23,1,123456788889999, -, BGF
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:54:00,1,123456788889999, -, NBG
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:54:38,1,123456788889999, -, BGFD
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 11:54:39,1,123456788889999, -, OIU
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:03:14,1,123456788889999, -, BGF
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:07:43,1,123456788889999, -, GGG
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:11:53,1,123456788889555, -, VVVV
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:13:12,1,123456788889555, -, VVVV
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:13:12,1,123456788889555, -, VVVV
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:13:44,1,123456788889555, -, VVVV
XXXXXXXX,XXXXXXXX, 26-JAN-2017, 12:13:44,1,123456788889555, -, VVVV
,,,,,,,
,,,,,,,
,,,,,,,
Note :- This is a System generated Report.,,,,,,,
以上共享文本已编辑,我的原始文件有 1000 多行。
错误是 -
Traceback (most recent call last):<br>
File "<stdin>", line 1, in <module><br>
File "/home/xxxxx/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 678, in parser_f
return _read(filepath_or_buffer, kwds)<br>
File "/home/xxxxx/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 446, in _read
data = parser.read(nrows)<br>
File "/home/xxxxx/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1036, in read
ret = self._engine.read(nrows)<br>
File "/home/xxxxx/.local/lib/python2.7/site-packages/pandas/io/parsers.py", line 1848, in read
data = self._reader.read(nrows)<br>
File "pandas/_libs/parsers.pyx", line 876, in pandas._libs.parsers.TextReader.read<br>
File "pandas/_libs/parsers.pyx", line 891, in pandas._libs.parsers.TextReader._read_low_memory<br>
File "pandas/_libs/parsers.pyx", line 945, in pandas._libs.parsers.TextReader._read_rows<br>
File "pandas/_libs/parsers.pyx", line 932, in pandas._libs.parsers.TextReader._tokenize_rows<br>
File "pandas/_libs/parsers.pyx", line 2112, in pandas._libs.parsers.raise_parser_error<br>
pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 11, saw 13
当我跳过第 12 行时,它工作得非常好。因此,在阅读此文件之前,我想搜索“---”模式并获取行 ID,然后我可以在使用“read_csv”函数阅读时跳过行。
import pandas as pd
ff = pd.read_csv("test.csv")
ff
这是我的代码。
提前致谢。