我正在尝试使用 pandas 库中的 .read_html() 函数,并在我在 shell 中运行代码时不断收到此错误。我看到您需要安装 lxml,所以我使用 apt-get 进行了安装。但后来当我再次尝试运行它时,我得到了同样的错误。
(trusty)mdz5032@localhost:~$ sudo apt-get -y install python-lxml
[sudo] password for mdz5032:
Reading package lists... Done
.
.
.
python-lxml is already the newest version.
0 upgraded, 0 newly installed, 0 to remove and 1 not upgraded.
这是我正在使用的代码
import pandas as pd
import pandas_datareader.data as web
import quandl
df = quandl.get("FMAC/HPI_PA", authtoken="")
fiddy_states = pd.read_html('https://simple.wikipedia.org/wiki/List_of_U.S._states')
我取出了 api 密钥,但如果需要,可以发布它。
这是完整的追溯
Traceback (most recent call last):
File "/home/mdz5032/pandasPractice.py", line 9, in <module>
fiddy_states = pd.read_html('https://simple.wikipedia.org/wiki/List_of_U.S._states')
File "/usr/local/lib/python3.4/dist-packages/pandas/io/html.py", line 874, in read_html
parse_dates, tupleize_cols, thousands, attrs, encoding)
File "/usr/local/lib/python3.4/dist-packages/pandas/io/html.py", line 726, in _parse
parser = _parser_dispatch(flav)
File "/usr/local/lib/python3.4/dist-packages/pandas/io/html.py", line 685, in _parser_dispatch
raise ImportError("lxml not found, please install it")
ImportError: lxml not found, please install it