0

我对 Python 还很陌生,这个社区给了我很大的帮助!我学到了很多东西。我正在尝试使用此现有代码循环遍历同一目录中的多个 XML 文件。目前,代码正在查看一个特定的文件。任何帮助是极大的赞赏!

import pandas as pd
from bs4 import BeautifulSoup

soup = BeautifulSoup(open("your_file.xml", "r"), "xml")

d = {}
for tag in soup.RECORDING.find_all(recursive=False):
    d[tag.name] = tag.get_text(strip=True)

df = pd.DataFrame([d])
print(df)
4

2 回答 2

1

您可以为此使用模块的walk功能os

import os

path = 'path/to/directory'
for (dirpath, dirnames, filenames) in os.walk(path):
    for filename in filenames:
        soup = BeautifulSoup(open(os.path.join(dirpath, filename), "r"), "xml")
于 2021-04-09T21:16:59.163 回答
1

这应该可以帮助你...

import pandas as pd
from bs4 import BeautifulSoup
import os

files_in_folder = os.listdir(folder_path)

result = list
for file in files_in_folder:
    soup = BeautifulSoup(open(file, "r"), "xml")
    d = {}
    for tag in soup.RECORDING.find_all(recursive=False):
        d[tag.name] = tag.get_text(strip=True)
        result.append(d)

df = pd.DataFrame(result)
于 2021-04-09T21:21:50.450 回答