python - 使用xml.etree.ElementTree在python中关闭一个标签后如何在嵌套xml中提取文本

Question

我想提取 xml 文档中的所有文本，但在以下情况下遇到问题：

...
<a>
hello
<B>
there
</B>
How was your day.

.....
</a>

在此代码段中，我可以获取文本“hello”和“there”，因为我可以使用以下标签获取它们：

a.text
b.text

但我不知道如何访问“你过得怎么样”。部分。

score 1 · Accepted Answer

您正在寻找元素的.tail属性：

>>> from xml.etree import ElementTree
>>> example = ElementTree.fromstring('''\
... <a>
... hello
... <B>
... there
... </B>
... How was your day.
... </a>
... '''
... )
>>> example
<Element 'a' at 0x10715d150>
>>> example.text
'\nhello\n'
>>> example.find('B')
<Element 'B' at 0x10715d7d0>
>>> example.find('B').text
'\nthere\n'
>>> example.find('B').tail
'\nHow was your day.\n'

python - 使用xml.etree.ElementTree在python中关闭一个标签后如何在嵌套xml中提取文本

1 回答 1

Related

Reference