我正在尝试使用 lxml 来做到这一点,但最终这是一个关于正确 xpath 的问题。我想从<pgBreak>
元素中选择直到其父元素结束,在这种情况下<p
>
XML 输入:
<root>
<pgBreak pgId="1"/>
<p>
some text to fill out a para
<pgBreak pgId="2"/>
some more text
<quote> A quoted block </quote>
remainder of para
</p>
</root>
XML 输出:
<root>
<pgBreak pgId="1"/>
<p>
some text to fill out a para
</p>
<pgBreak pgId="2"/>
<p>
some more text
<quote> A quoted block </quote>
remainder of para
</p>
</root>