通过 API,我得到了一个 XML 文件,我试图通过 org.w3c.dom 和 XPath 对其进行解析。XML 文件的一部分描述了 HTML 内容:
<Para>Since 2001, state and local health departments in the US have accelerated efforts to prepare for bioterrorism and other high-impact public health emergencies. These activities have been spurred by federal funding and guidance from the US Centers for Disease Control and Prevention (CDC) and the Health Resources and Services Administration (HRSA)
<CitationRef CitationID="B1">1</CitationRef>
<CitationRef CitationID="B2">2</CitationRef> . Over time, the emphasis of this guidance has expanded from bioterrorism to include "terrorism and non-terrorism events, including infectious disease, environmental and occupational related emergencies"
<CitationRef CitationID="B4">4</CitationRef> as well as pandemic influenza.
</Para>
这应该变成这样:
<p>Since 2001, state and local health departments in the US have accelerated efforts to prepare for bioterrorism and other high-impact public health emergencies. These activities have been spurred by federal funding and guidance from the US Centers for Disease Control and Prevention (CDC) and the Health Resources and Services Administration (HRSA)
<a href="link/B1">1</a>
<a href="link/B2">3</a> . Over time, the emphasis of this guidance has expanded from bioterrorism to include "terrorism and non-terrorism events, including infectious disease, environmental and occupational related emergencies"
<a href="link/B4">4</a> as well as pandemic influenza.
</p>
关于我如何做到这一点的任何建议?主要问题是检索标签并在保留其位置的同时替换它们。