在尝试将 html 文档解析为 XML(从开头添加 xml)时,我遇到了标签内属性的问题。
<tr>
<td class="yfnc_tabledata1" nowrap align="right">Jun 4, 2013</td>
<td class="yfnc_tabledata1" align="right">453.22</td>
<td class="yfnc_tabledata1" align="right">454.43</td>
<td class="yfnc_tabledata1" align="right">447.39</td>
<td class="yfnc_tabledata1" align="right">449.31</td>
<td class="yfnc_tabledata1" align="right">10,454,600</td>
<td class="yfnc_tabledata1" align="right">449.31</td>
</tr>
虽然通常没关系(因为我的 xslt 代码实际上并没有引用它),但我收到了一个错误:
ERROR: 'Attribute name "nowrap" associated with an element type "td" must be followed by the ' = ' character.'
ERROR: 'com.sun.org.apache.xml.internal.utils.WrappedRuntimeException: Attribute name "nowrap" associated with an element type "td" must be followed by the ' = ' character.'
所以我想知道是否有办法让它抑制/忽略这些错误。(寻找一种不涉及单独解析的方法,该方法将首先删除所有 nowrap 。)
(供参考,xml:http ://pastebin.com/TLD4bZkq,xslt: http: //pastebin.com/dPzDzeAX)