2

我正在解析一个 RSS 提要,在该<description>标签中包含一些像这样的 \n 行字符

    <?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<item>
      <guid>http://www.horseandhound.co.uk/blogs/7619/314307.html</guid>
      <title>Chloe Chubb&#039;s s<title>Chloe Chubb&#039;s showing blog: Horse of the Year Show (HOYS) preparations</title>
      <description>So it is a week before HOYS and I have somehow managed to stay a bit oblivious to it all. It is only when I log onto Facebook and see the many, many statuses talking about it that I realise it’s only just around the corner! Then the butterflies begin.
&lt;br /&gt;
 /&gt;
My pre-HOYS diet was going well until last weekend when me and my best mates headed for a weekend away by the beach. Cue huge amounts of cheese, wine and cake and with the 5 pounds I’d lost, 2 went straight back on.&amp;#8230;</description>
      <url>http://www.horseandhound.co.uk/blogs/7619/314307.html</url>
</item>
</channel>
</rss>

在这里,我可以将文本解析到第一行“line1 中的一些文本..”,但从下一行开始,剩余的文本不会被解析。我正在使用 Dom 解析器。请建议如何解决这个问题。

4

1 回答 1

2

您可以使用实体&#10;来表示 XML 属性中的换行符。
&#13;可以用来表示回车。
Windows 样式的 CRLF 可以表示为&#13;&#10;.

有关更多详细信息,请参阅XML 规范

于 2012-10-12T06:30:33.647 回答