我正在使用 urllib 和 beautifulsoup 来解析 django 中的 xml 文件。我无法使用 CDATA 解析描述标签的内容。
我的 xml 标签。
<item>
<title>EU Confronting US Over Surveillance</title>
<description><![CDATA[Voice of America is an international news and broadcast organization serving Central and Eastern Europe, the Caucasus, Central Asia, Russia, the Middle East and Balkan countries]]></description>
<guid>http://www.voanews.com/content/eu-confronting-us-over-surveillance/1778928.html</guid>
</item>
这个描述标签在项目标签views.py中
for i in soup.findAll('item'):
print i.description.string
如果 CDATA 不存在意味着我可以解析描述标签内的内容。我不知道如何解析这个内容。请帮助我另外如何获取标签内的图像..
<description><img src='http://static.ibnlive.in.com/ibnlive/pix/sitepix/10_2013/tony-abbott-visits-afghanistan-says-australias-war-is-over_291013013344_338x225.jpg' width='90' height='62'><p>"Australia's longest war" is ending and its defence forces mission in Afghanistan will be complete by 2013 end, Prime Minister Tony Abbott announced in a statement on Tuesday.</p></description>