我的程序有问题,我需要一些帮助。我需要我的 saida.txt 和这些东西:
+, adj, 技术
+, adj, oiaã
+, 副词, 123
+, adv, oshi
-, adv, teste1
-, 副词, oi1
但我在我的 saida.txt 中只得到这些东西:
+, adv, 123
+, adv, oshi
-, adv ,teste1
-, adv ,oi1
我的 xml(“pedaco.xml”)是:
<data>
<ver>
<pontuacao>+</pontuacao>
<nuver>palavra1</nuver>
<tiver>palavra2</tiver>
<cl>
<nocl>adj</nocl>
<an>teçste</an>
<an>oiaã</an>
</cl>
<cl>
<nocl> adv</nocl>
<an> 123 </an>
<an> oshi </an>
</cl>
</ver>
<ver>
<pontuacao>-</pontuacao>
<nuver>palavra3</nuver>
<tiver>palavra4</tiver>
<cl>
<nocl>adv</nocl>
<an>teste1</an>
<an>oi1</an>
</cl>
</ver>
</data>
我的完整代码是
# -*- coding: utf-8 -*
from xml.dom import minidom
import sys
reload(sys)
sys.setdefaultencoding("utf-8")
xmldoc = minidom.parse("pedaco.xml")
arquivo = open('saida.txt','w')
ver = xmldoc.getElementsByTagName('ver')
for node in ver:
nuver = node.getElementsByTagName('nuver')
tiver = node.getElementsByTagName('tiver')
pontuacao = node.getElementsByTagName('pontuacao')
cl = node.getElementsByTagName('cl')
for c in cl:
an = c.getElementsByTagName('an')
nocl = c.getElementsByTagName('nocl')
for a in pontuacao:
printando1 = a.childNodes[0].nodeValue
for b in nocl:
printando2 = b.childNodes[0].nodeValue
for c in an:
printando3 = c.childNodes[0].nodeValue
arquivo.write(printando1+",")
arquivo.write(printando2+",")
arquivo.write(printando3+"\n")