我正在学习 Python,并且很难理解 xml 解析器(ElementTree - XMLParser)的行为。
我修改了文档中的示例
class MaxDepth: # The target object of the parser
path = ""
def start(self, tag, attrib): # Called for each opening tag.
self.path += "/"+ tag
print '>>> Entering - ' + self.path
def end(self, tag): # Called for each closing tag.
print '<<< Leaving - ' + self.path
if self.path.endswith('/'+tag):
self.path = self.path[:-(len(tag)+1)]
def data(self, data):
if data:
print '... data called ...'
print data , 'length -' , len(data)
def close(self): # Called when all data has been parsed.
return self
它打印以下输出
>>> Entering - /a
... data called ...
length - 1
... data called ...
length - 2
>>> Entering - /a/b
... data called ...
length - 1
... data called ...
length - 2
<<< Leaving - /a/b
... data called ...
length - 1
... data called ...
length - 2
>>> Entering - /a/b
... data called ...
length - 1
... data called ...
length - 4
>>> Entering - /a/b/c
... data called ...
length - 1
... data called ...
length - 6
>>> Entering - /a/b/c/d
... data called ...
length - 1
... data called ...
length - 6
<<< Leaving - /a/b/c/d
... data called ...
length - 1
... data called ...
length - 4
<<< Leaving - /a/b/c
... data called ...
length - 1
... data called ...
length - 2
<<< Leaving - /a/b
... data called ...
length - 1
<<< Leaving - /a
<__main__.MaxDepth instance at 0x10e7dd5a8>
我的问题是
- 何时调用 data() 方法。
- 为什么在开始标签之前调用了两次
- 我找不到 api 文档以获取有关
data
方法的更多详细信息。我在哪里可以找到类的 api 参考之XMLParser
类的 javadoc。