我正在尝试“规范化”一个 DefaultStyledDocument 子类,在我们拥有的意义上org.w3c.dom.Node.normalize()
:也就是说,合并相邻的文本“叶子”。在 DefaultStyledDocument 的情况下,如果两个相邻的叶子具有相同的属性(或没有属性),则这些叶子被标识为合并。
下面是一个简单的版本(我们不检查实际属性:这是一个用例,您要么拥有纯文本,要么拥有一种可能的标记样式的文本)。
def normalise( self ):
# recursive function:
def normalise_structure( el, depth = 0 ):
indent = ' ' * depth
start = el.startOffset
print( '%s# el %s |%s|' % ( indent, el, self.getText( start, el.endOffset - start )))
prev_attr_set = None
for i in range( el.elementCount ):
subelement = el.getElement( i )
normalise_structure( subelement, depth + 1 )
if subelement.leaf:
curr_attr_set = subelement.attributes
print( '%s # this is a leaf, attribs %s' % ( indent, curr_attr_set, ))
# this is a simple version: only works if there is only one possible attribute
if prev_attr_set and curr_attr_set and prev_attr_set.attributeCount == curr_attr_set.attributeCount:
print( '%s # %s leaf needs to be merged with previous leaf' % (
indent, 'marked-up' if prev_attr_set.attributeCount == 1 else 'plain'))
attr_set = prev_attr_set.getElement( 0 ) if prev_attr_set.attributeCount else None
prev_subelement = el.getElement( i - 1 )
prev_start = prev_subelement.startOffset
curr_end = subelement.endOffset
merged_element = javax.swing.text.AbstractDocument.LeafElement(
javax.swing.text.DefaultStyledDocument(), el, attr_set, prev_start, curr_end )
el.replace( prev_start, curr_end - prev_start, [ merged_element ] )
prev_attr_set = curr_attr_set
else:
print( '%s # NOT a leaf...' % ( indent, ))
prev_attr_set = None
for self_el in self.rootElements:
normalise_structure( self_el )
当我运行这个我得到这个错误:
javax.swing.text.AbstractDocument$BranchElement.replace(AbstractDocument.java:2290) 的 java.lang.System.arraycopy(Native Method) 的线程“AWT-EventQueue-0”java.lang.ArrayIndexOutOfBoundsException 中的异常
我赶紧补充一下,在尝试javax.swing.text.DefaultStyledDocument()
作为 LeafElement 构造函数中的参数 1 之前,我尝试了“ self
”(即normalise
在第一行调用的 DefaultStyledDocument):同样的错误。