4

我正在尝试使用 jDOM 拆分一个大型 XML 文件(500mb)(我知道我可能应该尝试 SAX,但是......)但我得到 org.jdom.IllegalAddException: The Content already has an existing parent "root" exception,如图所示在下面的代码中。

示例 xml 和代码如下。我相信所有索引检查和其他琐碎的东西都是正确的。

谢谢!!!

首先对大量代码感到抱歉。

<root>
  <metadata><md1>...</md1><md2>...</md2><metadata>
  <someOtherInfo><soi_1>...</soi_1></someOtherInfo>
  <collection>
    <item id="1">...</item><item id="2">...</item><item id="2">...</item>
  </collection>
</root>

split() {
    final String[] nodeNames = XmlUtils.getNodeNames(elementXpath); // returns {root, collection, item}

    // creates tree of 
    //<root>
    //  <metadata><md1>...</md1><md2>...</md2><metadata>
    //  <someOtherInfo><soi_1>...</soi_1></someOtherInfo>
    //  <collection>

    final Element originalDestination = importNodes(sourceDocument, nodeNames);  

    Element destination = null;

    // traverses to "collection" element
    Element source = sourceDocument.getRootElement();
    for (int tempCount = 1; tempCount < nodeNames.length - 1; ++tempCount) {
        source = source.getChild(nodeNames[tempCount]);
    }

    // get all "collection/item" elements
    for (Object obj : source.getChildren(nodeNames[nodeNames.length - 1])) {
        // makes sure that each out file has batchSize no of elements
        if (groupCount % batchSize == 0) {
            if (destination != null) {
                // traverse and go back up to the root
                Element root = destination;
                while (root.getParentElement() != null) {
                    root = root.getParentElement();
                }

                // this is where I get -- org.jdom.IllegalAddException: The Content already has an existing parent "root" -- exception
                final Document destDocument = new Document(destination);

                // write file to disk and reset counters
            } else {
                // create complete clone of originalDestination so that even its parents are cloned
                destination = createClone(originalDestination, nodeNames);
            }
        }

        // add this "item" element to destination "collection" element
        final Element element = (Element) obj;
        destination.addContent(((Element) element.clone()));
        count++;
        groupCount++;
    }

    if (groupCount > 0) {
        // write remaining "items" to file
    }

}

private Element createClone(final Element source, final String[] nodeNames) {
    Element destination = source;
    while (destination.getParentElement() != null) {
        destination = destination.getParentElement();
    }
    destination = (Element) destination.clone();
    for (int tempCount = 1; tempCount < nodeNames.length - 1; ++tempCount) {
        destination = destination.getChild(nodeNames[tempCount]);
    }
    return destination;
}

private Element importNodes(final Document document,
    final String[] nodeNames) {

    Element source = document.getRootElement();
    if (!source.getName().equals(nodeNames[0])) {
        return null;
    }

    Element destination = null;

    for (int count = 0; count < (nodeNames.length - 1); count++) {
        if (count > 0) {
            source = source.getChild(nodeNames[count]);
        }
        final Element child = new Element(nodeNames[count]);
        if (destination != null) {
            destination.setContent(child);
        }
        destination = child;

        // copy attributes -- don't want to clone here since this is one of the ancestors of "item"
        for (Object objAttb : source.getAttributes()) {
            Attribute attb = (Attribute) objAttb;
            destination.setAttribute(attb.getName(), attb.getValue());
        }

        // this is for <metadata> and <soneInfo> elements
        for (Object obj : source.getChildren()) {
            final Element childToClone = (Element) obj;
            if (!childToClone.getName().equals(nodeNames[count + 1])
                    && (ignoreWhiteSpaceNodes ? !childToClone.getName()
                            .equals("#text") : true)) {
                final Element clone = (Element) childToClone.clone();
                destination.addContent(clone);
            }
        }

    }

    return destination;
}
4

4 回答 4

14

在将元素插入另一个文档之前,您只需要从其父元素中分离()元素。

于 2011-08-03T10:48:38.853 回答
11

在 JDOM 实现中,每个元素都获取到父元素的链接:在新目标中添加元素之前,您必须将元素从原始结构中分离出来。

Element elemCopy = (Element)element.clone();
elemCopy.detach();
destination.addContent(elemCopy);
于 2012-07-24T16:40:31.543 回答
3

替换 JDOM 中的元素:

element.removeContent();

int size =  frEl.getContentSize();

for(int count = 0; count < size; count++) {
        element.addContent(frEl.getContent(0).detach());
}
于 2014-07-11T13:28:44.913 回答
2

如果它是元素列表,您可能需要执行类似于以下的操作。

for (int count = 0; count < resultEle.size(); count++) {
    destDocument.getRootElement().getChild("result").addContent(resultEle.get(count).detach());
}
于 2012-12-27T10:42:24.723 回答