c# - 用 HtmlAgilityPack 包裹一个元素？

Question

我有一个HtmlDocument可能或可能有一个适当的<head>和<body>部分，或者可能只是一个 html 片段。无论哪种方式，我都想通过一个函数来运行它，以确保它具有（更多）正确的 html 结构。

我知道我可以通过查看是否有身体来检查它是否有身体

doc.DocumentNode.SelectSingleNode("//body");

一片空白。如果它没有正文，我将如何将 doc.DocumentNode 的内容包装在一个<body>元素中并将其分配回HtmlDocument？

编辑：我想做什么似乎有些困惑。在 jquery 方面：

$doc = $(document);
if( !$doc.has('body') ) {
    $doc.wrapInner('body');
}

基本上，如果没有 body 元素，则在所有内容周围放置一个 body 元素。

score 4 · Accepted Answer

你可以这样做：

HtmlDocument doc = new HtmlDocument();
doc.Load(MyTestHtm);
HtmlNode body = doc.DocumentNode.SelectSingleNode("//body");
if (body == null)
{
    HtmlNode html = doc.DocumentNode.SelectSingleNode("//html");
    // we presume html exists

    body = CloneAsParentNode(html.ChildNodes, "body");
}


static HtmlNode CloneAsParentNode(HtmlNodeCollection nodes, string name)
{
    List<HtmlNode> clones = new List<HtmlNode>(nodes);
    HtmlNode parent = nodes[0].ParentNode;

    // create a new parent with the given name
    HtmlNode newParent = nodes[0].OwnerDocument.CreateElement(name);

    // insert before the first node in the selection
    parent.InsertBefore(newParent, nodes[0]);

    // clone all sub nodes
    foreach (HtmlNode node in clones)
    {
        HtmlNode clone = node.CloneNode(true);
        newParent.AppendChild(clone);
    }

    // remove all sub nodes
    foreach (HtmlNode node in clones)
    {
        parent.RemoveChild(node);
    }
    return newParent;
}

c# - 用 HtmlAgilityPack 包裹一个元素？

1 回答 1

Related

Reference