1

我正在使用此 XPath 查询来选择 Xhtml 文档中没有输入后代的元素:

//*[not(descendant-or-self::input | descendant-or-self::textarea | descendant-or-self::select | ancestor::select)]

使用以下示例 XHtml 文档:

<html>
    <head>
        <title>Title</title>
    </head>
    <body>
        <div id="one">
            <input type="text" />
        </div>
        <div id="two">
            <textarea></textarea>
        </div>
        <div id="three">
            <div id="four">
                Text
            </div>
        </div>
        <div id="five">
            <select>
                <option>One</option>
                <option>Two</option>
            </select>
        </div>
        <div id="six">
            <input type="text" />
        </div>
        <div id="seven">
            <div id="eight"></div>
        </div>
    </body>
</html>

...还有这个 PHP 代码:

// Populate $html and $query with above

$dom = new DOMDocument('1.0', 'UTF-8');
$dom->loadXML($html);

$xpath = new DOMXPath($dom);
$nodes = $xpath->query($query);

foreach($nodes as $node)
{
    echo $node->tagName;

    if($node->hasAttribute('id'))
        echo '#' . $node->getAttribute('id');

    echo ' ';
}

我明白了:head title div#three div#four div#seven div#eight

但我想要这个:head div#three div#seven

我将获取 XPath 查询的结果并从 DOMDocument 中删除元素。title div#four div#eight是 的孩子head div#three div#seven,它们已经在结果中。

请记住,此查询将用于任何 XHtml 文档,我将如何更改我的 XPath 1.0 查询以获得所需的结果?

4

2 回答 2

1

只需为父母重复条件:

[not(descendant-or-self::input | descendant-or-self::textarea | descendant-or-self::select | ancestor-or-self::select)
and 
(../descendant-or-self::input | ../descendant-or-self::textarea | ../descendant-or-self::select | ../ancestor-or-self::select)]
于 2012-02-14T10:57:45.073 回答
0

在这种情况下,扩展您当前的 xpath 以另外说明轴中不应有 adiv或 a似乎就足够了:headancestor

//*[not(descendant-or-self::input 
      | descendant-or-self::textarea 
      | descendant-or-self::select 
      | ancestor::select
      | ancestor::div
      | ancestor::head)]

在示例 xml 中,这将根据需要仅返回headand divs threeand seven

于 2012-02-14T10:18:42.390 回答