3

我有以下html

$html =  '<body><div style="font-color:#000">Hello</div>
<span style="what">My name is rasid</span><div>new to you 
</div><div style="rashid">New here</div></body>';

$dom = new DOMDocument();
$dom->loadHTML($html);

$elements = $dom->getElementsByTagName('body');

我努力了

foreach($elements as $child)
 {
        echo $child->nodeName;
 }

输出是

body

但我需要遍历正文而不是正文下的所有标签。我怎样才能做到这一点。

我也在上面的例子中尝试过替换

$elements = $dom->getElementsByTagName('body');

$elements = $dom->getElementsByTagName('body')->item(0);

但它给出了错误。有什么解决办法??

4

3 回答 3

2

尝试这个

    $elements = $dom->getElementsByTagName('*');
    $i = 1; //counter to output from 3rd one, since foreach loop below will output" html  body div span div div"

    foreach($elements as $child)
    {

    if ($i > 2) echo $child->nodeName."<br>";   //output "div span div div"
    ++$i;
    }
于 2013-03-29T04:53:43.327 回答
2

如果你只想要body元素的子节点,你可以使用:

$body = $dom->getElementsByTagName( 'body' )->item( 0 );
foreach( $body->childNodes as $node )
{
    echo $node->nodeName . PHP_EOL;
}

如果你想要元素的所有降节点body,你可以使用DOMXPath

$xpath = new DOMXPath( $dom );
$bodyDescendants = $xpath->query( '//body//node()' );
foreach( $bodyDescendants as $node )
{
    echo $node->nodeName . PHP_EOL;
}
于 2013-03-29T13:24:38.623 回答
1

使用此代码

 $elements = $dom->getElementsByTagName('*');


        foreach($elements as $child)
        {

              echo $child->nodeName;

        }
于 2013-03-29T05:01:11.423 回答