这是HTML:
<article class="module_article featured">
<a title="Exclusive: Strictly's Vincent Simone welcomes baby boy" href="h/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/"><h1 class="article_title">Exclusive: Strictly's Vincent Simone welcomes baby boy</h1></a> <a href="/healthandbeauty/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/">
<img src="/imagenes/portadas/1-40-vincent-s.jpg">
</a>
<a href="/healthandbeauty/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/">
<img src="/imagenes/portadas/1-40-vincent-s.jpg">
</a>
<p>HELLO! Online can exclusively reveal that Strictly Come Dancing professional Vincent...</p>
</article>
<article class="module_article featured">
<a title="Exclusive: Strictly's Vincent Simone welcomes baby boy" href="h/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/"><h1 class="article_title">Exclusive: Strictly's Vincent Simone welcomes baby boy</h1></a> <a href="/healthandbeauty/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/">
<img src="/imagenes/portadas/1-40-vincent-s.jpg">
</a>
<a href="/healthandbeauty/mother-and-baby/2013091914634/vincent-simone-baby-boy-born/">
<img src="/imagenes/portadas/1-40-vincent-s.jpg">
</a>
<p>HELLO! Online can exclusively reveal that Strictly Come Dancing professional Vincent...</p>
</article>
这是我的 XPATH:
$articleLinks = $finder->query('article[contains(@class,"module_article")]//@href');
如您所见,它同时抓住了两者hrefs
。我只需要第一个。