描述
这将跳过锚标记中的所有其他属性,即使它们的值看起来像嵌套在值中的属性。
<a(?=\s|>) # validate this is an anchor tag
(?! # start look ahead ! must not contain, = must contain
(?:[^>=]|='[^']*'|="[^"]*"|=[^'"][^\s>]*)*? # move through tag, skipping over quoted or non quoted values
\shref="[^"]*(?:jpg|png|gif)" # find href, capture value including quotes if they exist
) # end look ahead
[^>]*>.*?<\/a> # capture the entire to the close tag
PHP 代码示例:
示例文本
注意第二行
<a href="http://mysite.com">My Site</a>
<a wrongtag=" href='http://mysite.com/image.jpg' " href="http://mysite.com">My Site</a>
<a href="http://mysite.com/image.jpg"><img src="http://mysite.com/image.jpg"/></a>
<a href="http://mysite.com/image.gif"><img src="http://mysite.com/image.gif"/></a>
<a href="http://yoursite.com">Your Site</a>
代码
<?php
$sourcestring="your source string";
echo preg_replace('/<a(?=\s|>)
(?! # start look ahead ! must not contain, = must contain
(?:[^>=]|=\'[^\']*\'|="[^"]*"|=[^\'"][^\s>]*)*? # move through tag, skipping over quoted or non quoted values
\shref="[^"]*(?:jpg|png|gif)" # find href, capture value including quotes if they exist
) # end look ahead
[^>]*>.*?<\/a> # actually capture the string
/imsx','',$sourcestring);
?>
火柴
[0] => <a href="http://mysite.com/image.jpg"><img src="http://mysite.com/image.jpg"/></a>
[1] => <a href="http://mysite.com/image.gif"><img src="http://mysite.com/image.gif"/></a>