php - preg_match_all 未正确匹配 href 部分

Question

我在使用 preg_match_all 匹配链接的 href 部分时遇到问题，目前它正在捕获 3 个部分（完整链接、仅 url、仅链接文本），这是完美的，但仅 url 部分正在捕获位于 href 之后的任何其他标签标签。

另外，如何使“href”文本不区分大小写？

代码：

$content = '<a href="http://www.google.com" target="_blank">Google</a> is a search engine. <a href="http://www.yahoo.com" title="yahoo" target="_blank">Yahoo</a> is a search engine.';

preg_match_all('/<a href="([^<]*)">([^<]*)<\/a>/', $content, $matches);

print_r($matches);

结果：

Array
(
    [0] => Array
        (
            [0] => <a href="http://www.google.com" target="_blank">Google</a>
            [1] => <a href="http://www.yahoo.com" title="yahoo" target="_blank">Yahoo</a>
        )

    [1] => Array
        (
            [0] => http://www.google.com" target="_blank
            [1] => http://www.yahoo.com" title="yahoo" target="_blank
        )

    [2] => Array
        (
            [0] => Google
            [1] => Yahoo
        )

)

score 2 · Accepted Answer

您开始寻找 > 而没有考虑任何其他属性。尝试

/<a href="([^"]*)"[^>]+>([^<]*)<\/a>/

这现在将拉出 href，然后跳过其余属性，然后将 html 拉到下一个标签上

php - preg_match_all 未正确匹配 href 部分

1 回答 1

Related

Reference