php - 更改正则表达式以不包含 [ 和 ] 内的内容

Question

我在这里有这个自动链接正则表达式代码：

// turn any url into url bbcode that doesn't have it already - so we can auto link urls- thanks stackoverflow

$URLRegex = '/(?:(?<!(\[\/url\]|\[\/url=))(\s|^))'; // No [url]-tag in front and is start of string, or has whitespace in front
$URLRegex.= '(';                                    // Start capturing URL
$URLRegex.= '(https?|ftps?|ircs?):\/\/';            // Protocol
$URLRegex.= '\S+';                                  // Any non-space character
$URLRegex.= ')';                                    // Stop capturing URL
$URLRegex.= '(?:(?<![.,;!?:\"\'()-])(\/|\s|\.?$))/i';      // Doesn't end with punctuation and is end of string, or has whitespace after

$body = preg_replace($URLRegex,"$2[url=$3]$3[/url]$5", $body);

问题是，如果 url 在引用标签内，并且结束引用标签正好在链接上，那么结束引用标签就会包含在链接中，这当然会搞砸！

如何调整该正则表达式以在链接中不包含 [ 和 ] 内的任何内容？

样本输入：

[quote=liamdawe] Have you had a look at [url=http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation]this howto[/url]? :)

http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation[/quote]
Testing

正确的输出是：

<div class="quote"><strong>Quote from liamdawe</strong><br />  Have you had a look at <a href="http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation" target="_blank">this howto</a>? <img src="/jscripts/sce/emoticons/smile.png" alt="" /><br />
<br />
<a href="http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation" target="_blank">http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation</a></div><br />
Testing

但我得到的输出是：

<div class="quote"><strong>Quote from liamdawe</strong><br />  Have you had a look at <a href="http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation" target="_blank">this howto</a>? <img src="/jscripts/sce/emoticons/smile.png" alt="" /><br />
<br />
<a href="http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation </div>" target="_blank">http://pcgamingwiki.com/wiki/Serious_Sam_II#Linux_Installation[/quote]</a><br />
Testing<br />

如您所见，它在链接中包含了 [/quote] 标记，因为它没有忽略自动链接器正则表达式中的 bbcode 标记。

如果需要，下面是执行该类型引用的代码： // 引用一个真实的人、书或任何东西 $pattern = '/[quote\=(.+?)](.+?)[/quote]/是';

$replace = "<div class=\"quote\"><strong>Quote from $1</strong><br />$2</div>";

while(preg_match($pattern, $body))
{
    $body = preg_replace($pattern, $replace, $body);
}

score 2 · Accepted Answer

试试这个

$URLRegex = '/(?:(?<!(\[\/url\]|\[\/url=))(\s|^))'; // No [url]-tag in front and is start of string, or has whitespace in front
$URLRegex.= '(';                                    // Start capturing URL
$URLRegex.= '(https?|ftps?|ircs?):\/\/';            // Protocol
$URLRegex.= '[\w\d\.\/#\_\-\?:=]+';                        // Any non-space character
$URLRegex.= ')';                                    // Stop capturing URL
$URLRegex.= '(?:(?<![.,;!?:\"\'()-])(\/|\[|\s|\.?$))/i';      // Doesn't end with punctuation and is end of string, or has whitespace after

$body = preg_replace($URLRegex,"$2[url=$3]$3[/url]$5", $body);

php - 更改正则表达式以不包含 [ 和 ] 内的内容

1 回答 1

Related

Reference