7

我有 preg_match_all 功能:

preg_match_all('#<h2>(.*?)</h2>#is', $source, $output, PREG_SET_ORDER);

它按预期工作,但问题是,它将所有项目 preg_matches 两次并放入一个像这样的巨大多维数组中,例如,它按预期 preg_matched 所需的所有 11 个项目,但两次并进入多维数组:

Array
(
    [0] => Array
        (
            [0] => <h2>10. <em>Cruel</em> by St. Vincent</h2>
            [1] => 10. <em>Cruel</em> by St. Vincent
        )

    [1] => Array
        (
            [0] => <h2>9. <em>Robot Rock</em> by Daft Punk</h2>
            [1] => 9. <em>Robot Rock</em> by Daft Punk
        )

    [2] => Array
        (
            [0] => <h2>8. <em>Seven Nation Army</em> by the White Stripes</h2>
            [1] => 8. <em>Seven Nation Army</em> by the White Stripes
        )

    [3] => Array
        (
            [0] => <h2>7. <em>Do You Want To</em> by Franz Ferdinand</h2>
            [1] => 7. <em>Do You Want To</em> by Franz Ferdinand
        )

    [4] => Array
        (
            [0] => <h2>6. <em>Teenage Dream</em> by Katie Perry</h2>
            [1] => 6. <em>Teenage Dream</em> by Katie Perry
        )

    [5] => Array
        (
            [0] => <h2>5. <em>Crazy</em> by Gnarls Barkley</h2>
            [1] => 5. <em>Crazy</em> by Gnarls Barkley
        )

    [6] => Array
        (
            [0] => <h2>4. <em>Kids</em> by MGMT</h2>
            [1] => 4. <em>Kids</em> by MGMT
        )

    [7] => Array
        (
            [0] => <h2>3. <em>Bad Romance</em> by Lady Gaga</h2>
            [1] => 3. <em>Bad Romance</em> by Lady Gaga
        )

    [8] => Array
        (
            [0] => <h2>2. <em>Pumped Up Kicks</em> by Foster the People</h2>
            [1] => 2. <em>Pumped Up Kicks</em> by Foster the People
        )

    [9] => Array
        (
            [0] => <h2>1. <em>Paradise</em> by Coldplay</h2>
            [1] => 1. <em>Paradise</em> by Coldplay
        )

    [10] => Array
        (
            [0] => <h2>Song That Get Stuck In Your Head YouTube Playlist</h2>
            [1] => Song That Get Stuck In Your Head YouTube Playlist
        )

)

如何将此数组转换为简单的数组并且没有那些重复的项目?非常感谢你。

4

2 回答 2

8

你总是会得到一个多维数组,但是,你可以像这样接近你想要的:

if (preg_match_all('#<h2>(.*?)</h2>#is', $source, $output, PREG_PATTERN_ORDER))
    $matches = $output[0]; // reduce the multi-dimensional array to the array of full matches only

如果您根本不需要子匹配,请使用非捕获分组:

if (preg_match_all('#<h2>(?:.*?)</h2>#is', $source, $output, PREG_PATTERN_ORDER))
    $matches = $output[0]; // reduce the multi-dimensional array to the array of full matches only

请注意,对 preg_match_all 的调用使用的是 PREG_PATTERN_ORDER 而不是 PREG_SET_ORDER:

PREG_PATTERN_ORDER 对结果进行排序,以便 $matches[0] 是完整模式匹配的数组,$matches[1] 是由第一个带括号的子模式匹配的字符串数组,依此类推。

PREG_SET_ORDER 排序结果,使得 $matches[0] 是第一组匹配的数组,$matches[1] 是第二组匹配的数组,依此类推。

见: http: //php.net/manual/en/function.preg-match-all.php

于 2012-11-14T05:48:52.450 回答
2

采用

#<h2>(?:.*?)</h2>#is 

作为你的正则表达式。如果您使用非捕获组(这是什么?:意思),则反向引用不会出现在数组中。

于 2012-11-14T05:48:44.573 回答