0

我有以下表格:

**visitors**
+---------------------+--------------+------+-----+---------+----------------+
| Field               | Type         | Null | Key | Default | Extra          |
+---------------------+--------------+------+-----+---------+----------------+
| visitors_id         | int(11)      | NO   | PRI | NULL    | auto_increment |
| visitors_path       | varchar(255) | NO   |     |         |                |
+---------------------+--------------+------+-----+---------+----------------+

**fedora_info**
+----------------+--------------+------+-----+---------+-------+
| Field          | Type         | Null | Key | Default | Extra |
+----------------+--------------+------+-----+---------+-------+
| pid            | varchar(255) | NO   | PRI |         |       |
| owner_uid      | int(11)      | YES  |     | NULL    |       |
+----------------+--------------+------+-----+---------+-------+

首先,我通过以下方式寻找visitors_path与特定页面相关的内容:

SELECT visitors_id, visitors_path
FROM visitors
WHERE visitors_path REGEXP '[[:<:]]fedora/repository/.*:[0-9]+$';

上述查询返回预期结果。

现在在第二个表中.*:[0-9]+提到的上述查询中。现在我想知道在第二个表中pid分组的上述查询中的结果计数。owner_uid

我怎样才能加入这个表?

编辑

样本数据:

visitors
+-------------+---------------------------------+
| visitors_id | visitors_path                   |
+-------------+---------------------------------+
|        4574 | fedora/repository/islandora:123 |
|        4575 | fedora/repository/islandora:123 |
|        4580 | fedora/repository/islandora:321 |
|        4681 | fedora/repository/islandora:321 |
|        4682 | fedora/repository/islandora:321 |
|        4704 | fedora/repository/islandora:321 |
|        4706 | fedora/repository/islandora:456 |
|        4741 | fedora/repository/islandora:456 |
|        4743 | fedora/repository/islandora:789 |
|        4769 | fedora/repository/islandora:789 |
+-------------+---------------------------------+

fedora_info
+-----------------+-----------+
| pid             | owner_uid |
+-----------------+-----------+
| islandora:123   |         1 |
| islandora:321   |         2 |
| islandora:456   |         3 |
| islandora:789   |         4 |
+-----------------+-----------+

Expected result:
+-----------------+-----------+
| count           | owner_uid |
+-----------------+-----------+
| 2               |         1 |
| 4               |         2 |
| 3               |         3 |
| 2               |         4 |
| 0               |         5 |
+-----------------+-----------+
4

2 回答 2

1

我建议你规范你的数据库。当在前端语言的visitors提取中插入行并将其放在单独的列中时(例如)。然后你就可以轻松加入了。pidfi_pid

以下查询可能对您有用。但这会占用很少的CPU资源。

SELECT 
       COUNT(a.visitors_id) as `count`,
       f.owner_uid
FROM   (SELECT visitors_id, 
               visitors_path, 
               SUBSTRING(visitors_path, ( LENGTH(visitors_path) - 
                                          LOCATE('/', REVERSE(visitors_path)) ) 
                                        + 2) AS 
                      pid 
        FROM   visitors 
        WHERE  visitors_path REGEXP '[[:<:]]fedora/repository/.*:[0-9]+$') AS `a`

JOIN fedora_info AS f 
         ON ( a.pid = f.pid ) 

GROUP  BY f.owner_uid 
于 2012-10-25T08:05:42.120 回答
0

以下查询返回预期结果,但速度很慢Query took 9.6700 sec

SELECT COUNT(t2.pid), t1.owner_uid
FROM fedora_info t1
JOIN (SELECT TRIM(LEADING 'fedora/repository/' FROM visitors_path) as pid
FROM visitors
WHERE visitors_path REGEXP '[[:<:]]fedora/repository/.*:[0-9]+$') t2 ON t1.pid = t2.pid
GROUP BY t1.owner_uid
于 2012-10-25T08:39:40.717 回答