4

我为工作编写了一个自定义帮助台,它一直运行良好……直到最近。一个查询确实变慢了。现在大约需要14秒!以下是相关表格:

CREATE TABLE `tickets` (
  `id` int(11) unsigned NOT NULL DEFAULT '0',
  `date_submitted` datetime DEFAULT NULL,
  `date_closed` datetime DEFAULT NULL,
  `first_name` varchar(50) DEFAULT NULL,
  `last_name` varchar(50) DEFAULT NULL,
  `email` varchar(50) DEFAULT NULL,
  `description` text,
  `agent_id` smallint(5) unsigned NOT NULL DEFAULT '1',
  `status` smallint(5) unsigned NOT NULL DEFAULT '1',
  `priority` tinyint(4) NOT NULL DEFAULT '0',
  PRIMARY KEY (`id`),
  KEY `date_closed` (`date_closed`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

CREATE TABLE `solutions` (
  `id` int(10) unsigned NOT NULL,
  `ticket_id` mediumint(8) unsigned DEFAULT NULL,
  `date` datetime DEFAULT NULL,
  `hours_spent` float DEFAULT NULL,
  `agent_id` smallint(5) unsigned DEFAULT NULL,
  `body` text,
  PRIMARY KEY (`id`),
  KEY `ticket_id` (`ticket_id`),
  KEY `date` (`date`),
  KEY `hours_spent` (`hours_spent`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8;

当用户提交票时,它会进入“票”表。然后,当代理解决问题时,他们会记录他们采取的行动。每个条目都进入“解决方案”表。换句话说,门票有很多解决方案。

变慢的查询的目标是从“tickets”表中提取所有字段,以及从“solutions”表中提取最新条目。这是我一直在使用的查询:

SELECT tickets.*,
    (SELECT CONCAT_WS(" * ", DATE_FORMAT(solutions.date, "%c/%e/%y"), solutions.hours_spent, CONCAT_WS(": ", solutions.agent_id, solutions.body))
    FROM solutions
    WHERE solutions.ticket_id = tickets.id
    ORDER BY solutions.date DESC, solutions.id DESC
    LIMIT 1
) AS latest_solution_entry
FROM tickets
WHERE tickets.date_closed IS NULL
OR tickets.date_closed >= '2012-06-20 00:00:00'
ORDER BY tickets.id DESC

以下是“latest_solution_entry”字段的示例:

6/20/12 * 1337 * 1: I restarted the computer and that fixed the problem. Yes, I took an hour to do this.

在 PHP 中,我拆分了“latest_solution_entry”字段并正确格式化。

当我注意到运行查询的页面速度变慢时,我在没有子查询的情况下运行了查询,而且速度非常快。然后我EXPLAIN在原始查询上运行并得到了这个:

+----+--------------------+-----------+-------+---------------+-----------+---------+---------------------+-------+-----------------------------+
| id | select_type        | table     | type  | possible_keys | key       | key_len | ref                 | rows  | Extra                       |
+----+--------------------+-----------+-------+---------------+-----------+---------+---------------------+-------+-----------------------------+
|  1 | PRIMARY            | tickets   | index | date_closed   | PRIMARY   | 4       | NULL                | 35804 | Using where                 |
|  2 | DEPENDENT SUBQUERY | solutions | ref   | ticket_id     | ticket_id | 4       | helpdesk.tickets.id |     1 | Using where; Using filesort |
+----+--------------------+-----------+-------+---------------+-----------+---------+---------------------+-------+-----------------------------+

所以我正在寻找一种方法来提高我的查询效率,但仍能实现相同的目标。有任何想法吗?

4

4 回答 4

19

让我总结一下我的理解:您想选择每张票及其最后的解决方案。

我喜欢对这类问题使用以下模式,因为它避免了子查询模式,因此在需要性能的地方相当不错。缺点是理解起来有点棘手:

SELECT
  t.*,
  s1.*
FROM tickets t
INNER JOIN solutions s1 ON t.id = s1.ticket_id
LEFT JOIN solutions s2 ON s1.ticket_id = s2.ticket_id AND s2.id > s1.id
WHERE s2.id IS NULL;

为了更好地理解,我只写了模式的核心。

关键是:

  • 表的 LEFT JOINsolutions与自身的s1.ticket_id = s2.ticket_id条件:它模拟GROUP BY ticket_id.

  • 条件s2.id > s1.id:它是“我只想要最后一个解决方案”的 SQL,它模拟MAX(). 我假设在您的模型中,the last意味着with the greatest id但您可以在此处使用日期条件。请注意,这s2.id < s1.id将为您提供第一个解决方案。

  • WHERE 子句s2.id IS NULL:最奇怪但绝对必要的子句……只保留您想要的记录。

试一试,让我知道:)

编辑1:我刚刚意识到第二点假设过于简单化了问题。这使它变得更加有趣:p 我正在尝试了解这种模式如何与您的date, id订单一起使用。

编辑2:好的,稍微扭曲一下效果很好。LEFT JOIN 的条件变为:

LEFT JOIN solutions s2 ON s1.ticket_id = s2.ticket_id
  AND (s2.date > s1.date OR (s2.date = s1.date AND s2.id > s1.id))
于 2012-06-21T08:18:48.570 回答
1

当您在 SELECT 子句中有一个内联视图时,它必须为每一行执行该选择。我发现在这种情况下最好在 FROM 子句中放置一个内联视图,而不是执行一次选择。

SELECT t.*, 
       Concat_ws(" * ", Date_format(s.date, "%c/%e/%y"), s.hours_spent, 
       Concat_ws(":", s.agent_id, s.body)) 
FROM   tickets t 
       INNER JOIN (SELECT solutions.ticket_id,
                          Max(solutions.date) maxdate 
                   FROM   solutions 
                   GROUP  BY solutions.ticket_id) last_solutions 
               ON t.id = last_solutions.ticket_id
       INNER JOIN (SELECT solutions.ticket_id,
                          solutions.date,
                          Max(solutions.id) maxid 
                   FROM   solutions 
                   GROUP  BY solutions.ticket_id,
                            solutions.date) last_solution
              ON last_solutions.ticket_id = last_solution.ticket_id 
                 and last_solutions.maxDate = last_solution.Date
       INNER JOIN solutions s 
               ON last_solution.maxid = s.id
WHERE  t.date_closed IS NULL 
        OR t.date_closed >= '2012-06-20 00:00:00' 
ORDER  BY t.id DESC 

注意:根据您的需要,您可能需要将其设为 LEFT join

于 2012-06-21T03:55:47.697 回答
1

尝试这个:

SELECT *
FROM (
  -- for each ticket get the most recent solution date
  SELECT ticket_id, MAX(solutions.date) as date
  FROM solutions
  GROUP BY ticket_id
) t
JOIN tickets ON t.ticket_id = tickets.id
WHERE tickets.date_closed IS NULL OR tickets.date_closed >= '2012-06-20 00:00:00'
ORDER BY tickets.id DESC

请注意,如果有一张包含 2 个同一日期的解决方案的票证,您的结果集中将有重复的记录。您将需要另一个连接来删除这些重复项或使用绝对序列,如序列(递增主键)。

于 2012-06-21T04:06:18.107 回答
1

根据目的,我给出一个想法:

SELECT DISTINCT s1.ticket_id, t.*,  s1.*
FROM tickets t
LEFT JOIN solutions s1 ON t.id = s1.ticket_id
于 2013-10-30T09:41:43.017 回答