0

我有一个查询需要几分钟才能在一个小数据集上执行,有什么问题?这是解释输出:

id  select_type table           type    possible_keys               key                 key_len   ref             rows  Extra

1   SIMPLE      BT_CALL_CLASS.. const   PRIMARY                     PRIMARY             4         const           1     Using index; Using temporary; Using filesort
1   SIMPLE      BT_FLAGS_FLAGS  const   PRIMARY                     PRIMARY             4         const           1     Using index
1   SIMPLE      BT_DIRECTORY... range   PRIMARY                     PRIMARY             4         NULL            308   Using where; Using index
1   SIMPLE      BT_CALL_FLAGS.. ref     PRIMARY,FKDF68A2ED9F150002  FKDF68A2ED9F150002  4         const           49    Using where; Using index
1   SIMPLE      BT_DEPARTMENT.. index   PRIMARY                     FK1F3A276188DDBD3   5         NULL            27    Using where; Using index; Using join buffer
1   SIMPLE      BT_USERS_USER.. index   PRIMARY                     FK6A68E086C27A155   5         NULL            233   Using where; Using index; Using join buffer
1   SIMPLE      BT_FCT_CALLS..  eq_ref  PRIMARY,FKDF68A2ED4F28..    PRIMARY             8         ...call_id      1     Using where

查询

 desc SELECT DISTINCT 
      BT_FCT_CALLS_FCT_CALLS.start_time AS COL0
     ,BT_FCT_CALLS_FCT_CALLS.calling_number AS COL1
     ,BT_FCT_CALLS_FCT_CALLS.called_number AS COL2
     ,BT_FCT_CALLS_FCT_CALLS.response AS COL3

 FROM departments BT_DEPARTMENTS_DEPARTMENTS_3 LEFT OUTER JOIN 
 ( 
  directory_numbers BT_DIRECTORY_NUMBERS_DIRECTORY_NUMBERS LEFT OUTER JOIN 
  ( 
   users BT_USERS_USERS_2 JOIN 
   ( 
    call_classification_dim BT_CALL_CLASSIFICATION_DIM_CALL_CLASSIFICATION_DIM JOIN 
    ( 
     flags BT_FLAGS_FLAGS JOIN 
     ( 
      fct_calls BT_FCT_CALLS_FCT_CALLS JOIN call_flags BT_CALL_FLAGS_CALL_FLAGS
      ON ( BT_FCT_CALLS_FCT_CALLS.id = BT_CALL_FLAGS_CALL_FLAGS.call_id )
     ) 
     ON ( BT_CALL_FLAGS_CALL_FLAGS.flag = BT_FLAGS_FLAGS.id AND (  BT_FLAGS_FLAGS.id  = 1 )  )
    ) 
    ON ( BT_FCT_CALLS_FCT_CALLS.call_direction_id = BT_CALL_CLASSIFICATION_DIM_CALL_CLASSIFICATION_DIM.id AND (  BT_CALL_CLASSIFICATION_DIM_CALL_CLASSIFICATION_DIM.id  = 2 )  )
   ) 
   ON ( (( BT_FCT_CALLS_FCT_CALLS.on_network_called_user_id  =  BT_USERS_USERS_2.id ) OR ( BT_FCT_CALLS_FCT_CALLS.on_network_calling_user_id  =  BT_USERS_USERS_2.id )) AND TRUE   )
  ) 
  ON ( (( BT_FCT_CALLS_FCT_CALLS.on_network_called_ext_id  =  BT_DIRECTORY_NUMBERS_DIRECTORY_NUMBERS.id ) OR ( BT_FCT_CALLS_FCT_CALLS.on_network_calling_ext_id  =  BT_DIRECTORY_NUMBERS_DIRECTORY_NUMBERS.id )) AND TRUE )
 ) 
 ON ( (( BT_FCT_CALLS_FCT_CALLS.on_network_called_department_id  =  BT_DEPARTMENTS_DEPARTMENTS_3.id ) OR ( BT_FCT_CALLS_FCT_CALLS.on_network_calling_department_id  =  BT_DEPARTMENTS_DEPARTMENTS_3.id )) AND TRUE )

  WHERE 
    (
      (
          NOT( BT_DEPARTMENTS_DEPARTMENTS_3.id  IN ( 1 ) )
      )
  AND (
          NOT( BT_DIRECTORY_NUMBERS_DIRECTORY_NUMBERS.id  IN ( 1 ) )
      )

  AND (
          NOT( BT_USERS_USERS_2.id  IN ( 1 ) )
      )
    )
ORDER BY 
      COL0
4

3 回答 3

0

正如我在评论中所说的那样,查询是自动生成的,我不能修改太多,无论如何,在升级到 mysql 5.6.14 之后,查询本身没有任何改变,性能变得正常(比较到具有类似复杂性的其他查询)!

于 2013-11-17T14:41:32.907 回答
0

首先,让我们从写得很糟糕的查询流程开始。通过简化直接关系与所有嵌套可以帮助清除一些事情。其次,在树下显示事物相关的关系。你到处蹦蹦跳跳。为了便于阅读,我还缩短了“别名”。最后,您不需要所有表,因为您没有从所有表中获取值,这使许多事情变得无用。无论如何,这是您原始查询的清理版本。如您所见,更容易看到表 A 链接到 B 链接到 C 链接到 D 等。

SELECT DISTINCT 
      calls.start_time AS COL0,
      calls.calling_number AS COL1,
      calls.called_number AS COL2,
      calls.response AS COL3
   FROM 
      fct_calls calls 
         JOIN departments dpt3 
            ON calls.on_network_called_department_id = dpt3.id
            OR calls.on_network_calling_department_id = dpt3.id
         JOIN users u2 
            ON calls.on_network_called_user_id = u2.id
            OR calls.on_network_calling_user_id = u2.id
         JOIN directory_numbers dirNums 
            ON calls.on_network_called_ext_id = dirNums.id 
            OR calls.on_network_calling_ext_id = dirNums.id 
         JOIN call_classification_dim classDim 
            ON calls.call_direction_id = classDim.id 
            AND classDim.id = 2
         JOIN call_flags callFlags
            ON calls.id = callFlags.call_id
            JOIN flags f 
               ON callFlags.flag = f.id 
               AND f.id = 1
  WHERE
         dpt3.id <> 1
     AND dirNums.id <> 1
     AND  u2.id = 1
  ORDER BY 
     COL0

现在,这就是我要运行的。同样,甚至没有使用很多东西,例如呼叫标志和目录号码。通过呼叫或被叫位置从具有您感兴趣的用户 ID 的所有呼叫开始。我会在( on_network_calling_user_id, on_network_calling_user_id )上的调用表上有一个索引。我会假设其他表在它们各自的主“ID”列上有索引。

SELECT DISTINCT 
      calls.start_time AS COL0,
      calls.calling_number AS COL1,
      calls.called_number AS COL2,
      calls.response AS COL3
   FROM 
      fct_calls calls 
         JOIN departments dpt3 
            ON calls.on_network_called_department_id = dpt3.id
            OR calls.on_network_calling_department_id = dpt3.id
         JOIN directory_numbers dirNums 
            ON calls.on_network_called_ext_id = dirNums.id 
            OR calls.on_network_calling_ext_id = dirNums.id 
  WHERE
     (    calls.on_network_called_user_id = 1
       OR calls.on_network_calling_user_id = 1 )
     AND dpt3.id <> 1
     AND dirNums.id <> 1
  ORDER BY 
     COL0

更进一步,由于考虑调用或被调用,您可能会通过执行 UNION 获得更好的性能,您可以尝试以下操作。因此,您正在寻找任何呼叫,其中进行呼叫的人是用户 ID 1 并且被呼叫的部门不是 1 并且目录扩展名不是 1...或者,正在呼叫用户 ID 1,但忽略来自部门 1 和扩展 1。

我将在呼叫表上有两个索引,用于这个谁被呼叫,以及呼叫来自哪里(on_network_calling_user_id,on_network_calling_department_id,on_network_calling_ext_id)这反过来......谁在呼叫,他们呼叫的部门/分机是什么( on_network_calling_user_id、on_network_calling_department_id、on_network_calling_ext_id)

SELECT
      calls.start_time AS COL0,
      calls.calling_number AS COL1,
      calls.called_number AS COL2,
      calls.response AS COL3
   FROM 
      fct_calls calls 
         LEFT JOIN departments dpt3 
            OR calls.on_network_calling_department_id = dpt3.id
         LEFT JOIN directory_numbers dirNums 
            OR calls.on_network_calling_ext_id = dirNums.id 
   WHERE
          calls.on_network_called_user_id = 1
      AND calls.on_network_calling_department_id <> 1
      AND calls.on_network_calling_ext_id <> 1
   ORDER BY 
      COL0
UNION
SELECT
      calls.start_time AS COL0,
      calls.calling_number AS COL1,
      calls.called_number AS COL2,
      calls.response AS COL3
   FROM 
      fct_calls calls 
         LEFT JOIN departments dpt3 
            OR calls.on_network_called_department_id = dpt3.id
         LEFT JOIN directory_numbers dirNums 
            OR calls.on_network_called_ext_id = dirNums.id 
   WHERE
          calls.on_network_calling_user_id = 1
      AND calls.on_network_called_department_id <> 1
      AND calls.on_network_called_ext_id <> 1
于 2013-10-01T14:58:31.413 回答
0

首先将您的 WHERE 子句更改为:

 WHERE 
 BT_DEPARTMENTS_DEPARTMENTS_3.id  <> 1 AND 
 BT_DIRECTORY_NUMBERS_DIRECTORY_NUMBERS.id  <> 1 AND 
 BT_USERS_USERS_2.id <> 1

它比使用 NOT IN(..) 更快

去掉AND TRUE不需要的。

于 2013-10-01T13:05:25.797 回答