2012-12-08 79 views
8

我创建了2个可用于执行相同功能的查询。它们都包含我想合并成单个查询的属性,但我一直无法。使用子查询与左连接的MySQL索引优化

查询1 - 给我的结果我想要的。慢(〜0.700秒)

查询2 - 给我很多行,我忽略和跳过。快速(〜0.005秒)

我的目标是修改QUERY 2,为每个项目删除除1之外的所有空行价格行。我似乎无法完成这项工作,因为我们对表现不感兴趣。这是由于我缺乏对MySQL中索引使用的经验和理解。

QUERY 1

用途设计不良的子查询不允许跨tbl_sale(e)中使用索引包含10K行。

SELECT b.id, b.sv, b.description, der.store_id, f.name, der.price 
FROM tbl_watch AS a 
    LEFT JOIN tbl_item AS b ON a.item_id = b.id 
LEFT JOIN (
    SELECT c.store_id, d.flyer_id, e.item_id, e.price 
    FROM tbl_storewatch AS c, tbl_storeflyer AS d 
    FORCE INDEX (storebeg_ndx) , tbl_sale AS e 
    WHERE c.user_id = '$user_id' 
    AND (
     d.store_id = c.store_id 
     AND d.date_beg = '20121206' 
     ) 
    AND e.flyer_id = d.flyer_id 
     ) AS der ON a.item_id = der.item_id 
LEFT JOIN tbl_store as f ON der.store_id = f.id 
WHERE a.user_id = '$user_id' 
ORDER BY b.description ASC 

这里是解释查询1

id select_type table  type possible_keys key    key_len  ref  rows Extra 
1 PRIMARY  a   ref  user_item_ndx user_item_ndx 4   const 30 Using index; Using temporary; Using filesort 
1 PRIMARY  b   eq_ref PRIMARY   PRIMARY   4   a.item_id 1 
1 PRIMARY  <derived2> ALL  NULL   NULL   NULL  NULL 300  
1 PRIMARY  f   eq_ref PRIMARY   PRIMARY   4   der.store_id 1 
2 DERIVED  c   ref  user_ndx  user_ndx  4     6 
2 DERIVED  e   ALL  NULL   NULL NULL NULL    9473 Using join buffer 
2 DERIVED  d   eq_ref storebeg_ndx storebeg_ndx 8   c.store_id 1 Using where 

QUERY 2

全部使用左联接,这是非常有效的(与ORDER BY除外)。索引用于每个连接。该查询返回tbl_watch中每个项目的所有可能匹配项。下面是该查询:

SELECT b.id, b.sv, b.description, c.store_id, f.name, e.price 
FROM tbl_watch AS a 
LEFT JOIN tbl_item AS b ON a.item_id = b.id 
LEFT JOIN tbl_storewatch AS c ON c.user_id = '$user_id' 
LEFT JOIN tbl_storeflyer AS d ON d.store_id = c.store_id 
    AND d.date_beg = '$s_date' 
LEFT JOIN tbl_sale AS e ON e.item_id = a.item_id 
    AND e.flyer_id = d.flyer_id 
LEFT JOIN tbl_store as f ON d.store_id = f.id 
WHERE a.user_id = '$user_id' 
ORDER BY b.description ASC 

这里是解释查询:

id select_type  table type possible_keys   key    key_len  ref      rows Extra 
1 SIMPLE   a  ref  user_item_ndx   user_item_ndx 4   const     6  Using index; Using temporary; Using filesort 
1 SIMPLE   b  eq_ref PRIMARY     PRIMARY   4   a.item_id    1 
1 SIMPLE   c  ref  user_ndx    user_ndx  4   const     2 
1 SIMPLE   d  eq_ref storebeg_ndx,storendx storebeg_ndx 8   c.store_id,const  1 
1 SIMPLE   e  eq_ref itemflyer_ndx   itemflyer_ndx 8   a.item_id,d.flyer_id 1 
1 SIMPLE   f  eq_ref PRIMARY     PRIMARY   4   d.store_id    1 

如何修改问题2(更有效)给我正是我需要像在查询1对行与吗?

感谢 迈克

+0

我不太确定第一个查询可能会给你想要的结果。左连接不是左外连接(虽然也许它在MySQL中,它不符合SQL),并且空值不是唯一值。我没有方便的MySQL,但把它放到PostgreSQL中并没有给出你描述的结果。我的回答如下... – PlexQ

回答

0

你的子查询中查询1使用隐式内部联接,而查询2是使用所有连接左明确连接。因此,有没有where子句中查询2.排除数据我会采取左侧开出的一对夫妇行(标示),看看如何改善的事情:

SELECT b.id, b.sv, b.description, c.store_id, f.name, e.price 
FROM tbl_watch AS a 
LEFT JOIN tbl_item AS b ON a.item_id = b.id 
LEFT JOIN tbl_storewatch AS c ON c.user_id = '$user_id' 
-- Left removed below 
JOIN tbl_storeflyer AS d ON d.store_id = c.store_id 
    AND d.date_beg = '$s_date' 
-- Left removed below 
JOIN tbl_sale AS e ON e.item_id = a.item_id 
    AND e.flyer_id = d.flyer_id 
LEFT JOIN tbl_store as f ON d.store_id = f.id 
WHERE a.user_id = '$user_id' 
ORDER BY b.description ASC` 

您也可以考虑服用并把它们移出来并移动到WHERE:

SELECT b.id, b.sv, b.description, c.store_id, f.name, e.price 
FROM tbl_watch AS a 
LEFT JOIN tbl_item AS b ON a.item_id = b.id 
LEFT JOIN tbl_storewatch AS c ON c.user_id = '$user_id' 
JOIN tbl_storeflyer AS d ON d.store_id = c.store_id 
JOIN tbl_sale AS e ON e.item_id = a.item_id 
LEFT JOIN tbl_store as f ON d.store_id = f.id 
WHERE a.user_id = '$user_id' 
AND d.date_beg = '$s_date' 
AND e.flyer_id = d.flyer_id 
ORDER BY b.description ASC 

最后,日期数学相当密集。在查询2中,使用外部连接,可以避免很多,但可能需要它。我想尝试通过使用子查询得到的ID和限制:

SELECT b.id, b.sv, b.description, c.store_id, f.name, e.price 
FROM tbl_watch AS a 
LEFT JOIN tbl_item AS b ON a.item_id = b.id 
LEFT JOIN tbl_storewatch AS c ON c.user_id = '$user_id' 
JOIN tbl_storeflyer AS d ON d.store_id = c.store_id 
JOIN tbl_sale AS e ON e.item_id = a.item_id 
LEFT JOIN tbl_store as f ON d.store_id = f.id 
WHERE a.user_id = '$user_id' 
AND e.flyer_id = d.flyer_id 
AND d.id in (select d.id from d where date_beg = '$s_date') 
ORDER BY b.description ASC 
+0

谢谢你的回应!这些解决方案确实为所有具有活动销售项目的项目(e.item_id = a.item_id AND e.flyer_id = d.flyer_id)提供行,但我也试图将每个项目包含在(b)域中的tbl_watch(a)中,即使它们不存在于tbl_sale(e)中。所以我最终会得到:id,sv,description,NULL,NULL,NULL。我只想让每行有NULL的行。我不知道如何完成这一点。 – ridgeback

+0

只是为了澄清我期望每件物品都属于以下三种情况之一:1 - 具有单一价格的物品。 2 - 具有多个价格的项目。 3 - 没有价格的项目。如果发生3,我仍然希望返回一个包含项目ID,sv和描述的行。 – ridgeback

+0

不确定date_beg这里是一个实际的日期字段,看起来它被用作某种字符。我不确定日期匹配在MySQL中很慢,但我会感到惊讶。日期通常是在内部长时间存储的,唯一的花费是将该字符串转换为长整型,所以我不相信它会增加任何开销。 – PlexQ

1

我觉得这个查询会给你想要的东西:

select a.id, a.sv, a.description, c.id, c.name, b.price 
    from 
    tbl_item a left outer join tbl_sale b on (a.id=b.item_id) 
     left outer join tbl_storeflyer d on (b.flyer_id=d.flyer_id and d.date_beg = '20120801') 
     left outer join tbl_store c on (d.store_id = c.id) 
     left outer join tbl_storewatch x on (c.id = x.store_id) 
     left outer join tbl_watch y on (a.id = y.item_id); 

与参与NULL值,你有可能会有一些左连接。替代的方法是使用一个联盟,这与MySQL可能会更快:

select a.id, a.sv, a.description, c.id as store_id, c.name, b.price 
    from 
    tbl_item a, 
    tbl_sale b, 
    tbl_storeflyer d, 
    tbl_store c, 
    tbl_storewatch x, 
    tbl_watch y 
    where 
    a.id = b.item_id and 
    b.flyer_id = d.flyer_id and 
    d.store_id = c.id and 
    c.id = x.store_id and 
    a.id = y.item_id and 
    d.date_beg = '20120801' 
union 
select a.id, a.sv, a.description, null as store_id, null as name, null as price 
    from 
    tbl_item a 
    where 
    a.id not in (select b.item_id from tbl_sale b); 

你可能会和工会作为一个左外下半场打加入,而不是一个“不”子查询 - 取决于如何你的MySQL版本会优化。