MySQL 如何执行关联查询

本文同时发表在https://github.com/zhangyachen/zhangyachen.github.io/issues/51

当前mysql执行的策略很简单：mysql对任何关联都执行嵌套循环操作，即mysql先在一个表中循环取出单条数据，然后再嵌套循环到下一个表中寻打匹配的行，依次下去，直到描述到所表表中匹配的行为止。然后根据各个表匹配的行，返回查询中需要的各个列。mysql会尝试在最后一个关联表中打到所有匹配的行，如果最后一个关联表无法找到更多的行以后，mysql返回到上一层次关联表，看是否能够找到更多的匹配记录，依此类推迭代执行。

按照这样的方式查找第一条表记录，再嵌套查询下一个关联表，然后回溯到上一个表，在mysql中是通过嵌套循环的方式来实现的--正如其名‘嵌套循环关联’。请看下面的例子中的简单查询：

select tbl1.col1,tbl2.col2

from tbl1 inner join tbl2 using(col3)

where tbl1.col1 in(5,6);

假设mysql按照查询中的表顺序进行关联操作，我们则可以用下面的伪代码表示mysql将如何完成这个查询：

outer_iter = iterator_over tbl1 where col1 in(3,4)

outer_row = outer_iter.next

    while outer_row

    inner_iter = iterator over tbl2 where col3=outer_row.col3

    inner_row = inner_iter.next

    while inner_row

　    output[outer_row.col1,inner_row.col2]

        inner_row = inner_iter.next

     end

     out_row = outer_iter.next

end

上面的执行计划对于单表查询和多表关联查询都适用，如果是一个单表查询，那么只需要完成上面的外层的基本操作。对于外连接和上面的执行过程任然适用。例如我们将上面的查询修改如下：

SELECT tbl1.col1 ,tbl2.col2 FROM tbl1 left outer join tbl2 using (col3) WHERE tbl1.col1 in (3,4)

对应的伪代码：

outer_iter = iterator over tbl1 where col1 in(3,4)

outer row = outer_iter.next

while outer_row

    inner_iter = iterator over tbl2 where col3 = outer_row.col3

    inner_row = inner_iter.next

    if inner row   -> 手动加粗

        while inner_row

            out_put [outer_row.col1,inner_row.col2]

            inner_row = inner_iter.next

        end

    else         -> 手动加粗

        out_put[outer_row.col1,NULL]         -> 手动加粗

    end

    outer_row = outer_iter.next

end

从上面两个例子也可以看出，对于主表来说，是先进行主表的where条件筛选，再进行表联接，而不是先进行整表联接再进行where条件的筛选。

举个例子：

数据表结构：

mysql> create table a(

    -> id int unsigned not null primary key

    -> );

mysql> create table b like a;

表中数据：

mysql> select * from a;

+----+

| id |

+----+

|  1 |

|  2 |

|  3 |

|  4 |

|  5 |

+----+

5 rows in set (0.00 sec)

mysql> select * from b;

+----+

| id |

+----+

|  4 |

|  5 |

|  6 |

|  7 |

+----+

4 rows in set (0.00 sec)

explain查询：

mysql> explain select a.id as aid,b.id as bid from a left join b using(id) where a.id>3;

+----+-------------+-------+--------+---------------+---------+---------+----------+------+--------------------------+

| id | select_type | table | type   | possible_keys | key     | key_len | ref      | rows | Extra                    |

+----+-------------+-------+--------+---------------+---------+---------+----------+------+--------------------------+

|  1 | SIMPLE      | a     | range  | PRIMARY       | PRIMARY | 4       | NULL     |    3 | Using where; Using index |

|  1 | SIMPLE      | b     | eq_ref | PRIMARY       | PRIMARY | 4       | com.a.id |    1 | Using index              |

+----+-------------+-------+--------+---------------+---------+---------+----------+------+--------------------------+

2 rows in set (0.00 sec)

可以看出，首先在a表上进行范围查询，筛选出a.id>3的数据，然后在进行"嵌套查询"。

注意，on后面的筛选条件主要是针对的是关联表，而对于主表筛选并不适用，比如：

mysql>  select a.id as aid,b.id as bid from a left join b on a.id=b.id and a.id>3;

+-----+------+

| aid | bid  |

+-----+------+

|   1 | NULL |

|   2 | NULL |

|   3 | NULL |

|   4 |    4 |

|   5 |    5 |

+-----+------+

5 rows in set (0.00 sec)

mysql> explain select a.id as aid,b.id as bid from a left join b on a.id=b.id and a.id>3;

+----+-------------+-------+--------+---------------+---------+---------+----------+------+-------------+

| id | select_type | table | type   | possible_keys | key     | key_len | ref      | rows | Extra       |

+----+-------------+-------+--------+---------------+---------+---------+----------+------+-------------+

|  1 | SIMPLE      | a     | index  | NULL          | PRIMARY | 4       | NULL     |    5 | Using index |

|  1 | SIMPLE      | b     | eq_ref | PRIMARY       | PRIMARY | 4       | com.a.id |    1 | Using index |

+----+-------------+-------+--------+---------------+---------+---------+----------+------+-------------+

2 rows in set (0.00 sec)

我们发现a表的id<=3的数据并未被筛选走，explain的结果是a表进行了index类型的查询，即主键索引的全部扫描。

如果在on的筛选条件是针对b表的呢，情况会怎么样？

下面的例子数据表结构和数据变了，我们只关注查询的结果区别：

mysql> select * from a left join b on a.data=b.data and b.id<=20;

+----+------+------+------+

| id | data | id   | data |

+----+------+------+------+

|  1 |    1 | NULL | NULL |

|  2 |    2 | NULL | NULL |

|  3 |    3 | NULL | NULL |

|  4 |    4 |    1 |    4 |

|  4 |    4 |    6 |    4 |

|  4 |    4 |   11 |    4 |

|  4 |    4 |   16 |    4 |

|  5 |    5 |    2 |    5 |

|  5 |    5 |    7 |    5 |

|  5 |    5 |   12 |    5 |

|  5 |    5 |   17 |    5 |

+----+------+------+------+

11 rows in set (0.00 sec)

mysql> select * from a left join b on a.data=b.data where b.id<=20;

+----+------+------+------+

| id | data | id   | data |

+----+------+------+------+

|  4 |    4 |    1 |    4 |

|  5 |    5 |    2 |    5 |

|  4 |    4 |    6 |    4 |

|  5 |    5 |    7 |    5 |

|  4 |    4 |   11 |    4 |

|  5 |    5 |   12 |    5 |

|  4 |    4 |   16 |    4 |

|  5 |    5 |   17 |    5 |

+----+------+------+------+

8 rows in set (0.00 sec)

由此，我们可以根据伪码来分析两者的区别：

outer_iter = iterator over a

outer row = outer_iter.next

while outer_row

    inner_iter = iterator over b where data = outer_row.date where id<=20

    inner_row = inner_iter.next

    if inner row

        while inner_row

            out_put [outer_row,inner_row]

            inner_row = inner_iter.next

        end

    else

        out_put[outer_row,NULL]

    end

    outer_row = outer_iter.next

end

outer_iter = iterator over a

outer row = outer_iter.next

while outer_row

    inner_iter = iterator over b where data = outer_row.date  ->手动加粗

    inner_row = inner_iter.next

    if inner row

        while inner_row

            out_put [outer_row,inner_row]

            inner_row = inner_iter.next

        end

    else

        out_put[outer_row,NULL]

    end

    outer_row = outer_iter.next

end

left join的结果集中 where b.id<=20  ->手动加粗

参考资料：《高性能MySQL》

MySQL 如何执行关联查询的更多相关文章

MySQL如何执行关联查询
MySQL中‘关联(join)’ 一词包含的意义比一般意义上理解的要更广泛.总的来说,MySQL认为任何一个查询都是一次‘关联’ --并不仅仅是一个查询需要到两个表的匹配才叫关联,索引在MySQL中, ...
mysql如何执行关联查询与优化
mysql如何执行关联查询与优化一.前言在数据库中执行查询(select)在我们工作中是非常常见的,工作中离不开CRUD,在执行查询(select)时,多表关联也非常常见,我们用的也比较多,那么m ...
Mysql多表表关联查询 inner Join left join right join
Mysql多表表关联查询 inner Join left join right join
JDBC MySQL 多表关联查询查询
public static void main(String[] args) throws Exception{ Class.forName("com.mysql.jdbc.Driver&q ...
MySQL多表关联查询与存储过程
-- **************关联查询(多表查询)**************** -- 需求:查询员工及其所在部门(显示员工姓名,部门名称) -- 1.1 交叉连接查询(不推荐.产生笛卡尔乘积 ...
mysql 无法执行select查询
场景:mysql无法执行select命令查询,对于已存在的数据库,除了mysql.information_schema数据库,其它诸如nova.keystone.cinder等数据库都有此现象. 日志 ...
MySQL 三种关联查询的方式: ON vs USING vs 传统风格
看看下面三个关联查询的 SQL 语句有何区别? 1SELECT * FROM film JOIN film_actor ON (film.film_id = film_actor.film_id) 2 ...
MySQL多表关联查询数量
//多表关联查询数量select user, t1.count1, t2.count2from user tleft join ( select user_id, count(sport_type) ...
[MySQL]多表关联查询技巧
示例表A: author_id author_name 1 Kimmy 2 Abel 3 Bill 4 Berton 示例表B: book_id author_id start_date end_da ...

随机推荐

unity插件开发
1.简单的svn集成: 查询svn的文档可以知道svn提供各种命令符操作.因此,原理非常简单,利用命令符操作调用svn即可.代码也非常简单: 更新:Process.Start("Tortoi ...
通过ELK快速搭建一个你可能需要的集中化日志平台
在项目初期的时候,大家都是赶着上线,一般来说对日志没有过多的考虑,当然日志量也不大,所以用log4net就够了,随着应用的越来越多,日志散落在各个服务器的logs文件夹下,确实有点不大方便,这个时候 ...
MySQL错误：2003-Can't connect to MySQL server on 'localhost'(10061 "unknown error")
今天数据库出了一点错误之后决定重装一下,结果卡在了一个问题上,连装了5遍,加上网上各种配置教程都没能结局,错误如下图所示: 最后忽然想到会不会是因为每一次卸载的时候没有彻底卸载干净,然后就彻彻底底卸载 ...
jQuery与别的js框架冲突
jQuery.noConflict()运行这个函数将变量$的控制权让渡给第一个实现它的那个库. 这有助于确保jQuery不会与其他库的$对象发生冲突. <script type="te ...
如何使用python将MySQL中的查询结果导出为Excel----xlwt的使用
如何在MySQL中执行的一条查询语句结果导出为Excel? 一.可选方法 1.使用sql yog等远程登录,执行查询语句并导出结果集为Excel 适用于较简单的查询结果集的导出如果需要多个SQL语句 ...
2016普及组t3海港
好的,说说这道题的思路,爆搜队列嘛: 用一个结构体队列存每个人来的时间和他的国籍,用一个vis数组存每个人来的次数,是第一次来sum便加一. 然后从前面第一个人开始扔(原谅我用这个词,因为我找不到更好 ...
在没有DOM操作的日子里，我是怎么熬过来的（终结篇）
前言在我写终结篇的日子里,Vue版本稳定在2.9.1.当我摸清Vue的脉络之后,以一个爬坑无数的亲历者的身份,谈谈我在MVVM时代里遇到的那些事儿. 接下来,正文从这开始~ 好多童鞋学习Vue都有灯 ...
POJ 1273 Drainage Ditches 网络流 FF
Drainage Ditches Time Limit: 1000MS Memory Limit: 10000K Total Submissions: 74480 Accepted: 2895 ...
手动安装Nginx
本分类下有一个环境一键安装.那这背后发生了什么呢?咱们手动使用源码进行安装.1.首先保证有一个能联网的centos.2.百度 ningx 官网点download http://nginx.or ...
bitcms内容管理系统 3.1版源码发布
开源bitcms内容管理系统采用ASP.NET MVC5+MySql的组合开发,更适应中小型系统低成本运行. bitcms的主要功能 1.重写了APS.NET MVC的路由机制.bitcms使用路由参 ...

MySQL 如何执行关联查询

MySQL 如何执行关联查询的更多相关文章

随机推荐

热门专题