[20180705]关于hash join 2.txt

--//昨天优化sql语句,执行计划hash join right sna,加入一个约束设置XX字段not null,逻辑读从上万下降到50.
--//关于hash join派生的执行计划,而且hash join还在外连接时支持右关联,特别是11g,加入NULL-AWARW/Single Null-Aware的判断,许
--//多我自己很混乱,做一点总结.

--//NA =>  Null-Aware.
--//SNA => Single Null-Aware

1.环境:
SCOTT@book> @ ver1
PORT_STRING                    VERSION        BANNER
------------------------------ -------------- --------------------------------------------------------------------------------
x86_64/Linux 2.4.xx            11.2.0.4.0     Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production

create table t1 as select level id   ,'t1'||to_char(level) name from dual connect by level<=4;
create table t2 as select level+1 id ,'t2'||to_char(level) name from dual connect by level<=4;
insert into t1 values (null,'t1null');
insert into t2 values (null,'t2null');
commit ;
--//分析略.

2.执行计划包含hash join:
SCOTT@test01p> select * from t1,t2 where t1.id=t2.id;
        ID NAME                         ID NAME
---------- -------------------- ---------- --------------------
         2 t12                           2 t21
         3 t13                           3 t22
         4 t14                           4 t23

Plan hash value: 1838229974
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     8 (100)|          |      3 |00:00:00.01 |      15 |       |       |          |
|*  1 |  HASH JOIN         |      |      1 |      4 |    64 |     8   (0)| 00:00:01 |      3 |00:00:00.01 |      15 |  1888K|  1888K| 1063K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
|*  3 |   TABLE ACCESS FULL| T2   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       8 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//如果改变连接顺序,加入提示:
SCOTT@book> select /*+ leading(t2 t1) */ * from t1,t2 where t1.id=t2.id;
        ID NAME                         ID NAME
---------- -------------------- ---------- --------------------
         2 t12                           2 t21
         3 t13                           3 t22
         4 t14                           4 t23

Plan hash value: 2959412835
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     7 (100)|          |      3 |00:00:00.01 |       9 |       |       |          |
|*  1 |  HASH JOIN         |      |      1 |      4 |    56 |     7   (0)| 00:00:01 |      3 |00:00:00.01 |       9 |  1888K|  1888K|  830K (0)|
|   2 |   TABLE ACCESS FULL| T2   |      1 |      4 |    28 |     3   (0)| 00:00:01 |      4 |00:00:00.01 |       2 |       |       |          |
|*  3 |   TABLE ACCESS FULL| T1   |      1 |      4 |    28 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

3.执行计划包含HASH JOIN OUTER/HASH JOIN RIGHT OUTER:
SCOTT@book> select * from t1,t2 where t1.id=t2.id(+);
        ID NAME                         ID NAME
---------- -------------------- ---------- --------------------
         2 t12                           2 t21
         3 t13                           3 t22
         4 t14                           4 t23
           t1null
         1 t11

Plan hash value: 1823443478
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     7 (100)|          |      5 |00:00:00.01 |       9 |       |       |          |
|*  1 |  HASH JOIN OUTER   |      |      1 |      5 |    84 |     7   (0)| 00:00:01 |      5 |00:00:00.01 |       9 |  1888K|  1888K|  935K (0)|
|   2 |   TABLE ACCESS FULL| T1   |      1 |      5 |    42 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       6 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |      4 |    28 |     3   (0)| 00:00:01 |      4 |00:00:00.01 |       3 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--// 通过改变连接顺序,HASH JOIN OUTER:
SCOTT@book> select /*+ SWAP_JOIN_INPUTS(t2) */ * from t1,t2 where t1.id=t2.id(+);
        ID NAME                         ID NAME
---------- -------------------- ---------- --------------------
         1 t11
         2 t12                           2 t21
         3 t13                           3 t22
         4 t14                           4 t23
           t1null

Plan hash value: 312430291
---------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation             | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
---------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT      |      |      1 |        |       |     8 (100)|          |      5 |00:00:00.01 |      15 |       |       |          |
|*  1 |  HASH JOIN RIGHT OUTER|      |      1 |      5 |    80 |     8   (0)| 00:00:01 |      5 |00:00:00.01 |      15 |  1888K|  1888K|  926K (0)|
|*  2 |   TABLE ACCESS FULL   | T2   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
|   3 |   TABLE ACCESS FULL   | T1   |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       8 |       |       |          |
---------------------------------------------------------------------------------------------------------------------------------------------------

--//在nested loop,merge join,hash join连接中,仅仅hash join在外连接中支持right连接模式,其它一般不提都是left连接.
--//nested loop 算法根本不可能支持right连接模式.
--//而merge join理论可以支持,但是因为不管如何metge join 两边都要排序,完全没有必要在支持right连接模式.(注这个是我个人的理解,也许不对)
--//而hash join如果右关联表生成hash表小,完全可能右关联表作为驱动表.

4.执行计划包含hash join anti/hash join right anti:
SCOTT@book> select * from t1 where id not in (select id from t2 where id is not null) and id is not null;
        ID NAME
---------- --------------------
         1 t11

--//我现在查询2个表的id is not null.

Plan hash value: 2706079091
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     8 (100)|          |      1 |00:00:00.01 |      12 |       |       |          |
|*  1 |  HASH JOIN ANTI    |      |      1 |      1 |    11 |     8   (0)| 00:00:01 |      1 |00:00:00.01 |      12 |  1888K|  1888K|  978K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|*  3 |   TABLE ACCESS FULL| T2   |      1 |      4 |    12 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//如何实现交换顺序呢?
SCOTT@book> select * from t1 where id not in (select /*+ SWAP_JOIN_INPUTS(t2) */ id from t2 where id is not null) and id is not null;
        ID NAME
---------- --------------------
         1 t11

Plan hash value: 629543484
--------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation            | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
--------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT     |      |      1 |        |       |     8 (100)|          |      1 |00:00:00.01 |      13 |       |       |          |
|*  1 |  HASH JOIN RIGHT ANTI|      |      1 |      1 |    11 |     8   (0)| 00:00:01 |      1 |00:00:00.01 |      13 |  2440K|  2440K|  855K (0)|
|*  2 |   TABLE ACCESS FULL  | T2   |      1 |      4 |    12 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|*  3 |   TABLE ACCESS FULL  | T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
--------------------------------------------------------------------------------------------------------------------------------------------------

5.执行计划包含hash join semi/hash join right semi:
SCOTT@book> select * from t1 where id  in (select id from t2 where id is not null) and id is not null;
        ID NAME
---------- --------------------
         2 t12
         3 t13
         4 t14

Plan hash value: 1713220790
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     8 (100)|          |      3 |00:00:00.01 |      13 |       |       |          |
|*  1 |  HASH JOIN SEMI    |      |      1 |      3 |    33 |     8   (0)| 00:00:01 |      3 |00:00:00.01 |      13 |  1888K|  1888K|  935K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|*  3 |   TABLE ACCESS FULL| T2   |      1 |      4 |    12 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//改变交换顺序呢?
SCOTT@book> select * from t1 where id  in (select /*+ SWAP_JOIN_INPUTS(t2) */ id from t2 where id is not null) and id is not null;
        ID NAME
---------- --------------------
         2 t12
         3 t13
         4 t14

Plan hash value: 1275841967
--------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation            | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
--------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT     |      |      1 |        |       |     8 (100)|          |      3 |00:00:00.01 |      13 |       |       |          |
|*  1 |  HASH JOIN RIGHT SEMI|      |      1 |      3 |    33 |     8   (0)| 00:00:01 |      3 |00:00:00.01 |      13 |  2440K|  2440K|  814K (0)|
|*  2 |   TABLE ACCESS FULL  | T2   |      1 |      4 |    12 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|*  3 |   TABLE ACCESS FULL  | T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
--------------------------------------------------------------------------------------------------------------------------------------------------

5.执行计划包含HASH JOIN FULL OUTER:
SCOTT@book> select * from t1 full outer join t2  on  t1.id=t2.id;
        ID NAME                         ID NAME
---------- -------------------- ---------- --------------------
         2 t12                           2 t21
         3 t13                           3 t22
         4 t14                           4 t23
                                         5 t24
                                           t2null
           t1null
         1 t11

7 rows selected.

Plan hash value: 53297166
-------------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation             | Name     | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
-------------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT      |          |      1 |        |       |     8 (100)|          |      7 |00:00:00.01 |      13 |       |       |          |
|   1 |  VIEW                 | VW_FOJ_0 |      1 |      5 |   360 |     8   (0)| 00:00:01 |      7 |00:00:00.01 |      13 |       |       |          |
|*  2 |   HASH JOIN FULL OUTER|          |      1 |      5 |    80 |     8   (0)| 00:00:01 |      7 |00:00:00.01 |      13 |  1888K|  1888K|  936K (0)|
|   3 |    TABLE ACCESS FULL  | T1       |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       6 |       |       |          |
|   4 |    TABLE ACCESS FULL  | T2       |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       7 |       |       |          |
-------------------------------------------------------------------------------------------------------------------------------------------------------
--//交换连接顺序略.

6.执行计划包含hash join anti NA/hash join right anti NA:
SCOTT@book> select * from t1 where id not in (select id from t2 ) ;
no rows selected

Plan hash value: 1275484728
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     8 (100)|          |      0 |00:00:00.01 |      10 |       |       |          |
|*  1 |  HASH JOIN ANTI NA |      |      1 |      2 |    22 |     8   (0)| 00:00:01 |      0 |00:00:00.01 |      10 |  1888K|  1888K|  936K (0)|
|   2 |   TABLE ACCESS FULL| T1   |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       6 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |      5 |    15 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       4 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//NA表示Null-Aware,我开始以为2边都要判断id是否存在NULL值.实际上如果执行如下:
SCOTT@book> select * from t1 where id not in (select id from t2 ) and id is not null;
no rows selected

Plan hash value: 1275484728
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |     8 (100)|          |      0 |00:00:00.01 |      10 |       |       |          |
|*  1 |  HASH JOIN ANTI NA |      |      1 |      1 |    11 |     8   (0)| 00:00:01 |      0 |00:00:00.01 |      10 |  1888K|  1888K|  936K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |      5 |    15 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       4 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//说明实际上NA仅仅判断右关联表查询连接条件是否为NULL.
--//另外这种查询非常容易引起歧义性,许多开发包括我,以前一直以为至少返回id=1的记录.
--//实际上正是T2表里面id存在空值导致没有结果.即使写成如下,结果也一样:
select * from t1 where id not in (select id from t2 ) and id is not null;
--//另外注意仅仅not in,not exists在执行计划中才存在NA,SNA.

--//如何交换:
select * from t1 where id not in (select /*+ SWAP_JOIN_INPUTS(t2) */ id from t2 ) ;
--//以上这样写不行.要写成如下(注我使用通过获得outlined的执行计划),猜测这样加提示的:
SCOTT@book> select /*+ SWAP_JOIN_INPUTS(@"SEL$5DA710D3" "T2"@"SEL$2") */ * from t1 where id not in (select id from t2 ) ;
no rows selected

Plan hash value: 2739594415
-----------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation               | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
-----------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT        |      |      1 |        |       |     8 (100)|          |      0 |00:00:00.01 |       4 |       |       |          |
|*  1 |  HASH JOIN RIGHT ANTI NA|      |      1 |      2 |    22 |     8   (0)| 00:00:01 |      0 |00:00:00.01 |       4 |  1817K|  1817K|  841K (0)|
|   2 |   TABLE ACCESS FULL     | T2   |      1 |      5 |    15 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       4 |       |       |          |
|   3 |   TABLE ACCESS FULL     | T1   |      0 |      5 |    40 |     4   (0)| 00:00:01 |      0 |00:00:00.01 |       0 |       |       |          |
-----------------------------------------------------------------------------------------------------------------------------------------------------
--//这里注意一个细节,T1实际上根本没有做全表扫描,逻辑读是0,starts=0.
--//插曲:补充测试,给T2增加一些测试数据:
insert into t2  select rownum+4 ,'t2'||to_char(rownum+4) from dual connect by level<=10000;
commit;

SCOTT@book> set autot traceonly
SCOTT@book> select count(*) from t2;
Execution Plan
----------------------------------------------------------
Plan hash value: 3321871023
-------------------------------------------------------------------
| Id  | Operation          | Name | Rows  | Cost (%CPU)| Time     |
-------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |     1 |    10   (0)| 00:00:01 |
|   1 |  SORT AGGREGATE    |      |     1 |            |          |
|   2 |   TABLE ACCESS FULL| T2   | 10005 |    10   (0)| 00:00:01 |
-------------------------------------------------------------------
Statistics
----------------------------------------------------------
          0  recursive calls
          0  db block gets
         30  consistent gets
          0  physical reads
          0  redo size
        528  bytes sent via SQL*Net to client
        520  bytes received via SQL*Net from client
          2  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
          1  rows processed
SCOTT@book> set autot off
--//可以发现全表扫描T1,逻辑读是30.

SCOTT@book> select * from t1 where id not in (select id from t2 ) and id is not null;
no rows selected

Plan hash value: 1275484728
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |    14 (100)|          |      0 |00:00:00.01 |      11 |       |       |          |
|*  1 |  HASH JOIN ANTI NA |      |      1 |      1 |    12 |    14   (0)| 00:00:01 |      0 |00:00:00.01 |      11 |  1888K|  1888K| 1008K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       6 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |  10005 | 40020 |    10   (0)| 00:00:01 |    459 |00:00:00.01 |       5 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------
--//你可以发现一个奇特现象,T2实际读取459条记录,buffers=5.并不是30,实际上因为前面已经存在id is null的记录,这样返回的结果集一定是没有结果,这样
--//停止了T2表的扫描.

SCOTT@book> column PARTITION_NAME noprint
SCOTT@book>  select * from dba_extents where segment_name='T2';
OWNER  SEGMENT_NAME         SEGMENT_TYPE       TABLESPACE_NAME EXTENT_ID    FILE_ID   BLOCK_ID      BYTES     BLOCKS RELATIVE_FNO
------ -------------------- ------------------ --------------- --------- ---------- ---------- ---------- ---------- ------------
SCOTT  T2                   TABLE              USERS                   0          4        552      65536          8            4
SCOTT  T2                   TABLE              USERS                   1          4        560      65536          8            4
SCOTT  T2                   TABLE              USERS                   2          4        672      65536          8            4
SCOTT  T2                   TABLE              USERS                   3          4        680      65536          8            4

SCOTT@book> select count(*) from t2 where DBMS_ROWID.ROWID_BLOCK_NUMBER(rowid) between 555 and 556;
  COUNT(*)
----------
       458
--//458,差1条与执行计划统计.可以推测仅仅扫描读取了数据块555,556,557块.

7.执行计划包含hash join anti SNA/hash join right anti SNA:

SCOTT@book> select * from t1 where id not in (select id from t2 where id is not null) ;
        ID NAME
---------- --------------------
         1 t11

Plan hash value: 1270581391
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |    14 (100)|          |      1 |00:00:00.01 |      36 |       |       |          |
|*  1 |  HASH JOIN ANTI SNA|      |      1 |      2 |    24 |    14   (0)| 00:00:01 |      1 |00:00:00.01 |      36 |  1888K|  1888K| 1008K (0)|
|   2 |   TABLE ACCESS FULL| T1   |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       6 |       |       |          |
|*  3 |   TABLE ACCESS FULL| T2   |      1 |  10004 | 40016 |    10   (0)| 00:00:01 |  10004 |00:00:00.01 |      30 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

--//注:当限制T2查询条件id is not null,这样查询仅仅检查T1表的id是否存在NULL.也就是SNA.
--//SNA 表示 Single Null-Aware ,也就是检查左关联表连接条件是否存在null.

--//EXECUTE SYS.DBMS_STATS.SET_TABLE_STATS (OWNNAME=>'SCOTT', TABNAME=>'T1', NUMBLKS=> 800000);
--//EXECUTE SYS.DBMS_STATS.SET_TABLE_STATS (OWNNAME=>'SCOTT', TABNAME=>'T1', NUMROWS=> 8000);

--//交换顺序:
SCOTT@book> select /*+  SWAP_JOIN_INPUTS(@"SEL$5DA710D3" "T2"@"SEL$2") */ * from t1 where id not in (select id from t2 where id is not null) ;
        ID NAME
---------- --------------------
         1 t11

Plan hash value: 1521920066
------------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation                | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT         |      |      1 |        |       |    14 (100)|          |      1 |00:00:00.01 |      37 |       |       |          |
|*  1 |  HASH JOIN RIGHT ANTI SNA|      |      1 |      2 |    24 |    14   (0)| 00:00:01 |      1 |00:00:00.01 |      37 |  2440K|  2440K| 1900K (0)|
|*  2 |   TABLE ACCESS FULL      | T2   |      1 |  10004 | 40016 |    10   (0)| 00:00:01 |  10004 |00:00:00.01 |      30 |       |       |          |
|   3 |   TABLE ACCESS FULL      | T1   |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       7 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------------

总结:
1.NA仅仅判断右关联表查询连接条件是否为NULL.
2.SNA仅仅判断左关联表查询连接条件是否为NULL.
3.注意NULL判断,NULL的逻辑仅仅存在 NULL is NULL 是true,NULL is not NULL是false,其它NULL=NULL之类的都表示noknown.
4.仅仅hash join支持右连接模式.
5.注意仅仅not in,not exists在执行计划中才存在NA,SNA.
6.个人建议,使用exists/not exists代替in/not in,大部分满足业务与实际查询的需要.
7.适当给字段加入not null,规避一些全表扫描,因为全NULL值,在oracle中的常规索引不记录,导致无法使用索引.
8.从以上测试,可以发现hash join派生许多操作方式:
HASH JOIN,HASH JOIN OUTER/HASH JOIN RIGHT OUTER,
HASH JOIN ANTI/HASH JOIN RIGHT ANTI,
HASH JOIN SEMI/HASH JOIN RIGHT SEMI,
HASH JOIN FULL OUTER,
HASH JOIN ANTI NA/HASH JOIN RIGHT ANTI NA,
HASH JOIN ANTI SNA/HASH JOIN RIGHT ANTI SNA.
9.以上完全是基于测试的猜测,不知道是否正确.

10.补充使用exists/not exists的测试:(12c)
SCOTT@test01p> @ ver1
PORT_STRING                    VERSION        BANNER                                                                               CON_ID
------------------------------ -------------- -------------------------------------------------------------------------------- ----------
IBMPC/WIN_NT64-9.1.0           12.1.0.1.0     Oracle Database 12c Enterprise Edition Release 12.1.0.1.0 - 64bit Production              0

SCOTT@test01p> select * from t1 where not exists (select id from t2 where t2.id=t1.id) ;
        ID NAME
---------- --------------------
           t1null
         1 t11

Plan hash value: 2706079091
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |    14 (100)|          |      2 |00:00:00.02 |      38 |       |       |          |
|*  1 |  HASH JOIN ANTI    |      |      1 |      2 |    24 |    14   (0)| 00:00:01 |      2 |00:00:00.02 |      38 |  1888K|  1888K| 1040K (0)|
|   2 |   TABLE ACCESS FULL| T1   |      1 |      5 |    40 |     4   (0)| 00:00:01 |      5 |00:00:00.01 |       7 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |  10005 | 40020 |    10   (0)| 00:00:01 |  10005 |00:00:00.01 |      31 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------
--//注意条件是t2.id=t1.id,这样T2.id是null的记录被排除了.注意与使用not in结果不同:
SCOTT@test01p> select  * from t1 where id not in (select id from t2 where id is not null) ;
        ID NAME
---------- --------------------
         1 t11
--//如果使用exists这样的效果.
SCOTT@test01p> select * from t1 where  not exists (select id from t2 where t2.id=t1.id) and id is not null;
        ID NAME
---------- --------------------
         1 t11

--//exists
SCOTT@test01p> select * from t1 where  exists (select id from t2 where t2.id=t1.id) ;
        ID NAME
---------- --------------------
         2 t12
         3 t13
         4 t14

Plan hash value: 1713220790
------------------------------------------------------------------------------------------------------------------------------------------------
| Id  | Operation          | Name | Starts | E-Rows |E-Bytes| Cost (%CPU)| E-Time   | A-Rows |   A-Time   | Buffers |  OMem |  1Mem | Used-Mem |
------------------------------------------------------------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT   |      |      1 |        |       |    14 (100)|          |      3 |00:00:00.01 |      39 |       |       |          |
|*  1 |  HASH JOIN SEMI    |      |      1 |      4 |    48 |    14   (0)| 00:00:01 |      3 |00:00:00.01 |      39 |  1888K|  1888K| 1040K (0)|
|*  2 |   TABLE ACCESS FULL| T1   |      1 |      4 |    32 |     4   (0)| 00:00:01 |      4 |00:00:00.01 |       7 |       |       |          |
|   3 |   TABLE ACCESS FULL| T2   |      1 |  10005 | 40020 |    10   (0)| 00:00:01 |  10005 |00:00:00.01 |      32 |       |       |          |
------------------------------------------------------------------------------------------------------------------------------------------------

[20180705]关于hash join 2.txt的更多相关文章

  1. [20180713]关于hash join 测试中一个疑问.txt

    [20180713]关于hash join 测试中一个疑问.txt --//上个星期做的测试,链接: http://blog.itpub.net/267265/viewspace-2157424/-- ...

  2. SQL Tuning 基础概述06 - 表的关联方式:Nested Loops Join,Merge Sort Join & Hash Join

    nested loops join(嵌套循环)   驱动表返回几条结果集,被驱动表访问多少次,有驱动顺序,无须排序,无任何限制. 驱动表限制条件有索引,被驱动表连接条件有索引. hints:use_n ...

  3. Sort merge join、Nested loops、Hash join(三种连接类型)

    目前为止,典型的连接类型有3种: Sort merge join(SMJ排序-合并连接):首先生产driving table需要的数据,然后对这些数据按照连接操作关联列进行排序:然后生产probed ...

  4. 视图合并、hash join连接列数据分布不均匀引发的惨案

    表大小 SQL> select count(*) from agent.TB_AGENT_INFO; COUNT(*) ---------- 1751 SQL> select count( ...

  5. 最新电Call记录统计-full hash join用法

    declare @time datetime set @time='2016-07-01' --最新的电Call记录统计查询--SELECT t.zuoxi1,t.PhoneCount,t.Phone ...

  6. Sql优化(一) Merge Join vs. Hash Join vs. Nested Loop

    原创文章,首发自本人个人博客站点,转载请务必注明出自http://www.jasongj.com Nested Loop,Hash Join,Merge Join介绍 Nested Loop: 对于被 ...

  7. Oracle 表的连接方式(2)-----HASH JOIN的基本机制3

    HASH JOIN的模式 hash join有三种工作模式,分别是optimal模式,onepass模式和multipass模式,分别在v$sysstat里面有对应的统计信息: SQL> sel ...

  8. Oracle 表的连接方式(2)-----HASH JOIN的基本机制2

    Hash算法原理 对于什么是Hash算法原理?这个问题有点难度,不是很好说清楚,来做一个比喻吧:我们有很多的小猪,每个的体重都不一样,假设体重分布比较平均(我们考虑到公斤级别),我们按照体重来分,划分 ...

  9. Oracle 表的连接方式(2)-----HASH JOIN的基本机制1

    我们对hash join的常见误解,一般包括两个: 第一个误解:是我们经常以为hash join需要对两个做join的表都做全表扫描 第二个误解:是经常以为hash join会选择比较小的表做buil ...

随机推荐

  1. 【sping揭秘】18、使用spring访问数据

    统一的数据访问异常层次体系 基于基本的jdbc封装dao层访问接口,封装不论是访问,csv文件,关系数据库(RDBMS),ladp都可以封装成一个个DAO对象来进行访问 抛出问题 可是对于我们忽略了一 ...

  2. shell编程中如何执行oracle语句

    shell编程中如果向oracle中插入数据之类的,需要先把执行语句放到文件中,然后再@这个文件执行 有如下俩种方式供参考: SQL=`sqlplus user/pwd@orains << ...

  3. Maven项目报错:Missing artifact****和ArtifactDescriptorException: Failed to read artifact descriptor for***和Cannot change version of project facet Dynamic web module to 2.5

    一.关于Cannot change version of project facet Dynamic web module to 2.5 具体查看博客:http://blog.csdn.net/ste ...

  4. Asp.Net Core微信服务中间件-.NetCore2.1

    又封周末,闲暇无聊,随手写了一个关于微信公众号服务的中间件,基于.NetCore2.1.服务类库采用.Net Standard2.0,兼容.net 4.6.1. 整体思路是,设计一个中间件,提供微信消 ...

  5. mysql 开发进阶篇系列 41 mysql日志之慢查询日志

    一.概述 慢查询日志记录了所有的超过sql语句( 超时参数long_query_time单位 秒),获得表锁定的时间不算作执行时间.慢日志默认写入到参数datadir(数据目录)指定的路径下.默认文件 ...

  6. SQL 必知必会·笔记<11>创建高级联结

    1. 使用表别名 SQL 除了可以对列名和计算字段使用别名,还允许给表名起别名.这样 做有两个主要理由: 缩短SQL 语句: 允许在一条SELECT 语句中多次使用相同的表. 使用表别名示例: SEL ...

  7. tomcat8 JVM 优化

    在Linux环境下设置Tomcat JVM,在/opt/tomcat/bin/catalina.sh文件中找到"# ----- Execute The Requested Command&q ...

  8. Jenkins : 安装 master 和 slave

    目录 安装 master 安装 slave 设置 master 与 slave 的通信方式 添加 slave 配置 在 salve 上安装 jre 安装并配置 Jenkins salve Jenkin ...

  9. k8s升级,HA集群1.12.0~HA集群1.13.2

    k8s升级,此次升级是1.12.0 至1.13.2 准备 # 首先升级master节点的基础组件kubeadm.kubelet.kubectl apt policy kubeadm 找到相应的版本,如 ...

  10. JavaWeb学习 (二十)————JavaWeb的两种开发模式

    一.JSP+JavaBean开发模式 1.1.jsp+javabean开发模式架构 jsp+javabean开发模式的架构图如下图(图1-1)所示