PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率

2015-05-21 Created By BaoXinjian

一、摘要

以前只考虑 merge into 只是在特定场合下方便才使用的，今天才发现，merge into 竟然会比 update 在更新数据时有这么大的改进。

其实呢，merge into部分的update和update也没啥不同的，不同的地方在于使用merge into后执行计划变了。

merge方法是最简洁，效率最高的方式，在大数据量更新时优先使用这种方式。

1. 基本语法

merge into test1 using test2

on (test1.id = test2.id)

when matched then update

set test1.name = nvl2(test1.name,test2.name,test1.name);

update内联视图方式：使用这种方式必须在test2.id上有主键 (这里很好理解，必须保证每一个test1.id对应在test2里只有一条记录，如果test2中有多条对应的记录，怎么更新test1)

或者on (test1.id = test2.id, test1.name = test2.name ....)，通过多栏位对比，确认唯一记录，类似Unique Index

2. 使用并行，加快大量数据更新：

merge /*+parallel(test1,4)*/ into test1 using test2

on (test1.id = test2.id)

when matched then update

set test1.name = nvl2(test1.name,test2.name,test1.name);

二、测试案例 - Update / Merge Into

1. 创建测试数据

create table test1 as select * from dba_objects where rownum<=10000;--10000条记录

create table test2 as select * from dba_objects--73056条记录

2. 直接Update时间和效率

SQL> alter system flush shared_pool;

System altered.

SQL> alter system flush buffer_cache;

System altered.

SQL> set linesize 400 pagesize 400

SQL> set autot trace

SQL> set timing on

SQL> update test1 t1

  2     set t1.object_name = (select t2.object_name

  3                             from test2 t2

  4                            where t2.object_id = t1.object_id);

10000 rows updated.

Elapsed: 00:06:33.35

Execution Plan

----------------------------------------------------------

   0      UPDATE STATEMENT Optimizer=ALL_ROWS (Cost=2923252 Card=10011 Bytes=790869)

   1    0   UPDATE OF 'TEST1'

   2    1     TABLE ACCESS (FULL) OF 'TEST1' (TABLE) (Cost=40 Card=10011 Bytes=790869)

   3    1     TABLE ACCESS (FULL) OF 'TEST2' (TABLE) (Cost=292 Card=772 Bytes=60988)

Statistics

----------------------------------------------------------

        430  recursive calls

      11122  db block gets

   15275257  consistent gets

       1175  physical reads

    4058752  redo size

        520  bytes sent via SQL*Net to client

        668  bytes received via SQL*Net from client

          3  SQL*Net roundtrips to/from client

          7  sorts (memory)

          0  sorts (disk)

      10000  rows processed

3. 通过Merge Into时间和效率

SQL> alter system flush shared_pool;

System altered.

Elapsed: 00:00:00.45

SQL> alter system flush buffer_cache;

System altered.

Elapsed: 00:00:00.71

SQL> merge into test1 t1

  2  using test2 t2

  3  on (t1.object_id = t2.object_id)

  4  when matched then

  5    update set t1.object_name = t2.object_name;

10000 rows merged.

Elapsed: 00:00:00.92

Execution Plan

----------------------------------------------------------

   0      MERGE STATEMENT Optimizer=ALL_ROWS (Cost=1243 Card=10011 Bytes=1321452)

   1    0   MERGE OF 'TEST1'

   2    1     VIEW

   3    2       HASH JOIN (Cost=1243 Card=10011 Bytes=4264686)

   4    3         TABLE ACCESS (FULL) OF 'TEST1' (TABLE) (Cost=40 Card=10011 Bytes=2192409)

   5    3         TABLE ACCESS (FULL) OF 'TEST2' (TABLE) (Cost=292 Card=77163 Bytes=15972741)

Statistics

----------------------------------------------------------

       1224  recursive calls

      10279  db block gets

       1586  consistent gets

       1191  physical reads

    2803872  redo size

        526  bytes sent via SQL*Net to client

        634  bytes received via SQL*Net from client

          3  SQL*Net roundtrips to/from client

         12  sorts (memory)

          0  sorts (disk)

      10000  rows processed

aaarticlea/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAATCAIAAAAf7rriAAABHklEQVQ4jc3Tv0sCcRjH8Wc4/wL9H4T0P3CpXYQcHQpuFIQayjvXQrnulhosHDpQsBZB8Q9wuWu1QVBcbMtNyaAm3y0e/fC4Lmvo4bO++D6fB74CLJdPm0X+MRaRn2HuowziDFNMVBFhajDvsHB4GYXBMQZxRikeVBHh0WDeDo+jKzz5gJ8dXjfAU4NZm4UbEse+4uC1qVYplSgU2Ntf4aHXeXrGLBjX65TLaEXyeTIZdrYjCskEIpLdRdMwTWybOxc/3GpxfsHpCUfH5HKk0xGF5JaHi1gm9jWuP3Ycbm+o1bAsDg9Q1U9YC8b9Pt0uzSZXl/LdrOHxmF6PTodGg0oFXY8oJLzOuveyf+f1vB8si17EDFj7zz5GyPwKvwECQrZ4yvBSdAAAAABJRU5ErkJggg==" alt="" />三、解析计划

1. 通过Update的解析计划

SQL> set autot off

SQL> update /*+gather_plan_statistics*/ test1 t1

  2     set t1.object_name = (select t2.object_name

  3                             from test2 t2

  4                            where t2.object_id = t1.object_id);

10000 rows updated.

Elapsed: 00:04:32.81

SQL> select * from table(dbms_xplan.display_cursor(null,null,'iostats'));

PLAN_TABLE_OUTPUT

--------------------------------------------------------------------------------------------

SQL_ID  c8qt9a54qgmqg, child number 0

-------------------------------------

update /*+gather_plan_statistics*/ test1 t1    set t1.object_name =

(select t2.object_name                            from test2 t2

                  where t2.object_id = t1.object_id)

Plan hash value: 3883393169

--------------------------------------------------------------------------------------

| Id  | Operation          | Name  | Starts | E-Rows | A-Rows |   A-Time   | Buffers |

--------------------------------------------------------------------------------------

|   0 | UPDATE STATEMENT   |       |      1 |        |      0 |00:04:32.73 |      10M|

|   1 |  UPDATE            | TEST1 |      1 |        |      0 |00:04:32.73 |      10M|

|   2 |   TABLE ACCESS FULL| TEST1 |      1 |  10011 |  10000 |00:00:00.17 |     133 |

|*  3 |   TABLE ACCESS FULL| TEST2 |  10000 |    772 |  10000 |00:04:31.51 |      10M|

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   3 - filter("T2"."OBJECT_ID"=:B1)

Note

-----

   - dynamic sampling used for this statement (level=2)

26 rows selected.

Elapsed: 00:00:01.38

2. 通过Merge Into的解析计划

SQL> merge /*+gather_plan_statistics*/

  2  into test1 t1

  3  using test2 t2

  4  on (t1.object_id = t2.object_id)

  5  when matched then

  6    update set t1.object_name = t2.object_name;

10000 rows merged.

Elapsed: 00:00:00.52

SQL> select * from table(dbms_xplan.display_cursor(null,null,'iostats'));

PLAN_TABLE_OUTPUT

-------------------------------------------------------------------------------------------

SQL_ID  9n4tc6tvwaj9c, child number 0

-------------------------------------

merge /*+gather_plan_statistics*/ into test1 t1 using test2 t2 on

(t1.object_id = t2.object_id) when matched then   update set

t1.object_name = t2.object_name

Plan hash value: 818823782

----------------------------------------------------------------------------------------

| Id  | Operation            | Name  | Starts | E-Rows | A-Rows |   A-Time   | Buffers |

----------------------------------------------------------------------------------------

|   0 | MERGE STATEMENT      |       |      1 |        |      0 |00:00:00.47 |   11458 |

|   1 |  MERGE               | TEST1 |      1 |        |      0 |00:00:00.47 |   11458 |

|   2 |   VIEW               |       |      1 |        |  10000 |00:00:00.33 |    1179 |

|*  3 |    HASH JOIN         |       |      1 |  10011 |  10000 |00:00:00.25 |    1179 |

|   4 |     TABLE ACCESS FULL| TEST1 |      1 |  10011 |  10000 |00:00:00.08 |     133 |

|   5 |     TABLE ACCESS FULL| TEST2 |      1 |  77163 |  73056 |00:00:00.26 |    1046 |

----------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   3 - access("T1"."OBJECT_ID"="T2"."OBJECT_ID")

Note

-----

   - dynamic sampling used for this statement (level=2)

28 rows selected.

Elapsed: 00:00:00.15

四、结果分析

1. 测试结果对比：update和merge into 都更新1w条记录，

update耗时6分钟，逻辑读消耗15275257；

merge into 耗时6秒钟，消耗逻辑读1586，相差太大了。

2. 其实看着执行计划，这个结果也很容易理解：

update采用的类似nested loop的方式，对更新的每一行，都会对查询的表扫描一次；

merge into这里选择的是hash join，则针对每张表都是做了一次 full table scan，对每张表都只是扫描一次。

3. Oracle官方建议，在大数据更新过程中，也是通过使用Merge Into代替Update

Thanks and Regards

参考： http://blog.csdn.net/xiexbb/article/details/4242063

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率的更多相关文章

PLSQL_性能优化系列14_Oracle High Water Level高水位分析
2014-10-04 Created By BaoXinjian 一.摘要 PLSQL_性能优化系列14_Oracle High Water Level高水位分析高水位线好比水库中储水的水位线,用于 ...
PLSQL_性能优化系列16_Oracle Tuning Analyze优化分析
2014-12-23 Created By BaoXinjian
PLSQL_性能优化系列01_Oracle Index索引
2014-06-01 Created By BaoXinjian
PLSQL_性能优化系列15_Oracle Explain Plan解析计划解读
2014-12-19 Created By BaoXinjian
PLSQL_性能优化系列05_Oracle Hint提示
2014-06-20 Created By BaoXinjian
PLSQL_性能优化系列02_Oracle Join关联
2014-09-25 Created By BaoXinjian
PLSQL_性能优化系列19_Oracle Explain Plan解析计划通过Profile绑定
20150529 Created By BaoXinjian
PLSQL_性能优化系列12_Oracle Index Anaylsis索引分析
2014-10-04 Created By BaoXinjian
PLSQL_性能优化系列08_Oracle Insert / Direct Insert性能优化
2014-09-25 Created By BaoXinjian

随机推荐

leetcode 140. Word Break II ----- java
Given a string s and a dictionary of words dict, add spaces in s to construct a sentence where each ...
JAVA常用系统函数
1.System.out.println("显示信息"); // 显示内容,并自动换行 2.Syetem.out.print("显示信息"); // 显示内容, ...
java大数
java大数还是很好用的! 基本加入: import java.math.BigInteger; import jave.math.BigDecimal; 分别是大数和大浮点数. 首先读入可以用: S ...
Draw a Border around any C# Winform Control
public class MyGroupBox : GroupBox { protected override void OnPaint(PaintEventArgs e) { base.OnPain ...
ExtJS参考手册
ExtJS是一个用javascript写的,主要用于创建前端用户界面,是一个与后台技术无关的前端ajax框架.因此,可以把ExtJS用在.Net.Java.Php等各种开发语言开发的应用中.ExtJs ...
[CTSC 2012][BZOJ 2806]Cheat
真是一道好题喵~ 果然自动机什么的就是要和 dp 搞基才是王道有木有! A:连 CTSC 都叫我们搞基,果然身为一个程序猿,加入 FFF 团是我此生最明智的选择.妹子什么闪边去,大家一起来搞基吧! Q ...
黑马程序员——JAVA基础之简述集合collection
------- android培训.java培训.期待与您交流! ---------- 集合: 为什么出现集合类? • 面向对象语言对事物的体现都是以对象的形式,所以为了方便对多个对象的操作,就对对 ...
debugging books
https://blogs.msdn.microsoft.com/debuggingtoolbox/2007/06/08/recommended-books-how-to-acquire-or-imp ...
NETMON& Message Analyzer
NMCap /network * /capture /file c:\folder\t.chn:1MB NMCap /network * /capture (IPv4.SourceAddress = ...
Unity光照
广义地说,Unity有2种光源.1.动态光源 2.Backed Lighting 1.动态光源就是实时计算的.只要摆光源就可以了 2.Backed Lighting 提前处理好光照贴图.贴在物体上 ...

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率的更多相关文章

随机推荐

热门专题