oracle group by placement可能导致错误结果的bug
Last week I’ve mentioned on Twitter that we ran into wrong result bug. We found workaround quickly but I’ve decided to spend some time to reproduce error and write blog post to warn you about this optimizer behavior.
Special thanks to my colleague who spotted odd results which led us to this finding.
My test (virtual) environment is:
OS: Oracle Enterprise Linux 5.8
DB: Oracle EE 11.1.0.7.12
In test I will use three tables:
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
|
CONTName Null Type ------- ---- ------------- CUST_ID NUMBER(38) CODE VARCHAR2(100) CUSTName Null Type ------- -------- ---------- CUST_ID NOT NULL NUMBER(38) DRAGName Null Type ------- ---- --------- DRAG_ID NUMBER(6) |
To gather fresh statistics for the tables:
|
1
2
3
4
5
6
|
begin dbms_stats.gather_table_stats(ownname=>user,tabname=>'CONT',estimate_percent=>100, cascade=>TRUE); dbms_stats.gather_table_stats(ownname=>user,tabname=>'CUST',estimate_percent=>100, cascade=>TRUE); dbms_stats.gather_table_stats(ownname=>user,tabname=>'DRAG',estimate_percent=>100, cascade=>TRUE);end;/ |
More details about tables:
|
1
2
3
4
5
6
7
8
9
10
|
select table_name, num_rows, blocks, partitioned, last_analyzedfrom dba_tableswhere table_name in ('CONT','CUST','DRAG');TABLE_NAME NUM_ROWS BLOCKS PARTITIONED LAST_ANALYZED ------------ ---------- ---------- ----------- -------------------CONT 1181949 2892 NO 04.02.2014 14:49:24 DRAG 314 5 NO 04.02.2014 14:49:25 CUST 576233 902 NO 04.02.2014 14:49:25 |
Information about indexes:
|
1
2
3
4
5
6
7
8
9
|
select index_name, table_name, uniqueness, distinct_keys, clustering_factorfrom dba_indexeswhere table_name in ('CONT','CUST','DRAG');INDEX_NAME TABLE_NAME UNIQUENESS DISTINCT_KEYS CLUSTERING_FACTOR-------------- ------------ ---------- ------------- -----------------I_CUST_ID CONT NONUNIQUE 468738 753983 PK_CUST_ID CUST UNIQUE 576233 878 |
We have three small and simple tables with just two indexes. CUST table has primary key on “cust_id” column.
After this little introduction it is time for some tests.
I will flush buffer cache and shared pool before every query execution.
|
1
2
3
4
5
|
SQL> alter system flush shared_pool;System altered.SQL> alter system flush buffer_cache;System altered. |
First query execution and execution plan:
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
|
select /*+ gather_plan_statistics */ count(co.code) as cntfrom drag t, cust cus, cont cowhere 1=1and t.drag_id = cus.cust_idand cus.cust_id = co.cust_id(+) group by t.drag_id; CNT--------------- 2 2 2 2 1 2 2 2 2...303 rows |
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
|
SQL> select * from table(dbms_xplan.display_cursor(null,null,'ALLSTATS LAST'));PLAN_TABLE_OUTPUT----------------------------------------------------------------------------------------------------------------------------SQL_ID gpnrgy2vawafq, child number 0-------------------------------------select /*+ gather_plan_statistics */ count(co.code) as cnt fromdrag t, cust cus, cont co where 1=1 and t.drag_id =cus.cust_id and cus.cust_id = co.cust_id(+) group by t.drag_idPlan hash value: 3989628059---------------------------------------------------------------------------------------------------------------------------| Id | Operation | Name | Starts | E-Rows | A-Rows | A-Time | Buffers | Reads | OMem | 1Mem | Used-Mem |---------------------------------------------------------------------------------------------------------------------------| 0 | SELECT STATEMENT | | 1 | | 303 |00:00:00.62 | 3734 | 3724 | | | || 1 | HASH GROUP BY | | 1 | 303 | 303 |00:00:00.62 | 3734 | 3724 | 1096K| 1096K| 1264K (0)||* 2 | HASH JOIN OUTER | | 1 | 792 | 1084 |00:00:00.16 | 3734 | 3724 | 1206K| 1206K| 1244K (0)||* 3 | HASH JOIN | | 1 | 314 | 314 |00:00:00.04 | 890 | 885 | 1452K| 1452K| 1470K (0)|| 4 | TABLE ACCESS FULL| DRAG | 1 | 314 | 314 |00:00:00.01 | 7 | 6 | | | || 5 | TABLE ACCESS FULL| CUST | 1 | 576K| 576K|00:00:00.02 | 883 | 879 | | | || 6 | TABLE ACCESS FULL | CONT | 1 | 1181K| 1181K|00:00:00.01 | 2844 | 2839 | | | |---------------------------------------------------------------------------------------------------------------------------Predicate Information (identified by operation id):--------------------------------------------------- 2 - access("CUS"."CUST_ID"="CO"."CUST_ID") 3 - access("T"."DRAG_ID"="CUS"."CUST_ID") |
Check result of the query - this is correct query result.
Now to simulate what we experienced in production.
|
1
2
3
4
|
SQL> alter system flush shared_pool;System altered.SQL> alter system flush buffer_cache;System altered. |
With hint I want to force PK_CUST_ID index usage because this was preferred plan in production.
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
|
select /*+ gather_plan_statistics index(cus PK_CUST_ID) */ count(co.code) as cntfrom drag t, cust cus, cont cowhere 1=1and t.drag_id = cus.cust_idand cus.cust_id = co.cust_id(+) group by t.drag_id; CNT--------------- 0 0 0 0 0 0 0 0 0 0... 303 rows |
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
|
SQL> select * from table(dbms_xplan.display_cursor(null,null,'ALLSTATS LAST'));PLAN_TABLE_OUTPUT------------------------------------------------------------------------------------------------------------------------------------SQL_ID 9vf9uf7mhdmdz, child number 0-------------------------------------select /*+ gather_plan_statistics index(cus PK_CUST_ID) */count(co.code) as cnt from drag t, cust cus, cont co where1=1 and t.drag_id = cus.cust_id and cus.cust_id = co.cust_id(+) groupby t.drag_idPlan hash value: 3263881209-----------------------------------------------------------------------------------------------------------------------------------| Id | Operation | Name | Starts | E-Rows | A-Rows | A-Time | Buffers | Reads | OMem | 1Mem | Used-Mem |-----------------------------------------------------------------------------------------------------------------------------------| 0 | SELECT STATEMENT | | 1 | | 303 |00:00:00.70 | 3459 | 3094 | | | || 1 | HASH GROUP BY | | 1 | 303 | 303 |00:00:00.70 | 3459 | 3094 | 934K| 934K| 1267K (0)||* 2 | HASH JOIN OUTER | | 1 | 764 | 1046 |00:00:00.22 | 3459 | 3094 | 1134K| 1134K| 1198K (0)|| 3 | NESTED LOOPS | | 1 | 303 | 303 |00:00:02.02 | 615 | 255 | | | || 4 | VIEW | VW_GBC_9 | 1 | 303 | 303 |00:00:00.01 | 7 | 6 | | | || 5 | HASH GROUP BY | | 1 | 303 | 303 |00:00:00.01 | 7 | 6 | 1012K| 1012K| 1249K (0)|| 6 | TABLE ACCESS FULL| DRAG | 1 | 314 | 314 |00:00:00.01 | 7 | 6 | | | ||* 7 | INDEX UNIQUE SCAN | PK_CUST_ID | 303 | 1 | 303 |00:00:00.22 | 608 | 249 | | | || 8 | TABLE ACCESS FULL | CONT | 1 | 1181K| 1181K|00:00:00.01 | 2844 | 2839 | | | |-----------------------------------------------------------------------------------------------------------------------------------Predicate Information (identified by operation id):--------------------------------------------------- 2 - access("CUS"."CUST_ID"="CO"."CUST_ID") 7 - access("ITEM_1"="CUS"."CUST_ID") |
Check result of the query!
Count is displaying all 0 values because it received only NULLs to count.
Other functions like max and min are also affected by this error.
Check steps 4,5 and 6 in execution plan.
Instead of quick full scan on DRAG table Oracle transformed query and created inline view using smart group-by optimization.
In 10053 trace I could easily find what Oracle was doing.
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
|
SELECT /*+ INDEX ("CUS" "PK_CUST_ID") */ SUM("VW_GBC_9"."ITEM_2") "CNT"FROM (SELECT "T"."DRAG_ID" "ITEM_1", COUNT("CO"."CODE") "ITEM_2", "T"."DRAG_ID" "ITEM_3" FROM "ADMIN"."DRAG" "T" WHERE 1=1 GROUP BY "T"."DRAG_ID", "T"."DRAG_ID" ) "VW_GBC_9", "ADMIN"."CUST" "CUS", "ADMIN"."CONT" "CO"WHERE "VW_GBC_9"."ITEM_1"="CUS"."CUST_ID"AND "CUS"."CUST_ID" ="CO"."CUST_ID"(+)GROUP BY "VW_GBC_9"."ITEM_3"; |
Quick workaround to fix this bug:
- Set "_optimizer_group_by_placement"=FALSE.
You could check in 10053 trace value of this parameter.
In my case: _optimizer_group_by_placement = true
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
|
SQL> alter session set "_optimizer_group_by_placement"=FALSE;Session altered.select /*+ gather_plan_statistics index(cus PK_CUST_ID) */ count(co.code) as cntfrom drag t, cust cus, cont cowhere 1=1and t.drag_id = cus.cust_idand cus.cust_id = co.cust_id(+) group by t.drag_id; CNT---------- 2 2 2 2 1 2 2 2 2...303 rows |
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
|
SQL> select * from table(dbms_xplan.display_cursor(null,null,'ALLSTATS LAST'));PLAN_TABLE_OUTPUT----------------------------------------------------------------------------------------------------------------------------SQL_ID a91bzhvupzquh, child number 0-------------------------------------select /*+ gather_plan_statistics index(cus PK_CUST_ID)*/count(co.code) as cnt from drag t, cust cus, cont co where1=1 and t.drag_id = cus.cust_id and cus.cust_id = co.cust_id(+) groupby t.drag_idPlan hash value: 2460166079------------------------------------------------------------------------------------------------------------------------| Id | Operation | Name | Starts | E-Rows | A-Rows | A-Time | Buffers | OMem | 1Mem | Used-Mem |------------------------------------------------------------------------------------------------------------------------| 0 | SELECT STATEMENT | | 1 | | 303 |00:00:00.16 | 3481 | | | || 1 | HASH GROUP BY | | 1 | 303 | 303 |00:00:00.16 | 3481 | 1096K| 1096K| 1232K (0)||* 2 | HASH JOIN OUTER | | 1 | 792 | 1084 |00:00:00.01 | 3481 | 1206K| 1206K| 1529K (0)|| 3 | NESTED LOOPS | | 1 | 314 | 314 |00:00:00.01 | 637 | | | || 4 | TABLE ACCESS FULL| DRAG | 1 | 314 | 314 |00:00:00.01 | 7 | | | ||* 5 | INDEX UNIQUE SCAN| PK_CUST_ID | 314 | 1 | 314 |00:00:00.01 | 630 | | | || 6 | TABLE ACCESS FULL | CONT | 1 | 1181K| 1181K|00:00:00.01 | 2844 | | | |------------------------------------------------------------------------------------------------------------------------Predicate Information (identified by operation id):--------------------------------------------------- 2 - access("CUS"."CUST_ID"="CO"."CUST_ID") 5 - access("T"."DRAG_ID"="CUS"."CUST_ID")27 rows selected. |
Oracle Support note associated with "_optimizer_group_by_placement" parameter.
Note.8945586.8 Ext/Pub Bug 8945586 - Wrong results using GROUP BY placement:
Description
Wrong results can occur when using GROUP BY placement where the aggregate column gets pruned from select list.
I’ve even found that “_optimizer_group_by_placement” parameter was mentioned in "Oracle® Fusion Middleware Oracle WebCenter Analytics Installation and Upgrade Guide".
Oracle 11g (11.1.0.6 and above) in default or Oracle Real Application Clusters (RAC) configuration
When running Oracle 11g versions prior to 11.1.0.7.0 the Oracle system parameter _optimizer_group_by_placement must be set to false. This can either be set in the init.ora file of the respective database instances or by by issuing an ALTER SYSTEM command as follows:
SQLPLUS /nolog
CONNECT / AS SYSDBA
ALTER SYSTEM SET "_optimizer_group_by_placement"=false
group by的优化bug还是挺多的,还有比如_optimizer_aggr_groupby_elim,但是不出bug的情况下,性能提升还是非常明显的,大家一定要仔细检查结果,不要只看性能。
oracle group by placement可能导致错误结果的bug的更多相关文章
- oracle已知会导致错误结果的bug列表(Bug Issues Known to cause Wrong Results)
LAST UPDATE: 1 Dec 15, 2016 APPLIES TO: 1 2 3 4 Oracle Database - Enterprise Edition - Versi ...
- django继承修改 User表导致的问题 fields.E304(permissions/group都会有这样的错误)
问题: django继承修改 User表时,进行migrations操作时会导致的问题 fields.E304(permissions/group都会有这样的错误)如图: 根源: django文档中有 ...
- 登陆Oracle,报oracle initializationg or shutdown in progress 错误提示
前两天,登陆Oracle,发现登陆不上去了,报”oracle initializationg or shutdown in progress 错误提示” 错误. 然后就想着怎么去解决,首先自己到win ...
- oracle:数据库版本问题导致的bug
公司开发出来的系统,由于各现场oracle数据库版本有10.2.0.4.11.2.0.1.11.2.0.3.11.2.0.4: 进而会导致版本不一导致错误问题.下面列举2个: 1.wm_concat ...
- Oracle ORA-01033: ORACLE initialization or shutdown in progress 错误解决办法
Oracle ORA-01033: ORACLE initialization or shutdown in progress 错误解决办法 登陆数据库时提示 “ORA-01033”错误在命令窗口以s ...
- oracle group by中cube和rollup字句的使用方法及区别
oracle group by中rollup和cube的区别: Oracle的GROUP BY语句除了最基本的语法外,还支持ROLLUP和CUBE语句. 如果是ROLLUP(A, B, C)的话,先 ...
- oracle所在磁盘空间不足导致了数据库异常
oracle所在磁盘空间不足导致了数据库异常.需要减小数据文件的大小来解决. 1.检查数据文件的名称和编号 select file#,name from v$datafile; 2.看哪个数据文件所占 ...
- MVC4 路由参数带点 文件名后缀导致错误
错误描述 最近在研究office在线预览,用到mvc4 apicontroller 需要传参是文件名,如test.docx导致错误"指定的目录或文件在 Web 服务器上不存在", ...
- Oracle问题之ORA-12560TNS:协议适配器错误
Oracle问题之ORA-12560TNS:协议适配器错误 一.造成ORA-12560: TNS: 协议适配器错误的问题的原因有三个: 1.监听服务没有起起来.windows平台个一如下操作:开始-- ...
随机推荐
- [原]Django-issue(1)---postgresql数据库连接密码错误
环境: Django==1.9.13 psycopg2==2.7.5 Python 3.6.5 postgresql 1.18.1 配置django的时候出现问题 检查setting,问题点:由于安装 ...
- 解决 nginx 出现 413 Request Entity Too Large 的问题
1.若nginx用所用的 php 请求解析服务是 fpm, 则检查 /etc/php5/fpm/php.ini 文件中的参数 upload_max_filesize = 20M post_max_si ...
- share drive 无效
docker设置的share dirve怎么按都无效 试了几遍都不行,想想刚才电脑系统更新了,然后查了下百度,发现是电脑策略的问题,设置成经典的就可以了
- 3D Slicer Reconstruct CT/MRI
3D Slicer Reconstruct CT/MRI 1. Load DCM file of your CT/MRI 2. Go to Volume Rendering, click the ey ...
- 自己配置 vue 项目 知识体系(自己写脚手架 类似 vue-cli )
简单的目录结构: |-index.html |-main.js 入口文件 |-App.vue vue文件,官方推荐命名法 |-package.json 工程文件(项目依赖.名称.配置) npm ini ...
- Oracle考试题作业
新建一张学员信息表(student),要求:1. 字段如下:学号(sid),姓名(name),性别(sex),年龄(age),地址(address).2. 分别为字段添加约束:学号为主键,姓名为非空, ...
- Python学习之旅(二十三)
Python基础知识(22):进程和线程(Ⅰ) 1.多进程 (1)fork Python的os模块封装了常见的系统调用,其中就包括fork,可以在Python程序中轻松创建子进程 fork可以在Mac ...
- 实际体验Span<T> 的惊人表现
前言 最近做了一个过滤代码块功能的接口.就是获取一些博客文章做文本处理,然后这些博客文章的代码块太多了,很多重复的代码关键词如果被拿过来处理,那么会对文本的特征表示已经特征选择会有很大的影响.所以需要 ...
- 线段树 || BZOJ1756: Vijos1083 小白逛公园 || P4513 小白逛公园
题面:小白逛公园 题解: 对于线段树的每个节点除了普通线段树该维护的东西以外,额外维护lsum(与左端点相连的最大连续区间和).rsum(同理)和sum……就行了 代码: #include<cs ...
- (一)juc线程高级特性——volatile / CAS算法 / ConcurrentHashMap
1. volatile 关键字与内存可见性 原文地址: https://www.cnblogs.com/zjfjava/category/979088.html 内存可见性(Memory Visibi ...