11G新特性 -- Multicolumn Statistics (Column groups)

默认oracle会收集表中各个列的统计信息，但是会忽略列之间的关联关系。在大多情况下，优化器假设在复杂查询中的列之间是独立的。当where子句后指定了一个表的多个列条件时，优化器通常会将多个列的选择性（selectivity）相乘得到where语句的选择性，导致优化器做出错误判断！
Oracle 11g引入了多列统计信息概念，如果上面情况列关联性很好，可以做多列统计信息收集，让优化器做出正确判断。

在oracle 10g中，只有在一些特殊场合，优化器才会考虑列之间的关联关系：
-The optimizer used the number of distinct keys in an index to estimate selectivity provided all columns of a conjunctive predicate match all columns of a concatenated index key. In addition, the predicates must be equalities used in equijoins.
- If you set DYNAMIC_SAMPLING to level 4, the optimizer used dynamic sampling to estimate the selectivity of predicates involving multiple columns from a table. Because the sampling size is quite small, the results are dubious in most cases.

创建Column Groups:

DECLARE

  cg_name varchar2();

BEGIN

  cg_name := dbms_stats.create_extended_stats(null,'customers', '(cust_state_province,country_id)');

END;

/

查看Column Groups:

SQL> select extension_name, extension from dba_stat_extensions where table_name='CUSTOMERS';

EXTENSION_NAME                 EXTENSION

------------------------------ --------------------------------------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_ ("CUST_STATE_PROVINCE","COUNTRY_ID")

或者

SQL>  select sys.dbms_stats.show_extended_stats_name ('sh','customers','(cust_state_province,country_id)') col_group_name from dual;

COL_GROUP_NAME

--------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_

删除:

SQL> exec dbms_stats.drop_extended_stats('sh','customers','(cust_state_province, country_id)');

收集Column Groups的统计信息:

SQL> exec dbms_stats.gather_table_stats('sh','customers',method_opt =>'for all columns size skewonly for columns (cust_state_province,country_id) size skewonly');

监控Column Groups:

--查询多列统计信息

SQL> Select extension_name, extension from user_stat_extensions where table_name='CUSTOMERS';

EXTENSION_NAME                 EXTENSION

------------------------------ --------------------------------------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_ ("CUST_STATE_PROVINCE","COUNTRY_ID")

SQL>

--查看distinct数和柱状图使用情况

SQL> select e.extension col_group, t.num_distinct, t.histogram from user_stat_extensions e, user_tab_col_statistics t where e.extension_name = t.column_name and e.table_name = t.table_name and t.table_name = 'CUSTOMERS';

COL_GROUP                                                                        NUM_DISTINCT HISTOGRAM

-------------------------------------------------------------------------------- ------------ ---------------

("CUST_STATE_PROVINCE","COUNTRY_ID")                                                       FREQUENCY

SQL>

实验：
1）当不使用多列统计信息时，真实结果是3341，执行计划是1132.

SQL> exec dbms_stats.drop_extended_stats('sh','customers','(cust_state_province,country_id)');

PL/SQL procedure successfully completed.

SQL> select count(*) from sh.customers where CUST_STATE_PROVINCE = 'CA' and country_id=;

  COUNT(*)

----------

Execution Plan

----------------------------------------------------------

Plan hash value: 

--------------------------------------------------------------------------------

| Id  | Operation          | Name      | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------

|    | SELECT STATEMENT   |           |      |     |      ()| :: |

|    |  SORT AGGREGATE    |           |      |     |            |          |

|*   |   TABLE ACCESS FULL| CUSTOMERS |   |  |      ()| :: |

--------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

    - filter("CUST_STATE_PROVINCE"='CA' AND "COUNTRY_ID"=)

Statistics

----------------------------------------------------------

          recursive calls

            db block gets

         consistent gets

            physical reads

            redo size

          bytes sent via SQL*Net to client

          bytes received via SQL*Net from client

            SQL*Net roundtrips to/from client

           sorts (memory)

            sorts (disk)

            rows processed

2）当使用多列统计信息时，真实结果是3341，执行计划是3437.

SQL> EXEC DBMS_STATS.GATHER_TABLE_STATS('SH','CUSTOMERS',METHOD_OPT =>'FOR ALL COLUMNS SIZE SKEWONLY FOR COLUMNS (CUST_STATE_PROVINCE,COUNTRY_ID) SIZE SKEWONLY');

PL/SQL procedure successfully completed.

SQL>  select count(*) from sh.customers where CUST_STATE_PROVINCE = 'CA' and country_id=;

  COUNT(*)

----------

Execution Plan

----------------------------------------------------------

Plan hash value: 

--------------------------------------------------------------------------------

| Id  | Operation          | Name      | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------

|    | SELECT STATEMENT   |           |      |     |      ()| :: |

|    |  SORT AGGREGATE    |           |      |     |            |          |

|*   |   TABLE ACCESS FULL| CUSTOMERS |   |  |      ()| :: |

--------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

    - filter("CUST_STATE_PROVINCE"='CA' AND "COUNTRY_ID"=)

Statistics

----------------------------------------------------------

            recursive calls

            db block gets

         consistent gets

            physical reads

            redo size

          bytes sent via SQL*Net to client

          bytes received via SQL*Net from client

            SQL*Net roundtrips to/from client

            sorts (memory)

            sorts (disk)

            rows processed

3）即以上情况，使用多列统计信息能让优化器得到更准确的判断！

11G新特性 -- Multicolumn Statistics (Column groups)的更多相关文章

11G新特性 -- Expression Statistics
当在查询中使用了function,返回值会受到影响. 比如: select count(*) from customers where lower(cust_state_province)='ca'; ...
11g新特性与12c新特性
1. 11g新特性概图管理新特性> 开发新特性> 2. 12c 新特性概图
11g新特性-自动sql调优(Automatic SQL Tuning)
11g新特性-自动sql调优(Automatic SQL Tuning) 在Oracle 10g中,引进了自动sql调优特性.此外,ADDM也会监控捕获高负载的sql语句. 在Oracle 11g中, ...
使用Oracle 11g新特性 Active Database Duplication 搭建Dataguard环境
Duplication Database 介绍 Duplicate database可以按照用途分为2种: duplicate database(复制出一个数据库) duplicate standby ...
Oracle 11g 新特性 --SQL Plan Management 说明
Oracle 11g 新特性 --SQL Plan Management 说明参见大神博主文章: http://blog.csdn.net/tianlesoftware/article/detail ...
Oracle 11g 新特性 – HM（Hang Manager）简介
在这篇文章中我们会对oracle 11g 新特性—hang 管理器(Hang Manager) 进行介绍.我们需要说明,HM 只在RAC 数据库中存在. 在我们诊断数据库问题的时候,经常会遇到一些数据 ...
11G 新特性之密码延迟认证
11G 新特性之密码延迟认证 11G 新特性之密码延迟认证 Table of Contents 1. 特性简述 2. 特性潜在引发问题 3. 关闭特性 1 特性简述为了防止用户密码的暴力破解,从 ...
11G新特性 -- Statistics Preferences
Statistics Preferences新特性可以实现对指定对象进行信息收集. 可以在table.schema.database.global级别设置statistics preference. ...
Oracle 11g新特性
文章转自网络 Oracle 11g于2007年7月11日美国东部时间11时(北京时间11日22时)正式发布,11g是甲骨文公司30年来发布的最重要的数据库版本,根据用户的需求实现了信息生命周期管理(I ...

随机推荐

POJ 2376 Cleaning Shifts【贪心】
POJ 2376 题意: 给出一给大区间和n各小区间,问最少可以用多少小区间覆盖整个大区间. 分析: 贪心法.设t为当前所有已确定区间的最右端,那我们可以每次都取所有可选的小区间(左端点<=t+ ...
Flink（三）Flink开发IDEA环境搭建与测试
一.IDEA开发环境 1.pom文件设置 <properties> <maven.compiler.source>1.8</maven.compiler.source&g ...
Codeforces 643C Levels and Regions 斜率优化dp
Levels and Regions 把dp方程列出来, 把所有东西拆成前缀的形式, 就能看出可以斜率优化啦. #include<bits/stdc++.h> #define LL lon ...
poj2186-Popular Cows【Tarjan】+(染色+缩点)（经典）
<题目链接> 题目大意: 有N(N<=10000)头牛,每头牛都想成为most poluler的牛,给出M(M<=50000)个关系,如(1,2)代表1欢迎2,关系可以传递,但 ...
mac OS X下Java项目环境搭建+IntelliJ IDEA Jrebel插件安装与破解+Office 2016破解版安装
一.mac OS X下Java项目环境搭建因为某些原因新入手了台最新版的MacBook Pro,意味着今天要花一天时间安装各种软件以及项目环境搭建╮(╯▽╰)╭ 项目环境搭建步骤: 1.安装jdk ...
IdentityServer4-用EF配置Client（一）
一.背景 IdentityServer4的介绍将不再叙述,百度下可以找到,且官网的快速入门例子也有翻译的版本.这里主要从Client应用场景方面介绍对IdentityServer4的应用. 首先简要介 ...
玩转SpringCloud（F版本）四．路由网关(zuul)
本篇文章基于: 01)玩转SpringCloud 一．服务的注册与发现(Eureka) 02) 玩转SpringCloud 二．服务消费者(1)ribbon+restTemplate 03) 玩转Sp ...
10.25 正睿停课训练 Day9
目录 2018.10.25 正睿停课训练 Day9 A 数独(思路 DP) B 红绿灯(最短路Dijkstra) C 轰炸(计算几何圆并) 考试代码 B C 2018.10.25 正睿停课训练 Da ...
洛谷P3373 [模板]线段树 2(区间增减.乘区间求和)
To 洛谷.3373 [模板]线段树2 题目描述如题,已知一个数列,你需要进行下面两种操作: 1.将某区间每一个数加上x 2.将某区间每一个数乘上x 3.求出某区间每一个数的和输入输出格式输入格 ...
3dmax多个版本软件的安装包以及安装教程
这个文档具体出自哪里,我也是记不得了,需要的看下,链接如果是失效,那我也无能为力了. 免费分享,链接永久有效 2014版3D MAX链接:http://pan.baidu.com/s/1nuFr7Xv ...

11G新特性 -- Multicolumn Statistics (Column groups)

11G新特性 -- Multicolumn Statistics (Column groups)的更多相关文章

随机推荐

热门专题