摘要:通常跑批加工场景下,都是大数量做关联操作,通常不建议使用索引。有些时候因为计划误判导致使用索引的可能会导致严重的性能问题。本文从一个典型的索引导致性能的场景重发,剖析此类问题的特征,定位方法和解决方法

本文分享自华为云社区《GaussDB(DWS)性能调优:indexscan导致的性能问题识别与优化 #【玩转PB级数仓GaussDB(DWS)】》,作者: 譡里个檔 。

通常跑批加工场景下,都是大数量做关联操作,通常不建议使用索引。有些时候因为计划误判导致使用索引的可能会导致严重的性能问题。本文从一个典型的索引导致性能的场景重发,剖析此类问题的特征,定位方法和解决方法。

1)在某局点POC测试时发现某SQL语句比较慢,原始SQL如下

WITH
/**
etl_116583_7960703_994644
**/
LOADABLE as (select "boq_rel_type_id","to_pu_id","to_version","cycle_id",
"part_offset_flag","to_boq_id","descr","from_contract_id",
"from_version","from_pu_id","ss_id","to_contract_id",
"from_boq_id","enable_flag","last_update_date"
from (SELECT /*+ PARALLEL(4)*/
BOQ_REL.FROM_BOQ_ID,
BOQ_REL.TO_BOQ_ID,
BOQ_REL.FROM_PU_ID,
BOQ_REL.TO_PU_ID,
BOQ_REL.PART_OFFSET_FLAG,
BOQ_REL.DESCR,
BOQ_REL.SS_ID,
BOQ_REL.CYCLE_ID,
NVL(BOQ_REL.FROM_VERSION, 'SNULL') FROM_VERSION,
NVL(BOQ_REL.TO_VERSION, 'SNULL') TO_VERSION,
BOQ_REL.LAST_UPDATE_DATE,
FROM_CON.CONTRACT_ID AS FROM_CONTRACT_ID,
TO_CON.CONTRACT_ID AS TO_CONTRACT_ID,
CLA.CLASS_ID AS BOQ_REL_TYPE_ID,
BOQ_REL.ENABLE_FLAG
FROM (SELECT A.FROM_BOQ_ID,
A.TO_BOQ_ID,
A.FROM_PU_ID,
A.TO_PU_ID,
A.FROM_CONTRACT_NUMBER,
A.TO_CONTRACT_NUMBER,
A.BOQ_REL_TYPE_CODE,
A.PART_OFFSET_FLAG,
A.DESCR,
A.SS_ID,
A.FROM_VERSION,
A.TO_VERSION,
A.LAST_UPDATE_DATE,
A.CYCLE_ID,
A.ENABLE_FLAG,
DECODE(A.SS_ID, 2820, 2600, A.SS_ID) SS_ID_TMP,
ROW_NUMBER() OVER(PARTITION BY FROM_BOQ_ID, TO_BOQ_ID, FROM_PU_ID, TO_PU_ID, FROM_CONTRACT_NUMBER, TO_CONTRACT_NUMBER, BOQ_REL_TYPE_CODE, FROM_VERSION, TO_VERSION
ORDER BY DECODE(A.SS_ID, 2820, 1, 2600, 2, 3)) RN
FROM LDB_MD_BOQ_REL A) BOQ_REL,
(SELECT CONTRACT_ID,
HW_CONTRACT_NUM,
SS_ID
FROM DWI_MD_CONTRACT
WHERE CONTRACT_ID IS NOT NULL
AND END_TIME = TO_DATE('4712-12-31', 'YYYY-MM-DD')) FROM_CON,
(SELECT CONTRACT_ID,
HW_CONTRACT_NUM,
SS_ID
FROM DWI_MD_CONTRACT
WHERE CONTRACT_ID IS NOT NULL
AND END_TIME = TO_DATE('4712-12-31', 'YYYY-MM-DD')) TO_CON,
(SELECT CLASS_ID,
CODE,
CLASS_TYPE_ID,
SS_ID
FROM DWI_MD_CLASS
WHERE CLASS_TYPE_ID = 193) CLA
WHERE BOQ_REL.RN = 1
AND BOQ_REL.FROM_CONTRACT_NUMBER = FROM_CON.HW_CONTRACT_NUM
AND BOQ_REL.SS_ID = FROM_CON.SS_ID
AND BOQ_REL.TO_CONTRACT_NUMBER = TO_CON.HW_CONTRACT_NUM
AND BOQ_REL.SS_ID = TO_CON.SS_ID
AND BOQ_REL.BOQ_REL_TYPE_CODE = CLA.CODE
AND BOQ_REL.SS_ID_TMP = CLA.SS_ID
) t
),
BEFORE_TARGET as (select "from_contract_id","from_pu_id","ss_id","from_boq_id","from_version","to_version",
"crt_cycle_id","to_pu_id","to_boq_id","del_flag","last_upd_cycle_id","last_update_date",
"descr","enable_flag","crt_job_instance_id","dq_improve_flag","upd_job_instance_id",
"to_contract_id","part_offset_flag","boq_rel_type_id"
from (SELECT /*+PARALLEL(4)*/
FROM_BOQ_ID,
TO_BOQ_ID,
FROM_PU_ID,
TO_PU_ID,
FROM_CONTRACT_ID,
TO_CONTRACT_ID,
BOQ_REL_TYPE_ID,
PART_OFFSET_FLAG,
DESCR,
SS_ID,
CRT_CYCLE_ID,
LAST_UPD_CYCLE_ID,
DEL_FLAG,
DQ_IMPROVE_FLAG,
CRT_JOB_INSTANCE_ID,
UPD_JOB_INSTANCE_ID,
NVL(FROM_VERSION, 'SNULL') FROM_VERSION,
NVL(TO_VERSION, 'SNULL') TO_VERSION,
LAST_UPDATE_DATE,
ENABLE_FLAG
FROM DWI_MD_BOQ_REL
) t
),
CDC as (select LOADABLE."ss_id",LOADABLE."from_version",LOADABLE."from_boq_id",
LOADABLE."part_offset_flag",LOADABLE."from_pu_id",
case when BEFORE_TARGET.BOQ_REL_TYPE_ID is null and BEFORE_TARGET.FROM_BOQ_ID is null
and BEFORE_TARGET.FROM_CONTRACT_ID is null and BEFORE_TARGET.FROM_PU_ID is null
and BEFORE_TARGET.FROM_VERSION is null
and BEFORE_TARGET.TO_BOQ_ID is null and BEFORE_TARGET.TO_CONTRACT_ID is null
and BEFORE_TARGET.TO_PU_ID is null and BEFORE_TARGET.TO_VERSION is null
then 1
else 3
end as "change_code",
LOADABLE."to_version",LOADABLE."boq_rel_type_id",
LOADABLE."from_contract_id",LOADABLE."to_contract_id",
LOADABLE."descr",LOADABLE."last_update_date",
LOADABLE."to_pu_id",LOADABLE."enable_flag",LOADABLE."cycle_id",
LOADABLE."to_boq_id"
from LOADABLE
left join BEFORE_TARGET on LOADABLE.BOQ_REL_TYPE_ID = BEFORE_TARGET.BOQ_REL_TYPE_ID
and LOADABLE.FROM_BOQ_ID = BEFORE_TARGET.FROM_BOQ_ID and LOADABLE.FROM_CONTRACT_ID = BEFORE_TARGET.FROM_CONTRACT_ID
and LOADABLE.FROM_PU_ID = BEFORE_TARGET.FROM_PU_ID and LOADABLE.FROM_VERSION = BEFORE_TARGET.FROM_VERSION
and LOADABLE.TO_BOQ_ID = BEFORE_TARGET.TO_BOQ_ID and LOADABLE.TO_CONTRACT_ID = BEFORE_TARGET.TO_CONTRACT_ID
and LOADABLE.TO_PU_ID = BEFORE_TARGET.TO_PU_ID and LOADABLE.TO_VERSION = BEFORE_TARGET.TO_VERSION
),
TFM_FILTER_DATA_TARGET_OUTPUT_U as (select CDC."to_pu_id",CDC."boq_rel_type_id",CDC."ss_id",
Current_Timestamp() as "dw_last_update_date",CDC."to_version",
CDC."from_version",20230104000000 as "last_upd_cycle_id",
CDC."from_contract_id",CDC."last_update_date",CDC."descr",
'N' as "del_flag",CDC."from_boq_id",CDC."to_boq_id",
CDC."enable_flag",CDC."from_pu_id",-1 as "upd_job_instance_id",
'N' as "dq_improve_flag",CDC."to_contract_id",
CDC."part_offset_flag"
from CDC where CDC.change_code=3
)
update DWI_MD_BOQ_REL TARGET_U
set "dq_improve_flag" = TFM_FILTER_DATA_TARGET_OUTPUT_U."dq_improve_flag",
"dw_last_update_date" = TFM_FILTER_DATA_TARGET_OUTPUT_U."dw_last_update_date",
"upd_job_instance_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."upd_job_instance_id",
"descr" = TFM_FILTER_DATA_TARGET_OUTPUT_U."descr",
"part_offset_flag" = TFM_FILTER_DATA_TARGET_OUTPUT_U."part_offset_flag",
"last_update_date" = TFM_FILTER_DATA_TARGET_OUTPUT_U."last_update_date",
"del_flag" = TFM_FILTER_DATA_TARGET_OUTPUT_U."del_flag",
"last_upd_cycle_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."last_upd_cycle_id",
"enable_flag" = TFM_FILTER_DATA_TARGET_OUTPUT_U."enable_flag",
"ss_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."ss_id"
from TFM_FILTER_DATA_TARGET_OUTPUT_U
where TARGET_U."boq_rel_type_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."boq_rel_type_id"
and TARGET_U."to_version" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_version"
and TARGET_U."to_version" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_version"
and TARGET_U."to_pu_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_pu_id"
and TARGET_U."to_pu_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_pu_id"
and TARGET_U."to_contract_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_contract_id"
and TARGET_U."to_contract_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_contract_id"
and TARGET_U."to_boq_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_boq_id"
and TARGET_U."to_boq_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."to_boq_id"
and TARGET_U."from_version" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_version"
and TARGET_U."from_version" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_version"
and TARGET_U."from_pu_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_pu_id"
and TARGET_U."from_pu_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_pu_id"
and TARGET_U."from_contract_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_contract_id"
and TARGET_U."from_contract_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_contract_id"
and TARGET_U."from_boq_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_boq_id"
and TARGET_U."from_boq_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."from_boq_id"
and TARGET_U."boq_rel_type_id" = TFM_FILTER_DATA_TARGET_OUTPUT_U."boq_rel_type_id"
;

2)查询此query的topSQL信息的warning字段,发现SQL自诊断信息中有索引相关告警信息。

3)查询此query的topSQL信息(如下图),分析历史执行信息,发现id=20的CStore Index Scan算子的耗时为90796.980ms,SQL执行总时长137135.658ms。CStore Index Scan算子的耗时占比为 66%

4)找到原始SQL语句,对查询语句中出现的表dwimd.dwi_md_contract 进行hint,强制其走顺序扫描,避免走indexscan(全量语句见附件)

5)对语句进行explain verbose,查看计划,发现计划符合预期(即表dwimd.dwi_md_contract走tablescan,对于列存表计划上显式为CStore Scan)

6)对语句执行EXPLAIN ANALYZE操作(即实际执行语句),查看实际执行时间如下,发现SQL语句性能提升近10倍。全量的执行信息见附件

点击关注,第一时间了解华为云新鲜技术~

GaussDB(DWS)性能调优:indexscan导致的性能问题识别与优化的更多相关文章

  1. [Spark性能调优] 第一章:性能调优的本质、Spark资源使用原理和调优要点分析

    本課主題 大数据性能调优的本质 Spark 性能调优要点分析 Spark 资源使用原理流程 Spark 资源调优最佳实战 Spark 更高性能的算子 引言 我们谈大数据性能调优,到底在谈什么,它的本质 ...

  2. JVM性能调优2:JVM性能调优参数整理

    序号 参数名 说明 JDK 默认值 使用过 1 JVM执行模式 2 -client-server 设置该JVM运行与Client 或者Server Hotspot模式,这两种模式从本质上来说是在JVM ...

  3. spark 性能调优(一) 性能调优的本质、spark资源使用原理、调优要点分析

    转载:http://www.cnblogs.com/jcchoiling/p/6440709.html 一.大数据性能调优的本质 编程的时候发现一个惊人的规律,软件是不存在的!所有编程高手级别的人无论 ...

  4. JVM性能调优(4) —— 性能调优工具

    前序文章: JVM性能调优(1) -- JVM内存模型和类加载运行机制 JVM性能调优(2) -- 垃圾回收器和回收策略 JVM性能调优(3) -- 内存分配和垃圾回收调优 一.JDK工具 先来看看有 ...

  5. MySql(九):MySQL性能调优——Schema设计的性能优化

    一.高效的模型设计 先了解下数据库设计的三大范式 第一范式:要求有主键,并且要求每一个字段原子性不可再分 第二范式:要求所有非主键字段完全依赖主键,不能产生部分依赖 第三范式:所有非主键字段和主键字段 ...

  6. 图灵学院Java架构师-VIP-【性能调优-Mysql索引数据结构详解与索引优化】

    最近报名了图灵学院的架构专题的付费课程,没有赶上6月份开课,中途加入的.错过了多线程的直播课程,只能看录播了

  7. 成为Java GC专家(5)—Java性能调优原则

    并不是每个程序都需要调优.如果一个程序性能表现和预期一样,你不必付出额外的精力去提高它的性能.然而,在程序调试完成之后,很难马上就满足它的性能需求,于是就有了调优这项工作.无论哪种编程语言,对应用程序 ...

  8. MySQL 性能调优之存储引擎

    原文:http://bbs.landingbj.com/t-0-246222-1.html        http://bbs.landingbj.com/t-0-245851-1.html MySQ ...

  9. 性能调优的本质、Spark资源使用原理和调优要点分析

    本课主题 大数据性能调优的本质 Spark 性能调优要点分析 Spark 资源使用原理流程 Spark 资源调优最佳实战 Spark 更高性能的算子 引言 我们谈大数据性能调优,到底在谈什么,它的本质 ...

  10. hbase性能调优(1)

    hbase性能调优 标签: hbase 性能调优 | 发表时间:2014-05-17 15:10 | 作者:无尘道长 分享到: 出处:http://www.iteye.com 一.服务端调优 1.参数 ...

随机推荐

  1. 【matplotlib 实战】--饼图

    饼图,或称饼状图,是一个划分为几个扇形的圆形统计图表.在饼图中,每个扇形的弧长(以及圆心角和面积)大小,表示该种类占总体的比例,且这些扇形合在一起刚好是一个完全的圆形. 饼图最显著的功能在于表现&qu ...

  2. 算法修养--广度优先搜索BFS

    广度优先算法(BFS) 广度优先算法(Breadth-First Search)是在图和树领域的搜索方法,其核心思想是从一个起始点开始,访问其所有的临近节点,然后再按照相同的方式访问这些临近节点的节点 ...

  3. 让物体动起来,Unity的几种移动方式

    一.前言 在大部分的Unity游戏开发中,移动是极其重要的一部分,移动的手感决定着游戏的成败,一个优秀的移动手感无疑可以给游戏带来非常舒服的体验.而Unity中有多种移动方法,使用Transform, ...

  4. 数据结构与算法(LeetCode)第一节:认识复杂度,对数器,二分法与异或运算

    一.认识复杂度 1.评估算法优劣的核心指标: 时间复杂度:当完成了表达式的建立,只要把最高阶项留下即可.低阶项都去掉,高阶项的系数也去掉,记为O(去掉系数的高阶项): ​ 时间复杂度是衡量算法流程的复 ...

  5. P4870 [BalticOI 2009 Day1]甲虫 题解

    题目链接 简要题意 在一个数轴上有 \(n\) 滴露水,每滴露水初始水量为 \(m\),每秒会蒸发一滴水,一个甲虫初始在原点,速度为 1,水能瞬间喝完,问它最多能喝到几滴水. 题目分析 对于这种移动区 ...

  6. 慎用:git reset --hard

    丧心病狂的命令:git reset --hard commit ,我以后没弄懂这个命令之前,再也不碰它了,背后凉嗖嗖的,谁敢啊. 事情的原由是我本地git commit 的时候,发现文件多了,想删掉本 ...

  7. 聊聊如何在Java应用中发送短信

    很多业务场景里,我们都需要发送短信,比如登陆验证码.告警.营销通知.节日祝福等等. 这篇文章,我们聊聊 Java 应用中如何优雅的发送短信. 1 客户端/服务端两种模式 Java 应用中发送短信通常需 ...

  8. SMC

    记一次入门反调试技术 找到关键函数,先分析F5伪代码,发现了virtualprotect函数,联想到了SMC代码保护技术 但是到了后面分析发现分析不下去了,然后找了wp发现是代码反调试技术 然后细细看 ...

  9. Educational Codeforces Round 105 (Rated for Div. 2) A-C题解

    写在前边 链接:Educational Codeforces Round 105 (Rated for Div. 2) A. ABC String 链接:A题链接 题目大意: 给定一个有\(A.B.C ...

  10. 基于OpenAi通用特定领域的智能语音小助手

    无穷尽的Q&A 钉钉...钉钉... 双双同学刚到工位,报销答疑群的消息就万马纷沓而来.她只能咧嘴无奈的摇摇头.水都还没有喝一口就开始"人工智能"的去回复.原本很阳光心情开 ...