KingbaseES例程之快速删除表数据

概述

快速删除表中的数据

delete语句删除数据

表中的数据被删除了，但是这个数据在硬盘上的真实存储空间不会被释放。

这种删除缺点是：删除效率比较低。

这种删除优点是：支持删除部分数据，支持回滚。
truncate语句删除数据

这种删除效率比较高，表被一次截断，物理删除。

这种删除缺点：不支持删除部分数据。

这种删除优点：快速，支持回滚。

案例：删除大表数据，但保留少量数据

一张表有100万条数据，分为1000组信息，仅保留每组的最后一条数据，如何快速删除其它99万余条数据？

方法一：删除每组非最大值的数据

explain  (analyse,buffers )

delete

from test10

where (c1,id) not in (select c1,max(id) from test10 group by c1)

returning *;

Delete on test10  (cost=36508.94..56943.94 rows=500000 width=6) (actual time=221.183..1732.834 rows=998999 loops=1)

  Buffers: shared hit=2012980

  ->  Seq Scan on test10  (cost=36508.94..56943.94 rows=500000 width=6) (actual time=221.128..583.449 rows=998999 loops=1)

        Filter: (NOT (hashed SubPlan 1))

        Rows Removed by Filter: 1001

        Buffers: shared hit=9547

        SubPlan 1

          ->  GroupAggregate  (cost=0.42..36506.44 rows=1001 width=8) (actual time=0.067..219.780 rows=1001 loops=1)

                Group Key: test10_1.c1

                Buffers: shared hit=4112

                ->  Index Only Scan using idx01 on test10 test10_1  (cost=0.42..31496.42 rows=1000000 width=8) (actual time=0.010..126.628 rows=1000000 loops=1)

                      Heap Fetches: 0

                      Buffers: shared hit=4112

Planning Time: 0.120 ms

Execution Time: 1799.063 ms

方法二：CTE获取每组最新行，删除每组非CTE的数据

explain  (analyse,buffers )

with recursive cte as (

        (select c1, ctid from test10 order by c1, id desc limit 1)

        union all

        (select test10.c1, test10.CTID

         from cte,

              lateral ( select CTID, c1

                        from test10

                        where cte.c1 < test10.c1

                        order by test10.c1, test10.id desc

                        limit 1) test10

        ))

delete from test10

where not exists (select  1 from cte where cte.ctid = test10.ctid )

returning *

;

Delete on test10  (cost=62.30..28121.41 rows=999899 width=36) (actual time=10.799..1627.548 rows=998999 loops=1)

  Buffers: shared hit=2013025

  CTE cte

    ->  Recursive Union  (cost=0.42..59.02 rows=101 width=10) (actual time=0.012..9.888 rows=1001 loops=1)

          Buffers: shared hit=4157

"          ->  Subquery Scan on ""*SELECT* 1""  (cost=0.42..0.49 rows=1 width=10) (actual time=0.010..0.013 rows=1 loops=1)"

                Buffers: shared hit=4

                ->  Limit  (cost=0.42..0.48 rows=1 width=14) (actual time=0.010..0.011 rows=1 loops=1)

                      Buffers: shared hit=4

                      ->  Index Scan using idx02 on test10 test10_1  (cost=0.42..54240.28 rows=1000000 width=14) (actual time=0.010..0.010 rows=1 loops=1)

                            Buffers: shared hit=4

          ->  Nested Loop  (cost=0.42..5.65 rows=10 width=10) (actual time=0.009..0.009 rows=1 loops=1001)

                Buffers: shared hit=4153

                ->  WorkTable Scan on cte cte_1  (cost=0.00..0.20 rows=10 width=4) (actual time=0.000..0.000 rows=1 loops=1001)

                ->  Limit  (cost=0.42..0.53 rows=1 width=14) (actual time=0.009..0.009 rows=1 loops=1001)

                      Buffers: shared hit=4153

                      ->  Index Scan using idx02 on test10 test10_2  (cost=0.42..33409.58 rows=333333 width=14) (actual time=0.009..0.009 rows=1 loops=1001)

                            Index Cond: (c1 > cte_1.c1)

                            Buffers: shared hit=4153

  ->  Hash Anti Join  (cost=3.28..28062.39 rows=999899 width=36) (actual time=10.727..422.146 rows=998999 loops=1)

        Hash Cond: (test10.ctid = cte.ctid)

        Buffers: shared hit=9592

        ->  Seq Scan on test10  (cost=0.00..15435.00 rows=1000000 width=6) (actual time=0.005..141.828 rows=1000000 loops=1)

              Buffers: shared hit=5435

        ->  Hash  (cost=2.02..2.02 rows=101 width=36) (actual time=10.713..10.714 rows=1001 loops=1)

              Buckets: 1024  Batches: 1  Memory Usage: 77kB

              Buffers: shared hit=4157

              ->  CTE Scan on cte  (cost=0.00..2.02 rows=101 width=36) (actual time=0.049..10.400 rows=1001 loops=1)

                    Buffers: shared hit=4157

Planning Time: 0.201 ms

Execution Time: 1691.687 ms

方法三：数组变量与truncate组合，支持事务回滚

do

$$

    declare

        v_rec test10[];

    begin

        v_rec := array(

                with recursive cte as (

                        (select id, c1, c2 from test10 order by c1, id desc limit 1)

                        union all

                        (select test10.id, test10.c1, test10.c2

                         from cte,

                              lateral ( select test10.id, test10.c1, test10.c2

                                        from test10

                                        where cte.c1 < test10.c1

                                        order by test10.c1, test10.id desc

                                        limit 1) test10

                        ))

                select (id, c1, c2)

                from cte);

        truncate test10;

        insert into test10

        select (t).*

        from (select unnest(v_rec) t) t;

        commit;

    exception

        when others then

            rollback;

    end;

$$

;

ANONYMOUS BLOCK

Time: 99.299 ms

TRUNCATE与DML操作的组合，实现通过少量数据的DML操作，实现DELETE大部分数据操作，可以减少执行时长。由于truncate支持事务回滚，可以在发生异常时回滚事务，或主动回滚事务，保证数据的完整性。

KingbaseES例程之快速删除表数据的更多相关文章

oracle 快速删除大批量数据方法（全部删除，条件删除，删除大量重复记录）
oracle 快速删除大批量数据方法(全部删除,条件删除,删除大量重复记录) 分类: ORACLE 数据库 2011-05-24 16:39 8427人阅读评论(2) 收藏举报 oracledel ...
oracle 快速备份表数据
oracle 快速备份表数据 CreateTime--2018年2月28日17:04:50 Author:Marydon UpdateTime--2017年1月20日11:45:07 1.1.9. ...
sql语句中----删除表数据drop、truncate和delete的用法
sql语句中----删除表数据drop.truncate和delete的用法 --drop drop table tb --tb表示数据表的名字,下同删除内容和定义,释放空间.简单来说就是把整 ...
sql语句中----删除表数据的"三兄弟"
说到删除表数据的关键字,大家记得最多的可能就是delete了然而我们做数据库开发,读取数据库数据.对另外的两兄弟用得就比较少了现在来介绍另外两个兄弟,都是删除表数据的,其实也是很容易理解的老大- ...
删除表数据drop、truncate和delete的用法
说到删除表数据的关键字,大家记得最多的可能就是delete了然而我们做数据库开发,读取数据库数据.对另外的两兄弟用得就比较少了现在来介绍另外两个兄弟,都是删除表数据的,其实也是很容易理解的老大- ...
SQLite Expert 删除表数据并重置自动增长列
用下面的语句肯定是行不通的,语句不支持 truncate table t_Records 方法:1.删除表数据 2.重置自动增长列 where name='t_Records' /*name :是表名 ...
sql有几种删除表数据的方式
有几种删除表数据的方式? truncate.delete和drop都可以删除数据. TRUNCATE TABLE删除表中的所有行,而不记录单个行删除操作. TRUNCATE TABLE 与没有 WHE ...
mysql进阶(二十一)删除表数据
MySQL删除表数据在MySQL中有两种方法可以删除数据,一种是DELETE语句,另一种是TRUNCATE TABLE语句.DELETE语句可以通过WHERE对要删除的记录进行选择.而使用TRUNC ...
数据库之删除表数据drop、truncate和delete的用法
数据库中删除表数据的关键字,最常用的可能就是delete了,另外其实还有drop和truncate两个关键字. 老大:drop 命令格式:drop table tb ---tb表示数据表的名字,下 ...

随机推荐

实现领域驱动设计 - 使用ABP框架 - 创建实体
用例演示 - 创建实体本节将演示一些示例用例并讨论可选场景. 创建实体从实体/聚合根类创建对象是实体生命周期的第一步.聚合/聚合根规则和最佳实践部分建议为Entity类创建一个主构造函数,以保证创 ...
ssm框架layui分页下标中文乱码，或者请选择中文乱码，提示乱码等
开始我以为是layui的bug 后来发现不是用过的方法: 1.修改layui的js文件将其中的中文变为encdoe 代码比如laypage.js下的中文 2.添加web.xml的过滤器该代码 ...
js 表面学习 - 认识事件
事件描述 onchange HTML 元素已被改变 onclick 用户点击了 HTML 元素 onmouseover 用户把鼠标移动到 HTML 元素上 onmouseout 用户把鼠标移开 HT ...
ansible概述、安装、模块介绍
一.Ansible介绍 Ansible是一个基于Python开发的配置管理和应用部署工具,现在也在自动化管理领域大放异彩. 它融合了众多老牌运维工具的优点,Pubbet和Saltstack能实现的功 ...
VIM学习笔记-1
VIM vim主要分为3个模式: Normal 模式 Insert模式 command模式 Insert 模式就是普通的编辑模式,没有太多可以介绍的,vim的主要功能都在 Normal 模式和 Com ...
Note -「因数的欧拉函数求和」
归档. 试证明:\(\sum \limits _{d | x} \varphi (d) = x\) Lemma 1. 试证明:\(\sum \limits _{d | p^k} \varphi (d) ...
Java 技术栈中间件优雅停机方案设计与实现全景图
欢迎关注公众号:bin的技术小屋,阅读公众号原文本系列 Netty 源码解析文章基于 4.1.56.Final 版本本文概要在上篇文章我为 Netty 贡献源码 | 且看 Netty 如何应对 ...
互联网界的IT巨变：从DOS的编辑器，到如今的无代码开发
众所周知,Borland Pascal.Turbo Pascal.Turbo C等这类开发工具,都习惯自带IDE. 因此,我产生了一个大胆的想法. DOS时代下的Turbo C 如果说Anders这类 ...
类型转换_float()函数
float()函数不能将文字类的字符串类型转换成小数类型同时将整数转换成浮点数类型的时候会在整数后买你加上.0 print(float(1))//output:1.0 print(float('1' ...
定时脚本删除docker容器中内容
今天在我同步mongo数据库的时候,服务器的磁盘突然就被占满了导致同步中断,mongo容器也停止工作了.然后就想要弄一个能够定时清理同步过程中留存在docker容器中的mongo数据的脚本.话不多说, ...

KingbaseES例程之快速删除表数据

概述

案例：删除大表数据，但保留少量数据

方法一：删除每组非最大值的数据

方法二：CTE获取每组最新行，删除每组非CTE的数据

方法三：数组变量与truncate组合，支持事务回滚

KingbaseES例程之快速删除表数据的更多相关文章

随机推荐

热门专题