KingbaseES例程之快速删除表数据

概述

快速删除表中的数据

delete语句删除数据

表中的数据被删除了，但是这个数据在硬盘上的真实存储空间不会被释放。

这种删除缺点是：删除效率比较低。

这种删除优点是：支持删除部分数据，支持回滚。
truncate语句删除数据

这种删除效率比较高，表被一次截断，物理删除。

这种删除缺点：不支持删除部分数据。

这种删除优点：快速，支持回滚。

案例：删除大表数据，但保留少量数据

一张表有100万条数据，分为1000组信息，仅保留每组的最后一条数据，如何快速删除其它99万余条数据？

方法一：删除每组非最大值的数据

explain  (analyse,buffers )

delete

from test10

where (c1,id) not in (select c1,max(id) from test10 group by c1)

returning *;

Delete on test10  (cost=36508.94..56943.94 rows=500000 width=6) (actual time=221.183..1732.834 rows=998999 loops=1)

  Buffers: shared hit=2012980

  ->  Seq Scan on test10  (cost=36508.94..56943.94 rows=500000 width=6) (actual time=221.128..583.449 rows=998999 loops=1)

        Filter: (NOT (hashed SubPlan 1))

        Rows Removed by Filter: 1001

        Buffers: shared hit=9547

        SubPlan 1

          ->  GroupAggregate  (cost=0.42..36506.44 rows=1001 width=8) (actual time=0.067..219.780 rows=1001 loops=1)

                Group Key: test10_1.c1

                Buffers: shared hit=4112

                ->  Index Only Scan using idx01 on test10 test10_1  (cost=0.42..31496.42 rows=1000000 width=8) (actual time=0.010..126.628 rows=1000000 loops=1)

                      Heap Fetches: 0

                      Buffers: shared hit=4112

Planning Time: 0.120 ms

Execution Time: 1799.063 ms

方法二：CTE获取每组最新行，删除每组非CTE的数据

explain  (analyse,buffers )

with recursive cte as (

        (select c1, ctid from test10 order by c1, id desc limit 1)

        union all

        (select test10.c1, test10.CTID

         from cte,

              lateral ( select CTID, c1

                        from test10

                        where cte.c1 < test10.c1

                        order by test10.c1, test10.id desc

                        limit 1) test10

        ))

delete from test10

where not exists (select  1 from cte where cte.ctid = test10.ctid )

returning *

;

Delete on test10  (cost=62.30..28121.41 rows=999899 width=36) (actual time=10.799..1627.548 rows=998999 loops=1)

  Buffers: shared hit=2013025

  CTE cte

    ->  Recursive Union  (cost=0.42..59.02 rows=101 width=10) (actual time=0.012..9.888 rows=1001 loops=1)

          Buffers: shared hit=4157

"          ->  Subquery Scan on ""*SELECT* 1""  (cost=0.42..0.49 rows=1 width=10) (actual time=0.010..0.013 rows=1 loops=1)"

                Buffers: shared hit=4

                ->  Limit  (cost=0.42..0.48 rows=1 width=14) (actual time=0.010..0.011 rows=1 loops=1)

                      Buffers: shared hit=4

                      ->  Index Scan using idx02 on test10 test10_1  (cost=0.42..54240.28 rows=1000000 width=14) (actual time=0.010..0.010 rows=1 loops=1)

                            Buffers: shared hit=4

          ->  Nested Loop  (cost=0.42..5.65 rows=10 width=10) (actual time=0.009..0.009 rows=1 loops=1001)

                Buffers: shared hit=4153

                ->  WorkTable Scan on cte cte_1  (cost=0.00..0.20 rows=10 width=4) (actual time=0.000..0.000 rows=1 loops=1001)

                ->  Limit  (cost=0.42..0.53 rows=1 width=14) (actual time=0.009..0.009 rows=1 loops=1001)

                      Buffers: shared hit=4153

                      ->  Index Scan using idx02 on test10 test10_2  (cost=0.42..33409.58 rows=333333 width=14) (actual time=0.009..0.009 rows=1 loops=1001)

                            Index Cond: (c1 > cte_1.c1)

                            Buffers: shared hit=4153

  ->  Hash Anti Join  (cost=3.28..28062.39 rows=999899 width=36) (actual time=10.727..422.146 rows=998999 loops=1)

        Hash Cond: (test10.ctid = cte.ctid)

        Buffers: shared hit=9592

        ->  Seq Scan on test10  (cost=0.00..15435.00 rows=1000000 width=6) (actual time=0.005..141.828 rows=1000000 loops=1)

              Buffers: shared hit=5435

        ->  Hash  (cost=2.02..2.02 rows=101 width=36) (actual time=10.713..10.714 rows=1001 loops=1)

              Buckets: 1024  Batches: 1  Memory Usage: 77kB

              Buffers: shared hit=4157

              ->  CTE Scan on cte  (cost=0.00..2.02 rows=101 width=36) (actual time=0.049..10.400 rows=1001 loops=1)

                    Buffers: shared hit=4157

Planning Time: 0.201 ms

Execution Time: 1691.687 ms

方法三：数组变量与truncate组合，支持事务回滚

do

$$

    declare

        v_rec test10[];

    begin

        v_rec := array(

                with recursive cte as (

                        (select id, c1, c2 from test10 order by c1, id desc limit 1)

                        union all

                        (select test10.id, test10.c1, test10.c2

                         from cte,

                              lateral ( select test10.id, test10.c1, test10.c2

                                        from test10

                                        where cte.c1 < test10.c1

                                        order by test10.c1, test10.id desc

                                        limit 1) test10

                        ))

                select (id, c1, c2)

                from cte);

        truncate test10;

        insert into test10

        select (t).*

        from (select unnest(v_rec) t) t;

        commit;

    exception

        when others then

            rollback;

    end;

$$

;

ANONYMOUS BLOCK

Time: 99.299 ms

TRUNCATE与DML操作的组合，实现通过少量数据的DML操作，实现DELETE大部分数据操作，可以减少执行时长。由于truncate支持事务回滚，可以在发生异常时回滚事务，或主动回滚事务，保证数据的完整性。

KingbaseES例程之快速删除表数据的更多相关文章

oracle 快速删除大批量数据方法（全部删除，条件删除，删除大量重复记录）
oracle 快速删除大批量数据方法(全部删除,条件删除,删除大量重复记录) 分类: ORACLE 数据库 2011-05-24 16:39 8427人阅读评论(2) 收藏举报 oracledel ...
oracle 快速备份表数据
oracle 快速备份表数据 CreateTime--2018年2月28日17:04:50 Author:Marydon UpdateTime--2017年1月20日11:45:07 1.1.9. ...
sql语句中----删除表数据drop、truncate和delete的用法
sql语句中----删除表数据drop.truncate和delete的用法 --drop drop table tb --tb表示数据表的名字,下同删除内容和定义,释放空间.简单来说就是把整 ...
sql语句中----删除表数据的"三兄弟"
说到删除表数据的关键字,大家记得最多的可能就是delete了然而我们做数据库开发,读取数据库数据.对另外的两兄弟用得就比较少了现在来介绍另外两个兄弟,都是删除表数据的,其实也是很容易理解的老大- ...
删除表数据drop、truncate和delete的用法
说到删除表数据的关键字,大家记得最多的可能就是delete了然而我们做数据库开发,读取数据库数据.对另外的两兄弟用得就比较少了现在来介绍另外两个兄弟,都是删除表数据的,其实也是很容易理解的老大- ...
SQLite Expert 删除表数据并重置自动增长列
用下面的语句肯定是行不通的,语句不支持 truncate table t_Records 方法:1.删除表数据 2.重置自动增长列 where name='t_Records' /*name :是表名 ...
sql有几种删除表数据的方式
有几种删除表数据的方式? truncate.delete和drop都可以删除数据. TRUNCATE TABLE删除表中的所有行,而不记录单个行删除操作. TRUNCATE TABLE 与没有 WHE ...
mysql进阶(二十一)删除表数据
MySQL删除表数据在MySQL中有两种方法可以删除数据,一种是DELETE语句,另一种是TRUNCATE TABLE语句.DELETE语句可以通过WHERE对要删除的记录进行选择.而使用TRUNC ...
数据库之删除表数据drop、truncate和delete的用法
数据库中删除表数据的关键字,最常用的可能就是delete了,另外其实还有drop和truncate两个关键字. 老大:drop 命令格式:drop table tb ---tb表示数据表的名字,下 ...

随机推荐

重学ES系列之模版字符串
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8&quo ...
CAD图在线Web测量工具代码实现(测量距离、面积、角度等)
CAD如今在各个领域均得到了普遍的应用并大大提高了工程技术人员的工作效率.在桌面端,AutoCAD测量工具已经非常强大:然后在Web端,如何准确.快速的对CAD图在Web进行测量呢? 功能能Web在 ...
sap 获取设置的打印机参数
*&---------------------------------------------------------------------* *& Form FRM_SET_PRI ...
从一道算法题实现一个文本diff小工具
众所周知,很多社区都是有内容审核机制的,除了第一次发布,后续的修改也需要审核,最粗暴的方式当然是从头再看一遍,但是编辑肯定想弄死你,显然这样效率比较低,比如就改了一个错别字,再看几遍可能也看不出来,所 ...
Python+opencv打开修图的正确方式get
先逼逼两句: 图像是 Web 应用中除文字外最普遍的媒体格式. 流行的 Web 静态图片有 JPEG.PNG.ICO.BMP 等.动态图片主要是 GIF 格式.为了节省图片传输流量,大型互联网公司还会 ...
win10设置Python程序定时运行(设置计划任务)
今天来设置一下定时执行Pycharm内的脚本: 这个要基于win10 的任务计划程序(设置 > 控制面板 > 系统和安全 > 管理工具 > 任务计划程序) 1. create ...
JDBCTools 第一个版本
JDBCToolV1: package com.dgd.test; import com.alibaba.druid.pool.DruidDataSourceFactory; import javax ...
eclipse使用小记录
(手动狗头)之前用eclipse的时候左侧的project栏不知道为什么整没了....记录一下 1.击Window--how View--other 2.Project Explorer,就可以了
使用开源Cesium+Vue实现倾斜摄影三维展示
准备工作 VUE开发工具:Visual studio Code 倾斜摄影转换工具:CesiumLab-下载地址:http://www.cesiumlab.com/ 三维显示:Cesium,api参考网 ...
开源一个自动整理B站UWP客户端软件进行批量下载的视频文件的小工具BiliVideosReoganizeHelper
大家都知道B站是一个很受欢迎的视频学习网站,上面有很多无私的up主上传了大量优秀的教学视频,在此向B站致敬,向广大UP主致敬. 有时,我们需要下载收藏一些视频,以防止以后找不到了.那么我们可以用B ...

KingbaseES例程之快速删除表数据

概述

案例：删除大表数据，但保留少量数据

方法一：删除每组非最大值的数据

方法二：CTE获取每组最新行，删除每组非CTE的数据

方法三：数组变量与truncate组合，支持事务回滚

KingbaseES例程之快速删除表数据的更多相关文章

随机推荐

热门专题