Postgres是如何管理空值的

创建表test,y字段插入null.

test=# create table test(x bigint,y bigint,z text);

CREATE TABLE

test=# insert into test values(11,null,'abcdefg');

INSERT 0 1

test=# select * from test;

 x  | y |    z

----+---+---------

 11 |   | abcdefg

(1 row)

这条记录存储在数据页里面是:

(gdb) p	*lpp

$17 = {lp_off = 8152, lp_flags = 1, lp_len = 40}

占了40个字节,TupleHeader是24个字节，40-24=16。

我们再插入一条数据看看

test=# insert into test values(22,100,'xyz');

INSERT 0 1

test=# select * from test;

 x  |  y  |    z

----+-----+---------

 11 |     | abcdefg

 22 | 100 | abcdefg

(2 rows)

再看看新插入的记录:

(gdb) p	*lpp

$19 = {lp_off = 8104, lp_flags = 1, lp_len = 48}

这2条数据对比

第一条记录:

insert into test values(11,null,'abcdefg');

字段x是bigint,8字节

字段y是bigint,8字节

字段z是text,字符串strlen('abcdefg')=7,8字节对齐。

8152+40=8192

第二条记录：

insert into test values(22,100,'abcdefg');

8+8+8=24

24+TupleHeader=48字节

8104+48=8152

我们来看看第一条是如何存储NULL的：

代码:src/include/access/tupmacs.h

#define att_isnull(ATT, BITS) (!((BITS)[(ATT) >> 3] & (1 << ((ATT) & 0x07))))

这就是计算字段是否为NULL的宏

PG通过t_bits来标记是否为null.

struct HeapTupleHeaderData

{

	union

	{

		HeapTupleFields t_heap;

		DatumTupleFields t_datum;

	}			t_choice;

	ItemPointerData t_ctid;		/* current TID of this or newer tuple (or a

								 * speculative insertion token) */

	/* Fields below here must match MinimalTupleData! */

	uint16		t_infomask2;	/* number of attributes + various flags */

	uint16		t_infomask;		/* various flag bits, see below */

	uint8		t_hoff;			/* sizeof header incl. bitmap, padding */

	/* ^ - 23 bytes - ^ */

	bits8		t_bits[FLEXIBLE_ARRAY_MEMBER];	/* bitmap of NULLs */

	/* MORE DATA FOLLOWS AT END OF STRUCT */

};

t_bits是一个字节，那么就有8位。所有上面的宏需要根据ATT来移3位。ATT>>3

这是算这个字段是在第几个数组下标

例如我这里查询的是第一个字段1>>3 = 0 就是在第一个数组下标

ATT & 0x07 求字段顺序第一条是 0 & 0x07 = 0 ,如果是第八个字段也是0，范围就是0-7

test=# select * from test;

 x  |  y  |    z

----+-----+---------

 11 |     | abcdefg

 22 | 100 | abcdefg

(2 rows)

第一条数据t_bits是这样的

0 0 0 0 0 1 0 1

根据上面的宏计算结果:

第一个字段:

(BITS)[(ATT) >> 3] = BITS[0]

(ATT) & 0x07 = 0 & 0x07 = 0

1 << 0 = 1

结果就是

0 0 0 0 0 1 0 1

0 0 0 0 0 0 0 1

!(BITS[0] & 1) = 0

代表数据不算为NULL.

第二个字段:

(BITS)[(ATT) >> 3] = BITS[0]

(ATT) & 0x07 = 1 & 0x07 = 1

1 << 1 = 2

结果就是

0 0 0 0 0 1 0 1

0 0 0 0 0 0 1 0

!(BITS[0] & 2) = 1

代表第二个字段为NULL

我们再插入一条数据

test=# insert into test values(null,null,'abcdefg');

INSERT 0 1

test=# select * from test;

 x  |  y  |    z

----+-----+---------

 11 |     | abcdefg

 22 | 100 | abcdefg

    |     | abcdefg

(3 rows)

我们主要是看新增加的这条数据

(gdb) p	*lpp

$42 = {lp_off = 8072, lp_flags = 1, lp_len = 32}

(gdb)

新插入的数据只占了32个字节 TupleHeader+8

t_bits = 0 0 0 0 0 1 0 0

利用上面的宏也很好的算出前面2个字段为NULL

我们来看看超过8个字段的情况

test=# create table test_more_column(

test(# col1 bigint,

test(# col2 bigint,

test(# col3 bigint,

test(# col4 bigint,

test(# col5 bigint,

test(# col6 bigint,

test(# col7 bigint,

test(# col8 bigint,

test(# col9 bigint,

test(# col10 bigint

test(# );

CREATE TABLE

test=# insert into test_more_column values(1,2,null,4,5,null,7,8,null,10);

INSERT 0 1

test=# select * from test_more_column ;

 col1 | col2 | col3 | col4 | col5 | col6 | col7 | col8 | col9 | col10

------+------+------+------+------+------+------+------+------+-------

    1 |    2 |      |    4 |    5 |      |    7 |    8 |      |    10

(1 row)

test=#

首先看看item

(gdb) p	*lpp

$1 = {lp_off = 8104, lp_flags = 1, lp_len = 88}

(gdb)

总共88个字节

(gdb) p *tuple->t_data

$3 = {t_choice = {t_heap = {t_xmin = 4414, t_xmax = 0, t_field3 = {t_cid = 0, t_xvac = 0}}, t_datum = {datum_len_ = 4414, datum_typmod = 0, datum_typeid = 0}}, t_ctid = {ip_blkid =

 {bi_hi = 0, bi_lo = 0}, ip_posid = 1}, t_infomask2 = 10, t_infomask = 2305, t_hoff = 32 ' ', t_bits = 0x7f8c9af9ef5f "\333\002"}

首先看看头偏移量就改变了不是前面的24个字节。是因为字段超过了8 需要用2个bit来标记NULL，而PG又是8字节对齐所以是24+8=32

88-32=56 总共88字节减去头32 数据占56字节

7 * 8 = 56 上面总共有7个字段存储了值，每个占8字节就是56字节

(gdb) p	bp

$8 = (bits8 *) 0x7f8c9af9ef5f "\333\002"

(gdb) p	sizeof(bp)

$9 = 8

(gdb)

数组的值

(gdb) p	bp[0]

$10 = 219 '\333'

(gdb) p bp[1]

$11 = 2 '\002'

bp[0] = 1 1 0 1 1 0 1 1 = 1 +2 +8 +16 +64 +128 = 219

bp[1] = 0 0 0 0 0 0 1 0 = 2

bp[3] = 0 0 0 0 0 0 0 0 = 0

......

bp[7] = 0 0 0 0 0 0 0 0 = 0

根据上面的宏att_isnull 就能很好的判断出那个字段是NULL。这样就非常的节省了数据存储空间。

Postgres是如何管理空值的的更多相关文章

pgpool-II主备流复制的架设
1.环境 OS: CentOS release 6.4 (Final) DB: postgresql 9.3.6 pgpool服务器: pgpool 172.16.0.240 数据库主服务器:mast ...
[转帖] “王者对战”之 MySQL 8 vs PostgreSQL 10
原贴地址:https://www.oschina.net/translate/showdown-mysql-8-vs-postgresql-10?lang=chs&page=2# 英文原版地址 ...
Mysql 和 Postgresql(PGSQL) 对比
Mysql 和 Postgresql(PGSQL) 对比转载自:http://www.oschina.net/question/96003_13994 PostgreSQL与MySQL比较 MySQ ...
“王者对战”之 MySQL 8 vs PostgreSQL 10
既然 MySQL 8 和 PostgreSQL 10 已经发布了,现在是时候回顾一下这两大开源关系型数据库是如何彼此竞争的. 在这些版本之前,人们普遍认为,Postgres 在功能集表现更出色,也因其 ...
pgpool-II 高可用搭建
pgpool-II主备流复制的架设1.环境 OS: CentOS release 6.4 (Final)DB: postgresql 9.3.6pgpool服务器: pgpool 172.16.0.2 ...
如何快速搭建自己的ERP系统,4步源码快速安装odoo教程
上一篇内容:了解什么是Odoo,为二次开发做准备 1.下载odoo源码 Github地址:https://github.com/odoo/odoo Gitee地址:https://gitee.com/ ...
Centos7下安装BlockScout
简介 BlockScout是一个Elixir应用程序,允许用户搜索以太坊网络(包括所有叉子和侧链)上的交易,查看账户和余额以及验证智能合约.BlockScout为用户提供了一个全面,易于使用的界面,以 ...
第20课-数据库开发及ado.net 可空值类型,资料管理器,多条件查询,Case
第20课-数据库开发及ado.net 可空值类型,资料管理器,多条件查询,Case SqlHelper using System; using System.Collections.Generic; ...
postgres高可用学习篇二：通过pgbouncer连接池工具来管理postgres连接
安装pgbouncer yum install libevent -y yum install libevent-devel -y wget http://www.pgbouncer.org/down ...

随机推荐

Lua的函数调用和协程中，栈的变化情况
Lua的函数调用和协程中,栈的变化情况 1. lua_call / lua_pcall 对于这两个函数,对栈底是没有影响的--调用的时候,参数会被从栈中移除,当函数返回的时候,其返回值会从函数处 ...
Being a Good Boy in Spring Festival（尼姆博弈）
Being a Good Boy in Spring Festival Time Limit: 1000/1000 MS (Java/Others) Memory Limit: 32768/32768 ...
状压DP小结
看了一个多星期状压DP,总算有点明白,大概可以分为两种:数据是在矩阵中的,数据是线性的,在矩阵中的一般就是排兵布阵这一种的,还有一种线性结构中给定条件让你求最大权值,比如求最大权值路线,TSP问题等, ...
Codeforces 376B. Coupons and Discounts
B. Coupons and Discounts time limit per test 1 second memory limit per test 256 megabytes input stan ...
0_Simple__cudaOpenMP
在OpenMP的多线程程序中,各线程分别调用CUDA进行计算.OpenMP的简单示例. ▶ 源代码: #include <omp.h> #include <stdio.h> # ...
利用canvas 导出图片
1.使用canvas绘制图片,并将图片导出. 在本地直接访问静态网页时,无法使用toDataURL(),需要将网页发布后,canvas才能使用toDataURL获取画布上的内容.因为canvas不允许 ...
Cocoapods使用过程中遇到的问题
前言:记录一些在CocoaPods使用过程中遇到的问题,本地环境:Xcode9.0 发现有的时候在执行pod init的时候不能正常地创建出来pod File文件,显示的错误如下: ――― MARKD ...
向ASP.NET Core迁移
有人说.NET在国内的氛围越来越不行了,看博客园文章的浏览量也起不来.是不是要转Java呢? 没有必要扯起语言的纷争,Java也好C#都只是语言是工具,各有各的使用场景.以前是C#非开源以及不能在Li ...
EF错误
The model backing the 'XXXXDBContext' context has changed since the database was created. Either man ...
ZedBoard开发板学习记录（一）之开发环境的搭建（Ubuntu16.04）以及运行HelloWorld程序的测试
ZedBoard开发板由PL和PS两大部分组成, 对PS操作,一般有两个办法: (1).在Windows系统上面,使用SDK新建C Project SDK自带编译环境,编译后自动产生elf文件.使用U ...

Postgres是如何管理空值的

Postgres是如何管理空值的的更多相关文章

随机推荐

热门专题