redis源码分析之数据结构--dictionary

本文不讲hash算法，而主要是分析redis中的dict数据结构的特性--分步rehash。

首先看下数据结构：dict代表数据字典，每个数据字典有两个哈希表dictht，哈希表采用链式存储。

typedef struct dictEntry {//封装键值对

    void *key;

    union {//联合体表示不同数据类型，节省空间

        void *val;

        uint64_t u64;

        int64_t s64;

    } v;

    struct dictEntry *next;

} dictEntry;

typedef struct dictType {//字典类型，及相应的操作

    unsigned int (*hashFunction)(const void *key);

    void *(*keyDup)(void *privdata, const void *key);

    void *(*valDup)(void *privdata, const void *obj);

    int (*keyCompare)(void *privdata, const void *key1, const void *key2);

    void (*keyDestructor)(void *privdata, void *key);

    void (*valDestructor)(void *privdata, void *obj);

} dictType;

/* This is our hash table structure. Every dictionary has two of this as we

 * implement incremental rehashing, for the old to the new table. */

typedef struct dictht {//hash表

    dictEntry **table;

    unsigned long size;

    unsigned long sizemask;

    unsigned long used;

} dictht;

typedef struct dict {//数据字典

    dictType *type;

    void *privdata;

    dictht ht[2];//每个数据字典有两个hash表

    int rehashidx; /* rehashing not in progress if rehashidx == -1 */如果值为-1说明没有处于rehash的过程，否则说明指向当前正在rehash的链表的表头在字典中的索引。

    int iterators; /* number of iterators currently running */

} dict;

增加新节点函数，调用dictAddRaw，先增加节点的键，而不赋值，只有增加成功后才赋值。每次增加新节点，都要判断是否正在rehash，如果是则进行_dictRehashstep()，

/* Add an element to the target hash table */

int dictAdd(dict *d, void *key, void *val)

{

    dictEntry *entry = dictAddRaw(d,key);

    if (!entry) return DICT_ERR;

    dictSetVal(d, entry, val);

    return DICT_OK;

}

dictEntry *dictAddRaw(dict *d, void *key)

{

    int index;

    dictEntry *entry;

    dictht *ht;

    if (dictIsRehashing(d)) _dictRehashStep(d);

    /* Get the index of the new element, or -1 if

     * the element already exists. */

    if ((index = _dictKeyIndex(d, key)) == -1)

        return NULL;

    /* Allocate the memory and store the new entry */

    ht = dictIsRehashing(d) ? &d->ht[1] : &d->ht[0];//如果没有rehash，则还是在ht[0]上操作，否则将新节点加入到ht[1]上。

    entry = zmalloc(sizeof(*entry));

    entry->next = ht->table[index];

    ht->table[index] = entry;

    ht->used++;

    /* Set the hash entry fields. */

    dictSetKey(d, entry, key);

    return entry;

}

下面看一下，如何增量式rehash，

int dictRehash(dict *d, int n) {

    if (!dictIsRehashing(d)) return 0;

    while(n--) {

        dictEntry *de, *nextde;

        /* Check if we already rehashed the whole table... */

        if (d->ht[0].used == 0) {//如果表0已经为空，说明rehash完成了，释放表0

            zfree(d->ht[0].table);

            d->ht[0] = d->ht[1];

            _dictReset(&d->ht[1]);

            d->rehashidx = -1;

            return 0;

        }

        /* Note that rehashidx can't overflow as we are sure there are more

         * elements because ht[0].used != 0 */

        assert(d->ht[0].size > (unsigned)d->rehashidx);//防止越界

        while(d->ht[0].table[d->rehashidx] == NULL) d->rehashidx++;//从rehashidx+1开始执行

        de = d->ht[0].table[d->rehashidx];//取出当前链表的表头

        /* Move all the keys in this bucket from the old to the new hash HT */

        while(de) {//循环将当前链表的所以节点都从表0移除，加入到表1

            unsigned int h;

            nextde = de->next;

            /* Get the index in the new hash table */

            h = dictHashKey(d, de->key) & d->ht[1].sizemask;

            de->next = d->ht[1].table[h];//采用头插法将节点插入新表

            d->ht[1].table[h] = de;

            d->ht[0].used--;

            d->ht[1].used++;

            de = nextde;

        }

        d->ht[0].table[d->rehashidx] = NULL;

        d->rehashidx++;

    }

    return 1;

}

另外，在dictAdd函数中，调用_dictKeyIndex函数。_dictKeyIndex函数查找新的key所对应的桶的下标。_dictKeyIndex函数调用_dictExpandIfNeeded函数判断是否需要扩充ht[0]的table，如果当前正在进行增量rehash，则不扩展空间。_dictExpandIfNeeded函数调用dictExpand函数进行实际的扩充。dictExpand函数的代码如下：

/* Expand or create the hash table */

int dictExpand(dict *d, unsigned long size)

{

    dictht n; /* the new hash table */

    unsigned long realsize = _dictNextPower(size);

    /* the size is invalid if it is smaller than the number of

     * elements already inside the hash table */

    if (dictIsRehashing(d) || d->ht[0].used > size)

        return DICT_ERR;

    /* Allocate the new hash table and initialize all pointers to NULL */

    n.size = realsize;

    n.sizemask = realsize-1;

    n.table = zcalloc(realsize*sizeof(dictEntry*));

    n.used = 0;

    /* Is this the first initialization? If so it's not really a rehashing

     * we just set the first hash table so that it can accept keys. */

    if (d->ht[0].table == NULL) {

        d->ht[0] = n;

        return DICT_OK;

    }

    /* Prepare a second hash table for incremental rehashing */

    d->ht[1] = n;

    d->rehashidx = 0;

    return DICT_OK;

}

redis源码分析之数据结构--dictionary的更多相关文章

Redis源码分析-底层数据结构盘点
前段时间翻看了Redis的源代码(C语言版本,Git地址:https://github.com/antirez/redis), 过了一遍Redis数据结构,包括SDS.ADList.dict.ints ...
redis源码分析之数据结构：跳跃表
跳跃表是一种随机化的数据结构,在查找.插入和删除这些字典操作上,其效率可比拟于平衡二叉树(如红黑树),大多数操作只需要O(log n)平均时间,但它的代码以及原理更简单. 和链表.字典等数据结构被广泛 ...
redis源码分析之事务Transaction（下）
接着上一篇,这篇文章分析一下redis事务操作中multi,exec,discard三个核心命令. 原文地址:http://www.jianshu.com/p/e22615586595 看本篇文章前需 ...
Redis源码分析：serverCron - redis源码笔记
[redis源码分析]http://blog.csdn.net/column/details/redis-source.html Redis源代码重要目录 dict.c:也是很重要的两个文件,主要 ...
Redis源码分析（dict）
源码版本:redis-4.0.1 源码位置: dict.h:dictEntry.dictht.dict等数据结构定义. dict.c:创建.插入.查找等功能实现. 一.dict 简介 dict (di ...
redis源码分析之发布订阅（pub/sub）
redis算是缓存界的老大哥了,最近做的事情对redis依赖较多,使用了里面的发布订阅功能,事务功能以及SortedSet等数据结构,后面准备好好学习总结一下redis的一些知识点. 原文地址:htt ...
redis源码分析之事务Transaction（上）
这周学习了一下redis事务功能的实现原理,本来是想用一篇文章进行总结的,写完以后发现这块内容比较多,而且多个命令之间又互相依赖,放在一篇文章里一方面篇幅会比较大,另一方面文章组织结构会比较乱,不容易 ...
redis源码分析之有序集SortedSet
有序集SortedSet算是redis中一个很有特色的数据结构,通过这篇文章来总结一下这块知识点. 原文地址:http://www.jianshu.com/p/75ca5a359f9f 一.有序集So ...
Redis源码分析（intset）
源码版本:4.0.1 源码位置: intset.h:数据结构的定义 intset.c:创建.增删等操作实现 1. 整数集合简介 intset是Redis内存数据结构之一,和之前的 sds. skipl ...

随机推荐

18、nginx优化
一.性能优化概述基询imm能优化,那么在性能优化这一章,我们将分为如下几个方面做介绍 1.首先我们需要了解性能优化要考虑哪些方面. 2.然后我们需要了解性能优化必须要用到的压力测试工具ab. 3.最 ...
nginx服务学习第二章
nginx.config文件中字符串不显示高亮 nginx服务搭建完成后,查看nginx.config的时候发现没有高亮字符,要想配置文件出现高亮方便观看,需要修改一些配置文件,修改步骤如下: # m ...
Linux进程管理工具之ps
1.PS进程管理指令 ps -aux USER:用户名称 PID:进程号 %CPU:进程占用CPU的百分比 %MEM:进程占用物理内存的百分比 VSZ:进程占用的虚拟内存大小(单位:KB) RS ...
自学Python5.5-面向对象三大基本特征_继承
自学Python之路-Python基础+模块+面向对象自学Python之路-Python网络编程自学Python之路-Python并发编程+数据库+前端自学Python之路-django 自学Pyth ...
完美解决Mysql的Access denied for user 'root'@'%的'问题
背景:mysql5.6 root已授权所有数据库,执行过下面的语句 grant all privileges on *.* to 'root'@'%' identified by 'root' 当使用 ...
003-centos7:rsyslog简单配置客户端和服务器端
实现把一个主机作为客户端,把日志发送到指定的服务器端: [服务器端] 开放tcp端口,udp端口: vim /etc/rsyslog.conf: # Provides UDP syslog recep ...
1.Go-copy函数、sort排序、双向链表、list操作和双向循环链表
1.1.copy函数通过copy函数可以把一个切片内容复制到另一个切片中 (1)把长切片拷贝到短切片中 ? 1 2 3 4 5 6 7 8 9 10 11 12 package main imp ...
adb简介
Android 调试桥 (adb) 是一种功能多样的命令行工具,可让您与设备进行通信.adb 命令便于执行各种设备操作(例如安装和调试应用),并提供对 Unix shell(可用来在设备上运行各种命令 ...
Gym - 101630G The Great Wall (前缀和+树状数组+二分)
题意:有一个序列,一开始所有的元素都是ai,你可以选择两个长度相等的区间,如果某个元素被一个区间覆盖,那么变为bi,如果被两个区间都覆盖,那么变为ci.问所有区间的选择方法中产生的第k小的元素总和. ...
fedora 29 安装ALSA声音驱动
centos系列解决安装utils时遇到的问题 configure: error: this packages requires a curses library yum install ncurs ...

redis源码分析之数据结构--dictionary

redis源码分析之数据结构--dictionary的更多相关文章

随机推荐

热门专题