levelDB跳表实现

跳表的原理就是利用随机性建立索引，加速搜索，并且简化代码实现难度。具体的跳表原理不再赘述，主要是看了levelDB有一些实现细节的东西，凸显自己写的实现不足之处。

去除冗余的key

  template<typename Key, class Comparator>

  struct SkipList<Key,Comparator>::Node {

    explicit Node(const Key& k) : key(k) { }

    Key const key;

    // Accessors/mutators for links.  Wrapped in methods so we can

    // add the appropriate barriers as necessary.

    Node* Next(int n) {

      assert(n >= 0);

      // Use an 'acquire load' so that we observe a fully initialized

      // version of the returned Node.

      return reinterpret_cast<Node*>(next_[n].Acquire_Load());

    }

    void SetNext(int n, Node* x) {

      assert(n >= 0);

      // Use a 'release store' so that anybody who reads through this

      // pointer observes a fully initialized version of the inserted node.

      next_[n].Release_Store(x);

    }

    // No-barrier variants that can be safely used in a few locations.

    Node* NoBarrier_Next(int n) {

      assert(n >= 0);

      return reinterpret_cast<Node*>(next_[n].NoBarrier_Load());

    }

    void NoBarrier_SetNext(int n, Node* x) {

      assert(n >= 0);

      next_[n].NoBarrier_Store(x);

    }

   private:

    // Array of length equal to the node height.  next_[0] is lowest level link.

    port::AtomicPointer next_[1];

  };

这里使用一个Node节点表示所有相同key，不同高度的节点集合，仅保留了key和不同高度的向右指针，并且使用NewNode来动态分配随即高度的向右指针集合，而next_就指向这指针集合。这也是c/c++ tricky的地方。

  #include <stdio.h>

  struct Node {

  	char str[1];

  };

  int main() {

  	char* mem = new char[4];

  	for (int i = 0; i < 4; i++) {

  		mem[i] = i + '0';

  	}

  	Node* node = (Node*)mem;

  	char* const pstr = node->str;

  	for (int i = 0; i < 4; i++) {

  		printf("%c", pstr[i]);

  	}

  	return 0;

  }

就像上面这个简单的sample，成员str可以作为指针指向从数组下标0开始的元素，并且不受申明时的限制，不局限于大小1，索引至分配的最大的内存地址。

简易随机数生成

  uint32_t Next() {

      static const uint32_t M = 2147483647L;   // 2^31-1

      static const uint64_t A = 16807;  // bits 14, 8, 7, 5, 2, 1, 0

      // We are computing

      //       seed_ = (seed_ * A) % M,    where M = 2^31-1

      //

      // seed_ must not be zero or M, or else all subsequent computed values

      // will be zero or M respectively.  For all other values, seed_ will end

      // up cycling through every number in [1,M-1]

      uint64_t product = seed_ * A;

      // Compute (product % M) using the fact that ((x << 31) % M) == x.

      seed_ = static_cast<uint32_t>((product >> 31) + (product & M));

      // The first reduction may overflow by 1 bit, so we may need to

      // repeat.  mod == M is not possible; using > allows the faster

      // sign-bit-based test.

      if (seed_ > M) {

        seed_ -= M;

      }

      return seed_;

  }

可以看到，他使用A和M对种子进行运算，达到一定数据范围内不会重复的数集，而里面对于(product % M)，使用(product >> 31) + (product & M)进行运算优化，考虑右移和与操作的代价远小于取余操作。

简洁清晰的私有帮助方法，帮助寻找小于指定key的节点

  template<typename Key, class Comparator>

  typename SkipList<Key,Comparator>::Node*

  SkipList<Key,Comparator>::FindLessThan(const Key& key) const {

    Node* x = head_;

    int level = GetMaxHeight() - 1;

    while (true) {

      assert(x == head_ || compare_(x->key, key) < 0);

      Node* next = x->Next(level);

      if (next == NULL || compare_(next->key, key) >= 0) {

        if (level == 0) {

          return x;

        } else {

          // Switch to next list

          level--;

  	  }

      } else {

        x = next;

      }

    }

  }

levelDB跳表实现的更多相关文章

LevelDB学习笔记 (3): 长文解析memtable、跳表和内存池Arena
LevelDB学习笔记 (3): 长文解析memtable.跳表和内存池Arena 1. MemTable的基本信息我们前面说过leveldb的所有数据都会先写入memtable中,在leveldb ...
SkipList 跳表
1.定义描述跳跃列表(也称跳表)是一种随机化数据结构,基于并联的链表,其效率可比拟于二叉查找树(对于大多数操作需要O(log n)平均时间). 基本上,跳跃列表是对有序的链表增加 ...
[转载] 跳表SkipList
原文: http://www.cnblogs.com/xuqiang/archive/2011/05/22/2053516.html leveldb中memtable的思想本质上是一个skiplist ...
skiplist 跳表（1）
最近学习中遇到一种新的数据结构,很实用,搬过来学习. 原文地址:skiplist 跳表为什么选择跳表目前经常使用的平衡数据结构有:B树,红黑树,AVL树,Splay Tree, Treep等. ...
SkipList跳表基本原理
为什么选择跳表目前经常使用的平衡数据结构有:B树,红黑树,AVL树,Splay Tree, Treep等. 想象一下,给你一张草稿纸,一只笔,一个编辑器,你能立即实现一颗红黑树,或者AVL树出来吗 ...
C语言跳表(skiplist)实现
一.简介跳表(skiplist)是一个非常优秀的数据结构,实现简单,插入.删除.查找的复杂度均为O(logN).LevelDB的核心数据结构是用跳表实现的,redis的sorted set数据结构也 ...
K：跳表
跳表(SkipList)是一种随机化的数据结构,目前在redis和leveldb中都有用到它,它的效率和红黑树以及 AVL 树不相上下,但跳表的原理相当简单,只要你能熟练操作链表, 就能轻松实现一 ...
SkipList跳表（一）基本原理
一直听说跳表这个数据结构,说要学一下的,懒癌犯了,是该治治了为什么选择跳表目前经常使用的平衡数据结构有:B树.红黑树,AVL树,Splay Tree(这个树好像还没有听说过),Treep(也没有听 ...
深入理解跳表在Redis中的应用
本文首发于:深入理解跳表在Redis中的应用微信公众号:后端技术指南针持续输出干货欢迎关注前面写了一篇关于跳表基本原理和特性的文章,本次继续介绍跳表的概率平衡和工程实现, 跳表在Redis.Lev ...

随机推荐

easyui form提交文件(上传图片和文件)
<div id="dialogBtn"> <a class="easyui-linkbutton" href="#" on ...
poj 3273 Monthly Expence 简单二分
/** 大意: 有连续的n天,每一天有一定的花费,将其分成m份,每一份占一天或者连续的几天,求这m份中的最大值思路: 二分其最大上限,看在此最大上线,能分成多少份,若大于m份,说明上限过小,需要扩大 ...
PCIE卡槽还能这样用！
前几天去电脑城装了台i5的主机,当时就发现主板上只有2个PCIex2的槽.但奇怪的是2个还长的不一样,一个屁股后面是开口的:) 问装机的小伙子,他也不懂,而且就这电脑操作水平都跟哥差远了,让他给我硬盘 ...
09-C语言数组
目录: 一.使用xcode编辑工具二.数组三.数组遍历四.多维数组回到顶部一.使用xcode编辑工具 1 打开xcode程序 2 创建一个项目 OSX -> Application - ...
高质量程序设计指南C/C++语言——有了malloc/free为什么还要new/delete？
深究带PLL的错误复位设计
PLL复位通常犯的错误或者是像上一篇文章 FPGA知识大梳理(四)FPGA中的复位系统大汇总中的图一样,也是错误设计.为何呢?看ALTPLL (Phase-Locked Loop) IP Cor ...
Mathematica
Mathematica是一款科学计算软件,很好地结合了数值和符号计算引擎.图形系统.编程语言.文本系统.和与其他应用程序的高级连接.很多功能在相应领域内处于世界领先地位,它也是使用最广泛的数学软件之一 ...
24_Core Data Demo
今天开始学习Core Data,类似于数据库,可以永久保存数据.不过当把App从iPhone删掉之后就没有了.可以用来保存App的运行数据. 参考链接:iOS Swift教程 Core Data 概述 ...
delphi 文件夹权限设置（执行一个小脚本的笨办法）
如题,研究了一天,也没再网上找到比较好的方式,自己做了一个.方法如下: 1.创建一个 cmd 命令文件.2.调用该命令. 代码如下: S:='echo y|cacls h: /t /c /g ev ...
用elasticsearch索引mongodb数据
参照网页:单机搭建elasticsearch和mongodb的river 三个步骤: 一,搭建单机replicSet二,安装mongodb-river插件三,创建meta,验证使用第一步,搭建单机m ...

levelDB跳表实现

levelDB跳表实现的更多相关文章

随机推荐

热门专题