elasticsearch index 之 engine

elasticsearch对于索引中的数据操作如读写get等接口都封装在engine中，同时engine还封装了索引的读写控制，如流量、错误处理等。engine是离lucene最近的一部分。

engine的实现结构如下所示：

engine接口有三个实现类，主要逻辑都在InternalEngine中。ShadowEngine之实现了engine接口的部分读方法，主要用于对于索引的读操作。shardFSEngine在InternalEngine的基础上实现了recovery方法，它的功能跟InternalEngine基本相同只是它的recovery过程有区别，不会对Translog和index进行快照存储。

Engine类定义了一些index操作的主要方法和内部类，方法如create，index等。内部类如index，delete等。这些方法的实现是在子类中，这些方法的参数是这些内部类。首先看一下它的方法：

 public abstract void create(Create create) throws EngineException;

    public abstract void index(Index index) throws EngineException;

    public abstract void delete(Delete delete) throws EngineException;

    public abstract void delete(DeleteByQuery delete) throws EngineException;

这些抽象方法都在子类中实现，它们的参数都是一类，这些都是Engine的内部类，这些内部类类似于实体类，没有相关逻辑只是由很多filed及get方法构成。如Create和Index都继承自IndexOperation，它们所有信息都存储到IndexOperation的相关Field中，IndexOperation如下所示：

 public static abstract class IndexingOperation implements Operation {

        private final DocumentMapper docMapper;

        private final Term uid;

        private final ParsedDocument doc;

        private long version;

        private final VersionType versionType;

        private final Origin origin;

        private final boolean canHaveDuplicates;

        private final long startTime;

        private long endTime;

    ………………

}

无论是Index还是Create，相关数据和配置都在doc中，根据doc和docMapper就能够获取本次操作的所有信息，另外的一些字段如version，uid都是在类初始化时构建。这样传给实际方法的是一个class，在方法内部根据需求获取到相应的数据，如index方法的实现：

    private void innerIndex(Index index) throws IOException {

        synchronized (dirtyLock(index.uid())) {

            final long currentVersion;

            VersionValue versionValue = versionMap.getUnderLock(index.uid().bytes());

            if (versionValue == null) {

                currentVersion = loadCurrentVersionFromIndex(index.uid());

            } else {

                if (engineConfig.isEnableGcDeletes() && versionValue.delete() && (engineConfig.getThreadPool().estimatedTimeInMillis() - versionValue.time()) > engineConfig.getGcDeletesInMillis()) {

                    currentVersion = Versions.NOT_FOUND; // deleted, and GC

                } else {

                    currentVersion = versionValue.version();

                }

            }

            long updatedVersion;

            long expectedVersion = index.version();

            if (index.versionType().isVersionConflictForWrites(currentVersion, expectedVersion)) {

                if (index.origin() == Operation.Origin.RECOVERY) {

                    return;

                } else {

                    throw new VersionConflictEngineException(shardId, index.type(), index.id(), currentVersion, expectedVersion);

                }

            }

            updatedVersion = index.versionType().updateVersion(currentVersion, expectedVersion);

            index.updateVersion(updatedVersion);

            if (currentVersion == Versions.NOT_FOUND) {

                // document does not exists, we can optimize for create

                index.created(true);

                if (index.docs().size() > 1) {

                    indexWriter.addDocuments(index.docs(), index.analyzer());

                } else {

                    indexWriter.addDocument(index.docs().get(0), index.analyzer());

                }

            } else {

                if (versionValue != null) {

                    index.created(versionValue.delete()); // we have a delete which is not GC'ed...

                }

                if (index.docs().size() > 1) {

                    indexWriter.updateDocuments(index.uid(), index.docs(), index.analyzer());//获取IndexOperation中doc中字段更新索引

                } else {

                    indexWriter.updateDocument(index.uid(), index.docs().get(0), index.analyzer());

                }

            }

            Translog.Location translogLocation = translog.add(new Translog.Index(index));//写translog

            versionMap.putUnderLock(index.uid().bytes(), new VersionValue(updatedVersion, translogLocation));

            indexingService.postIndexUnderLock(index);

        }

    }

这就是Engine中create、index这些方法的实现方式。后面分析索引过程中会有更加详细说明。Engine中还有获取索引状态（元数据）及索引操作的方法如merge。这些方法也是在子类中调用lucene的相关接口，跟create，index，get很类似。因为没有深入Engine的方法实现，因此这里的分析比较简单，后面的分析会涉及这里面很多方法。

总结：这里只是从结构上对indexEngine进行了简单说明，它里面的方法是es对lucene索引操作方法的封装，只是增加了一下处理方面的逻辑如写translog，异常处理等。它的操作对象是shard，es所有对shard的写操作都是通过Engine来实现，后面的分析会有所体现。

elasticsearch index 之 engine的更多相关文章

ElasticSearch Index操作源码分析
ElasticSearch Index操作源码分析本文记录ElasticSearch创建索引执行源码流程.从执行流程角度看一下创建索引会涉及到哪些服务(比如AllocationService.Mas ...
elasticsearch index 之 put mapping
elasticsearch index 之 put mapping mapping机制使得elasticsearch索引数据变的更加灵活,近乎于no schema.mapping可以在建立索引时设 ...
elasticsearch index 功能源码概述
从本篇开始,对elasticsearch的介绍将进入数据功能部分(index),这一部分包括索引的创建,管理,数据索引及搜索等相关功能.对于这一部分的介绍,首先对各个功能模块的分析,然后详细分析数据索 ...
Add mappings to an Elasticsearch index in realtime
Changing mapping on existing index is not an easy task. You may find the reason and possible solutio ...
ElasticSearch Index API && Mapping
ElasticSearch NEST Client 操作Index var indexName="twitter"; var deleteIndexResponse = clie ...
Elasticsearch Index模块
1. Index Setting(索引设置) 每个索引都可以设置索引级别.可选值有: static :只能在索引创建的时候,或者在一个关闭的索引上设置 dynamic:可以动态设置 1.1. S ...
Elasticsearch index fields 重命名
reindex数据复制,重索引 POST _reindex { "source": { "index": "twitter" }, &quo ...
elasticsearch index tuning
一.扩容 tag_server当前使用ElasticSearch版本为5.6,此版本单个index的分片是固定的,一旦创建后不能更改. 1.扩容方法1,不适 ES6.1支持split index功能, ...
elasticsearch index 之 create index（-）
从本篇开始,就进入了Index的核心代码部分.这里首先分析一下索引的创建过程.elasticsearch中的索引是多个分片的集合,它只是逻辑上的索引,并不具备实际的索引功能,所有对数据的操作最终还是由 ...

随机推荐

python-网络-tcp
python-网络-tcp 标签(空格分隔): python TCP[client]-发送数据 from socket import * s = socket(AF_INET, SOCK_STREAM ...
SharePoint 修改完或制作完一定要发布
设置了匿名访问但是网站就是需要登录,找了很多问题. 首先想到的映射问题,然后努力检查,最后把代码删掉,然后把站删掉,最后测试出来问题. 点击上方[网站设置] 把修改过的文件发布. 母版也和布局页一定 ...
模拟select样式，自定义下拉列表为树结构
效果图如下: 首先,需要用到的库jQuery,zTree(官网API:http://www.treejs.cn/v3/api.php) 注意:因为zTree是基于jQuery的,所以应该先引入jQue ...
Pepper plugin implementation
For Developers‎ > ‎Design Documents‎ > ‎ Pepper plugin implementation This document provides a ...
caffe(5) 其他常用层及参数
本文讲解一些其它的常用层,包括:softmax_loss层,Inner Product层,accuracy层,reshape层和dropout层及其它们的参数配置. 1.softmax-loss so ...
洛谷 P2542 [AHOI2005]航线规划树链剖分_线段树_时光倒流_离线
Code: #include <map> #include <cstdio> #include <algorithm> #include <cstring&g ...
[SCOI2008]着色方案递推记忆化搜索
我们发现 $c_{i}$ 和 $k$ 的规模非常小我们还发现每种颜色的位置是不必知道的,只要这种颜色和相邻的颜色种类不同即可.定义状态 $f[a][b][c][d][e][last]$,代表有 $a$ ...
Generational GC (Part one )
目录什么是分代垃圾回收对象对的年龄新生代对象和老年对象 Ungar的分带垃圾回收堆的结构记录集写入屏障对象的结构分配新生代GC 幸存空间沾满了怎么办? 老年代GC 优缺点吞吐量得到 ...
首家5G体验厅在深圳建成
日前,深圳移动卓越时代营业厅推出5G全方位体验活动,让市民亲身感受5G时代到来.据悉,十大5G展示项目生动展现移动5G带来的生活巨变与产业升级,为5G发展汇聚各界力量加速创新落地. 现场有市民表示,5 ...
WHU 1548 Home 2-SAT
---恢复内容开始--- 题意: N个人想回家在至少一个时刻.至多两个时刻.并且,他们每个人都能独自回家. 定义:ai表示第i个人回家的时间, xij = abs(ai - aj) (i != j). ...

elasticsearch index 之 engine

elasticsearch index 之 engine的更多相关文章

随机推荐

热门专题