26.bulk批量操作

主要知识点

1、bulk语法

2、bulk使用时的注意事项

3、bulk size 对es性能的影响

一、bulk语法

每一个操作要两个json串（delete操作除外），每个json串占一行不能换行，语法如下：

{"action": {"metadata"}}

{"data"}

具体写法如下：

{"index": {"_index": "test_index", "_type", "test_type", "_id": "1"}}

{"test_field1": "test1", "test_field2": "test2"}

二、用法举例

POST /_bulk

{"delete":{"_index":"test_index","_type":"test_type","_id":10}}

{"create":{"_index":"test_index","_type":"test_type","_id":12}}

{"test_field": "test field 12"}

{"create":{"_index":"test_index","_type":"test_type","_id":2}}

{"test_field": "recreate test field 2"}

{"index":{"_index":"test_index","_type":"test_type","_id":2,"_retry_on_conflict":3}}

{"test_field2": "reindex test field2"}

{"update": {"_index":"test_index","_type":"test_type","_id":1}}

{"doc":{"test_field1": "partial update test field2"}}

执行结果如下：

{

"took": 189,

"errors": true,

"items": [

{

"delete": {

"found": true,

"_index": "test_index",

"_type": "test_type",

"_id": "10",

"_version": 2,

"result": "deleted",

"_shards": {

"total": 2,

"successful": 1,

"failed": 0

"status": 200

}

{

"create": {

"_index": "test_index",

"_type": "test_type",

"_id": "14",

"_version": 1,

"result": "created",

"_shards": {

"total": 2,

"successful": 1,

"failed": 0

"created": true,

"status": 201

}

{

"create": {

"_index": "test_index",

"_type": "test_type",

"_id": "2",

"status": 409,

"error": {

"type": "version_conflict_engine_exception",

"reason": "[test_type][2]: version conflict, document already exists (current version [5])",

"index_uuid": "d5YEp9EjTKevAC315oXfwA",

"shard": "2",

"index": "test_index"

}

{

"index": {

"_index": "test_index",

"_type": "test_type",

"_id": "2",

"_version": 6,

"result": "updated",

"_shards": {

"total": 2,

"successful": 1,

"failed": 0

"created": false,

"status": 200

}

{

"update": {

"_index": "test_index",

"_type": "test_type",

"_id": "1",

"_version": 3,

"result": "noop",

"_shards": {

"total": 2,

"successful": 1,

"failed": 0

"status": 200

}

]

}

可以看出，第一个deleter操作成功，第二个create操作成功，第三个create操作不成功（因为原_id=2的document已存在）第四个index操作成功，做全量替换，第五个update操作成功，

三、bulk可以执行的操作类型

（1）delete：删除一个文档，只要1个json串就可以了

（2）create：PUT /index/type/id/_create，强制创建

（3）index：普通的put操作，可以是创建文档，也可以是全量替换文档

（4）update：执行的partial update操作

四、其他写法

（1）当所有的操作都作用于同一个index时：

POST /test_index/_bulk

{ "delete": { "_type": "test_type", "_id": "3" }}

{ "create": { "_type": "test_type", "_id": "12" }}

{ "test_field": "test12" }

{ "index": { "_type": "test_type" }}

{ "test_field": "auto-generate id test" }

{ "index": { "_type": "test_type", "_id": "2" }}

{ "test_field": "replaced test2" }

{ "update": { "_type": "test_type", "_id": "1", "_retry_on_conflict" : 3} }

{ "doc" : {"test_field2" : "bulk test1"} }

（2）当所有的操作都作用于同一个index下的同一个type时：

POST /test_index/test_type/_bulk

{ "delete": { "_id": "3" }}

{ "create": { "_id": "12" }}

{ "test_field": "test12" }

{ "index": { }}

{ "test_field": "auto-generate id test" }

{ "index": { "_id": "2" }}

{ "test_field": "replaced test2" }

{ "update": { "_id": "1", "_retry_on_conflict" : 3} }

{ "doc" : {"test_field2" : "bulk test1"} }

五、bulk size对性能的影响

bulk request会加载到内存里，如果太大的话，性能反而会下降，因此需要反复尝试一个最佳的bulk size。一般从1000~5000条数据开始，尝试逐渐增加。另外，如果看大小的话，最好是在5~15MB之间。

六、 bulk语法的注意事项

1、bulk api对json的语法有严格的要求，每个json串不能换行，只能放一行，同时一个json串和一个json串之间，必须有一个换行。

2、bulk操作中，任意一个操作失败，是不会影响其他的操作的，但是在返回结果里，es会告诉异常日志

26.bulk批量操作的更多相关文章

ELK学习总结（2-4）bulk 批量操作-实现多个文档的创建、索引、更新和删除
bulk 批量操作-实现多个文档的创建.索引.更新和删除 ----------------------------------------------------------------------- ...
第三百六十三节，Python分布式爬虫打造搜索引擎Scrapy精讲—elasticsearch(搜索引擎)的mget和bulk批量操作
第三百六十三节,Python分布式爬虫打造搜索引擎Scrapy精讲—elasticsearch(搜索引擎)的mget和bulk批量操作注意:前面讲到的各种操作都是一次http请求操作一条数据,如果想 ...
四十二 Python分布式爬虫打造搜索引擎Scrapy精讲—elasticsearch(搜索引擎)的mget和bulk批量操作
注意:前面讲到的各种操作都是一次http请求操作一条数据,如果想要操作多条数据就会产生多次请求,所以就有了mget和bulk批量操作,mget和bulk批量操作是一次请求可以操作多条数据 1.mget ...
Elasticsearch之CURL命令的bulk批量操作
大家,也可去看看我下面的博客 Elasticsearch之批量操作bulk 官网上,是举例了新建一个requests文件. [hadoop@master elasticsearch-]$ pwd /h ...
ES bulk 批量操作
bulk允许在一个请求中进行多个操作(create.index.update.delete),也就是可以在一次请求裡做很多事情也由于这个关系,因此bulk的请求体和其他请求的格式会有点不同 bulk ...
elasticsearch使用bulk实现批量操作
本篇文章提供ES原生批量操作语法及使用bulk批量操作文档.文章依旧提供语法,具体实现大家根据语法,在对应处进行替换即可一.原生批量获取文档 1.获取指定文档值(1) 语法: GET /_mget ...
利用kibana插件对Elasticsearch进行批量操作
#############批量获取################# #获取所有数据 GET _mget { "docs": [ {"_index":" ...
java操作elasticsearch实现批量添加数据（bulk）
java操作elasticsearch实现批量添加主要使用了bulk 代码如下: //bulk批量操作(批量添加) @Test public void test7() throws IOExcepti ...
kibana——es的批量操作
一·_mget: 1.创建的索引如下: 2.批量查询: #查询两个 GET _mget { "docs":[ { "_index":"testdb&q ...

随机推荐

spark streaming updateStateByKey 使用方法
updateStateByKey 解释: 以DStream中的数据进行按key做reduce操作,然后对各个批次的数据进行累加在有新的数据信息进入或更新时.能够让用户保持想要的不论什么状.使用这个功 ...
[Android]Fragment源代码分析(三) 事务
Fragment管理中,不得不谈到的就是它的事务管理,它的事务管理写的很的出彩.我们先引入一个简单经常使用的Fragment事务管理代码片段: FragmentTransaction ft = thi ...
http自己定义超时检測方法、主动抛出异常
上次为了解决无网路由器超时的问题,将问题重心转移到了HttpClient. HttpUrLConnection上面,什么各种设置ReadTimeout. connectionTimeOut,还有所谓的 ...
Android系统Recovery工作原理之使用update.zip升级过程分析（九）---updater-script脚本语法简介以及执行流程【转】
本文转载自:http://blog.csdn.net/mu0206mu/article/details/7465603 Android系统Recovery工作原理之使用update.zip ...
Appium + python - input操作实例
import osimport time as t adb = 'adb shell input tap 400 500'os.system(adb) t.sleep(5) class keyeven ...
Zeppelin0.6.2+sparkR2.0.2环境搭建
0.序先吐槽一下网上旧版本的Zeppelin和R的安装,让我折腾了几个小时. 不过最终还是调通了也不容易,其实我现在一点R都没有学呢,只是刚看了一节课,但是这个工具既然出现在了Spark中,我想它还 ...
A - Supermarket
Problem description We often go to supermarkets to buy some fruits or vegetables, and on the tag the ...
# --with-http_stub_status_module模块
作用: 查看nginx的客户端状态环境检测 nginx -V 查看nginx已经编译的模块中是否包含--with-http_stub_status_module 语法: 效果
Centos 自动删除日志文件的Shell代码
#!/bin/bash # #判断文件夹内文件的大小,如果大于一定的数值,那么删除 # echo '判断文件夹内文件的大小,如果大于一定的数值,并且文件名称包含数字(年月日)的删除,那么删除' pat ...
微信公众号API使用总结
官网: https://mp.weixin.qq.com/ API: http://mp.weixin.qq.com/wiki/home/index.html 接口调试工具:h ...

26.bulk批量操作

26.bulk批量操作的更多相关文章

随机推荐

热门专题