前言

最近写的一个个人项目(传送门：全终端云书签)中需要用到全文检索功能，目前 mysql，es 都可以做全文检索，mysql 胜在配置方便很快就能搞定上线(参考这里),不考虑上手难度，es 在全文检索方面是完胜 mysql 的。

最后决定使用 es。使用最新的 7.2 版本。java 客户端使用 es 官方的 high level client(官方文档),为什么用这个有以下几点原因：

jest 毕竟不是官方的，更新速度较慢
transportClient,速度太慢，连官方都嫌弃它了。在 7.x 中已经被弃用，8.x 中将完全删除
high level client 的官方文档写的很清楚明了，虽然目前相关的中文资料还很少，也能够上手用起来

本文主要内容如下：

docker 部署 es(支持 ik 中文分词)
在 springboot 中进行增删改查

docker 部署 es(基于 linux)

es 的中文分词目前比较流行的分词插件为 ik(github 地址)。由于手写 docker 命令太繁杂，这里用 docker-compose 来管理。假定当前在/root 目录下

下载 ik release 到/root/es/ik 目录下，并解压到当前文件夹。
创建/root/es/data 目录，并将读写权限给所有用户.本目录用于存放 es 数据。由于 es 不能以 root 用户执行，所以对于此目录需要将读写权限给其他用户。

编写 es 配置文件，7.2 的配置文件变化还是较大的（之前用的是 2.x 版本），一个简单的配置如下：

cluster.name: elasticsearch

 # 配置的集群名称，默认是 elasticsearch，es 服务会通过广播方式自动连接在同一网段下的 es 服务，通过多播方式进行通信，同一网段下可以有多个集群，通过集群名称这个属性来区分不同的集群。

 node.name: bookmark-world

 # 当前配置所在机器的节点名，你不设置就默认随机指定一个 name 列表中名字，该 name 列表在 es 的 jar 包中 config 文件夹里 name.txt 文件中，其中有很多作者添加的有趣名字。

 node.master: true

 # 指定该节点是否有资格被选举成为 node（注意这里只是设置成有资格， 不代表该 node 一定就是 master），默认是 true，es 是默认集群中的第一台机器为 master，如果这台机挂了就会重新选举 master。

 node.data: true

 # 指定该节点是否存储索引数据，默认为 true。

 bootstrap.memory_lock: false

 # 设置为 true 来锁住内存不进行 swapping。因为当 jvm 开始 swapping 时 es 的效率 会降低，所以要保证它不 swap，可以把 ES_MIN_MEM 和 ES_MAX_MEM 两个环境变量设置成同一个值，并且保证机器有足够的内存分配给 es。 同时也要允许 elasticsearch 的进程可以锁住内存，linux 下启动 es 之前可以通过`ulimit -l unlimited`命令设置。

 # 设置为 true，会导致报警告实际未锁定内存，进而退出进程(es在生产模式下有警告就会退出)

 network.bind_host: 0.0.0.0

 # 设置绑定的 ip 地址，可以是 ipv4 或 ipv6 的，默认为 0.0.0.0，绑定这台机器的任何一个 ip。

 # 集群配置

 discovery.seed_hosts:

   - bookmark-es

 cluster.initial_master_nodes:

   - bookmark-world

编写/root/docker-compose.yml

version: "2"

services:

  bookmark-es:

  image: docker.elastic.co/elasticsearch/elasticsearch:7.2.0

container_name: bookmark-es

volumes:

  - /etc/localtime:/etc/localtime

  - ./es/data:/usr/share/elasticsearch/data

  - ./es/ik:/usr/share/elasticsearch/plugins/ik

  - ./es/elasticsearch.yml:/usr/share/elasticsearch/config/elasticsearch.yml

ports:

  - 9200:9200

  - 9300:9300

执行 docker-compose up -d启动 es

详细可参考这里：云书签 docker 部署。

springboot 整合

创建 springboot 项目

首先创建一个 springboot 项目，然后引入high level client的依赖，pom 文件如下：

<?xml version="1.0" encoding="UTF-8"?>

<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">

    <modelVersion>4.0.0</modelVersion>

    <parent>

        <groupId>org.springframework.boot</groupId>

        <artifactId>spring-boot-starter-parent</artifactId>

        <version>2.1.6.RELEASE</version>

        <relativePath/> <!-- lookup parent from repository -->

    </parent>

    <groupId>com.fanxb</groupId>

    <artifactId>es-demo</artifactId>

    <version>0.0.1-SNAPSHOT</version>

    <name>es-demo</name>

    <description>Elasticsearch Demo project for Spring Boot</description>

    <properties>

        <java.version>1.8</java.version>

    </properties>

    <!--注意：如果使用了parent那么需要在此定义es版本号,因为spring-boot-start-parent中已经定义了es相关依赖的版本号

    ，high-level-client中的部分依赖会被覆盖成低版本的,导出出现莫名其妙的错误-->

    <dependencyManagement>

        <dependencies>

            <dependency>

                <groupId>org.elasticsearch.client</groupId>

                <artifactId>elasticsearch-rest-high-level-client</artifactId>

                <version>7.2.0</version>

            </dependency>

            <!-- https://mvnrepository.com/artifact/org.elasticsearch/elasticsearch -->

            <dependency>

                <groupId>org.elasticsearch</groupId>

                <artifactId>elasticsearch</artifactId>

                <version>7.2.0</version>

            </dependency>

            <!--&lt;!&ndash; https://mvnrepository.com/artifact/org.elasticsearch.client/elasticsearch-rest-client &ndash;&gt;-->

            <dependency>

                <groupId>org.elasticsearch.client</groupId>

                <artifactId>elasticsearch-rest-client</artifactId>

                <version>7.2.0</version>

            </dependency>

        </dependencies>

    </dependencyManagement>

    <dependencies>

        <dependency>

            <groupId>org.springframework.boot</groupId>

            <artifactId>spring-boot-starter-web</artifactId>

        </dependency>

        <dependency>

            <groupId>org.elasticsearch.client</groupId>

            <artifactId>elasticsearch-rest-high-level-client</artifactId>

            <version>7.2.0</version>

        </dependency>

        <dependency>

            <groupId>com.alibaba</groupId>

            <artifactId>fastjson</artifactId>

            <version>1.2.56</version>

        </dependency>

        <dependency>

            <groupId>org.springframework.boot</groupId>

            <artifactId>spring-boot-starter-test</artifactId>

            <scope>test</scope>

        </dependency>

    </dependencies>

    <build>

        <plugins>

            <plugin>

                <groupId>org.springframework.boot</groupId>

                <artifactId>spring-boot-maven-plugin</artifactId>

            </plugin>

        </plugins>

    </build>

</project>

注意:这里有一个依赖的大坑，要注意!

如果定义了<parent>，就必须在<dependencyManagement>中指定部分依赖的版本，否则会因为依赖版本不对出现各种莫名其妙的错误,上面注释中已经指出。

创建 util/EsUtil.java 工具类

主要功能函数如下：

预创建 index

虽然 es 在插入数据时会自动根据字段类型来创建字段定义，但是自动创建并不总是和需要相符的，比如想让某个字段不分词，或者使用其他的分词器。所以在代码中先判断 index(es7 中已经废弃了 mapping，也就是一个 index 相当于一个表)是否存在，如果不存在就创建 index.

主要代码如下：

//被@PostConstruct注释的方法将会在对应类注入到Spring后调用,确保index的生成

@PostConstruct

public void init() {

    try {

        if (client != null) {

            client.close();

        }

        client = new RestHighLevelClient(RestClient.builder(new HttpHost(host, port, scheme)));

        if (this.indexExist(INDEX_NAME)) {

            return;

        }

        CreateIndexRequest request = new CreateIndexRequest(INDEX_NAME);

        request.settings(Settings.builder().put("index.number_of_shards", 3).put("index.number_of_replicas", 2));

        request.mapping(CREATE_INDEX, XContentType.JSON);

        CreateIndexResponse res = client.indices().create(request, RequestOptions.DEFAULT);

        if (!res.isAcknowledged()) {

            throw new RuntimeException("初始化失败");

        }

    } catch (Exception e) {

        e.printStackTrace();

        System.exit(0);

    }

}

插入或者更新一个对象

通过指定 id，如果此 id 存在那么就是更新，否则是插入。

public void insertOrUpdateOne(String index, EsEntity entity) {

    IndexRequest request = new IndexRequest(index);

    request.id(entity.getId());

    request.source(JSON.toJSONString(entity.getData()), XContentType.JSON);

    try {

        client.index(request, RequestOptions.DEFAULT);

    } catch (Exception e) {

        throw new RuntimeException(e);

    }

}

批量插入

high level client 提供了方便的批量操作接口，如下所示：

public void insertBatch(String index, List<EsEntity> list) {

    BulkRequest request = new BulkRequest();

    list.forEach(item -> request.add(new IndexRequest(index).id(item.getId())

            .source(JSON.toJSONString(item.getData()), XContentType.JSON)));

    try {

        client.bulk(request, RequestOptions.DEFAULT);

    } catch (Exception e) {

        throw new RuntimeException(e);

    }

}

批量删除

和上面一样同样用到了BulkRequest

public <T> void deleteBatch(String index, Collection<T> idList) {

    BulkRequest request = new BulkRequest();

    idList.forEach(item -> request.add(new DeleteRequest(index, item.toString())));

    try {

        client.bulk(request, RequestOptions.DEFAULT);

    } catch (Exception e) {

        throw new RuntimeException(e);

    }

}

搜索

通过构建SearchSourceBuilder查询参数

public <T> List<T> search(String index, SearchSourceBuilder builder, Class<T> c) {

    SearchRequest request = new SearchRequest(index);

    request.source(builder);

    try {

        SearchResponse response = client.search(request, RequestOptions.DEFAULT);

        SearchHit[] hits = response.getHits().getHits();

        List<T> res = new ArrayList<>(hits.length);

        for (SearchHit hit : hits) {

            res.add(JSON.parseObject(hit.getSourceAsString(), c));

        }

        return res;

    } catch (Exception e) {

        throw new RuntimeException(e);

    }

}

delete by query

es 插入数据容易，删除就比较麻烦了，特别是根据条件删除。

public void deleteByQuery(String index, QueryBuilder builder) {

    DeleteByQueryRequest request = new DeleteByQueryRequest(index);

    request.setQuery(builder);

    //设置批量操作数量,最大为10000

    request.setBatchSize(10000);

    request.setConflicts("proceed");

    try {

        client.deleteByQuery(request, RequestOptions.DEFAULT);

    } catch (Exception e) {

        throw new RuntimeException(e);

    }

}

结束

可通过测试类com.fanxb.esdemo.service.BookServiceTest查看运行结果。

源码地址：github

本文原创发布于：https://www.tapme.top/blog/detail/2019-07-29-14-59

springboot整合elasticsearch(基于es7.2和官方high level client)的更多相关文章

SpringBoot整合ElasticSearch实现多版本的兼容
前言在上一篇学习SpringBoot中,整合了Mybatis.Druid和PageHelper并实现了多数据源的操作.本篇主要是介绍和使用目前最火的搜索引擎ElastiSearch,并和Spring ...
ElasticSearch(2)---SpringBoot整合ElasticSearch
SpringBoot整合ElasticSearch 一.基于spring-boot-starter-data-elasticsearch整合开发环境:springboot版本:2.0.1,elast ...
springboot整合elasticsearch入门例子
springboot整合elasticsearch入门例子 https://blog.csdn.net/tianyaleixiaowu/article/details/72833940 Elastic ...
Springboot整合elasticsearch以及接口开发
Springboot整合elasticsearch以及接口开发搭建elasticsearch集群搭建过程略(我这里用的是elasticsearch5.5.2版本) 写入测试数据新建索引book( ...
SpringBoot整合Elasticsearch详细步骤以及代码示例（附源码）
准备工作环境准备 JAVA版本 java version "1.8.0_121" Java(TM) SE Runtime Environment (build 1.8.0_121 ...
Springboot整合Elasticsearch报错availableProcessors is already set to [4], rejecting [4]
Springboot整合Elasticsearch报错今天使用SpringBoot整合Elasticsearch时候,相关的配置完成后,启动项目就报错了. nested exception is j ...
Springboot整合ElasticSearch进行简单的测试及用Kibana进行查看
一.前言搜索引擎还是在电商项目.百度.还有技术博客中广泛应用,使用最多的还是ElasticSearch,Solr在大数据量下检索性能不如ElasticSearch.今天和大家一起搭建一下,小编是看完 ...
😊SpringBoot 整合 Elasticsearch （超详细）.md
SpringBoot 整合 Elasticsearch (超详细) 注意: 1.环境搭建安装es Elasticsearch 6.4.3 下载链接为了方便,环境使用Windows 配置解压后配置 ...
SpringBoot整合elasticsearch
在这一篇文章开始之前,你需要先安装一个ElasticSearch,如果你是mac或者linux可以参考https://www.jianshu.com/p/e47b451375ea,如果是windows ...

随机推荐

tkinter + cefpython 仿美团桌面程序
使用js开发桌面程序目前是一个趋势,Electron是其中一个佼佼者,网上也不乏很多文章.今天主要是来讲一下cefpython. 用python的朋友,特别使用过tkinter开发过界面的,一定会觉得 ...
EXPLAIN说明
列名类型解释 id SELECT语句的ID编号,优先执行编号较大的查询,如果编号相同,则从上向下执行 select_type SIMPLE 一条没有UNION或子查询部分的SELECT语句 P ...
spark 源码分析之五 -- Spark内置RPC机制剖析之一创建NettyRpcEnv
在前面源码剖析介绍中,spark 源码分析之二 -- SparkContext 的初始化过程中的SparkEnv和 spark 源码分析之四 -- TaskScheduler的创建和启动过程中的C ...
002-pythn基础-循环、编码
1. 循环 while 条件: 代码块(循环体) else: 当上面的条件为假. 才会执行执行顺序: 判断条件是否为真. 如果真. 执行循环体. 然后再次判断条件....直到循环条件为假. 程序退出 ...
java word转html 报错 org/apache/poi/xwpf/usermodel/IRunBody
最终解决的办法是修改jar包版本,一定要对应上. <dependency> <groupId>org.apache.poi</groupId> <artifa ...
Django项目的创建和管理
1.主题这部分教程主要介绍如何通过Pycharm创建.管理.运行一个Django工程.对于Django模块的相关知识大家可以参考Python社区. 2.准备环境 Django版本为2.0或更高 Py ...
spring boot freemarker 导出word 带echarts图形报表
创建word文件内容如下将word导出为xml格式将文件后缀名改为 .ftl 在springboot项目中添加freemarker依赖  <dep ...
《C#并发编程经典实例》学习笔记—2.7 避免上下文延续
避免上下文延续在默认情况下,一个 async 方法在被 await 调用后恢复运行时,会在原来的上下文中运行. 为了避免在上下文中恢复运行,可让 await 调用 ConfigureAwait 方法 ...
kuangbin专题专题二搜索进阶 Nightmare Ⅱ HDU - 3085
题目链接:https://vjudge.net/problem/HDU-3085 题意:有两个鬼和两个人和墙,鬼先走,人再走,鬼每走过的地方都会复制一个新鬼, 但新鬼只能等待旧鬼走完一次行程之后,下一 ...
c++学习书籍推荐《C++ Primer Plus中文版(第6版)》下载
百度云及其他网盘下载地址:点我编辑推荐一本经久不衰的C++畅销经典教程:一本支持C++11新标准的程序设计图书. 它被誉为“开发人员学习C++的教程,没有之一”! Amazon网站“Langua ...

springboot整合elasticsearch(基于es7.2和官方high level client)

前言