一、创建索引时，自定义拼音分词和ik分词

PUT /my_index

{

    "index": {

        "analysis": {

            "analyzer": {

                "ik_pinyin_analyzer": {  自定义分词name

                    "type": "custom",

                    "tokenizer": "ik_smart",

                    "filter": ["my_pinyin", "word_delimiter"]

                },

                "pinyin_analyzer": {

                    "type": "custom",

                    "tokenizer": "ik_max_word",

                    "filter": ["my_pinyin", "word_delimiter"]

                }

            },

            "filter": {

                "my_pinyin": {

                    "type" : "pinyin",

                    "keep_separate_first_letter" : false, 启用该选项时，将保留第一个字母分开，例如：刘德华> l，d，h，默认：false，注意：查询结果也许是太模糊，由于长期过频

                    "keep_full_pinyin" : true,  当启用该选项，例如：刘德华> [ liu，de，hua]，默认值：true

                    "keep_original" : true, 启用此选项时，也将保留原始输入，默认值：false

                    "limit_first_letter_length" : 16, 设置first_letter结果的最大长度，默认值：16
                    "lowercase" : true,  小写非中文字母，默认值：true
                    "remove_duplicated_term" : true  启用此选项后，将删除重复的术语以保存索引，例如：de的> de，default：false，注意：位置相关的查询可能会受到影响
} 
} 
} 
} 
}

二、创建mapping时，设置字段分词(注：相同索引下建不同的type时，相同字段名属性必须设一样)

POST /my_index/user/_mapping

{

    "user": {

        "properties": {

          "id":{

            "type":"integer"

          },

            "userName": {

              "type": "text",

              "store": "no",

              "term_vector": "with_positions_offsets",

              "analyzer": "ik_pinyin_analyzer",   自定义分词器name

              "boost": 10,

              "fielddata" : true,

              "fields": {

                    "raw": {

                        "type": "keyword"    设置keyword时，对该字段不进行分析

                    }

                }

            },

            "reason":{

              "type": "text",

              "store": "no",  字段store为true，这意味着这个field的数据将会被单独存储。这时候，如果你要求返回field1（store：yes），es会分辨出field1已经被存储了，因此不会从_source中加载，而是从field1的存储块中加载。

              "term_vector": "with_positions_offsets",

              "analyzer": "ik_pinyin_analyzer",

              "boost": 10

            }

        }

    }

}

测试

PUT /my_index/user/1

{

  "id":1,

  "userName":"刘德华",

  "reason":"大帅哥"

}

PUT /my_index/user/2

{

  "id":2,

  "userName":"刘德华",

  "reason":"中华人民"

}

不分词查询

GET /my_index/user/_search

{

  "query": {

    "match": {

      "userName.raw": "刘德华"

    }

  }

}

{

  "took": 0,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "skipped": 0,

    "failed": 0

  },

  "hits": {

    "total": 2,

    "max_score": 0.2876821,

    "hits": [

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "2",

        "_score": 0.2876821,

        "_source": {

          "id": 2,

          "userName": "刘德华",

          "reason": "中华人民"

        }

      },

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "1",

        "_score": 0.2876821,

        "_source": {

          "id": 1,

          "userName": "刘德华",

          "reason": "大帅哥"

        }

      }

    ]

  }

}

分词查询

GET /my_index/user/_search

{

  "query": {

    "match": {

      "userName": "刘"

    }

  }

}

{

  "took": 0,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "skipped": 0,

    "failed": 0

  },

  "hits": {

    "total": 2,

    "max_score": 0.31331712,

    "hits": [

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "2",

        "_score": 0.31331712,

        "_source": {

          "id": 2,

          "userName": "刘德华",

          "reason": "中华人民"

        }

      },

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "1",

        "_score": 0.31331712,

        "_source": {

          "id": 1,

          "userName": "刘德华",

          "reason": "大帅哥"

        }

      }

    ]

  }

}

拼音分词

GET /my_index/user/_search

{

  "query": {

    "match": {

      "reason": "shuai"

    }

  }

}

{

  "took": 0,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "skipped": 0,

    "failed": 0

  },

  "hits": {

    "total": 1,

    "max_score": 3.4884284,

    "hits": [

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "1",

        "_score": 3.4884284,

        "_source": {

          "id": 1,

          "userName": "刘德华",

          "reason": "大帅哥"

        }

      }

    ]

  }

}

分组聚合

GET /my_index/user/_search

{

  "size":2,

  "query": {

    "match": {

      "userName": "liu"

    }

  },

  "aggs": {

    "group_by_meetingType": {

      "terms": {

        "field": "userName.raw"

      }

    }

  }

}

{

  "took": 1,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "skipped": 0,

    "failed": 0

  },

  "hits": {

    "total": 2,

    "max_score": 3.133171,

    "hits": [

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "2",

        "_score": 3.133171,

        "_source": {

          "id": 2,

          "userName": "刘德华",

          "reason": "中华人民"

        }

      },

      {

        "_index": "my_index",

        "_type": "user",

        "_id": "1",

        "_score": 3.133171,

        "_source": {

          "id": 1,

          "userName": "刘德华",

          "reason": "大帅哥"

        }

      }

    ]

  },

  "aggregations": {

    "group_by_meetingType": {

      "doc_count_error_upper_bound": 0,

      "sum_other_doc_count": 0,

      "buckets": [

        {

          "key": "刘德华",

          "doc_count": 2

        }

      ]

    }

  }

}

大神们这些都是个人理解哪里有一样的想法或建议欢迎评论！！！！！！！

Elasticsearch拼音和ik分词器的结合应用的更多相关文章

Elasticsearch下安装ik分词器
安装ik分词器(必须安装maven) 上传相应jar包解压到相应目录 unzip elasticsearch-analysis-ik-master.zip(zip包) cp -r elasticse ...
【ELK】【docker】【elasticsearch】2.使用elasticSearch+kibana+logstash+ik分词器+pinyin分词器+繁简体转化分词器 6.5.4 启动 ELK+logstash概念描述
官网地址:https://www.elastic.co/guide/en/elasticsearch/reference/current/docker.html#docker-cli-run-prod ...
Elasticsearch 7.x - IK分词器插件（ik_smart，ik_max_word）
一.安装IK分词器 Elasticsearch也需要安装IK分析器以实现对中文更好的分词支持. 去Github下载最新版elasticsearch-ik https://github.com/medc ...
linux（centos 7）下安装elasticsearch 5 的 IK 分词器
(一)到IK 下载对应的版本(直接下载release版本,避免mvn打包),下载后是一个zip压缩包 (二)将压缩包上传至elasticsearch 的安装目录下的plugins下,进行解压,运行如 ...
通过docker安装elasticsearch和安装ik分词器插件及安装kibana
前提: 已经安装好docker运行环境: 步骤: 1.安装elasticsearch 6.2.2版本,目前最新版是7.2.0,这里之所以选择6.2.2是因为最新的SpringBoot2.1.6默认支持 ...
【ELK】【docker】【elasticsearch】1. 使用Docker和Elasticsearch+ kibana 5.6.9 搭建全文本搜索引擎应用集群,安装ik分词器
系列文章:[建议从第二章开始] [ELK][docker][elasticsearch]1. 使用Docker和Elasticsearch+ kibana 5.6.9 搭建全文本搜索引擎应用集群,安 ...
docker 部署 elasticsearch + elasticsearch-head + elasticsearch-head跨域问题 + IK分词器
0. docker pull 拉取elasticsearch + elasticsearch-head 镜像 1. 启动elasticsearch Docker镜像 docker run -di ...
Docker 下Elasticsearch 的安装和ik分词器
(1)docker镜像下载 docker pull elasticsearch:5.6.8 (2)安装es容器 docker run -di --name=changgou_elasticsearch ...
Elasticsearch（ES）分词器的那些事儿
1. 概述分词器是Elasticsearch中很重要的一个组件,用来将一段文本分析成一个一个的词,Elasticsearch再根据这些词去做倒排索引. 今天我们就来聊聊分词器的相关知识. 2. 内置 ...

随机推荐

Dubbo架构学习整理
一. Dubbo诞生背景随着互联网的发展和网站规模的扩大,系统架构也从单点的垂直结构往分布式服务架构演进,如下图所示: 单一应用架构:一个应用部署所有功能,此时简化CRUD的ORM框架是关键垂直应 ...
jpa的查询语法
pytorch模型部署在MacOS或者IOS
pytorch训练出.pth模型如何在MacOS上或者IOS部署,这是个问题. 然而我们有了onnx,同样我们也有了coreML. ONNX: onnx是一种针对机器学习设计的开放式文件格式,用来存储 ...
仿照 ButterKnife 的 Android 注解实例
什么是注解 java.lang.annotation,接口 Annotation,在JDK5.0及以后版本引入. 注解处理器是 javac 的一个工具,它用来在编译时扫描和处理注解(Annotatio ...
Perl列表相关函数
内置的列表函数有: grep, join, map, qw//, reverse, sort, unpack join:将多个元素使用给定字符串联起来join grep:从列表中筛选符合条件的元素执行 ...
ARM 处理器寻址方式之间接寻址的几种表达
我们以 LDR 指令为例来分别举例分析. LDR 指令的格式为: LDR{条件} 目的寄存器,<存储器地址> LDR 指令是字加载指令,用于从存储器中将一个 32 位的字数据送到目的寄存器 ...
linux安装配置zookeeper-3.4.10
此文是基于上一篇文章:hadoop集群搭建安装zookeeper: [在各个slave节点安装zookeeper] 下载地址:http://mirror.bit.edu.cn/apache/zook ...
maven web工程缺少 src/main/java 和 src/test/java 资源文件夹的方法
右键打开:build path -> configure build path... 在弹出的界面,选择: 编辑后: 点击finish,即可完成
[PHP] 多进程通信-消息队列使用
向消息队列发送数据和获取数据的测试 <?php $key=ftok(__FILE__,'a'); //获取消息队列 $queue=msg_get_queue($key,0666); //发送消息 ...
Spring使用ajax异步上传文件
单文件上传  文件上传 :<input type="file" id="file" name="fi ...

Elasticsearch拼音和ik分词器的结合应用

大神们这些都是个人理解哪里有一样的想法或建议欢迎评论！！！！！！！

Elasticsearch拼音和ik分词器的结合应用的更多相关文章

随机推荐

热门专题