ElasticSearch中term和match探索

一.创建测试数据

1.创建一个index

curl -X PUT  http://127.0.0.1:9200/student?pretty -H "Content-Type: application/json" -d '{

    "settings": {

        "number_of_shards": 1,

        "number_of_replicas": 0

    },

    "mappings": {

        "_source": {

            "enabled": true

        },

        "properties": {

            "id": {

                "type": "integer"

            },

            "name": {

                "type": "text"

            },

            "age": {

                "type": "integer"

            },

            "class": {

                "type": "text",

                "analyzer": "ik_max_word"

            },

            "introduce": {

                "type": "text",

                "analyzer": "ik_max_word"

            }

        }

    }

}'

2.验证是否创建成功

curl -XGET "http://127.0.0.1:9200/student?pretty"

3.插入测试数据

curl -X PUT http://127.0.0.1:9200/student/_doc/1?pretty -H "Content-Type: application/json" -d '{

    "id":1,

    "name":"关云长",

    "age":30,

	"class":"蜀国一班"

}'

curl -X PUT http://127.0.0.1:9200/student/_doc/2?pretty -H "Content-Type: application/json" -d '{

    "id":2,

    "name":"吕蒙",

    "age":25,

	"class":"吴国一班"

}'

curl -X PUT http://127.0.0.1:9200/student/_doc/3?pretty -H "Content-Type: application/json" -d '{

    "id":3,

    "name":"吕布",

    "age":40,

	"class":"三姓一班"

}'

curl -X PUT http://127.0.0.1:9200/student/_doc/4?pretty -H "Content-Type: application/json" -d '{

    "id":4,

    "name":"张翼德",

    "age":30,

	"class":"蜀国二班"

}'

4.查询所有数据，验证是否正确

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '

{

    "query": {

        "match_all": {}

    }

}'

二.验证



#关于term和match，下面两个查询，term没有结果，match有结果，为什么？

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '{

    "query": {

           "term": {"name":"吕蒙"}

    }

}'

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '{

    "query": {

           "match": {"name":"吕蒙"}

    }

}'

拿A去B里匹配，A能分词，B也能分词。term不会将A分词，match会将A分词，存储数据类型keyword不会将B分词，text会将B分词。

可以看到上面用term方式查找，没有结果，而用match方式查找，能查找到“吕蒙”和“吕布”两个结果

term是不分词（不拆分搜索字）查找目标字段中是否有要查找的文字，也就是完整查找“吕蒙”两个字，而name这个字段用的是text类型存储的，text类型数据默认是分词的，也就是elasticsearch会将name分词后（分成“吕”和“蒙”）再存储，这时候拿完整的搜索字“吕蒙”去存储的“吕”、“蒙”里找肯定是找不到的。

match是分词（拆分搜索字）查找目标字段，也就是说会先将要查找的搜索子“吕蒙”拆成“吕”和“蒙”，再分别去name里找“吕”，如果没有找到“吕”，还会去找“蒙”，而存储的数据里，text已经将“吕蒙”和“吕布”都分词成了“吕”，“蒙”，“吕”，“布”存储了，所以光通过一个“吕”字就能找到两条结果。

这里要区分搜索词的分词，以及字段存储的分词。拿A去B里匹配，A能分词，B也能分词。term不会将A分词，match会将A分词。

既然name的类型，存储的时候就是分词的，那能不能在存储的时候不分词了，可以用将text类型改成keyword类型

#删除所有文档

curl -XPOST "http://127.0.0.1:9200/student/_delete_by_query?pretty" -v -H "Content-Type: application/json" -d '

{

    "query": {

        "match_all": {}

    }

}'

#删除索引

curl -XDELETE "http://127.0.0.1:9200/student?pretty"

#重新创建索引，将name字段的类型改成keyword

curl -X PUT  http://127.0.0.1:9200/student?pretty -H "Content-Type: application/json" -d '{

    "settings": {

        "number_of_shards": 1,

        "number_of_replicas": 0

    },

    "mappings": {

        "_source": {

            "enabled": true

        },

        "properties": {

            "id": {

                "type": "integer"

            },

            "name": {

                "type": "keyword"

            },

            "age": {

                "type": "integer"

            },

            "class": {

                "type": "text",

                "analyzer": "ik_max_word"

            },

            "introduce": {

                "type": "text",

                "analyzer": "ik_max_word"

            }

        }

    }

}'

#重新插入上面四条数据

#请复制上面的语句，执行

#下面这条查询将返回“吕蒙”同学

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '{

    "query": {

           "term": {"name":"吕蒙"}

    }

}'

#下面这条查询将返回0结果，因为存储时类型为keyword没有分词，所以存储的是“吕蒙”和“吕布”，这时候拿#“吕”去匹配，没有匹配的结果

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '{

    "query": {

           "term": {"name":"吕"}

    }

}'

#下面的结果将只会返回“吕蒙”同学，没有匹配的结果，因为存储时类型为keyword没有分词，所以存储的“吕

#蒙”和“吕布”，这时候拿“吕蒙”去匹配，虽然用的match，会将搜索词拆分成“吕蒙”，“吕”，“蒙”去搜索，但

#“吕”和“蒙”都不会匹配的到存储的“吕蒙”和“吕布”

curl -XGET http://127.0.0.1:9200/student/_search?pretty -H "Content-Type: application/json" -d '{

    "query": {

           "match": {"name":"吕蒙"}

    }

}'

ElasticSearch中term和match探索的更多相关文章

elasticsearch 中的Multi Match Query
在Elasticsearch全文检索中,我们用的比较多的就是Multi Match Query,其支持对多个字段进行匹配.Elasticsearch支持5种类型的Multi Match,我们一起来深入 ...
ES查询－term VS match （转）
原文地址:https://blog.csdn.net/sxf_123456/article/details/78845437 elasticsearch 中term与match区别 term是精确查询 ...
Elasticsearch 5.0 中term 查询和match 查询的认识
Elasticsearch 5.0 关于term query和match query的认识一.基本情况前言:term query和match query牵扯的东西比较多,例如分词器.mapping ...
Elasticsearch学习系列之term和match查询
lasticsearch查询模式一种是像传递URL参数一样去传递查询语句,被称为简单查询 GET /library/books/_search //查询index为library,type为book ...
Elasticsearch学习系列之term和match查询实例
Elasticsearch查询模式一种是像传递URL参数一样去传递查询语句,被称为简单查询 GET /library/books/_search //查询index为library,type为boo ...
Elasticsearch中的Term查询和全文查询
目录前言 Term 查询 exists 查询 fuzzy 查询 ids 查询 prefix 查询 range 查询 regexp 查询 term 查询 terms 查询 terms_set 查询 t ...
在Elasticsearch中查询Term Vectors词条向量信息
这篇文章有点深度,可能需要一些Lucene或者全文检索的背景.由于我也很久没有看过Lucene了,有些地方理解的不对还请多多指正. 更多内容还请参考整理的ELK教程关于Term Vectors 额, ...
如何在Elasticsearch中安装中文分词器(IK+pinyin)
如果直接使用Elasticsearch的朋友在处理中文内容的搜索时,肯定会遇到很尴尬的问题--中文词语被分成了一个一个的汉字,当用Kibana作图的时候,按照term来分组,结果一个汉字被分成了一组. ...
elasticsearch中常用的API
elasticsearch中常用的API分类如下: 文档API: 提供对文档的增删改查操作搜索API: 提供对文档进行某个字段的查询索引API: 提供对索引进行操作,查看索引信息等查看API: ...

随机推荐

windows下去掉快捷方式图标的小箭头的几种方法
去掉快捷方式图标的小箭头的几种方法第一种: 点开始菜单,点运行,输入以下命令后回车.即可解决 cmd /k reg delete "HKEY_CLASSES_ROOT\lnkfile&qu ...
CF1208C
CF1208C 这场杜老师大战tourist的比赛怎么这么多人类智慧题... 题意: 构造一个 $ n \times n $ 的矩阵,使得该矩阵每一行与每一列的元素的异或和全部相等. 解法: 异或的神 ...
应用fluent二维单向流泥沙冲刷作用下河床变形代码【转载】
本代码转载自:http://www.codeforge.cn/read/260028/keti_2d_b.c__html #include "udf.h" #define Rho ...
Jenkins系统初始化配置
1.点击系统管理-->全局安全配置 2.设置允许用户注册,点击保存 3.配置全局工具 4.配置maven文件路径,使用文件系统中的settings文件 5.配置jdk 6.配置maven 7.我 ...
Linux命令行提交更新冲突
1.在harry目录下的hello文件第五行加一些内容 2.将修改后文件执行提交操作提交成功,文件版本升为5 3.在sally目录下同样修改hello文件第五行 4.sally进行提交操作发现提交 ...
puppeteer爬虫服务
爬虫文件 baidu.js const puppeteer = require("puppeteer"); const path = require('path'); const ...
expdp / impdp 用法详解，和exp / imp 的区别
一关于expdp和impdp 使用EXPDP和IMPDP时应该注意的事项:EXP和IMP是客户端工具程序,它们既可以在客户端使用,也可以在服务端使用.EXPDP和IMPDP是服务端的工具程 ...
第11组 Beta冲刺（4/5）
第11组 Beta冲刺(4/5) 队名不知道叫什么团队组长博客 https://www.cnblogs.com/xxylac/p/12018586.html 作业博客 https://edu. ...
html文字两行后，就用省略号代替剩下的
html文字两行后,就用省略号代替剩下的一.总结一句话总结: 实现原理很简单,将box的高度设置为行高的两倍,超出之后隐藏,这样就只有两行了,然后再用after属性绝对定位在第二行后面加几个点 . ...
一个有趣的BUG/按钮disabled之后还能触发click事件
一个很有意思的Bug 某天测试同学再次向我反馈,你这个删除按钮虽然置灰了,但是还是可以点击啊? 我:????(黑人问号) 卧槽?不可能啊,按钮都disabled了,怎么还可以点击?还能触发click事 ...