Top Hits Aggregation

top_hits指标聚合器跟踪正在聚合的最相关文档。此聚合器旨在用作子聚合器，以便可以按桶聚合最匹配的文档。

top_hits聚合器可以有效地用于通过桶聚合器按特定字段对结果集进行分组。一个或多个存储桶聚合器确定结果集被切入的属性。

选项

from - 要获取的第一个结果的偏移量。
size - 每个桶返回的最大匹配匹配数的最大数量。默认情况下，返回前三个匹配的匹配。
sort - 如何对最匹配的匹配进行排序。默认情况下，命中按主查询的分数排序。

Supported per hit features 每个匹配功能支持

top_hits聚合返回常规搜索命中，因为可以支持许多每个命中功能：

实例

下面来看看具体的例子，就知道怎么回事了，使用起来很简单。

先准备索引和数据，这里以菜谱为例，name：菜谱名，type 为菜系，rating 为用户的累积平均评分

PUT recipes

POST /recipes/type/_mapping

{

  "properties": {

    "name":{

      "type": "text"

    },

    "rating":{

      "type": "float"

    },"type":{

      "type": "keyword"

    }

  }

}

/recipes/_bulk

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"清蒸鱼头","rating":1,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"剁椒鱼头","rating":2,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"红烧鲫鱼","rating":3,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（辣）","rating":3,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（微辣）","rating":4,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（变态辣）","rating":5,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"广式鲫鱼汤","rating":5,"type":"粤菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鱼香肉丝","rating":2,"type":"川菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"奶油鲍鱼汤","rating":2,"type":"西菜"}

现在我们看看普通的查询效果是怎么样的，搜索关键字带“鱼”的菜，返回3条数据

POST recipes/type/_search

{

  "query": {"match": {

    "name": "鱼"

  }},"size": 3

}

全是湘菜，我的天，最近上火不想吃辣，这个第一页的结果对我来说就是垃圾，如下：

{

  "took": 2,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": 0.26742277,

    "hits": [

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYF_OA-dG63Txsd",

        "_score": 0.26742277,

        "_source": {

          "name": "鲫鱼汤（变态辣）",

          "rating": 5,

          "type": "湘菜"

        }

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHXO_OA-dG63Txsa",

        "_score": 0.19100356,

        "_source": {

          "name": "红烧鲫鱼",

          "rating": 3,

          "type": "湘菜"

        }

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHWy_OA-dG63TxsZ",

        "_score": 0.19100356,

        "_source": {

          "name": "剁椒鱼头",

          "rating": 2,

          "type": "湘菜"

        }

      }

    ]

  }

}

我们再看看，这次我想加个评分排序，大家都喜欢的是那些，看看有没有喜欢吃的，执行查询：

POST recipes/type/_search

{

  "query": {"match": {

    "name": "鱼"

  }},"sort": [

    {

      "rating": {

        "order": "desc"

      }

    }

  ],"size": 3

}

结果稍微好点了，不过3个里面2个是湘菜，还是有点不合适，结果如下：

{

  "took": 1,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": null,

    "hits": [

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYF_OA-dG63Txsd",

        "_score": null,

        "_source": {

          "name": "鲫鱼汤（变态辣）",

          "rating": 5,

          "type": "湘菜"

        },

        "sort": [

          5

        ]

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYW_OA-dG63Txse",

        "_score": null,

        "_source": {

          "name": "广式鲫鱼汤",

          "rating": 5,

          "type": "粤菜"

        },

        "sort": [

          5

        ]

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHX7_OA-dG63Txsc",

        "_score": null,

        "_source": {

          "name": "鲫鱼汤（微辣）",

          "rating": 4,

          "type": "湘菜"

        },

        "sort": [

          4

        ]

      }

    ]

  }

}

现在我知道了，我要看看其他菜系，这家不是还有西餐、广东菜等各种菜系的么，来来，帮我每个菜系来一个菜看看，换 terms agg 先得到唯一的 term 的 bucket，再组合 top_hits agg，返回按评分排序的第一个 top hits，有点复杂，没关系，看下面的查询就知道了：

GET recipes/type/_search

{

  "query": {

    "match": {

      "name": "鱼"

    }

  },

  "sort": [

    {

      "rating": {

        "order": "desc"

      }

    }

  ],"aggs": {

    "type": {

      "terms": {

        "field": "type",

        "size": 10

      },"aggs": {

        "rated": {

          "top_hits": {

            "sort": [{

              "rating": {"order": "desc"}

            }],

            "size": 1

          }

        }

      }

    }

  },

  "size": 0,

  "from": 0

}

看下面的结果，虽然 json 结构有点复杂，不过总算是我们想要的结果了，湘菜、粤菜、川菜、西菜都出来了，每样一个，不重样：

{

  "took": 4,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": 0,

    "hits": []

  },

  "aggregations": {

    "type": {

      "doc_count_error_upper_bound": 0,

      "sum_other_doc_count": 0,

      "buckets": [

        {

          "key": "湘菜",

          "doc_count": 6,

          "rated": {

            "hits": {

              "total": 6,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYF_OA-dG63Txsd",

                  "_score": null,

                  "_source": {

                    "name": "鲫鱼汤（变态辣）",

                    "rating": 5,

                    "type": "湘菜"

                  },

                  "sort": [

                    5

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "川菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYr_OA-dG63Txsf",

                  "_score": null,

                  "_source": {

                    "name": "鱼香肉丝",

                    "rating": 2,

                    "type": "川菜"

                  },

                  "sort": [

                    2

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "粤菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYW_OA-dG63Txse",

                  "_score": null,

                  "_source": {

                    "name": "广式鲫鱼汤",

                    "rating": 5,

                    "type": "粤菜"

                  },

                  "sort": [

                    5

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "西菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHY3_OA-dG63Txsg",

                  "_score": null,

                  "_source": {

                    "name": "奶油鲍鱼汤",

                    "rating": 2,

                    "type": "西菜"

                  },

                  "sort": [

                    2

                  ]

                }

              ]

            }

          }

        }

      ]

    }

  }

}

elasticsearch 深入 —— Top Hits Aggregation的更多相关文章

Elasticsearch：significant terms aggregation
在本文中,我们将重点关注significant terms和significant text聚合.这些聚合旨在搜索数据集中有趣和/或不寻常的术语,这些术语可以告诉您有关数据的隐藏属性的更多信息.此功能 ...
elasticsearch in docker/ and aggregation,,performance tune ;throughout
Docker环境中Elasticsearch的安装 ]https://wenchao.ren/archives/category/elasticsearch/page/2 [ElasticSearch ...
elasticsearch 基础 —— Inner hits
Inner hits The parent-join and nested 功能允许返回具有不同范围匹配的文档.在父/子案例中,基于子文档中的匹配返回父文档,或者基于父文档中的匹配返回子文档.在嵌套的 ...
Elasticsearch：top_hits aggregation
top_hits指标聚合器跟踪要聚合的最相关文档. 该聚合器旨在用作子聚合器,以便可以按存储分区汇总最匹配的文档. top_hits聚合器可以有效地用于通过存储桶聚合器按某些字段对结果集进行分组. 一 ...
Elasticsearch入门篇
推荐博客: 阮一峰大神:http://www.ruanyifeng.com/blog/2017/08/elasticsearch.html ElasticSearch 权威指南(中文版):https: ...
Elasticsearch 2.3.3 JAVA api说明文档
原文地址:https://www.blog-china.cn/template\documentHtml\1484101683485.html 翻译作者:@青山常在人不老加入翻译:cdcnsuper ...
Elasticsearch基本用法(1)--原生操作
2.2.创建索引 2.2.1.语法创建索引的请求格式: 请求方式:PUT 请求路径:/索引库名请求参数:json格式: { "settings": { "number ...
Elasticsearch操作索引
目录操作索引 1. 基本概念 2. 创建索引 2.1 语法 2.2查看索引设置 2.3.删除索引 2.4 映射配置 2.5 新增数据 2.6 修改数据 2.7 删除数据 3. 查询 3.1 基本查询 ...
Elasticsearch7.X 入门学习第九课笔记-----聚合分析Aggregation
原文:Elasticsearch7.X 入门学习第九课笔记-----聚合分析Aggregation 版权声明:本文为博主原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明. ...

随机推荐

gbase整合mybatis出现： Cause: java.sql.SQLException: Can't convert to: binary stream
参考地址:http://mybatis-user.963551.n3.nabble.com/Map-SQL-Type-LVARCHAR-x-to-JDBC-Type-VARCHAR-globally- ...
ubuntu idea 安装
一.下载 1.进入官网下载对应安装包 https://www.jetbrains.com/idea/download/#section=linux sudo wget https://downloa ...
flask之环境的搭建
一.查看ubantu上是否安装虚拟环境的包 virtualenv --version 这里显示的是:15.0.1的版本,如果没有的话, sudo pip install virtualenv sudo ...
ATM机取款过程
假设一个简单的ATM机的取款过程是这样的:首先提示用户输入密码,最多只能输入三次,超过3次则提示用户“密码错误,请取卡”结束交易.如果用户密码正确,再提示用户输入取款金额,ATM机只能输出100元的纸 ...
监听UDP端口
功能:监听服务器的UDP端口,输出端口接收的数据 #encoding:utf-8 import socket global udp global ip global port def listen_p ...
2018 ACM-ICPC 中国大学生程序设计竞赛线上赛 I. Reversion Count (java大数)
Description: There is a positive integer X, X's reversion count is Y. For example, X=123, Y=321; X=1 ...
Word文档粘贴到DEDECMS
Chrome+IE默认支持粘贴剪切板中的图片,但是我要发布的文章存在word里面,图片多达数十张,我总不能一张一张复制吧?Chrome高版本提供了可以将单张图片转换在BASE64字符串的功能.但是无法 ...
[luogu]P1315 观光公交[贪心]
[luogu]P1315 [NOIP2011]观光公交 ——!x^n+y^n=z^n 题目描述风景迷人的小城Y 市,拥有n 个美丽的景点.由于慕名而来的游客越来越多,Y 市特意安排了一辆观光公交车, ...
NOIp 基础数论知识点总结
推荐阅读 NOIp 数学知识点总结: https://www.cnblogs.com/greyqz/p/maths.html Basic 常用素数表:https://www.cnblogs.com/g ...
[CSP-S模拟测试]:trade（反悔贪心）
题目传送门(内部题62) 输入格式第一行有一个整数$n$.第二行有$N$个整数:$a_1\ a_2\ a_3\cdot\cdot\cdot a_n$. 输出格式一行一个整数表示最大收益. 样例样 ...

elasticsearch 深入 —— Top Hits Aggregation

Top Hits Aggregation

选项

Supported per hit features 每个匹配功能支持

实例

elasticsearch 深入 —— Top Hits Aggregation的更多相关文章

随机推荐

热门专题