Top Hits Aggregation

top_hits指标聚合器跟踪正在聚合的最相关文档。此聚合器旨在用作子聚合器，以便可以按桶聚合最匹配的文档。

top_hits聚合器可以有效地用于通过桶聚合器按特定字段对结果集进行分组。一个或多个存储桶聚合器确定结果集被切入的属性。

选项

from - 要获取的第一个结果的偏移量。
size - 每个桶返回的最大匹配匹配数的最大数量。默认情况下，返回前三个匹配的匹配。
sort - 如何对最匹配的匹配进行排序。默认情况下，命中按主查询的分数排序。

Supported per hit features 每个匹配功能支持

top_hits聚合返回常规搜索命中，因为可以支持许多每个命中功能：

实例

下面来看看具体的例子，就知道怎么回事了，使用起来很简单。

先准备索引和数据，这里以菜谱为例，name：菜谱名，type 为菜系，rating 为用户的累积平均评分

PUT recipes

POST /recipes/type/_mapping

{

  "properties": {

    "name":{

      "type": "text"

    },

    "rating":{

      "type": "float"

    },"type":{

      "type": "keyword"

    }

  }

}

/recipes/_bulk

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"清蒸鱼头","rating":1,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"剁椒鱼头","rating":2,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"红烧鲫鱼","rating":3,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（辣）","rating":3,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（微辣）","rating":4,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鲫鱼汤（变态辣）","rating":5,"type":"湘菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"广式鲫鱼汤","rating":5,"type":"粤菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"鱼香肉丝","rating":2,"type":"川菜"}

{ "index":  { "_index": "recipes", "_type": "type"}}

{"name":"奶油鲍鱼汤","rating":2,"type":"西菜"}

现在我们看看普通的查询效果是怎么样的，搜索关键字带“鱼”的菜，返回3条数据

POST recipes/type/_search

{

  "query": {"match": {

    "name": "鱼"

  }},"size": 3

}

全是湘菜，我的天，最近上火不想吃辣，这个第一页的结果对我来说就是垃圾，如下：

{

  "took": 2,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": 0.26742277,

    "hits": [

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYF_OA-dG63Txsd",

        "_score": 0.26742277,

        "_source": {

          "name": "鲫鱼汤（变态辣）",

          "rating": 5,

          "type": "湘菜"

        }

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHXO_OA-dG63Txsa",

        "_score": 0.19100356,

        "_source": {

          "name": "红烧鲫鱼",

          "rating": 3,

          "type": "湘菜"

        }

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHWy_OA-dG63TxsZ",

        "_score": 0.19100356,

        "_source": {

          "name": "剁椒鱼头",

          "rating": 2,

          "type": "湘菜"

        }

      }

    ]

  }

}

我们再看看，这次我想加个评分排序，大家都喜欢的是那些，看看有没有喜欢吃的，执行查询：

POST recipes/type/_search

{

  "query": {"match": {

    "name": "鱼"

  }},"sort": [

    {

      "rating": {

        "order": "desc"

      }

    }

  ],"size": 3

}

结果稍微好点了，不过3个里面2个是湘菜，还是有点不合适，结果如下：

{

  "took": 1,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": null,

    "hits": [

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYF_OA-dG63Txsd",

        "_score": null,

        "_source": {

          "name": "鲫鱼汤（变态辣）",

          "rating": 5,

          "type": "湘菜"

        },

        "sort": [

          5

        ]

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHYW_OA-dG63Txse",

        "_score": null,

        "_source": {

          "name": "广式鲫鱼汤",

          "rating": 5,

          "type": "粤菜"

        },

        "sort": [

          5

        ]

      },

      {

        "_index": "recipes",

        "_type": "type",

        "_id": "AVoESHX7_OA-dG63Txsc",

        "_score": null,

        "_source": {

          "name": "鲫鱼汤（微辣）",

          "rating": 4,

          "type": "湘菜"

        },

        "sort": [

          4

        ]

      }

    ]

  }

}

现在我知道了，我要看看其他菜系，这家不是还有西餐、广东菜等各种菜系的么，来来，帮我每个菜系来一个菜看看，换 terms agg 先得到唯一的 term 的 bucket，再组合 top_hits agg，返回按评分排序的第一个 top hits，有点复杂，没关系，看下面的查询就知道了：

GET recipes/type/_search

{

  "query": {

    "match": {

      "name": "鱼"

    }

  },

  "sort": [

    {

      "rating": {

        "order": "desc"

      }

    }

  ],"aggs": {

    "type": {

      "terms": {

        "field": "type",

        "size": 10

      },"aggs": {

        "rated": {

          "top_hits": {

            "sort": [{

              "rating": {"order": "desc"}

            }],

            "size": 1

          }

        }

      }

    }

  },

  "size": 0,

  "from": 0

}

看下面的结果，虽然 json 结构有点复杂，不过总算是我们想要的结果了，湘菜、粤菜、川菜、西菜都出来了，每样一个，不重样：

{

  "took": 4,

  "timed_out": false,

  "_shards": {

    "total": 5,

    "successful": 5,

    "failed": 0

  },

  "hits": {

    "total": 9,

    "max_score": 0,

    "hits": []

  },

  "aggregations": {

    "type": {

      "doc_count_error_upper_bound": 0,

      "sum_other_doc_count": 0,

      "buckets": [

        {

          "key": "湘菜",

          "doc_count": 6,

          "rated": {

            "hits": {

              "total": 6,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYF_OA-dG63Txsd",

                  "_score": null,

                  "_source": {

                    "name": "鲫鱼汤（变态辣）",

                    "rating": 5,

                    "type": "湘菜"

                  },

                  "sort": [

                    5

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "川菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYr_OA-dG63Txsf",

                  "_score": null,

                  "_source": {

                    "name": "鱼香肉丝",

                    "rating": 2,

                    "type": "川菜"

                  },

                  "sort": [

                    2

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "粤菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHYW_OA-dG63Txse",

                  "_score": null,

                  "_source": {

                    "name": "广式鲫鱼汤",

                    "rating": 5,

                    "type": "粤菜"

                  },

                  "sort": [

                    5

                  ]

                }

              ]

            }

          }

        },

        {

          "key": "西菜",

          "doc_count": 1,

          "rated": {

            "hits": {

              "total": 1,

              "max_score": null,

              "hits": [

                {

                  "_index": "recipes",

                  "_type": "type",

                  "_id": "AVoESHY3_OA-dG63Txsg",

                  "_score": null,

                  "_source": {

                    "name": "奶油鲍鱼汤",

                    "rating": 2,

                    "type": "西菜"

                  },

                  "sort": [

                    2

                  ]

                }

              ]

            }

          }

        }

      ]

    }

  }

}

elasticsearch 深入 —— Top Hits Aggregation的更多相关文章

Elasticsearch：significant terms aggregation
在本文中,我们将重点关注significant terms和significant text聚合.这些聚合旨在搜索数据集中有趣和/或不寻常的术语,这些术语可以告诉您有关数据的隐藏属性的更多信息.此功能 ...
elasticsearch in docker/ and aggregation,,performance tune ;throughout
Docker环境中Elasticsearch的安装 ]https://wenchao.ren/archives/category/elasticsearch/page/2 [ElasticSearch ...
elasticsearch 基础 —— Inner hits
Inner hits The parent-join and nested 功能允许返回具有不同范围匹配的文档.在父/子案例中,基于子文档中的匹配返回父文档,或者基于父文档中的匹配返回子文档.在嵌套的 ...
Elasticsearch：top_hits aggregation
top_hits指标聚合器跟踪要聚合的最相关文档. 该聚合器旨在用作子聚合器,以便可以按存储分区汇总最匹配的文档. top_hits聚合器可以有效地用于通过存储桶聚合器按某些字段对结果集进行分组. 一 ...
Elasticsearch入门篇
推荐博客: 阮一峰大神:http://www.ruanyifeng.com/blog/2017/08/elasticsearch.html ElasticSearch 权威指南(中文版):https: ...
Elasticsearch 2.3.3 JAVA api说明文档
原文地址:https://www.blog-china.cn/template\documentHtml\1484101683485.html 翻译作者:@青山常在人不老加入翻译:cdcnsuper ...
Elasticsearch基本用法(1)--原生操作
2.2.创建索引 2.2.1.语法创建索引的请求格式: 请求方式:PUT 请求路径:/索引库名请求参数:json格式: { "settings": { "number ...
Elasticsearch操作索引
目录操作索引 1. 基本概念 2. 创建索引 2.1 语法 2.2查看索引设置 2.3.删除索引 2.4 映射配置 2.5 新增数据 2.6 修改数据 2.7 删除数据 3. 查询 3.1 基本查询 ...
Elasticsearch7.X 入门学习第九课笔记-----聚合分析Aggregation
原文:Elasticsearch7.X 入门学习第九课笔记-----聚合分析Aggregation 版权声明:本文为博主原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明. ...

随机推荐

未来HTML5的发展前景如何？黑客专家是这样回答的
如果你想进军IT行业,如果你准备好掌握一项新技术,那么就选择HTML5.近日,我们采访了国内知名网络黑客安全专家郭盛华,帮助您了解当今最重要的技术.在本篇文章中,黑客安全专家郭盛华回答了有关HTML5 ...
Jmeter参数化控件意见收集
1.可以读取EXCEL,可以自定义SHEET,行和列: 2.数据可以加密传输,加密方式如下: 1)SHA1 2)SHA224 3)SHA256 4)SHA384 5)SHA512 6)MD5 7)Hm ...
php选择文件夹上传
最近遇见一个需要上传百兆大文件的需求,调研了七牛和腾讯云的切片分段上传功能,因此在此整理前端大文件上传相关功能的实现. 在某些业务中,大文件上传是一个比较重要的交互场景,如上传入库比较大的Excel表 ...
RxJava学习总结
1. 概念 Rx是微软.NET的一个响应式扩展.Rx借助可观测的序列提供一种简单的方式来创建异步的,基于事件驱动的程序.Rx就是一种响应式编程,来创建基于事件的异步程序RxJava是一个在 Java ...
《Javascript设计模式与开发实践》关于设计模式典型代码的整理：单例模式、策略模式、代理模式、迭代器模式、发布-订阅模式、命令模式、组合模式
1.单例模式:保证一个类仅有一个实例,并提供一个访问它的全局访问点. 使用闭包封装私有变量// 使用闭包创建单例var user = (function () { var _name = 'sven' ...
Jira中的Tempo查看component以及issue的工作量汇总
在右侧group by的地方,同时选中component和issue
linux常用命令之文档
不常用,经常就会遗忘,mygod,不用则退化... 目录管理命令 ls:列出指定目录下的内容格式:ls [OPTION]... [FILE]... -a:显示所有文件包括隐藏文件 -A:显示除.和. ...
spring cloud网关gateway
spring gateway使用基于netty异步io,第二代网关:zuul 1使用servlet 3,第一代网关,每个请求一个线程,同步Servlet,多线程阻塞模型.而spring貌似不想在支持z ...
pandas melt 与pivot 函数
(掌握这个,基本就完美无缺的任意按照自己的想法,更改列了.) 背景: 最近有个excel 数据需要转化的过程. 数据量还挺大的,大概有30多万. 需要把某些行变成列,有些列又变成行. 这个操作本身就比 ...
远控CVE整理
Windows: CVE-2017-8464(通过快捷方式,可U盘/共享等途径传播)

elasticsearch 深入 —— Top Hits Aggregation

Top Hits Aggregation

选项

Supported per hit features 每个匹配功能支持

实例

elasticsearch 深入 —— Top Hits Aggregation的更多相关文章

随机推荐

热门专题