我的网站集成ElasticSearch初体验

最近，我给我的网站(https://www.xiandanplay.com/)尝试集成了一下es来实现我的一个搜索功能，因为这个是我第一次了解运用elastic，所以如果有不对的地方，大家可以指出来，话不多说，先看看我的一个大致流程

这里我采用的sdk的版本是Elastic.Clients.Elasticsearch, Version=8.0.0.0，官方的网址Installation | Elasticsearch .NET Client [8.0] | Elastic

我的es最开始打算和我的应用程序一起部署到ubuntu上面，结果最后安装kibana的时候，各种问题，虽好无奈，只好和我的SqlServer一起安装到windows上面，对于一个2G内容的服务器来说，属实有点遭罪了。

1、配置es

在es里面，我开启了密码认证。下面是我的配置

"Search": {

    "IsEnable": "true",

    "Uri": "http://127.0.0.1:9200/",

    "User": "123",

    "Password": "123"

  }
然后新增一个程序集

然后再ElasticsearchClient里面去写一个构造函数去配置es

using Core.Common;

using Core.CPlatform;

using Core.SearchEngine.Attr;

using Elastic.Clients.Elasticsearch;

using Elastic.Clients.Elasticsearch.IndexManagement;

using Elastic.Transport;

namespace Core.SearchEngine.Client

{

    public class ElasticSearchClient : IElasticSearchClient

    {

        private ElasticsearchClient elasticsearchClient;

        public ElasticSearchClient()

        {

            string uri = ConfigureProvider.configuration.GetSection("Search:Uri").Value;

            string username = ConfigureProvider.configuration.GetSection("Search:User").Value;

            string password = ConfigureProvider.configuration.GetSection("Search:Password").Value;

            var settings = new ElasticsearchClientSettings(new Uri(uri))

                          .Authentication(new BasicAuthentication(username, password)).DisableDirectStreaming();

            elasticsearchClient = new ElasticsearchClient(settings);

        }

        public ElasticsearchClient GetClient()

        {

            return elasticsearchClient;

        }

    }

}

　　然后，我们看skd的官网有这个这个提示

客户端应用程序应创建一个该实例，该实例在整个应用程序中用于整个应用程序辈子。在内部，客户端管理和维护与节点的 HTTP 连接，重复使用它们以优化性能。如果您使用依赖项注入容器中，客户端实例应注册到单例生存期

所以我直接给它来一个AddSingleton

using Core.SearchEngine.Client;

using Microsoft.Extensions.DependencyInjection;

namespace Core.SearchEngine

{

    public static class ConfigureSearchEngine

    {

        public static void AddSearchEngine(this IServiceCollection services)

        {

            services.AddSingleton<IElasticSearchClient, ElasticSearchClient>();

        }

    }

}

2、提交文章并且同步到es

然后就是同步文章到es了，我是先写入数据库，再同步到rabbitmq，通过事件总线（基于事件总线EventBus实现邮件推送功能）写入到es

先定义一个es模型

using Core.SearchEngine.Attr;

using System;

using System.Collections.Generic;

using System.Linq;

using System.Text;

using System.Threading.Tasks;

using XianDan.Model.BizEnum;

namespace XianDan.Domain.Article

{

    [ElasticsearchIndex(IndexName ="t_article")]//自定义的特性，sdk并不包含这个特性

    public class Article_ES

    {

        public long Id { get; set; }

        /// <summary>

        /// 作者

        /// </summary>

        public string Author { get; set; }

        /// <summary>

        /// 标题

        /// </summary>

        public string Title { get; set; }

        /// <summary>

        /// 标签

        /// </summary>

        public string Tag { get; set; }

        /// <summary>

        /// 简介

        /// </summary>

        public string Description { get; set; }

        /// <summary>

        /// 内容

        /// </summary>

        public string ArticleContent { get; set; }

        /// <summary>

        /// 专栏

        /// </summary>

        public long ArticleCategoryId { get; set; }

        /// <summary>

        /// 是否原创

        /// </summary>

        public bool? IsOriginal { get; set; }

        /// <summary>

        /// 评论数

        /// </summary>

        public int? CommentCount { get; set; }

        /// <summary>

        /// 点赞数

        /// </summary>

        public int? PraiseCount { get; set; }

        /// <summary>

        /// 浏览次数

        /// </summary>

        public int? BrowserCount { get; set; }

        /// <summary>

        /// 收藏数量

        /// </summary>

        public int? CollectCount { get; set; }

        /// <summary>

        /// 创建时间

        /// </summary>

        public DateTime CreateTime { get; set; }

    }

}

然后创建索引

 string index = esArticleClient.GetIndexName(typeof(Article_ES));

            await esArticleClient.GetClient().Indices.CreateAsync<Article_ES>(index, s =>

            s.Mappings(

                x => x.Properties(

                    t => t.LongNumber(l => l.Id)

                         .Text(l=>l.Title,z=>z.Analyzer(ik_max_word))

                         .Keyword(l=>l.Author)

                         .Text(l=>l.Tag,z=>z.Analyzer(ik_max_word))

                         .Text(l=>l.Description,z=>z.Analyzer(ik_max_word))

                         .Text(l=>l.ArticleContent,z=>z.Analyzer(ik_max_word))

                         .LongNumber(l=>l.ArticleCategoryId)

                         .Boolean(l=>l.IsOriginal)

                         .IntegerNumber(l=>l.BrowserCount)

                         .IntegerNumber(l=>l.PraiseCount)

                         .IntegerNumber(l=>l.PraiseCount)

                         .IntegerNumber(l=>l.CollectCount)

                         .IntegerNumber(l=>l.CommentCount)

                         .Date(l=>l.CreateTime)

                    )

                )

            );

然后每次增删改文章的时候写入到mq，例如

 private async Task SendToMq(Article article, Operation operation)

        {

            ArticleEventData articleEventData = new ArticleEventData();

            articleEventData.Operation = operation;

            articleEventData.Article_ES = MapperUtil.Map<Article, Article_ES>(article);

            TaskRecord taskRecord = new TaskRecord();

            taskRecord.Id = CreateEntityId();

            taskRecord.TaskType = TaskRecordType.MQ;

            taskRecord.TaskName = "发送文章";

            taskRecord.TaskStartTime = DateTime.Now;

            taskRecord.TaskStatu = (int)MqMessageStatu.New;

            articleEventData.Unique = taskRecord.Id.ToString();

            taskRecord.TaskValue = JsonConvert.SerializeObject(articleEventData);

            await unitOfWork.GetRepository<TaskRecord>().InsertAsync(taskRecord);

            await unitOfWork.CommitAsync();

            try

            {

                eventBus.Publish(GetMqExchangeName(), ExchangeType.Direct, BizKey.ArticleQueueName, articleEventData);

            }

            catch (Exception ex)

            {

                var taskRecordRepository = unitOfWork.GetRepository<TaskRecord>();

                TaskRecord update = await taskRecordRepository.SelectByIdAsync(taskRecord.Id);

                update.TaskStatu = (int)MqMessageStatu.Fail;

                update.LastUpdateTime = DateTime.Now;

                update.TaskResult = "发送失败";

                update.AdditionalData = ex.Message;

                await taskRecordRepository.UpdateAsync(update);

                await unitOfWork.CommitAsync();

            }

        }

mq订阅之后写入es，具体的增删改的方法就不写了吧

3、开始查询es

等待写入文章之后，开始查询文章，这里sdk提供的查询的方法比较复杂，全都是通过lmbda一个个链式去拼接的，但是我又没有找到更好的方法，所以就先这样吧

先创建一个集合存放查询的表达式

List<Action<QueryDescriptor<Article_ES>>> querys = new List<Action<QueryDescriptor<Article_ES>>>();

然后定义一个几个需要查询的字段

我这里使用MultiMatch来实现多个字段匹配同一个查询条件，并且指定使用ik_smart分词

Field[] fields =

                {

                    new Field("title"),

                    new Field("tag"),

                    new Field("articleContent"),

                    new Field("description")

                };

 querys.Add(s => s.MultiMatch(y => y.Fields(Fields.FromFields(fields)).Analyzer(ik_smart).Query(keyword).Type(TextQueryType.MostFields)));

定义查询结果高亮，给查询出来的匹配到的分词的字段添加标签，同时前端需要对这个样式处理，

:deep(.search-words) em {

color: #ee0f29;

font-style: initial;

}

 Dictionary<Field, HighlightField> highlightFields = new Dictionary<Field, HighlightField>();

            highlightFields.Add(new Field("title"), new HighlightField()

            {

                PreTags = new List<string> { "<em>" },

                PostTags = new List<string> { "</em>" },

            });

            highlightFields.Add(new Field("description"), new HighlightField()

            {

                PreTags = new List<string> { "<em>" },

                PostTags = new List<string> { "</em>" },

            });

            Highlight highlight = new Highlight()

            {

                Fields = highlightFields

            };

为了提高查询的效率，我只查部分的字段

 SourceFilter sourceFilter = new SourceFilter();

            sourceFilter.Includes = Fields.FromFields(new Field[] { "title", "id", "author", "description", "createTime", "browserCount", "commentCount" });

            SourceConfig sourceConfig = new SourceConfig(sourceFilter);

            Action<SearchRequestDescriptor<Article_ES>> configureRequest = s => s.Index(index)

            .From((homeArticleCondition.CurrentPage - 1) * homeArticleCondition.PageSize)

            .Size(homeArticleCondition.PageSize)

            .Query(x => x.Bool(y => y.Must(querys.ToArray())))

            .Source(sourceConfig)

             .Sort(y => y.Field(ht => ht.CreateTime, new FieldSort() { Order=SortOrder.Desc}))

获取查询的分词结果

 var analyzeIndexRequest = new AnalyzeIndexRequest

            {

                Text = new string[] { keyword },

                Analyzer = analyzer

            };

            var analyzeResponse = await elasticsearchClient.Indices.AnalyzeAsync(analyzeIndexRequest);

            if (analyzeResponse.Tokens == null)

                return new string[0];

            return analyzeResponse.Tokens.Select(s => s.Token).ToArray();

到此，这个就是大致的查询结果，完整的如下

 public async Task<Core.SearchEngine.Response.SearchResponse<Article_ES>> SelectArticle(HomeArticleCondition homeArticleCondition)

        {

            string keyword = homeArticleCondition.Keyword.Trim();

            bool isNumber = Regex.IsMatch(keyword, RegexPattern.IsNumberPattern);

            List<Action<QueryDescriptor<Article_ES>>> querys = new List<Action<QueryDescriptor<Article_ES>>>();

            if (isNumber)

            {

                querys.Add(s => s.Bool(x => x.Should(

                    should => should.Term(f => f.Field(z => z.Title).Value(keyword))

                    , should => should.Term(f => f.Field(z => z.Tag).Value(keyword))

                    , should => should.Term(f => f.Field(z => z.ArticleContent).Value(keyword))

                    )));

            }

            else

            {

                Field[] fields =

                {

                    new Field("title"),

                    new Field("tag"),

                    new Field("articleContent"),

                    new Field("description")

                };

                querys.Add(s => s.MultiMatch(y => y.Fields(Fields.FromFields(fields)).Analyzer(ik_smart).Query(keyword).Type(TextQueryType.MostFields)));

            }

            if (homeArticleCondition.ArticleCategoryId.HasValue)

            {

                querys.Add(s => s.Term(t => t.Field(f => f.ArticleCategoryId).Value(FieldValue.Long(homeArticleCondition.ArticleCategoryId.Value))));

            }

            string index = esArticleClient.GetIndexName(typeof(Article_ES));

            Dictionary<Field, HighlightField> highlightFields = new Dictionary<Field, HighlightField>();

            highlightFields.Add(new Field("title"), new HighlightField()

            {

                PreTags = new List<string> { "<em>" },

                PostTags = new List<string> { "</em>" },

            });

            highlightFields.Add(new Field("description"), new HighlightField()

            {

                PreTags = new List<string> { "<em>" },

                PostTags = new List<string> { "</em>" },

            });

            Highlight highlight = new Highlight()

            {

                Fields = highlightFields

            };

            SourceFilter sourceFilter = new SourceFilter();

            sourceFilter.Includes = Fields.FromFields(new Field[] { "title", "id", "author", "description", "createTime", "browserCount", "commentCount" });

            SourceConfig sourceConfig = new SourceConfig(sourceFilter);

            Action<SearchRequestDescriptor<Article_ES>> configureRequest = s => s.Index(index)

            .From((homeArticleCondition.CurrentPage - 1) * homeArticleCondition.PageSize)

            .Size(homeArticleCondition.PageSize)

            .Query(x => x.Bool(y => y.Must(querys.ToArray())))

            .Source(sourceConfig)

             .Sort(y => y.Field(ht => ht.CreateTime, new FieldSort() { Order=SortOrder.Desc})).Highlight(highlight);

            var resp = await esArticleClient.GetClient().SearchAsync<Article_ES>(configureRequest);

            foreach (var item in resp.Hits)

            {

                if (item.Highlight == null)

                    continue;

                foreach (var dict in item.Highlight)

                {

                    switch (dict.Key)

                    {

                        case "title":

                            item.Source.Title = string.Join("...", dict.Value);

                            break;

                        case "description":

                            item.Source.Description = string.Join("...", dict.Value);

                            break;

                    }

                }

            }

            string[] analyzeWords = await esArticleClient.AnalyzeAsync(homeArticleCondition.Keyword);

            List<Article_ES> articles = resp.Documents.ToList();

            return new Core.SearchEngine.Response.SearchResponse<Article_ES>(articles, analyzeWords);

        }

4、演示效果

搞完之后，发布部署，看看效果，分词这里要想做的像百度那样，估计目前来看非常有难度的

那么这里我也向大家求教一下，如何使用SearchRequest封装多个查询条件，如下

SearchRequest searchRequest = new SearchRequest();
searchRequest.From = 0;
searchRequest.Size = 10;
searchRequest.Query=多个查询条件

因为我觉得这样代码读起来比lambda可读性高些，能更好的动态封装。

我的网站集成ElasticSearch初体验的更多相关文章

【docker Elasticsearch】Rest风格的分布式开源搜索和分析引擎Elasticsearch初体验
概述: Elasticsearch 是一个分布式.可扩展.实时的搜索与数据分析引擎. 它能从项目一开始就赋予你的数据以搜索.分析和探索的能力,这是通常没有预料到的. 它存在还因为原始数据如果只是躺在磁 ...
【ES】ElasticSearch初体验之使用Java进行最基本的增删改查~
好久没写博文了, 最近项目中使用到了ElaticSearch相关的一些内容, 刚好自己也来做个总结. 现在自己也只能算得上入门, 总结下自己在工作中使用Java操作ES的一些小经验吧. 本文总共分为三 ...
ElasticSearch初体验之使用
好久没写博文了, 最近项目中使用到了ElaticSearch相关的一些内容, 刚好自己也来做个总结.现在自己也只能算得上入门, 总结下自己在工作中使用Java操作ES的一些小经验吧. 本文总共分为三个 ...
Spring boot集成redis初体验
pom.xml: <?xml version="1.0" encoding="UTF-8"?> <project xmlns="ht ...
Spring boot集成Rabbit MQ使用初体验
Spring boot集成Rabbit MQ使用初体验 1.rabbit mq基本特性首先介绍一下rabbitMQ的几个特性 Asynchronous Messaging Supports mult ...
webpack初体验_集成插件_集成loader
webpack初体验如果没装 webpack 就先装一下,命令行输入npm i webpack -g 新建一个项目创建一个空的项目定义一个名称创建一个Module 选择静态 web 输入名称 ...
JAVA中使用最广泛的本地缓存？Ehcache的自信从何而来2 —— Ehcache的各种项目集成与使用初体验
大家好,又见面了. 本文是笔者作为掘金技术社区签约作者的身份输出的缓存专栏系列内容,将会通过系列专题,讲清楚缓存的方方面面.如果感兴趣,欢迎关注以获取后续更新. 在上一篇文章<JAVA中使用最广 ...
在同一个硬盘上安装多个 Linux 发行版及 Fedora 21 、Fedora 22 初体验
在同一个硬盘上安装多个 Linux 发行版以前对多个 Linux 发行版的折腾主要是在虚拟机上完成.我的桌面电脑性能比较强大,玩玩虚拟机没啥问题,但是笔记本电脑就不行了.要在我的笔记本电脑上折腾多个 ...
Question2Answer初体验
Question2Answer初体验高质量的问答社区十分有价值,很多无法解决的问题能通过问答社区找到解决办法,而对于站长来说,垂直的问答社区也很有潜力.最近盯上问答这一块,发现和我的一些思路很符 ...
Net Core平台灵活简单的日志记录框架NLog+Mysql组合初体验
Net Core平台灵活简单的日志记录框架NLog初体验前几天分享的"[Net Core集成Exceptionless分布式日志功能以及全局异常过滤][https://www.cnblog ...

随机推荐

oeasy教您玩转vim - 42 - # 剪切进入
剪切进入回忆上节课内容上次我们了解到了各种寄存器 :reg 无名寄存器"" 数字寄存器"0-"9 行内删除专用寄存器"- 指定寄存器" ...
CF1359A 题解
洛谷链接&CF 链接题目简述共有 \(T\) 组数据. 对于每组数据给出 \(n,m,k\),表示 \(k\) 名玩家打牌,共 \(n\) 张牌,\(m\) 张王,保证 \(k \mid ...
MapGIS路网数据发布
准备 1.MapGIS 10 桌面版(我用的10.5.6.10) 2.路网的shp文件数据导入 1.创建要素集,如果已有要素集可以不用创建: 2.导入路网要素类,选择准备好的shp文件后导入即可: ...
【JavaWeb】如何越过SpringMVC直接返回内容
来自前同事问的一个问题,因为项目里面的SpringMVC会封装好一个固定的JSON响应规范: 可以看见,data属性下面,又会有一层data, 数据的消费方提出要求,只需要里面data的数据,外面的J ...
【Zookeeper】Re01 安装与操作
Zookeeper基于JDK开发出来的运行环境至少需要JRE 快速安装JDK: yum install -y java-1.8.0-openjdk-devel.x86_64 # ZK镜像仓库 htt ...
【Vue】Re22 Axios
Axios[AJAX I\O System] 创建案例项目并且安装Axios npm install axios --save 接口测试网址: http://httpbin.org/ 案例提供的数据地 ...
中国AI领域超越美国的拐点在哪 —— 国产AI芯片量产化的成本接近于美国成熟AI芯片的成本
作为AI领域的一个大头兵,本是没有资格去谈论high level层面的东西的,只不过总有些忍不得说的事情. 今天这里就说下个人对中国AI发展的一个观点或是预测,在我看来中国AI领域超越美国的拐点就在于 ...
强化学习中atari游戏环境下帧的预处理操作
在网上找到一个Rainbow算法的代码(https://gitee.com/devilmaycry812839668/Rainbow),在里面找到了atari游戏环境下帧的预处理操作. 具体代码地址: ...
【转载】python的魔法方法———A Guide to Python's Magic Methods
原文地址: https://rszalski.github.io/magicmethods/ ===================================================== ...
Long Way to be Non-decreasing 题解
前言题目链接:洛谷:CF. 题意简述 yzh 喜欢单调不降序列. 她有一个序列 \(a\),最初为 \(a_1, \ldots, a_n\),其中每个元素都在 \([1, m]\) 内. 她希望使序 ...

我的网站集成ElasticSearch初体验

我的网站集成ElasticSearch初体验的更多相关文章

随机推荐

热门专题