SharePoint 2013: Search Architecture in SPC202
Presentation broken down into 4 parts:
- Feeding Chain(crawler and content processing)
- Index core(index components)
- Query Chain
- Analytics Component
A few steps back: In SharePoint 2010 we had a built-in search service, but also shipped FAST search service in addition to that. With SharePoint 2013, all different components are merged into a single cohesive search service.
In addition to SharePoint, Search component is also shared inside Exchange. Search with FAST technologies now spans everything from Outlook Web Access through eDiscovery through intranet search scenarios. Immense investments into Internet search as well, with such features as Product Catalogue and Cross-Site Publishing. We are now using search without even thinking about it in many areas that were not traditionally search-driven in the past. It's much more than the original "Search Box".
Feeding Chain
- Crawl Component: Crawler in SP2013 is much more specialized. Extensible through BCS, Local Disk Cache, OOB Connectors, Configurations stored in Admin database. Big Change: New crawl mode called Continuous crawl mode. Runs continuously due to parallel processing. Crawl components scale individually now, no longer interlocked scaling regarding crawl components->crawl db's.
- Content Processing Component: Stateless node, analyzes content for indexing, uses "Processing flows", schema mappings, stores links and anchors in Links db(analytics), extensible via web service call-outs, configurations stored in Admin database. Each CrawlerFlow processes one document at a time. Note: Security crawls should be done faster now, due to using Update Groups.
Index Core
- Index component: Index is a state-full component.
- New concepts: Partitions and Replicas - no more Columns and Rows. All nodes perform indexing. Journal Shipping from primary to replicas. Each partition can have many replicas. Different from SharePoint 2010(but typical for FAST), index is stored on local disk and not in the database, i.e how SharePoint 2010 stores it in Property db.
- Improvements to index freshness: as content comes in, it goes straight to memory and is immediately searchable. Eventually flushed to disk. In previous FAST products, documents would have to first be stored on disk prior to being searchable.
- Understanding Index schemas: Crawled property -> Managed property, Schema administration can now be done on a Site Collection level. Adding "Searchable" and Retrievable" to managed property properties.
Query Chain
- Web Front-End: REST/ODATA API, CSOM and SSOM, Portals and Publishing, Search Center, ContentWebPart, RefinerWebPart, Result Templates. Programming model that allows you to serve both in the cloud and on-premise. Can take advantage of these API's and develop applications running on Surface/phones/tablets.
- Query Processing Component: Stateless node, processing query flows, query analyzer, linguistics/dictionaries, result sources, schema mappings, query rules, query federation, configuration stored in Admin database.
Notes
- Query federation: Possible for on-premise/cloud federation.
- Query router uses various Search Provider Flows(i.e Best Bets, People Search, Exchange Search, Local SharePoint Search,, Remote SharePoint Search, Personal Favorites, etc). These are exposed to end-users and administrators via Result Sources/Query Rules and such.
- Query Rules: Captures Search Intent. Composed of 3 top level elements:
- Query Conditions: What queries should be handled
- Query Actions: What happens when a rule matches
- Publishing Options: Is the rule active and for how long.
Analytics Service
- Analytics Processing Component: Map/Reduce, learns by usage, Search Analytics, Usage Analytics, enriches index by updating index items, Usage reports in Analytics database.
Making Recommendations is made easy. Recommendations by user behavior.
Built-in recommendations:
Event stream analysis
Item-to-item recommendations
Stored in "recommendedfor" managed property. Up to 12 configurable, weighted events.
Note: Example: If i'm building an application using Java, can I take my events and feed them into the Analytics Engine. Yes, through the REST API or CSOM.
SharePoint 2013: Search Architecture in SPC202的更多相关文章
- SharePoint 2013 Search 配置总结
前言:SharePoint 2013集成了Fast搜索以后,搜索的配置有了些许改变,自己在配置过程中,也记录下了一些入门的东西,希望能够对大家有所帮助. 1.配置搜索服务,首先需要启用搜索功能,基本思 ...
- SharePoint 2013 Search REST API 使用示例
前言:在SharePoint2013中,提供Search REST service搜索服务,你可以在自己的客户端搜索方法或者移动应用程序中使用,该服务支持REST web request.你可以使用K ...
- 配置SharePoint 2013 Search 拓扑结构
在单台服务器上安装了 SharePoint Server 2013,并且创建了具有默认搜索拓扑的 Search Service 应用程序.在默认搜索拓扑中,所有搜索组件都位于承载管理中心的服务器上.S ...
- 修复SharePoint 2013 Search 拓扑错误
Problem 当创建和配置SharePoint 2013 Search Service Application成功之后,进入详细配置页后,在Search Application Topology节点 ...
- 探索 SharePoint 2013 Search功能
转载: http://www.cnblogs.com/OceanEyes/p/SharePont-2013-Search-Develop.html SharePoint 2013的搜索功能很强大,特别 ...
- 转载 SharePoint 2013 Search功能
转载原出处: http://www.cnblogs.com/OceanEyes/p/SharePont-2013-Search-Develop.html 阅读目录 启用Search Service A ...
- Customize the SharePoint 2013 search experience with a Content Enrichment web service
Did you ever wish you had more control over how your content is indexed and presented as search resu ...
- sharepoint 2013 search configuration
在建立search application之前完成以下命令 $hostA = Get-SPEnterpriseSearchServiceInstance -Identity "WTCSPS0 ...
- SharePoint 2013 入门教程
以下文章是自己在学习SharePoint的过程中,不断积累和总结的博文,现在总结一个目录,分享给大家.这个博客也是自己从SharePoint入门,到一个SharePoint开发的成长记录,里面记录的都 ...
随机推荐
- Eclipse:Cannot complete the install because of a conflicting dependency.问题解决
今天尝试在线更新ADT(22到23)的时候,遇到了这么个问题,从错误提示中初步看起来是存在引用的冲突: 估计大家在把22升级到23的时候都会遇上这个问题,新旧版冲突,感觉像是ADT自己的bug. 其实 ...
- jQuery手机端上拉刷新下拉加载更多页面
基于jQuery手机端上拉下拉刷新页面代码.这是一款类似QQ空间客户端或者微信下拉刷新页面特效代码.效果图如下: 在线预览 源码下载 实现的代码. html代码: <div id=" ...
- Unity3D内置着色器
Unity内部提供了一些可以直接使用的着色器,这些内置着色器包括以下6个方面: (1)Performance of Unity shaders 着色器的性能和两个方面有关:shader本身和rende ...
- [Python] Debugger in Pycharm
From: http://blog.csdn.net/u013088062/article/details/50214459 From: http://blog.csdn.net/u013088062 ...
- 自己动手模拟百分百<select>下拉列表
浏览器默认的下拉确实不好用啊,主要是样式不好修改和统一. (一)下手之前先理清一下<select>的流程: 1.结构:<select> <option value=&qu ...
- Feathers JS – 基于 Express 构建数据驱动的服务
Feathers 是一个轻量的 Web 应用程序框架,基于 NodeJS 最流行的 Web 框架——Express.这使得它很容易使用 socket.io 来创建 RESTful Web 服务和实 ...
- 读Java Concurrency in Practice. 第六章.
这一章开讲任务执行.绝大多数并发程序的工作都可以分解为抽象的.互不相关的工作单元,称之为任务(Task). 使用java线程来执行任务 以web服务器的实现举例, 此时将用户的一次连接,当做一个独立的 ...
- Laravel 5 服务的注册和使用
首先我们需要创建一个服务的类,比如叫CqhServiceProvider,最简单的方式就是用artisan来帮我们创建 php artisan make:provider CqhServiceProv ...
- javascript中&&和||的区别
javascript中&&和||的区别 &&和||操作符两边不是布尔类型时,系统会转换成布尔类型值再计算(空字符串.null.0都会被转成false),结果本身不变. ...
- WebGL on iOS8 终于等到了这一天
WWDC2014刚结束,这次的大会是名符其实的开发者大会,更贴切的应该说的确是一次软件开发者的大会,对于OSX和iOS的更多功能特性让人兴奋,Swift新语言促成了如上图片 但我更感兴趣的是WebGL ...