solr5.2 mysql 增量索引

前提：数据库里数据进行增删改操作时，相应的solr需要修改或者新建索引，之前从数据库中导入数据并创建索引的操作是全量创建，如果本身数据库数据量非常大，就需要增量创建索引

1./usr/local/src/solr-5.2.1/server/solr/doc/conf 中solrconfig.xml，添加下面的内容

这个是全量创建索引

 <requestHandler name="/dataimport" class="org.apache.solr.handler.dataimport.DataImportHandler">

          <lst name="defaults">

           <str name="config">data-config.xml</str>

           </lst>

</requestHandler>

下面这个是增量

 <requestHandler name="/deltaimport" class="org.apache.solr.handler.dataimport.DataImportHandler">

          <lst name="defaults">

           <str name="config">delta-data-config.xml</str>

           </lst>

</requestHandler>

2./usr/local/src/solr-5.2.1/server/solr/doc/conf中data-config.xml

<?xml version="1.0" encoding="UTF-8" ?>

<dataConfig>

    <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://localhost/documents" user="root" password="12345"/>

    <document>

        <entity name="doc_import" pk="id" query="select id,file_name,file_type,file_path,file_content from document">

            <field column="id" name="id" />

            <field column="file_name" name="file_name" />

            <field column="file_type" name="file_type" />

            <field column="file_path" name="file_path" />

            <field column="file_content" name="file_content" />

        </entity>

        <deltaImportQuery>

        </deltaImportQuery>

    </document>

</dataConfig>

3./usr/local/src/solr-5.2.1/server/solr/doc/conf中delta-data-config.xml

数据库中有一个create_time,默认是CURRENT_TIMESTAMP

<dataConfig>

    <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://localhost/documents" user="root" password="12345"/>

    <document name="doc">

        <entity dataSource="jdbcDataSource" name="doc_import_add"

          query="select id,file_name,file_type,file_path,file_content from document"

          deltaImportQuery="select id,file_name,file_type,file_path,file_content from document where id= ${dih.delta.id}"

          deltaQuery="select id,file_name,file_type,file_path,file_content from document where creat_time &gt; '${dih.last_index_time}'">

            <field column="id" name="id" />

            <field column="file_name" name="file_name" />

            <field column="file_type" name="file_type" />

            <field column="file_path" name="file_path" />

            <field column="file_content" name="file_content" />

        </entity>

    </document>

</dataConfig>

4.重启solr

solr5.2 mysql 增量索引的更多相关文章

[Solr] (源) Solr与MongoDB集成，实时增量索引
一. 概述大量的数据存储在MongoDB上,需要快速搜索出目标内容,于是搭建Solr服务. 另外一点,用Solr索引数据后,可以把数据用在不同的项目当中,直接向Solr服务发送请求,返回xml.js ...
coreseek增量索引
1.在多数情况下,因为Coreseek索引速度高达10MB/s,所以只需要创建一个索引源即可满足需求,但是在数据量随时激增的大型应用中(如SNS.评论系统等),单一的索引源将会给indexer造成极大 ...
sphinx通过增量索引实现近实时更新
一.sphinx增量索引实现近实时更新设置数据库中的已有数据很大,又不断有新数据加入到数据库中,也希望能够检索到.全部重新建立索引很消耗资源,因为我们需要更新的数据相比较而言很少. 例如.原来的数据 ...
sphinx 增量索引实现近实时更新
一.sphinx增量索引的设置数据库中的已有数据很大,又不断有新数据加入到数据库中,也希望能够检索到.全部重新建立索引很消耗资源,因为我们需要更新的数据相比较而言很少.例如.原来的数据有几百万条 ...
sphinx增量索引
首先建立一个计数表,保存数据表的最新记录ID CREATE TABLE `sph_counter` ( `id` int(11) unsigned NOT NULL, `max_id` int(1 ...
Sphinx 增量索引更新
是基于PHP API调用,而不是基于sphinxSE.现在看来sphinxSE比API调用更简单的多,因为之前没有想过sphinxSE,现在先把API的弄明白.涉及到的:sphinx 数据源的设置,简 ...
sphinx续5-主索引增量索引和实时索引
原文件地址:http://blog.itpub.net/29806344/viewspace-1400942/ 在数据库数据非常庞大的时候,而且实时有新的数据插入,如果我们不更新索引,新的数据就sea ...
sphinx 增量索引及时更新、sphinx indexer索引合成时去旧和过滤办法(转)
一.sphinx增量索引的设置数据库中的已有数据很大,又不断有新数据加入到数据库中,也希望能够检索到.全部重新建立索引很消耗资源,因为我们需要更新的数据相比较而言很少.例如.原来的数据有几百万 ...
sphinx （coreseek）——3、区段查询与增量索引实例
首先本文测试数据100多万的域名的wwwtitle 信息检索数据: 首先建立临时表格: CREATE TABLE `sph_counter` ( `index_id` ) NOT NULL, `m ...

随机推荐

《sqoop实现hdfs中的数据导出至mysql数据库》
报错Access denied for user 'root'@'localhost' (using password: YES) 参考一参考二登陆mysql时,root密码的修改参考帖子h ...
关于Promise模式整理中。。。
http://blog.csdn.net/womendeaiwoming/article/details/49849055 研究了几天Promise模式,因为在项目里也遇到了所谓的“回调陷阱”,就是多 ...
POJ 1780 Code（有向图的欧拉通路）
输入n(1<=n<=6),输出长度为10^n + n -1 的字符串答案. 其中,字符串以每n个为一组,使得所有组都互不相同,且输出的字符串要求字典序最小. 显然a[01...(n-1)] ...
MyEclipse 10, 2013, 2014 破解、注册码
MyEclipse 试用期限一般是三十天,过了三十天后 MyEclipse 会提示用户注册而不能正常使用,这里分享一下破解过程,仅供学习和参考. MyEclipse 10, 2013, 2014 破解 ...
JQuery简单标签页实现
<!DOCTYPE html><html lang="en"> <head> <meta charset="UTF-8" ...
jQuery- 常规选择器(一)
注意:用size的时候有(),而length没有括号除了这种方式之外,还可以用转换为 DOM 对象的方式来判断,例如:i$('#pox').get(0) 或 $('#pox')[0] //通过数 ...
[转] JS中简单的继承与多态
这里讲了一个最最最简单的JS中基于原型链的继承和多态. 先看一下以下这段代码的实现(A是“父类”,B是“子类”): var A = function(){ this.value = 'a'; this ...
[机器学习] 虚拟机VMware中使用Ubuntu的联网问题
在VMware中安装Ubuntu要解决两个问题: 1.VMware Tools安装使用 2.Ubuntu联网的虚拟机设置 1.VMware Tools安装它的作用就是使用户可以从物理主机直接往虚拟机 ...
Object-C 1.0 第二章
1. 输出obc #import <Foundation/Foundation.h> int main(int argc,const char *argv[]) { NSLog(@&qu ...
tornado 学习笔记8 模板以及UI
Tornado 包含一个简单.快速而且灵活的模板语言. Tornado同样可以使用任何其他的python模板语言,虽然没有集成这些模板语言进RequestHandler.ren ...

solr5.2 mysql 增量索引

solr5.2 mysql 增量索引的更多相关文章

随机推荐

热门专题