lucene Hello World

一个lucene创建索引和查找索引的样例：

创建索引:

public class Indexer {

private  IndexWriter indexWriter;

/**

 * 构造器实例化indexWriter

 * @throws Exception

 */

public Indexer(String indexPath) throws Exception {

Directory directory = FSDirectory.open(Paths.get(indexPath));//索引存储的位置

Analyzer analyzer = new StandardAnalyzer();//标准分析器

IndexWriterConfig iwc = new IndexWriterConfig(analyzer);

indexWriter = new IndexWriter(directory, iwc);

}

/**

 * 关闭indexWriter

 * @param indexWriter

 * @throws IOException

 */

public void close() throws Exception {

indexWriter.close();

}

/**

 * 获取文档Document

 * @throws FileNotFoundException

 */

public Document getDocumnet(File f) throws Exception {

Document doc = new Document();

doc.add(new TextField("content", new FileReader(f)));

doc.add(new TextField("tittle",f.getName(),Field.Store.YES));

doc.add(new TextField("path",f.getCanonicalPath(), Field.Store.YES));

return doc;

}

/**

 * 索引当个文件

 * @throws Exception

 */

public void indexFile(File f) throws Exception {

System.out.println(f.getName());

Document doc = this.getDocumnet(f);

indexWriter.addDocument(doc);

}

/**

 * 索引一个目录下的所有文件

 * @param filePath 目录路径

 * @return 索引文件的个数

 * @throws Exception

 */

public int index(String filePath) throws Exception {

File[] files = new File(filePath).listFiles();

for(File f:files) {

this.indexFile(f);

}

return indexWriter.numDocs();

}

public static void main(String[] args) {

        String indexPath = "G:\\工作\\luence\\index";

        String dataPath = "G:\\工作\\luence\\data";

        Indexer indexer = null;

        int indexNum=0;

        try {

            indexer = new Indexer(indexPath);

            indexNum = indexer.index(dataPath);

        } catch (Exception e) {

            e.printStackTrace();

        }finally {

            try {

                indexer.close();

            } catch (Exception e) {

                e.printStackTrace();

            }

        }

        System.out.println("索引了"+indexNum+"个文件");

    }

}

查找索引:

public class Searcher {

public static void search(String indexPath,String searchStr) throws Exception {

Directory dir = FSDirectory.open(Paths.get(indexPath));

IndexReader indeReader = DirectoryReader.open(dir);

IndexSearcher indexSearch = new IndexSearcher(indeReader);

Analyzer analyzer = new StandardAnalyzer();//标准分词器

QueryParser parser = new QueryParser("content", analyzer);

Query query = parser.parse(searchStr);

TopDocs td = indexSearch.search(query, 10);

for(ScoreDoc sc:td.scoreDocs) {

Document doc = indexSearch.doc(sc.doc);

System.out.println(doc.get("tittle"));

System.out.println(doc.get("path"));

}

}

public static void main(String[] args) throws Exception {

Searcher.search("G:\\工作\\luence\\index\\", "Hollywood");

}

}

lucene Hello World的更多相关文章

lucene 基础知识点
部分知识点的梳理,参考<lucene实战>及网络资料 1.基本概念 lucence 可以认为分为两大组件: 1)索引组件 a.内容获取:即将原始的内容材料,可以是数据库.网站(爬虫).文本 ...
用lucene替代mysql读库的尝试
采用lucene对mysql中的表建索引,并替代全文检索操作. 备注:代码临时梳理很粗糙,后续修改. import java.io.File; import java.io.IOException; ...
Lucene的评分(score)机制研究
首先,需要学习Lucene的评分计算公式—— 分值计算方式为查询语句q中每个项t与文档d的匹配分值之和,当然还有权重的因素.其中每一项的意思如下表所示: 表3.5 评分公式中的因子评分因子描述 ...
Lucene的分析资料【转】
Lucene 源码剖析 1 目录 2 Lucene是什么 2.1.1 强大特性 2.1.2 API组成- 2.1.3 Hello World! 2.1.4 Lucene roadmap 3 索引文件结 ...
Lucene提供的条件判断查询
第一.按词条搜索 - TermQuery query = new TermQuery(new Term("name","word1"));hits = sear ...
Lucene 单域多条件查询
在Lucene 中 BooleanClause用于表示布尔查询子句关系的类,包括:BooleanClause.Occur.MUST表示and,BooleanClause.Occur.MUST_NOT表 ...
lucene自定义过滤器
先介绍下查询与过滤的区别和联系,其实查询(各种Query)和过滤(各种Filter)之间非常相似,可以这样说只要用Query能完成的事,用过滤也都可以完成,它们之间可以相互转换,最大的区别就是使用过滤 ...
lucene+IKAnalyzer实现中文纯文本检索系统
首先IntelliJ IDEA中搭建Maven项目(web):spring+SpringMVC+Lucene+IKAnalyzer spring+SpringMVC搭建项目可以参考我的博客整合Luc ...
全文检索解决方案（lucene工具类以及sphinx相关资料）
介绍两种全文检索的技术. 1. lucene+ 中文分词(IK) 关于lucene的原理,在这里可以得到很好的学习. http://www.blogjava.net/zhyiwww/archive/ ...
MySQL和Lucene索引对比分析
MySQL和Lucene都可以对数据构建索引并通过索引查询数据,一个是关系型数据库,一个是构建搜索引擎(Solr.ElasticSearch)的核心类库.两者的索引(index)有什么区别呢?以前写过 ...

随机推荐

gst-crypto GStreamer插件
gst-crypto GStreamer插件内容 1. gst-crypto概述 1.1gst-crypto GStreamer插件功能 1.2用例范例 2. GStreamer插件支持 3. 在本 ...
如何运行具有奇点的NGC深度学习容器
如何运行具有奇点的NGC深度学习容器 How to Run NGC Deep Learning Containers with Singularity 高性能计算机和人工智能的融合使新的科学突破成为可 ...
预测汽车级Linux专业技术的需求
预测汽车级Linux专业技术的需求 Anticipating need for Automotive Grade Linux expertise 在听了多年汽车级Linux(AGL)及其所有潜力之后, ...
Linux芯片驱动之SPI Controller
针对一款新的芯片,芯片厂商如何基于Linux编写对应的 SPI controller 驱动? 我们先看看 Linux SPI 的整体框架: 可以看到,最底层是硬件层,对应芯片内部 SPI contro ...
MySQL零散知识点（01）
内容概要 --- 表字段操作补充(掌握) --- python操作MySQL(掌握) --- 视图(了解) --- 触发器(了解) --- 存储过程(了解) --- 事务(掌握) --- 内置函数(了 ...
SpringBoot面试题 (史上最全、持续更新、吐血推荐)
文章很长,建议收藏起来,慢慢读! 疯狂创客圈为小伙伴奉上以下珍贵的学习资源: 疯狂创客圈经典图书 : <Netty Zookeeper Redis 高并发实战> 面试必备 + 大厂必备 ...
关于DWG文件转换成PDF
最近有这样一个需求,客户会提供DWG文件,因为DWG文件是不能直接在网页上显示的,所以必须对他做处理,要求是转换成PDF格式.我查了很久的资料,很多都是基于C#和.NET的方法,而且都是说的很模糊,不 ...
Spring Boot开发RESTful接⼝服务及单元测试
Spring Boot开发RESTful接⼝服务及单元测试常用注解解释说明: @Controller :修饰class,⽤来创建处理http请求的对象 @RestController :Spring ...
solidity基础知识
1.solidity是一种语法类似JavaScript的高级语言,它被设计成以编译的方式生成以太坊虚拟机代码.在后续的内容中你将会发现,使用它很容易创建用于投票.众筹.封闭拍卖.多重签名钱包等等的合约 ...
nginx访问fastdfs文件报错400 Bad Request
1.修改vi /etc/fdfs/mod_fastdfs.conf 2.将url_have_group_name = false 改为 url_have_group_name = true 3.重启 ...

lucene Hello World

lucene Hello World的更多相关文章

随机推荐

热门专题