lucene Hello World

一个lucene创建索引和查找索引的样例：

创建索引:

public class Indexer {

private  IndexWriter indexWriter;

/**

 * 构造器实例化indexWriter

 * @throws Exception

 */

public Indexer(String indexPath) throws Exception {

Directory directory = FSDirectory.open(Paths.get(indexPath));//索引存储的位置

Analyzer analyzer = new StandardAnalyzer();//标准分析器

IndexWriterConfig iwc = new IndexWriterConfig(analyzer);

indexWriter = new IndexWriter(directory, iwc);

}

/**

 * 关闭indexWriter

 * @param indexWriter

 * @throws IOException

 */

public void close() throws Exception {

indexWriter.close();

}

/**

 * 获取文档Document

 * @throws FileNotFoundException

 */

public Document getDocumnet(File f) throws Exception {

Document doc = new Document();

doc.add(new TextField("content", new FileReader(f)));

doc.add(new TextField("tittle",f.getName(),Field.Store.YES));

doc.add(new TextField("path",f.getCanonicalPath(), Field.Store.YES));

return doc;

}

/**

 * 索引当个文件

 * @throws Exception

 */

public void indexFile(File f) throws Exception {

System.out.println(f.getName());

Document doc = this.getDocumnet(f);

indexWriter.addDocument(doc);

}

/**

 * 索引一个目录下的所有文件

 * @param filePath 目录路径

 * @return 索引文件的个数

 * @throws Exception

 */

public int index(String filePath) throws Exception {

File[] files = new File(filePath).listFiles();

for(File f:files) {

this.indexFile(f);

}

return indexWriter.numDocs();

}

public static void main(String[] args) {

        String indexPath = "G:\\工作\\luence\\index";

        String dataPath = "G:\\工作\\luence\\data";

        Indexer indexer = null;

        int indexNum=0;

        try {

            indexer = new Indexer(indexPath);

            indexNum = indexer.index(dataPath);

        } catch (Exception e) {

            e.printStackTrace();

        }finally {

            try {

                indexer.close();

            } catch (Exception e) {

                e.printStackTrace();

            }

        }

        System.out.println("索引了"+indexNum+"个文件");

    }

}

查找索引:

public class Searcher {

public static void search(String indexPath,String searchStr) throws Exception {

Directory dir = FSDirectory.open(Paths.get(indexPath));

IndexReader indeReader = DirectoryReader.open(dir);

IndexSearcher indexSearch = new IndexSearcher(indeReader);

Analyzer analyzer = new StandardAnalyzer();//标准分词器

QueryParser parser = new QueryParser("content", analyzer);

Query query = parser.parse(searchStr);

TopDocs td = indexSearch.search(query, 10);

for(ScoreDoc sc:td.scoreDocs) {

Document doc = indexSearch.doc(sc.doc);

System.out.println(doc.get("tittle"));

System.out.println(doc.get("path"));

}

}

public static void main(String[] args) throws Exception {

Searcher.search("G:\\工作\\luence\\index\\", "Hollywood");

}

}

lucene Hello World的更多相关文章

lucene 基础知识点
部分知识点的梳理,参考<lucene实战>及网络资料 1.基本概念 lucence 可以认为分为两大组件: 1)索引组件 a.内容获取:即将原始的内容材料,可以是数据库.网站(爬虫).文本 ...
用lucene替代mysql读库的尝试
采用lucene对mysql中的表建索引,并替代全文检索操作. 备注:代码临时梳理很粗糙,后续修改. import java.io.File; import java.io.IOException; ...
Lucene的评分(score)机制研究
首先,需要学习Lucene的评分计算公式—— 分值计算方式为查询语句q中每个项t与文档d的匹配分值之和,当然还有权重的因素.其中每一项的意思如下表所示: 表3.5 评分公式中的因子评分因子描述 ...
Lucene的分析资料【转】
Lucene 源码剖析 1 目录 2 Lucene是什么 2.1.1 强大特性 2.1.2 API组成- 2.1.3 Hello World! 2.1.4 Lucene roadmap 3 索引文件结 ...
Lucene提供的条件判断查询
第一.按词条搜索 - TermQuery query = new TermQuery(new Term("name","word1"));hits = sear ...
Lucene 单域多条件查询
在Lucene 中 BooleanClause用于表示布尔查询子句关系的类,包括:BooleanClause.Occur.MUST表示and,BooleanClause.Occur.MUST_NOT表 ...
lucene自定义过滤器
先介绍下查询与过滤的区别和联系,其实查询(各种Query)和过滤(各种Filter)之间非常相似,可以这样说只要用Query能完成的事,用过滤也都可以完成,它们之间可以相互转换,最大的区别就是使用过滤 ...
lucene+IKAnalyzer实现中文纯文本检索系统
首先IntelliJ IDEA中搭建Maven项目(web):spring+SpringMVC+Lucene+IKAnalyzer spring+SpringMVC搭建项目可以参考我的博客整合Luc ...
全文检索解决方案（lucene工具类以及sphinx相关资料）
介绍两种全文检索的技术. 1. lucene+ 中文分词(IK) 关于lucene的原理,在这里可以得到很好的学习. http://www.blogjava.net/zhyiwww/archive/ ...
MySQL和Lucene索引对比分析
MySQL和Lucene都可以对数据构建索引并通过索引查询数据,一个是关系型数据库,一个是构建搜索引擎(Solr.ElasticSearch)的核心类库.两者的索引(index)有什么区别呢?以前写过 ...

随机推荐

YOLOv4没交棒，但YOLOv5来了！
YOLOv4没交棒,但YOLOv5来了! 前言 4月24日,YOLOv4来了! 5月30日,"YOLOv5"来了! 这里的 "YOLOv5" 是带有引号的,因为 ...
Jmeter(五十一) - 从入门到精通高级篇 - jmeter之运动战（详解教程）
1.简介运动战是一种军事作战方式,依托较大的作战空间来换取时间移动兵力包围敌方,以优势兵力速战速决,运动战的运用归为这样一段话"避敌主力,诱敌深入,集中优势兵力逐个击破".今天宏 ...
太神奇了！GIF的合成与提取这么好玩
今天辰哥教大家一个Python有趣好玩的小功能:将多张图片转为GIF,同时也可以将一个GIF动图提取出里面的图片在开始之前,先来一个动图开头(预览) 01.图片转GIF动图 1.准备工作在开始合并 ...
Task03：复杂一点的查询
之前接触了sql基本的查询用法,接下来介绍一些相对复杂的用法. 3.1 视图我们先来看一个查询语句 SELECT stu_name FROM view_students_info; 单从表面上看起来 ...
【NX二次开发】Block UI 组
设置组及组内成员不可见 this->group->GetProperties()->SetLogical("Show", false); 设置组及组内成员不可操作 ...
【NX二次开发】根据部件名返回部件tag，UF_PART_ask_part_tag
注意UF_PART_ask_part_tag的参数输入带扩展名的部件名或者不带扩展名的部件名,不允许输入全路径名,否则会出错,例如下面这例子.部件在C盘"C:\\temp\\B01.prt ...
【NX二次开发】NX内部函数，libufunx.dll文件中的内部函数
本文分为两部分:"带参数的函数"和 "带修饰的函数". 浏览这篇博客前请先阅读: [NX二次开发]NX内部函数,查找内部函数的方法带参数的函数: void e ...
【题解】localmaxima 数论
# T749 localmaxima 权限限制没有超链接题目描述 Description 给出一个排列,若其中一个数比它前面的数都大,则称为localmaxima数,求一个随机排列中localmax ...
20204107 孙嘉临《PYTHON程序设计》计算器设计实验二报告
课程:<python程序设计> 班级:2041 姓名:孙嘉临学号:20204107 实验教师:王志强实验日期:2021年4月12日必修/选修:公选课 ##一.实验内容设计并完成一个 ...
.net core 使用webservice
开发环境在vs2017,2015 暂时没有试过 1.在扩展更新中添加Microsoft WCF Web Service Reference Provider 2.在core项目中添加链接的服务 3.键 ...

lucene Hello World

lucene Hello World的更多相关文章

随机推荐

热门专题