Hunspell介绍及试用
1、简介
Hunspell是一个为拥有多态和复杂组合词的语言所设计的拼写检查器,原本为匈牙利语设计。
Hunspell是一个自由软件,在GPL、LGPL和MPL三许可证下发行。
Hunspell对主要平台和编程语言都有接口和封装。Hunspell基于MySpell,并且与MySpell词典后端兼容。MySpell使用单字节字符编码,而Hunspell则可以使用Unicode UTF-8编码的词典。
2、以下应用程序使用Hunspell作为拼写检查器:
Mac OS X10.6 以及之后版本
Eclipse,使用Hunspell4Eclipse
Google Chrome,Google开发的一个网页浏览器
Evernote,笔记软件
LibreOffice和OpenOffice.org,开源办公组件
Mozilla Firefox和Thunderbird以及SeaMonkey
Opera,一个跨平台的网页浏览器
Scribus,桌面出版应用
Vim,一个文本编辑器
WPS Office,国产办公组件
3、使用docker镜像测试Hunspell的功能:
3.1查看可用字典
[root@host---- hunspell]# docker run --rm tmaier/hunspell -D
SEARCH PATH:
.::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/root/.openoffice.org//user/wordbook:/root/.openoffice.org2/user/wordbook:/root/.openoffice.org2./user/w/lib/openoffice.org/basis3./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./shhare/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo
AVAILABLE DICTIONARIES (path is not mandatory for -d option):
/usr/share/hunspell/en_CA
/usr/share/hunspell/de_DE_comb
/usr/share/hunspell/en_ZA
/usr/share/hunspell/en_US
/usr/share/hunspell/en_GB
/usr/share/hunspell/en_AU
/usr/share/hunspell/de_CH
/usr/share/hunspell/de_DE_neu
/usr/share/hunspell/en_NZ
/usr/share/hunspell/de_AT
/usr/share/hunspell/default
LOADED DICTIONARY:
/usr/share/hunspell/default.aff
/usr/share/hunspell/default.dic
Hunspell 1.6.
3.2查看帮助信息
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words -h
Usage: hunspell [OPTION]... [FILE]...
Check spelling of each FILE. Without FILE, check standard input. - check only first field in lines (delimiter = tabulator)
-a Ispell's pipe interface
--check-url check URLs, e-mail addresses and directory paths
--check-apostrophe check Unicode typographic apostrophe
-d d[,d2,...] use d (d2 etc.) dictionaries
-D show available dictionaries
-G print only correct words or lines
-h, --help display this help and exit
-H HTML input file format
-i enc input encoding
-l print misspelled words(只打印错误的单词)
-L print lines with misspelled words(打印错误单词所在行)
-m analyze the words of the input text
-n nroff/troff input file format
-O OpenDocument (ODF or Flat ODF) input file format
-p dict set dict custom dictionary
-r warn of the potential mistakes (rare words)
-P password set password for encrypted dictionaries
-s stem the words of the input text
-S suffix words of the input text
-t TeX/LaTeX input file format
-v, --version print version number
-vv print Ispell compatible version number
-w print misspelled words (= lines) from one word/line input.
-X XML input file format Example: hunspell -d en_US file.txt # interactive spelling
hunspell -i utf- file.txt # check UTF- encoded file
hunspell -l *.odt # print misspelled words of ODF files # Quick fix of ODF documents by personal dictionary creation # Make a reduced list from misspelled and unknown words: hunspell -l *.odt | sort | uniq >words # Delete misspelled words of the file by a text editor.
# Use this personal dictionary to fix the deleted words: hunspell -p words *.odt Bug reports: http://hunspell.github.io/
3.3检查某个文档的拼写(显示错误词所在行数及建议更改)原文:test1.TXT(链接:https://pan.baidu.com/s/17JRmtnebLblVsMG05CIm-w 密码:l3q9)
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words test1.TXT
test1.TXT:: Locate: rans | Try: rand
test1.TXT:: Locate: wew | Try: woo
test1.TXT:: Locate: Sevenn | Try: Severn
test1.TXT:: Locate: cannt | Try: canny
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: hasn | Try: has
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: vomeronasal | Try: astronomer
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: wouldn | Try: would
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: cefepime | Try: timepiece
test1.TXT:: Locate: amikacin | Try: Kamikaze
test1.TXT:: Locate: Mmm | Try: Mm
test1.TXT:: Locate: kuai | Try: Kauai
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: aren | Try: earn
test1.TXT:: Locate: shouldn | Try: should
test1.TXT:: Locate: whould | Try: would
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Uhh | Try: Shh
test1.TXT:: Locate: Chh | Try: Ch
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: exfoliator | Try: defoliator
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
Hunspell介绍及试用的更多相关文章
- fasttext介绍和试用
http://fasttext.apachecn.org/cn/docs/v0.1.0/support.html fasttext介绍网站 https://github.com/facebookres ...
- firefox插件-HackBar介绍与试用
This toolbar will help you in testing sql injections, XSS holes and site security. It is NOT a tool ...
- Unity4.6新UI系统初探(uGUI)
一.引言 Unity终于在即将到来的4.6版本内集成了所见即所得的UI解决方案(视频).事实上从近几个版本开始,Unity就在为这套系统做技术扩展,以保证最终能实现较理想的UI系统.本文试图通过初步的 ...
- 打造自己博客(wordpress)的wap手机版本
这儿介绍我试用的两款插件:wordpress-mobile-edition和wp-t-wap.1.先说一下两者的区别. wordpress-mobile-edition插件使用后,可以用手机直接访问你 ...
- React中使用CSSTransitionGroup插件实现轮播图
动画效果,是一个页面上必不可少的功能,学习一个新的东西,当然就要学习,如何用新的东西,用它的方法去实现以前的东西啦.今天呢,我就在这里介绍一个试用react-addons-css-transition ...
- 『编程题全队』"Gugua"事务管理系统项目宣传文案
一.项目简介 1.项目简介 Gugua是为了解决有事务管理需要的人群的痛苦, 他们需要 一个便利和高效的个人和团体事务管理平台,但是现有的方案并没有很好地解决这些需求,我们有独特的办法是提供跨平台的软 ...
- javascript中 try catch用法
javascript中 try catch用法 投稿:hebedich 字体:[增加 减小] 类型:转载 时间:2015-08-16我要评论 JS try catch语句一般在什么情况下使用?是必须使 ...
- 活动助手Beta用户试用报告
用户试用报告 1.面向参与者用户 1.1 日常参加各类学习(水综测)活动中,有没有遇到以下问题: (1) 信息来源混乱,不知道靠不靠谱 (2) 每次报名都要重新填写自己的学号手机号,有时候填错了就没综 ...
- 达梦7的试用 与SQLSERVER的简单技术对比
达梦7的试用 与SQLSERVER的简单技术对比 达梦数据库公司推出了他们的数据库服务管理平台,可以在该平台使用达梦数据库而无须安装达梦7数据库 地址:http://online.dameng.com ...
随机推荐
- 编程菜鸟的日记-初学尝试编程-C++ Primer Plus 第5章编程练习4
#include <iostream>using namespace std;const MAXSIZE=12;int main(){ char *month[MAXSIZE]={&quo ...
- js之promise讲解
1 Promise概述 Promise对象是CommonJS工作组提出的一种规范,目的是为异步操作提供统一接口. 那么,什么是Promises? 首先,它是一个对象,也就是说与其他JavaScript ...
- (57)Wangdao.com第九天_JavaScript 对象的简介及分类
对象 引用数据类型. 对象是一种复合的数据类型,可以存储多个不同类型的变量. 以后我们看到的,只要不是String.Number.Boolean.Null.Undefined,那么就一定是 Objec ...
- 根据屏幕自适应宽度:@media
@media screen and (min-width: 1490px){ .w1224{ width: 1400px !important; }}@media screen and (max-wi ...
- Java根据链接生成二维码
Java根据链接生成二维码 相关 jar 包: core-3.1.0.jar 源码及 jar 包下载:http://files.cnblogs.com/files/liaolongjun/qrcode ...
- CentOS7安装MySQL并允许远程登录
下载 mysql 源安装包 $ curl -LO http://dev.mysql.com/get/mysql57-community-release-el7-11.noarch.rpm 安装 mys ...
- Linux系统的命令应该如何记?
Linux入门篇: 很多刚入门的同学,就像无头的苍蝇一样,到处找视频.书籍.网站帖子之类的学习方式,视频虽然讲得详细,但是时间的投入也是巨大的,播放时间,练习时间,加起来很吓人,其实啊很少有人能坚持把 ...
- linux CentOS 安装 nginx
官方网址:http://nginx.org/en/download.html 一.安装基础组件 yum install gcc-c++ yum install pcre yum install pcr ...
- /etc/inittab文件详解
/etc/inittab由/sbin/init程序解析调用,用于系统初始化,/sbin/init可参考源码busybox中init.c的实现方式. 原址如下: http://leejia.blog.5 ...
- STM32 内部flash的读写程序
/* Base address of the Flash sectors */ #define ADDR_FLASH_SECTOR_0 ((uint32_t)0x08000000) /* Base ...