Hunspell介绍及试用
1、简介
Hunspell是一个为拥有多态和复杂组合词的语言所设计的拼写检查器,原本为匈牙利语设计。
Hunspell是一个自由软件,在GPL、LGPL和MPL三许可证下发行。
Hunspell对主要平台和编程语言都有接口和封装。Hunspell基于MySpell,并且与MySpell词典后端兼容。MySpell使用单字节字符编码,而Hunspell则可以使用Unicode UTF-8编码的词典。
2、以下应用程序使用Hunspell作为拼写检查器:
Mac OS X10.6 以及之后版本
Eclipse,使用Hunspell4Eclipse
Google Chrome,Google开发的一个网页浏览器
Evernote,笔记软件
LibreOffice和OpenOffice.org,开源办公组件
Mozilla Firefox和Thunderbird以及SeaMonkey
Opera,一个跨平台的网页浏览器
Scribus,桌面出版应用
Vim,一个文本编辑器
WPS Office,国产办公组件
3、使用docker镜像测试Hunspell的功能:
3.1查看可用字典
[root@host---- hunspell]# docker run --rm tmaier/hunspell -D
SEARCH PATH:
.::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/root/.openoffice.org//user/wordbook:/root/.openoffice.org2/user/wordbook:/root/.openoffice.org2./user/w/lib/openoffice.org/basis3./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./shhare/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo:/opt/openoffice.org2./share/dict/ooo:/usr/lib/openoffice.org2./share/dict/ooo
AVAILABLE DICTIONARIES (path is not mandatory for -d option):
/usr/share/hunspell/en_CA
/usr/share/hunspell/de_DE_comb
/usr/share/hunspell/en_ZA
/usr/share/hunspell/en_US
/usr/share/hunspell/en_GB
/usr/share/hunspell/en_AU
/usr/share/hunspell/de_CH
/usr/share/hunspell/de_DE_neu
/usr/share/hunspell/en_NZ
/usr/share/hunspell/de_AT
/usr/share/hunspell/default
LOADED DICTIONARY:
/usr/share/hunspell/default.aff
/usr/share/hunspell/default.dic
Hunspell 1.6.
3.2查看帮助信息
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words -h
Usage: hunspell [OPTION]... [FILE]...
Check spelling of each FILE. Without FILE, check standard input. - check only first field in lines (delimiter = tabulator)
-a Ispell's pipe interface
--check-url check URLs, e-mail addresses and directory paths
--check-apostrophe check Unicode typographic apostrophe
-d d[,d2,...] use d (d2 etc.) dictionaries
-D show available dictionaries
-G print only correct words or lines
-h, --help display this help and exit
-H HTML input file format
-i enc input encoding
-l print misspelled words(只打印错误的单词)
-L print lines with misspelled words(打印错误单词所在行)
-m analyze the words of the input text
-n nroff/troff input file format
-O OpenDocument (ODF or Flat ODF) input file format
-p dict set dict custom dictionary
-r warn of the potential mistakes (rare words)
-P password set password for encrypted dictionaries
-s stem the words of the input text
-S suffix words of the input text
-t TeX/LaTeX input file format
-v, --version print version number
-vv print Ispell compatible version number
-w print misspelled words (= lines) from one word/line input.
-X XML input file format Example: hunspell -d en_US file.txt # interactive spelling
hunspell -i utf- file.txt # check UTF- encoded file
hunspell -l *.odt # print misspelled words of ODF files # Quick fix of ODF documents by personal dictionary creation # Make a reduced list from misspelled and unknown words: hunspell -l *.odt | sort | uniq >words # Delete misspelled words of the file by a text editor.
# Use this personal dictionary to fix the deleted words: hunspell -p words *.odt Bug reports: http://hunspell.github.io/
3.3检查某个文档的拼写(显示错误词所在行数及建议更改)原文:test1.TXT(链接:https://pan.baidu.com/s/17JRmtnebLblVsMG05CIm-w 密码:l3q9)
[root@host---- hunspell]# docker run --rm -v $(pwd):/workdir tmaier/hunspell -u3 -i utf- -d de_DE_neu,en_US,de_CH -p words test1.TXT
test1.TXT:: Locate: rans | Try: rand
test1.TXT:: Locate: wew | Try: woo
test1.TXT:: Locate: Sevenn | Try: Severn
test1.TXT:: Locate: cannt | Try: canny
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Lele | Try: Lee
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: hasn | Try: has
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: vomeronasal | Try: astronomer
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: wasn | Try: wans
test1.TXT:: Locate: wouldn | Try: would
test1.TXT:: Locate: weren | Try: were
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: cefepime | Try: timepiece
test1.TXT:: Locate: amikacin | Try: Kamikaze
test1.TXT:: Locate: Mmm | Try: Mm
test1.TXT:: Locate: kuai | Try: Kauai
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: aren | Try: earn
test1.TXT:: Locate: shouldn | Try: should
test1.TXT:: Locate: whould | Try: would
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: Uhh | Try: Shh
test1.TXT:: Locate: Chh | Try: Ch
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: isn | Try: sin
test1.TXT:: Locate: ve | Try: be
test1.TXT:: Locate: exfoliator | Try: defoliator
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: didn | Try: did
test1.TXT:: Locate: Hmm | Try: Mm
test1.TXT:: Locate: ve | Try: be
Hunspell介绍及试用的更多相关文章
- fasttext介绍和试用
http://fasttext.apachecn.org/cn/docs/v0.1.0/support.html fasttext介绍网站 https://github.com/facebookres ...
- firefox插件-HackBar介绍与试用
This toolbar will help you in testing sql injections, XSS holes and site security. It is NOT a tool ...
- Unity4.6新UI系统初探(uGUI)
一.引言 Unity终于在即将到来的4.6版本内集成了所见即所得的UI解决方案(视频).事实上从近几个版本开始,Unity就在为这套系统做技术扩展,以保证最终能实现较理想的UI系统.本文试图通过初步的 ...
- 打造自己博客(wordpress)的wap手机版本
这儿介绍我试用的两款插件:wordpress-mobile-edition和wp-t-wap.1.先说一下两者的区别. wordpress-mobile-edition插件使用后,可以用手机直接访问你 ...
- React中使用CSSTransitionGroup插件实现轮播图
动画效果,是一个页面上必不可少的功能,学习一个新的东西,当然就要学习,如何用新的东西,用它的方法去实现以前的东西啦.今天呢,我就在这里介绍一个试用react-addons-css-transition ...
- 『编程题全队』"Gugua"事务管理系统项目宣传文案
一.项目简介 1.项目简介 Gugua是为了解决有事务管理需要的人群的痛苦, 他们需要 一个便利和高效的个人和团体事务管理平台,但是现有的方案并没有很好地解决这些需求,我们有独特的办法是提供跨平台的软 ...
- javascript中 try catch用法
javascript中 try catch用法 投稿:hebedich 字体:[增加 减小] 类型:转载 时间:2015-08-16我要评论 JS try catch语句一般在什么情况下使用?是必须使 ...
- 活动助手Beta用户试用报告
用户试用报告 1.面向参与者用户 1.1 日常参加各类学习(水综测)活动中,有没有遇到以下问题: (1) 信息来源混乱,不知道靠不靠谱 (2) 每次报名都要重新填写自己的学号手机号,有时候填错了就没综 ...
- 达梦7的试用 与SQLSERVER的简单技术对比
达梦7的试用 与SQLSERVER的简单技术对比 达梦数据库公司推出了他们的数据库服务管理平台,可以在该平台使用达梦数据库而无须安装达梦7数据库 地址:http://online.dameng.com ...
随机推荐
- centos7 yum安装LAMP
说明:我安装后的版本号分别是: apache : Apache/2.4.6 (CentOS)mysql:5.6.42php:5.6.39 一.配置网络. 我们首先需要让我们的虚拟机能够连接上外网,这样 ...
- day13_DOM
一,document可以获得HTML页面全部内容, 1.①document.getElementById:获取全部id标签,②document.getElementById("i1" ...
- JDBC API 事务的实践
使用了持久化框架几乎没有使用过原生的jdbc API ,发现原来使用jdbc API来实现事务也是很简单的. 数据库的链接connection具有一个属性autocommit,这个属性默认是true, ...
- js判断设备是否为安卓
var u = navigator.userAgent; var isAndroid = u.indexOf('Android') > -1 || u.indexOf('Adr') > - ...
- mysql中各种日期数据类型及其所占用的空间
DATETIME,8字节: DATE,3字节: TIMESTAMP,4字节: YEAR,1字节: TIME,3字节:
- Python 学习笔记5 变量-列表
列表是python常用的一种变量. 是由一些列按照特定顺序排列的元素组成的.你可以创建包含字母表中的所有字母,数字.可以将任何东西都加入到列表中. 通常情况下,列表中都包含多个元素,所以建议变量的名称 ...
- RoR - Nested Resources, Security ,pagination
root to: 'xxx' 默认root路径 Nested Resource: Rails.application.routes.draw do resources :books do resou ...
- table-一列细分为多列(合并单元格)
这个是一个很常见的一个表格展示需求,其中最要的就两个属性 rowspan 和 colspan .他们分别就是合并行 与 合并列. 要做的效果是如图下面这个,联系电话就是合并了单元格.这个说法类似于ex ...
- 洛谷P4456 交错序列[CQOI2018] dp+数论
正解:dp 解题报告: 传送门 首先可以先拆下这个贡献式,为了方便之后设状态什么的,把式子转成和ny有关,就成了 \(\sum \left ( n-i \right )^{a}\cdot i^{b}\ ...
- Spring Boot Thmeleaf的语法 day04
一.Thmeleaf语法的使用 html格式的页面放在classpath:/templates/就会自动渲染. 1.在pom.xml导入 <dependency> <groupId& ...