Flesch Reading Ease
Time Limit: 1000MS   Memory Limit: 65536K
Total Submissions: 2269   Accepted: 710

Description

Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch, is among most ubiquitously used readability tests, which are principally employed for assessment of the difficulty to understand a reading passage written in English. The Flesch Reading Ease score of a passage relies solely on three statistics, namely the total numbers of sentences, words and syllables, of the passage. Specifically, the score is defined by the following formula:

.

As can be inferred from the above formula, a passage with a high Flesch Reading Ease score tends to favor shorter sentences and words, which is in compliance with commonsense in spite of partial accuracy. (Think of, for instance, the word "television". Long as it may seem, it is indeed one of the first words that any individual who studies English learns.) A related Wikipedia entry on Flesch Reading Ease [1] suggests that passages scoring 90~100 are comprehensible for an average American 5th grader, and 8th and 9th graders possess the ability to follow passages with a score in the range of 60~70, whereas passages not exceeding 30 in the score are best suitable for college graduates. The text of this problem, all sections taken into account, scores roughly 50 as per the calculation of Google Documents.

Despite the simplicity in its ideas, several aspects of its definition remains vague for any real-world implementation of Flesch Reading Ease. For the sake of precision and uniformity, the following restrictions adapted from [2] are adopted for this problem, to which you are to write a solution that effectively computes the Flesch Reading Ease score of a given passage of English text.

  1. Periods, explanation points, colons and semicolons serve as sentence delimiters.
  2. Each group of continuous non-blank characters with beginning and ending punctuation removed counts as a word.
  3. Each vowel (one of a, e, i, o, u and y) in a word is considered one syllable subject to that
    1. -es, -ed and -e (except -le) endings are ignored,
    2. words of three letters or shorter count as single syllables,
    3. consecutive vowels count as one syllable.

References

  1. Wikipedia contributors. Flesch-Kincaid Readability Test. Wikipedia, The Free Encyclopedia. August 30, 2007, 01:57 UTC. Available at: http://en.wikipedia.org/w/index.php?title=Flesch-Kincaid_Readability_Test&oldid=154509512. Accessed September 5, 2007.
  2. Talburt, J. 1985. The Flesch index: An easily programmable readability analysis algorithm. In Proceedings of the 4th Annual international Conference on Systems Documentation. SIGDOC '85. ACM Press, New York, NY, 114-122.

Input

The input contains a passage in English whose Flesch Reading Ease score is to be computed. Only letters of the English alphabet (both lowercase and uppercase), common punctuation marks (periods, question and exclamation marks, colons, semicolons as well as commas, quotation marks, hyphens and apostrophes), and spaces appear in the passage. The passage is of indefinite length and possibly occupies multiple lines. Additionally, it is guaranteed to be correct in punctuation. 
 

Output

Output the Flesch Reading Ease score of the given passage rounded to two
digits beyond decimal point. 
 

Sample Input

Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch,
is among most ubiquitously used readability tests, which are principally
employed for assessment of the difficulty to understand a reading passage
written in English. The Flesch Reading Ease score of a passage relies solely
on three statistics, namely the total numbers of sentences, words and
syllables, of the passage.

Sample Output

26.09

Source

大致题意:
给出一篇规范的文章,求其 句子数、单词数 和 音节数把这3个值代入题目给出的公式,输出其结果,保留2位小数。
  标记单词分隔符: 逗号(,) 和 空格( )
  句子分隔符:句号(.) 问号(?) 冒号(:) 分号(;) 感叹号(!)
  音节处理要求:
  (1)当单词总长度<=3时,音节数无条件+1
  (2)
当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的(>=2)元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元  音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。

解题思路:

注意:
由于用while(cin>>msg)输入文章,因此是按 空字符
把文章分开若干片段,直到出现EOF时才结束输入,因此msg中的单词分隔符不会出现空格,只要当msg最后一个字符为字母时,就说明此时的单词分隔符为空格。

音节数是最难处理的,其规律如下:
(1) 当单词总长度<=3时,音节数无条件+1
(2)
当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。

注意:
(1)元音字母要判断12个,6个小写,6个大写。

(2)输入的文章每个字符只能扫描一次,若重复扫描会超时。

(3)G++用%.2f

; C++用%.2lf

AC代码+详细注释:

#include<cstdio>
#include<iostream>
using namespace std;
char msg[];
int word=;//单词数
int sentance=;//句子数
int syllable=;//音标数
bool isalpha(char ch){//检查字符ch是否为字母
if(ch>='A'&&ch<='Z') return ;
if(ch>='a'&&ch<='z') return ;
return ;
}
bool isvowel(char ch){//检查字符ch是否为元音字母
if(ch=='a'||ch=='e'||ch=='i'||ch=='o'||ch=='u'||ch=='y') return ;
if(ch=='A'||ch=='E'||ch=='I'||ch=='O'||ch=='U'||ch=='Y') return ;
return ;
}
bool isword(char ch){//检查字符ch是否为单词分隔符
if(ch==',')
return ;
return ;
}
bool issentance(char ch){//检查字符ch是否为句子分隔符
if(ch=='.'||ch=='?' ||ch==':'||ch==';'||ch=='!') return ;
return ;
}
int main(){
while(cin>>msg){//以空格为标记,截取文章片段
int wordlen=;
bool flag_frevowel=;//当当前字符为元音时,检查前一字符是否为元音的标记
int syl=;//假设当前单词长度>3时,记录音节数的变化量。若单词实际长度<=3,则syllable-syl
int i;
for(i=;msg[i];i++){
if(isalpha(msg[i])){//当前字符为 字母
wordlen++;//当前所处理的单词的已知长度 (已知长度<=实际长度)
if(wordlen<=){//当 已知长度<=3 时
if(!isalpha(msg[i+])){//检查单词实际长度是否<=3
syllable++;//当实际长度<=3时,syllable无条件+1
syllable-=syl;//实际音节数调整,单词实际长度<=3,则syllable减去 "假设单词长度>3时" 音节数的变化量syl
syl=;
continue;
}
}
if(isvowel(msg[i])){//当前字母为 元音字母
if(msg[i]=='e'){
if(!isalpha(msg[i+])&&msg[i-]=='l'){//-le
syllable++;
syl++;//由于不知道单词的实际长度,因此总音节数syllable与音节数变化量syl同时递增
continue;
}
else if(!isalpha(msg[i+])) continue;// -e
else if((msg[i+]=='d'||msg[i+]=='s')&&!isalpha(msg[i+])) continue;// -ed -es
}
/*处理连续或单个元音*/
if(!flag_frevowel){//当前字母为元音,但前一字符不是元音
flag_frevowel=;
syllable++;
syl++;
continue;
}
else continue;//当前字母为元音,但前一字母也是元音,即出现连续元音,syllable不计数
}
flag_frevowel=;//当前字母不是元音
}
else if(isword(msg[i])){//当前字符为 单词分隔符
flag_frevowel=;
wordlen=;//当前单词操作已结束,长度清零,计算下一单词
syl=;
word++;
}
else if(issentance(msg[i])){//当前字符为 句子分隔符
flag_frevowel=;
wordlen=;//当前单词操作已结束,长度清零,计算下一单词
word++;
syl=;
sentance++;
}
}
if(isalpha(msg[i-])) word++;//当前文章片段最后一个字符为 字母
}
printf("%.2lf",206.835-1.015*(double)word/(double)sentance-84.6*(double)syllable/(double)word);
return ;
}

poj3371的更多相关文章

  1. Flesch Reading Ease -POJ3371模拟

    Flesch Reading Ease Time Limit: 1000MS Memory Limit: 65536K Description Flesch Reading Ease, a reada ...

  2. poj分类 很好很有层次感。

    初期: 一.基本算法:      (1)枚举. (poj1753,poj2965)      (2)贪心(poj1328,poj2109,poj2586)      (3)递归和分治法.      ( ...

  3. 【转】POJ题目分类推荐 (很好很有层次感)

    OJ上的一些水题(可用来练手和增加自信) (poj3299,poj2159,poj2739,poj1083,poj2262,poj1503,poj3006,poj2255,poj3094)初期: 一. ...

  4. 【转】ACM训练计划

    [转] POJ推荐50题以及ACM训练方案 -- : 转载自 wade_wang 最终编辑 000lzl POJ 推荐50题 第一类 动态规划(至少6题, 和 必做) 和 (可贪心) (稍难) 第二类 ...

  5. POJ 题目分类(转载)

    Log 2016-3-21 网上找的POJ分类,来源已经不清楚了.百度能百度到一大把.贴一份在博客上,鞭策自己刷题,不能偷懒!! 初期: 一.基本算法: (1)枚举. (poj1753,poj2965 ...

  6. (转)POJ题目分类

    初期:一.基本算法:     (1)枚举. (poj1753,poj2965)     (2)贪心(poj1328,poj2109,poj2586)     (3)递归和分治法.     (4)递推. ...

  7. acm常见算法及例题

    转自:http://blog.csdn.net/hengjie2009/article/details/7540135 acm常见算法及例题  初期:一.基本算法:     (1)枚举. (poj17 ...

  8. poj分类

    初期: 一.基本算法:      (1)枚举. (poj1753,poj2965)      (2)贪心(poj1328,poj2109,poj2586)      (3)递归和分治法.      ( ...

  9. 转载 ACM训练计划

    leetcode代码 利用堆栈:http://oj.leetcode.com/problems/evaluate-reverse-polish-notation/http://oj.leetcode. ...

随机推荐

  1. ArcGIS制图之Sub Points点抽稀

    简介 Sub Points工具是 Esri 中国自主开发的一个插件,该工具优先考虑点在空间分布上的均匀合理性,并结合点数据中包含的 "优先级" 属性进行筛选.通过获取每个点在一定范 ...

  2. SharePoint 门户添加内网域名

    原理:在DNS服务器上,添加一条SharePoint门户所在主机的别名,当我们在浏览器里访问这个别名的时候,会自动到Dns去解析,解析出来这台主机,从而访问到我们的SharePoint门户. 1.打开 ...

  3. 通过源码理解UST(用户栈回溯)

    UST原理:如果gflags标志中包含了UST标志,堆管理器会为当前进程分配一块内存,这个内存区域就是UST数据库(user-mode stack trace database),并建立一个STACK ...

  4. Sharepoint学习笔记—习题系列--70-573习题解析 -(Q121-Q124)

    Question 121You develop a custom approval workflow. The workflow uses the CreateTask class to assign ...

  5. Docker: 解决Docker无法在电信网络中访问外网

    在电信网络中,Docker在build和run时会无法访问外网,原因是docker的默认dns地址是8.8.8.8,由于众所周知的原因,我们需要改写这个地址,方法如下: 修改/etc/sysconfi ...

  6. [leetcode] Contains Duplicate

    Contains Duplicate Given an array of integers, find if the array contains any duplicates. Your funct ...

  7. 理解Lucene索引与搜索过程中的核心类

    理解索引过程中的核心类 执行简单索引的时候需要用的类有: IndexWriter.ƒDirectory.ƒAnalyzer.ƒDocument.ƒField 1.IndexWriter IndexWr ...

  8. C安全编码--预处理

    建议和规则 建议: 用内联函数或静态函数代替与函数相似的宏 在宏参数名两边加上括号 宏替换列表应该加上括号 应该使用typedef定义编码类型 不要复用标准头文件名 理解连接标记或执行字符串化时的宏替 ...

  9. App开发流程之通用宏定义及头文件

    工欲善其事,必先利其器. 在正式实现各种炫酷的功能和UI前,做好准备工作是提高后续开发效率的必经之路. 所以,这个系列,我不是在各种堆技术,更关注的是“兵马动”之前的“粮草行”,有些繁琐,但当清晰理出 ...

  10. unity下载文件三(http异步下载)

    异步下载,顾名思义就是不影响你主线程使用客户端的时候,人家在后台搞你的明堂. 直接入主题,既然要下载,首先得请求,请求成功之后进行回调,这就是一个异步过程,异步回调的时间不可控. 1.首先请求下载. ...