BNC Part-of-speech codes
Extracted from the BNC Manual
- AJ0
- adjective (general or positive) e.g. good, old
- AJC
- comparative adjective e.g. better, older
- AJS
- superlative adjective, e.g. best, oldest
- AT0
- article, e.g. the, a, an, no . Note the inclusion of no: articles are defined as determiners which typically begin a noun phrase but cannot appear as its head.
- AV0
- adverb (general, not sub-classified as AVP or AVQ), e.g. often, well, longer, furthest. Note that adverbs, unlike adjectives, are not tagged as positive, comparative, or superlative. This is because of the relative rarity of comparative or superlative forms.
- AVP
- adverb particle, e.g. up, off, out. This tag is used for all prepositional adverbs, whether or not they are used idiomatically in phrasal verbs such as
Come out here
, orI can't hold out any longer
. - AVQ
- wh-adverb, e.g. when, how, why. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- CJC
- coordinating conjunction, e.g. and, or, but.
- CJS
- subordinating conjunction, e.g. although, when.
- CJT
- the subordinating conjunction that, when introducing a relative clause, as in
the day that follows Christmas
. Some theories treat that here as a relative pronoun; others as a conjunction. We have adopted the latter analysis. - CRD
- cardinal numeral, e.g. one, 3, fifty-five, 6609.
- DPS
- possessive determiner form, e.g. your, their, his.
- DT0
- general determiner: a determiner which is not a DTQ e.g. this both in
This is my house
andThis house is mine
. A determiner is defined as a word which typically occurs either as the first word in a noun phrase, or as the head of a noun phrase. - DTQ
- wh-determiner, e.g. which, what, whose, which. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- EX0
- existential there, the word thereappearing in the constructions
there is...
,there are ...
. - ITJ
- interjection or other isolate, e.g. oh, yes, mhm, wow.
- NN0
- common noun, neutral for number, e.g. aircraft, data, committee. Singular collective nouns such as committee take this tag on the grounds that they can be followed by either a singular or a plural verb.
- NN1
- singular common noun, e.g. pencil, goose, time, revelation.
- NN2
- plural common noun, e.g. pencils, geese, times, revelations.
- NP0
- proper noun, e.g. London, Michael, Mars, IBM. Note that no distinction is made for number in the case of proper nouns, since plural proper names are a comparative rarity.
- ORD
- ordinal numeral, e.g. first, sixth, 77th, next, last. No distinction is made between ordinals used in nominal and adverbial roles. next and last are included in this category, as general ordinals.
- PNI
- indefinite pronoun, e.g. none, everything, one (pronoun), nobody. This tag is applied to words which always function as heads of noun phrases. Words like some and these, which can also occur before a noun head in an article-like function, are tagged as determiners, DT0 or AT0.
- PNP
- personal pronoun, e.g. I, you, them, ours. Note that possessive pronouns such as ours and theirs are included in this category.
- PNQ
- wh-pronoun, e.g. who, whoever, whom. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- PNX
- reflexive pronoun, e.g. myself, yourself, itself, ourselves.
- POS
- the possessive or genitive marker 's or '. Note that this marker is tagged as a distinct word. For example,
Peter's or someone else's
is tagged Peter's or someone else's ]]> - PRF
- the preposition of. This word has a special tag of its own, because of its high frequency and its almost exclusively postnominal function.
- PRP
- preposition, other than of, e.g. about, at, in, on behalf of, with. Note that prepositional phrases like on behalf of or in spite of are treated as single words.
- TO0
- the infinitive marker to.
- UNC
unclassified
items which are not appropriately classified as items of the English lexicon. Examples include foreign (non-English) words; special typographical symbols; formulae; hesitation fillers such as errm in spoken language.- VBB
- the present tense forms of the verb be, except for is or 's am, are 'm, 're, be (subjunctive or imperative), ai (as in ain't).
- VBD
- the past tense forms of the verb be, was, were.
- VBG
- -ing form of the verb be, being.
- VBI
- the infinitive form of the verb be, be.
- VBN
- the past participle form of the verb be, been
- VBZ
- the -s form of the verb be, is, 's.
- VDB
- the finite base form of the verb do, do.
- VDD
- the past tense form of the verb do, did.
- VDG
- the -ing form of the verb do, doing.
- VDI
- the infinitive form of the verb do, do.
- VDN
- the past participle form of the verb do, done.
- VDZ
- the -s form of the verb do, does.
- VHB
- the finite base form of the verb have, have, 've.
- VHD
- the past tense form of the verb have, had, 'd.
- VHG
- the -ing form of the verb have, having.
- VHI
- the infinitive form of the verb have, have.
- VHN
- the past participle form of the verb have, had.
- VHZ
- the -s form of the verb have, has, 's.
- VM0
- modal auxiliary verb, e.g. can, could, will, 'll, 'd, wo (as in won't)
- VVB
- the finite base form of lexical verbs, e.g. forget, send, live, return. This tag is used for imperatives and the present subjunctive forms, but not for the infinitive (VVI).
- VVD
- the past tense form of lexical verbs, e.g. forgot, sent, lived, returned.
- VVG
- the -ing form of lexical verbs, e.g. forgetting, sending, living, returning.
- VVI
- the infinitive form of lexical verbs , e.g. forget, send, live, return.
- VVN
- the past participle form of lexical verbs, e.g. forgotten, sent, lived, returned.
- VVZ
- the -s form of lexical verbs, e.g. forgets, sends, lives, returns.
- XX0
- the negative particle not or n't.
- ZZ0
- alphabetical symbols, e.g. A, a, B, b, c, d.
The following portmanteau tags are used to indicate where the CLAWS system has indicated an uncertainty between two possible analyses:
- AJ0-AV0
- adjective or adverb
- AJ0-NN1
- adjective or singular common noun
- AJ0-VVD
- adjective or past tense verb
- AJ0-VVG
- adjective or -ing form of the verb
- AJ0-VVN
- adjective or past participle
- AVP-PRP
- adverb particle or preposition
- AVQ-CJS
- wh-adverb or subordinating conjunction
- CJS-PRP
- subordinating conjunction or preposition
- CJT-DT0
- that as conjunction or determiner
- CRD-PNI
- one as number or pronoun
- NN1-NP0
- singular common noun or proper noun
- NN1-VVB
- singular common noun or base verb form
- NN1-VVG
- singular common noun or -ing form of the verb
- NN2-VVZ
- plural noun or -s form of lexical verb
- VVD-VVN
- past tense verb or past participle
The following codes are used with c elements only:
- PUL
- left bracket (i.e. ( or [ )
- PUN
- any mark of separation ( . ! , : ; - ? ... )
- PUQ
- quotation mark ( ` ' `` '' )
- PUR
- right bracket (i.e. ) or ] )
Note that some punctuation marks (notably long dashes and ellipses) are not tagged as such in the corpus, but appear simply as entity references.
BNC Part-of-speech codes的更多相关文章
- Labels & Codes
Labels & Codes List of Codes Adjectives Nouns Verbs Other labels Adjectives adjective A word th ...
- UVA-146 ID Codes
It is 2084 and the year of Big Brother has finally arrived, albeit a century late. In order to exerc ...
- Lattice Codes
最近在做的一些关于lattice codes的工作,想记录下来. 首先,我认为lattice coding是一种联合编码调制技术,将消息序列映射到星座点.其中一个良好的性质是lattice point ...
- How to make a not-so-boring speech?
For almost 26 years, even a trivial boy like me, have made over 100 and listened uncountable speeche ...
- System Error Codes
很明显,以下的文字来自微软MSDN 链接http://msdn.microsoft.com/en-us/library/windows/desktop/ms681382(v=vs.85).aspx M ...
- Windows Locale Codes - Sortable list(具体一个语言里还可具体细分,中国是2052,法国是1036)
Windows Locale Codes - Sortable list NOTE: Code page is an outdated method for character encoding, y ...
- Bar codes in NetSuite Saved Searches(transport/reprint)
THIS IS A COPY FROM BLOG Ways of incorporating Bar Codes into your Netsuite Saved Searches. Code ...
- Secret Codes
Secret Codes This is a list of codes that can be entered into the dialer to output the listed info ...
- Disabling default console handler in Java Logger by codes
The open source packages usu. relies on log4j or Java Logger to print logs, by default the console h ...
随机推荐
- 2.NoSQL之Redis配置与优化
一.关系型数据库与非关系数据库 关系型数据库: 关系型数据库是一个结构化的数据库,创建在关系模型(二维表格模型)基础上,一般面向于记录. sQL语句(标准数据查询语言)就是一种基于关系型数据库的语言, ...
- 6.文本三剑客之sed
文本三剑客之sed 目录 文本三剑客之sed sed编辑器 sed概述 sed工作流程 sed用法 sed打印 sed删除 sed替换 sed增加行内容 sed剪切粘贴与复制粘贴 sed字符/字符串交 ...
- C语言学习之我见-strncpy()字符串复制函数(可控制范围)
strncpy()函数,用于两个字符串值的复制. (1)函数原型 char *strncpy(char * _Dest,const char * _Source,size_t _Count); (2) ...
- 关于vue打包上线遇到的坑
打包上线经常会经常遇到路径找不到,页面空白,那么下面我们就解决一下. 第一步.先找到config目录的index.js 改成如上图红框标注所示 第二步.找到build下的utils.js文件 加上如上 ...
- JavaScript产生随机颜色
//获取rgb类型的颜色 IE7不支持 function randomColor(){ var r = Math.floor(Math.random()*256); var g = Math.floo ...
- JavaScript扩展原型链浅析
前言 上文对原型和原型链做了一些简单的概念介绍和解析,本文将浅析一些原型链的扩展. javaScript原型和原型链 http://lewyon.xyz/prototype.html 扩展原型链 使用 ...
- 网络通讯之Socket-Tcp(一)
网络通讯之Socket-Tcp 分成3部分讲解: 网络通讯之Socket-Tcp(一): 1.如何理解Socket 2.Socket通信重要函数 网络通讯之Socket-Tcp(二): 1.简单So ...
- scrapy框架入门
scrapy迄今为止依然是世界上最好用,最稳定的爬虫框架,相比于其他直接由函数定义的程序, scrapy使用了面向对象并对网页请求的过程分成了很多个模块和阶段,实现跨模块和包的使用,大大提升了代码的稳 ...
- 使用高斯Redis实现二级索引
摘要:高斯Redis 搭建业务二级索引,低成本,高性能,实现性能与成本的双赢. 本文分享自华为云社区<华为云GaussDB(for Redis)揭秘第21期:使用高斯Redis实现二级索引> ...
- 虚拟机启动时报’A start job is running for /etc/rc.local .. Compatibility错误。
虚拟机启动时报'A start job is running for /etc/rc.local .. Compatibility错误. 问题已经存在很长时间了,但是不影响ssh登录,遂置之未理. 经 ...