BNC Part-of-speech codes
Extracted from the BNC Manual
- AJ0
- adjective (general or positive) e.g. good, old
- AJC
- comparative adjective e.g. better, older
- AJS
- superlative adjective, e.g. best, oldest
- AT0
- article, e.g. the, a, an, no . Note the inclusion of no: articles are defined as determiners which typically begin a noun phrase but cannot appear as its head.
- AV0
- adverb (general, not sub-classified as AVP or AVQ), e.g. often, well, longer, furthest. Note that adverbs, unlike adjectives, are not tagged as positive, comparative, or superlative. This is because of the relative rarity of comparative or superlative forms.
- AVP
- adverb particle, e.g. up, off, out. This tag is used for all prepositional adverbs, whether or not they are used idiomatically in phrasal verbs such as
Come out here
, orI can't hold out any longer
. - AVQ
- wh-adverb, e.g. when, how, why. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- CJC
- coordinating conjunction, e.g. and, or, but.
- CJS
- subordinating conjunction, e.g. although, when.
- CJT
- the subordinating conjunction that, when introducing a relative clause, as in
the day that follows Christmas
. Some theories treat that here as a relative pronoun; others as a conjunction. We have adopted the latter analysis. - CRD
- cardinal numeral, e.g. one, 3, fifty-five, 6609.
- DPS
- possessive determiner form, e.g. your, their, his.
- DT0
- general determiner: a determiner which is not a DTQ e.g. this both in
This is my house
andThis house is mine
. A determiner is defined as a word which typically occurs either as the first word in a noun phrase, or as the head of a noun phrase. - DTQ
- wh-determiner, e.g. which, what, whose, which. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- EX0
- existential there, the word thereappearing in the constructions
there is...
,there are ...
. - ITJ
- interjection or other isolate, e.g. oh, yes, mhm, wow.
- NN0
- common noun, neutral for number, e.g. aircraft, data, committee. Singular collective nouns such as committee take this tag on the grounds that they can be followed by either a singular or a plural verb.
- NN1
- singular common noun, e.g. pencil, goose, time, revelation.
- NN2
- plural common noun, e.g. pencils, geese, times, revelations.
- NP0
- proper noun, e.g. London, Michael, Mars, IBM. Note that no distinction is made for number in the case of proper nouns, since plural proper names are a comparative rarity.
- ORD
- ordinal numeral, e.g. first, sixth, 77th, next, last. No distinction is made between ordinals used in nominal and adverbial roles. next and last are included in this category, as general ordinals.
- PNI
- indefinite pronoun, e.g. none, everything, one (pronoun), nobody. This tag is applied to words which always function as heads of noun phrases. Words like some and these, which can also occur before a noun head in an article-like function, are tagged as determiners, DT0 or AT0.
- PNP
- personal pronoun, e.g. I, you, them, ours. Note that possessive pronouns such as ours and theirs are included in this category.
- PNQ
- wh-pronoun, e.g. who, whoever, whom. The same tag is used whether the word is used interrogatively or to introduce a relative clause.
- PNX
- reflexive pronoun, e.g. myself, yourself, itself, ourselves.
- POS
- the possessive or genitive marker 's or '. Note that this marker is tagged as a distinct word. For example,
Peter's or someone else's
is tagged Peter's or someone else's ]]> - PRF
- the preposition of. This word has a special tag of its own, because of its high frequency and its almost exclusively postnominal function.
- PRP
- preposition, other than of, e.g. about, at, in, on behalf of, with. Note that prepositional phrases like on behalf of or in spite of are treated as single words.
- TO0
- the infinitive marker to.
- UNC
unclassified
items which are not appropriately classified as items of the English lexicon. Examples include foreign (non-English) words; special typographical symbols; formulae; hesitation fillers such as errm in spoken language.- VBB
- the present tense forms of the verb be, except for is or 's am, are 'm, 're, be (subjunctive or imperative), ai (as in ain't).
- VBD
- the past tense forms of the verb be, was, were.
- VBG
- -ing form of the verb be, being.
- VBI
- the infinitive form of the verb be, be.
- VBN
- the past participle form of the verb be, been
- VBZ
- the -s form of the verb be, is, 's.
- VDB
- the finite base form of the verb do, do.
- VDD
- the past tense form of the verb do, did.
- VDG
- the -ing form of the verb do, doing.
- VDI
- the infinitive form of the verb do, do.
- VDN
- the past participle form of the verb do, done.
- VDZ
- the -s form of the verb do, does.
- VHB
- the finite base form of the verb have, have, 've.
- VHD
- the past tense form of the verb have, had, 'd.
- VHG
- the -ing form of the verb have, having.
- VHI
- the infinitive form of the verb have, have.
- VHN
- the past participle form of the verb have, had.
- VHZ
- the -s form of the verb have, has, 's.
- VM0
- modal auxiliary verb, e.g. can, could, will, 'll, 'd, wo (as in won't)
- VVB
- the finite base form of lexical verbs, e.g. forget, send, live, return. This tag is used for imperatives and the present subjunctive forms, but not for the infinitive (VVI).
- VVD
- the past tense form of lexical verbs, e.g. forgot, sent, lived, returned.
- VVG
- the -ing form of lexical verbs, e.g. forgetting, sending, living, returning.
- VVI
- the infinitive form of lexical verbs , e.g. forget, send, live, return.
- VVN
- the past participle form of lexical verbs, e.g. forgotten, sent, lived, returned.
- VVZ
- the -s form of lexical verbs, e.g. forgets, sends, lives, returns.
- XX0
- the negative particle not or n't.
- ZZ0
- alphabetical symbols, e.g. A, a, B, b, c, d.
The following portmanteau tags are used to indicate where the CLAWS system has indicated an uncertainty between two possible analyses:
- AJ0-AV0
- adjective or adverb
- AJ0-NN1
- adjective or singular common noun
- AJ0-VVD
- adjective or past tense verb
- AJ0-VVG
- adjective or -ing form of the verb
- AJ0-VVN
- adjective or past participle
- AVP-PRP
- adverb particle or preposition
- AVQ-CJS
- wh-adverb or subordinating conjunction
- CJS-PRP
- subordinating conjunction or preposition
- CJT-DT0
- that as conjunction or determiner
- CRD-PNI
- one as number or pronoun
- NN1-NP0
- singular common noun or proper noun
- NN1-VVB
- singular common noun or base verb form
- NN1-VVG
- singular common noun or -ing form of the verb
- NN2-VVZ
- plural noun or -s form of lexical verb
- VVD-VVN
- past tense verb or past participle
The following codes are used with c elements only:
- PUL
- left bracket (i.e. ( or [ )
- PUN
- any mark of separation ( . ! , : ; - ? ... )
- PUQ
- quotation mark ( ` ' `` '' )
- PUR
- right bracket (i.e. ) or ] )
Note that some punctuation marks (notably long dashes and ellipses) are not tagged as such in the corpus, but appear simply as entity references.
BNC Part-of-speech codes的更多相关文章
- Labels & Codes
Labels & Codes List of Codes Adjectives Nouns Verbs Other labels Adjectives adjective A word th ...
- UVA-146 ID Codes
It is 2084 and the year of Big Brother has finally arrived, albeit a century late. In order to exerc ...
- Lattice Codes
最近在做的一些关于lattice codes的工作,想记录下来. 首先,我认为lattice coding是一种联合编码调制技术,将消息序列映射到星座点.其中一个良好的性质是lattice point ...
- How to make a not-so-boring speech?
For almost 26 years, even a trivial boy like me, have made over 100 and listened uncountable speeche ...
- System Error Codes
很明显,以下的文字来自微软MSDN 链接http://msdn.microsoft.com/en-us/library/windows/desktop/ms681382(v=vs.85).aspx M ...
- Windows Locale Codes - Sortable list(具体一个语言里还可具体细分,中国是2052,法国是1036)
Windows Locale Codes - Sortable list NOTE: Code page is an outdated method for character encoding, y ...
- Bar codes in NetSuite Saved Searches(transport/reprint)
THIS IS A COPY FROM BLOG Ways of incorporating Bar Codes into your Netsuite Saved Searches. Code ...
- Secret Codes
Secret Codes This is a list of codes that can be entered into the dialer to output the listed info ...
- Disabling default console handler in Java Logger by codes
The open source packages usu. relies on log4j or Java Logger to print logs, by default the console h ...
随机推荐
- 人人都能学会的 Python 多线程指南~
大家好鸭!有没有想我~(https://jq.qq.com/?_wv=1027&k=rX9CWKg4) 在 Python 中,多线程最常见的一个场景就是爬虫,例如这样一个需求,有多个结构一样的 ...
- c# 怎样能写个sql的解析器
c# 怎样能写个sql的解析器 本示例主要是讲明sql解析的原理,真实的源代码下查看 sql解析器源代码 详细示例DEMO 请查看demo代码 前言 阅读本文需要有一定正则表达式基础 正则表达式基础教 ...
- Nginx通过bat文件快速启动停止
新建文本文件NginxRun.bat.(名字无所谓,后缀名得是bat) 将以下代码复制到bat文件中即可. @echo off ::进入D盘 d: ::进入nginx目录 这里是自己的nginx目录 ...
- 【最全】CSS盒子(div)水平垂直居中居然还有这种方式
最全的CSS盒子(div)水平垂直居中布局,对CSS 布局掌握程度决定你在 Web 开发中的开发页面速度. 相对于屏幕 方法一:利用定位 <div class="box"&g ...
- 记一道经典树上Nim游戏
这道题首先是 Hanriver 提出来的,但是大家都不会做,今天看到了一道一模一样的题目 AT2667 题目大意是,每个人删掉一个不是整棵树的原树的子树,给定一个树问游戏状态. 首先,这是需要用到多个 ...
- 2022-07-21 第四组 java之继承
目录 一.继承 1.概念 2.语法 3.父类成员访问 3.1 子类访问父类的成员变量 3.1.1 子类和父类中不存在同名的成员变量 3.1.2 子类和父类中不存在同名的成员变量 3.2 子类中访问父类 ...
- GDOI 2022 普及组游记
To LuoguDAY -1 期中考成绩下来了,全无了除了历史 (96) 和生物 (95) 还能看,剩下的-,语文 101.5 ,少错一道选择和断句就 107.5 了,居然比雌兔还低 数学少错一道选择 ...
- Java学习第五周
这周学习了异常与多线程,线程使用 Exception异常的分类: 1.编译时异常:继承自Exception的异常或者其子类,编译阶段就会报错 2.运行时异常:继承自RuntimeException的异 ...
- 利用Docker挂载Nginx-rtmp(服务器直播流分发)+FFmpeg(推流)+Vue.js结合Video.js(播放器流播放)来实现实时网络直播
原文转载自「刘悦的技术博客」https://v3u.cn/a_id_75 众所周知,在视频直播领域,有不同的商家提供各种的商业解决方案,其中比较靠谱的服务商有阿里云直播,腾讯云直播,以及又拍云和网易云 ...
- 臭名远扬之 goto 语句
C 语言自学之 goto 语句 Dome1:以下程序实现从控制台输出1-10,使用goto语句,实现当输出完3之后跳出循环体. 1 #include <stdio.h> 2 3 int m ...