Zen-NE: 2016

POS tagging: (https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html)

Number	Tag	Description
1.	CC	Coordinating conjunction
2.	CD	Cardinal number
3.	DT	Determiner
4.	EX	Existential there
5.	FW	Foreign word
6.	IN	Preposition or subordinating conjunction
7.	JJ	Adjective
8.	JJR	Adjective, comparative
9.	JJS	Adjective, superlative
10.	LS	List item marker
11.	MD	Modal
12.	NN	Noun, singular or mass
13.	NNS	Noun, plural
14.	NNP	Proper noun, singular
15.	NNPS	Proper noun, plural
16.	PDT	Predeterminer
17.	POS	Possessive ending
18.	PRP	Personal pronoun
19.	PRP$	Possessive pronoun
20.	RB	Adverb
21.	RBR	Adverb, comparative
22.	RBS	Adverb, superlative
23.	RP	Particle
24.	SYM	Symbol
25.	TO	to
26.	UH	Interjection
27.	VB	Verb, base form
28.	VBD	Verb, past tense
29.	VBG	Verb, gerund or present participle
30.	VBN	Verb, past participle
31.	VBP	Verb, non-3rd person singular present
32.	VBZ	Verb, 3rd person singular present
33.	WDT	Wh-determiner
34.	WP	Wh-pronoun
35.	WP$	Possessive wh-pronoun
36.	WRB	Wh-adverb

Chunking: (highlighted are more commonly used)

ADJP: Adjective phrase

ADVP: Adverb phrase

NP: Noun phrase

PP: Prepositional phrase

S: Simple declarative clause

SBAR: Subordinate clause

SBARQ: Direct question introduced by wh-element

SINV: Declarative sentence with subject-aux inversion

SQ: Yes/no questions and subconstituent of SBARQ excluding wh-element

VP: Verb phrase

WHADVP: Wh-adverb phrase

WHNP: Wh-noun phrase

WHPP: Wh-prepositional phrase

X: Constituent of unknown or uncertain category

*: “Understood” subject of infinitive or imperative

0: Zero variant of that in subordinate clauses

T: Trace of wh-Constituent

IOB or IOB2 format for inside, outside, and beginning

B: beginning

I: inside

O: not in any chunk (, . and, etc.)

E: ending

S: standalone

NER

PER: person

ORG: organization

LOC: locations

MISC: misellaneous

Also sometimes including times and quantities (ordinals and numbers)

B: beginning

I: inside

E: ending

S: simple

Semantic Role Labeling (http://www.cs.upc.edu/~srlconll/)

A semantic role in language is the relationship that a syntactic constituent has with a predicate. Typical semantic arguments include Agent, Patient, Instrument, etc. and also adjunctive arguments indicating Locative, Temporal, Manner, Cause, etc. aspects. Recognizing and labeling semantic arguments is a key task for answering "Who", "When", "What", "Where", "Why", etc. questions in Information Extraction, Question Answering, Summarization, and, in general, in all NLP tasks in which some kind of semantic interpretation is needed.

The following sentence, taken from the PropBank corpus, exemplifies the annotation of semantic roles:

[_A0 He ] [_AM-MOD would ] [_AM-NEG n't ] [_V accept ] [_A1 anything of value ] from [_A2 those he was writing about ] .

Here, the roles for the predicate accept (that is, the roleset of the predicate) are defined in the PropBank Frames scheme as:

V: verb
A0: acceptor
A1: thing accepted
A2: accepted-from
A3: attribute
AM-MOD: modal
AM-NEG: negation

Parsing

Parsing may show the sentence structure (generally a tree structure) with the above tags.

Zen-NE

Sunday, January 10, 2016

NLP cheatsheet

POS tagging: (https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html)

Chunking: (highlighted are more commonly used)

NER

Semantic Role Labeling (http://www.cs.upc.edu/~srlconll/)

Parsing

About Me