Part-Of-Speech (POS) tagging#
Part-of-speech tagging (POS) is the process of assigning a POS (noun, adjective, verb, etc) to a word. In Wowool you can choose, whether you want to preserve all the POS when there is not enough evidence to assign a single POS to a word, or (the default) to run it as a deterministic process where every word will get the most likely POS.
The POS are used to simplify rule writing, as they provide another layer of abstraction: Nouns (Nn) are usually used to express concepts, like “mother” or “freedom”, Verbs (V) denote action, like “collect” or “jump”, adjectives (Adj) qualify nouns: “rich”, “mysterious”, “yellow”.
Literal Lemma POS
"Age" 'age' Nn-Sg
"is" 'be' V-Pres-Sg-be
"an" 'a' Det-Indef
"issue" 'issue' Nn-Sg
"of" 'of' Prep-of
"mind" 'mind' Nn-Sg
"over" 'over' Prep-Std
"matter" 'matter' Nn-Sg
"." '.' Punct-Sent
The POS tags can either be used fully (eg “V-Pres-Sg-be”), or as so-called umbrella tags (eg “V”).
Umbrella tags are simplified POS tags containing the POS until the first hyphen. Umbrella tags are consistent for all languages, whereas the full tags differ.
Umbrella Tag |
Meaning |
Example |
---|---|---|
Adj |
Adjective |
“big”, “incendiary” |
Adv |
Adverb |
“incredibly” |
Aux |
Auxiliary |
“do”, “will” |
Conj |
Conjunction |
“if”, “and” |
Det |
Determiner |
“the”, “my” |
Interj |
Interjection |
“yes” |
Nn |
Noun |
“car”, “imagination” |
Num |
Number |
“1”, “forty” |
Part |
Particle |
“to”, “not” |
Prep |
Preposition |
“in”, “for” |
Pron |
Pronoun |
“he”, “anybody” |
Prop |
Proper Noun |
“England”, “Michelle” |
Punct |
Punctuation |
“,”, “:” |
V |
Verb |
“is”, “came” |
What’s the use of Part Of Speech#
POS is used to generalize: it helps us to write rules in a simple way: for example, if we are looking for descriptions of wine, the adjectives can help us:
rule:
{
Adj 'wine'
} = WineType;
Output:
red winewhite winesdelicious winetasteless wine
It also helps us to disambiguate or distinguish a word’s meaning: for example ‘store’ can be a place where merchandise is offered for sale (a noun) or an action (a verb): ‘to put something in a place where it is available’. The same happens with the word ‘terminal’: it can be an adjective in ‘a terminal illness’ or a noun ‘passengers can proceed to the terminal’.
If you want to annotate places, you could use the POS to find the correct meaning:
rule:
{
(
<'store',Nn>
| <'terminal',Nn>
)
} = Place;
Note
POS can only be referred to in rules, not in lexicons, as you will see in the following chapters.