Korean IL0 Manual

Features on All Nodes
Nouns
Verbs and Auxiliary verbs
Adjectives and Adverbs

Copular Constructions
Conjunctions
Punctuation

Features on All Nodes

Each node in the dependency tree can be thought of as an attribute-value matrix, i.e., a bundle of features with values. All values must be set for each node in the tree. This will require checking each node before finishing the analysis. Here is a list of features:

Position (wpos). The linear position of the word in the sentence. This should not be modified or annotated, except for new empty nodes created by the annotator, which should always be given the wpos 100.

Word (lex). This is the inflected word form associated with the node. It is almost always correctly displayed already.

Part-of-Speech (POS). This is the lexical class, taken from a short list. Example: verb. Specific options:

¡Æ          V -- verbs

¡Æ          N -- common nouns

¡Æ          PN -- proper nouns

¡Æ          Adj -- adjectives (including indicative adjectives)

¡Æ          Adv -- adverbs (including conjunctive adverbs)

¡Æ          Conj – coordinating conjunctions

¡Æ          Pun -- punctuation marks

¡Æ          Sym -- various symbols (dollar signs and the like)

¡Æ          Uh -- speech-specifc sounds, even if meaningful (such as /UH HUH/)

¡Æ          Misc -- everything else

Base form (Root). This is the base form (lexeme) of the inflected form. A first "guess" will be included, which needs to be checked and corrected.

English Translation (Trans). This is an English translation for the given word and this should be added. If there exist more than one proper translation, all they should be given in the form of "gloss:translation1/translation2/translation3".

Deep role (DRole). This is the role of the node with respect to its mother, in some deeper representation. This is a little murky. We will use strictly syntactic criteria. DRole reflects the argument patterns of the the verb if it were in its active, non-ergative form.

¡Æ          Subj -- subject

¡Æ          Obj -- object

¡Æ          Obj2 -- indirect object

¡Æ          Mod -- Modifier

¡Æ          Root -- Root of sentence

Done. This feature is only a check to make sure that the default values have been checked. Set it to "Y" when you are done with the features for one node.

Nouns

Noun is (almost always) combined with Particle(we may call it noun-suffix). Particle gives grammatical function to noun such as conjunction, possession, object, subject, location, time, comparison, and so on.  We do not analyze particle at this point. We have a single node for a noun+particle and set noun as a base form of the node.

Nominal modifiers

The head of a noun phrase is the head noun. Any adnominals (e.g. demonstrative (pro)nouns ÀÌ ¡°this¡±, Àú ¡°that¡±, ±× ¡°the¡±) and adjectives are dependents of the noun. If there are multiple adjectives, the default structure will simply have each adjective as a direct dependent of the noun. For example, both the demonstrative adnominal ÀÌ "this" and the adjectival noun-based adnominal ÀÛÀº "small" are direct dependents of the noun µµ½Ã "city" in the noun phrase ÀÌ ÀÛÀº µµ½Ã "this small city".

Compound Nouns

Compound noun phrases, when clear, can have multiple noun phrases as dependents. For example, ÇÒÀÎÇ×°ø¿ä±Ý ¡°discount air fare¡± will have ¿ä±Ý ¡°fare¡± as the head and ÇÒÀÎ ¡°discount¡± and Ç×°ø ¡°air¡± as its direct dependents. A good test for this is to remove each noun in turn, to see if the phrase still retains part of its original sense. Because ÇÒÀÎÇ×°ø¿ä±Ý ¡°discount air fare¡± is a air fare and a discount fare, this analysis is the one we want.

In contrast, a phrase like Ç×°ø¿ä±ÝÇÒÀÎ ¡°air fare discount¡± should be annotated with ÇÒÀÎ ¡°discount¡± as the head, ¿ä±Ý ¡°fare¡± as its dependent, and Ç×°ø ¡°air¡± as a dependent of ¿ä±Ý ¡°fare¡±.

In cases where it's not clear whether or which nouns modify each other, the default compound structure will have all modifying nouns as direct dependents on the rightmost noun.

Proper Nouns

Proper nouns should have the value PN for feature POS. They are treated largely like nouns, except that compound proper nouns are not analyzed syntactically as if they were common nouns, but rather given single nodes. (The intuition is that they are really fixed phrases.)

Dependent Nouns (Functional Nouns)

Dependent nouns are nouns because they work as nouns in a sentence, but they are dependent because they are required to be modified by VP, ADJP, or NP, which means they cannot be used independently.  Dependent nouns sometimes add minor meanings (e.g. »Ó ¡°only¡±), but mainly they are used in functional purpose.  For example, there are ºÐ, °Í, ¹Ù, µûÀ§ (dependent noun-general), Áö, ¼ö, ¸® (dependent noun-subjective), µû¸§, »Ó, ÅÍ (dependent noun- descriptive), ÁÙ (dependent noun-objective), µí, ¸¸Å­ (dependent noun-adverbial), and ¸í, °³, ±×·ç (dependent noun-unit). 

Dependent noun phrase can be subject, object or any other part by adding an appropriate particle (noun-suffix) as other noun phrases, but it is often translated into gerund, to-infinitive, or relative clause (with ¡°what¡±) in English.  For example, the dependent noun °Í "thing" (actually it's not really "thing" since it is not used independently as "thing") is used in the following.

±×´Â ¾Æ¸§´Ù¿î °ÍÀ» ÁÁ¾ÆÇÑ´Ù. "He beautiful thing like" (He likes beautiful things)

±×¸¦ ¼³µæÇÏ´Â °ÍÀº ºÒ°¡´ÉÇÏ´Ù. "him persuade thing+eun impossible" (To persuade him is impossible)

³»°¡ ÇÒ ¼ö ÀÖ´Â °ÍÀº ÀÌ°Í »ÓÀÌ´Ù. "I do can thing this only be" (What I can do is this)

Dependent noun is head of the phrase and VP, ADVP, or NP modifying this dependent noun is a dependent of the noun. 

Numeral Nouns and Counters

Numeral nouns such as ÀÌ½Ê ¡°twenty¡±, ½Ê»ç ¡°fourteen¡± are usually followed by counters such as ¸í ¡°people¡±, ³â ¡°year¡±, ¹ø ¡°times¡±.  The counter is one kind of dependent nouns (unit) and it is head of numeral noun phrase as explained in Dependent Nouns section.  So in ¾ÆÈ© ¸í ¡°nine people¡±, ¡°people¡± is head and ¡°nine¡± modifies ¡°people¡± giving the information on how many people are there.

Numeral noun phrases may be preceded by adjectives such as Á¦ ¡°ordinal number¡±, ¾à ¡°about¡±,  ÃÑ ¡°in total¡±.  For example, in the phrase of ¾à ÀÌ½Ê ¸í ¡°about twenty people¡±, both ¡°about¡± and ¡°twenty¡± are independent dependents of ¡°people.¡±

Particles (Noun-suffixes, postpositions)

Nouns are followed by particles to have grammatical functions in a sentence.  There are many kinds of particles such as °¡ ¡°ka¡± (subjective), ¸¦ ¡°reul¡± (objective), ¿¡°Ô ¡°eghye¡± (indirect objective), ¿¡¼­ ¡°eseo¡± (location).  They don¡¯t appear as separate nodes but in one node with nouns.  (see the first part of Nouns section)

Verbs and Auxiliary verbs

Verbs are heads of sentences and clauses.

Choosing a Head

The head of any complete clausal utterance is the main verb. Incomplete utterances (NPs) should have as their head the usual head for that type of phrase. Verb form changes by adding a suffix to the stem word. Different suffix can make a verb into different closing forms (declarative, interrogative, imperative, exclamatory), conjunctive forms (coordinate, subordinate), or derivative forms (noun, adjective). 

Verbs are often followed by auxiliary verbs to add more meanings such as negative (¾Ê´Ù ¡°not¡±), continuous (ÀÖ´Ù ¡°ing¡±), in these cases, auxiliary verbs are dependents of a main verb.

Arguments and adjuncts

In distinguishing between arguments and adjuncts, consistency is the most important thing. This distinction will matter most for annotating empty categories. In addition, each argument will be annotated with a feature encoding its grammatical role. All non-arguments will be annotated as adjuncts, including function words.

Grammatical relations

The role of each argument (subject, object, indirect object) must be annotated as a feature of its node.

Empty categories and missing constituents

Missing arguments will appear as empty nodes. Missing adjuncts will not.

¡¤        Empty nominal nodes: big-PRO: 

1.        These are cases of empty nodes where the meaning can be derived from the syntactic context: when a sentence contains subordinate sentences (phrases).  Please see this example sentence, °¡ÀÔÀÚ°¡ ¹øÈ£ À̵¿À» ½ÅûÇÏ¸é »ç¾÷ÀÚ¸¦ º¯°æÇÒ ¼ö ÀÖ´Ù ¡°subscriber number move request provider change can¡± ( If a subscriber requests to change a number, he can change the provider.)  In this sentence, the phrase °¡ÀÔÀÚ°¡ ¹øÈ£ À̵¿À» ½ÅûÇÏ¸é ¡°If a subscriber requests to change a number¡± is a subordinate of the phrase »ç¾÷ÀÚ¸¦ º¯°æÇÒ ¼ö ÀÖ´Ù. ¡°can change the provider¡±.  The subject °¡ÀÔÀÚ°¡ ¡°subscriber¡± is a subject for both phrases, so we introduce an empty node for the subordinate phrase and identify the node with which it is co-referential.  We then copy the co-referential node's word and lexeme values to the empty node, but add brackets around the value: "<°¡ÀÔÀÚ°¡>".

¡¤        Empty nominal nodes: little-pro, missing argument in passive

1.        The missing argument in agentless passives.  For example, in ³ª¿¡°Ô Ã¥ÀÌ ÁÖ¾îÁ³´Ù "I was given a book", we know from syntax that there is another argument role which is not explicitly filled, namely the ddep subject.  This role is added at IL0.  We cannot tell syntactically what this is, only pragmatically.

2.        Arbitrary empty subjects, usually in adjunct clauses.  For example, in "Eating jicama is good for children's brain growth", the subject of "eating" is not specified.

In these cases, we label both the lexeme and the word feature of the new node "<pro>".  In case of doubt ("<pro>" or "<°¡ÀÔÀÚ°¡>"), ask yourself: can I tell from syntax alone what this node means?  If no, <pro>".  If yes, fill in the lexeme.

Questions

Questions are treated as any other sentence, with the question word taking the same position and grammatical relation as the answer to the question would take.

Imperatives

If an overt subject is not present, include an empty noun; otherwise an imperative will have the same analysis as a declarative sentence.  As mentioned in Choosing a Head section, the auxiliary verbs are dependents of a main verb.  For example, in ²ÉÀ» ÆÈ¾Æ ÁÖ¼¼¿ä ¡°flowers sell –juseyo¡±, ÆÈ¾Æ ¡°sell¡± is a main verb and ÁÖ¼¼¿ä ¡°-juseyo¡± is an auxiliary verb adding a meaning of ¡°do something for others¡±, yes, the sentence means ¡°Buy flowers¡±. So, ÆÈ¾Æ ¡°sell¡± is head of the sentence.

Passive

A sentence can have passive meaning by changing a suffix of a verb.  Following the meaning, the causer is given a role of subject and the caused is given a role of object.  For example, ±×°¡ ³ª¸¦ ¹«½ÃÇÏ´Ù ¡°he me ignore¡± (He ignores me) can be changed into passive form of ³ª´Â ±×¿¡°Ô ¹«½Ã´çÇÑ´Ù. ¡°I him-from ignore-danghanda¡± (I am ignored by him), and both sentences have a subject of ¡°he¡± and an object of ¡°I¡±.

Causative

A sentence can have causative meaning by changing a suffix of a verb. To give a proper role for each, make a separate node for causing suffix and let it be a head of the sentence.  The dependents will be the subject and the caused action.  For example, in ³»°¡ ±×¸¦ °øºÎ½ÃŲ´Ù ¡°I him study-sikinda¡± (I make him study), ½ÃŲ´Ù ¡°-sikinda¡± (make) is the head of the sentence, and ³»°¡ ¡°I¡± and ±×¸¦ °øºÎ ¡°him study¡± are dependents of the head as a subject and an object respectively.  ±×¸¦ °øºÎ ¡°him study¡± are analyzed as °øºÎ ¡°study¡± is a head of the phrase and ±×¸¦ ¡°him¡± is a dependent as a subject of the head ¡°study¡±.

Non-finite clauses

There is no non-finite clause in Korean.

Relative clauses

There is no relative clause in Korean.

Adjectives and Adverbs

Adjectives

Adjectives modify nouns. For example, in the phrase of ¾Æ¸§´Ù¿î ¼Ò³à ¡°beautiful girl¡±, the adjective ¾Æ¸§´Ù¿î ¡°beautiful¡± modifies ¼Ò³à ¡°girl¡± by identifying the appearance of the girl.

Predicate Adjectives

Adjectives can be predicates in sentences.  In this case, adjectives almost look like verbs grammatically, but they are not about action or movement but about quality or status.  Predicate adjectives can have different forms by changing a suffix like verbs, but they show slightly different conjugation from verbs. Predicative adjectives are simply thought as ¡°be + adjective¡± in English. For example, in ±×³à´Â ¾Æ¸§´ä´Ù ¡°she beautiful¡± (She is beautiful), ¾Æ¸§´ä´Ù ¡°beautiful¡± is head of a sentence.

Auxiliary Adjectives

As auxiliary verbs, there are auxiliary adjectives.  They can follow verbs or predicate adjectives, and they add meanings about status.  They are dependent of main verbs or adjectives.  For example, in ÇÏ°í ½Í´Ù ¡°do want¡± (want to do), ÇÏ°í ¡°do¡± is head.

Adverbs

Adverbs modify verbs. For example, ±×³à´Â ¿ì¾ÆÇÏ°Ô ÃãÃè´Ù "she danced gracefully.", the adverb ¿ì¾ÆÇÏ°Ô "gracefully" modifies the verb by specifying the manner in which the action was performed.

Copular Constructions

When a noun is used as a predicate, copula ¡°ÀÌ´Ù¡± is added to the end of the noun.  The predicate is a head of the sentence and the subject will have a role of ¡°subj¡±.  For example, in ³ª´Â ÇлýÀÌ´Ù ¡°I student-ida¡± (I am a student), ÇлýÀÌ´Ù ¡°student-ida¡± is the head.

Conjunctions

Conjunction has its own part-of-speech (Conj).

Coordinating Conjunctions

The coordinating conjunction is placed as a dependent of the first conjunct with role Mod, and the second conjunct is a dependent of the conjunction with role Obj.  If a comma acts as a conjunction, it is treated as such (given part-of-speech Conj and analyzed as in the above sentence).  If noun particle or verb suffix is used to connect two phrase or sentences without explicit conjunction, the second conjunct is a dependent of the head of the first conjunct.

Subordinating Conjunctions

Subordinating conjunctions are dependents the head of the principal clause in the sentence. Subordinates are dependents of the conjunctions.

Punctuation

Remove all punctuation, except meaningful punctuation.

Examples:

Quotes -- leave them (open and closed) attached to the constituent which is quoted. If the quoted passage is not a constituent, quote each pience separately.

Commas -- leave them when they are used as conjunction.

Do remove:

All non-conjunction commas.

All sentence-final punctuation.

All dashes and so on.