Difference between revisions of "Kaingang/Universal Dependencies"

From LING073
Jump to: navigation, search
(NMOD:POSS)
(OBJ)
Line 57: Line 57:
 
====OBJ====
 
====OBJ====
  
 +
We used OBJ to connect nouns to their dependent direct objects.
  
 
[[Category: sp19_UD]] [[Category: Kaingang]]
 
[[Category: sp19_UD]] [[Category: Kaingang]]

Revision as of 21:14, 30 April 2019

kgp.annotated2.ud.conllu and kgp.annotated2.nomorph.conllu are the files we chose to train the parser on (not kgp.annotated.ud.conllu or kgp.annotated.nomorph.conllu).

Evaluation

Number of forms, UAS, LAS scores for kgp.annotated2.ud.conllu

$ udpipe --accuracy --parse kgp.withmorph.udpipe ../kgp.annotated2.ud.conllu
Loading UDPipe model: done.
Parsing from gold tokenization with gold tags - forms: 160, UAS: 70.62%, LAS: 55.00%
  • Number of sentences in corpus: 18

Number of forms, UAS, LAS scores for kgp.annotated2.nomorph.conllu

$ udpipe --accuracy --parse kgp.nomorph.udpipe ../kgp.annotated2.nomorph.conllu
Loading UDPipe model: done.
Parsing from gold tokenization with gold tags - forms: 160, UAS: 70.62%, LAS: 55.00%
  • Number of sentences in corpus: 18

Dependency

(for each relation: a description of the relation, noting various ways it might be used in the language; and two examples of the relation from your corpus, preferably illustrating what you described.)

Five dependency relations that we used at least twice in our annotation

CASE

Kaingang uses a set of postpositions (or what our source dictionary calls "circumstance markers" in the same way that English and Portuguese uses prepositions. We connected the objects of said prepositions with their respective postpositions using the CASE.

Kaingang also uses a large set of different subject and topic markers that indicate what the subject and focus of the sentence is. We connected all of the nouns with their respective subject markers using CASE.

NSUBJ

We used the NSUBJ relation to relate subjects to non-verbal predicates. Where Portuguese and English use a copula "to be" verb, Kaingang instead uses a particular aspect marker "vẽ".


"Ka fej kusũg vẽ."

tree flower red ASP

"The tree's flower is red"

In the above example, we would connect the nonverbal predicate kusũg to its NSUBJ dependent fej.


"Inh mỹnh fi vẽ."

my mother her ASP

"She is my mother" (lit: my mother is her)

In the above example, we would connect the nonverbal predicate fi with its NSUBJ dependent mỹnh.

ADVMOD

Kaingang does not conjugate its verbs. Instead, Kaingang speakers use mode markers, aspect markers, and opinion markers to modify verbs, define verb aspect, and apply changes to modality, respectively.

We used ADVMOD to connect a verb to each of its individual mode, aspect, and opinion markers.

NMOD:POSS

Kaingang expresses noun possession and ownership by juxtaposing nouns with one another. They place the possessor in front of the possessed to make a genitive phrase. We connected a possessed noun to its preceding possessor using NMOD:POSS.

OBJ

We used OBJ to connect nouns to their dependent direct objects.