Difference between revisions of "Fijian/Transducer"

From LING073
Jump to: navigation, search
(Notes)
(Analyser Evaluation)
Line 5: Line 5:
 
==Analyser Evaluation==
 
==Analyser Evaluation==
 
* Total number of stems in transducer: 21
 
* Total number of stems in transducer: 21
 
+
* Morph Test:
 +
:-Transitivity of Verbs: Passes 15, Total 15;
 +
:-Fist person inclusive pronouns: Passes 19, Total 32;
 +
:-First person exclusive pronouns: Passes 15, Total 21;
 +
:-Second person pronouns: Passes 10, Total 32;
 +
:-Third person pronouns: Passes 21, Total 24;
 
*Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got:
 
*Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got:
  

Revision as of 14:30, 6 March 2018

Transducer Link

The morphological transducer for Fijian is at: https://github.swarthmore.edu/hwang11/ling073-fij

Analyser Evaluation

  • Total number of stems in transducer: 21
  • Morph Test:
-Transitivity of Verbs: Passes 15, Total 15;
-Fist person inclusive pronouns: Passes 19, Total 32;
-First person exclusive pronouns: Passes 15, Total 21;
-Second person pronouns: Passes 10, Total 32;
-Third person pronouns: Passes 21, Total 24;
  • Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got:
-Number of tokenised words in the corpus: 1306
-Coverage: 4.67%
  • Initial top unknown words:

na 113

a 75

e 62

o 40

ni 40

mai 35

ka 33

i 30

sa 28

E 23

ko 19

vei 18

me 17

u 16

dua 13

kina 11

ki 10

Viti 10

koya 9

vaka 9

-The top unknown words na, a, and o are articles <art>: na and a are used with common nouns and o is used with pronouns and proper names.
-The unknown words me is a relator <rel>.
  • After adding the analyses of na, a, o, and me to my transducer, the coverage went from 4.67% to 24.35%.
-Current unknown words:

e 62

ni 40

mai 35

ka 33

i 30

sa 28

E 23

ko 19

vei 18

u 16

dua 13

kina 11

ki 10

Viti 10

koya 9

vaka 9

kena 8

levu 8

tu 8

vena 8

Generator Evaluation

Initial evaluation of morphological generation

  • Number of passed morphological analysis tests:
  • Current coverage: 24.35%

Final Evaluation

  • Number of passing and failing tests:
  • Number of twol rules added: 5

Notes