Difference between revisions of "Fijian/Transducer"
From LING073
(→Notes) |
(→Analyser Evaluation) |
||
Line 5: | Line 5: | ||
==Analyser Evaluation== | ==Analyser Evaluation== | ||
* Total number of stems in transducer: 21 | * Total number of stems in transducer: 21 | ||
− | + | * Morph Test: | |
+ | :-Transitivity of Verbs: Passes 15, Total 15; | ||
+ | :-Fist person inclusive pronouns: Passes 19, Total 32; | ||
+ | :-First person exclusive pronouns: Passes 15, Total 21; | ||
+ | :-Second person pronouns: Passes 10, Total 32; | ||
+ | :-Third person pronouns: Passes 21, Total 24; | ||
*Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got: | *Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got: | ||
Revision as of 14:30, 6 March 2018
Contents
Transducer Link
The morphological transducer for Fijian is at: https://github.swarthmore.edu/hwang11/ling073-fij
Analyser Evaluation
- Total number of stems in transducer: 21
- Morph Test:
- -Transitivity of Verbs: Passes 15, Total 15;
- -Fist person inclusive pronouns: Passes 19, Total 32;
- -First person exclusive pronouns: Passes 15, Total 21;
- -Second person pronouns: Passes 10, Total 32;
- -Third person pronouns: Passes 21, Total 24;
- Current Coverage: Running aq-covtest on my fij.corpus.basic.txt, I got:
- -Number of tokenised words in the corpus: 1306
- -Coverage: 4.67%
- Initial top unknown words:
na 113
a 75
e 62
o 40
ni 40
mai 35
ka 33
i 30
sa 28
E 23
ko 19
vei 18
me 17
u 16
dua 13
kina 11
ki 10
Viti 10
koya 9
vaka 9
- -The top unknown words na, a, and o are articles <art>: na and a are used with common nouns and o is used with pronouns and proper names.
- -The unknown words me is a relator <rel>.
- After adding the analyses of na, a, o, and me to my transducer, the coverage went from 4.67% to 24.35%.
- -Current unknown words:
e 62
ni 40
mai 35
ka 33
i 30
sa 28
E 23
ko 19
vei 18
u 16
dua 13
kina 11
ki 10
Viti 10
koya 9
vaka 9
kena 8
levu 8
tu 8
vena 8
Generator Evaluation
Initial evaluation of morphological generation
- Number of passed morphological analysis tests:
- Current coverage: 24.35%
Final Evaluation
- Number of passing and failing tests:
- Number of twol rules added: 5