Difference between revisions of "Tiwi and English/Structural transfer"
(→Tagger) |
(→Gendered Nouns and Adjectives) |
||
(4 intermediate revisions by 2 users not shown) | |||
Line 36: | Line 36: | ||
=== Gendered Nouns and Adjectives === | === Gendered Nouns and Adjectives === | ||
− | ==== Eng | + | ==== Tiw-Eng ==== |
(tiw) ngarra kijinga → (eng) he is small | (tiw) ngarra kijinga → (eng) he is small | ||
+ | |||
==== Tagger ==== | ==== Tagger ==== | ||
− | + | {{transferMorphTest|tiw|eng|ngarra{{tag|prn}}{{tag|pers}}{{tag|p3}}{{tag|m}}{{tag|sg}} kiji {{tag|adj}}{{tag|f}}| prpers{{tag|prn}}{{tag|subj}}{{tag|p3}}{{tag|m}}{{tag|sg}} be {{tag|vbser}}{{tag|pres}}{{tag|p3}}{{tag|sg}} small {{tag|adj}}{{tag|sint}} }} | |
+ | |||
==== Biltrans ==== | ==== Biltrans ==== | ||
^ngarra<prn><m><p3><sg>/he<prn><m><p3><sg>$ ^kiji<adj><f>/small<adj><sint><f>$^.<sent>/.<sent>$ | ^ngarra<prn><m><p3><sg>/he<prn><m><p3><sg>$ ^kiji<adj><f>/small<adj><sint><f>$^.<sent>/.<sent>$ | ||
Line 46: | Line 48: | ||
=== Insert "Be" Verbs === | === Insert "Be" Verbs === | ||
− | ==== Eng | + | ==== Tiw-Eng ==== |
(tiw) kamini naki? → (eng) what is this? | (tiw) kamini naki? → (eng) what is this? | ||
==== Tagger ==== | ==== Tagger ==== | ||
Line 57: | Line 59: | ||
=== Word to Word Phrase === | === Word to Word Phrase === | ||
− | ==== Eng | + | ==== Tiw-Eng ==== |
{{transferTest|tiw|eng|jupijupi awi kiyija mirrawu | soup and a little bit of tobacco}} | {{transferTest|tiw|eng|jupijupi awi kiyija mirrawu | soup and a little bit of tobacco}} | ||
==== Tagger ==== | ==== Tagger ==== |
Latest revision as of 14:34, 8 May 2021
Contents
Pre-Evaluation
Statistics about input files
Number of words in reference: 41
Number of words in test: 28
Number of unknown words (marked with a star) in test: 13
Percentage of unknown words: 14.63%
Results when removing unknown-word marks (stars)
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Results when unknown-word marks (stars) are not removed
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Statistics about the translation of unknown words
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%
Examples
Gendered Nouns and Adjectives
Tiw-Eng
(tiw) ngarra kijinga → (eng) he is small
Tagger
(tiw) ngarra<prn><pers><p3><m><sg> kiji <adj><f> → (eng) prpers<prn><subj><p3><m><sg> be <vbser><pres><p3><sg> small <adj><sint>
Biltrans
^ngarra<prn><m><p3><sg>/he<prn><m><p3><sg>$ ^kiji<adj><f>/small<adj><sint><f>$^.<sent>/.<sent>$
Transfer
^he<prn><p3><m><sg>$ ^be<vbser><pres><p3><sg>$ ^small<adj><sint>$^.<sent>$
Insert "Be" Verbs
Tiw-Eng
(tiw) kamini naki? → (eng) what is this?
Tagger
(tiw) kami<prn><itg><m> naki<prn><dem><sg> → (eng) what <prn><itg><mf><sp> be <vbser><pres><p3><sg> this<prn><dem><mf><sg>
Biltrans
^kami<prn><itg><m>/what<prn><itg><m>/which<prn><itg><m>$ ^naki<prn><dem><sg>/this><prn><dem><sg>$^.<sent>/.<sent>$
Transfer
^what<prn><itg><m>$ ^be<vbser><pres><p3><sg>$ ^this><prn><dem><sg>$^.<sent>$
Word to Word Phrase
Tiw-Eng
(tiw) jupijupi awi kiyija mirrawu → (eng) soup and a little bit of tobacco
Tagger
(tiw) jupijupi<n> awi <cnjcoo> kiyija<prn><qnt> mirrawu <n> → (eng) soup<n><sg> and <cnjcoo> a little bit <adv> of <pr> tobacco <n><sg>
Biltrans
^jupijupi<n>/soup<n>$ ^awi<cnjcoo>/and<cnjcoo>$ ^kiyija<prn><qnt>/a little bit<adv>$ ^mirrawu<n>/tobacco<n>$^.<sent>/.<sent>$
Transfer
^soup<n><sg>$ ^and<cnjcoo>$ ^a little bit<adv>$ ^of<pr>$ ^tobacco<n><sg>$^.<sent>$
Post Evaluation
Statistics about input files
Number of words in reference: 41
Number of words in test: 28
Number of unknown words (marked with a star) in test: 13
Percentage of unknown words: 46.43%
Results when removing unknown-word marks (stars)
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Results when unknown-word marks (stars) are not removed
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Statistics about the translation of unknown words
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%