Difference between revisions of "Tiwi and English/Structural transfer"
(update modes) |
(→Gendered Nouns and Adjectives) |
||
(24 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== Pre-Evaluation == | == Pre-Evaluation == | ||
− | + | ===Statistics about input files=== | |
+ | Number of words in reference: 41 | ||
+ | |||
+ | Number of words in test: 28 | ||
+ | |||
+ | Number of unknown words (marked with a star) in test: 13 | ||
+ | |||
+ | Percentage of unknown words: 14.63% | ||
+ | |||
+ | ===Results when removing unknown-word marks (stars)=== | ||
+ | Edit distance: 36 | ||
+ | |||
+ | Word error rate (WER): 87.8% | ||
+ | |||
+ | Number of position-independent correct words: 5 | ||
+ | |||
+ | Position-independent word error rate (PER): 87.8% | ||
+ | |||
+ | ===Results when unknown-word marks (stars) are not removed=== | ||
+ | Edit distance: 36 | ||
+ | |||
+ | Word error rate (WER): 87.8% | ||
+ | |||
+ | Number of position-independent correct words: 5 | ||
+ | |||
+ | Position-independent word error rate (PER): 87.8% | ||
+ | |||
+ | ===Statistics about the translation of unknown words=== | ||
+ | Number of unknown words which were free rides: 0 | ||
+ | |||
+ | Percentage of unknown words that were free rides: 0% | ||
== Examples == | == Examples == | ||
− | + | ||
+ | === Gendered Nouns and Adjectives === | ||
+ | ==== Tiw-Eng ==== | ||
+ | (tiw) ngarra kijinga → (eng) he is small | ||
+ | |||
+ | ==== Tagger ==== | ||
+ | {{transferMorphTest|tiw|eng|ngarra{{tag|prn}}{{tag|pers}}{{tag|p3}}{{tag|m}}{{tag|sg}} kiji {{tag|adj}}{{tag|f}}| prpers{{tag|prn}}{{tag|subj}}{{tag|p3}}{{tag|m}}{{tag|sg}} be {{tag|vbser}}{{tag|pres}}{{tag|p3}}{{tag|sg}} small {{tag|adj}}{{tag|sint}} }} | ||
+ | |||
+ | ==== Biltrans ==== | ||
+ | ^ngarra<prn><m><p3><sg>/he<prn><m><p3><sg>$ ^kiji<adj><f>/small<adj><sint><f>$^.<sent>/.<sent>$ | ||
+ | ==== Transfer ==== | ||
+ | ^he<prn><p3><m><sg>$ ^be<vbser><pres><p3><sg>$ ^small<adj><sint>$^.<sent>$ | ||
+ | |||
+ | === Insert "Be" Verbs === | ||
+ | ==== Tiw-Eng ==== | ||
+ | (tiw) kamini naki? → (eng) what is this? | ||
+ | ==== Tagger ==== | ||
+ | {{transferMorphTest|tiw|eng|kami{{tag|prn}}{{tag|itg}}{{tag|m}} naki{{tag|prn}}{{tag|dem}}{{tag|sg}} | what {{tag|prn}}{{tag|itg}}{{tag|mf}}{{tag|sp}} be {{tag|vbser}}{{tag|pres}}{{tag|p3}}{{tag|sg}} this{{tag|prn}}{{tag|dem}}{{tag|mf}}{{tag|sg}} }} | ||
+ | |||
+ | ==== Biltrans ==== | ||
+ | ^kami<prn><itg><m>/what<prn><itg><m>/which<prn><itg><m>$ ^naki<prn><dem><sg>/this><prn><dem><sg>$^.<sent>/.<sent>$ | ||
+ | ==== Transfer ==== | ||
+ | ^what<prn><itg><m>$ ^be<vbser><pres><p3><sg>$ ^this><prn><dem><sg>$^.<sent>$ | ||
+ | |||
+ | === Word to Word Phrase === | ||
+ | ==== Tiw-Eng ==== | ||
+ | {{transferTest|tiw|eng|jupijupi awi kiyija mirrawu | soup and a little bit of tobacco}} | ||
+ | ==== Tagger ==== | ||
+ | {{transferMorphTest|tiw|eng|jupijupi{{tag|n}} awi {{tag|cnjcoo}} kiyija{{tag|prn}}{{tag|qnt}} mirrawu {{tag|n}} | soup{{tag|n}}{{tag|sg}} and {{tag|cnjcoo}} a little bit {{tag|adv}} of {{tag|pr}} tobacco {{tag|n}}{{tag|sg}} }} | ||
+ | ==== Biltrans ==== | ||
+ | ^jupijupi<n>/soup<n>$ ^awi<cnjcoo>/and<cnjcoo>$ ^kiyija<prn><qnt>/a little bit<adv>$ ^mirrawu<n>/tobacco<n>$^.<sent>/.<sent>$ | ||
+ | ==== Transfer ==== | ||
+ | ^soup<n><sg>$ ^and<cnjcoo>$ ^a little bit<adv>$ ^of<pr>$ ^tobacco<n><sg>$^.<sent>$ | ||
== Post Evaluation == | == Post Evaluation == | ||
+ | ===Statistics about input files=== | ||
+ | Number of words in reference: 41 | ||
+ | |||
+ | Number of words in test: 28 | ||
+ | |||
+ | Number of unknown words (marked with a star) in test: 13 | ||
+ | |||
+ | Percentage of unknown words: 46.43% | ||
+ | |||
+ | ===Results when removing unknown-word marks (stars)=== | ||
+ | Edit distance: 36 | ||
+ | |||
+ | Word error rate (WER): 87.8% | ||
+ | |||
+ | Number of position-independent correct words: 5 | ||
+ | |||
+ | Position-independent word error rate (PER): 87.8% | ||
+ | |||
+ | ===Results when unknown-word marks (stars) are not removed=== | ||
+ | Edit distance: 36 | ||
+ | |||
+ | Word error rate (WER): 87.8% | ||
+ | |||
+ | Number of position-independent correct words: 5 | ||
+ | |||
+ | Position-independent word error rate (PER): 87.8% | ||
+ | |||
+ | ===Statistics about the translation of unknown words=== | ||
+ | Number of unknown words which were free rides: 0 | ||
+ | |||
+ | Percentage of unknown words that were free rides: 0% | ||
− | |||
[[Category:Sp21_StructuralTransfer]] [[Category:Tiwi]] [[Category:English]] | [[Category:Sp21_StructuralTransfer]] [[Category:Tiwi]] [[Category:English]] |
Latest revision as of 13:34, 8 May 2021
Contents
Pre-Evaluation
Statistics about input files
Number of words in reference: 41
Number of words in test: 28
Number of unknown words (marked with a star) in test: 13
Percentage of unknown words: 14.63%
Results when removing unknown-word marks (stars)
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Results when unknown-word marks (stars) are not removed
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Statistics about the translation of unknown words
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%
Examples
Gendered Nouns and Adjectives
Tiw-Eng
(tiw) ngarra kijinga → (eng) he is small
Tagger
(tiw) ngarra<prn><pers><p3><m><sg> kiji <adj><f> → (eng) prpers<prn><subj><p3><m><sg> be <vbser><pres><p3><sg> small <adj><sint>
Biltrans
^ngarra<prn><m><p3><sg>/he<prn><m><p3><sg>$ ^kiji<adj><f>/small<adj><sint><f>$^.<sent>/.<sent>$
Transfer
^he<prn><p3><m><sg>$ ^be<vbser><pres><p3><sg>$ ^small<adj><sint>$^.<sent>$
Insert "Be" Verbs
Tiw-Eng
(tiw) kamini naki? → (eng) what is this?
Tagger
(tiw) kami<prn><itg><m> naki<prn><dem><sg> → (eng) what <prn><itg><mf><sp> be <vbser><pres><p3><sg> this<prn><dem><mf><sg>
Biltrans
^kami<prn><itg><m>/what<prn><itg><m>/which<prn><itg><m>$ ^naki<prn><dem><sg>/this><prn><dem><sg>$^.<sent>/.<sent>$
Transfer
^what<prn><itg><m>$ ^be<vbser><pres><p3><sg>$ ^this><prn><dem><sg>$^.<sent>$
Word to Word Phrase
Tiw-Eng
(tiw) jupijupi awi kiyija mirrawu → (eng) soup and a little bit of tobacco
Tagger
(tiw) jupijupi<n> awi <cnjcoo> kiyija<prn><qnt> mirrawu <n> → (eng) soup<n><sg> and <cnjcoo> a little bit <adv> of <pr> tobacco <n><sg>
Biltrans
^jupijupi<n>/soup<n>$ ^awi<cnjcoo>/and<cnjcoo>$ ^kiyija<prn><qnt>/a little bit<adv>$ ^mirrawu<n>/tobacco<n>$^.<sent>/.<sent>$
Transfer
^soup<n><sg>$ ^and<cnjcoo>$ ^a little bit<adv>$ ^of<pr>$ ^tobacco<n><sg>$^.<sent>$
Post Evaluation
Statistics about input files
Number of words in reference: 41
Number of words in test: 28
Number of unknown words (marked with a star) in test: 13
Percentage of unknown words: 46.43%
Results when removing unknown-word marks (stars)
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Results when unknown-word marks (stars) are not removed
Edit distance: 36
Word error rate (WER): 87.8%
Number of position-independent correct words: 5
Position-independent word error rate (PER): 87.8%
Statistics about the translation of unknown words
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%