Difference between revisions of "Miskito and English"
From LING073
(→Final Evaluation) |
(→Final Evaluation) |
||
Line 82: | Line 82: | ||
** Stems in Transducer: 344 | ** Stems in Transducer: 344 | ||
− | ====MT miq | + | ====MT miq → eng==== |
+ | Tests run on miq.longer.txt from the bilingual repo | ||
+ | |||
+ | * Word error rate (WER): 91.11 % | ||
+ | * Position-independent word error rate (PER): 73.33 % | ||
==Developed Resources== | ==Developed Resources== |
Revision as of 01:28, 22 May 2021
Note: Resources for machine translation between Miskito and English
Initial Miq → Eng Evaluation
Corpus Coverage
Monolingual coverage: 52 / 116 (~0.44827586206896551724)
Bilingual coverage: coverage: 37 / 118 (~0.31355932203389830508)
Sentence Analysis
Sentence 1
Yang wal wîna kalila piras
- Intended: I don’t eat meat and chicken.
- ^Yang<prn><p1>/I<prn><p1><sg>$ ^wal<post>/with<pr>$ ^wîna<n>/meat<n>$ ^kalila<n>/chicken<n>$ ^piaia<v><pres><neg>/eat<vblex><pres><neg>$^.<sent>/.<sent>$
- #I with #meat #chicken #eat
Sentence 2
Yang siknis brisna mango.
- Intended: I’m allergic to mangos.
- ^Yang<prn><p1>/I<prn><p1><sg>$ ^siknis<n>/allergy<n>$ ^briaia<v><pres><p1>/have<vbhaver><pres><p1>$ ^mango<n>/mango<n>$^.<sent>/.<sent>$
- #I #allergy #have #mango
Sentence 3
Dia prais brisma cuartocana timia kumi?
- Intended: How much is a room?
- ^Dia<det><itg>/What<det><itg>$ ^prais<n>/cost<n>$ ^briaia<v><pres><p2>/have<vbhaver><pres><p2>$ ^cuartocana<n>/room<n>$ ^timia<n>/night<n>$ ^kumi<adj>/one<adj>$^?<sent>/?<sent>$^.<sent>/.<sent>$
- #What #cost #have #room #one #night?
Sentence 4
- Intended: Is there a place to sleep?
- Lexical transfer: *Manuapo *ansara #place *barsara *yapaya?
- Miskito: Manuapo ansara pleiski barsara yapaya?
Sentence 5
- Intended: Is there a place to wash clothes ?
- Lexical transfer: *Nara *pleis *kum *apo *cuala *tuskaya?
- Miskito: Nara pleis kum apo cuala tuskaya?
Sentence 6
- Intended: Tomorrow, I will stay at this lovely hotel.
- Lexical transfer: #I *yauhka *takash *keisna *naha *hotel *pain.
- Miskito: Yang yauhka takash keisna naha hotel pain.
Sentence 7
- Intended: Is there a place to rent bikes?
- Lexical transfer: *Pleeskum *barsekei *baisikil *rin *monaya?
- Miskito: Pleeskum barsekei baisikil rin monaya?
Sentence 8
- Intended: What are you doing tonight?
- Lexical transfer: *Dia *dow *keisma *na #night.
- Miskito: Dia dow keisma na timia.
Sentence 9
- Intended: What do you think of the government?
- Lexical transfer: #You *dia *lukisma *naho *gabament?
- Miskito: Man dia lukisma naho gabament?
Sentence 10
- Intended: Today is Friday.
- Lexical transfer: *Naiwa *lika *praidi.
- Miskito: Naiwa lika praidi.
Final Evaluation
Additions
- 200+ stems added
- 2 new lexical selection rules
- sangni
- taya
- 3 new disambiguation rules in miq transducer for possessive nouns/verbs and object pronouns
- 1 new transfer rule for possessive nouns
- Also debugged plural nouns transfer rule
Evaluations
- Precision and Recall
- Coverage over large corpus
- ~70.08%
- Tokens in corpus: 13, 819
- Stems in Transducer: 344
MT miq → eng
Tests run on miq.longer.txt from the bilingual repo
- Word error rate (WER): 91.11 %
- Position-independent word error rate (PER): 73.33 %