Difference between revisions of "Fijian and English"
From LING073
(→fij → eng evaluation) |
(→eng → fij evaluation) |
||
Line 43: | Line 43: | ||
------------------------------------------------------- | ------------------------------------------------------- | ||
Number of words in reference: 62 | Number of words in reference: 62 | ||
− | Number of words in test: | + | Number of words in test: 56 |
− | Number of unknown words (marked with a star) in test: | + | Number of unknown words (marked with a star) in test: 2 |
− | Percentage of unknown words: | + | Percentage of unknown words: 3.57 % |
Results when removing unknown-word marks (stars) | Results when removing unknown-word marks (stars) | ||
Line 51: | Line 51: | ||
Edit distance: 52 | Edit distance: 52 | ||
Word error rate (WER): 83.87 % | Word error rate (WER): 83.87 % | ||
− | Number of position-independent correct words: | + | Number of position-independent correct words: 13 |
− | Position-independent word error rate (PER): | + | Position-independent word error rate (PER): 79.03 % |
Results when unknown-word marks (stars) are not removed | Results when unknown-word marks (stars) are not removed | ||
------------------------------------------------------- | ------------------------------------------------------- | ||
− | Edit distance: | + | Edit distance: 52 |
− | Word Error Rate (WER): | + | Word Error Rate (WER): 83.87 % |
Number of position-independent correct words: 13 | Number of position-independent correct words: 13 | ||
Position-independent word error rate (PER): 79.03 % | Position-independent word error rate (PER): 79.03 % | ||
Line 63: | Line 63: | ||
Statistics about the translation of unknown words | Statistics about the translation of unknown words | ||
------------------------------------------------------- | ------------------------------------------------------- | ||
− | Number of unknown words which were free rides: | + | Number of unknown words which were free rides: 0 |
− | Percentage of unknown words that were free rides: | + | Percentage of unknown words that were free rides: 0.00 % |
+ | |||
Revision as of 10:41, 3 April 2018
Resources for machine translation between Fijian and English
Contents
fij → eng evaluation
Current WER and PER:
Test file: 'fij-eng.tests.txt' Reference file 'eng.tests.txt' Statistics about input files ------------------------------------------------------- Number of words in reference: 56 Number of words in test: 59 Number of unknown words (marked with a star) in test: Percentage of unknown words: 0.00 % Results when removing unknown-word marks (stars) ------------------------------------------------------- Edit distance: 50 Word error rate (WER): 89.29 % Number of position-independent correct words: 12 Position-independent word error rate (PER): 83.93 % Results when unknown-word marks (stars) are not removed ------------------------------------------------------- Edit distance: 50 Word Error Rate (WER): 89.29 % Number of position-independent correct words: 12 Position-independent word error rate (PER): 83.93 % Statistics about the translation of unknown words ------------------------------------------------------- Number of unknown words which were free rides: 0 Percentage of unknown words that were free rides: 0%
eng → fij evaluation
Current WER and PER :
Test file: 'eng-fij.tests.txt' Reference file 'fij.tests.txt' Statistics about input files ------------------------------------------------------- Number of words in reference: 62 Number of words in test: 56 Number of unknown words (marked with a star) in test: 2 Percentage of unknown words: 3.57 % Results when removing unknown-word marks (stars) ------------------------------------------------------- Edit distance: 52 Word error rate (WER): 83.87 % Number of position-independent correct words: 13 Position-independent word error rate (PER): 79.03 % Results when unknown-word marks (stars) are not removed ------------------------------------------------------- Edit distance: 52 Word Error Rate (WER): 83.87 % Number of position-independent correct words: 13 Position-independent word error rate (PER): 79.03 % Statistics about the translation of unknown words ------------------------------------------------------- Number of unknown words which were free rides: 0 Percentage of unknown words that were free rides: 0.00 %
Lexical Selection
eng → fij one-to-many mapping
- Case 1: Pelu and lo’i describe two different kinds of bending action.
(eng) bend → (fij) pelu (e.g. bend of metal)
(eng) bend → (fij) lo’i (e.g. bend at a joint)
- Case 2: The third person singular pronoun in English does not distinguish between nominative and accusative case.
(eng) it → (fij) e (subj)
(eng) it → (fij) koya (obj)
fij → eng one-to-many mapping
- Case 1: The word levu can be used either as an adjective meaning "big", or a verb meaning "be a lot".
(fij) levu → (eng) big (adj)
(fij) levu → (eng) be a lot (v)
- Case2: Fijian does not distinguish genders on pronouns.
(fij) koya → (eng) him
(fij) koya → (eng) her
(fij) koya → (eng) it
Contrastive Grammar
https://wikis.swarthmore.edu/ling073/Fijian_and_English/Contrastive_Grammar
Developed Resources for Machine Translation
https://github.swarthmore.edu/hwang11/ling073-fij-eng
https://github.swarthmore.edu/hwang11/ling073-fij-eng-corpus