Difference between revisions of "Wamesa and Tongan"
(→Lexical Selection) |
|||
Line 24: | Line 24: | ||
Documentation for Lexical Selection can be found [[Wamesa_and_Tongan/Lexical_Selection | here]] | Documentation for Lexical Selection can be found [[Wamesa_and_Tongan/Lexical_Selection | here]] | ||
====wad → ton==== | ====wad → ton==== | ||
− | + | In Wamesa, the 3rd person plural is indicated by a morpheme, but in Tongan, it can appear as a few different words.<br/> | |
− | + | * (wad) se- → ou (ton) | |
− | + | * (wad) se- → kinautolu (ton) | |
− | + | Wamesa only has one word for ''sick'', while Tongan has a few.<br/> | |
− | + | * (wad) pota → puke (ton) | |
− | + | * (wad) pota → ʻa (ton) | |
− | + | (re = eye, landward; bo = row, fruit) | |
− | The Wamesa word ''re'' can mean ''eye'' or ''landward/inland'' | + | The Wamesa word ''re'' can mean ''eye'' or ''landward/inland''. |
*(wad) re → (eye) mata (ton) | *(wad) re → (eye) mata (ton) | ||
*(wad) re → (landward) (ton) | *(wad) re → (landward) (ton) | ||
+ | In Wamesa, ''bo'' means ''fruit'' or ''to row''. | ||
*(wad) bo → (fruit) foʻi ʻakau (ton) | *(wad) bo → (fruit) foʻi ʻakau (ton) | ||
*(wad) bo → (to row) (ton) | *(wad) bo → (to row) (ton) |
Revision as of 12:12, 28 March 2017
This page is a resource for machine translation between Wamesa and Tongan.
Contents
wad → ton evaluation
On the tests
file, 77.47% of the words translate correctly, and the unknown word rate is 28.57%. The WER and PER are both 71.43% when *'d words are removed, and 100% when not removing *'s.
ton → wad evaluation
Percentage of unknown words in ton.sentences.txt (bilingual corpus): 7% Percentage of unknown words in test file: 0%
Results when removing unknown word marks (stars) | WER: 100% PER: 100% Number of position-independent correct words: 1
Results when unknown word marks (stars) not removed | WER: 100% PER: 100%
- There is still one bug that we are trying to fix. siʻaku is being translated to #<poss>, but it should translate to nothing as there is no equivalent in Wamesa. The percentages above are when the incorrect #<poss> is taken out of the ton-wad.tests.txt file. Otherwise, the percentage increases to 114.29%.
Lexical Selection
Documentation for Lexical Selection can be found here
wad → ton
In Wamesa, the 3rd person plural is indicated by a morpheme, but in Tongan, it can appear as a few different words.
- (wad) se- → ou (ton)
- (wad) se- → kinautolu (ton)
Wamesa only has one word for sick, while Tongan has a few.
- (wad) pota → puke (ton)
- (wad) pota → ʻa (ton)
(re = eye, landward; bo = row, fruit)
The Wamesa word re can mean eye or landward/inland.
- (wad) re → (eye) mata (ton)
- (wad) re → (landward) (ton)
In Wamesa, bo means fruit or to row.
- (wad) bo → (fruit) foʻi ʻakau (ton)
- (wad) bo → (to row) (ton)
ton → wad
Tongan only has one word for to swim, while Wamesa has a few.
- (ton) kakau → as (wad)
- (ton) kakau → kubira (wad)
Tongan has one word for ocean, while Wamesa has a couple.
- (ton) moana → sasi (wad)
- (ton) moana → rawanam (wad)
Documentation
Contrastive Grammar Lexical Selection