Difference between revisions of "Nuosu and Mandarin"
From LING073
(→External Resources) |
|||
Line 5: | Line 5: | ||
*[https://github.swarthmore.edu/Ling073-sp22/apertium-zho Mandarin(zho) Transducer Repository] | *[https://github.swarthmore.edu/Ling073-sp22/apertium-zho Mandarin(zho) Transducer Repository] | ||
*[https://github.com/AgnesWang000/Nuosu-Transducer Nuosu(iii) Transducer] | *[https://github.com/AgnesWang000/Nuosu-Transducer Nuosu(iii) Transducer] | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
==Polished RBMT System== | ==Polished RBMT System== | ||
Line 39: | Line 23: | ||
** Coverage: | ** Coverage: | ||
+ | == Additions == | ||
+ | ===Disambiguation=== | ||
+ | * Added a rule that distinguishes adverbs and main verbs (ex. ꃅ) | ||
+ | * Added a rule that distinguishes exhaustion particles and main verbs for (ex. ꌐ) | ||
+ | * Added a rule that distinguishes end-of-sentence particles and main verbs for (ex. ꄉ) | ||
+ | ===Structural Transfer=== | ||
+ | * Added one more sentence for each of the five rules, bringing the total to 10 sentences | ||
+ | * Added a rule which added an adjective-specific marker when translating to Mandarin. | ||
+ | * Added a rule which specified the explicit topic marker in Nuosu | ||
+ | * Changed the rule that specified the organization of NP and ClP phrases | ||
− | + | ===Adding Stems=== | |
+ | * Added 100 new stems | ||
[[Category: Nuosu]] [[Category: sp22_TranslationPairs]] | [[Category: Nuosu]] [[Category: sp22_TranslationPairs]] |
Revision as of 21:04, 12 May 2022
Contents
Resources for machine translation between Nuosu and English
Developed Resources
- Nuosu-Mandarin Machine Translation Repository
- Mandarin(zho) Transducer Repository
- Nuosu(iii) Transducer
Polished RBMT System
Nuosu Transducer
- Precision: %
- Recall: %
- Coverage over large corpus: 182179 / 243953 (~0.74677909269408451628) remaining unknown forms: 61774
- Stems in transducer: 276
MT iii->zho
- Over : iii.longer.text
- Word Error Rate (WER): %
- Position-independent word error rate (PER): %
- Percentage of unknown words: %
- Number of position-independent correct words:
- Coverage:
Additions
Disambiguation
- Added a rule that distinguishes adverbs and main verbs (ex. ꃅ)
- Added a rule that distinguishes exhaustion particles and main verbs for (ex. ꌐ)
- Added a rule that distinguishes end-of-sentence particles and main verbs for (ex. ꄉ)
Structural Transfer
- Added one more sentence for each of the five rules, bringing the total to 10 sentences
- Added a rule which added an adjective-specific marker when translating to Mandarin.
- Added a rule which specified the explicit topic marker in Nuosu
- Changed the rule that specified the organization of NP and ClP phrases
Adding Stems
- Added 100 new stems