Difference between revisions of "User:Cayoh1/Final project"
(→Issues) |
(→Issues) |
||
Line 20: | Line 20: | ||
Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly. | Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly. | ||
− | [[File:Vowels. | + | [[File:Vowels.png]] |
How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well. | How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well. |
Revision as of 18:29, 18 May 2021
Contents
Outline
I have created a very preliminary, basic transducer for Yoruba, my native Nigerian Language. I have used what we've learned in class to build morphological analysis capabilities for my transducer.
- I added noun, verb, pronoun, adjectives, and interjections to the
lexd
file. - In my grammar documentation, I created
morphTests
to test morphological analysis. - I started writing rules in the
twol
file.
Code/ Resources
Solution
Background
Essentially, even though there are up to 55 million speakers of Yoruba from Southwestern Nigeria, it is still a relatively low resource language. It does have limited Google Translate functionality and some documentation, but they could use a lot of improvement.
Approach
With my knowledge of the language, growing up in a Yoruba speaking household and my knowledge of morphological analysis, I aimed to create a viable apertium transducer for the language that reflects how current speakers of the language use it. I have been checking with my family members to ensure my implementation is adequate.
Issues
Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly.
How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well.
Evaluation
Moving Forward
Your thoughts on who might benefit from your project and in what way. E.g., might a language community be able to find a use for what you're doing, or maybe [computational] linguists working on a language or issue? Remember that Bird (2020) advocates for evaluating in this way—it just might not be possible to do so within the context of this class.