Difference between revisions of "User:Cayoh1/Final project"

From LING073
Jump to: navigation, search
(Issues)
(Issues)
Line 20: Line 20:
 
Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly.  
 
Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly.  
  
[[File:Vowels.png]]  
+
[[File:Vowels.png]?maxwidth=600]  
  
 
How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well.
 
How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well.

Revision as of 18:30, 18 May 2021

Outline

I have created a very preliminary, basic transducer for Yoruba, my native Nigerian Language. I have used what we've learned in class to build morphological analysis capabilities for my transducer.

  • I added noun, verb, pronoun, adjectives, and interjections to the lexd file.
  • In my grammar documentation, I created morphTests to test morphological analysis.
  • I started writing rules in the twol file.

Code/ Resources

Github

Grammar Documentation

Solution

Background

Essentially, even though there are up to 55 million speakers of Yoruba from Southwestern Nigeria, it is still a relatively low resource language. It does have limited Google Translate functionality and some documentation, but they could use a lot of improvement.

Approach

With my knowledge of the language, growing up in a Yoruba speaking household and my knowledge of morphological analysis, I aimed to create a viable apertium transducer for the language that reflects how current speakers of the language use it. I have been checking with my family members to ensure my implementation is adequate.

Issues

Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly.

[[File:Vowels.png]?maxwidth=600]

How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well.

Evaluation

Moving Forward

Your thoughts on who might benefit from your project and in what way. E.g., might a language community be able to find a use for what you're doing, or maybe [computational] linguists working on a language or issue? Remember that Bird (2020) advocates for evaluating in this way—it just might not be possible to do so within the context of this class.