Difference between revisions of "User:Cayoh1/Final project"

From LING073
Jump to: navigation, search
(Issues)
(Issues)
Line 22: Line 22:
 
[[File:Vowels.png|upright=0.5]]  
 
[[File:Vowels.png|upright=0.5]]  
  
How you're approaching the solution to the overall problem, including how you're implementing the solution. You can talk here about smaller individual issues that have arisen as well.
+
Also Yoruba doesn't have gendered or plural forms of nouns or adjectives, you express gender in some cases by adding another word, for example: <code>older sister</code> in Yoruba would be <code>ẹ̀gbọ́n obìnrin</code> with the first word meaning older sibling and the second word signifying female. Similar to Romance languages like French and Spanish, Yoruba also distinguishes verbs and interjections by familiarity; here are different ways to say things depending on the recipient.
  
 
== Evaluation ==
 
== Evaluation ==

Revision as of 18:36, 18 May 2021

Outline

I have created a very preliminary, basic transducer for Yoruba, my native Nigerian Language. I have used what we've learned in class to build morphological analysis capabilities for my transducer.

  • I added noun, verb, pronoun, adjectives, and interjections to the lexd file.
  • In my grammar documentation, I created morphTests to test morphological analysis.
  • I started writing rules in the twol file.

Code/ Resources

Github

Grammar Documentation

Solution

Background

Essentially, even though there are up to 55 million speakers of Yoruba from Southwestern Nigeria, it is still a relatively low resource language. It does have limited Google Translate functionality and some documentation, but they could use a lot of improvement.

Approach

With my knowledge of the language, growing up in a Yoruba speaking household and my knowledge of morphological analysis, I aimed to create a viable apertium transducer for the language that reflects how current speakers of the language use it. I have been checking with my family members to ensure my implementation is adequate.

Issues

Some issues I've run into are implementing correctly the Yoruba characters and structure with apertium tools. Yoruba is a tonal language and has many characters in the vocabulary are accented. This led to some difficulty in the beginning of characters not being represented correctly.

Vowels.png

Also Yoruba doesn't have gendered or plural forms of nouns or adjectives, you express gender in some cases by adding another word, for example: older sister in Yoruba would be ẹ̀gbọ́n obìnrin with the first word meaning older sibling and the second word signifying female. Similar to Romance languages like French and Spanish, Yoruba also distinguishes verbs and interjections by familiarity; here are different ways to say things depending on the recipient.

Evaluation

Moving Forward

Your thoughts on who might benefit from your project and in what way. E.g., might a language community be able to find a use for what you're doing, or maybe [computational] linguists working on a language or issue? Remember that Bird (2020) advocates for evaluating in this way—it just might not be possible to do so within the context of this class.