Difference between revisions of "Okinawan and Miyako"

From LING073
Jump to: navigation, search
(Morphology)
(twol)
Line 93: Line 93:
 
=====twol=====
 
=====twol=====
 
*I added a rule such that the accusative marker goes to う after something ending in a or u.
 
*I added a rule such that the accusative marker goes to う after something ending in a or u.
*I added a rule such that the accusative marker goes to ぬ after something ending in ん
+
*I added a rule such that the accusative marker changes to the appropriate thing, such as going to ぬ after something ending in ん.
 
*: Old output: <code>^みんぬ/みん<n><gen>$</code>
 
*: Old output: <code>^みんぬ/みん<n><gen>$</code>
 
*: New output: <code>^みんぬ/みん<n><acc>/みん<n><gen>$</code>
 
*: New output: <code>^みんぬ/みん<n><acc>/みん<n><gen>$</code>
 +
*I added a rule such that {っ} goes to じ after a word ending in ず.
 +
*I added a rule such that ず goes to っ before the topic and accusative markers.
 +
*I added a rule such that す goes to っ before the accusative marker.
 +
*I added a rule such that {っ} goes to そ after a word ending in す and before an accusative marker.
  
 
====Expansion of Okinawan transducer and mvi → ryu====
 
====Expansion of Okinawan transducer and mvi → ryu====

Revision as of 22:45, 7 May 2017

Resources for machine translation between Okinawan and Miyako

Lexical Selection

  • In Miyako, はい can mean field, needle, or south, and is also a causative auxiliary verb. If it is followed by string, it is probably needle.
  • Hand and arm are the same word in Okinawan (てぃい), but different words in Miyako. If てぃい is followed by an instrumental, we are assuming it is hand.

Evaluation

Evaluation as of lexical selection

ryu → mvi Evaluation

Evaluation of tests
  • WER: 100 %
  • PER: 100 %
  • Coverage: 89%
Evaluation of sentences
  • WER: 90.32 %
  • PER: 90.32 %
  • Coverage: 62.1%

mvi → ryu Evaluation

Evaluation of tests
  • WER: 88.89 %
  • PER: 77.78 %
  • Coverage: 100 %
Evaluation of sentences
  • WER: 100%
  • PER: 100%
  • Coverage: 80.65%

Final Evaluation

ryu Evaluation

  • Precision:
  • Recall:
  • Coverage of large:
  • Number of words in large
  • Number of stems in the transducer:

mvi Evaluation

  • Precision:
  • Recall:
  • Coverage of large:
  • Number of words in large
  • Number of stems in the transducer:

ryu → mvi Evaluation

Evaluation of longer
  • WER:
  • PER:
  • Coverage:
  • Trimmed coverage:
  • Number of tokens:
Evaluation of large
  • Trimmed coverage:
  • Number of tokens:

mvi → ryu Evaluation

Evaluation of longer
  • WER:
  • PER:
  • Coverage:
  • Trimmed coverage:
  • Number of tokens:
Evaluation of large
  • Trimmed coverage:
  • Number of tokens:

Expansion of Miyako transducer and ryu → mvi

Morphology
  • I added more numbers. 6 is not included because I am confused as to what it is.
  • I expanded classifiers, adding days, portion, group, and people
  • I changed how the focus marker worked, such that it was a lexicon rather than being hard-coded. This also means that the verbs which take focus markers can now get them.
    Old output: ^ぼーしなてぃどぅ/*ぼーしなてぃどぅ$
    New output: ^ぼーしなてぃどぅ/ぼーし<n><abs>+な<mod><quot><foc>$
    Old output: ^そぅだてぃどぅ/*そぅだてぃどぅ$
    New output: ^そぅだてぃどぅ/そぅだてぃ<v><cvb_abs><foc>$
  • I implemented the resultative and causal.
    Old output: ^あいば/*あいば$
    New output: ^あいば/あい<vaux><vaux>$
  • I hard-coded the forms I have for do.
  • I added an additional lexicon certain verb forms go through to attach but to the end of them.
twol
  • I added a rule such that the accusative marker goes to う after something ending in a or u.
  • I added a rule such that the accusative marker changes to the appropriate thing, such as going to ぬ after something ending in ん.
    Old output: ^みんぬ/みん<n><gen>$
    New output: ^みんぬ/みん<n><acc>/みん<n><gen>$
  • I added a rule such that {っ} goes to じ after a word ending in ず.
  • I added a rule such that ず goes to っ before the topic and accusative markers.
  • I added a rule such that す goes to っ before the accusative marker.
  • I added a rule such that {っ} goes to そ after a word ending in す and before an accusative marker.

Expansion of Okinawan transducer and mvi → ryu