Bikol and Chamorro

From LING073
Jump to: navigation, search

Resources for machine translation between Bikol and Chamorro

Developed Resources

Translation Repositories

Initial Evaluation

bcl → cha Evalutaion

  • WER = 94.74%
  • PER = 84.21%
  • Proportion of stems translated correctly = 100.00%

cha → bcl Evaluation

  • WER = 90.91%
  • PER = 86.36%
  • Proportion of stems translated correctly = 100.00%

Lexical Selection

Find at least two cases in each direction of a one-to-many mapping.

Bikol-Chamorro

bóla

In Bikol, bóla means ball or compliment. In Chamorro, ball = bola and compliment = komplimenta.

Lexical selection between bola and komplimenta is guided by the following rules:

  • Bola (ball) is chosen when:
    • Preceded by a verb such as throw, toss, catch, play, buy, borrow, lend, bounce
    • Followed by an adjective such as round, colors

Example sentences

  • Bola (ball) is chosen in the following example:
    • Sublion mo si bola ki Jose (Borrow the ball from Jacob)
  • Komplimenta (compliment) is chosen in the following example:
    • <example here>
glóbo

In Bikol, glóbo means map or balloon. In Chamorro, map = mapa and balloon = abubu.

For now, the default selection is set to translate glóbo to abubu, or balloon in English.

Chamorro-Bikol

  • In Chamorro, guakse means to scrub (clothes) or to turn inside out. In Bikol, to scrub (clothes) is kuró-kusó and to turn inside out is balík.
  • maggem means cautious or damp. In Bikol, máan means cautious and basá means damp.


Structural Transfer

Final Evaluation

Polished RBMT System

Bcl-Cha

Additional Stems
  • 100 additional stems (verbs, adjectives, and nouns) were added to the bilingual dictionaries and both monolingual dictionaries.
Lexical Selection Rules
  • In Bikol, the adjective sadít can either mean (1) small, tiny; or (2) low (as a soft voice).
    • Despasio (low, soft) is chosen if the adjective sadít follows the noun bóses, or voice.
    • Balaskes (small, tiny) is the default and is chosen in all other circumstances.
  • In Bikol,
    • Pasífiku (Pacific) is chosen if the adjective pasípiko is in the exact phrase dágat na pasípiko, or Pacific Ocean.
    • Mahgong (peaceful) is the default and is chosen in all other circumstances.

Bcl Transducer

Disambiguation (x2)
sa

The Bikol word "sa" can have either of the following two meanings, illustrated by the following example sentences.

  • Determiner: the
    • Nagdumán akó sa haróng na dakúla. (I went to the big house.)
    • Correct analysis: ^Nagdumán/dumán<vblex><past>$ ^akó/akó<prn><p1><sg>$ ^sa<det><def>$ ^haróng/haróng<n>$ ^na/na<link>$ ^dakúla<adj><sg>$^./.<sent>$
  • Preposition: to
    • Nagdumán akó sa Píli. (I went to Píli.)
    • Correct analysis: ^Nagdumán/dumán<vblex><past>$ ^akó/akó<prn><p1><sg>$ ^sa/sa<pr>$ ^Píli/Píli<cog>$^./.<sent>$
dakúla

The Bikol word "dakúla" can have either of the following two meanings, illustrated by the following example sentences.

  • Adjective: big
    • Nagdumán akó sa haróng na dakúla. (I went to the big house.)
    • Correct analysis: ^Nagdumán/dumán<vblex><past>$ ^akó/akó<prn><p1><sg>$ ^sa/sa<det><def>$ ^haróng/haróng<n>$ ^na/na<link>$ ^dakúla<adj><sg>$^./.<sent>$
  • Verb: to grow bigger
    • Dakúla an búrak. (A flower grows.)
    • Correct analysis: ^dakúla/dakúla<vblex>$ ^an/an<det><ind>$ ^búrak/búrak<n>$^./.<sent>$