Biak/Disambiguation

From LING073
Revision as of 14:27, 4 April 2021 by Tmou1 (talk | contribs) (Final Evaluation of Ambiguity)

Jump to: navigation, search

Our Code

The .rlx file on our Github

Initial Evaluation of Ambiguity

  • Our initial ambiguity = 1.012
  • Some examples:
    • mran<v><iv><p1><pl><inc> ra<pr>/ra<cnjsub> vanda<n> ↔ komran ra vanda # We walk along the seaside
    • saroi<n>+i<prn><pers><p3><sg><spc> orn<v><tv><p3><sg> si<prn><pers><p3><pl><an><spc>+ya<det><dem> inm<v><tv><p3><an> monda<adv> ra<pr>/ra<cnjsub> mar<v><iv><p3><pl><an> ↔ saroiya dorn siya, siya sinm monda ra simar # The whale ate some, others drank (river water) only until they died.

Final Evaluation of Ambiguity

  • Final ambiguity = 1.010
  • We created a rule which selects the conjunction if the next word is a verb, in which case it cannot be a preposition. This removed the ambiguity described in the examples above.
  • We found ambiguity between the possessive inalienable noun 'nari' (tooth 1/3sg) and 'nari' meaning will, to indicate some sort of future tense.
    • There is a lack of examples and documentation on these forms, particularly the tense form, which was unmentioned in our comprehensive grammar. We have yet to determine a steadfast rule to disambiguate these words grammatically.
    • We were able to cover some cases, where we remove the auxiliary 'will' if there are no more verbs in the sentence. This will be helpful when the noun is being used as a subject.
  • Ambiguity should be able to be lowered by increasing corpus coverage (in particular, adding more verb stems).