Difference between revisions of "Central Kurdish/Disambiguation"

From LING073
Jump to: navigation, search
(Constraint Example with Sentences)
(Constraint Example with Sentences)
 
Line 19: Line 19:
  
 
* "ئەو سەگەکەی هێنا" (he brought the dog)
 
* "ئەو سەگەکەی هێنا" (he brought the dog)
Here the ێ should be analyzed as a personal suffix, because it is immediately followed by a transitive verb. The output was:
+
Here the ێ should be analyzed as a personal suffix, because it is immediately followed by a transitive verb. The output was: '''^سەگ<n><def><sg>+ئەو<prn><pers><p3><sg>$'''
 
 
'''^ئەو<prn><pers><p3><sg>$ ^سەگ<n><def><sg>+ئەو<prn><pers><p3><sg>$ ^هێنان<v><tv><past>$'''
 
  
 
== Final Evaluation of Ambiguity ==
 
== Final Evaluation of Ambiguity ==
  
 
* Final ambiguity in the corpus (Wikipedia): ~1.029 (-35%)
 
* Final ambiguity in the corpus (Wikipedia): ~1.029 (-35%)

Latest revision as of 20:16, 8 April 2021


Code

GitHub

Initial Evaluation of Ambiguity

  • Ambiguity in the transducer: ~1.07
  • Initial ambiguity in the corpus (Wikipedia): ~1.045

Constraint Example with Sentences

One source of ambiguity came from the ی suffix at the end of nouns. The suffix could be analyzed either as an izafa or as a personal suffix for a 3rd-person singular subject. The two sentences below demonstrate the successful disambiguation between the two forms in two specific scenarios. Both examples are contrived (not from an external corpus), though since the programmer is a native speaker of Sorani Kurdish it can essentially be considered authentic.

  • "سەگی گەورە" (big dog)

‌Here one would expect the ی to output as an izafa, because it is followed by a noun. The output was verified to be: ^سەگ<n><def><sg>+ی<pr>$

  • "ئەو سەگەکەی هێنا" (he brought the dog)

Here the ێ should be analyzed as a personal suffix, because it is immediately followed by a transitive verb. The output was: ^سەگ<n><def><sg>+ئەو<prn><pers><p3><sg>$

Final Evaluation of Ambiguity

  • Final ambiguity in the corpus (Wikipedia): ~1.029 (-35%)