Difference between revisions of "Waray/Disambiguation"
From LING073
(→Added Rules) |
(→After Disambiguation) |
||
Line 41: | Line 41: | ||
Ambiguity after disambiguation: ~1.05008077544426494346* | Ambiguity after disambiguation: ~1.05008077544426494346* | ||
− | * ''The ambiguity is the same because there is not yet enough context in the lexicon for the disambiguation to properly disambiguate'' | + | * Total number of forms: 136 |
+ | * Number of unique forms: 133 | ||
+ | |||
+ | ''*The ambiguity is the same because there is not yet enough context in the lexicon for the disambiguation to properly disambiguate'' | ||
+ | |||
[[Category:Sp21_Disambiguation]] [[Category:Waray]] | [[Category:Sp21_Disambiguation]] [[Category:Waray]] |
Revision as of 14:56, 29 April 2021
Contents
Morphological Disambiguation
Initial Level(s) of Ambiguity
Ambiguity before disambiguation: ~1.05008077544426494346
Differentiating a Pronoun from a Determiner
There are two readings of 'hira' ('they') and its Class II possessive variation 'nira' ('their'): while common nouns usually proceed the word 'hira', the following words can help determine whether or not the word is a pronoun or a determiner. In the determiner case, 'hira' is usually followed by at least two—usually proper—nouns (a compound subject).
- Pronoun Reading
- Kwarta nira [Their money] (Not in corpus)
- Before Disambiguation:
- ^Kwarta/kwarta<n>$ ^nira/hira<prn><pers><p3><pl><pos>/ni<det><pos>+ra<det><pl>$^./.<sent>$
- After Disambiguation:
- ^Kwarta<n>$ ^hira<prn><pers><p3><pl><pos>$^.<sent>$
- Before Disambiguation:
- Ngan amo ini an gintikangan pagbuhat nira [They have begun to do this]
- Before Disambiguation:
- After Disambiguation:
- ^Ngan<det><pl>$ ^*amo$ ^ini<prn><det><dem>$ ^an<det><nom>$ ^*gintikangan$ ^*pagbuhat$ ^hira<prn><pers><p3><pl><pos>$^.<sent>$
- Determiner Reading
- Kwarta nira Rico ngan Bobong [Rico and Bobong's money OR Money of Rico and Bobong]
- Before Disambiguation:
- ^Kwarta/kwarta<n>$ ^nira/hira<prn><pers><p3><pl><pos>/ni<det><pos>+ra<det><pl>$ ^Rico/Rico<np><ant><m>$ ^ngan/ngan<det><pl>$ ^Bobong/Bobong<np><ant><m>$^./.<sent>$
- After Disambiguation:
- ^Kwarta<n>$ ^ni<det><pos>+ra<det><pl>$ ^Rico<np><ant><m>$ ^ngan<det><pl>$ ^Bobong<np><ant><m>$^.<sent>$
- Before Disambiguation:
- Mgbantay pa hiya ha balay nira ni Rhoda ngan Romulo [He will still guard Rhoda and Romulo’s house]
- Before Disambiguation:
- After Disambiguation:
- ^*Mgbantay$ ^pa<adv>$ ^hiya<prn><pers><p3><sg><nom>$ ^ha<det><obl>$ ^*balay$ ^hira<prn><pers><p3><pl><pos>$ ^ni<det><pos>$ ^*Rhoda$ ^ngan<det><pl>$ ^*Romulo$^.<sent>$
Added Rules
- If word precedes at least one proper noun, select determiner reading
- SELECT Determiner IF (0 (prn)) (1 (np)) ;
- If word precedes a determiner and at least one proper noun, select determiner reading
- SELECT Determiner IF (0 (prn)) (1 (det)) (2 (np)) ;
- Pronoun case
- REMOVE Determiner IF (0 (prn)) (1 EOS) ;
After Disambiguation
Ambiguity after disambiguation: ~1.05008077544426494346*
- Total number of forms: 136
- Number of unique forms: 133
*The ambiguity is the same because there is not yet enough context in the lexicon for the disambiguation to properly disambiguate