Mixe/Disambiguation
Link to Git repository: https://github.swarthmore.edu/Ling073-sp22/ling073-mto
Contents
Initial evaluation of ambiguity
Using the script shown in the Calculating ambiguity section of the Morphological Disambiguator wiki page, the initial level of ambiguity in our corpus is about 1.025.
EDIT: After determining that juuꞌ is only a relativizer, not a demonstrative, corpus ambiguity is apparently 1.
forms in transducer: 204288
unique forms in transducer:106575
- The reason there are so many is because it's outputting all possible spelling variations for each token. I'm not sure how to prevent this.
Ambiguous terms
pö’ö
Verb stem pö’ö "break" and noun pö’ö "sand"
"break"
I had to make up a sentence because (1) I couldn't find any example sentences with this word and, (2) I had trouble finding any other cases of ambiguity.
I started with an example sentence that includes a verb from the same non-agentive ambitransitive verb paradigm as pö’ö.
- xi nëëj ’ixtaamp
- xii nëëj ø=’ëjx-tam-p
- that water 3S.IND=BASE-spill-IDT.ICPL.INTR
- "That water spilled." (p75 of 312)
First, I switched out nëëj "wind" with pu'u "board/table", to try and make the sentence semantically sound. Then, I replaced independent incompletive aspect with independent completive aspect, thus changing the aspect suffix from -p to -ø. Finally, I switched the verb stem to pö’ö. This gets us, to the best of my knowledge:
- xi pu'u pö’ö
- xii pu'u ø=pö’ö-ø
- that board 3S.IND=break-IDT.CPL.INTR
- that board broke
The main thing I'm unclear on is whether the verb stem would take a different form. Some verb stems change more than others depending on aspect, transitivity, dependency, and construction type (direct, inverse). I haven't seen any clear pattern that would allow me to predict what shape pö’ö takes when it is an independent, completive, intransitive verb.
Transducer analysis for this sentence:
^xi/xii<det><dem>$ ^pu'u/pu'u<n><sg>$ ^pö’ö/pö’ö<v><idt><cpl><p3>/pö’ö<v><dep><cpl><p3>/pö’ö<v><dep><cpl><p1>/pö’ö<v><dep><icpl><p1>/pö’ö<v><dep><icpl><p3>/pö’ö<v><idt><cpl><p1>/pö’ö<n>$^./.<sent>$
"sand"
- kumöön pöꞌö yäkpätsïmjä
- kumöön pö’ö y=yäk-pä’äv-tsëm-jäy-I
- town sand 3S.DEP=PAS-EDGE-load-APL.R-ICPL.DEP
- "The sand is carried to the municipality."
Note that I'm treating pä’äv-tsëm as a verb stem pätsïm, since pä’äv is a lexical prefix.
Transducer analysis for this sentence:
^kumöön/kumöön<n><sg>$ ^pöꞌö/pöꞌö<n>/pöꞌö<v><idt><cpl><p3>$ ^yäkpätsïmjä/yäkpä’ävtsëm<v><tv><dep><icpl><pas><apl><p3>$^./.<sent>$
constraints
If there is another verb in the sentence, pöꞌö is likely a noun. If there is no other verb in the sentence, pöꞌö is almost certainly a verb.