Purépecha/Transducer
From LING073
Contents
Code
Github Repo[1]
Tests
- As of now, our Transducer passes 110/197 tests generated from our Wikipedia page
Lexical Info
- Lexicons: 10
- Lexicon entries: 80
- Patterns: 2
- Pattern entries: 5
Counts for individual lexicons:
- NounRoot: 3
- RegNounInfl: 2
- ObjectRoot: 19
- Object: 1
- Punctuation: 22
- V-Stem: 13
- AspectTime: 10
- ModeInterrogative: 9
- All anonymous lexicons: 1
Coverage
- Current coverage: 14.9%, (16590/111120)
«by adding "ka<det> ↔ and" "Jose<n><sg> ↔ Joseph" "ma<num> ↔ one" "Mariani<n><sg> ↔ Maria" "Babilonia<n>{{tag}sg}} ↔ Babylon" "jimbo<det> ↔ for" to the transducer, coverage went from 14.9% to 33.7%»
Notes
- There are some more complex grammar forms that we aren't sure how to code yet.
- Originally, our corpus was primarily taken from tweets by a native Purepechan, but we were able to find a Bible in Purepechan that we added to our corpus.