Central Kurdish/Transducer
From LING073
Contents
Code
Evaluation
Stems
The total number of stems can be found below:
- 8 N-Stems
- 4 Definite/Plural
- 4 Verbs_Inf (infinitives)
- 4 V-Stems_1
- 4 V-Stems_2
- 6 Subject_Prn
- 4 Imperatives
- 6 Prns
- 3 Adj-Stem
- 2 Comparatives
- 3 Prepositions
- 3 Conjunctions
- 2 Adverbs
- 2 Npast
Coverage
The total coverage over the corpus was is 12.02%. This was an increase of 3 percentage points that came after adding three common words for "water", "earth", and "god" (all just <n>), two prepositions for "which" and "on" (<pr>), and one conjunction for "so" (<conjcoo>). The current top unknown words are:
- بوو
- هەموو
- با
- ئەمە
- فەرمووی
I know the meanings, but not the analyses, of all these words.
Tests
The transducer currently passes 70/101 (69%) tests on the main yaml file and 3/6 (50%) on the commonwords file. It seems to do well with noun morphology and most verb morphology.