Central Kurdish/Transducer

From LING073
Revision as of 09:13, 19 March 2021 by Rkamal1 (talk | contribs) (Tests)

Jump to: navigation, search

Code

GitHub Repository

Evaluation

Stems

The total number of stems can be found below:

  • 8 N-Stems
  • 4 Definite/Plural
  • 4 Verbs_Inf (infinitives)
  • 4 V-Stems_1
  • 4 V-Stems_2
  • 6 Subject_Prn
  • 4 Imperatives
  • 6 Prns
  • 3 Adj-Stem
  • 2 Comparatives
  • 3 Prepositions
  • 3 Conjunctions
  • 2 Adverbs
  • 2 Npast

Coverage

The total coverage over the corpus was is 12.02%. This was an increase of 3 percentage points that came after adding three common words for "water", "earth", and "god" (all just <n>), two prepositions for "which" and "on" (<pr>), and one conjunction for "so" (<conjcoo>). The current top unknown words are:

  • بوو
  • هەموو
  • با
  • ئەمە
  • فەرمووی

I know the meanings, but not the analyses, of all these words.

Tests

The transducer currently passes 70/101 (69%) tests on the main yaml file and 3/6 (50%) on the commonwords file. It seems to do well with noun morphology and most verb morphology.

Notes