Central Kurdish
From LING073
Below is a list of resources relevant to the Sorani Kurdish language. Resources are categorized according to type of content. I own italicized resources in PDF format. I have also flagged resources I have not yet obtained.
For the corpus assembly lab, a repository of plain text files has been created using excerpts from some of the resources listed below.
Contents
External Resources
Computational Resources
- Keyboard Layout (seems to be quite common)
- .txt lists of words, word-level digrams, and character-level trigrams
- Latin-based phonetic keyboard layout that I own on my computer
- does this mean that it came with your OS or that it's a custom one you (or someone else?) created? -Jonathan
- Rule-based Kurdish Text Transliteration System: Latin-Arabic orthography conversion
- Kurdish Speech to Text: early attempt to develop speech recognition for Sorani Kurdish
- Towards Machine Translation for the Kurdish Language: Sorani machine translation model
Dictionaries
- Sorani Vocabulary: vocab list with Latin script transliterations, by Harvard
- The Kurdish Dictionary: translates to and from any combination of ckb/kur/eng via query
- Dictionary of Scientific Terms: includes Sorani definitions of terms like "atom"
- Diccionaire Fondamental Kurde-Français-Sorani: French-Sorani dict with phrases & alphabet
Grammatical Descriptions
- Sorani Grammar: high-level description of important grammatical properties of Sorani
- Kurdish Basic Course: has helpful chapters on grammar and word lists
- A Reference Grammar with Selected Readings: extensive descriptions of Sorani grammar
Scientific Works
- Building a Lemmatizer and a Spell-Checker for Sorani Kurdish: includes background on Sorani morphology
- Kurdish Romanization Table: potentially useful information on Sorani alphabet and orthography
- Kurdish Interdialect Machine Translation: suggests a method for Kurmanji-Sorani machine translation
- Building a Test Collection for Sorani Kurdish: outlines a Test Collection project + list of affixes
Corpora
- OLAC Resources (how do I access these?)
- Try clicking the links on that page and then looking for "Identifier (URI)" - Daniel
- Sorani Wikipedia: many articles averaging a few paragraphs in length
- Sorani Bible: one of a few different versions of the Bible
- JW Website: few dozen entries accompanied by voice narration
- Hawler Gov: various entries on the Kurdish capital
- شازاده چکۆله: Sorani version of The Little Prince, by Aso Abdullah
- Abas Shiwan's blog (with Latin script transliterations)
Developed Resources
- Keyboard (see Wiki page)
- Transducer (see Wiki page)