Difference between revisions of "Central Kurdish"
From LING073
(→Computational Resources) |
(→Computational Resources) |
||
Line 11: | Line 11: | ||
* Latin-based keyboard layout that I own on my computer | * Latin-based keyboard layout that I own on my computer | ||
{{comment|: does this mean that it came with your OS or that it's a custom one you (or someone else?) created? -Jonathan}} | {{comment|: does this mean that it came with your OS or that it's a custom one you (or someone else?) created? -Jonathan}} | ||
+ | {{comment|: Someone else created it, I think Unikurd is its name, I cannot find a link to it but it is on my computer.}} | ||
* [https://arxiv.org/pdf/1811.10278.pdf Rule-based Kurdish Text Transliteration System]: Latin-Arabic orthography conversion | * [https://arxiv.org/pdf/1811.10278.pdf Rule-based Kurdish Text Transliteration System]: Latin-Arabic orthography conversion |
Revision as of 20:16, 22 February 2021
Below is a list of resources relevant to the Sorani Kurdish language. Resources are categorized according to type of content. I own italicized resources in PDF format. I have also flagged resources I have not yet obtained.
Contents
Computational Resources
- .txt lists of words, word-level digrams, and character-level trigrams
- Keyboard Layout: seems to be the most common layout
- Latin-based keyboard layout that I own on my computer
- does this mean that it came with your OS or that it's a custom one you (or someone else?) created? -Jonathan
- Someone else created it, I think Unikurd is its name, I cannot find a link to it but it is on my computer.
- Rule-based Kurdish Text Transliteration System: Latin-Arabic orthography conversion
- Kurdish Speech to Text: early attempt to develop speech recognition for Sorani Kurdish
- Towards Machine Translation for the Kurdish Language: Sorani machine translation model
Dictionaries
- Sorani Vocabulary: vocab list with Latin script transliterations, by Harvard
- The Kurdish Dictionary: translates to and from any combination of ckb/kur/eng via query
- Dictionary of Scientific Terms: includes Sorani definitions of terms like "atom"
- Diccionaire Fondamental Kurde-Français-Sorani: French-Sorani dict with phrases & alphabet
Grammatical Descriptions
- Sorani Grammar: high-level description of important grammatical properties of Sorani
- Kurdish Basic Course: has helpful chapters on grammar and word lists
- A Reference Grammar with Selected Readings: extensive descriptions of Sorani grammar
Scientific Works
- Building a Lemmatizer and a Spell-Checker for Sorani Kurdish: includes background on Sorani morphology
- Kurdish Romanization Table: potentially useful information on Sorani alphabet and orthography
- Kurdish Interdialect Machine Translation: suggests a method for Kurmanji-Sorani machine translation
- Building a Test Collection for Sorani Kurdish: outlines a Test Collection project + list of affixes
Corpora
Books & Encyclopediae
- Sorani Wikipedia: many articles averaging a few paragraphs in length
- Sorani Bible: one of a few different versions of the Bible
- JW Website: few dozen entries accompanied by voice narration
- Hawler Gov: various entries on the Kurdish capital
- شازاده چکۆله: Sorani version of The Little Prince, by Aso Abdullah
News Sites
This is an aggregation of news sites, all written in Sorani, that contain written media on the order of ~300 words per article, though long-form works of journalism can also be found.
Blog Pages
Long-form posts that exceed ~1000 words can be found here. Each link goes to a specific person's blog archive, containing anywhere between 10-100 articles.
- Abas Shiwan (with Latin script transliterations)