Difference between revisions of "Central Kurdish"

From LING073
Jump to: navigation, search
(Corpora)
Line 2: Line 2:
  
 
Below is a list of resources relevant to the Sorani Kurdish language. Resources are categorized according to type of content. I own italicized resources in PDF format. I have also flagged resources I have not yet obtained.
 
Below is a list of resources relevant to the Sorani Kurdish language. Resources are categorized according to type of content. I own italicized resources in PDF format. I have also flagged resources I have not yet obtained.
 +
 +
For the [https://wikis.swarthmore.edu/ling073/Initial_corpus_assembly corpus assembly lab], a [https://github.swarthmore.edu/Ling073-sp21/ling073-ckb-corpus repository]  of plain text files has been created using excerpts from some of the resources listed below.
  
 
== Computational Resources ==
 
== Computational Resources ==

Revision as of 11:06, 23 February 2021


Below is a list of resources relevant to the Sorani Kurdish language. Resources are categorized according to type of content. I own italicized resources in PDF format. I have also flagged resources I have not yet obtained.

For the corpus assembly lab, a repository of plain text files has been created using excerpts from some of the resources listed below.

Computational Resources

  • .txt lists of words, word-level digrams, and character-level trigrams
  • Latin-based keyboard layout that I own on my computer
does this mean that it came with your OS or that it's a custom one you (or someone else?) created? -Jonathan
Someone else created it, I think Unikurd is its name, I cannot find a link to it but it is on my computer.

Dictionaries

  • Dictionary of Scientific Terms: includes Sorani definitions of terms like "atom"
  • Diccionaire Fondamental Kurde-Français-Sorani: French-Sorani dict with phrases & alphabet

Grammatical Descriptions

  • Sorani Grammar: high-level description of important grammatical properties of Sorani

Scientific Works

Corpora

Try clicking the links on that page and then looking for "Identifier (URI)" - Daniel

Books & Encyclopediae

  • JW Website: few dozen entries accompanied by voice narration
  • Hawler Gov: various entries on the Kurdish capital
  • شازاده چکۆله: Sorani version of The Little Prince, by Aso Abdullah

News Sites

This is an aggregation of news sites, all written in Sorani, that contain written media on the order of ~300 words per article, though long-form works of journalism can also be found.

Blog Pages

Long-form posts that exceed ~1000 words can be found here. Each link goes to a specific person's blog archive, containing anywhere between 10-100 articles.