User:Cdalton2/Language selection

From LING073
< User:Cdalton2
Revision as of 19:10, 24 January 2018 by Cdalton2 (talk | contribs)

Jump to: navigation, search


Morphological Typology

Rohingya seems to be a fusional language. A single "verb-form suffix" encodes person, tense, and aspect; that is, each grammatical person has its own set of tense/aspect suffixes.


Rohingya (rhg) is an Indo-Aryan language spoken by 1.7 to 1.8 million people worldwide. As of 2012, about 800,000 of these speakers live in the Rakhine state of Myanmar. Due to persecution and military crackdowns by Myanmar's government on the Rohingya ethnic group, an estimated 1 million Rohingyas have fled to Bangladesh and a number of other countries. As Ethnologue notes, the nature of the situation is such that the actual population numbers are very uncertain. Most Rohingya speakers also use Rakhine (rki), and some use Burmese (mya). I have also seen the term "Rohingyalish" used to refer to this language.

Rohingya has been written in several different scripts. It was first written in the 1800s using the Arabic script. In the mid to late 1900s, there were a couple of attempts to modify the Arabic or Urdu scripts to make them more suitable for Rohingya. One system that seems to have caught on a bit is the Hanifi script, which was developed by Molana Hanifi in the 1980s. It is based on the Arabic alphabet with borrowings from the Latin and Burmese scripts. It is not currently supported by Unicode, but language technologist Anshuman Pandey has been working on it, and it is expected to be incorporated into the 2018 Unicode standard. There is also a system for writing Rohingya in the Latin alphabet.

Textual Resources

The Rohingya Language Foundation has a number of texts in Rohingya, including:

  • Several translations of the Quran and parts of the Quran
  • Dictionaries and textbooks
  • A document explaining Rohingya morphology with some example sentences

putting this here for now


Morphological Typology

As best as I can determine, Standard Tibetan falls generally into the agglutinative category. It's similar to some isolating languages in that the number and gender of a noun is usually determined by context; however, nouns do receive case marking, and the case suffixes are just added onto the noun phrase without affecting the form of the noun itself.


Tibetan (bod) is a Tibeto-Burman language spoken by about 1.2 million people, mostly in the Tibetan Autonomous Region of the People's Republic of China, but also in some parts of India and Nepal. Standard Tibetan is considered to be the dialect spoken in Lhasa, but there is a lot of variation within the Tibetic languages group, since the Tibetan people range across a very large area. Speakers of Tibetan may also speak Mandarin Chinese (cmn), as well as a whole host of other, smaller languages. Tibetan is the main language of instruction in primary schools in the Tibetan Autonomous Region, where it is considered an official language, and it does not appear to be having many issues in terms of transmission to younger generations.

The Tibetan alphabet is said to have been developed by Thonmi Sambhota in the middle of the 7th century. The script has not changed much over the years, so there is quite a large disparity between the spelling and the modern pronunciation of the language. When writing Tibetan in Latin script, most Western scholars use the Wylie transliteration, but the Chinese government officially uses Tibetan pinyin.

Textual Resources

The Tsadra Foundation has links to databases of Buddhist texts in Tibetan. In particular, Lotsawa House has a pretty large corpus.