I'm interested in working with Courtney Dalton on this project. I think we'd work well together because we're both Bryn Mawr students and I'm a Comp Sci student and she's a Linguistics student!
Analytic, ergative and tonal. 
1.2 million from 1990 census, spoken in the "Tibet Autonomous Region," probably also exposed to Chinese, Nepalese, and Hindi, very conservative orthography. ISO codes include ISO 639-1, bo, ISO 639-2, tib and bod, and ISO 639-3, bod. 
Lots of texts available online, from translations to coursebooks to libraries. 
152,000 speakers, spoken in Iraq (Dahuk and Ninawa governorates), member of macrolanguage (Syriac) and lots of exposure to related languages in the region, often with a good degree of intelligibility. Written in the Syriac alphabet.  ISO codes include ISO 639-2 and 639-3 are both syc. 
Good number of texts available (at least there are bible translations and a Universal Declaration of Human Rights.) 
Agglutinative, no passives, non-tonal. 
SourcesTweets?!?!? That's so cool!,,,