Difference between revisions of "Magahi"

From LING073
Jump to: navigation, search
(Scientific works)
(Added a few resources, the github, and moved things around a bit.)
Line 1: Line 1:
 
== Computational Resources ==
 
== Computational Resources ==
* Kimi Linguistics Magahi, Apache License 2.0 <ref name="Apache License">https://github.com/kmi-linguistics/magahi/blob/master/LICENSE</ref>
+
* Kimi Linguistics Magahi, Apache License 2.0 [https://github.com/kmi-linguistics/magahi/] Github repository with some computational resources for Magahi, notably a large corpus.
** "Magahi Morph Analyser (magahi-morph) - a rule-based analyser."<ref name="GitHub">https://github.com/kmi-linguistics/magahi</ref>
+
** "Magahi Morph Analyser (magahi-morph) - a rule-based analyser."
** "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger." <ref name="GitHub">https://github.com/kmi-linguistics/magahi</ref>
+
** "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger."
** "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex."<ref name="GitHub">https://github.com/kmi-linguistics/magahi</ref>
+
** "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex." Doesn't actually appear to work.
** "POS tagger for annotating with Universal Dependencies POS tags."<ref name="GitHub">https://github.com/kmi-linguistics/magahi</ref>
+
** "POS tagger for annotating with Universal Dependencies POS tags."
 +
 
 
== Dictionaries/Phrasebooks/Glossaries ==
 
== Dictionaries/Phrasebooks/Glossaries ==
* Magahi Skeleton Grammar, 1927, Motilal Banarsidass. Brief overview of Magahi Grammar in English; hard to read. https://archive.org/details/in.ernet.dli.2015.306861/mode/2up
 
 
* Magahi Grammar Dictionary (Hindi) 1965 https://archive.org/details/in.ernet.dli.2015.306861/page/n23/mode/2up
 
* Magahi Grammar Dictionary (Hindi) 1965 https://archive.org/details/in.ernet.dli.2015.306861/page/n23/mode/2up
* Small online Phrasebook (only 2 phrases) https://glosbe.com/mag/en
+
* Small online Phrasebook https://omniglot.com/language/phrases/magahi.htm
* "A Comparative dictionary of the Bihārī language" https://archive.org/details/acomparativedic00griegoog/page/n33/mode/2up
+
* "A Comparative dictionary of the Bihārī language" https://archive.org/details/acomparativedic00griegoog/page/n33/mode/2up. Bihārī is the term for a subfamily of Indo-Aryan including Magahi.
 +
 
 
== Grammatical Descriptions ==
 
== Grammatical Descriptions ==
*
+
* ''The Indo-Aryan Languages'' (Jain, Danesh ; Cardona, George) on Tripod, particularly chapter 13 has a nice overview of Magahi.
 +
* ''The Linguistic Survey of India'' (Grierson, George) [https://archive.org/details/LinguisticSurveyOfIndiaVolVPartIiIndoAryanFamilyEasternGroup/page/n1/mode/2up here], particularly volume 5 part 2, starts page 30, gives an overview of Magahi, including a skeleton grammar on page 38.
 +
* ''A Reference Grammar of Maithili'' (Yadav, Ramawatar) on Tripod. Not Magahi, but a very closely related language, and Grierson outlines the differences between Magahi and Maithili in ''The Linguistic Survey of India''.
  
 
== Scientific works ==
 
== Scientific works ==
Line 19: Line 22:
  
 
== Corpora ==
 
== Corpora ==
* U.N. Charter of Human Rights in Magahi, PDF. <ref name="Charter">https://ia800300.us.archive.org/3/items/rosettaproject_mag_undec-1/rosettaproject_mag_undec-1.pdf</ref>
+
[https://github.swarthmore.edu/Ling073-sp21/ling073-mag-corpus Corpus on Github]. Consists mostly of the corpus from Kimi Linguistics Magahi, which is made up of blog posts and a story, as well as some data from ''The Linguistic Survey of India'' mentioned above, which has numerous glossed and analyzed samples from various dialects of Magahi, but only one was transcribed for the corpus.
* <i>Tinaḍiṛiyā = Tindiria : Magahi story collection</i> by Swarnkiran. Physical copies at Library of Congress, Columbia University Libraries, and others. <ref name="story collection">https://www.worldcat.org/title/tinadiriya-tindiria-magahi-story-collection/oclc/8931793&referer=brief_results</ref>
 
*
 

Revision as of 14:59, 23 February 2021

Computational Resources

  • Kimi Linguistics Magahi, Apache License 2.0 [1] Github repository with some computational resources for Magahi, notably a large corpus.
    • "Magahi Morph Analyser (magahi-morph) - a rule-based analyser."
    • "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger."
    • "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex." Doesn't actually appear to work.
    • "POS tagger for annotating with Universal Dependencies POS tags."

Dictionaries/Phrasebooks/Glossaries

Grammatical Descriptions

  • The Indo-Aryan Languages (Jain, Danesh ; Cardona, George) on Tripod, particularly chapter 13 has a nice overview of Magahi.
  • The Linguistic Survey of India (Grierson, George) here, particularly volume 5 part 2, starts page 30, gives an overview of Magahi, including a skeleton grammar on page 38.
  • A Reference Grammar of Maithili (Yadav, Ramawatar) on Tripod. Not Magahi, but a very closely related language, and Grierson outlines the differences between Magahi and Maithili in The Linguistic Survey of India.

Scientific works

Corpora

Corpus on Github. Consists mostly of the corpus from Kimi Linguistics Magahi, which is made up of blog posts and a story, as well as some data from The Linguistic Survey of India mentioned above, which has numerous glossed and analyzed samples from various dialects of Magahi, but only one was transcribed for the corpus.