Difference between revisions of "Magahi"
From LING073
(→Scientific works) |
(Added a few resources, the github, and moved things around a bit.) |
||
Line 1: | Line 1: | ||
== Computational Resources == | == Computational Resources == | ||
− | * Kimi Linguistics Magahi, Apache License 2.0 | + | * Kimi Linguistics Magahi, Apache License 2.0 [https://github.com/kmi-linguistics/magahi/] Github repository with some computational resources for Magahi, notably a large corpus. |
− | ** "Magahi Morph Analyser (magahi-morph) - a rule-based analyser." | + | ** "Magahi Morph Analyser (magahi-morph) - a rule-based analyser." |
− | ** "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger." | + | ** "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger." |
− | ** "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex." | + | ** "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex." Doesn't actually appear to work. |
− | ** "POS tagger for annotating with Universal Dependencies POS tags." | + | ** "POS tagger for annotating with Universal Dependencies POS tags." |
+ | |||
== Dictionaries/Phrasebooks/Glossaries == | == Dictionaries/Phrasebooks/Glossaries == | ||
− | |||
* Magahi Grammar Dictionary (Hindi) 1965 https://archive.org/details/in.ernet.dli.2015.306861/page/n23/mode/2up | * Magahi Grammar Dictionary (Hindi) 1965 https://archive.org/details/in.ernet.dli.2015.306861/page/n23/mode/2up | ||
− | * Small online Phrasebook | + | * Small online Phrasebook https://omniglot.com/language/phrases/magahi.htm |
− | * "A Comparative dictionary of the Bihārī language" https://archive.org/details/acomparativedic00griegoog/page/n33/mode/2up | + | * "A Comparative dictionary of the Bihārī language" https://archive.org/details/acomparativedic00griegoog/page/n33/mode/2up. Bihārī is the term for a subfamily of Indo-Aryan including Magahi. |
+ | |||
== Grammatical Descriptions == | == Grammatical Descriptions == | ||
− | * | + | * ''The Indo-Aryan Languages'' (Jain, Danesh ; Cardona, George) on Tripod, particularly chapter 13 has a nice overview of Magahi. |
+ | * ''The Linguistic Survey of India'' (Grierson, George) [https://archive.org/details/LinguisticSurveyOfIndiaVolVPartIiIndoAryanFamilyEasternGroup/page/n1/mode/2up here], particularly volume 5 part 2, starts page 30, gives an overview of Magahi, including a skeleton grammar on page 38. | ||
+ | * ''A Reference Grammar of Maithili'' (Yadav, Ramawatar) on Tripod. Not Magahi, but a very closely related language, and Grierson outlines the differences between Magahi and Maithili in ''The Linguistic Survey of India''. | ||
== Scientific works == | == Scientific works == | ||
Line 19: | Line 22: | ||
== Corpora == | == Corpora == | ||
− | + | [https://github.swarthmore.edu/Ling073-sp21/ling073-mag-corpus Corpus on Github]. Consists mostly of the corpus from Kimi Linguistics Magahi, which is made up of blog posts and a story, as well as some data from ''The Linguistic Survey of India'' mentioned above, which has numerous glossed and analyzed samples from various dialects of Magahi, but only one was transcribed for the corpus. | |
− | |||
− |
Revision as of 14:59, 23 February 2021
Contents
Computational Resources
- Kimi Linguistics Magahi, Apache License 2.0 [1] Github repository with some computational resources for Magahi, notably a large corpus.
- "Magahi Morph Analyser (magahi-morph) - a rule-based analyser."
- "Magahi BIS POS Tagger (magahi-pos) - a maximum entropy based pos-tagger."
- "Corpus Search Tool (msearchit) - searches through the specified corpus; comes with a support for regex." Doesn't actually appear to work.
- "POS tagger for annotating with Universal Dependencies POS tags."
Dictionaries/Phrasebooks/Glossaries
- Magahi Grammar Dictionary (Hindi) 1965 https://archive.org/details/in.ernet.dli.2015.306861/page/n23/mode/2up
- Small online Phrasebook https://omniglot.com/language/phrases/magahi.htm
- "A Comparative dictionary of the Bihārī language" https://archive.org/details/acomparativedic00griegoog/page/n33/mode/2up. Bihārī is the term for a subfamily of Indo-Aryan including Magahi.
Grammatical Descriptions
- The Indo-Aryan Languages (Jain, Danesh ; Cardona, George) on Tripod, particularly chapter 13 has a nice overview of Magahi.
- The Linguistic Survey of India (Grierson, George) here, particularly volume 5 part 2, starts page 30, gives an overview of Magahi, including a skeleton grammar on page 38.
- A Reference Grammar of Maithili (Yadav, Ramawatar) on Tripod. Not Magahi, but a very closely related language, and Grierson outlines the differences between Magahi and Maithili in The Linguistic Survey of India.
Scientific works
- The case system of Eastern Indo-Aryan languages : a typological overview: https://www.worldcat.org/title/case-system-of-eastern-indo-aryan-languages-a-typological-overview/oclc/1201667050&referer=brief_results
- Phonology: https://www.worldcat.org/title/magahi-phonology-a-descriptive-study/oclc/261342906&referer=brief_results
- Magahi semantics: https://www.worldcat.org/title/magahi-arthavijnana-visleshanatmaka-nirvacana-an-analytical-approach-to-magahi-semantics/oclc/11045154&referer=brief_results
Corpora
Corpus on Github. Consists mostly of the corpus from Kimi Linguistics Magahi, which is made up of blog posts and a story, as well as some data from The Linguistic Survey of India mentioned above, which has numerous glossed and analyzed samples from various dialects of Magahi, but only one was transcribed for the corpus.