Difference between revisions of "Magahi/Final Project"
From LING073
(→Initial Evaluation) |
|||
Line 1: | Line 1: | ||
= Initial Evaluation = | = Initial Evaluation = | ||
− | + | * Coverage: 1732745 / 3678603 (~0.47103343307228314662)<br> | |
− | Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn<br> | + | * Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn<br> |
− | Precision: 96.58120%<br> | + | * Precision: 96.58120%<br> |
− | Recall: 83.91089%<br> | + | * Recall: 83.91089%<br> |
− | + | * Unknown words | |
<code> | <code> | ||
31801 ^हे/*हे$ he | 31801 ^हे/*हे$ he | ||
Line 29: | Line 29: | ||
== Final Evaluation == | == Final Evaluation == | ||
+ | * Coverage: 150272 / 187701 (~0.80059243158001289285) | ||
+ | * Totals: 142 forms, 340 tp, 103 fp, 0 tn, 64 fn | ||
+ | * Precision: 76.74944% (went down because we introduced ambiguity by massively expanding the lexicon; it was artificially high before because all of our lexicon was based on that story) | ||
+ | * Recall: 84.15842% | ||
+ | * Unknown words | ||
<code> | <code> | ||
− | |||
58 ^डॉ०/*डॉ०$ | 58 ^डॉ०/*डॉ०$ | ||
21 ^हौले/*हौले$ | 21 ^हौले/*हौले$ | ||
Line 51: | Line 55: | ||
17 ^महाकवि/*महाकवि$ | 17 ^महाकवि/*महाकवि$ | ||
17 ^मरदाना/*मरदाना$ | 17 ^मरदाना/*मरदाना$ | ||
− | + | * Remaining unknown forms: 37429 | |
− | + | * Total number of forms: 1172 | |
+ | * Lexical forms (not morphology): 1012 | ||
</code> | </code> | ||
[[Category:sp21_FinalProjects]] [[Category:Magahi]] | [[Category:sp21_FinalProjects]] [[Category:Magahi]] |
Revision as of 17:39, 20 May 2021
Initial Evaluation
- Coverage: 1732745 / 3678603 (~0.47103343307228314662)
- Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn
- Precision: 96.58120%
- Recall: 83.91089%
- Unknown words
31801 ^हे/*हे$ he 17942 ^ऊ/*ऊ$ U 17754 ^1/*1$ 15114 ^तो/*तो$ to 13642 ^2/*2$ 12158 ^न/*न$ na 9535 ^नयँ/*नयँ$ nayaM 9129 ^3/*3$ 7072 ^नसध॰/*नसध॰$ nasadh 6821 ^5/*5$ 6626 ^ले/*ले$ se 6158 ^घर/*घर$ ghar 5972 ^की/*की$ kI 5842 ^जे/*जे$ je 5752 ^त/*त$ ta 5606 ^अप्पन/*अप्पन$ appan 5513 ^mso/*mso$ 5487 ^10/*10$ 5384 ^4/*4$ 5318 ^6/*6$
Final Evaluation
- Coverage: 150272 / 187701 (~0.80059243158001289285)
- Totals: 142 forms, 340 tp, 103 fp, 0 tn, 64 fn
- Precision: 76.74944% (went down because we introduced ambiguity by massively expanding the lexicon; it was artificially high before because all of our lexicon was based on that story)
- Recall: 84.15842%
- Unknown words
58 ^डॉ०/*डॉ०$ 21 ^हौले/*हौले$ 20 ^गाड/*गाड$ 19 ^सुनावे/*सुनावे$ 19 ^छो/*छो$ 18 ^हलूं/*हलूं$ 18 ^विदवान/*विदवान$ 18 ^तोरे/*तोरे$ 18 ^जुगाड़/*जुगाड़$ 18 ^गते/*गते$ 17 ^होते/*होते$ 17 ^हिंछा/*हिंछा$ 17 ^हाँथ/*हाँथ$ 17 ^सथान/*सथान$ 17 ^सजल/*सजल$ 17 ^वाह/*वाह$ 17 ^योगदान/*योगदान$ 17 ^मुनचुन/*मुनचुन$ 17 ^महाकवि/*महाकवि$ 17 ^मरदाना/*मरदाना$
- Remaining unknown forms: 37429
- Total number of forms: 1172
- Lexical forms (not morphology): 1012