Difference between revisions of "Magahi/Final Project"

From LING073
Jump to: navigation, search
(Initial Evaluation)
Line 1: Line 1:
 
= Initial Evaluation =
 
= Initial Evaluation =
coverage: 1732745 / 3678603 (~0.47103343307228314662)<br>
+
* Coverage: 1732745 / 3678603 (~0.47103343307228314662)<br>
Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn<br>
+
* Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn<br>
Precision: 96.58120%<br>
+
* Precision: 96.58120%<br>
Recall: 83.91089%<br>
+
* Recall: 83.91089%<br>
 
+
* Unknown words
 
<code>
 
<code>
 
   31801 ^हे/*हे$      he
 
   31801 ^हे/*हे$      he
Line 29: Line 29:
  
 
== Final Evaluation ==
 
== Final Evaluation ==
 +
* Coverage: 150272 / 187701 (~0.80059243158001289285)
 +
* Totals: 142 forms, 340 tp, 103 fp, 0 tn, 64 fn
 +
* Precision: 76.74944% (went down because we introduced ambiguity by massively expanding the lexicon; it was artificially high before because all of our lexicon was based on that story)
 +
* Recall: 84.15842%
 +
* Unknown words
 
<code>
 
<code>
TOP UNKNOWN WORDS:
 
 
     58 ^डॉ०/*डॉ०$
 
     58 ^डॉ०/*डॉ०$
 
     21 ^हौले/*हौले$
 
     21 ^हौले/*हौले$
Line 51: Line 55:
 
     17 ^महाकवि/*महाकवि$
 
     17 ^महाकवि/*महाकवि$
 
     17 ^मरदाना/*मरदाना$
 
     17 ^मरदाना/*मरदाना$
coverage: 150272 / 187701 (~0.80059243158001289285)
+
* Remaining unknown forms: 37429
remaining unknown forms: 37429
+
* Total number of forms: 1172
 +
* Lexical forms (not morphology): 1012
 
</code>
 
</code>
  
 
[[Category:sp21_FinalProjects]] [[Category:Magahi]]
 
[[Category:sp21_FinalProjects]] [[Category:Magahi]]

Revision as of 16:39, 20 May 2021

Initial Evaluation

  • Coverage: 1732745 / 3678603 (~0.47103343307228314662)
  • Totals: 142 forms, 339 tp, 12 fp, 0 tn, 65 fn
  • Precision: 96.58120%
  • Recall: 83.91089%
  • Unknown words

 31801 ^हे/*हे$      he
 17942 ^ऊ/*ऊ$     U
 17754 ^1/*1$
 15114 ^तो/*तो$     to
 13642 ^2/*2$
 12158 ^न/*न$      na
  9535 ^नयँ/*नयँ$    nayaM
  9129 ^3/*3$
  7072 ^नसध॰/*नसध॰$ nasadh
  6821 ^5/*5$
  6626 ^ले/*ले$     se
  6158 ^घर/*घर$     ghar
  5972 ^की/*की$    kI
  5842 ^जे/*जे$     je
  5752 ^त/*त$      ta
  5606 ^अप्पन/*अप्पन$   appan
  5513 ^mso/*mso$ 
  5487 ^10/*10$
  5384 ^4/*4$
  5318 ^6/*6$

Final Evaluation

  • Coverage: 150272 / 187701 (~0.80059243158001289285)
  • Totals: 142 forms, 340 tp, 103 fp, 0 tn, 64 fn
  • Precision: 76.74944% (went down because we introduced ambiguity by massively expanding the lexicon; it was artificially high before because all of our lexicon was based on that story)
  • Recall: 84.15842%
  • Unknown words

    58 ^डॉ०/*डॉ०$
    21 ^हौले/*हौले$
    20 ^गाड/*गाड$
    19 ^सुनावे/*सुनावे$
    19 ^छो/*छो$
    18 ^हलूं/*हलूं$
    18 ^विदवान/*विदवान$
    18 ^तोरे/*तोरे$
    18 ^जुगाड़/*जुगाड़$
    18 ^गते/*गते$
    17 ^होते/*होते$
    17 ^हिंछा/*हिंछा$
    17 ^हाँथ/*हाँथ$
    17 ^सथान/*सथान$
    17 ^सजल/*सजल$
    17 ^वाह/*वाह$
    17 ^योगदान/*योगदान$
    17 ^मुनचुन/*मुनचुन$
    17 ^महाकवि/*महाकवि$
    17 ^मरदाना/*मरदाना$
  • Remaining unknown forms: 37429
  • Total number of forms: 1172
  • Lexical forms (not morphology): 1012