Difference between revisions of "Waray/Final Project"

From LING073
Jump to: navigation, search
 
Line 1: Line 1:
 +
==Expanding Our Morphological Transducer==
 +
===Initial Anaylser Evaluation===
 +
* Coverage: 742 / 1239 (~0.59887005649717514124)
 +
* Remaining Unknown Words: 487
  
 +
====Lexical Information====
 +
* Lexicons: 17       
 +
* Lexicon entries: 120
 +
* Patterns: 1
 +
* Pattern entries: 9
  
[[Category:Sp21_FinalProjects]] [[Caegory:Waray]]
+
====Counts for individual lexicon====
 +
* NounRoot: 23
 +
* Determiners: 13
 +
* PluralDet: 2
 +
* VerbPrefixes: 5
 +
* VerbStems: 5
 +
* Pronouns: 31
 +
* ProperNouns: 1
 +
* Adverbs: 11
 +
* Auxiliary: 1
 +
* Punctuaion: 22
 +
* All anonymous lexicons: 6
 +
 
 +
====Tests====
 +
* '''war.yaml''': Total passes: 57, Total fails: 14, Total: 71
 +
* '''commonWords.yaml''': Total passes 4, Total fails: 16, Total: 20
 +
 
 +
====Current Unknown Words====
 +
TOP UNKNOWN WORDS:
 +
    15 ^nagkaada/*nagkaada$
 +
    12 ^hito/*hito$
 +
    11 ^uyon/*uyon$
 +
    11 ^linarang/*linarang$
 +
    10 ^may/*may$
 +
    10 ^langit/*langit$
 +
      9 ^klase/*klase$
 +
      9 ^hayop/*hayop$
 +
      8 ^katubigan/*katubigan$
 +
      7 ^tagsa/*tagsa$
 +
      7 ^ngatanan/*ngatanan$
 +
      7 ^kalamrag/*kalamrag$
 +
      7 ^basi/*basi$
 +
      6 ^nalupad/*nalupad$
 +
      6 ^liso/*liso$
 +
      6 ^iton/*iton$
 +
      6 ^haluag/*haluag$
 +
      6 ^espasyo/*espasyo$
 +
      6 ^bawbaw/*bawbaw$
 +
      6 ^aga/*aga$
 +
 
 +
===Notes===
 +
* Tests for verbalized nouns not implemented yet
 +
 
 +
==Initial Generator Evaluation==
 +
 
 +
 
 +
* Currently, we have 4 rules in our twol file to handle verb conjugation.
 +
 
 +
[[Category: Sp21_Transducers]] [[Category: Waray]]
 +
 
 +
 
 +
 
 +
[[Category:Sp21_FinalProjects]] [[Category:Waray]]

Revision as of 18:00, 20 May 2021

Expanding Our Morphological Transducer

Initial Anaylser Evaluation

  • Coverage: 742 / 1239 (~0.59887005649717514124)
  • Remaining Unknown Words: 487

Lexical Information

  • Lexicons: 17
  • Lexicon entries: 120
  • Patterns: 1
  • Pattern entries: 9

Counts for individual lexicon

  • NounRoot: 23
  • Determiners: 13
  • PluralDet: 2
  • VerbPrefixes: 5
  • VerbStems: 5
  • Pronouns: 31
  • ProperNouns: 1
  • Adverbs: 11
  • Auxiliary: 1
  • Punctuaion: 22
  • All anonymous lexicons: 6

Tests

  • war.yaml: Total passes: 57, Total fails: 14, Total: 71
  • commonWords.yaml: Total passes 4, Total fails: 16, Total: 20

Current Unknown Words

TOP UNKNOWN WORDS:

    15 ^nagkaada/*nagkaada$
    12 ^hito/*hito$
    11 ^uyon/*uyon$
    11 ^linarang/*linarang$
    10 ^may/*may$
    10 ^langit/*langit$
     9 ^klase/*klase$
     9 ^hayop/*hayop$
     8 ^katubigan/*katubigan$
     7 ^tagsa/*tagsa$
     7 ^ngatanan/*ngatanan$
     7 ^kalamrag/*kalamrag$
     7 ^basi/*basi$
     6 ^nalupad/*nalupad$
     6 ^liso/*liso$
     6 ^iton/*iton$
     6 ^haluag/*haluag$
     6 ^espasyo/*espasyo$
     6 ^bawbaw/*bawbaw$
     6 ^aga/*aga$

Notes

  • Tests for verbalized nouns not implemented yet

Initial Generator Evaluation

  • Currently, we have 4 rules in our twol file to handle verb conjugation.