Difference between revisions of "User:Doldham1/Final project"
From LING073
(→Evaluation) |
(→Transducer) |
||
Line 3: | Line 3: | ||
====Corpus==== | ====Corpus==== | ||
====Transducer==== | ====Transducer==== | ||
+ | *I implemented numbers by having a digits lexicon which goes in a loop, and then adds various forms like year, day, and hour to the end of a number. | ||
+ | *I wrote four twol rules to handle the conjugations of u verbs for た、て、and ない forms. | ||
===What I did not do=== | ===What I did not do=== |
Revision as of 22:39, 12 May 2017
Contents
Project Details
What I did
Corpus
Transducer
- I implemented numbers by having a digits lexicon which goes in a loop, and then adds various forms like year, day, and hour to the end of a number.
- I wrote four twol rules to handle the conjugations of u verbs for た、て、and ない forms.
What I did not do
- Tokenisation
- Prefixes (e.g. 毎 for every, お and ご for honorific, etc.)
Repositories
Evaluation
- Coverage over the Hiroshima article:
- Coverage over the grapefruit article:
- Coverage over the shibori article:
- Coverage over the orc article:
- Coverage over all articles:
Precision and recall would also be useful metrics, but I did not have time to work on them.