Difference between revisions of "Latin and Mandarin Chinese/Structural transfer"

From LING073
Jump to: navigation, search
Line 2: Line 2:
  
 
==Pre-evaluation==
 
==Pre-evaluation==
 +
 +
=== Latin corpus coverage ===
 +
 +
Number of tokenised words in the corpus: 380
 +
Coverage: 88.68%
 +
Top unknown words in the corpus:
 +
2 potest
 +
2 facet
 +
2 possit
 +
2 quo
 +
1 tibi
 +
1 Mariaene
 +
1 quid
 +
1 audit
 +
1 possum
 +
1 matrae
 +
1 James
 +
1 loquent
 +
1 audire
 +
1 not
 +
1 Videbasne
 +
1 duo
 +
1 poterunt
 +
1 eae
 +
1 aliquid
 +
1 posset
 +
 +
=== Chinese corpus coverage ===
 +
 +
Number of tokenised words in the corpus: 447
 +
Coverage: 100.00%
 +
 +
=== lat → zho ===
 +
 +
WER: 586.79%
 +
 +
PER: 586.79%
 +
 +
=== zho → lat ===
 +
 +
WER: 97.42%
 +
 +
PER: 93.55%
  
 
[[Category:Sp17_StructuralTransfer]]
 
[[Category:Sp17_StructuralTransfer]]

Revision as of 19:52, 11 April 2018

This is the page for the structural transfer of Latin and Mandarin Chinese. The main page for this language pair can be found here.

Pre-evaluation

Latin corpus coverage

Number of tokenised words in the corpus: 380 Coverage: 88.68% Top unknown words in the corpus: 2 potest 2 facet 2 possit 2 quo 1 tibi 1 Mariaene 1 quid 1 audit 1 possum 1 matrae 1 James 1 loquent 1 audire 1 not 1 Videbasne 1 duo 1 poterunt 1 eae 1 aliquid 1 posset

Chinese corpus coverage

Number of tokenised words in the corpus: 447 Coverage: 100.00%

lat → zho

WER: 586.79%

PER: 586.79%

zho → lat

WER: 97.42%

PER: 93.55%