Difference between revisions of "Dhivehi and English/Structural transfer"

From LING073
Jump to: navigation, search
(Definite Articles for Nouns)
Line 47: Line 47:
 
<pre> $ echo ތަރައްގީ | apertium -d . div-eng-interchunk
 
<pre> $ echo ތަރައްގީ | apertium -d . div-eng-interchunk
 
  apertium-interchunk: Rule 1 noun<SN><sg><def><dir>{^improvement<n><sg>$}
 
  apertium-interchunk: Rule 1 noun<SN><sg><def><dir>{^improvement<n><sg>$}
^the<det>{^the<det><def><sp>$}$ ^noun<SN><sg><def><dir>{^improvement<n><sg>$}$^sent<SENT>{^.<sent>$}$</pre>
+
^the<det>{^the<det><def><sp>$}$ ^noun<SN><sg><def><dir>{^improvement<n><sg>$}$^sent<SENT>{^.<sent>$}$</pre>
  
 
<pre> $ echo ތަރައްގީ | apertium -d . div-eng-postchunk
 
<pre> $ echo ތަރައްގީ | apertium -d . div-eng-postchunk
 
  ^the<det><def><sp>$ ^improvement<n><sg>$^.<sent>$</pre>
 
  ^the<det><def><sp>$ ^improvement<n><sg>$^.<sent>$</pre>
 
  
 
== Post-Evaluation ==  
 
== Post-Evaluation ==  

Revision as of 09:31, 18 April 2019

Pre-evaluation

Statistics about input files
-------------------------------------------------------
Number of words in reference: 48
Number of words in test: 37
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 47
Word error rate (WER): 97.92 %
Number of position-independent correct words: 2
Position-independent word error rate (PER): 95.83 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 47
Word Error Rate (WER): 97.92 %
Number of position-independent correct words: 2
Position-independent word error rate (PER): 95.83 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%

Definite Articles for Nouns

 $ echo ތަރައްގީ | apertium -d . div-eng
 the improvement
 $ echo ތަރައްގީ | apertium -d . div-eng-disam
 ^ތަރައްގީ<n><nhum><sg><def><dir>$^.<sent>$
 $ echo ތަރައްގީ | apertium -d . div-eng-biltrans 
 ^ތަރައްގީ<n><nhum><sg><def><dir>/improvement<n><sg><def><dir>$^.<sent>/.<sent>$
 $ echo ތަރައްގީ | apertium -d . div-eng-chunker
 
 apertium-transfer: Rule 3 ތަރައްގީ<n><nhum><sg><def><dir>/improvement<n><sg><def><dir>

 apertium-transfer: Rule 1 .<sent>/.<sent>
 ^noun<SN><sg><def><dir>{^improvement<n><sg>$}$^sent<SENT>{^.<sent>$}$
 $ echo ތަރައްގީ | apertium -d . div-eng-interchunk
 apertium-interchunk: Rule 1 noun<SN><sg><def><dir>{^improvement<n><sg>$}
 ^the<det>{^the<det><def><sp>$}$ ^noun<SN><sg><def><dir>{^improvement<n><sg>$}$^sent<SENT>{^.<sent>$}$
 $ echo ތަރައްގީ | apertium -d . div-eng-postchunk
 ^the<det><def><sp>$ ^improvement<n><sg>$^.<sent>$

Post-Evaluation

Statistics about input files
-------------------------------------------------------
Number of words in reference: 48
Number of words in test: 49
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 39
Word error rate (WER): 81.25 %
Number of position-independent correct words: 16
Position-independent word error rate (PER): 68.75 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 39
Word Error Rate (WER): 81.25 %
Number of position-independent correct words: 16
Position-independent word error rate (PER): 68.75 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%