Difference between revisions of "Dzongkha and English/Structural Transfer"

From LING073
Jump to: navigation, search
 
(Adding Prepositions)
 
(24 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
 +
== Pre-Evaluation ==
 +
=== Statistics about input file ===
 +
* Number of words in reference: 31
 +
* Number of words in test: 10
 +
* Number of unknown words (marked with a star) in test: 0
 +
* Percentage of unknown words: 0.00 %
 +
 +
=== Results when removing unknown-word marks (stars) ===
 +
* Edit distance: 31
 +
* Word error rate (WER): 100.00 %
 +
* Number of position-independent correct words: 0
 +
* Position-independent word error rate (PER): 100.00 %
 +
 +
=== Results when unknown-word marks (stars) are not removed ===
 +
* Edit distance: 31
 +
* Word Error Rate (WER): 100.00 %
 +
* Number of position-independent correct words: 0
 +
* Position-independent word error rate (PER): 100.00 %
 +
 +
=== Statistics about the translation of unknown words ===
 +
* Number of unknown words which were free rides: 0
 +
* Percentage of unknown words that were free rides: 0%
 +
 +
=== Coverage ===
 +
* Bilingual Coverage: 100%
 +
 +
== Example ==
 +
 +
=== Determiner After Noun ===
 +
 +
==== Dzo-Eng ====
 +
* {{transferTest|dzo|eng|བྱི་ལི་དེ་|the cat}}
 +
 +
==== Tagger ====
 +
* {{transferMorphTest|dzo|eng|བྱི་ལི་{{tag|n}} དེ་{{tag|det}}{{tag|def}}|the{{tag|det}}{{tag|def}} cat{{tag|n}}{{tag|sg}} }}
 +
 +
==== Biltrans ====
 +
^བྱི་ལི་{{tag|n}}/cat{{tag|n}}$^དེ་{{tag|det}}/the{{tag|det}}{{tag|def}}$
 +
 +
==== Transfer ====
 +
^the{{tag|det}}{{tag|def}}{{tag|sg}}$ ^cat{{tag|n}}{{tag|m}}{{tag|sg}}$
 +
 +
==== Lex ====
 +
^default{{tag|NP}}{{tag|ND}}{
 +
 +
: ^བྱི་ལི་{{tag|n}}/cat{{tag|n}}$
 +
 +
: ^དེ་{{tag|det}}/the{{tag|det}}{{tag|def}}$
 +
 +
}$
 +
 +
=== Verb-Noun Order ===
 +
 +
==== Dzo-Eng ====
 +
* {{transferTest|dzo|eng|བྱི་ཙི་དེ་ཟུང་|catch the rat}}
 +
 +
==== Tagger ====
 +
* {{transferMorphTest|dzo|eng|བྱི་ཙི་{{tag|n}} དེ་{{tag|det}}{{tag|def}} ཟུང་{{tag|v}}{{tag|tv}}|catch{{tag|vblex}} the{{tag|det}}{{tag|def}} rat{{tag|n}}{{tag|sg}} }}
 +
 +
==== Biltrans ====
 +
^བྱི་ཙི་{{tag|n}}/rat{{tag|n}}$^དེ་{{tag|det}}/the{{tag|det}}{{tag|def}}$^ཟུང་{{tag|v}}{{tag|tv}}/catch{{tag|vblex}}$
 +
 +
==== Transfer ====
 +
^catch{{tag|vblex}}{{tag|tv}}{{tag|p3}}{{tag|sg}}$ ^the{{tag|det}}{{tag|def}}{{tag|sg}}$ ^rat{{tag|n}}{{tag|m}}{{tag|sg}}$
 +
 +
==== Lex ====
 +
^default{{tag|NP}}{{tag|ND}}{
 +
 +
: ^བྱི་ལི་{{tag|n}}/cat{{tag|n}}$
 +
 +
: ^དེ་{{tag|det}}/the{{tag|det}}{{tag|def}}$
 +
 +
}$
 +
 +
^catch{{tag|VP}}{{tag|tv}}{{tag|PD}}{{tag|ND}}{
 +
^default{{tag|NP}}{{tag|ND}}{
 +
^བྱི་ཙི་{{tag|n}}/rat{{tag|n}}$
 +
^དེ་{{tag|det}}/the{{tag|det}}{{tag|def}}$
 +
}$
 +
^ཟུང་{{tag|v}}{{tag|tv}}/catch{{tag|vblex}}$
 +
}$
 +
 +
== Post-Evaluation ==
 +
=== Statistics about input files ===
 +
*Number of words in reference: 31
 +
*Number of words in test: 13
 +
*Number of unknown words (marked with a star) in test:
 +
*Percentage of unknown words: 0.00 %
 +
 +
=== Results when removing unknown-word marks (stars) ===
 +
*Edit distance: 28
 +
*Word error rate (WER): 90.32 %
 +
*Number of position-independent correct words: 3
 +
*Position-independent word error rate (PER): 90.32 %
 +
 +
=== Results when unknown-word marks (stars) are not removed ===
 +
*Edit distance: 28
 +
*Word Error Rate (WER): 90.32 %
 +
*Number of position-independent correct words: 3
 +
*Position-independent word error rate (PER): 90.32 %
 +
 +
=== Statistics about the translation of unknown words ===
 +
*Number of unknown words which were free rides: 0
 +
*Percentage of unknown words that were free rides: 0%
 +
 +
=== Coverage ===
 +
*Bilingual Coverage: 100%
 +
  
  
 
[[Category:Sp21_StructuralTransfer]] [[Category:Dzongkha]][[Category:English]]
 
[[Category:Sp21_StructuralTransfer]] [[Category:Dzongkha]][[Category:English]]

Latest revision as of 02:41, 25 May 2021

Pre-Evaluation

Statistics about input file

  • Number of words in reference: 31
  • Number of words in test: 10
  • Number of unknown words (marked with a star) in test: 0
  • Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)

  • Edit distance: 31
  • Word error rate (WER): 100.00 %
  • Number of position-independent correct words: 0
  • Position-independent word error rate (PER): 100.00 %

Results when unknown-word marks (stars) are not removed

  • Edit distance: 31
  • Word Error Rate (WER): 100.00 %
  • Number of position-independent correct words: 0
  • Position-independent word error rate (PER): 100.00 %

Statistics about the translation of unknown words

  • Number of unknown words which were free rides: 0
  • Percentage of unknown words that were free rides: 0%

Coverage

  • Bilingual Coverage: 100%

Example

Determiner After Noun

Dzo-Eng

  • (dzo) བྱི་ལི་དེ་ → (eng) the cat

Tagger

  • (dzo) བྱི་ལི་<n> དེ་<det><def> → (eng) the<det><def> cat<n><sg>

Biltrans

^བྱི་ལི་<n>/cat<n>$^དེ་<det>/the<det><def>$

Transfer

^the<det><def><sg>$ ^cat<n><m><sg>$

Lex

^default<NP><ND>{

^བྱི་ལི་<n>/cat<n>$
^དེ་<det>/the<det><def>$

}$

Verb-Noun Order

Dzo-Eng

  • (dzo) བྱི་ཙི་དེ་ཟུང་ → (eng) catch the rat

Tagger

  • (dzo) བྱི་ཙི་<n> དེ་<det><def> ཟུང་<v><tv> → (eng) catch<vblex> the<det><def> rat<n><sg>

Biltrans

^བྱི་ཙི་<n>/rat<n>$^དེ་<det>/the<det><def>$^ཟུང་<v><tv>/catch<vblex>$

Transfer

^catch<vblex><tv><p3><sg>$ ^the<det><def><sg>$ ^rat<n><m><sg>$

Lex

^default<NP><ND>{

^བྱི་ལི་<n>/cat<n>$
^དེ་<det>/the<det><def>$

}$

^catch<VP><tv><PD><ND>{ ^default<NP><ND>{ ^བྱི་ཙི་<n>/rat<n>$ ^དེ་<det>/the<det><def>$ }$ ^ཟུང་<v><tv>/catch<vblex>$ }$

Post-Evaluation

Statistics about input files

  • Number of words in reference: 31
  • Number of words in test: 13
  • Number of unknown words (marked with a star) in test:
  • Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)

  • Edit distance: 28
  • Word error rate (WER): 90.32 %
  • Number of position-independent correct words: 3
  • Position-independent word error rate (PER): 90.32 %

Results when unknown-word marks (stars) are not removed

  • Edit distance: 28
  • Word Error Rate (WER): 90.32 %
  • Number of position-independent correct words: 3
  • Position-independent word error rate (PER): 90.32 %

Statistics about the translation of unknown words

  • Number of unknown words which were free rides: 0
  • Percentage of unknown words that were free rides: 0%

Coverage

  • Bilingual Coverage: 100%