Difference between revisions of "Fijian and English/Structural transfer"

From LING073
Jump to: navigation, search
(eng → fij)
(eng → fij)
Line 73: Line 73:
 
*Taggers: <code>^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$</code>
 
*Taggers: <code>^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$</code>
 
*Biltrans: <code>^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$</code>
 
*Biltrans: <code>^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$</code>
 +
Step 1: remove number tags
 
*Chunker: <code>^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^’oro<n>$}$^sent<SENT>{^.<sent>$}$</code>
 
*Chunker: <code>^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^’oro<n>$}$^sent<SENT>{^.<sent>$}$</code>
 +
Step 2: change word orders
 
*Interchunk: <code>^nom<n>{^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$</code>
 
*Interchunk: <code>^nom<n>{^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$</code>
 +
*Postchunk: <code>^’oro<n>$ ^levu<adj>$ ^yai<det><dem>$^.<sent>$</code>
 +
*Translation: <code>’oro levu yai</code>
 +
*Problem: how to add an extra word--the article ''a'' in Fijian?
  
 
==fij → eng==
 
==fij → eng==

Revision as of 10:56, 12 April 2018

Initial Evaluation

  • eng → fij
-WER:75.81%
-PER:70.97%
Test file: 'eng-fij.tests.txt'
Reference file 'fij.tests.txt'

Statistics about input files
-------------------------------------------------------
Number of words in reference: 62
Number of words in test: 57
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 47
Word error rate (WER): 75.81 %
Number of position-independent correct words: 18
Position-independent word error rate (PER): 70.97 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 47
Word Error Rate (WER): 75.81 %
Number of position-independent correct words: 18
Position-independent word error rate (PER): 70.97 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%
  • fij → eng
-WER:82.14%
-PER:76.79%
Test file: 'fij-eng.tests.txt'
Reference file 'eng.tests.txt'

Statistics about input files
-------------------------------------------------------
Number of words in reference: 56
Number of words in test: 57
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 46
Word error rate (WER): 82.14 %
Number of position-independent correct words: 14
Position-independent word error rate (PER): 76.79 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 46
Word Error Rate (WER): 82.14 %
Number of position-independent correct words: 14
Position-independent word error rate (PER): 76.79 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%

eng → fij

I am working on the first contrastive grammar point--word orders within NPs. An English NP has the demonstrative and adjectives preceding the head noun, while Fijian has all of the modifiers following the head noun. Besides, the article in front of the head noun is obligatory even if a demonstrative is present, but its corresponding translation in English--the determiner the--is not allowed to occur when there is a demonstrative in the NP. In terms of tags, Fijian does not distinguish numbers on either nouns or demonstratives.

  • Example phrase: (eng) this big village → (fij) a ’oro levu yai
  • Current translation to Fijian:#yai levu #’oro
  • Taggers: ^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$
  • Biltrans: ^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$

Step 1: remove number tags

  • Chunker: ^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^’oro<n>$}$^sent<SENT>{^.<sent>$}$

Step 2: change word orders

  • Interchunk: ^nom<n>{^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$
  • Postchunk: ^’oro<n>$ ^levu<adj>$ ^yai<det><dem>$^.<sent>$
  • Translation: ’oro levu yai
  • Problem: how to add an extra word--the article a in Fijian?

fij → eng

Post-evaluation