Difference between revisions of "Fijian and English/Structural transfer"

From LING073
Jump to: navigation, search
(eng → fij)
(eng → fij)
Line 73: Line 73:
 
*Taggers: <code>^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$</code>
 
*Taggers: <code>^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$</code>
 
*Biltrans: <code>^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$</code>
 
*Biltrans: <code>^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$</code>
Step 1: remove number tags
+
Step 1: remove number tags and add ''a<art>''
*Chunker: <code>^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^’oro<n>$}$^sent<SENT>{^.<sent>$}$</code>
+
*Chunker: <code>^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^a<art> ^’oro<n>$}$^sent<SENT>{^.<sent>$}$</code>
 
Step 2: change word orders
 
Step 2: change word orders
*Interchunk: <code>^nom<n>{^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$</code>
+
*Interchunk: <code>^nom<n>{^a<art> ^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$</code>
*Postchunk: <code>^’oro<n>$ ^levu<adj>$ ^yai<det><dem>$^.<sent>$</code>
+
*Postchunk: <code>^a<art>$ ^’oro<n>$ ^levu<adj>$ ^yai<det><dem>$^.<sent>$</code>
*Translation: <code>’oro levu yai</code>
+
*Translation: <code>a ’oro levu yai</code>
*Problems:
 
:-How to add an extra word--the article ''a'' in Fijian?
 
:-Both demonstratives and adjectives are nonobligatory, but the rule above to change word orders only works when both appear in the NP. (i.e. the word orders for "this village" and "the big village" are still incorrect.) Do I have to write separate rules for them?
 
  
 
==fij → eng==
 
==fij → eng==

Revision as of 16:39, 12 April 2018

Initial Evaluation

  • eng → fij
-WER:75.81%
-PER:70.97%
Test file: 'eng-fij.tests.txt'
Reference file 'fij.tests.txt'

Statistics about input files
-------------------------------------------------------
Number of words in reference: 62
Number of words in test: 57
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 47
Word error rate (WER): 75.81 %
Number of position-independent correct words: 18
Position-independent word error rate (PER): 70.97 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 47
Word Error Rate (WER): 75.81 %
Number of position-independent correct words: 18
Position-independent word error rate (PER): 70.97 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%
  • fij → eng
-WER:82.14%
-PER:76.79%
Test file: 'fij-eng.tests.txt'
Reference file 'eng.tests.txt'

Statistics about input files
-------------------------------------------------------
Number of words in reference: 56
Number of words in test: 57
Number of unknown words (marked with a star) in test: 
Percentage of unknown words: 0.00 %

Results when removing unknown-word marks (stars)
-------------------------------------------------------
Edit distance: 46
Word error rate (WER): 82.14 %
Number of position-independent correct words: 14
Position-independent word error rate (PER): 76.79 %

Results when unknown-word marks (stars) are not removed
-------------------------------------------------------
Edit distance: 46
Word Error Rate (WER): 82.14 %
Number of position-independent correct words: 14
Position-independent word error rate (PER): 76.79 %

Statistics about the translation of unknown words
-------------------------------------------------------
Number of unknown words which were free rides: 0
Percentage of unknown words that were free rides: 0%

eng → fij

I am working on the first contrastive grammar point--word orders within NPs. An English NP has the demonstrative and adjectives preceding the head noun, while Fijian has all of the modifiers following the head noun. Besides, the article in front of the head noun is obligatory even if a demonstrative is present, but its corresponding translation in English--the determiner the--is not allowed to occur when there is a demonstrative in the NP. In terms of tags, Fijian does not have number tags on either nouns or demonstratives.

  • Example phrase: (eng) this big village → (fij) a ’oro levu yai
  • Current translation to Fijian:#yai levu #’oro
  • Taggers: ^this/this<det><dem><sg>$ ^big/big<adj><sint>$ ^village/village<n><sg>$^./.<sent>$
  • Biltrans: ^this<det><dem><sg>/yai<det><dem><sg>$ ^big<adj><sint>/levu<adj>$ ^village<n><sg>/’oro<n><sg>$^.<sent>/.<sent>$

Step 1: remove number tags and add a<art>

  • Chunker: ^dem<det><dem>{^yai<det><dem>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^nom<n>{^a<art> ^’oro<n>$}$^sent<SENT>{^.<sent>$}$

Step 2: change word orders

  • Interchunk: ^nom<n>{^a<art> ^’oro<n>$}$ ^adj<adj><sint>{^levu<adj>$}$ ^dem<det><dem>{^yai<det><dem>$}$^sent<SENT>{^.<sent>$}$
  • Postchunk: ^a<art>$ ^’oro<n>$ ^levu<adj>$ ^yai<det><dem>$^.<sent>$
  • Translation: a ’oro levu yai

fij → eng

Post-evaluation