Misc tools

From LING073
Jump to: navigation, search

Installing on your own system

NOTICE: These tools should be installed on the lab computers already; this section is just for those of you who would like to use the tools on your own system.

You can install miscellaneous tools from this repo, like morphTests2regtest and lexccounter, as follows:

cd ~/ling073
git clone git@github.swarthmore.edu:Ling073-sp22/tools.git
cd tools
make install

After you've already cloned the repo, you can update to the most recent version like this:

cd tools
git pull
make install

morphTests2regtest

You can scrape {{morphTest}}s from any page on this wiki for use with apertium-regtest like this:

morphTests2regtest "Language/Grammar" -l xyz

Run this from inside your transducer repository. It populates the test/ directory in your transducer. Be sure to commit the files.

morphTests2yaml

NOTICE: We're using morphTests2regtest (see above) now.

You can scrape {{morphTest}}s from any page on this wiki for use with morph-test or aq-morphtest like this:

morphTests2yaml "Language/Grammar" -l xyz

Put the file in a tests/ directory in your transducer repo and commit the file.

lexccounter

This script counts the number of unique entries in a lexc file.

$ lexccounter apertium-xyz.xyz.lexc 
Unique entries: 127

apertium-eval-translator

You need two files:

  • one test translation, and
  • one reference translation

The reference translation is the parallel text in your corpus, e.g. abc.tests.txt. To get a test translation, run the source text (xyz.tests.txt) through apertium and direct the output into a new file, e.g.

 cat xyz.tests.txt | apertium -d . xyz-abc > xyz-abc.tests.txt

scrapeTransferTests

You can scrape the transferTests from your contrastive grammar page into a small parallel corpus. E.g.,

 scrapeTransferTests -p abc-xyz "Language1_and_Language2/Contrastive_Grammar"

will result in an abc.tests.txt and xyz.tests.txt file that contain the respective sides of any transferTests on your contrastive grammar page specified as being for abc to xyz translation.


2017 version of this page