Code Monkey home page Code Monkey logo

emnlp2016's Introduction

## Lexicon - Experiment ##
#### Preparation
1. Extract the code and the data:
    $ tar -xvzf data.tar.gz   
    $ tar -xvzf chainn.tar.gz

2. Install all third party software:
    - Moses    : http://www.statmt.org/moses/                   # PBMT
    - GIZA++   : https://github.com/moses-smt/giza-pp           # Word Alignment
    - Travatar : http://www.phontron.com/travatar/              # Hiero
    - Chainer  : http://chainer.org/                            # Neural Network Toolkit
    - MTEval   : https://github.com/odashi/mteval               # Evaluator to calculate NIST and BLEU

3. Copy the script/bash/config-run.sh script/bash/config.sh
    $ cp script/bash/config-run.sh script/bash/config.sh

4. Open config.sh and resolve all the dependencies there!
    $ vi script/bash/config.sh

5. Run prepare.sh, this will create the filtered data and will extract lexical probability from eijiro
    $ ./prepare.sh

##### Execution
6. For convenience, you need to run the baseline experiment first because we will use the lexical translation probabilities
   produced from them:
    $ ./hiero.sh
    $ ./pbmt.sh

7. Let's make the hybrid lexicon first!
    $ ./script/hybrid.sh    

8. Once we have lexicons, you can reproduce the experiments by running btec.sh and kftt.sh:
    $ ./btec.sh
    $ ./kftt.sh

That's it. You can find the result in test/$experiment/$method/test-13.{out,align,result,nist.result}
Enjoy :)

emnlp2016's People

Watchers

James Cloos avatar Philip Arthur avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.