Code Monkey home page Code Monkey logo

htparser's Introduction

Easy-first Parser

Easy-first dependency parser based on Hierarchical Tree LSTMs

The techniques behind the parser are described in the paper Easy-First Dependency Parsing with Hierarchical Tree LSTMs. Further materials could be found here.

Required software

Train a parsing model

The software requires having a training.conll and development.conll files formatted according to the CoNLL data format.

To train a parsing model with for either parsing architecture type the following at the command prompt:

python src/parser.py --outdir [results directory] --train training.conll --dev development.conll [--extrn path_to_external_embeddings_file]

We use the same external embedding used in Transition-Based Dependency Parsing with Stack Long Short-Term Memory which can be downloaded from the authors github repository and directly here.

Note 1: The reported test result is the one matching the highest development score.

Note 2: The parser calculates (after each iteration) the accuracies excluding punctuation symbols by running the eval.pl script from the CoNLL-X Shared Task and stores the results in directory specified by the --outdir.

Note 3: The external embeddings parameter is optional and could be omitted.

Parse data with your parsing model

The command for parsing a test.conll file formatted according to the CoNLL data format with a previously trained model is:

python src/parser.py --predict --outdir [results directory] --test test.conll [--extrn extrn.vectors] --model [trained model file] --params [param file generate during training]

The parser will store the resulting conll file in the out directory (--outdir).

Citation

If you make use of this software for research purposes, we'll appreciate citing the following:

@article{DBLP:journals/tacl/KiperwasserG16a,
    author    = {Eliyahu Kiperwasser and
                Yoav Goldberg},
    title     = {Easy-First Dependency Parsing with Hierarchical Tree LSTMs},
    journal   = {{TACL}},
    volume    = {4},
    pages     = {445--461},
    year      = {2016},
    url       = {https://transacl.org/ojs/index.php/tacl/article/view/798},
    timestamp = {Tue, 09 Aug 2016 14:51:09 +0200},
    biburl    = {http://dblp.uni-trier.de/rec/bib/journals/tacl/KiperwasserG16a},
    bibsource = {dblp computer science bibliography, http://dblp.org}
}

License

This software is released under the terms of the Apache License, Version 2.0.

Contact

For questions and usage issues, please contact [email protected]

Credits

Eliyahu Kiperwasser

Yoav Goldberg

htparser's People

Contributors

elikip avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.