Code Monkey home page Code Monkey logo

lelesk's Introduction

Le's Lesk

A fast Python 3 Word-Sense Disambiguation package (WSD) using the extended LESK algorithm

Install

lelesk is available on PyPI and can be installed using pip

pip install lelesk

Lelesk uses NLTK lemmatizer and yawlib wordnet API. To install NLTK data, start a Python prompt, import nltk and then download the required data

$ python3
Python 3.6.9 (default, Jan 26 2021, 15:33:00) 
[GCC 8.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import nltk
>>> nltk.download(['stopwords', 'punkt', 'averaged_perceptron_tagger', 'wordnet'])

Download and extract yawlib pre-built databases to ~/wordnet.

For more information:

Command-line tools

To disambiguate a sentence, run this command on the terminal:

python3 -m lelesk wsd "I go to the bank to get money."

To perform word-sense disambiguation on a text file, prepare a text file with each line is a sentence.

For example here is the content of the file demo.txt

I go to the bank to withdraw money.
I sat at the river bank.

you then can run the following command

# output to TTL/JSON (a single file)
python3 -m lelesk file demo.txt demo_wsd_output.json --ttl json

# output to TTL/TSV (multiple TSV files)
python3 -m lelesk file demo.txt demo_wsd_output.json --ttl tsv

Issues

If you have any issue, please report at https://github.com/letuananh/lelesk/issues

lelesk's People

Contributors

letuananh avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

ju-sing

lelesk's Issues

installation problem

The install procedure is not clear. It looks like lelesk is installed with NLTK? I installed nltk and after that I got

$ python3 lelesk/wsdtk.py -W car -x 'the car is on the road'
Traceback (most recent call last):
  File "lelesk/wsdtk.py", line 60, in <module>
    from .main import LeLeskWSD, LeskCache
ModuleNotFoundError: No module named '__main__.main'; '__main__' is not a package
leme:lelesk ar$ pwd
/Users/ar/work/lelesk

Not clear how to install this package. Can you elaborate the instructions?

Doc for Python import and use inside scripts or program

Hello and thank you for making your code available to the public.
A couple of questions:

  • Is there info or documentation and use examples for methods and classes to be import from inside an iPython shell or from within Python code ? (The alternative would be to dig into your code directly.) Concretely I am interested in using 'lelesk' in a way similar to 'lesk', i.e. propose ambiguous terms and context sentence and return the most likely synset.

  • I installed yawlib as well as db files in ~/wordnet, checked that yawlib is operational (python -m yawlib lemma research), but python -m lelesk wsd "I go to the bank to get money" produces segmentation fault from within a Python v 3.7.1 virtual environment. Any idea why that could be ? ... my VENV is set up with `pyenv for what it's worth.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.