Code Monkey home page Code Monkey logo

tokenizer_api's Introduction

1) Files included in this package:
- tokenizer_api (root folder):
     - README: this file
     - setup.py: script to build a python package with setuptools, including
             a description of the package's contents, metadata and dependencies
     - setup.cfg: configuration options for setup.py
     - MANIFEST.in: more details on the configuration of setup.py; it is meant 
             to allow non-'.py' files to be installed as well.
     - pytest.ini: setting some of the parameters for pytest's configuration
     - usage_example.py: code to ilustrate how to lauch the RESTful API
     - .gitignore: lists the files that should not be uploaded to git
     - core (the folder with the package's code):
         - __init__.py: empty file to mark where the package's code is to be found
         - tokenizer.py: code to build and launch the RESTful API,
                 as well as to tokenize the strings passed as input
         - collocations_dict.pickle: python dictionary containing 
                 the Oxford Online Collocations Dictionary
     - test (folder with code to test the package):
         - api_test.py: code to test the accessibility and error handling
                 of the RESTful API's POST method
         - conftest.py: code to generate a pytest.fixture for api_test.py
         - process_text_test.py: code to test the function designed to
                 tokenize strings, including its ability to raise the necessary
                 exceptions
     - documentation (folder with pdf files):
         - Problem Statement.pdf: the PDF the company looking to hire me supplied me with, 
         which includes the problem statement
         - to_git_or_not.pdf: the PDF where I explain how I have decided to use 
                 git and why

2) Package Installation:
    To install the package, please run 
    >> python setup.py build
    from the root directory */tokenizer_api
    
    Once that is done, you should run
    >> python -m pip install .
    which will install the package in your active python environment
    
    You can now import custom_tokenizer (for an example, see usage_example.py)
    

3) Test Suite:
    The tests has been developed using pytest. 
    To run the tests, simply execute
    >> python setup.py test
    from the root directory */tokenizer_api
    
    Don't be surprised to find out there are two tests the system is failing.
    This is part of its expected behaviour. 
    

tokenizer_api's People

Contributors

rtapiaoregui avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.