Code Monkey home page Code Monkey logo

neji's Introduction

Neji

Neji is a flexible and powerful platform for biomedical information extraction from scientific texts, such as patents, publications and electronic health records.

Table of contents

What is new in Neji 2?

  • Neji Web Server
    • Management of annotation services and respective dictionaries and machine-learning models
    • Web page with interactive annotation for each service
    • REST API for each service
  • Gimli for machine learning NER training
    • Gimli is now easier to use with faster training and processing times. Its functionalities are now integrated into Neji, providing the same high accuracy previously achieved
  • Multiple linguistic parsers support, for general text and multi-language
  • Support to additional input and output formats, including BioC
  • SDK usability improvements
  • Performance improvements
  • Stability improvements

What you can do with Neji?

With Neji you can build text mining processing pipelines for:

  • Rapidly create REST services and interactive web pages for text mining tasks
  • Concept recognition:
    • Dictionary-based, Machine learning-based and Rule-based
  • Train machine learning models for NER (Named Entity Recognition):
    • Normalization with dictionary matching and Stopword filtering
  • Linguistic parsing:
    • Sentence splitting, Tokenisation, Lemmatisation, Chunking and Dependency parsing
  • Convert between corpora formats:
    • Input formats: BioC, XML, HTML and Text
    • Output formats: JSON, A1, BC2, Base64, BioC, CoNLL, IeXML, Pipe and PipeExtended

Quick start

  1. Download and extract the latest version of Neji
  2. Use neji.sh to annotate
  3. Use nejiTrain.sh to train new NER models

Documentation

Neji's documentation is available at https://github.com/BMDSoftware/neji/wiki.

Usage notification

If you are using Neji in your projects, please let us know by sending an e-mail to [email protected] or [email protected].

Bugs and features requests

Have a bug or a feature request?

If your problem or idea is not addressed yet, please open a new issue.

Support and consulting

BMD Software

Please contact BMD Software for professional support and consulting services.

Copyright and license

Copyright (C) 2016 BMD Software and University of Aveiro

Neji is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.

neji's People

Contributors

andrejeronimo avatar shalonteoh avatar davidcampos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.