Code Monkey home page Code Monkey logo

german-nlp's Introduction

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German Awesome

Resources and tools which can be used either off-the-shelf or with minor adjustments and which are currently maintained are primarily chosen for this list. It is deliberately biased in terms of usability and user-friendliness.

Pull requests and suggestions are welcome! See contributing guidelines.

Table of Contents

Text corpora

General-purpose

Historical

Specialized

Swiss German

Learner and Error Corpora

Word lists

Data acquisition

Lists of corpora

Generic resources

Frameworks

Treebanks

Deep learning models and transformers

Annotation

Standards

Linguistic processing

Tokenization / Sentence boundary detection

Stemming

Lemmatization

Morphological analysis

Normalization

Phonology

POS-tagging

Syntactical parsing

Named Entity Recognition

Misc

Text generation

Industry/Applications

Evaluation

Semantic analysis

Datasets

Word embeddings and senses

Sentiment analysis datasets / polarity clues

Sentiment detection

GermEval

(category to improve)

Discourse

Summarization

Psycholinguistics

Speech NLP

Machine Translation

(category to improve)

Parallel corpora

Teaching resources and tutorials

More lists

German

General

Comparable lists

Larger institutional GitHub groups

Contributors

See the list of contributors.

License

CC-BY

german-nlp's People

Contributors

adbar avatar akron avatar heyarne avatar hoffart avatar malteos avatar reckart avatar susannehaaf avatar tsterbak avatar zesch avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.