Code Monkey home page Code Monkey logo

awesome-weak-supervision-1's Introduction

awesome-weak-supervision

A curated list of weak and distant supervision papers and tools.

Table of Contents

Introduction

Weak supervision and distant supervision provide ways to (semi-) automatically generate training data for machine learning systems in a fast and efficient manner where normal, supervised training data is lacking. This idea is popular in fields like natural language processing and computer vision and is actively researched. Here, we list interesting papers and tools to help newcomers from both the research and the application side try out weak supervision.

This list was started by the organizers for the WeaSuL Workshop on Weakly Supervised Learning at ICLR'21 and we welcome contributions to extend it.

Contributing

If you want to contribute to this list, just create a pull-request or a new issue. For a paper or tool, please provide all the necessary information (authors, title, conference, link, topic tags, short description). If you are unsure, feel free to open an issue to discuss it. If you encounter any typos, just let us know. Thanks!

Overview Texts

Texts that give a quick start into the topic.

Surveys

Surveys give a broad overview of a field and can allow you to quickly get insights into current trends and issues for future work.

Foundational Papers

Important steps in how we came to the current state of the art.

Books

Libraries and Tools

Open-source libraries and tools already providing implementations that get you started quickly.

  • Cleanlab [ML, CV, NLP] "Python package for machine learning with noisy labels. cleanlab cleans labels and supports finding, quantifying, and learning with label errors in datasets."
  • Knodle [ML, CV, NLP] "Modular weakly supervised learning with PyTorch."
  • Snorkel [ML, CV, NLP] "Programmatically build and manage training data.โ€
  • ANEA [NLP] "A tool to automatically annotate named entities in unlabeled text based on entity lists for the use as distant supervision"

Datasets and Benchmarks

Datasets generated through weak and distant supervision. These works can provide both insights into how to generate weakly supervised data as well as to evaluate your learning algorithms on them.

awesome-weak-supervision-1's People

Contributors

michael-aloys avatar reemalyami avatar beroth avatar bplank avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.