Code Monkey home page Code Monkey logo

paper-notes's Introduction

paper-notes

paper notes will be moved to medium for easier publishing

paper notes on machine learning

2016-11 (ICLR 2017)

Reinforcement Learning:

  • A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models [arXiv]
  • The Predictron: End-To-End Learning and Planning [OpenReview]
  • Third-Person Imitation Learning [OpenReview]
  • Generalizing Skills with Semi-Supervised Reinforcement Learning [OpenReview]
  • Sample Efficient Actor-Critic with Experience Replay [OpenReview]
  • [Reinforcement Learning with Unsupervised Auxiliary Tasks][OpenReview]
  • Neural Architecture Search with Reinforcement Learning [OpenReview]
  • Towards Information-Seeking Agents [OpenReview]
  • Multi-Agent Cooperation and the Emergence of (Natural) Language [OpenReview]
  • Improving Policy Gradient by Exploring Under-appreciated Rewards [OpenReview]
  • Stochastic Neural Networks for Hierarchical Reinforcement Learning [OpenReview]
  • Tuning Recurrent Neural Networks with Reinforcement Learning [OpenReview]
  • RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning [arXiv]
  • Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning [OpenReview]
  • Learning to Perform Physics Experiments via Deep Reinforcement Learning [OpenReview]
  • Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU [OpenReview]
  • Learning to Compose Words into Sentences with Reinforcement Learning[OpenReview]
  • Deep Reinforcement Learning for Accelerating the Convergence Rate [OpenReview]
  • Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning [OpenReview]
  • Learning to Compose Words into Sentences with Reinforcement Learning [OpenReview]
  • Learning to Navigate in Complex Environments [arXiv]
  • Unsupervised Perceptual Rewards for Imitation Learning [OpenReview]
  • Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic [OpenReview]
  • Neural Architecture Search with Reinforcement Learning [OpenReview]

NLP

  • Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation [arXiv]
  • [Neural Machine Translation with Reconstruction][arXiv]
  • Iterative Refinement for Machine Translation [OpenReview]
  • A Convolutional Encoder Model for Neural Machine Translation [arXiv]
  • Improving Neural Language Models with a Continuous Cache [OpenReview]
  • Vocabulary Selection Strategies for Neural Machine Translation [OpenReview]
  • Towards an automatic Turing test: Learning to evaluate dialogue responses [OpenReview]
  • Dialogue Learning With Human-in-the-Loop [OpenReview]
  • Batch Policy Gradient Methods for Improving Neural Conversation Models [OpenReview]
  • Learning through Dialogue Interactions [OpenReview]
  • [Dual Learning for Machine Translation][arXiv]
  • Unsupervised Pretraining for Sequence to Sequence Learning [arXiv]
  • On orthogonality and learning recurrent networks with long term dependencies[OpenReview]
  • Efficient Vector Representation for Documents through Corruption [OpenReview]
  • Generating Long and Diverse Responses with Neural Conversation Models[OpenReview]
  • A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks[OpenReview]
  • Structured Sequence Modeling with Graph Convolutional Recurrent Networks[OpenReview]
  • Wav2Letter: an End-to-End ConvNet-based Speech Recognition System[OpenReview]
  • LEARNING A NATURAL LANGUAGE INTERFACE WITH NEURAL PROGRAMMER [OpenReview]
  • Dynamic Coattention Networks For Question Answering[OpenReview]
  • Quasi-Recurrent Neural Networks[OpenReview]
  • A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks [OpenReview]
  • Adversarial Training Methods for Semi-Supervised Text Classification[OpenReview]
  • A Neural Knowledge Language Model[OpenReview]

One-shot learning/ transfer-learning

  • Optimization as a Model for Few-Shot Learning[OpenReview]
  • Learning to Remember Rare Events[OpenReview]
  • Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks[OpenReview]

CNN

  • LOCALLY CONNECTED CONVOLUTIONAL NEURAL NETWORKS ON GRAPH-STRUCTURED DATA[OpenReview]
  • Semi-Supervised Classification with Graph Convolutional Networks[OpenReview] semi-supervised learning -Neural Graph Machines: Learning Neural Networks Using Graphs[OpenReview]

misc

  • An Analysis of Deep Neural Network Models for Practical Applications[OpenReview]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.