Code Monkey home page Code Monkey logo

Hi there 👋

I am a third year PhD student at ETH Zürich in Switzerland, supervised by Prof. Mrinmaya Sachan and Nicholas Monath from Google DeepMind. My PhD research investigate how generative LLMs reason, what factors affect their reasoning performance, and how to optimize their reasoning skills.

Twitter

Kumar Shridhar's Projects

litbank-1 icon litbank-1

Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.

llama icon llama

Inference code for LLaMA models

longtonotes icon longtonotes

LongtoNotes: OntoNotes with Longer Coreference Chains

pypostal icon pypostal

Python bindings to libpostal for fast international address parsing/normalization

pytorch icon pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

pytorch-bayesiancnn icon pytorch-bayesiancnn

Bayesian Convolutional Neural Network with Variational Inference based on Bayes by Backprop in PyTorch.

pytorch-nlp icon pytorch-nlp

Supporting Rapid Prototyping with a Toolkit (incl. Datasets and Neural Network Layers)

pyvarinf icon pyvarinf

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

screws icon screws

SCREWS: A Modular Framework for Reasoning with Revisions

sentence-vae icon sentence-vae

PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.