Code Monkey home page Code Monkey logo

SERIS Research Lab (Secure Efficient Reproducible Systems)

Multimodal Transformers

Ben's GitHub stats GitHub Streak

Ben Chou's Projects

mae icon mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

mask_rcnn icon mask_rcnn

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

midi-ddsp icon midi-ddsp

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

miditok icon miditok

MIDI / symbolic music tokenizers for Deep Learning models 🎶

mir_eval icon mir_eval

Evaluation functions for music/audio information retrieval/signal processing algorithms.

mt3 icon mt3

MT3: Multi-Task Multitrack Music Transcription

multi-modal-transformer-reading icon multi-modal-transformer-reading

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.

muzic icon muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

rse icon rse

Residual Shuffle-Exchange

slic_superpixels icon slic_superpixels

source code for SLIC super-pixel implementation in Matlab with 2D and 3D examples

tensorflow-yolov4-tflite icon tensorflow-yolov4-tflite

YOLOv4, YOLOv4-tiny, YOLOv3, YOLOv3-tiny Implemented in Tensorflow 2.0, Android. Convert YOLO v4 .weights tensorflow, tensorrt and tflite

tesseract icon tesseract

Tesseract Open Source OCR Engine (main repository)

traffic_sign_classifier icon traffic_sign_classifier

A classifier model implemented with Keras trained to classify images. Goal is to have validation accuracy above 93%. As of initial commit, baseline model I have implemented is le-net performing at 89%.

videomaev2 icon videomaev2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.