Linhui Xiao's Projects
Official implementation of PVT series
All Algorithms implemented in Python
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
PyTorch1.x tutorials, examples and some books I found 【不定期更新】整理的PyTorch 1.x 最新版教程、例子和书籍
Kalman filtering via RcppArmadillo
Referring Expression Parser
RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2
RoBERTa中文预训练模型: RoBERTa for Chinese
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
image scene graph generation benchmark
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).
ORB-SLAM2 源码注释, 基于泡泡机器人的注释版本
Code release for SLIP Self-supervision meets Language-Image Pre-training
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
The exercises in Barfoot's book: state estimation for robotics
collecting books, papers and docs.
主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。
An Open Source Machine Learning Framework for Everyone
Plot the vector graph of attention based text visualisation
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Reference models and tools for Cloud TPUs.
A Robust and Versatile Monocular Visual-Inertial State Estimator
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning