zgcr / simpleaicv_pytorch_training_examples Goto Github PK

SimpleAICV:pytorch training and testing examples.

License: MIT License

Python 16.51% Shell 0.16% C++ 0.02% Cuda 0.21% Jupyter Notebook 83.11%

pytorch darknet fcos resnet retinanet centernet ttfnet repvgg mae dino vit deeplabv3plus kd regnetx u2net solov2 yolact sam segment-anything

simpleaicv_pytorch_training_examples's Introduction

My column
📢 News!
Introduction
All task training results
Environments
Download my pretrained models and experiments records
Prepare datasets
How to train and test model
How to use gradio demo
Reference
Citation

My column

https://www.zhihu.com/column/c_1692623656205897728

📢 News!

2024/04/15: support segment-anything model training/testing/jupyter example/gradio demo.

Introduction

This repository provides simple training and testing examples for the following tasks:

task	support dataset	support network
Image classification task	CIFAR100 ImageNet1K(ILSVRC2012) ImageNet21K(Winter 2021 release) ACCV2022	ResNet DarkNet RepVGG RegNetX ViT VAN
Object detection task	VOC2007 and VOC2012 COCO2017 Objects365(v2,2020)	RetinaNet FCOS CenterNet TTFNet DETR DINO-DETR
Semantic segmentation task	ADE20K	DeepLabv3+ U2Net
Instance segmentation task	COCO2017	YOLACT SOLOv2
Knowledge distillation task	ImageNet1K(ILSVRC2012)	KD loss(for ResNet) DML loss(for ResNet)
Contrastive learning task	ImageNet1K(ILSVRC2012)	DINO(for ResNet)
Masked image modeling task	ImageNet1K(ILSVRC2012) ACCV2022	MAE(for ViT)
OCR text detection task	/	DBNet
OCR text recognition task	/	CTC Model
Human matting task	/	PFAN Matting model
Salient object detection task	/	PFAN Segmentation model
Face detection task	/	RetinaFace
Interactive segmentation task	/	SAM(segment-anything)
Image inpainting task	CelebA-HQ Places365-standard Places365-challenge	AOT-GAN TRANSX-LKA-AOT-GAN
Diffusion model task	CIFAR10 CIFAR100 CelebA-HQ FFHQ	DDPM DDIM

All task training results

See all task training results in results.md.

Environments

1、This repository only supports running on ubuntu(verison>=18.04 LTS).

2、This repository only support one node one gpu/one node multi gpus mode with pytorch DDP training.

3、Please make sure your Python environment version>=3.7.

4、Please make sure your pytorch version>=1.10.

5、If you want to use torch.complie() function,please make sure your pytorch version>=2.0.Using pytorch2.0/2.2/2.3,don't use pytorch2.1.

Use pip or conda to install those Packages in your Python environment:

torch
torchvision
pillow
numpy
Cython
colormath
pycocotools
opencv-python
scipy
einops
scikit-image
pyclipper
shapely
imagesize
nltk
tqdm
yapf
onnx
onnxruntime
onnxsim
thop
gradio==4.26.0
transformers==4.41.2
open-clip-torch==2.24.0

If you want to use xformers,install xformers Packge from offical github repository:

https://github.com/facebookresearch/xformers

If you want to use dino-detr model,install MultiScaleDeformableAttention Packge in your Python environment:

cd to simpleAICV/detection/compile_multiscale_deformable_attention,then run commands:

chmod +x make.sh
./make.sh

Download my pretrained models and experiments records

You can download all my pretrained models and all my experiments records/checkpoints from huggingface or Baidu-Netdisk.

If you only want to download all my pretrained models(model.state_dict()),you can download pretrained_models folder.

# huggingface
https://huggingface.co/zgcr654321/classification_training/tree/main
https://huggingface.co/zgcr654321/contrastive_learning_training/tree/main
https://huggingface.co/zgcr654321/detection_training/tree/main
https://huggingface.co/zgcr654321/image_inpainting_training/tree/main
https://huggingface.co/zgcr654321/diffusion_model_training/tree/main
https://huggingface.co/zgcr654321/distillation_training/tree/main
https://huggingface.co/zgcr654321/instance_segmentation_training/tree/main
https://huggingface.co/zgcr654321/masked_image_modeling_training/tree/main
https://huggingface.co/zgcr654321/ocr_text_detection_training/tree/main
https://huggingface.co/zgcr654321/ocr_text_recognition_training/tree/main
https://huggingface.co/zgcr654321/human_matting_training/tree/main
https://huggingface.co/zgcr654321/salient_object_detection_training/tree/main
https://huggingface.co/zgcr654321/face_detection_training/tree/main
https://huggingface.co/zgcr654321/interactive_segmentation_training/tree/main
https://huggingface.co/zgcr654321/semantic_segmentation_training/tree/main
https://huggingface.co/zgcr654321/pretrained_models/tree/main

# Baidu-Netdisk
链接：https://pan.baidu.com/s/1yhEwaZhrb2NZRpJ5eEqHBw 
提取码：rgdo

Prepare datasets