Code Monkey home page Code Monkey logo

cvpr-2023-track2-8th's Introduction

CVPR 2023第一届大模型比赛Track2 第8名方案

We follow the code described in the CVPR2023 paper titled "Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval"

Requirements

Usage

Requirements

we use single RTX3090 24G GPU for training and evaluation.

pytorch 1.9.0
torchvision 0.10.0
prettytable
easydict

Prepare Datasets

Organize your data folder as follows:

|-- data/
|       |-- train
|           |-- train_images/
|           |-- train_label.txt
|       |-- test
|           |-- test_images/
|           |-- test_label.txt
|       |-- val
|           |-- val_images/
|           |-- val_label.txt
|-- logs/
|       |-- retrival1
|       |-- retrival2

The logs can be download from: 链接:https://pan.baidu.com/s/1pcxTXMNkKEaQoXGTPWT7qg 提取码:0wfp

Training

CUDA_VISIBLE_DEVICES=0 python train.py \
--name iira \
--img_aug \
--batch_size 96 \
--MLM \
--loss_names 'sdm+id' \
--dataset_name 'ImageRetri' \
--root_dir './data' \
--num_epoch 30 \
--output_dir logs/retrival1

CUDA_VISIBLE_DEVICES=0 python train.py \
--name iira \
--img_aug \
--batch_size 96 \
--MLM \
--loss_names 'sdm+id+oim' \
--dataset_name 'ImageRetri' \
--root_dir './data' \
--num_epoch 30 \
--output_dir logs/retrieval2

Testing

CUDA_VISIBLE_DEVICES=0 python inference_person.py --config_file ./logs/retrival1/configs.yaml
CUDA_VISIBLE_DEVICES=0 python inference_person.py --config_file ./logs/retrival2/configs.yaml
python ensemble.py

cvpr-2023-track2-8th's People

Contributors

echoxvf avatar

Stargazers

Chengxin avatar 摇曳的快乐 avatar

Watchers

摇曳的快乐 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.