Code Monkey home page Code Monkey logo

lgur's Introduction

Learning Granularity-Unified Representations for Text-to-Image Person Re-identification

This is the codebase for our ACM MM 2022 paper.

datasets
└── cuhkpedes
    ├── captions.json
    └── imgs
        ├── cam_a
        ├── cam_b
        ├── CUHK01
        ├── CUHK03
        ├── Market
        ├── test_query
        └── train_query
└──icfgpedes
    ├── ICFG-PEDES.json
    └── ICFG_PEDES
        ├── test
        └── train

Download DeiT-small weights

wget https://dl.fbaipublicfiles.com/deit/deit_small_distilled_patch16_224-649709d9.pth

Process image and text datasets

python processed_data_singledata_CUHK.py
python processed_data_singledata_ICFG.py

Train

python train_mydecoder_pixelvit_txtimg_3_bert.py

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@inproceedings{shao2022learning,
  title={Learning Granularity-Unified Representations for Text-to-Image Person Re-identification},
  author={Shao, Zhiyin and Zhang, Xinyu and Fang, Meng and Lin, Zhifeng and Wang, Jian and Ding, Changxing},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  year={2022}
}

lgur's People

Contributors

zhiyinshao-h avatar

Stargazers

Xinyi Wu avatar  avatar L avatar zzp avatar YukiWoo avatar  avatar  avatar  avatar  avatar  avatar  avatar tttbbq avatar hugyyy avatar Lin Min avatar ampulla avatar Feng Chen avatar Liviu-Daniel avatar  avatar Xinyu Zhang avatar Aotle avatar Vibhu Dubey avatar  avatar  avatar zhu zhang avatar  avatar ZhaoShuai avatar  avatar runcongma avatar An-zhi WANG avatar xmu-xiaoma666 avatar panqihe avatar cyberPanda avatar

Watchers

Feng Chen avatar Zijie Wang avatar  avatar

lgur's Issues

baseline

您好,感谢您的开源!
关于文章的baseline在代码中并没有显式的控制方式,请问baseline的实现大概是什么样的呢?
对于ImageExtract和TextExtract的输出并不是相同维度的,似乎也无法直接计算idloss之类,您方便讲解一下吗

关于论文中ID loss,有些疑问

我不太明白的地方是,是怎么用prototype获取的identity概率,感觉文章里描述的不是特别清楚,看文章里的意思,ID loss应该是与映射后的特征没有关系,那这样不就意味着四个不同的映射后的特征计算出来的loss应该是一样的吗?可能是我哪里理解的有误吧, 希望拨冗指教

Corss-Domain的结果

cross-domain的结果感觉报的高了,实现不出来论文里的结果,能否给个pretrain的结果,就是你们训练好的最好结果的模型,看看cross-domain的结果是否正确。

About the implementation detail on ICPG dataset.

Hi, thank you for your elegant work. When I run the code on ICPG-PEDES, the result is something different. I directly change the dataset name in the code and keep other parameters. Should I change other parameters on this dataset?
image

Code

您好,请问您提供的代码是最终的版本吗?我保持参数一致,但训练结果低于您论文中的数据。

dataset

请问您的数据集从哪里下载呢

Update "model_TransREID/backbones/vit_pytorch.py" code

line line 30 of "model_TransREID/backbones/vit_pytorch.py", should be updated to

TORCH_MAJOR = int(torch.__version__.split('.')[0])
TORCH_MINOR = int(torch.__version__.split('.')[1])
if TORCH_MAJOR == 1 and TORCH_MINOR < 8:
        from torch._six import container_abcs,int_classes
else:
        import collections.abc as container_abcs

processed_tokens

There is no "processed_tokens" in the json file in the dataset. How can I get "tokens"? Thank you for your generous answer

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.