Comments (7)
This is my setup for MSVD: DATA_PATH=./Cap4Video/MSVD-Frames python -m torch.distributed.launch --nproc_per_node=2 --master_port 2963 train_video.py --do_train --num_thread_reader=4 --epochs=5 --batch_size=128 --n_display=20 --data_path ./Cap4Video/MSVD-Frames/Frames --features_path ./Cap4Video/MSVD-Frames/Frames/MSVD_frames --output_dir ckpts/MSVD-resume --lr 1e-4 --max_words 32 --max_frames 12 --batch_size_val 16 --datatype msvd --feature_framerate 1 --coef_lr 1e-3 --freeze_layer_num 0 --slice_framepos 2 --loose_type --linear_patch 2d --sim_header seqTransf --strategy 2 --pretrained_clip_name ViT-B/32 --interaction wti --text_pool_type transf_avg --world_size 2 ;\
nproc_per_node and world_size is the number of GPUs, in your case it would be 1
According to your design, the first stage train_ Can video. py run? Because I see that this configuration file lacks a lot of path information compared to msrvtt.
May I ask if it is possible to share a successful running code with me? Just send me my email( [email protected] )I want to debug and see the specific process operation inside
Sorry to bother you again, thank you very much for your help!
(按照您的设计第一阶段train_video.py可以运行吗?因为我看这个配置文件相比较于msrvtt少了好多路径信息。
请问是否可以分享我一个成功的运行代码呢?发我邮箱就好([email protected]),我想要debug看看里面的具体流程运转
很抱歉再次打扰到您,万分感谢您的帮助!)
from cap4video.
Hi @shams2023 were you able to reproduce the results for all the datasets?
from cap4video.
你好@shams2023您能够重现所有数据集的结果吗?
No, did you reproduce the msvd dataset?
from cap4video.
I trained the stage 1, but not the stage 2 yet
from cap4video.
我训练了第 1 阶段,但还没有训练第 2 阶段
May I ask how to configure the file and how to set it up? Mine is a single card 3090.
This is the following file:
from cap4video.
This is my setup for MSVD:
DATA_PATH=./Cap4Video/MSVD-Frames
python -m torch.distributed.launch --nproc_per_node=2 --master_port 2963
train_video.py
--do_train --num_thread_reader=4 --epochs=5 --batch_size=128 --n_display=20
--data_path ./Cap4Video/MSVD-Frames/Frames
--features_path ./Cap4Video/MSVD-Frames/Frames/MSVD_frames
--output_dir ckpts/MSVD-resume
--lr 1e-4 --max_words 32 --max_frames 12 --batch_size_val 16
--datatype msvd
--feature_framerate 1 --coef_lr 1e-3
--freeze_layer_num 0 --slice_framepos 2
--loose_type --linear_patch 2d --sim_header seqTransf
--strategy 2
--pretrained_clip_name ViT-B/32
--interaction wti --text_pool_type transf_avg
--world_size 2 ;\
nproc_per_node and world_size is the number of GPUs, in your case it would be 1
from cap4video.
This is my setup: DATA_PATH=./Cap4Video/MSVD-Frames python -m torch.distributed.launch --nproc_per_node=2 --master_port 2963 train_video.py --do_train --num_thread_reader=4 --epochs=5 --batch_size=128 --n_display=20 --data_path ./Cap4Video/MSVD-Frames/Frames --features_path ./Cap4Video/MSVD-Frames/Frames/MSVD_frames --output_dir ckpts/MSVD-resume --lr 1e-4 --max_words 32 --max_frames 12 --batch_size_val 16 --datatype msvd --feature_framerate 1 --coef_lr 1e-3 --freeze_layer_num 0 --slice_framepos 2 --loose_type --linear_patch 2d --sim_header seqTransf --strategy 2 --pretrained_clip_name ViT-B/32 --interaction wti --text_pool_type transf_avg --world_size 2 ;\
nproc_per_node and world_size is the number of GPUs, in your case it would be 1
Thank you for your help! I hope to continue communicating with you!
(感谢你的帮助!希望能继续和你保持交流!)
from cap4video.
Related Issues (20)
- 1
- Which part of the code is the interaction module implemented in? HOT 1
- Question about implementation details. HOT 17
- The parameter batch_first=True is causing an error
- Training requirements HOT 4
- > 在我们的论文中,查询视频分支和查询标题分支是分开训练的。我们首先训练查询视频分支 5 个周期。一旦训练了该分支,我们就继续训练查询标题分支。 HOT 1
- Training script for MSVD, DiDeMo, VATEX
- Requesting code to generate the frames from the MSRVTT dataset HOT 2
- Python, Pytorch, Torchvision, Cudatoolkit versions
- Resume training HOT 1
- some questions HOT 1
- Preprocess for other datasets HOT 1
- Questions about [SEP] token
- Inference on pretrained model
- 在其他数据集上训练
- FileNotFoundError: [Errno 2] No such file or directory: 'data/MSRVTT_test_website_titles.json' HOT 2
- lr=optimizer.get_lr()[0] IndexError: list index out of range
- 是否可以把co_attention_transformer_module.py模块移植到(图像-文本对的交互上面)
- can you provide inference code using text query ?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cap4video.