Comments (4)
@zhuhongyue I trained on two Titan X, and finished in about 12 hours for 30 iterations. The GPU-Util
of your posted info is 0%
, but the Memory-Usage
is normal. I guess that there is some problem with your dataloader, maybe because of the io of your disc. You may try watch -n 0.5 nvidia-smi
to see whether the GPU-Util
always stays around 0%
.
from domainadaptivereid.
@LcDog
Thank you for your reply!
I noticed the issue about GPU-Util which remain 0% most of time. Thus, considering your suggestion, I check my disk state and find nothing strange. I guess the problem is related to the dataloader.
So I run your program step by step and I notice that:
source_features, _ = extract_features(model, src_extfeat_loader, print_freq=args.print_freq)
target_features, _ = extract_features(model, tgt_extfeat_loader, print_freq=args.print_freq)
rerank_dist = re_ranking(source_features, target_features, lambda_value=args.lambda_value)
these three code cost around 30 miniutes every iteration and I suppose it means only these data preprocessing cost about 15 hours in 30 iterations.
I am wondering how much time these code cost in your environment?
In addition, because of the limitation of gpu resources, I cannot test how much time the train part cost every iteration while another gpu program is running. Hence I will check the training time tommorow.
from domainadaptivereid.
@zhuhongyue extract_features
is done on GPU, but re_ranking
is written by numpy, thus the process is running on CPU. Our CPU is Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz
and re_ranking
cost less than 5mins. Besides, extract_features
copy CUDA variables to CPU memory, and this process also heavily depends on the hardware...
You can set lambda_value=0
for a quiker version. Though the results maybe more unstable with this setting.
from domainadaptivereid.
When I set only one gpu visible, the whole training process cost about 15 hours and the gpu utilization rate become normal as expected.
I suppose this issue is solved.
by the way, my command is
CUDA_VISIBLE_DEVICES=0 python2 selftraining.py --src_dataset dukemtmc\
--tgt_dataset market1501\
--resume dukemtmc_trained.pth.tar\
--data_dir data \
--logs_dir log
from domainadaptivereid.
Related Issues (20)
- Question about Dataloader HOT 5
- Training without d_w HOT 1
- How to solve the model saturation? HOT 2
- Where is this paper published? HOT 1
- duke dataset link Invalid. Can you please update? HOT 2
- Question about Msmt dataset HOT 2
- Errors in run
- UnicodeDecodeError: 'ascii' codec can't decode byte 0x91 in position 0: ordinal not in range(128) HOT 2
- When I run the shell : sh run.sh it got an error like this: ImportError: cannot import name 'pinvh'.
- Hello,i find you use "tri_mat = np.triu(rerank_dist, 1)" in selftraining.py HOT 1
- duke2market results HOT 2
- i train the model on dataset dukemtmc, how can i evaluate the model on dataset market1501 before Adaptation HOT 1
- MSMT17 dataset process
- Data selection of PKU-VehicleID HOT 1
- about re_rank code HOT 1
- about pre train model
- MSMT17 setting HOT 2
- source_train.py
- requirements
- Dukemtmc is unavailable HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from domainadaptivereid.