Comments (3)
The code release in this repo is meant for reproducing the results of research papers.
We help users who have trouble using the code to reproduce results in the paper, but in general we do not provide suggestions on other issues such as how to apply the method to a new task or dataset.
from moco.
Hi,i did unsupervised pre-training of a ResNet-50 model on a dataset which contains 122,208 unlabeled bird images and the last epoch log is below:
the loss stucks at ~6.90 which is similar to another closed issue #12. In that issue it seems not tha bad. Is this normal?
Then i use this pretrained model to train and eval on a dataset which contains 3,959 train images and 2000 val images. These images are in 200 categories of birds. I follow the
'''
python main_lincls.py
-a resnet50
--lr 30.0
--batch-size 256
--pretrained [your checkpoint path]/checkpoint_0199.pth.tar
--dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0
[your imagenet-folder with train and val folders]
'''
however the validate accuracy is quite low (~12%), which is much lower than supervised training method(~60%). I tried serval learning rate (0.1, 5,10, 100.0) but the results seems still bad.
So can i ask how do you set these hyperparameters?Or, the pretrained model is bad? how can i check this probelm?
Thanks!
Hi, how many GPU did you use for training? I have 8 Tesla v100 (32G) GPU, but it still can not afford batch_size 256.
from moco.
嗨,我在包含 122,208 个未标记鸟类图像的数据集上对 ResNet-50 模型进行了无监督预训练,最后一个纪元日志如下: 损失停留在 ~6.90,这类似于另一个已关闭的问题#12。在那个问题上似乎还不错。这是正常的吗?
![]()
然后我使用这个预训练模型对包含 3,959 个训练图像和 2000 个 val 图像的数据集进行训练和评估。这些图像属于 200 种鸟类。我按照 ''' python main_lincls.py -a resnet50 --lr 30.0 --batch-size 256 --pretrained [你的检查点路径]/checkpoint_0199.pth.tar --dist-url 'tcp://localhost:10001 ' --multiprocessing-distributed --world-size 1 --rank 0 [带有 train 和 val 文件夹的 imagenet 文件夹] ''' 但是验证准确度非常低(~12%),远低于监督训练方法(~60%)。我尝试了 serval learning rate (0.1, 5,10, 100.0) 但结果似乎仍然很糟糕。 那请问这些超参数是怎么设置的? 或者,预训练的模型不好?我如何检查这个问题? 谢谢!
I also encountered this problem, have you solved it
from moco.
Related Issues (20)
- cannot reproduce the results of moco-v2 HOT 2
- Question about transfering to COCO with Mocov1 and Mocov2 checkpoint
- Issue about dequeue_and_enqueue HOT 3
- Question about the queue for key encoder HOT 2
- Why labels are all zeros, should first columns of labels be ones? HOT 4
- Issue with batch size HOT 1
- Low Accuracy
- One question about single GPU HOT 2
- How to load the Hyperparameters without command line code Argument Parser?
- About training HOT 2
- what information is leaked due to intra-batch communication? HOT 2
- What is the label format of the cifar-10 dataset? HOT 1
- Concerns about feature dimensionality in MoCo self-training
- Can you tell me dataset structure and how images are named in the dataset HOT 1
- why pretrain from encoder_q? HOT 1
- Question about queue dimension
- How is BN in key-encoder updated (in Moco v1)? HOT 1
- Why is labels = zeros(N) set to zero? HOT 3
- The size of the dictionary HOT 2
- About License
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from moco.