Comments (4)
Note: if your other dataset is too small and still use the default queue size of 65536, you should revise some code to make things right. Basically, if the dataset is just 5k, the queue contains >10x of the dataset size, which means it contains >10 positive samples. These positive samples should be removed for the loss to make sense. This is not a problem on ImageNet with 1.28M images, as the queue is negligibly small. But this should be handled with some care for smaller sets.
from moco.
The initial increase of loss is because the queue is being filled, and the task is getting harder when real features replace initial noise in the queue. Other than that, we do not have enough information to diagnose your case as you used a different dataset. In the latest version of our arXiv paper (Appendix A.9), there is a curve on ImageNet for your reference, and the ImageNet curves can be obtained if you run our code following our settings.
from moco.
I met same problem, but in a different dataset(not image dataset). But I tried only use pasal voc 07 dataset, only have 5000 images and do not have this problem. So I think this problem is not related to dataset size
from moco.
I met same problem, but in a different dataset(not image dataset). But I tried only use pasal voc 07 dataset, only have 5000 images and do not have this problem. So I think this problem is not related to dataset size
Hi, I tried the same dataset, it didn't work, what was your queue size? and how many epochs did you see convergence?
from moco.
Related Issues (20)
- cannot reproduce the results of moco-v2 HOT 2
- Question about transfering to COCO with Mocov1 and Mocov2 checkpoint
- Issue about dequeue_and_enqueue HOT 3
- Question about the queue for key encoder HOT 2
- Why labels are all zeros, should first columns of labels be ones? HOT 4
- Issue with batch size HOT 1
- Low Accuracy
- One question about single GPU HOT 2
- How to load the Hyperparameters without command line code Argument Parser?
- About training HOT 2
- what information is leaked due to intra-batch communication? HOT 2
- What is the label format of the cifar-10 dataset? HOT 1
- Concerns about feature dimensionality in MoCo self-training
- Can you tell me dataset structure and how images are named in the dataset HOT 1
- why pretrain from encoder_q? HOT 1
- Question about queue dimension
- How is BN in key-encoder updated (in Moco v1)? HOT 1
- Why is labels = zeros(N) set to zero? HOT 3
- The size of the dictionary HOT 2
- About License
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from moco.