Comments (3)
Sorry @ABCCASa , I don't understand the question.
So do we still need to use index=0
What does that refer to?
from vision.
Hi, @NicolasHug , Thanks for your reply,
I mean, for example, should I use ["background", "apple", "cat", "bottle", "people"] or ["apple", "cat", "bottle", "people"].
The value at index = 0 should always be background in faster rcnn. But in retinanet, it may not. I looked at the source code, and it threatens each class separately as a binary problem. For a box, if the value is close to 1 (e.g. [0.9, 0.1, 0.1]), then it belongs to this class. Otherwise, it's not. If the box does not belong to any of the classes, it will be the background (e.g., [0.001, 0.001, 0.013]). The retinanet uses sigmoid, not softmax, all class scores sum up do not need to be 1.
from vision.
Does this help: #4106 (comment) ?
the num_classes should include the background which is encoded with 0. During inference the model predicts labels starting from 1.
from vision.
Related Issues (20)
- AttributeError: module 'torchvision.transforms' has no attribute 'v2' HOT 1
- Run all torchvision models in one script. HOT 1
- Build fails: error: unknown type name 'j_decompress_ptr' HOT 3
- Differences in CPU vs CUDA resize for uint8 images HOT 2
- Enable Video models for other tasks
- Can't use gaussian_blur if sigma is a tensor on gpu HOT 3
- Mask r-cnn training runs infinitely without output or error HOT 1
- detection AnchorGenerator Source code issues HOT 1
- Video Reader's get_metadata function fails on videos with sound HOT 2
- Difficulty building on macOS HOT 3
- -
- Typo at `permutate_channels`
- MPS test jobs are failing HOT 3
- Add vision-language models HOT 1
- Add mobilenetv4 support and pretrained models? HOT 5
- Allow passing a file-like object to torchvision.io.video.read_video HOT 6
- cutmix alpha argument in references/classification/transforms.py HOT 1
- torchvision manywheel py 3.11, cuda failure HOT 6
- MacOS test job are failing and install `torch-2.2.0.dev20231010`
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vision.