Comments (7)
What exactly have you changed in the code?
Be wary of erasing things like
global best_bleu4, epochs_since_improvement, checkpoint, start_epoch, fine_tune_encoder, data_name, word_map
PEP will mark those as warnings, but here they they have a good use.
from a-pytorch-tutorial-to-image-captioning.
I just change the code "scores, _ = pack_padded_sequence(scores, decode _lengths, batch_first = True)" to "scores = pack_padded_sequence(scores, decode _lengths, batch_first = True).data " to debug. I also change some data parameters in the begin of train.py but I don't think it would influence a lot. I didn't change the global parameters code. Do you know how to make the loss convergence? Should I lower the learning rate?
from a-pytorch-tutorial-to-image-captioning.
Have you tried this fix instead?
from a-pytorch-tutorial-to-image-captioning.
Have you tried this fix instead?
Yeah, I just delete the '_', but the cross entrypy loss must accept two tensor parameters. So I add the '.data' to the end of this code.
from a-pytorch-tutorial-to-image-captioning.
That's true. They should be the same in the loss by using .data
. Curios, is your loss just not decreasing, or is it getting worse?
from a-pytorch-tutorial-to-image-captioning.
That's true. They should be the same in the loss by using
.data
. Curios, is your loss just not decreasing, or is it getting worse?
My trian.py works, but the loss just not decreases.
from a-pytorch-tutorial-to-image-captioning.
I change the code "scores = pack_padded_sequence(scores, decode_lengths, batch_first=True)[0]",
Because he required these in the new version. After I finished these, I didn't encounter your situation
from a-pytorch-tutorial-to-image-captioning.
Related Issues (20)
- why use LSTMCell not use LSTM directly HOT 3
- Asking about List of Packages Version
- Failed to create .hdf5 files
- bleu-4 does not increase HOT 2
- Code stopped @ train.py with this error,
- Anyone , Please help to solve this error HOT 3
- If i want to use RL for this , how will i do ?
- eval.val is not working.
- Dimension error HOT 2
- Dataset not available HOT 3
- ValueError: max() arg is an empty sequence HOT 3
- Example Notebook?? HOT 1
- In eval.py, it seems that for each image, the model has to recalculate 5 times, is it too inefficient?
- Please help me with the error:RuntimeError: Expected target size [32, 9490], got [32, 51] HOT 1
- I think this a bug. caption.py 140 HOT 5
- RuntimeError: CUDA error: device-side assert triggered
- Only the author mentions half the bleu-4 score HOT 2
- can this model detect and recognize text in images containing text HOT 1
- I get this issues:Dimension out of range (expected to be in range of [-2, 1], but got 2) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from a-pytorch-tutorial-to-image-captioning.