Comments (11)
Sorry to hear that, 1.5steps/s is not bad, although it should be around 3 for batch size 32.
from forwardtacotron.
Hi, could be that the older pytorch does not autocast the values in l1_loss. I just pushed an explicit cast to master that should fix this, could you pull and try again?
from forwardtacotron.
Hi
Thank you for the fast response.
I pulled master and tried but got this:
Traceback (most recent call last):
File "train_forward.py", line 98, in <module>
trainer.train(model, optimizer)
File "D:\speech\ForwardTacotron-master\ForwardTacotron-master\trainer\forward_trainer.py", line 37, in train
self.train_session(model, optimizer, session)
File "D:\speech\ForwardTacotron-master\ForwardTacotron-master\trainer\forward_trainer.py", line 71, in train_session
loss.backward()
File "C:\Users\Josh\AppData\Local\Programs\Python\Python37\lib\site-packages\torch\tensor.py", line 118, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph)
File "C:\Users\Josh\AppData\Local\Programs\Python\Python37\lib\site-packages\torch\autograd\__init__.py", line 93, in backward
allow_unreachable=True) # allow_unreachable flag
RuntimeError: CUDA error: unspecified launch failure
Do I need to reprocess and retrain the first network or it should be working?
from forwardtacotron.
Hi
I think maybe I am running out of vram, I have an RTX 2070 with 8GB ram, I might need to lower batch size.
from forwardtacotron.
It is working in cpu mode, will try to reduce batch size. Thank you!
from forwardtacotron.
Batch size of 4 works,
8, 16, 32 do not. Does lower batch size affect quality or just takes longer to train?
from forwardtacotron.
hmmm. might not be a vram issue... even at 4 it does not get through an epoch before giving the same error as before hmmm....
vram usage only at 4GB...
from forwardtacotron.
Seems to be a bug in cudnn, disabling it is very slow but works pytorch/pytorch#27588
from forwardtacotron.
adding torch.autograd.set_detect_anomaly(True) fixes the issue, but it is still a bit slower, but still much faster than disabling cudnn
from forwardtacotron.
I could imagine that upgrading pytorch/nvcc would help, but I understand that can be quite cumbersome.
from forwardtacotron.
Upgrading pytorch to 1.5.1 and getting latest nvidia drivers produced the same result. Does not seem to happen on 2080xx cards,just lower tier like mine. No big deal though, 1.5 steps/sec is much better than 0.26 without cudnn at all!
from forwardtacotron.
Related Issues (20)
- symbols.py for Arabic letters
- Feature request: model compatible to export into onnx
- Cast error details: Unable to cast [Array] to Tensor HOT 9
- Adding pauses to the input text HOT 2
- confuse about duration extract HOT 10
- preprocess.py issues - RAM usage close to 100% but CPU usage is nonexistant HOT 16
- ValueError not enough values to unpack (expected 2 got 0) HOT 2
- making the system available for use with assistive technologies on windows HOT 1
- Bad Alignment HOT 1
- ValueError: need at least one array to stack train_tacotron.py line 192 HOT 1
- Facing problem at preprocessing
- Need instructions for fine tunning
- Problems with attention for dataset consisting of longer samples
- how to train a dataset using a pre-trained model?
- preprocess.py misuses Espeak backend, resulting in slow performance and memory leak HOT 2
- preprocess.py: list index out of range HOT 5
- Multispeaker and new neural voice creation HOT 12
- Non-Latin alphabets
- Bad Attention!
- Training a model twice using a different dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from forwardtacotron.