Comments (2)
Thanks for your interest and sorry for the delay! I would love to hear some sound examples when you have the time.
You can probably run the generation pretty quickly on a regular laptop; you shouldn't need to use a cloud service unless your application requires generating a ton of content. You can get an idea of how fast the model would run on your laptop by going to this web demo and pressing "Change" on one of the sounds: https://chrisdonahue.com/wavegan/
One thing you can do is to take the trained WaveGAN and just generate a ton of sounds from it offline (e.g. 100k). Then you can just take a random sample from set in your real-time application.
We trained the models for the posted examples to between 100k and 200k steps, but yes we observed that even after only 10-20k steps the model was producing reasonable results.
Best of luck and let me know if you have more questions!
from wavegan.
Thanks for your reply @chrisdonahue .
Here are some of the results from my training experiment from waveGAN.
Here is the dataset, for reference: https://www.youtube.com/watch?v=wXV39pybgJU
And here are my results (stopped at checkpoint 30k or so): https://www.dropbox.com/sh/hp4jk6d7gzuy2qz/AAAfgWpGnuh30LI7EwEAfa8ya?dl=0
IMO, they are pretty good quality and useful as samples for further processing in an electronic music production workflow.
Currently I'm experimenting with mixing heterogeneous datasets, meaning that I use very different .wavs as datasets and see if the model actually "mixes them" in the output as it learns from the them.
Another Issue I'm having is to actually be able to load in my own checkpoints with the example code that is published in the Jupyter notebook.
- Is it possible to implement it as a stand-alone .py script in which I can batch output, for example 100 .wavs from a random or specific latent space vector?
Currently I am generating each sample one by one using a generator script someone else posted in another issue thread, which is kind of tedious.
Thanks again!
UPDATE: I managed to make a .py script for interpolating between latent space vectors, as in the Google Colab Notebook. Here is one interpolation using the above generated checkpoints: https://soundcloud.com/h-e-x-o-r-c-i-s-m-o-s/espacio_latente?fbclid=IwAR239cENr7yFQQq7Xi8CaOar8_H1k2_yHi7pOiwSQ5QYrM_iGrXdVwMyo-k
from wavegan.
Related Issues (20)
- Transfer Learning using pre-trained checkpoints HOT 2
- Invalid argument: You must feed a value for placeholder tensor 'ngl' with dtype int32 HOT 2
- Can we change (increase) amount of training data in between sessions?
- Last layer of the generator in the CNN (size 16384) HOT 1
- Tensorflow2.4 support HOT 5
- Multi-channel audio doesn't work with --data_num_channels 2 in Jupyter Lab HOT 1
- Can you fix collab
- Generate MFCC
- Can't get this to run anymore - Need information on environment
- Single-frequency noisy sound in the result HOT 4
- Reading WaveGAN models using Tensorflow C API
- Code for continue training model from last ckpt? HOT 1
- Training starts but no updates are dumping to checkpoints? HOT 1
- Training higher quality audio for ~5 seconds
- Overtraining HOT 2
- Can't train the model
- Can't run HOT 4
- OSS License compatibility question
- Massive Tensorflow Error Message When Attempting to Train WaveGAN
- shuffle buffer size is 4096 but I wish to use a dataset of 49k+ samples?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wavegan.