posenhuang / deeplearningsourceseparation Goto Github PK

View Code? Open in Web Editor NEW

366.0 366.0 135.0 511.89 MB

Deep Recurrent Neural Networks for Source Separation

License: Other

MATLAB 97.83% TeX 0.41% M 0.03% C 1.72%

audio-separation deep-learning matlab rnn source-separation speech-denoising speech-separation

deeplearningsourceseparation's People

Contributors

Stargazers

Watchers

Forkers

wbgxx333 anguscupid chenll0539 amoliu elsdrium wjiangia yluo42 linhodong wuxx0803 woodshop t-rad679 wavelets ishay-magisto scatterbrain333 clever-scientist artafrooz gloriayy vsooda agangzz caomw quocble yjjgithub xi-studio gcaaa31928 jupiterethan williamsz moushuai krigans shenjiyuan codeaudit chenxiao60 theolivenbaum wernerfs zhangt369 hdubey rubythonode chagge cristidruta arpiagariu yyuzhong qiming111 poloholmes chenxinglili pi19404 dfsanshi zhaihr23 yundaxian fydp-sece mumulala chuckcho lacking1 csu-mapping tonytongzhao yassinkhalifa purice93 zhuzhu123456 ljx0305 kfcmax yunzqq bsrjy xdsb runngezhang omerabramovici iamzye cccjourney rohithkodali haribharadwaj cxywzx nightfury10497 auserj b03201003 jinguang-dong leetsinghua towerofbable peterzhousz flyahead garyli99 nd1511 yiyiicon namgyucho jasdasdf liuzongquan nisoka tomorrowelite alongwithyou sasalou yenzy asdlei99 summerfanny nimisharaichel whu933314 byfaith zheng58895 prabh95 yml-bit lym0302 chapayghub skn123 readsharemvp fn246

deeplearningsourceseparation's Issues

Training ERROR with "Not enough input arguments."

Hi, I am a beginner in matlab.
I'm interesting in singing voice separation project,
so I cloned your project and tried to execute "train_mir1k_demo" in matlab.
However, I got:
Error in train_mir1k_demo (line 58)
eI.MFCCorlogMelorSpectrum=MFCCorlogMelorSpectrum;

I'm wondering how to set the MFCCorlogMelorSpectrum value when executing this code in terminal.
ThankU!!

How to train the model?

Dear,
How to train the model for separating singing and voice？
Any advice or suggestion will be good.
Thx

drnn folder missing

Warning: Name is nonexistent or not a directory: ..\..\..\codes\denoising\drnn 
> In path (line 109)
  In addpath (line 88)
  In run_test_single_model (line 8)

training code for voice denoise

can you upload the code that used for training a voice denoising model, thank you very much.

Faced some errors after running run_test_single_model.m

Hi, I recently started using Matlab so I apologise in advance if this is a stupid question.

I am trying to work on TIMIT dataset. While running the file :run_test_single_model.m, after downloading the pertained model, I am facing the following errors,

Error using horzcat

Dimensions of arrays being concatenated are not consistent.

Error in stft2 (line 82)
x = [zeros( 1, sz+pd-hp, cl) s zeros( 1, sz+pd,
cl)]';

Error in compute_features_stft2 (line 15)
spectrum_mix = scf * stft2( dmix, nFFT, hop, 0, wn);

Error in formulate_data_test (line 54)
[DATA, mixture_spectrum, eI]=compute_features_stft2(dmix, eI);

Error in test_timit_general_kl_recurrent (line 38)
[test_data_cell, target_ag,
mixture_spectrum]=formulate_data_test(mixture, eI,
testmode);

Error in run_test_single_model (line 41)
test_timit_general_kl_recurrent(eI.modelname, theta, eI,
'done', j);

Would be great if you could help me out, thank you!

tensorflow based implementation

Does anyone has tensorflow based implementation for this?

denoising training?

Why the denoising experiment has no training code available? I am very interested in this, but would be awesome if you can provide additional code for training the denoising model. Thanks!

Timit

Hi, I am a beginner in matlab.
I'm interesting in singing voice separation project,
so I cloned your project and tried to execute "train_timit_demo" in matlab.
However, I got:
Error in train_timit_demo (line 58)
eI.MFCCorlogMelorSpectrum=MFCCorlogMelorSpectrum;

I'm wondering how to set the MFCCorlogMelorSpectrum value when executing this code in terminal.
Thank you so much!

I found your answer about the same question for "train_mir1k_demo". Can I use the same values?

TSP

To run this m file it is necessary some parameters, could you give the ones to run the example
TSP_model_RNN1_win1_h300_l2_r0_64ms_1000000_softabs_linearout_RELU_logmel_trn0_c1e-10_c0.001_bsz100000_miter10_bf50_c0_d0_7650.mat

train_TSP_demo_mini_clip(context_win, hidden_units, num_layers, isdropout, isRNN, iscleanonly,...
circular_step , isinputL1, MFCCorlogMelorSpectrum, framerate, pos_neg_r, outputnonlinear, opt, act, ...
train_mode, const, const2, isGPU, batchsize, MaxIter, bfgs_iter, clip, lambda,...
data_mode)

could it be used in real time ?

Dear,
I want to use the method to solve the howling in RTC-Real Time Communication,
so could I separate the noise and voice frame by frame ?

any reference will be appreciated.

thx

License is missing

Hi, we want reference your implementation in a paper and need to know the software license of your work.

what is the license of this software? Maybe you can add a LICENSE.md file in your root folder.

After training, where is the model saved?

Hi, posenhuang. Thank you for your wonderful work. I have tried to train the denoising demo with timit corpus. It seems like that the training process is successful, which cost me 5h:43m:44.3s. The problem is I can't find the trained model. Could you please tell me where can I find it?

The codes can't run

Your codes have some bugs, the minFunc_2012 tools can’t support your codes, and the TSP and denoising can't run.

some errors

Excuse me.When I run the demos there are some errors.It reports that there is no stft3 function,and wrongly use horzcat function.The projects is a little big.It will be a little challenging to find the bugs,so I hope that you can give me some advise.Thank you very much.

how to control the gain

I notice that the gain is 1 by default, I guess this means the SNR is 0 db. If I want mix the waves at 5db,how to set the parameter 'gain'

denoising training code

I wonder if you can upload denoising training code.

Denoising Demo not working

Hi, it seems like the code in the denoising folder is not up to date. First I received an error that the model file was not found so I renamed that file. Then I received an error that the formulate_data_test function is missing. If I add it (from the timit or TSP folder) I get yet another error:

Error using -
Matrix dimensions must agree.

Error in test_denoising_general_kl_bss3 (line 56)
output.source_noise= spectrum.mix-output.source_signal;

The previous error looked like this:

run_test_single_model
Warning: Name is nonexistent or not a directory: ......\codes\denoising\Data
In path (line 109)
In addpath (line 88)
In run_test_single_model (line 7)
Warning: Name is nonexistent or not a directory: ......\codes\denoising\drnn
In path (line 109)
In addpath (line 88)
In run_test_single_model (line 8)
Undefined function or variable 'formulate_data_test'.

Error in test_denoising_general_kl_bss3 (line 14)
formulate_data_test(mixture, eI, testmode);

Error in run_test_single_model (line 38)
output = test_denoising_general_kl_bss3(x', theta, eI, 'testall', 0);

I will try to fix it myself but if you have time I would appreciate your help. Thank you.

minFunc returns 'Step direction is illegal!'

Hi,posenhuang
I use 3 hours' mandarin speech to train mir1k. The SNR is [-5,0,5,10,15,20]dB and clean data is SNR infinite condition. The left channel of clean data is filled with all zeros.
In minFunc.m, line 963-967, a legal number check is implemented and I got the error 'Step direction is illegal!'. I think there must be some data illegal, but I don't know where the problem is. Would you please
help me to solve this problem?

is there other Language？

Dear,
Is there any other Language？
Such as python or C/C++
Any advice or suggestion will be good!
Thx

Would you please share the demo for denoising_model training?

First, what brilliant work you have done!
I found your denoising demo without model training application. Would you please share a m-file for denoising model training. Your reply will be highly appreciated! Thank you.

what the version of matlab you use.

Are you using the RNN for denoising task?

Hi,
posenhuang. Do you use RNN in denoising task?

Script for Inferencing

Hi @posenhuang ,
Thanks for creating this really helpful repository. Can you please share a inference script so that I can test the result of your trained models on custom dataset.

Thanks.

Timit and TSP separation code - not working

Hi, the Timit speech separation model seems to run as you can see a bunch of output on the command window - but does NOT save the split files it appears

The TSP model - just does NOT run and errors out :(.

Would be very helpful if you could address these.

Thx!

Training (ERROR)

HI Excuse me, I am a beginner in programming. I am working on a project of audio source separation , really interested in this code, if you can helpe me ... i have two problems :
1_ Error en Training code: codes/mir1k/train_mir1k_demo.m
Undefined function or variable "data_ag".

Error in formulate_data (line 223)
theoutputs = {data_ag, target_ag, mixture_ag};

Error in train_mir1k_demo (line 220)
[data_cell, targets_cell, mixture_spectrum]=formulate_data(train_files, eI, eI.train_mode); %0 -- chunk, 2--no chunk, 3- icassp

Error in tr (line 62)
train_mir1k_demo(context_win, hidden_units, num_layers, isdropout, ...
2_ I have a problem in the installation of htk, i can't install it, does the problem come back from there ? if so, how can i install it provided i downloaded the file ?