Code Monkey home page Code Monkey logo

ghostvlad-speaker's Introduction

ghostvlad-speaker

An tensorflow implementation of ghostvlad, pretrained model can be downloaded in ghostvlad-speaker.tar

prerequisit

1. pydub, also need ffmpeg to be installed.
2. tensorflow
3. librosa

Steps:

1. generate speaker labels from dataset voxceleb2

python voxcele2json.py

2. Training

python train.py
Please change the args_params in main

3. predict

python predict.py

ghostvlad-speaker's People

Contributors

taylorlu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

ghostvlad-speaker's Issues

About the convergence cost

Hello, I have used the vox2 dataset(including 5994 speakers), and the default parameters.
However, the cost converges at around 19.50, and the speaker verfication result is not good.
I am wondering what's wrong with it, and what is your convergence cost. Thanks!

cluster center stay the same during training

Hi there

During training using the GhostVLAD layer, I have experienced that the cluster centres do not change their value, which means that they always stay at the same place during training. Is this an expected behaviour, or is this a bug in my training process?

To give you some context, I used the GhostVLAD layer after a ResNet18 architecture for an audio task. I adapted your exact code to fit my structure. All the initialization, layers and so on are like in your code.

I would appreciate your help 😄.

A question about decay_steps

In the original paper, decaying step of learning rate is every 36 epochs.

But your code, the step is 5000 iterations.

How can I make the same configuration as in the original paper?

Thank you!

About pretrained model settings

Hi taylorlu, I want to know the settings of the pretrained model. (epoch, batch size, learning rate decay...)

I used the model and it seems works nicely.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.