Code Monkey home page Code Monkey logo

Comments (4)

antoniomo avatar antoniomo commented on August 11, 2024 1

But of course feel free to open another issue if you can work around that and still want to give this a shot :)

from speaker-diarization.

antoniomo avatar antoniomo commented on August 11, 2024

Hi!

Telephone conversations are usually of quite poor quality, however, you can try some things:

call(['./spk-change-detection.py', vad_recipe, args.feapath,
        '-o', spkchange_recipe, '-m', 'gw', '-d', 'BIC', '-w', '1.0',
        '-st', '3.0', '-dws', '0.1', '-l', '0.75'])  # Lambda to 0.75
  • Try using GLR with a fine-tuned distance threshold for the spk-clustering2.py:
call(['./spk-clustering.py', spkchange_recipe, args.feapath,
          '-o', outfile, '-m', 'hi', '-d', 'GLR', '-t', '3000'])  # Distance: GLR, Threshold: 3000 (fine tune for your data)

You'll find that code in the wrapper script spk-diarization2.py, make the changes there :)

I hope this helps, let me know to close the issue if possible :)

from speaker-diarization.

amenegola avatar amenegola commented on August 11, 2024

Hi! Awesome answer, thanks!

I tried to implement the suggestions, they didn't work very well. Sometimes, it really looks like it is working, but it is not most of the time. I understood how changing the lambda and GLR Threshold impacts on segmentation and clustering, but it seems that the problem lies elsewhere. Also, tuning the parameters until I get good results in one audio may not generalize to the rest.

You can close the issue. Thank you very much for the help!

from speaker-diarization.

antoniomo avatar antoniomo commented on August 11, 2024

Sorry I couldn't be of more help! You are totally right, those parameters are data-dependent, so if the recording conditions vary a lot on your dataset, this won't work too well, and you'll need something more advanced than what this package can provide :(

from speaker-diarization.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.