Code Monkey home page Code Monkey logo

Comments (5)

antoniomo avatar antoniomo commented on August 11, 2024 1

Hi!

Glad you like it :)

About your requirement, you want each .wav file to have just the audio content of each speaker? That is:

speaker_1/audio.wav  # A wav file with the concatenation of every speaker_1 part of audio.wav
speaker_2/audio.wav  # A wav file with the concatenation of every speaker_2 part of audio.wav

Is this it? In that case, there's no audio post-processing on this package as-is, so it won't do this automatically, but you could write a python or awk script that takes the output, parses each start-time=xxx, end-time=xxx and speaker=xxx with a regex, and calls ffmpeg to cut the appropriate segment of the original .wav. You can later join all the speaker_x segments together.

This answer is pretty close to what you want, you just need to modify the parsing part to this format: https://unix.stackexchange.com/a/400032/6301

from speaker-diarization.

antoniomo avatar antoniomo commented on August 11, 2024 1

Is the audio in Korean? The language model that is included is for English language. It would be possible to train a Korean language model with a lot of annotated data and using https://github.com/aalto-speech/AaltoASR but that's quite a big task and I can't guide you through it :(

from speaker-diarization.

melonicedlatte avatar melonicedlatte commented on August 11, 2024

Thank You !! < ^ o ^ > !!

I use python and now i'm making split file.

But, i meet a new problem.
I'm a korean, and i test one wav file.
But, performance is not good.

How can i solve this situation you think??

from speaker-diarization.

melonicedlatte avatar melonicedlatte commented on August 11, 2024

Thank you. 👍👍👍
I will test other korean .wav file.
And I have to see that source and confirm that i can use it.

Thank you for your rapid answer.

from speaker-diarization.

antoniomo avatar antoniomo commented on August 11, 2024

Closing as it seems this was answered :)

from speaker-diarization.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.