Code Monkey home page Code Monkey logo

personalvad's People

Contributors

dependabot[bot] avatar pirxus avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

personalvad's Issues

model release

Hi, @pirxus really kind of shared implementation,
It would be great to user if you could share the checkpoint for fast evaluation.

test data

Hi, thank you for uploading your code!
I want to use the pretrained models to test my own data, but it seems that there is something wrong during the process of feature extraction. So I'd like to ask how should I use the model to test my own data?

Issues with ntss and tss confusion

Hi,

I am currently trying to replicate the results from the Personal VAD paper, and I am having some issues with the model not properly distinguishing between target speaker and non-target speaker speech, with a heavy bias towards target speaker speech.
Have you had any such issues?

Best regards,
Holger Severin Bovbjerg

personalVAD 2.0

Hi, will you try to implement the personalVAD 2.0 ?
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

extract feature

Hi, thank you for your personal VAD code!

when i try to extract feature (command: bash prepare_dataset_features.sh 3), i encounter some problem as below:

File "/usr/lib/python3.6/multiprocessing/pool.py", line 119, in worker
result = (True, func(*args, **kwds))
File "/usr/lib/python3.6/multiprocessing/pool.py", line 47, in starmapstar
return list(itertools.starmap(args[0], args[1]))
File "src/extract_features.py", line 176, in extract_features
for utt_id, (sr, arr) in wav_scp:
File "/home/oscar/.local/lib/python3.6/site-packages/kaldiio/highlevel.py", line 136, in iter
k, v = next(self.generator)
File "/home/oscar/.local/lib/python3.6/site-packages/kaldiio/matio.py", line 127, in load_scp_sequential
mat = _load_mat(arkfd, offset, slices, endian=endian)
File "/home/oscar/.local/lib/python3.6/site-packages/kaldiio/matio.py", line 331, in _load_mat
array = read_kaldi(fd, endian)
File "/home/oscar/.local/lib/python3.6/site-packages/kaldiio/matio.py", line 441, in read_kaldi
array = read_ascii_mat(fd)
File "/home/oscar/.local/lib/python3.6/site-packages/kaldiio/matio.py", line 605, in read_ascii_mat
assert len(string) != 0
AssertionError

Thank you for your help. I look forward to hearing from you.

Sincerely,

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.