adirockzz95 / piwho Goto Github PK
View Code? Open in Web Editor NEWSpeaker recognition library based on MARF for raspberry pi and other SBCs.
License: MIT License
Speaker recognition library based on MARF for raspberry pi and other SBCs.
License: MIT License
Everytime I try to run, this appear
from watchdog.observers import Observer
ImportError: No module named observers
while training voice i used this code
from piwho import recognition
from piwho import vad
def find_speaker():
recog = recognition.SpeakerRecognizer()
# Record voice until silence is detected.
# save WAV file
vad.record()
# use newly recorded file.
name = []
name = recog.identify_speaker()
return name
if name == "main":
speaker = find_speaker()
print(speaker[0])
speaker.txt file is generated successfully but .gzbin file is not generated
please reply
thanks
If i use more than 10 speakers for given code.
from piwho import recognition
recog = recognition.SpeakerRecognizer()
name = []
name = recog.identify_speaker('audio.wav')
print(name[0])
print(name[1])
Then rather than getting speaker's name, I get this output.
[options]
Please tell me what's the reason.
Thanks.
why i am getting this issue i simpy used these lines of code inside python shell
from piwho import recognition
recog = recognition.SpeakerRecognizer()
recog.debug = True
recog.train_new_data('/home/royal/linkites_pd/wav_all/akshat3.wav','one')
below this the error occurs
FileNotFoundError: [Errno 2] No such file or directory: 'java'
waiting for reply
Thank you
Hi, nice work with the python wrapper.
I understand from the name that you're only interested in recognising who is speaking, not what they've said. But maybe you know how difficult or easy it would be to extend it so that it can match the sound instead?
I mean, if I have wavs of speaker1 saying "whoosh" and "beep" and "neenaw", I would like it to recognise which of those three things speaker1 says into the microphone, but I don't want to recognise speaker1 saying "nawnee" even if it sounds exactly like speaker1, and I don't really care about recognising speaker2 saying anything. Is that something that piwho could do, or would it need MARF to be extended in order to do this?
Is there a way using get_speaker_scores() to get the score for a single name?
Hello,
I have successfully trained and have a .gzbin file and speakers.txt. When I try to return the name list from recog.identify_speaker() like so:
name = []
vad.record()
name = recog.identify_speaker()
print(name)
name does not contain anything. Is there something I am missing? Thanks
I trained the model for four different speaker and it is able to recognize them but when i run recognition on an audio of a different speaker, (other than the four speakers on which i have trained) it shows the name of one of four speakers on which i had trained the model. Is their any way through which i can classify it as unknown .
Does the training input files need to have the speakers saying the same phrase?
Also, when running the recognition program, does the new file need have the speaker saying the same phrase?
In other words, is this system is text-independent or text-dependent?
Thank you for your time,
David
Hi,
First of all thank you for sharing this awesome library!
I'm trying to implement the audio recording script which records audio and recognizes the speaker, but when I run it it only says
* Listening mic. recording..
It never stops, after looking in the vad.py file I don't see where the .wav file is stored , I'm probably missing something.
the only other thing I have done that could be causing an issue, I chaged the audio settings to
CHUNK =512
RATE =48000
this was done so pyAudio could work with my USB audio card C-media
Any help is appreciated!
Thanks
Hi everyone,
I'd like to know if your team could help us to integrate your solution to Jarvis, which is a lightweight configurable multi-lang jarvis-like bot, meant for home automation running on slow computer.
More information here: https://github.com/alexylem/jarvis
Even if it's creator is French, the project is most of the time carried in English.
We would like to implement your solution to adapt commands regarding to the requester.
Ex: Read a personal music playlist
=> If John say: "Play some music", Jarvis answer "OK, let's play some rock!"
=> If Jane say: "Play some music", Jarvis answer "OK, let's play some pop."
Thanks in advance ๐
hi,
i ran below lines of code inside python shell
from piwho import recognition
recog = recognition.SpeakerRecognizer()
name = []
name = recog.identify_speaker('/home/user/piwho2_virtual/akshat1.wav')
Issue:
File "/usr/lib/python2.7/subprocess.py", line 1343, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
But in actual file exists see
(mypython) user@user-OptiPlex-3020M:~/piwho2_virtual$ ls
10:17:49:21.wav 11:11:31:03.wav gender testing.py
11:11:18:49.wav 11:11:33:32.wav mypython test.py
11:11:30:49.wav akshat1.wav speakers.txt
One more query i have: how would i got .gzbin folder
please tell me what to do?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.