Comments (2)
Do you have the docopt
dependency, and are you using python2?
from speaker-diarization.
yeah.. Thankyou. I was missing docopt
dependency. Now Im getting this result...
userk@PSSHSRDT034:~/speaker-diarization$ python spk-diarization2.py /mnt/c/users/karthikeyan/Downloads/proper.wav Reading file: /mnt/c/users/karthikeyan/Downloads/proper.wav Writing output to: stdout Using feacat from: /home/userk/speaker-diarization/feacat Writing temporal files in: /tmp Writing lna files in: /home/userk/speaker-diarization/lna Writing exp files in: /home/userk/speaker-diarization/exp Writing features in: /home/userk/speaker-diarization/fea Performing exp generation and feacat concurrently tokenpass: ./VAD/tokenpass/test_token_pass Reading recipe: /tmp/initzDxEk1.recipe Using model: ./hmms/mfcc_16g_11.10.2007_10 Writing
.lnafiles in: /home/userk/speaker-diarization/lna Writing
.exp` files in: /home/userk/speaker-diarization/exp
Processing file 1/1
Input: /mnt/c/users/karthikeyan/Downloads/proper.wav
Output: /home/userk/speaker-diarization/lna/proper.lna
FAN OUT: 0 nodes, 0 arcs
FAN IN: 0 nodes, 0 arcs
Prefix tree: 3 nodes, 6 arcs
WARNING: No tokens in final nodes. The result will be incomplete. Try increasing beam.
Calling voice-detection2.py
Reading recipe from: /tmp/initzDxEk1.recipe
Reading .exp files from: /home/userk/speaker-diarization/exp
Writing output to: /tmp/vadTalccO.recipe
Sample rate set to: 125
Minimum speech turn duration: 0.5 seconds
Minimum nonspeech between-turns duration: 1.5 seconds
Segment before expansion set to: 0.0 seconds
Segment end expansion set to: 0.0 seconds
Waiting for feacat to end.
Calling spk-change-detection.py
Reading recipe from: /tmp/vadTalccO.recipe
Reading feature files from: /home/userk/speaker-diarization/fea
Feature files extension: .fea
Writing output to: /tmp/spkcxxYN9G.recipe
Conversion rate set to frame rate: 125.0
Using a growing window
Deltaws set to: 0.096 seconds
Using BIC as distance measure, lambda = 1.0
Window size set to: 1.0 seconds
Window step set to: 3.0 seconds
Threshold distance: 0.0
Useful metrics for determining the right threshold:
Average between windows distance: -789.417532303
Maximum between windows distance: 35.230502772707496
Minimum between windows distance: -1378.4592347022503
Total windows: 23
Total segments: 2
Average between detected segments distance: 56.7217946043
Maximum between detected segments distance: 56.72179460426196
Minimum between detected segments distance: 56.72179460426196
Total detected speaker changes: 1
Calling spk-clustering.py
('===', '/tmp/spkcxxYN9G.recipe')
Reading recipe from: /tmp/spkcxxYN9G.recipe
Reading feature files from: /home/userk/speaker-diarization/fea
Feature files extension: .fea
Writing output to: stdout
Conversion rate set to frame rate: 125.0
Using hierarchical clustering
Using BIC as distance measure, lambda = 1.3
Threshold distance: 0.0
Maximum speakers: 0
Initial cluster with: 2 speakers
Merging: 1 and 2 distance: -2548.5851870160886
Final speakers: 1
Useful metrics for determining the right threshold:
Maximum between segments distance: 0
Minimum between segments distance: -2548.5851870160886
Total segments: 2
Total detected speakers: 1`
from this how can I get the info of 'number of audio segments can be generated with respect to each speaker'. like speaker 1
has around 5 audio segments and the duration (from where to where I should crop the audio) .... and the wav file has two speakers but it shows total detected speakers: 1 ..
from speaker-diarization.
Related Issues (15)
- Issue with Running Audio File HOT 3
- Dockerfile is missing the step to install ffmpeg HOT 1
- Questions about BIC distance calculation HOT 1
- DockerHub-hosted image not up to date HOT 1
- I want to make a wav file using stdout, how can i make? HOT 5
- Speed is too slow. Is this docker can use gpu? HOT 1
- Building my own model HOT 3
- Excessive CPU HOT 1
- Docker Image for Speech Diarization HOT 3
- separate `.wav` files per speaker HOT 2
- Can I recognize only one speaker? HOT 2
- Tunable Parameters HOT 4
- OSError: [Errno 2] No such file or directory HOT 1
- WAV files format HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from speaker-diarization.