Comments (6)
I posted my own question on the DSP stack exchange to help get a better handle on this. It includes a nice graph of the filter array :)
http://dsp.stackexchange.com/questions/37450/please-help-me-understand-this-audio-downsampling-code
from speech-javascript-sdk.
nice! So you're planning to support downsampling to 8khz too? Is this to minimize bandwidth sent to watson? I'm wondering if going below 16khz actually affects recognition quality?
from speech-javascript-sdk.
Well, the service supports 8kHz "narrowband" models and 16kHz "broadband" models. Right now, when you select a narrowband model, the audio is downsampled to 16kHz here, and then I think it gets downsampled again to 8kHz at the service. So, doing it once here should both save bandwidth and increase quality.
from speech-javascript-sdk.
Ah, right I forgot about the narrowband model. So really this is about bandwidth savings when opting into the non-broadband model. Thanks for clarification.
from speech-javascript-sdk.
@nfriedly very nice!
Question about this line:
Input audio is typically 48kHz, this downsamples it to 16kHz.
I've seen several browsers provide different sample rates. For example, Chrome right now produces 44.1khz samples. I'm wondering if this filtering logic is going to slightly distort this input?
from speech-javascript-sdk.
Oh, let me adjust that. It will reduce the cutoff point of the low-pass filter slightly, but not enough to affect human speech.
from speech-javascript-sdk.
Related Issues (20)
- package.json { scripts { ... postinstall: }} only contains shell script for Linux, no batch file for Windows HOT 2
- The Javascript standalone release that isn't Webpack HOT 3
- CORS problem when getting voices () HOT 3
- Error: "Zone.js has detected that ZoneAwarePromise `(window|global).Promise` has been overwritten on Angular 7 HOT 2
- TextToSpeech for Safari iOS fix. HOT 5
- Requesting new feature to enable NLU service. HOT 3
- SpeechToText RecognizeStream stop() lag. HOT 2
- RecognizeMicrophone is not able to establish webSocket connection HOT 2
- Error during WebSocket handshake: Unexpected response code: 403 HOT 2
- REST API docs HOT 1
- Difficult integration with Create React app... HOT 2
- Please 1.0 this thing :) HOT 6
- Broken docs link HOT 5
- watsonplatform.net endpoint retirement HOT 3
- recognizeMicrophone function speechDetectorSensitivity and backgroundAudioSuppression can't set as params HOT 2
- The automated release is failing 🚨 HOT 1
- WebSocket connection error due to no access_token HOT 2
- Support for character_insertion_bias HOT 6
- Security Issue HOT 1
- Doesn't work with React 18+ HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from speech-javascript-sdk.