assemblyai / assemblyai-go-sdk Goto Github PK
View Code? Open in Web Editor NEWOfficial Go SDK
License: MIT License
Official Go SDK
License: MIT License
Ideally I would be able to make a request to get the temporary token https://www.assemblyai.com/docs/guides/real-time-streaming-transcription#creating-temporary-authentication-tokens
The structs are there, but are not used anywhere. Either the package should be updated to include that function, or there should be a generic http function to make calls to unimplemented AssemblyAI endpoints without having to do all of the set up manually for one endpoint
https://github.com/AssemblyAI/assemblyai-go-sdk/blob/main/types.go#L90
https://github.com/AssemblyAI/assemblyai-go-sdk/blob/main/types.go#L280
Hello and thanks for the recent release, especially the features around ForceEndUtterance
and SetEndUtteranceSilenceThreshold
. Will be using those updates in the future.
I had a question about how the realtime portion of this SDK formats and sends audio. I noticed on the official API docs it says:
When sending audio over the WebSocket connection, you can use the websocket's binary mode to send raw audio data. This can be the raw data recorded directly from a microphone or read from an audio file.
and slightly lower down...
HEADS UP
Sending audio_data via JSON is also supported but will be deprecated in the future. Use the binary mode instead.
If I look in the codebase I find the following:
Line 310 in 8e723bf
Given that &msg
here is using a json.RawMessage type, it seems the realtime SDK is sending the bytes of a json message, but not the raw audio bytes themselves. Is that coming in a future release?
Its possible I am mistaken in how the SDK is working or I don't understand what the API docs are referring to when it comes to "raw audio bytes"? I would assume it means PCM bytes. Looking for clarification.
Thanks for the help!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.