-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FLAC in browser #5
Comments
it probably depends on which audio codec is used in |
Hi @russaa Sorry, this isn't relevant to your project. But I am looking for some insight on how to record good quality speech that can be transcribed with good quality. PS: I am having no issues with English speaker, it is non-english speaker/languages causing problems. |
I haven't really used But you probably also need to check, if the specific browser supports the container-format & codec you want to use, see MediaRecorder.isTypeSupported(). For comparison of various lossy codecs, the Opus website provides some information. If you do not have bandwidth concerns, you could also send the raw WAV data to your server. Also: maybe the French speech recognition of Google AI just generally delivers poorer results(?) |
|
Hi @wingedrasengan927 This should give you FLAC file which can be used with Google Speech API. Also, I would like to recommend that record the voice using separate stream and use libflac.js to encode audio stream. This approach really improves quality. |
thank you for the advice @carbonrider. would definitely try it out. |
Tested it. Works like charm. Thank you so much! |
Awesome @wingedrasengan927 |
I stuck into the similar problem. Could you please share your working snippet? |
@carbonrider - can you tell me which ffmpeg you are using? |
@wingedrasengan927 @carbonrider - do you both have sample code I can look at? |
Hi,
Heads up: This question is to understand which approach will have better results.
We are facing some issues with French speech recognition (we are using Google AI services). The current process is as follows
Record using MediaRecorder JS API --> Save as webm ---> Extract FLAC using FFMPEG --> Invoke Google Speech API
After looking at your library, it seems that you are converting audio bits received from MediaRecorder to FLAC in browser itself (which implies server side processing is not required).
Now moving to the main point, do you think any of the above approach will have impact on the Transcription quality?
The text was updated successfully, but these errors were encountered: