google-speech-api

audio file isn't being parsed with Google Speech

左心房为你撑大大i 提交于 2019-12-24 06:48:04
问题 This question is a followup to a previous question. The snippet of code below almost works...it runs without error yet gives back a None value for results_list . This means it is accessing the file (I think) but just can't extract anything from it. I have a file, sample.wav , living publicly here: https://storage.googleapis.com/speech_proj_files/sample.wav I am trying to access it by specifying source_uri='gs://speech_proj_files/sample.wav' . I don't understand why this isn't working. I don't

Google Speech API “Sample rate in request does not match FLAC header”

北城余情 提交于 2019-12-24 06:38:00
问题 I'm trying to convert an mp4 video clip into a FLAC audio file and then have google speech spit out the words from the video so that I can detect if specific words were said. I have everything working except that I am getting an error from the Speech API: { "error": { "code": 400, "message": "Sample rate in request does not match FLAC header.", "status": "INVALID_ARGUMENT" } } I am using FFMPEG in order to convert the mp4 into a FLAC file. I am specifying that the FLAC file be 16 bits in the

accessing audio files from Google Cloud Storage when using Google Speech

安稳与你 提交于 2019-12-24 02:07:21
问题 I have used this bit of code below to successfully parse a .wav file which contains speech, to text, using Google Speech. But I want to access a different .wav file, which I have placed on Google Cloud Storage (publicly), instead of on my local hard drive. Why doesn't simply changing speech_file = 'my/local/system/sample.wav' to speech_file = 'https://console.cloud.google.com/storage/browser/speech_proj_files/sample.wav' work acceptably? Here is my code: speech_file = 'https://console.cloud

accessing audio files from Google Cloud Storage when using Google Speech

╄→гoц情女王★ 提交于 2019-12-24 02:06:50
问题 I have used this bit of code below to successfully parse a .wav file which contains speech, to text, using Google Speech. But I want to access a different .wav file, which I have placed on Google Cloud Storage (publicly), instead of on my local hard drive. Why doesn't simply changing speech_file = 'my/local/system/sample.wav' to speech_file = 'https://console.cloud.google.com/storage/browser/speech_proj_files/sample.wav' work acceptably? Here is my code: speech_file = 'https://console.cloud

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

时光总嘲笑我的痴心妄想 提交于 2019-12-23 08:08:37
问题 I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to perform the requests as shown bellow: gcloud ml speech recognize-long-running \ "/path/to/file/audio.flac" \ --language-code="pt-PT" --async Every time I run this on one of my recording, it gives the following error message: ERROR: (gcloud.ml.speech

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

爱⌒轻易说出口 提交于 2019-12-23 08:08:21
问题 I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to perform the requests as shown bellow: gcloud ml speech recognize-long-running \ "/path/to/file/audio.flac" \ --language-code="pt-PT" --async Every time I run this on one of my recording, it gives the following error message: ERROR: (gcloud.ml.speech

How to convert the float32Array format of native html5 recorded audio to proper bytes for Google Speech-to-Text service?

匆匆过客 提交于 2019-12-23 04:48:25
问题 If you follow this tutorial: https://medium.com/ideas-at-igenius/delivering-a-smooth-cross-browser-speech-to-text-experience-b1e1f1f194a2 you will manage to create a script processor to which you add a listener scriptProcessor = inputPoint.context.createScriptProcessor(bufferSize, in_channels, out_channels) //... scriptProcessor.addEventListener('audioprocess', streamAudioData) Inside the callback by calling this line: callback_param.inputBuffer.getChannelData(0) one receives a javascript

Google Cloud Speech to Text API - Speaker Diarization

与世无争的帅哥 提交于 2019-12-23 04:08:04
问题 When i am trying to do a speech to text transcribe of a live phone call using web socket. Already included const Speech = require('@google-cloud/speech').v1p1beta1; const speech = new Speech.SpeechClient(); With following config. encoding: 'LINEAR16', sampleRateHertz: 8000, languageCode: 'en-US', useEnhanced: true, enableSpeakerDiarization: true, diarizationSpeakerCount: 2, enableWordConfidence: true, model: `phone_call`, I am getting following response { "results": [ { "alternatives": [ {

Grpc proxy server for Google speech streaming api is not responding ( Grpc.Core.RpcException )

耗尽温柔 提交于 2019-12-22 17:11:12
问题 We are trying to implement GRPC proxy server ( ASP.NET Core console app ) which will proxy incoming data to Google Speech streaming api. Code is pretty much the same as in sample application. Proxy server purpose is multiple language recognition and api key protection. For testing purpose I have created Windows forms application which sends available data ( audio chunks via DataAvailable handler ) to proxy server. Server starts recognizing transcripts, but after few recognitions (

Trouble passing string variable to return data from python function to be used globally anywhere in a python script or program - EDITED for clarity

僤鯓⒐⒋嵵緔 提交于 2019-12-22 01:07:34
问题 I am editing my question to reflect the issue I am having in my application. I am trying to take a streamed audio and convert it to text using Google text to speech. Then pass that that text as input to a conversation not on Watson. Watson then returns its answer. The latter half works great. The issue I am having is that I can't get the script to pass the text from the recorded speech to the Watson service I created. I don't get an error, I just get nothing. The mic is working (I tested it