google-speech-api | 易学教程

audio file isn't being parsed with Google Speech

阅读更多关于 audio file isn't being parsed with Google Speech

问题 This question is a followup to a previous question. The snippet of code below almost works...it runs without error yet gives back a None value for results_list . This means it is accessing the file (I think) but just can't extract anything from it. I have a file, sample.wav , living publicly here: https://storage.googleapis.com/speech_proj_files/sample.wav I am trying to access it by specifying source_uri='gs://speech_proj_files/sample.wav' . I don't understand why this isn't working. I don't

Google Speech API “Sample rate in request does not match FLAC header”

阅读更多关于 Google Speech API “Sample rate in request does not match FLAC header”

问题 I'm trying to convert an mp4 video clip into a FLAC audio file and then have google speech spit out the words from the video so that I can detect if specific words were said. I have everything working except that I am getting an error from the Speech API: { "error": { "code": 400, "message": "Sample rate in request does not match FLAC header.", "status": "INVALID_ARGUMENT" } } I am using FFMPEG in order to convert the mp4 into a FLAC file. I am specifying that the FLAC file be 16 bits in the

accessing audio files from Google Cloud Storage when using Google Speech

阅读更多关于 accessing audio files from Google Cloud Storage when using Google Speech

问题 I have used this bit of code below to successfully parse a .wav file which contains speech, to text, using Google Speech. But I want to access a different .wav file, which I have placed on Google Cloud Storage (publicly), instead of on my local hard drive. Why doesn't simply changing speech_file = 'my/local/system/sample.wav' to speech_file = 'https://console.cloud.google.com/storage/browser/speech_proj_files/sample.wav' work acceptably? Here is my code: speech_file = 'https://console.cloud

accessing audio files from Google Cloud Storage when using Google Speech

阅读更多关于 accessing audio files from Google Cloud Storage when using Google Speech

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

阅读更多关于 INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

问题 I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to perform the requests as shown bellow: gcloud ml speech recognize-long-running \ "/path/to/file/audio.flac" \ --language-code="pt-PT" --async Every time I run this on one of my recording, it gives the following error message: ERROR: (gcloud.ml.speech

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

阅读更多关于 INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

How to convert the float32Array format of native html5 recorded audio to proper bytes for Google Speech-to-Text service?

阅读更多关于 How to convert the float32Array format of native html5 recorded audio to proper bytes for Google Speech-to-Text service?

问题 If you follow this tutorial: https://medium.com/ideas-at-igenius/delivering-a-smooth-cross-browser-speech-to-text-experience-b1e1f1f194a2 you will manage to create a script processor to which you add a listener scriptProcessor = inputPoint.context.createScriptProcessor(bufferSize, in_channels, out_channels) //... scriptProcessor.addEventListener('audioprocess', streamAudioData) Inside the callback by calling this line: callback_param.inputBuffer.getChannelData(0) one receives a javascript

Google Cloud Speech to Text API - Speaker Diarization

阅读更多关于 Google Cloud Speech to Text API - Speaker Diarization

问题 When i am trying to do a speech to text transcribe of a live phone call using web socket. Already included const Speech = require('@google-cloud/speech').v1p1beta1; const speech = new Speech.SpeechClient(); With following config. encoding: 'LINEAR16', sampleRateHertz: 8000, languageCode: 'en-US', useEnhanced: true, enableSpeakerDiarization: true, diarizationSpeakerCount: 2, enableWordConfidence: true, model: `phone_call`, I am getting following response { "results": [ { "alternatives": [ {

Grpc proxy server for Google speech streaming api is not responding ( Grpc.Core.RpcException )

阅读更多关于 Grpc proxy server for Google speech streaming api is not responding ( Grpc.Core.RpcException )

问题 We are trying to implement GRPC proxy server ( ASP.NET Core console app ) which will proxy incoming data to Google Speech streaming api. Code is pretty much the same as in sample application. Proxy server purpose is multiple language recognition and api key protection. For testing purpose I have created Windows forms application which sends available data ( audio chunks via DataAvailable handler ) to proxy server. Server starts recognizing transcripts, but after few recognitions (

Trouble passing string variable to return data from python function to be used globally anywhere in a python script or program - EDITED for clarity

阅读更多关于 Trouble passing string variable to return data from python function to be used globally anywhere in a python script or program - EDITED for clarity

问题 I am editing my question to reflect the issue I am having in my application. I am trying to take a streamed audio and convert it to text using Google text to speech. Then pass that that text as input to a conversation not on Watson. Watson then returns its answer. The latter half works great. The issue I am having is that I can't get the script to pass the text from the recorded speech to the Watson service I created. I don't get an error, I just get nothing. The mic is working (I tested it