google-speech-api | 易学教程

WebException: Error: NameResolutionFailure in unity3d

阅读更多关于 WebException: Error: NameResolutionFailure in unity3d

问题 Here is my script that request Google Speech API. public string HttpUploadFile(string url, string file, string paramName, string contentType) { file = @"C:\Users\Faizan Khan\AppData\LocalLow\DefaultCompany\GoogleSpeechTest\testing\testing7.060799.wav"; Debug.Log(string.Format("Uploading {0} to {1}", file, url)); HttpWebRequest wr = (HttpWebRequest)WebRequest.Create(url); wr.ContentType = "audio/l16; rate=44100"; wr.Method = "POST"; wr.KeepAlive = true; wr.Credentials = System.Net

Google speech API internal server error -83104

阅读更多关于 Google speech API internal server error -83104

问题 I am newly touching Google speech API and trying to make the official Node.js demo work. But when I using a .flac file (record from my phone and converted) to test the demo, I got this error from Google speech API server: Got audio file! Analyzing speech... { Error: Aborted: internal server error -83104 at Request._callback (~/work/demo/nodejs-docs-samples/speech/node_modules/google-auth-library/lib/transporters.js:85:15) at Request.self.callback (~/work/demo/nodejs-docs-samples/speech/node

Fetch “transcript” values from Google speech api

阅读更多关于 Fetch “transcript” values from Google speech api

问题 I am trying to fetch the "transcript" value from the following result: { transcript: "1 2 3 4" confidence: 0.902119 words { start_time { nanos: 200000000 } end_time { nanos: 700000000 } word: "1" } words { start_time { nanos: 700000000 } end_time { nanos: 900000000 } word: "2" } words { start_time { nanos: 900000000 } end_time { seconds: 1 } word: "3" } words { start_time { seconds: 1 } end_time { seconds: 1 nanos: 300000000 } word: "4" } } The code I am writing to get it is : for result in

Google Speech Api get text from audio file returning {“result”:[]} in C#

阅读更多关于 Google Speech Api get text from audio file returning {“result”:[]} in C#

问题 I'm trying to create a windows application where I can take an audio file I have and transcribe the voice in it to a text file with the Google Speech Recognition API. Here is what I did: 1) I went here https://groups.google.com/a/chromium.org/forum/?fromgroups#!forum/chromium-dev and became a member. 2) I went to my Google Developers Console and generated an API key successfully. 3) I got some code online and ran it: private void btnGoogle_Click(object sender, EventArgs e) { string path = @"Z

What is Google Speech To Text Rest APi Address?

阅读更多关于 What is Google Speech To Text Rest APi Address?

问题 I have got the speech API ke y as below image depicted but i am unable to find where to request? where is URL given? "http://www.google.com/speech-api/v2/recognize?output=json&lang=en-us&key="+ apiKey; "https://speech.googleapis.com/v1/speech:recognize?key=" + apiKey; "https://speech.googleapis.com/v1beta1/speech:syncrecognize?key=${" + apiKey + "}"; All are i guess wrong as i am getting DNS name resolution error . what is the correct URL (remember i didn't purchase the service yet), where i

Google Speech API how to pickup results of my asyncrecognize request

阅读更多关于 Google Speech API how to pickup results of my asyncrecognize request

问题 I executed the following curl -s -k -H "Content-Type: application/json" -H "Authorization: Bearer ${MY_ACCESS_TOKEN} " https://speech.googleapis.com/v1beta1/speech:asyncrecognize -d @json/automator.json which returned { "name": "376034185739864225" } I tried using the interactive GET available here` passing in that name value above, (about 12 hours later), but got back { "error": { "code": 404, "message": "Requested entity was not found.", "status": "NOT_FOUND" } } Am I missing something?

How to use StreamingRecognize for more than 1 minute?

阅读更多关于 How to use StreamingRecognize for more than 1 minute?

问题 I am quite new to using Google speech API. My application requiers me to contiouously stream audio requsts for speech recognition. the usage will be more than 1 minute continuously. However, the service halts after 60 seconds as per the Usage Limits. Is there a way around this issue? Any help is greatly appreciated. Thanks 回答1: Buried deep in the Google cloud console is a link to a form where you can request increase in some of the limits. However, if possible, use the Async recognition which

Detect fluency from google speech api results

阅读更多关于 Detect fluency from google speech api results

问题 Trying to determine fluency of a speaker using google speech (to text) api. So far i have found that api (betav1) can show the time taken to speak a word ( its starting time and ending time ). And from wikipedia, Oral fluency or speaking fluency is a measurement both of production and reception of speech, as a fluent speaker must be able to understand and respond to others in conversation. Spoken language is typically characterized by seemingly non-fluent qualities (e.g., fragmentation,

Google Python Speech Recognition with wake word

阅读更多关于 Google Python Speech Recognition with wake word

问题 I am using the Google python speech recognition on a Raspberry Pi and it works really well. However, it is constantly listening and I would prefer to have a wake word so that it only listens when I need it to. At the moment it listens until there is a silence (or a return to whatever the ambient background noise is) and then sends the sound clip to the API for speech to text recognition. Is there a way for me to shorten the length of time that it is listening? This way it can be listening to

How to increase Google's Speech Recognition accuracy for separated numbers

阅读更多关于 How to increase Google's Speech Recognition accuracy for separated numbers

问题 We give this image to our users: enter image description here This picture is representing separate numbers. And all of our users read it as "11-0-9-5" into their microphones. We use Google Speech Engine, and it interprets this result: "1109 5". This makes it impossible for us to compare the spoken words with the expected result. And we're stuck in this phase. Is there a way to tell Google's Speech Recognition to understand spoken numbers literally and separately, and do not join them