google-speech-api | 易学教程

HotwordDetector using VoiceInteractionService in android

阅读更多关于 HotwordDetector using VoiceInteractionService in android

I am creating the voice recognition service like "Ok Google" with custom command. For this I have used the VoiceInteractionService and I refered the code from here . Also, I am getting the STATE_HARDWARE_UNAVAILABLE error in onAvailabilityChanged . public class VoiceCommandService extends VoiceInteractionService { private static final String TAG = "AlwaysOnHotwordDetector"; Locale locale = new Locale("en-US"); protected SpeechRecognizer mSpeechRecognizer; protected Intent mSpeechRecognizerIntent; public final Callback mHotwordCallback = new Callback() { @Override public void

How to translate live streaming using google speech api?

阅读更多关于 How to translate live streaming using google speech api?

问题 this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en

Different between Google Speech API and Web Speech API

阅读更多关于 Different between Google Speech API and Web Speech API

问题 I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can implement the speech recognition. And it just working on google chrome and opera: http://caniuse.com/#feat=speech-recognition So 1. What is the different Google Speech API and Web Speech API? Are they have any relations? The speech recognition result json

How to translate live streaming using google speech api?

阅读更多关于 How to translate live streaming using google speech api?

this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en/chrome/demos/speech.html UPDATE: Perform streaming API with audio less than 6sec duration. What can't i

Different between Google Speech API and Web Speech API

阅读更多关于 Different between Google Speech API and Web Speech API

I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can implement the speech recognition. And it just working on google chrome and opera: http://caniuse.com/#feat=speech-recognition So 1. What is the different Google Speech API and Web Speech API? Are they have any relations? The speech recognition result json is return from google. Is that the google speech api will be more accurate than web speech api? Thank

Google Speech API V2

阅读更多关于 Google Speech API V2

问题 I have followed this link to generate API keys, but the Speech API was not found on the console. The application example I'm following is here: https://github.com/gillesdemey/google-speech-v2/ I keep on getting an error: Your client does not have permission to get URL. What's the problem? 回答1: You have to be a member of chromium-dev discussion list before you can enable the Speech API in your Google Developers Console. Go to Chromium Dev group and click Join the list. Do not post to the group

Is there a way to force Google Speech api to return only words as response?

阅读更多关于 Is there a way to force Google Speech api to return only words as response?

I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang= "+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the result will be 1234 and if I say one thousand two hundred thirty four the result is still 1234 . Another issue is that with other languages i.e. the word elf in German is eleven . If you say elf the result is 11 , instead of elf. I know we have no control over the api but is there any parameters or hacks we can add to this api to force it to return only

Google Speech Recognition API: timestamp for each word?

阅读更多关于 Google Speech Recognition API: timestamp for each word?

It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said " one two three for five " in a WAV file. Google API gives me this: { u'alternative': [ {u'transcript': u'12345'}, {u'transcript': u'1 2 3 4 5'}, {u'transcript': u'one two three four five'} ], u'final': True } Question: is it possible to get the time (in seconds) at which each word has been said? With my example: ['one', 0.23, 0.80], ['two', 1.03, 1.45], ['three', 1.79, 2.35], etc. i.e. the

GOOGLE_APPLICATION_CREDENTIALS error

阅读更多关于 GOOGLE_APPLICATION_CREDENTIALS error

I am trying to use the google speech API for converting an audio file to text, but facing some problems with the documentation.I have enabled the speech API and created an service account file. I also got the service account credentials activated. Then I wrote the following command to retrieve the authorization token using my service account. gcloud auth application-default print-access-token But it is returning this message I don't understand why this is happening because I already have an VM instance created in my Compute Engine. Can anybody explain where am I going wrong with this? Also, is

how to speed up google cloud speech

阅读更多关于 how to speed up google cloud speech

问题 I am using a microphone which records sound through a browser, converts it into a file and sends the file to a java server. Then, my java server sends the file to the cloud speech api and gives me the transcription. The problem is that the transcription is super long (around 3.7sec for 2sec of dialog). So I would like to speed up the transcription. The first thing to do is to stream the data (if I start the transcription at the beginning of the record. The problem is that I don't really