google-speech-api

HotwordDetector using VoiceInteractionService in android

≡放荡痞女 提交于 2019-12-04 04:08:38
I am creating the voice recognition service like "Ok Google" with custom command. For this I have used the VoiceInteractionService and I refered the code from here . Also, I am getting the STATE_HARDWARE_UNAVAILABLE error in onAvailabilityChanged . public class VoiceCommandService extends VoiceInteractionService { private static final String TAG = "AlwaysOnHotwordDetector"; Locale locale = new Locale("en-US"); protected SpeechRecognizer mSpeechRecognizer; protected Intent mSpeechRecognizerIntent; public final Callback mHotwordCallback = new Callback() { @Override public void

How to translate live streaming using google speech api?

限于喜欢 提交于 2019-12-03 16:59:24
问题 this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en

Different between Google Speech API and Web Speech API

ぐ巨炮叔叔 提交于 2019-12-03 16:17:12
问题 I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can implement the speech recognition. And it just working on google chrome and opera: http://caniuse.com/#feat=speech-recognition So 1. What is the different Google Speech API and Web Speech API? Are they have any relations? The speech recognition result json

How to translate live streaming using google speech api?

我的未来我决定 提交于 2019-12-03 05:59:07
this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en/chrome/demos/speech.html UPDATE: Perform streaming API with audio less than 6sec duration. What can't i

Different between Google Speech API and Web Speech API

家住魔仙堡 提交于 2019-12-03 05:25:27
I am working on web speech recognition. And I found that Google provide a API which call "Google speech API V2" to developer. But I notice there is a limit on every day to use it. After that I found there is a native WEB Speech API also can implement the speech recognition. And it just working on google chrome and opera: http://caniuse.com/#feat=speech-recognition So 1. What is the different Google Speech API and Web Speech API? Are they have any relations? The speech recognition result json is return from google. Is that the google speech api will be more accurate than web speech api? Thank

Google Speech API V2

混江龙づ霸主 提交于 2019-12-03 03:43:43
问题 I have followed this link to generate API keys, but the Speech API was not found on the console. The application example I'm following is here: https://github.com/gillesdemey/google-speech-v2/ I keep on getting an error: Your client does not have permission to get URL. What's the problem? 回答1: You have to be a member of chromium-dev discussion list before you can enable the Speech API in your Google Developers Console. Go to Chromium Dev group and click Join the list. Do not post to the group

Is there a way to force Google Speech api to return only words as response?

前提是你 提交于 2019-12-03 01:56:25
I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang= "+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the result will be 1234 and if I say one thousand two hundred thirty four the result is still 1234 . Another issue is that with other languages i.e. the word elf in German is eleven . If you say elf the result is 11 , instead of elf. I know we have no control over the api but is there any parameters or hacks we can add to this api to force it to return only

Google Speech Recognition API: timestamp for each word?

ぃ、小莉子 提交于 2019-12-03 01:48:56
It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said " one two three for five " in a WAV file. Google API gives me this: { u'alternative': [ {u'transcript': u'12345'}, {u'transcript': u'1 2 3 4 5'}, {u'transcript': u'one two three four five'} ], u'final': True } Question: is it possible to get the time (in seconds) at which each word has been said? With my example: ['one', 0.23, 0.80], ['two', 1.03, 1.45], ['three', 1.79, 2.35], etc. i.e. the

GOOGLE_APPLICATION_CREDENTIALS error

倾然丶 夕夏残阳落幕 提交于 2019-12-03 00:22:06
I am trying to use the google speech API for converting an audio file to text, but facing some problems with the documentation.I have enabled the speech API and created an service account file. I also got the service account credentials activated. Then I wrote the following command to retrieve the authorization token using my service account. gcloud auth application-default print-access-token But it is returning this message I don't understand why this is happening because I already have an VM instance created in my Compute Engine. Can anybody explain where am I going wrong with this? Also, is

how to speed up google cloud speech

元气小坏坏 提交于 2019-12-02 18:49:47
问题 I am using a microphone which records sound through a browser, converts it into a file and sends the file to a java server. Then, my java server sends the file to the cloud speech api and gives me the transcription. The problem is that the transcription is super long (around 3.7sec for 2sec of dialog). So I would like to speed up the transcription. The first thing to do is to stream the data (if I start the transcription at the beginning of the record. The problem is that I don't really