google-speech-api

Google Speech API V2

限于喜欢 提交于 2019-12-02 15:58:36
I have followed this link to generate API keys, but the Speech API was not found on the console. The application example I'm following is here: https://github.com/gillesdemey/google-speech-v2/ I keep on getting an error: Your client does not have permission to get URL. What's the problem? Gui Ambros You have to be a member of chromium-dev discussion list before you can enable the Speech API in your Google Developers Console. Go to Chromium Dev group and click Join the list. Do not post to the group regarding the Google Speech API, as it is completely off topic. Go back to Google Developers

how to speed up google cloud speech

℡╲_俬逩灬. 提交于 2019-12-02 09:09:49
I am using a microphone which records sound through a browser, converts it into a file and sends the file to a java server. Then, my java server sends the file to the cloud speech api and gives me the transcription. The problem is that the transcription is super long (around 3.7sec for 2sec of dialog). So I would like to speed up the transcription. The first thing to do is to stream the data (if I start the transcription at the beginning of the record. The problem is that I don't really understand the api. For instance if I want to transcript my audio stream from the source (browser/microphone

NodeJS Convert Int16Array binary Buffer to LINEAR16 encoded raw stream for Google Speech API

淺唱寂寞╮ 提交于 2019-12-02 07:36:18
I'm trying to convert speech to text in node server where speech recording happens in the browser using AudioContext. I'm Able to send int16Array buffer(recorded data) to my node server through a WebSocket connection of binaryType:arraybuffer. this.processor.onaudioprocess = (e) => { // this.processAudio(e) for ( var float32Array = e.inputBuffer.getChannelData(0) || new Float32Array(this.bufferSize), len = float32Array.length, int16Array = new Int16Array(len); len--;) int16Array[len] = 32767 * Math.min(1, float32Array[len]); this.socket.send(int16Array.buffer); }; In server, data is received

Google Streaming Speech Recognition on an Audio Stream Python

假装没事ソ 提交于 2019-12-01 16:23:05
I have searched through all the available docs of Google but I could not find an example of streaming speech recognition on an audio stream in Python. Currently, I am using Speech Recognition for Python in Django to get the audio from the user and then listen to the audio. I can then save the file and run the google speech recognition or directly from the instance of the audio created. Can somebody guide me how to perform streaming speech recognition on an audio stream ? Google provides an example of the streaming Python API here . Rather than opening an audio file to create the stream (as on

ImportError: No module named google.cloud

泪湿孤枕 提交于 2019-11-30 16:23:26
问题 I am unable to import google.cloud.speech from google.cloud import speech I have installed it using : pip install --upgrade google-cloud-speech -t dir-name It is giving me below error while importing it from dir-name ImportError: No module named google.cloud google package with all the sub package is present over there but without __init__ .py in every sub packages as well. How can I import this packages without adding __init__ .py in package folder? PS : I have also tried from __future__

Is Google Cloud Platform available in China?

大兔子大兔子 提交于 2019-11-29 12:46:21
问题 I want to find out if there are any restrictions accessing GCP in China. Is GCP all available to use in China? If not, why? 回答1: There are no locations in China at the moment, and it seems Google is quite reluctant to bring GCP to the region (https://cloud.google.com/about/locations/) Main reasons probably include that servers operating in mainland china have to fulfill a bunch of regulations, which is basically also why even for Operators that have servers in China (e.g. Amazon),

Request had invalid authentication credentials. Expected OAuth 2 access token error in cloud speech api

孤街浪徒 提交于 2019-11-29 11:45:56
问题 i have followed the google cloud speech api quickstart of requesting api by using curl -s -H "Content-Type: application/json" \ -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) \ https://speech.googleapis.com/v1/speech:recognize \ -d @sync-request.json and following link but i got error of { "error": { "code": 401, "message": "Request had invalid authentication credentials. Expected OAuth 2 access token, login cookie or other valid authentication credential.

Google-speech-api transcribing spoken numbers incorrectly

半世苍凉 提交于 2019-11-29 07:31:56
I started using google speech api to transcribe audio. The audio being transcribed contains many numbers spoken one after the other. E.g. 273 298 But the transcription comes back 270-3298 My guess is that it is interpreting it as some sort of phone number. What i want is unparsed output e.g. "two seventy three two ninety eight' which i can deal with and parse on my own. Is there a setting or support for this kind of thing? thanks Try passing a speech context with some phrase hints. How to use it is documented here: https://cloud.google.com/speech/docs/basics#phrase-hints Give it the spelled

How to use google speech recognition api in python? [closed]

不羁的心 提交于 2019-11-27 07:53:09
Stack overflow might not be the best place to ask this question but i need help. I have an mp3 file and i want to use google's speech recognition to get the text out of that file. Any ideas where i can find documentation or examples will be appreciated. Take a look at Google Cloud Speech API that enables developers to convert audio to text [...] The API recognizes over 80 languages and variants [...] You can create a free account to get a limited amount of API request. HOW TO: You need first to install gcloud python module & google-api-python-client module with: pip install --upgrade gcloud

How to use google speech recognition api in python? [closed]

怎甘沉沦 提交于 2019-11-26 13:45:38
问题 Stack overflow might not be the best place to ask this question but i need help. I have an mp3 file and i want to use google's speech recognition to get the text out of that file. Any ideas where i can find documentation or examples will be appreciated. 回答1: Take a look at Google Cloud Speech API that enables developers to convert audio to text [...] The API recognizes over 80 languages and variants [...] You can create a free account to get a limited amount of API request. HOW TO: You need