speech-recognition

Google speech API [closed]

喜欢而已 提交于 2019-11-26 15:29:25
问题 I'm now working with my project and I'm about to build a Siri-like application for the desktop computer. I am thinking if Google Speech API is reliable and accurate for speech recognition? Can you suggest to me what speech API is the most accurate in terms of speech recognition? Most preferably a free API. Thank you. 回答1: While the Google speech API is free it is not an official public API. Some people have reverse engineered it, as is discussed in this blog. If you are planning on accessing

How to use Speech Recognition inside the iOS SDK? [closed]

谁说胖子不能爱 提交于 2019-11-26 15:13:43
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . I know that there is no public API for the SIRI-Services, but is there an API for simple Speech-Recognition? So if I have a textfield and the user taps onto that textfield, a keyboard with the typically microphone button appears and if he pressed it the speech get recognized and transformed into a string object?

Help with SAPI v5.1 SpeechRecognitionEngine always gives same wrong result with C#

假如想象 提交于 2019-11-26 14:48:40
问题 I was playing around with this SAPI v5.1 library. So I was testing a sample WAV file I have. (Download it from here). Anyway, the sound in that file is clear and easy. It contains only one word which is number three. Now when I run the following code, I get number 8 or "eight". If I remove it, I get 7. If I try to randomize the list I get different results and so on. I'm really getting confused and started to think that SpeachRecognition in SAPI library doesn't work at all... Anyway here is

Getting FFT peaks from data

让人想犯罪 __ 提交于 2019-11-26 14:45:16
I am developing a speech recognition system from scratch using Octave. I am trying to detect phonemes by detecting differences in frequency. Currently I have read in a wav file, organized the values into blocks and applied fft to the overall data. After, I plot the new data with plot(abs(real(fft(q)))) which creates this graph: How could I get the frequency values (the peaks of the graph)? If you don't have access to findpeaks , the basic premise behind how it works is that for each point in your signal, it searches a three element window that is centred at this point and checks to see whether

Google Speech Recognition API Result is Empty

时光总嘲笑我的痴心妄想 提交于 2019-11-26 14:42:16
I'm performing an asynchronous request to Google Cloud Speech API, and I do not know how to get the result of operation: Request POST: https://speech.googleapis.com/v1beta1/speech:asyncrecognize Body: { "config":{ "languageCode" : "pt-BR", "encoding" : "LINEAR16", "sampleRate" : 16000 }, "audio":{ "uri":"gs://bucket/audio.flac" } } Which returns: { "name": "469432517" } So, I do a POST: https://speech.googleapis.com/v1beta1/operations/469432517 Which returns: { "name": "469432517", "metadata": { "@type": "type.googleapis.com/google.cloud.speech.v1beta1.AsyncRecognizeMetadata", "progressPercent

Error ERROR_RECOGNIZER_BUSY with offline speech recognition

两盒软妹~` 提交于 2019-11-26 14:36:35
问题 I have made research on google offline speech recognition. but it works fine in google nexus 5(OS:-4.4) but same build if I implement in Samsung galaxy s5(OS:-5.0) it is not recognizing and it is showing this error: 8- ERROR_RECOGNIZER_BUSY. Below is my code. By keeping this link as reference I have made a changes http://www.truiton.com/2014/06/android-speech-recognition-without-dialog-custom-activity/ Without internet voice must recognize. I have worked on Pocket sphinx but it take lot of

Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

∥☆過路亽.° 提交于 2019-11-26 14:22:51
Is there a comprehensive list of known APIs for desktop or browser environments? Michael Levy I'll rehash and update an answer from Speech recognition in C or Java or PHP? . This is by no means comprehensive, but it might be a start for you From watching these questions for few months, I've seen most developer choices break down like this: Windows folks - use the System.Speech features of .Net or Microsoft.Speech and install the free recognizers Microsoft provides. Windows 7 includes a full speech engine. Others are downloadable for free. There is a C++ API to the same engines known as SAPI.

How to use google speech recognition api in c#?

▼魔方 西西 提交于 2019-11-26 14:16:20
问题 I want to get the audio file from c# and send to google speech recognition API for get the "speech to text" answer. My code is like this: try { byte[] BA_AudioFile = GetFile(filename); HttpWebRequest _HWR_SpeechToText = null; _HWR_SpeechToText = (HttpWebRequest)HttpWebRequest.Create( "https://www.google.com/speech-api/v2/recognize?output=json&lang=" + DEFAULT_LANGUAGE + "&key=" + key); _HWR_SpeechToText.Credentials = CredentialCache.DefaultCredentials; _HWR_SpeechToText.Method = "POST"; _HWR

Build NEW Acoustic model, Dictionary , Language model for uncommon language speech recognition

半世苍凉 提交于 2019-11-26 14:10:06
问题 I want to build NEW Acoustic model ,New Dictionary ,New Language model for " Sinhala Language speech recognition " Sinhala language Characters are Unicode based. for an example A=අ,I=ඉ,U=උ,KA=ක,BA=බ. I did go through CMUSphinx Tutorial For Developers. But it did not help me. It works for English language. Language model should be ARPA model. and How can I map Sinhala Unicode with English phonemes and how to train Language model with Different voices. Is there any tool available for generate

Writing speech-recognition engine [closed]

耗尽温柔 提交于 2019-11-26 13:54:45
问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 8 years ago . So, like many others I decided to create my own speech-recognition engine. As it turned out, it's not easy at all, instead, it's