speech | 易学教程

Using Google Speech API

阅读更多关于 Using Google Speech API

What is the code for implementing the Google Speech API in my C# based application? I found out that it is possible to create an audio file and sent it to http://slides.html5rocks.com/#speech-input and receive it as text. Could you please explain how to do this or provide me with the code if you have attempted this before? Been stuck here for a while now Much appreciated. Code So far: SpeechRecognitionEngine rec = new SpeechRecognitionEngine(); SpeechSynthesizer dummy = new SpeechSynthesizer(); public Form1() { InitializeComponent(); Choices searching = new Choices("Porsche"); GrammarBuilder

Google Cloud Speech API word Hints

阅读更多关于 Google Cloud Speech API word Hints

Can you give and example of using word hints in Google cloud speech API. I try to use Rest API executor for brook.flac. I input phrase Brooklin (instead of Brooklyn) but the result is the same. Does they works at all? From https://cloud.google.com/speech/docs/basics#phrase-hints For any given recognition task, you may also pass a speechContext (of type SpeechContext) that provides information to aid in processing the given audio. Currently, a context can hold a list of phrases to act as "hints" to the recognizer; these phrases can boost the probability that such words or phrases will be

How to install and use additional voices in SpeechSynthesizer()?

阅读更多关于 How to install and use additional voices in SpeechSynthesizer()?

I need to use male voice in SpeechSynthesizer in addition to default Anna voice. I installed some files from Microsoft, and GetInstalledVoices() method says now I have Microsoft Anna, Microsoft Mary, Microsoft Mike and Sample TTS Voice. After I'm trying to set voice speaker.SelectVoice("Microsoft Mike"); my program throws exception: Cannot set voice. No matching voice is installed or the voice was disabled. What should I exactly install and how to enable Mike's voice? System is Windows7. Dork After trying many suggestions from web found working solution: - install Microsoft Speech Platform SDK

text to phonemes converter

阅读更多关于 text to phonemes converter

问题 I'm searching for a tool that converts text to phonemes, (like text to speech software) I can program one but it will not be without errors and takes a lot of time! so my question is: is there a simple tool for converting e.g. "hello" to "HH AH0 L OW1" maybe some command-line tool so i can capture the stdout? i'm searching for the phonemes in 'Arpabet' style (see the 'hello' example). espeak does something like that but the output is not in Arpabet style and the phonemes are not split by some

JAVA using google speech recognition API

阅读更多关于 JAVA using google speech recognition API

问题 I'm trying to use google speech recognition API. Here's the code i've written: http://pastebin.com/zJEhnJ74 It works. I get an answer from the server: {"status":5,"id":"8803471b14a2310dfcf917754e8bd4a7-1","hypotheses":[]} Now the problem is "status:5". Infact, here's status code: status: 0 – correct , status: 4 – missing audio file,  status: 5 – incorrect audio file. My problem is "incorrect audio file". I don't understand if it is a .flac file error (you can download my test .flac file here:

API or SDK for speech to text(speech recognition ) iphone

阅读更多关于 API or SDK for speech to text(speech recognition ) iphone

Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form. Detailed Description is as follows: In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file). So if I am able to get the word from api or sdk then it is possible to highlight it. Apart from I googled a lot for api and I came across ceedvocalsdk but it's not available for free trial. If someone can provide any idea other than this suiting to my

Python Speaker Recognition [closed]

阅读更多关于 Python Speaker Recognition [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . I have an audio file, a recorded telephone conversation of 2 people, that I need to separate the voices of 2 speakers automatically. I am new to speech recognition and I looked at wave module of python but failed to find any fruitful information. Please help how to start. Also please suggest me free python

How to capture audio in javascript?

阅读更多关于 How to capture audio in javascript?

I am currently using getUserMedia() , which is only working on Firefox and Chrome, yet it got deprecated and works only on https (in Chrome). Is there any other/better way to get the speech input in javascript that works on all platforms? E.g. how do websites like web.whatsapp.com app record audio? getUserMedia() prompts first-time-users to permit audio recording, whereas the Whatsapp application doesn't require the user's permission. The getUserMedia() I am currently using looks like this: navigator.getUserMedia( { "audio": { "mandatory": { "googEchoCancellation": "false",

pyspeech (python) - Transcribe mp3 files?

阅读更多关于 pyspeech (python) - Transcribe mp3 files?

I'd like to transcribe mp3 (speech-to-text) using the pyspeech API. I don't know if this is possible, though. Is it? How? pyspeech seems to be merely a python interface to the regular Windows speech APIs. Most likely you'd create some method of treating mp3 playback as an audio source for that speech API to listen to. Michael Levy I don't know about pyspeech, but if it is a Python wrapper around the Microsoft speech APIs, then some other posts may be helpful. Microsoft Speech engines do not require microphone input. They can accept audio files. If you are doing transcription, you will need a

API or SDK to make speech recognition only for numbers (between 1 and 10000)?

阅读更多关于 API or SDK to make speech recognition only for numbers (between 1 and 10000)?

问题 I need a specialized solution optimized to detect numbers between 1 and 1000 to be used on a smartphone. Best solution would be to have this SDK working offline. Any idea ? I do not find any configuration with Google Speech or Amazon Transcribe to allow "number only" 回答1: It is not quite right to strictly expect numbers from people, they usually say many things like "i don't know" or "wait a bit" even if you ask them for numbers. You will harm the experience significantly. You have to analyze