speech-recognition | 易学教程

free speech recognition engines for iOS?

阅读更多关于 free speech recognition engines for iOS?

I am looking for some free speech recognition engines to use in my iphone application... can you suggest any? There are a couple of wrappers of the Sphinx speech recognition engine (http://cmusphinx.sourceforge.net/) available for iOS https://github.com/KingOfBrian/VocalKit Through reading his summary you can see he is actually pointing people towards http://www.politepix.com/openears Nuance just opened the doors for developers to the Dragon Mobile SDK (they are industry-leaders). have a look at NDEV Mobile 来源： https://stackoverflow.com/questions/7596816/free-speech-recognition-engines-for-ios

Can I use OS X 10.8's speech recognition/dictation without a GUI?

阅读更多关于 Can I use OS X 10.8's speech recognition/dictation without a GUI?

Ideally I would like to use 10.8's/Mountain Lion's dictation feature from Python, without a GUI. As far as I can tell dictation is currently only supported by using the UITextInput protocol , which implies a GUI (correct?). You can create a dummy text field and programmatically start dictating by emulating the keypress (tried out successfully here) 来源： https://stackoverflow.com/questions/11851713/can-i-use-os-x-10-8s-speech-recognition-dictation-without-a-gui

Voice Recognition in PHP?

阅读更多关于 Voice Recognition in PHP?

I am looking for Voice Recognition in PHP. I have a lot of wav files (10-15 seconds) and I would like Voice Recognition to recognise 1 or 2 words from each wav file and then flag it to database or csv. All the files have the same voice and accent (same person lol) Example pseudocode: $voice = new play('file.wav'); $result = $voice->recognise("Good Morning"); if ($result) { echo "Matched Good Morning"; //flag to database or csv } else { echo "No match found"; } Which PHP Voice Recognition library can do this? PHP doesn't have this functionality built in but there are APIs available for this.

Speech recognition response is poor in sphinx4

阅读更多关于 Speech recognition response is poor in sphinx4

Currently we are investigating into using sphinx4 for speech recognition. We are trying to achieve a good response for a dictation type application. The input is a wav file and we wish to transcribe it. I have looked into the LatticeDemo and Transcriber demo provided by Sphinx4. When i utilize the same configuration , the response is pretty poor. I have tried to tweak in the configuration files but it simply does not recognize the words. the transcriber demo provided is for digits, i have modified the config file to understand words. But i am not sure if i am missing something. I have attached

Speech recognition framework for iOS that supports Spanish [closed]

阅读更多关于 Speech recognition framework for iOS that supports Spanish [closed]

Is there a speech to text framework for iOS that supports Spanish out of the box? Commercial or OS is ok. There are a bunch of commercial IOS librariers for speech recognition. The names I keep hearing are Nuance, iSpeech, and Yapme. Each offers cloud speech recognition (off the device) and a client library and SDK to build into your app. Nuance seems to support Spanish - http://blog.dragonmobileapps.com/2011/01/mobile-app-developer-dragon-mobile-sdk.html ...you can speech-enable your app for including US and UK English, European Spanish, European French, German, Italian and Japanese---with

Listening for keywords at all times, like “Ok google” on 4.4 [closed]

阅读更多关于 Listening for keywords at all times, like “Ok google” on 4.4 [closed]

I've checked the new Kitkat libraries to make sure that I'm not missing something. I want my application to always listen to a keyword before performing an action, just like how Google Now always listens for the keywords "Ok google" on the homescreen. How did Google accomplish this? What you need is called keyword spotting . Apart from online speech API Android has offline recognizer which is a bit more sophisticated. It is not mentioned in official documentation, but you can see the sources here , for example. I'm not sure you will find keyword spotting there, but at least you will have

Google Speech Recognition API

阅读更多关于 Google Speech Recognition API

I'm trying to use the Google Speech API v2 (at address https://www.google.com/speech-api/v2/recognize?... ) I need to use my Api Key, but when I use it I get error 403 Forbidden When I use an API key that was on the example project I downloaded it is working fine. I saw that at the Google Developers Console I can enable a lot of api options, but didn't find any Speech-API option. Is there anything else I need to enable to get access to this API using my key? Thank you! Instructions are here : http://www.chromium.org/developers/how-tos/api-keys !! Do not forget to activate the API "Speech API"

Speaker Recognition [closed]

阅读更多关于 Speaker Recognition [closed]

How could I differentiate between two people speaking? As in if someone says "hello" and then another person says "hello" what kind of signature should I be looking for in the audio data? periodicity? Thanks a lot to anyone who can answer this! The solution to this problem lies in Digital Signal Processing (DSP). Speaker recognition is a complex problem which brings computers and communication engineering to work hand in hand. Most techniques of speaker identification require signal processing with machine learning (training over the speaker database and then identification using training data

Microsoft Speech Recognition Platform

阅读更多关于 Microsoft Speech Recognition Platform

I wrote an app in C# for speech recognition using System.Speech which works fine on Windows 7. However I'm after creating the same app that will work on windows 2003 (x86). My programming environment: Windows 7 x64 Pro Visual Studio 2008 In order to develop this application in my programming environment I installed: 1.Microsoft Speech Platform - Server Runtime (Version 10.1) (x86) http://www.microsoft.com/downloads/details.aspx?FamilyID=674356C4-E742-4855-B3CC-FC4D5522C449&displaylang=en&displaylang=en 2.Microsoft Speech Platform - Software Development Kit (SDK) (Version 10.1) (x86) http://www

Voice to text on Android in offline mode

阅读更多关于 Voice to text on Android in offline mode

Is there anyway in which I can use the Voice to Text feature of Android in offline mode. In the given example VoiceRecognition.java, it starts and activity with the intent RecognizerIntent.ACTION_RECOGNIZE_SPEECH. Does it mean that any other apk needs to be installed before hand for this to work or do I need to write my own application to launch on this intent. I have been searching for this for a long time but is getting confused... Here is the code I used.. private static final int VOICE_RECOGNITION_REQUEST_CODE = 1234; private ListView mList; /** * Called with the activity is first created.