speech-recognition | 易学教程

Stop speech recognition when user is finished speaking

阅读更多关于 Stop speech recognition when user is finished speaking

问题 How is Siri is able to determine when I'm finished speaking. The reason I would like to know is that I would like to implement similar functionality with Apple's Speech Recognition API with my app. Is this doable, or is the only way to know when the user has stopped speaking is via user input? 回答1: You can use a timer, i had the same problem and I could not solve it with an elegant method. fileprivate var timer:Timer? func startRecordingTimer() { lastString = "" createTimerTimer(4) } func

What's a good open source VoiceXML implementation?

阅读更多关于 What's a good open source VoiceXML implementation?

I am trying to find out if it's possible to build a complete IVR application by cobbling together parts from open source projects. Is anyone using a non-commercial VoiceXML implementation to build speech-enabled systems? I've tried JVoiceXML in the past and had some luck with it. http://jvoicexml.sourceforge.net/ It's java of course, but that wasn't a problem for my situation. Voiceglue ( http://www.voiceglue.org/ ) is an implementation of voicexml using openvxi and asterisk. It may be a good option for you, it is GPL licensed. matt b You might want to take a look at OpenVXI, I believe that a

How to hide toast message “Your audio will be sent to google to provide speech recognition service.” in Android?

阅读更多关于 How to hide toast message “Your audio will be sent to google to provide speech recognition service.” in Android?

问题 I am using google speech recognizer for integrating voice services in Android but while pressing on mic button this annoying toast message is showing. Please suggest me a way to hide this toast message. This is my java code public class FormActivity extends AppCompatActivity { AppCompatEditText mFeedbackView; ImageView mFeedbackVoiceView; private final int REQ_CODE_SPEECH_INPUT_FEEDBACK = 100; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState);

Open-source tool for gender-recognition using voice [closed]

阅读更多关于 Open-source tool for gender-recognition using voice [closed]

Closed. This question is off-topic. It is not currently accepting answers. Learn more . Want to improve this question? Update the question so it's on-topic for Stack Overflow. Good evening, I'm working on a project with Kinect and I need to perform a gender-recognition using the voice information caught by the microphone of the sensor. The recognition could also be text-based , i.e. the sentence said by the user could always be the same. I "just" need the binary answer male/female. I've checked the quite popular open-source Sphinx but I can't understand if it could be used for my needs. Any

Burmese speech to text conversion in android?

阅读更多关于 Burmese speech to text conversion in android?

Can we add custom language for RecognizerIntent? I have search many SO Question like https://stackoverflow.com/questions/2080401/is-there-a-speech-to-text-api-by-google That solve my problem of using limited number of language during Speech to Text conversion. My problem is that, I need to used Burmese(Local Language of Burma) Speech and convert it to text. any other help can be appreciated. UPDATE : Google's servers currently support English, Mandarin Chinese, and Japanese. Speech Input API for android Google will unlikely support small languages any time soon. The only way to custom language

iOS / C: Algorithm to detect phonemes

阅读更多关于 iOS / C: Algorithm to detect phonemes

问题 I am searching for an algorithm to determine whether realtime audio input matches one of 144 given (and comfortably distinct) phoneme-pairs. Preferably the lowest level that does the job. I'm developing radical / experimental musical training software for iPhone / iPad. My musical system comprises 12 consonant phonemes and 12 vowel phonemes, demonstrated here. That makes 144 possible phoneme pairs. The student has to sing the correct phoneme pair 'laa duu bee' etc in response to visual

Java voice recognition

阅读更多关于 Java voice recognition

问题 Is there Anyone that has experience with any open source, or relatively cheap voice recognition API for java? I'm pretty much looking for something that will turn spoken words into text. From the java speech recognition page on sun, it seems that it is something that is rather dead. My requirements is something that at the least runs on linux. Can anyone recommend something? Pure java would be a bonus, else a linux based solution could be considered. And since this is a home project... the

Open Source Software For Transcribing Speech in Audio Files

阅读更多关于 Open Source Software For Transcribing Speech in Audio Files

问题 Can anyone recommend reliable open source software for transcribing English speech in wav files? The two main programs I've researched are Sphinx and Julius, but I've never been able to get either to work, and the documentation with each on transcribing files is sketchy at best. I'm developing on 64-bit Ubuntu 10.04, whose repos include sphinx2 and julius, as well as voxforge's julius acoustic modal for English. I'm focussing on transcribing files, instead of directly processing sound from a

How to convert human voice into digital format?

阅读更多关于 How to convert human voice into digital format?

I am working on a project where biometric system is used to secure the system. We are planning to use human voice to secure the system. Idea is to allow the person to say some words or sentences and system will store that voice in digital format. Next time person wants to enter the system, he/she has to speak some words which may or may not be different from the words used earlier. We don't want to match words but want to match voice frequency. I have read some research papers regarding this system but those papers don't have any implementation details. So just want to know whether there is

How to translate live streaming using google speech api?

阅读更多关于 How to translate live streaming using google speech api?

this is the google speech API docs: https://cloud.google.com/speech/docs/sync-recognize i try this API for 2 weeks. but still cant solved my main purpose (translate live streaming). I'm using PHP. (other languange suggestion is allowed, i will find by myself) What i can do in my 2 weeks: Synchronous Speech Recognition (<=1min) Asynchronous Speech Recognition (>1min and <=80min). Note : i can modify this to accept 3hours video. Live speech recognition from mic : https://www.google.com/intl/en/chrome/demos/speech.html UPDATE: Perform streaming API with audio less than 6sec duration. What can't i