voice-recognition | 易学教程

Why did RecognitionListener stop working in JellyBean?

阅读更多关于 Why did RecognitionListener stop working in JellyBean?

问题 For everyone using Android's voice recognition API, there used to be a handy RecognitionListener you could register that would push various events to your callbacks. In particular, there was the following onBufferReceived(byte[]) method: public abstract void onBufferReceived (byte[] buffer) Since: API Level 8 More sound has been received. The purpose of this function is to allow giving feedback to the user regarding the captured audio. There is no guarantee that this method will be called.

Do Chrome extensions support WebSpeech API?

阅读更多关于 Do Chrome extensions support WebSpeech API?

问题 Do google Chrome extensions support Chrome's Web Speech speech recognition API? I have included some javascript to create a speech recognition object, but when I launch my extension, I am not prompted for microphone access. This is not an issue with my code. I have searched on google, but I can't find any information on whether Chrome extensions support the Web Speech API. I just want a yes/no answer. Note: I believe the WebSpeech API won't work for local files. 回答1: Update: Based on RobW's

What are language codes in Chrome's implementation of the HTML5 speech recognition API?

阅读更多关于 What are language codes in Chrome's implementation of the HTML5 speech recognition API?

问题 Chrome implemented the HTML5 speech recognition API. Many languages are supported. I wanna know which languages are supported and each language's corresponding code which is used in the HTML element's lang attribute. For instance: Polish (pl-PL) Turkish (tr-TR) Thank you! 回答1: Ok, if it is not published, we can try to at least figure this out. Let me put this table for the beginning and we will refine it if someone has more information. I'm making assumption that supported languages shall be

Python detect keywords

阅读更多关于 Python detect keywords

I'm doing and application that do the fallowing: 1: If some noise is detected by the microphone, its starts to record audio, until no noise is detected. After it, the audio is recorded to a wav file. 2: I have to detect some words on it. There are only, 5 to 10 words to detect. So far, my code only does the first part (detect noise and record audio). Now, I have a list with the following words: help, please, yes, no, could, you, after, tomorrow . I need an offline way to detect if my sound contains these words. Is this possible? How can I do that? I'm using linux and there is no way to change

Human face, emotion and voice recognition

阅读更多关于 Human face, emotion and voice recognition

I am looking for a good face, emotion and voice recognition method in C# . For face recognition I was early using Emgu CV which is not accurate and performance is very low in low light conditions. Also I need to find user's emotion. Whether sad or happy like that. But I found its not easy with Emgu CV. Also for voice recognition I am not able to find any solutions yet, I found speech recognition but it is not what I need. I don't want to use any online API's. Can anybody suggest me any SDKs or Algorithms using which I a implement face, emotion and voice recognition? gliderkite Face recognition

Integrate Google Voice Recognition in Android app

阅读更多关于 Integrate Google Voice Recognition in Android app

I want to introduce a new feature into my app: permanent voice recognition . First of all I followed these posts: Voice recognition Speech recognition in Android Offline Speech Recognition In Android (JellyBean) and more others, plus other posts from different websites. Problem: What actually I'm trying to do is to have a permanent voice recognition without displaying google's voice activity. For example: When I start the application the voice recognition should start and listen. When the recognizer matches some words then my app will do different actions accordingly. I do not like to press a

How to install VOCE for C++?

阅读更多关于 How to install VOCE for C++?

问题 I am trying to install the VOCE api, which is a voice recognition API built for both C++ and Java. This is the second time I am working with external C++ libs, and first time working with Java + C++ api. Voce link - http://voce.sourceforge.net/ In this API there is really no .lib files, but only .jar files! I read the whole "read me" file but no good clue at how to set this up for C++. I am using QT, the latest version which works with Visual Studio 2010 compiler. How can I set this up?

How to Recognise when user START & STOP speaking in android? (Voice Recognition in Android)

阅读更多关于 How to Recognise when user START & STOP speaking in android? (Voice Recognition in Android)

I have done a lot of R&D and gone through a lot of resources to resolve my problem but I have FAILED to get any proper solution. I have developed an app, now i want to add Voice based functionality to it. The required features are 1) when USER starts speaking, it should record the audio/video and 2) when user stops speaking, it should play the recorded audio/video . Note :Here video means whatever user performs within app during that period of time. For example, clicks on the buttons or some kind of animation, etc. I don't want to use Google's Voice Recognizer available by default in the

Why did RecognitionListener stop working in JellyBean?

阅读更多关于 Why did RecognitionListener stop working in JellyBean?

For everyone using Android's voice recognition API , there used to be a handy RecognitionListener you could register that would push various events to your callbacks. In particular, there was the following onBufferReceived(byte[]) method: public abstract void onBufferReceived (byte[] buffer) Since: API Level 8 More sound has been received. The purpose of this function is to allow giving feedback to the user regarding the captured audio. There is no guarantee that this method will be called. Parameters buffer a buffer containing a sequence of big-endian 16-bit integers representing a single

How to track rate of speech

阅读更多关于 How to track rate of speech

问题 I am developing an iPhone app that tracks rate of speech, and hoping to use Nuance Speechkit (https://developer.nuance.com/public/Help/DragonMobileSDKReference_iOS/SpeechKit_Guide/Basics.html) Is there a way to track rate of speech (e.g., updating WPM every few seconds) with the framework? Right now it seems to just do speech-to-text at the end of a long utterance, as opposed to every word or so (i.e., return partial results). 回答1: There are easier ways, for example you can use CMUSphinx with