voice-recognition

Why did RecognitionListener stop working in JellyBean?

℡╲_俬逩灬. 提交于 2019-12-03 05:09:13
问题 For everyone using Android's voice recognition API, there used to be a handy RecognitionListener you could register that would push various events to your callbacks. In particular, there was the following onBufferReceived(byte[]) method: public abstract void onBufferReceived (byte[] buffer) Since: API Level 8 More sound has been received. The purpose of this function is to allow giving feedback to the user regarding the captured audio. There is no guarantee that this method will be called.

Do Chrome extensions support WebSpeech API?

情到浓时终转凉″ 提交于 2019-12-03 05:04:15
问题 Do google Chrome extensions support Chrome's Web Speech speech recognition API? I have included some javascript to create a speech recognition object, but when I launch my extension, I am not prompted for microphone access. This is not an issue with my code. I have searched on google, but I can't find any information on whether Chrome extensions support the Web Speech API. I just want a yes/no answer. Note: I believe the WebSpeech API won't work for local files. 回答1: Update: Based on RobW's

What are language codes in Chrome's implementation of the HTML5 speech recognition API?

白昼怎懂夜的黑 提交于 2019-12-03 01:39:58
问题 Chrome implemented the HTML5 speech recognition API. Many languages are supported. I wanna know which languages are supported and each language's corresponding code which is used in the HTML element's lang attribute. For instance: Polish (pl-PL) Turkish (tr-TR) Thank you! 回答1: Ok, if it is not published, we can try to at least figure this out. Let me put this table for the beginning and we will refine it if someone has more information. I'm making assumption that supported languages shall be

Python detect keywords

核能气质少年 提交于 2019-12-03 01:27:00
I'm doing and application that do the fallowing: 1: If some noise is detected by the microphone, its starts to record audio, until no noise is detected. After it, the audio is recorded to a wav file. 2: I have to detect some words on it. There are only, 5 to 10 words to detect. So far, my code only does the first part (detect noise and record audio). Now, I have a list with the following words: help, please, yes, no, could, you, after, tomorrow . I need an offline way to detect if my sound contains these words. Is this possible? How can I do that? I'm using linux and there is no way to change

Human face, emotion and voice recognition

让人想犯罪 __ 提交于 2019-12-03 00:37:03
I am looking for a good face, emotion and voice recognition method in C# . For face recognition I was early using Emgu CV which is not accurate and performance is very low in low light conditions. Also I need to find user's emotion. Whether sad or happy like that. But I found its not easy with Emgu CV. Also for voice recognition I am not able to find any solutions yet, I found speech recognition but it is not what I need. I don't want to use any online API's. Can anybody suggest me any SDKs or Algorithms using which I a implement face, emotion and voice recognition? gliderkite Face recognition

Integrate Google Voice Recognition in Android app

断了今生、忘了曾经 提交于 2019-12-02 20:34:24
I want to introduce a new feature into my app: permanent voice recognition . First of all I followed these posts: Voice recognition Speech recognition in Android Offline Speech Recognition In Android (JellyBean) and more others, plus other posts from different websites. Problem: What actually I'm trying to do is to have a permanent voice recognition without displaying google's voice activity. For example: When I start the application the voice recognition should start and listen. When the recognizer matches some words then my app will do different actions accordingly. I do not like to press a

How to install VOCE for C++?

时光总嘲笑我的痴心妄想 提交于 2019-12-02 20:31:40
问题 I am trying to install the VOCE api, which is a voice recognition API built for both C++ and Java. This is the second time I am working with external C++ libs, and first time working with Java + C++ api. Voce link - http://voce.sourceforge.net/ In this API there is really no .lib files, but only .jar files! I read the whole "read me" file but no good clue at how to set this up for C++. I am using QT, the latest version which works with Visual Studio 2010 compiler. How can I set this up?

How to Recognise when user START & STOP speaking in android? (Voice Recognition in Android)

拟墨画扇 提交于 2019-12-02 19:51:41
I have done a lot of R&D and gone through a lot of resources to resolve my problem but I have FAILED to get any proper solution. I have developed an app, now i want to add Voice based functionality to it. The required features are 1) when USER starts speaking, it should record the audio/video and 2) when user stops speaking, it should play the recorded audio/video . Note :Here video means whatever user performs within app during that period of time. For example, clicks on the buttons or some kind of animation, etc. I don't want to use Google's Voice Recognizer available by default in the

Why did RecognitionListener stop working in JellyBean?

泄露秘密 提交于 2019-12-02 18:25:39
For everyone using Android's voice recognition API , there used to be a handy RecognitionListener you could register that would push various events to your callbacks. In particular, there was the following onBufferReceived(byte[]) method: public abstract void onBufferReceived (byte[] buffer) Since: API Level 8 More sound has been received. The purpose of this function is to allow giving feedback to the user regarding the captured audio. There is no guarantee that this method will be called. Parameters buffer a buffer containing a sequence of big-endian 16-bit integers representing a single

How to track rate of speech

前提是你 提交于 2019-12-02 18:13:47
问题 I am developing an iPhone app that tracks rate of speech, and hoping to use Nuance Speechkit (https://developer.nuance.com/public/Help/DragonMobileSDKReference_iOS/SpeechKit_Guide/Basics.html) Is there a way to track rate of speech (e.g., updating WPM every few seconds) with the framework? Right now it seems to just do speech-to-text at the end of a long utterance, as opposed to every word or so (i.e., return partial results). 回答1: There are easier ways, for example you can use CMUSphinx with