speech-recognition

C# Speech Recognition from System Audio (Speaker Sound)

若如初见. 提交于 2019-11-30 21:51:32
问题 I've seen speech recognition from input devices (obviously) and I've seen speech recognition from files (http://gotspeech.net/forums/thread/6835.aspx). However, I was wondering whether it would be possible to run speech recognition on system audio in real time. By system audio, the sound that comes out of your speakers. It would be a great tool for those who are hard of hearing, as they are watching YouTube videos, the C# Application could transcribe what's being said. How could I go about

Vista Speech Recognition in Delphi

不打扰是莪最后的温柔 提交于 2019-11-30 21:08:43
I would like to be able to dictate into my Delphi application using Microsoft Vista's speech recognition. However when attempting to dictate into a TMemo, it simply does not work. I noticed that Firefox 3.0 has the same issue and after they contacted Microsoft about the problem, they told them that they need to implement the Text Services Framework for their application. I am wondering if there is any way to implement this in Delphi so that I can dictate into a TMemo or a TRichEdit. Searching Google for a solution didn't return any relevant results. Where would I start in finding a solution?

UWP suitable project solution

荒凉一梦 提交于 2019-11-30 20:52:10
问题 I want rewrite C# Winforms desktop application to Universal Windows Platform, but first of all I'm trying to figure out, what should be suitable for my goal. The reason why I want to use UWP is a quality of Speech Recognition, (maybe also work on other devices, but this is currently very secondary). Here is my previous question Speech recognition for windows desktop application where advised to use UWP with Speech recognition Quickstart: Recognize speech with the Speech SDK for .NET Framework

Speech to text sdk freezes after video playback

蹲街弑〆低调 提交于 2019-11-30 20:37:54
问题 I'm using speech-to-text sdk provided by https://github.com/todoroo/iPhone-Speech-To-Text Recognizer works just fine until the moment I playback a video using MPMoviePlayerController. Here is the code i'm using to call recognizer: - (IBAction)actionBtRecognition:(id)sender { if(recognizer == nil){ recognizer = [[SpeechToTextModule alloc] init]; } [recognizer beginRecording]; } To playback movie I used this tutorial So, once I playback a movie and call recognizer, it's just freezes. When I

Offline google voice recognition on android

时光毁灭记忆、已成空白 提交于 2019-11-30 19:47:08
I am working on an android application which will listen to voice command and triggers actions accordingly. Below are some doubts: Is google voice recognition offline library is available to use in android apps. Can we create our own dictionary of words & commands with wake up word. Application should work in offline mode(Without Internet). Christopher Is Google voice recognition offline library is available to use in android apps. I think this question is already answered here: Offline Speech Recognition In Android (JellyBean) Can we create our own dictionary of words & commands with wake up

Microsoft.Speech SpeechRecognitionEngine.setInputToDefaultAudioDevice() method throwing System.InvalidOperationException

◇◆丶佛笑我妖孽 提交于 2019-11-30 18:52:10
问题 I'm using Microsoft.Speech (NOT System.Speech!) and I'm trying to set the input of the audio stream to my microphone, which is the default audio device, but it is throwing the exception: System.InvalidOperationException' occurred in Microsoft.Speech.dll but was not handled in user code` Additional information: Cannot find the requested data item, such as a data key or value. This was just a test, ideally I want to set the audio stream to my speakers, but if I can get this to work I can just

'SAPI does not implement phonetic alphabet selection' exception

◇◆丶佛笑我妖孽 提交于 2019-11-30 18:15:41
问题 Whenever I attempt to code any speech recognition program in my laptop,I always get the same messages as mentioned below..I can always compile my code and get the windows form application working..But the problem is, the program will not detect my voice..And the program wont work.. I am very sure my codes works fine as I usually take the codes from youtube videos like : https://www.youtube.com/watch?v=KR0-UYUGYgA and many more.. I am using .NET framework 4 client profile for my projects.. I

RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS in Oreo

若如初见. 提交于 2019-11-30 18:08:27
In most Android devices, the RecognitionService will be supplied by Google's native 'Now/Assistant' application. Up until Android Oreo, I was able to query the languages supported by the Google Recognizer with the following simple code: final Intent vrIntent = new Intent(RecognizerIntent.ACTION_GET_LANGUAGE_DETAILS); // vrIntent.setPackage("com.google.android.googlequicksearchbox"); getContext().sendOrderedBroadcast(vrIntent, null, new BroadcastReceiver() { @Override public void onReceive(final Context context, final Intent intent) { // final Bundle bundle = intent.getExtras(); final Bundle

AVAudioEngine inputNode installTap crash when restarting recording

一个人想着一个人 提交于 2019-11-30 17:56:30
I am implementing Speech Recognition in my app. When I first present the view controller with the speech recognition logic, everything works fine. However, when I try present the view controller again, I get the following crash: ERROR: [0x190bf000] >avae> AVAudioNode.mm:568: CreateRecordingTap: required condition is false: IsFormatSampleRateAndChannelCountValid(format) *** Terminating app due to uncaught exception 'com.apple.coreaudio.avfaudio', reason: 'required condition is false: IsFormatSampleRateAndChannelCountValid(format)' Here is the code used for starting and stopping recording:

Is there any way to send audio file to the speech-to-text recognition

五迷三道 提交于 2019-11-30 17:26:10
I want the Android speech recognition system analysing audio file and not the default incoming voice from microphone. Is there any way to do that ? Thank you. I suppose it works in a similar way to the chrome api - http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/ As he has mentioned you can convert the microphone file into a .flac file and send it to the speech api, and you will get the same result. So you can use SOX and convert it yourself. Hope it helps. Dias cmusphinx.sourceforge.net/wiki/tutorialandroid Just found that link sounds like someone has created a android