speech-recognition

Delphi and SAPI

[亡魂溺海] 提交于 2019-12-01 10:06:07
问题 What resource can I learn from to use SAPI from Delphi? How to use Speech Recognition from Delphi Applications? 回答1: Speech Synthesis & Speech Recognition: Using SAPI 5.1 from the makers of Delphi sounds like a good begin. 来源: https://stackoverflow.com/questions/1021490/delphi-and-sapi

Is it possible to use WIndows Speech Recognition Engine in a word pronunciation game?

孤人 提交于 2019-12-01 09:28:17
I use to create an application that uses the windows speech recognition engine or the SAPI. its like a game for pronunciation that it give you score when you pronounce it correctly. but when i started experiments with SAPI, it has poor recognition unless if you load a grammar on it (XML) its give best recognition result. but the problem now is closest pronunciation from the input text will be recognize. for example: Database -> dedebase -> correct. even if you mispronounce it. it gives you correct answers. without using the xml grammar when you say database it give you "in the base/the base

Microsoft.Speech.Synthesis does not work for Text To Speech BUT System.Speech.Synthesis works.Why?

你离开我真会死。 提交于 2019-12-01 09:27:48
I just try to run simple microsoft example for Text To Speech using using Microsoft.Speech.dll; using System; using Microsoft.Speech.Synthesis; namespace TTS { class Program { public static void Main(string[] args) { Console.WriteLine("Testing TTS!"); // Initialize a new instance of the SpeechSynthesizer. using (SpeechSynthesizer synth = new SpeechSynthesizer()) { // Output information about all of the installed voices. Console.WriteLine("Installed voices -"); foreach (InstalledVoice voice in synth.GetInstalledVoices()) { VoiceInfo info = voice.VoiceInfo; Console.WriteLine(" Voice Name: " +

simple speech recognition methods

╄→尐↘猪︶ㄣ 提交于 2019-12-01 08:58:40
Yes, I'm aware that speech recognition is fairly complicated (as an understatement). What I'm looking for is a method for distinguishing between maybe 20-30 phrases. An ability to split words (discrete speech is fine) would be nice, but isn't required. The software will be user-dependent(i.e. for use by me). I'm not looking for existing software, but for a good way of going about doing this myself. I've looked into various existing methods and it seems like splitting the sound into phonemes, while common, is somewhat excessive for my needs. For some context, I'm just looking for a way to

Why does PocketSphinx Android demo require cygwin

僤鯓⒐⒋嵵緔 提交于 2019-12-01 08:27:02
问题 I am trying to develop a speech recognition program in android and so far I have seen that themost popular option seems to be CMU sphinx, with pocketsphinx being used for android. From what I have understood the source code for CMU Sphinx has been provided in Java for running on a computer, so will not directly run on an android phone. So I will first have to compile the CMu sphinx code into a sort of library (something like creating a dll out a code I need to share if I was working in C),

Android Speech Recognition Service much slower on <= Ice Cream Sandwhich

我与影子孤独终老i 提交于 2019-12-01 08:04:32
问题 I have a service that is implementing RecognitionListener (like this: Android Speech Recognition as a service on Android 4.1 & 4.2) where I have to override onBufferReceived(byte[]) method. I was investigating why the speech recognition is much slower to call onResults() on <=ICS . The only difference I could find was that onBufferReceived is called on phones <= ICS. On JellyBean the onBufferReceived() is never called and onResults() is called significantly faster and I'm thinking its because

Is it possible to use WIndows Speech Recognition Engine in a word pronunciation game?

夙愿已清 提交于 2019-12-01 05:45:54
问题 I use to create an application that uses the windows speech recognition engine or the SAPI. its like a game for pronunciation that it give you score when you pronounce it correctly. but when i started experiments with SAPI, it has poor recognition unless if you load a grammar on it (XML) its give best recognition result. but the problem now is closest pronunciation from the input text will be recognize. for example: Database -> dedebase -> correct. even if you mispronounce it. it gives you

I can't find System.speech

做~自己de王妃 提交于 2019-12-01 04:24:42
问题 I am trying to do a project that uses the windows speech recognition libraries and I am trying to add a reference to System.speech since I am using: SpeechSynthesizer synth = new SpeechSynthesizer(); However, I can't find System.speech in the .NET tab when I try to add a reference. I updated my .NET Framework and still can't find it. Is there a way to simply add it? 回答1: Check your project type, and the version of the framework it's targeting. SpeechSynthesizer exists in .NET 3.0, 3.5 and 4 -

Speech recognition with Flash or Silverlight

╄→гoц情女王★ 提交于 2019-12-01 02:33:45
I'm developing a web user interface to enter some information that is not very complex but needs to be loaded in real time. I think that the application could make use of speech recognition to facilitate the task. Te core of the interface is being built with Javascript and jQuery, but can easily include a flash or silverlight component. I believe that´s probably the way to go... I don't need to recognize everything that the user says, but only a few prerecorded commands. Also, I don't want the user to click on a button to specify the begining and the end of the spoken command. It should be

C# Speech Recognition from System Audio (Speaker Sound)

泪湿孤枕 提交于 2019-12-01 01:58:02
I've seen speech recognition from input devices (obviously) and I've seen speech recognition from files ( http://gotspeech.net/forums/thread/6835.aspx ). However, I was wondering whether it would be possible to run speech recognition on system audio in real time. By system audio, the sound that comes out of your speakers. It would be a great tool for those who are hard of hearing, as they are watching YouTube videos, the C# Application could transcribe what's being said. How could I go about doing this? Very easily - Go to the sound mixer, choose input and enable/unmute "Stereo Mix". You