speech-recognition | 易学教程

How to Reduce Noise in Sphinx4 application

阅读更多关于 How to Reduce Noise in Sphinx4 application

I am new to sphinx4 and speech recognition thing. I am creating a speech application by using sphinx4. The issue is noise, which results in recognition by program even without speech input by user ie it is converting speech to text even when there is no speech input by user, thus affecting accuracy. Main issue is how to implement noise reduction. This is because the system detects input even when I dont speak anything into the microphone. So I guess its because of noise. I checked online for noise reduction but there is hardly clear information about the same. Though there is some information

Speech Recognition PlatformNotSupportedException

阅读更多关于 Speech Recognition PlatformNotSupportedException

问题 I'm running a sample using speech recognition and it´s not working on windows 8 or 2012 server. Using win8 or win2012server the following code doesn´t return records, but it does in win7. foreach(RecognizerInfo rec in System.Speech.Recognition.SpeechRecognitionEngine.InstalledRecognizers()) { Console.WriteLine(rec.Culture); } If I use a LoadGrammar using win8 or win2012 server it raises an exception: _recognizer.LoadGrammar (new Grammar (new GrammarBuilder (new Choices (_numbersGrammar.Keys

Android SpeechRecognizer should only be used from the application's main thread

阅读更多关于 Android SpeechRecognizer should only be used from the application's main thread

I am trying to integrate some of Androids Speech APIS in my AndEngine based game. I placed my code within the BaseGame activity - however this error appears at runtime: 05-06 23:51:28.955: ERROR/AndroidRuntime(553): java.lang.RuntimeException: SpeechRecognizer should be used only from the application's main thread How do I access the applications main thread? And how can I ensure my Speech code runs in it. The setup is as follows: Main Class - BaseGameActivity SpeechClass Main Class instantiates a new SpeechClass + calls its method - then I get the above error. Please advise on how I can do

Interfacing LIVE SPEECH with Tkinter GUI

阅读更多关于 Interfacing LIVE SPEECH with Tkinter GUI

问题 I want to interface pocketsphinx livespeech with Python tkinter GUI in such a way that GUI is visible on frontend and Livespeech works on Back-end.But when i merge tkinter code with livespeech code; livespeech code always runs first and GUI not shows till i stop the code;so i won't be able to perform my required task.., #*********************************** IMPORTING MODULES***************** import tkinter from tkinter import* import tkinter.messagebox import sqlite3 import os from

C# system.speech.recognition alternate words

阅读更多关于 C# system.speech.recognition alternate words

I am currently using the Microsoft.Speech API to dictate utterances into text, but what I really need is the alternative dictations the program could use. I am using this for my honours thesis, and for it I wish to know the top 10 interpretations of any utterance. A very similar, if not exact question was asked in 2011: C# system.speech.recognition alternates But was never answered. My question thus is: how does one get the alternatives to an interpretation of a dictation using the Microsoft.Speech API? This MSDN page handles what you're asking quite nicely. For reference, I'll post the

web speech api - speech synthesis .lang property not working

阅读更多关于 web speech api - speech synthesis .lang property not working

im trying to use web speech api to transcript a word in Portuguese, i set the property to 'pt-BR' ( unfortunately Portuguese - european is not supported) but, always replies in english. Can someone help? Thanks code: <script type="text/javascript"> var synth = window.speechSynthesis; function falatarea(){ var utteranceY = new SpeechSynthesisUtterance(); utteranceY.text = "teste"; utteranceY.lang = "pt-BR"; utteranceY.voice = "pt-BR"; window.speechSynthesis.speak(utteranceY); } </script> Seems like Chrome broke recently. I had code working, now its not. https://code.google.com/p/chromium/issues

Interfacing LIVE SPEECH with Tkinter GUI

阅读更多关于 Interfacing LIVE SPEECH with Tkinter GUI

I want to interface pocketsphinx livespeech with Python tkinter GUI in such a way that GUI is visible on frontend and Livespeech works on Back-end.But when i merge tkinter code with livespeech code; livespeech code always runs first and GUI not shows till i stop the code;so i won't be able to perform my required task.., #*********************************** IMPORTING MODULES***************** import tkinter from tkinter import* import tkinter.messagebox import sqlite3 import os from pocketsphinx import LiveSpeech, get_model_path conn = sqlite3.connect('portal.db') c = conn.cursor() window =

How to use the function GetStorageFileName for adding training files to registry?

阅读更多关于 How to use the function GetStorageFileName for adding training files to registry?

I am using sapi 5.4 and I am able to retain the audio for training but for adding it to registry entries as given in this Acoustic training using SAPI 5.3 Speech API I am not able to understand the arguments to be used for ISpObjectToken::GetStorageFileName can somebody help me out with it? ISpObjectToken::GetStorageFileName isn't the best documented API in the world. Luckily, I have some worked examples to help improve the documentation. clsidCaller Set this if you need the file name associated with the object token. For training, this should be CLSID_NULL . (If you're updating engine-related

Parse speech output to a JSON to call Application API

阅读更多关于 Parse speech output to a JSON to call Application API

问题 Here is an idea: We have web applications with exposed restful APIs which accepts json. Now how about using google speech APIs to take user voice input convert it to text then somehow translate that text to JSONs required by APIs and then call those application APIe with JSON? Is there any. Library to translate text to a specified JSon format? Has anybody used this approach? 回答1: This is called "intent analysis". There are such libraries, for example RASA For example you input is "show me

PlatformNotSupportedException Using .NET Speech Recognition

阅读更多关于 PlatformNotSupportedException Using .NET Speech Recognition

问题 So I'm trying voice recognition for C#, I'm using System.Speech.Recognition, and, I was searching around on the internet, trying out several pieces of code for some basic speech recognition, the best one I could find was this: using System; using System.Text; using System.Windows.Forms; using System.Speech.Recognition; namespace SpeechRecognition { public partial class MainForm : Form { SpeechRecognitionEngine recognitionEngine; public MainForm() { InitializeComponent(); Initialize(); }