speech-recognition | 易学教程

Python Voice Recognition Library - Always Listen?

阅读更多关于 Python Voice Recognition Library - Always Listen?

I've recently been working on using a speech recognition library in python in order to launch applications. I Intend to ultimately use the library for voice activated home automation using the Raspberry Pi GPIO. I have this working, it detects my voice and launches application. The problem is that it seems to hang on the one word I say (for example, I say internet and it launches chrome an infinite number of times) This is unusual behavior from what I have seen of while loops. I cant figure out how to stop it looping. Do I need to do something out of the loop to make it work properly? Please

Accuracy of MS System.Speech.Recognizer and the SpeechRecognitionEngine

阅读更多关于 Accuracy of MS System.Speech.Recognizer and the SpeechRecognitionEngine

问题 I am currently testing the SpeechRecognitionEngine by loading from an xml file a pretty simple rule. In fact it is a simple between ("decrypt the email", "remove encryption") or ("encrypt the email", "add encryption"). I have trained my Windows 7 PC and additionally added the words encrypt and decrypt as I realize they are very similar. The recognizer already has a problem with making a difference between these two. The issue I am having is that it recognizes things too often. I have set the

ios speech recognition Error Domain=kAFAssistantErrorDomain Code=216 “(null)”

阅读更多关于 ios speech recognition Error Domain=kAFAssistantErrorDomain Code=216 “(null)”

Basically I am learning ios speech recognition module following this tutorial: https://medium.com/ios-os-x-development/speech-recognition-with-swift-in-ios-10-50d5f4e59c48 But when I test it on my iphone6, I always got this error: Error Domain=kAFAssistantErrorDomain Code=216 "(null)" I searched it on the internet, but find very rare info about this. Here is my code: // // ViewController.swift // speech_sample // // Created by Peizheng Ma on 6/22/17. // Copyright © 2017 Peizheng Ma. All rights reserved. // import UIKit import AVFoundation import Speech class ViewController: UIViewController,

Is there a voice authentication library? [closed]

阅读更多关于 Is there a voice authentication library? [closed]

Does anyone know if there's a good voice authentication library available? Or do I need to find a good voice recognition library and build from there? I've tried searching google using the tearm "voice authentication" library with no luck. There are several open source packages implementing speaker recognition. For example Mistral http://mistral.univ-avignon.fr/ There are clones of mistral too http://code.google.com/p/improved-mistral/ Speech Recognition is usually quite different from voice authentication (often called voice biometrics). Here is a good overview document - http://www.sans.org

Microsoft Speech Recognition - what reference do I have to add?

阅读更多关于 Microsoft Speech Recognition - what reference do I have to add?

问题 I'm trying to make a C# program that uses the Microsoft Speech Recognition API (with Kinect) but I'm struggling to get started. I have the using statements using Microsoft.Speech.AudioFormat; using Microsoft.Speech.Recognition; But it's giving me the error "the type or namespace name 'Speech' does not exist in the namespace 'Microsoft'. Are you missing an assembly reference?" Does anyone know what to add? System.Speech doesn't do the trick. 回答1: Make sure you add this assembly to your project

Widget that calls speech recognition app

阅读更多关于 Widget that calls speech recognition app

问题 I'm trying to create a widget that contains a single ImageView which, when clicked, starts speech recognition application. I've never worked with widgets and pending intents, so I'm confused: how to create a pending intent for starting speech recognition activity? I tried with something like this, but it, of course, fails: Intent intent = new Intent(); Intent voiceIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); voiceIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL,

Server-side Voice Recognition [closed]

阅读更多关于 Server-side Voice Recognition [closed]

Anyone know of any good server side voice recognition engines that are already hosted? I.e. I want to be able to call a simple web API posting some sound data and get text back. Doesn't have to be free - but hopefully free to experiment with. David J. Liszewski There are several IVR services which host an entire VOIP session (telephone call) as a complete application, rather than offer individual service transactions "àla carte". If you were to make your program look like a VOIP call, you might be able to get it done with some of these services. Voxeo published a list of free (and low cost)

How to handle ERROR_RECOGNIZER_BUSY

阅读更多关于 How to handle ERROR_RECOGNIZER_BUSY

问题 In my voice recognition based app, I sometimes receive ERROR_RECOGNIZER_BUSY. Intuitively, this calls for... retries , right? The problem is that this error is very undocumented, so obviously I have questions that perhaps someone more experienced in the field is able to answer: What triggers such an error? Is it really only busy server (at Google)? or this could also hint at a bug in my app? Do I have to explicitly close/reopen a session before a retry? How often to retry? once every 1-second

Voice/Speech to text [closed]

阅读更多关于 Voice/Speech to text [closed]

I need an API or library (preferably free) that will convert voice/speech through a microphone, into text (string). Additionally, I will need an API or library that can do text-to-speech. I'd like to use C# and .NET, but other languages will suffice. Thanks. ShahidAzim You can use CMU Sphinx as it is pretty open and scalable solution and I think it can be used at both client and server side: http://cmusphinx.sourceforge.net/ If you are looking for a Microsoft desktop solution then you can use SAPI: http://msdn.microsoft.com/en-us/magazine/cc163663.aspx On server side, you can use Microsoft

Speech Recognition Android App

阅读更多关于 Speech Recognition Android App

问题 I'm making an app that takes commands from User and write it in real time. What would be the Best option for me to take? Third Party software like sphinx or should I use the built in (android speech recognition)? Secondly I want it to write in real time, like when I speak it starts writing? 回答1: You should use the built in Android Speech recognition. Specifically, you will need to operate the SpeechRecognier API so that there is no popup dialog box. Also, do not expect SpeechRecognizer to