speech-recognition | 易学教程

Detect fluency from google speech api results

阅读更多关于 Detect fluency from google speech api results

问题 Trying to determine fluency of a speaker using google speech (to text) api. So far i have found that api (betav1) can show the time taken to speak a word ( its starting time and ending time ). And from wikipedia, Oral fluency or speaking fluency is a measurement both of production and reception of speech, as a fluent speaker must be able to understand and respond to others in conversation. Spoken language is typically characterized by seemingly non-fluent qualities (e.g., fragmentation,

How to control Microsoft Speech Recognition app?

阅读更多关于 How to control Microsoft Speech Recognition app?

问题 This question was migrated from Super User because it can be answered on Stack Overflow. Migrated 9 years ago . I want to know if it's possible to control "Microsoft Speech Recognition" using c#. (source: yfrog.com) Is it possible, for instance, to simulate the click on "On: Listen to everything I say" programmatically using c# or python? 回答1: JRobert had the right idea. If you were using C++, then you would call ISpRecognizer::SetRecoState(SPRST_ACTIVE), and then, if you're running on

Detecting known words using the Web Speech API

阅读更多关于 Detecting known words using the Web Speech API

问题 I'm in the planning stages of a web app that is intended to help children learn vocabulary. We would like to make the app as interactive as possible. For example, we would show a picture of an apple and ask the child to identify the object. The child would then say "apple" and we would determine whether they are correct, etc. The new Web Speech API seems like a promising tool for this project. However, looking through the documentation, I believe it will only produce transcripts from speech

Can I get speech recognition in language other than English on Android?

阅读更多关于 Can I get speech recognition in language other than English on Android?

问题 I am trying to build an application (which would use the Hindi language and other regional languages) to get speech voice commands. I also need text to speech functionality in my application. I was wondering if there was any way I could get a speech recognition library on Android? I did a quick Google search and found a couple of libraries for Hindi on the Internet but I am not sure if I can include them in my Android project. Can I? 回答1: You can use pocketsphinx for your app.It has JAVA and

Google Python Speech Recognition with wake word

阅读更多关于 Google Python Speech Recognition with wake word

问题 I am using the Google python speech recognition on a Raspberry Pi and it works really well. However, it is constantly listening and I would prefer to have a wake word so that it only listens when I need it to. At the moment it listens until there is a silence (or a return to whatever the ambient background noise is) and then sends the sound clip to the API for speech to text recognition. Is there a way for me to shorten the length of time that it is listening? This way it can be listening to

Discord receive audio

阅读更多关于 Discord receive audio

问题 I want to receive Audio from Discord to make a speech recognition. I haven't found something in the python Discord APi. The speech recognition is no problem, but I have no idea how to receive Audio from Discord. Maybe someone can help me. 回答1: This functionality doesn't really exist. There is a VoiceClient.poll_voice_ws coroutine, but that just reads directly from the socket. You'll have to come up with some way of decoding that into audio and further handling it yourself. 来源： https:/

Local spoken command recognition on mobile devices

阅读更多关于 Local spoken command recognition on mobile devices

问题 I need to implement spoken command recognition in Android and iPhone apps without the use of internet communication - it all has to be processed locally on the devices and it must be able to distinguish 20-50 commands. In other words: Local limited vocabulary speaker independent speech recognition running on both Android and iOS. Using Google does not enlighten me enough so I was hoping for some advice from people having worked with this subject. Any advice? 回答1: You can use CMUSphinx.

CMU Sphinx4 phoneme dictation

阅读更多关于 CMU Sphinx4 phoneme dictation

问题 How can I configure sphinx4 to be able to detect only phonemes in a dictation? I've already read about partial results "You can control how often the result listener is fired by setting the configuration variable 'featureBlockSize' in the decoder." But my problem is that there are always a grammar needed, like hello.gram in the helloworld example. I need to be able to detect and recognize phoneme from a continuous speech. 回答1: This is what Sphinx has to say about it: Phoneme Recognition

Remove profanity censor from Google Speech Recognition

阅读更多关于 Remove profanity censor from Google Speech Recognition

问题 I am trying to convert speech to text in an iOS application using Google's Speech to Text API. I am simply sending some audio data to the URL "https://www.google.com/speech-api/v1/recognize?xjerr=1&client=chromium&lang=en-US" and it is returning me the (mostly) correct words I say. However, it is replacing any profanity with '####'. How can I replace the '####' with the actual curse words? Just some additional information: I am using the todoroo SpeechToText library. The code for the request

CMU Sphinx 4 - 5 pre alpha install guide

阅读更多关于 CMU Sphinx 4 - 5 pre alpha install guide

问题 Are there any good, preferably step by step install guides for CMU Sphinx 4 -5prealpha(latest version)? I couldn't find any and the official guide is outdated. I've been trying to follow the steps but after i'm done i'm missing 2 jars(jsapi and tags) and if i try to get them from the version before eclipse doesn't find any of the classes. I haven't had too much experience with seting up such projects for eclipse, all i did before was maybe create some custom libraries so that's why i'm asking