speech-recognition | 易学教程

Open-source tool for gender-recognition using voice [closed]

阅读更多关于 Open-source tool for gender-recognition using voice [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Good evening, I'm working on a project with Kinect and I need to perform a gender-recognition using the voice information caught by the microphone of the sensor. The recognition could also be text-based , i.e. the sentence said by the user could always be the same. I "just" need the binary answer male/female. I

Android record audio while doing speech recognition

阅读更多关于 Android record audio while doing speech recognition

问题 I am doing speech recognition using a third party cloud service on Android, and it works well with Android API SpeechRecognizer. Code below: Intent recognizerIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); recognizerIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, RecognizerIntent.LANGUAGE_MODEL_WEB_SEARCH); // accept partial results if they come recognizerIntent.putExtra(RecognizerIntent.EXTRA_PARTIAL_RESULTS, true); //need to have a calling package for it to work if (

How to implement speech-to-text via Speech framework [closed]

阅读更多关于 How to implement speech-to-text via Speech framework [closed]

问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 2 years ago . I want to do speech recognition in my Objective-C app using the iOS Speech framework. I found some Swift examples but haven't been able to find anything in Objective-C. Is it possible to access this framework from Objective-C? If so, how? 回答1: After spending enough time looking

Using Tensorflow's Connectionist Temporal Classification (CTC) implementation

阅读更多关于 Using Tensorflow's Connectionist Temporal Classification (CTC) implementation

问题 I'm trying to use the Tensorflow's CTC implementation under contrib package (tf.contrib.ctc.ctc_loss) without success. First of all, anyone know where can I read a good step-by-step tutorial? Tensorflow's documentation is very poor on this topic. Do I have to provide to ctc_loss the labels with the blank label interleaved or not? I could not be able to overfit my network even using a train dataset of length 1 over 200 epochs. :( How can I calculate the label error rate using tf.edit_distance?

Any OpenCV-like C/C++ library for Audio processing? [closed]

阅读更多关于 Any OpenCV-like C/C++ library for Audio processing? [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 4 years ago . Is there anything more out there, that resembles (in spirit) OpenCV, but for processing audio and deriving some intelligence from it ? Capabilities could range from:- Multiplatform audio capture and audio playback DSP - Audio filters Tone detection Tonal property analysis Tone synthesis (various standard

SpeechSynthesizer - How do I play/save the wav file?

阅读更多关于 SpeechSynthesizer - How do I play/save the wav file?

问题 I have the following code snippet in an ASP.NET app (non Silverlight) string sText = "Test text"; SpeechSynthesizer ss = new SpeechSynthesizer(); MemoryStream ms = new MemoryStream(); ss.SetOutputToWaveStream(ms); ss.Speak(sText); //Need to send the ms Memory stream to the user for listening/downloadin How do I: Play this file on the browser Prompt for the user to download a wav file? Can anyone help with completing the code? EDIT: Any help is appreciated. 回答1: Here's the main bit to an

web speech api - speech synthesis .lang property not working

阅读更多关于 web speech api - speech synthesis .lang property not working

问题 im trying to use web speech api to transcript a word in Portuguese, i set the property to 'pt-BR' ( unfortunately Portuguese - european is not supported) but, always replies in english. Can someone help? Thanks code: <script type="text/javascript"> var synth = window.speechSynthesis; function falatarea(){ var utteranceY = new SpeechSynthesisUtterance(); utteranceY.text = "teste"; utteranceY.lang = "pt-BR"; utteranceY.voice = "pt-BR"; window.speechSynthesis.speak(utteranceY); } </script> 回答1:

Speech recognition for windows desktop application

阅读更多关于 Speech recognition for windows desktop application

问题 In Ionic Typescript application I'm using import { SpeechRecognition, SpeechRecognitionListeningOptionsAndroid, SpeechRecognitionListeningOptionsIOS } from '@ionic-native/speech-recognition'; which work impressive on mobile device, gives you number of correct or close recognition variants result even with a quite low speech. In WinForms C# desktop application, I'm using System.Speech.Recognition; , which works fine, but not enough in comparison, fairly often event gives many non-accurate

C# system.speech.recognition alternate words

阅读更多关于 C# system.speech.recognition alternate words

问题 I am currently using the Microsoft.Speech API to dictate utterances into text, but what I really need is the alternative dictations the program could use. I am using this for my honours thesis, and for it I wish to know the top 10 interpretations of any utterance. A very similar, if not exact question was asked in 2011: C# system.speech.recognition alternates But was never answered. My question thus is: how does one get the alternatives to an interpretation of a dictation using the Microsoft

How to use the function GetStorageFileName for adding training files to registry?

阅读更多关于 How to use the function GetStorageFileName for adding training files to registry?

问题 I am using sapi 5.4 and I am able to retain the audio for training but for adding it to registry entries as given in this Acoustic training using SAPI 5.3 Speech API I am not able to understand the arguments to be used for ISpObjectToken::GetStorageFileName can somebody help me out with it? 回答1: ISpObjectToken::GetStorageFileName isn't the best documented API in the world. Luckily, I have some worked examples to help improve the documentation. clsidCaller Set this if you need the file name