voice

PlatformNotSupportedException Using .NET Speech Recognition

廉价感情. 提交于 2019-12-01 20:32:52
So I'm trying voice recognition for C#, I'm using System.Speech.Recognition, and, I was searching around on the internet, trying out several pieces of code for some basic speech recognition, the best one I could find was this: using System; using System.Text; using System.Windows.Forms; using System.Speech.Recognition; namespace SpeechRecognition { public partial class MainForm : Form { SpeechRecognitionEngine recognitionEngine; public MainForm() { InitializeComponent(); Initialize(); } private void Initialize() { recognitionEngine = new SpeechRecognitionEngine(); recognitionEngine

play raw audio file in python in realtime

北战南征 提交于 2019-12-01 09:57:32
问题 I have a udp server in python that continuously receives voice packets from a client in raw format, array of bytes. How can I play the voice on the server side in real time? Any recommended libraries or ways to do it? Here is my very simple server code if needed (which I doubt) import socket UDP_IP = "192.168.1.105" UDP_PORT = 5005 sock = socket.socket(socket.AF_INET, # Internet socket.SOCK_DGRAM) # UDP sock.bind((UDP_IP, UDP_PORT)) while True: data, addr = sock.recvfrom(1024) # buffer size

Jelly Bean Google Voice text-to-speech API?

我的未来我决定 提交于 2019-12-01 05:27:27
问题 Is there a way I can use the text-to-speech from the new Google Voice? In Jelly Bean the pronunciation is really smooth so I was thinking may be there is some kind of API for access to that service? Thanks! 回答1: Yes, it is possible to access the Google Now voice using the Android TTS APIs. You need to use "KEY_FEATURE_NETWORK_SYNTHESIS" in the "params" parameter for the TextToSpeech.speak() call. See http://developer.android.com/reference/android/speech/tts/TextToSpeech.Engine.html#KEY

How to distinguish between voice call and video call on Android?

北城以北 提交于 2019-11-30 16:36:17
I want to log voice call and video call. I must distinguish them. Where is the difference between the intent for ACTION_NEW_OUTGOING_CALL of video call and the intent for ACTION_NEW_OUTGOING_CALL of voice call? There is no difference, because there are no video calls in Android at the time of this writing. Any video chat application will not be using ACTION_NEW_OUTGOING_CALL in all likelihood, so all broadcasts of ACTION_NEW_OUTGOING_CALL are for voice calls. 来源: https://stackoverflow.com/questions/4537924/how-to-distinguish-between-voice-call-and-video-call-on-android

How to distinguish between voice call and video call on Android?

丶灬走出姿态 提交于 2019-11-30 16:08:40
问题 I want to log voice call and video call. I must distinguish them. Where is the difference between the intent for ACTION_NEW_OUTGOING_CALL of video call and the intent for ACTION_NEW_OUTGOING_CALL of voice call? 回答1: There is no difference, because there are no video calls in Android at the time of this writing. Any video chat application will not be using ACTION_NEW_OUTGOING_CALL in all likelihood, so all broadcasts of ACTION_NEW_OUTGOING_CALL are for voice calls. 来源: https://stackoverflow

Any simple VAD implementation?

ぐ巨炮叔叔 提交于 2019-11-30 11:37:32
问题 I'm looking for some C/C++ code for VAD (Voice Activity Detection). Basically, my application is reading PCM frames from the device. I would like to know when the user is talking. I'm not looking for any speech recognition algorithm but only for voice detection. I would like to know when the user is talking and when he finishes: bool isVAD(short* pcm,size_t count); 回答1: There are open source implementations in the Sphinx and Freeswitch projects. I think they are all energy based detectors do

Algorithm to remove vocal from sound track [closed]

本秂侑毒 提交于 2019-11-30 10:18:04
问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed last year . I want to remove vocals from mp3 sound tracks. I searched google, and tried few softwares but none of them are convincing. I am planning to read the mp3 file, get a waveform and remove the waveform that is above a specified limit. do you have any suggestions on how to proceed. --

Is it possible to send/receive voice from PC to Android device over Bluetooth

落花浮王杯 提交于 2019-11-30 10:07:00
I have a PC (where java program is running), Bluetooth dongle (which supports A2DP) and Android device (2.3.6, which also supports A2DP). I want to: Send real time voice data from PC's microphone to android device via bluetooth Send real time voice data from Android device's microphone to PC via bluetooth I found this question , where Dennis says that "Phones are typically only Source devices (Source of the stream that can stream to sink devices), Sinks are Headsets or Bluetooth speakers.". So it seems it is possible to implement case 2? Or not? And what about case 1, is it impossible to

Is there a way to have change female to male voice during the conversation in DialogFlow (Api.ai)

六眼飞鱼酱① 提交于 2019-11-30 07:36:30
问题 We a building a chatbot app for Google Home using Api.ai (Dialogflow now) which has both male and female historical figures. We are using Actions on google. Google lets you default to a male or female voice when deploying the app. Is there a way to switch between male to female voice dynamically, for example, using code in webhook? 回答1: Maybe . Although SSML supports a <voice> tag, the documentation for Actions does not list it as supported. However, as you noted, there appears to be some

Is there a voice authentication library? [closed]

让人想犯罪 __ 提交于 2019-11-30 03:54:42
Does anyone know if there's a good voice authentication library available? Or do I need to find a good voice recognition library and build from there? I've tried searching google using the tearm "voice authentication" library with no luck. There are several open source packages implementing speaker recognition. For example Mistral http://mistral.univ-avignon.fr/ There are clones of mistral too http://code.google.com/p/improved-mistral/ Speech Recognition is usually quite different from voice authentication (often called voice biometrics). Here is a good overview document - http://www.sans.org