speech | 易学教程

Speech on WinRT

阅读更多关于 Speech on WinRT

问题 Is Speech api available in WinRT (like Windows Phone 8). Can I use some third party speech api instead? I want to implement TTS in my Windows 8/WinRT application, does anyone have some suggestions? 回答1: Speech is now available for WinRT Voice Commands and Speech Recognition are available on phone only. TTS is available for all winRT devices. 回答2: There is no Speech API available in WinRT. There may be 3rd party WinRT components, you would have to look for those. I am not aware of any, but I

Is it possible to write a speech-recognition engine from scratch?

阅读更多关于 Is it possible to write a speech-recognition engine from scratch?

问题 Using some existing math libraries, though. The point is, that there is literally no research in this field in my language (Georgian). Is that possible? How long would I take? I know that this also depends on the skills, but still? 回答1: Also answered at Speech to text conversion for non-english language Is it possible to write a speech-recognition engine from scratch? You do not need to write engine from scratch, there are many engines already available, you can just pick one like CMUSphinx

Number of Zero-crossings - Equation

阅读更多关于 Number of Zero-crossings - Equation

问题 I have written an algorithm that calculates the number of zero-crossings within a signal. By this, I mean the number of times a value changes from + to - and vice-versa. The algorithm is explained like this: If there are the following elements: v1 = {90, -4, -3, 1, 3} Then you multiply the value by the value next to it. (i * i+1) Then taking the sign value sign(val) determine if this is positive or negative. Example: e1 = {90 * -4} = -360 -> sigum(e1) = -1 e2 = {-4 * -3} = 12 -> signum(e2) =

No podspec found for googleapis in

阅读更多关于 No podspec found for googleapis in

问题 Installing googleapis pod giving error "No podspec found for googleapis in ." I am trying to installing pod 'googleapis', :path => '.' but, its getting following error Analyzing dependencies Fetching podspec for googleapis from . [!] No podspec found for googleapis in . And my pod file is Uncomment the next line to define a global platform for your project platform :ios, '8.1' target 'appName' do # Uncomment the next line if you're using Swift or would like to use dynamic frameworks # use

Weird pocketsphinx accuracy degredation

阅读更多关于 Weird pocketsphinx accuracy degredation

问题 A little bit of context - I'm creating a pocketsphinx application for android, using jsgf grammars (though the problem persists with fsg grammars as well). My code is based on the pocketsphinx demo for android. Pocketsphinx is producing a bizarre issue where the accuracy appears to be degrading after only a couple of queries to the engine. The first query has near perfect accuracy - it can recognize relatively complicated and convoluted phrases without difficulty. However the second, and

Matching wildcard/dictation in Microsoft Speech Grammar

阅读更多关于 Matching wildcard/dictation in Microsoft Speech Grammar

问题 I'm using Microsoft Speech API to load a grxml grammar: Grammar grammar = new Grammar(file); grammar.Enabled = true; SpeechRecognitionEngine sre = GetEngine(); sre.LoadGrammarAsync(grammar); Based on MSDN I can not find tag to match a wildcard / spoken text like: <item>My message is {dictation}</item> It seems to be availalble with code with a DictationGrammar and appendDictation(). It's also available with WSRMacro XML using * but I do not how to do it in XML ? The skip text but I need to

Detect fluency from google speech api results

阅读更多关于 Detect fluency from google speech api results

问题 Trying to determine fluency of a speaker using google speech (to text) api. So far i have found that api (betav1) can show the time taken to speak a word ( its starting time and ending time ). And from wikipedia, Oral fluency or speaking fluency is a measurement both of production and reception of speech, as a fluent speaker must be able to understand and respond to others in conversation. Spoken language is typically characterized by seemingly non-fluent qualities (e.g., fragmentation,

speech recognition and sound comparation witth musicg

阅读更多关于 speech recognition and sound comparation witth musicg

问题 I'm trying to make an Android application with speech recognition, but unfortunately google doesn't support my language (MACEDONIAN) and i'm trying to compare two recording sounds. I'm using http://code.google.com/p/musicg/ to record and compare speech, and i'm stack on initialization the settings for detecting the speech .Some one can tell me how to rewrite this init function for speech detection it's very important to me.. or some other idea how to do that. this is the initialization for

Small-size speech recognition on Android to look for keywords

阅读更多关于 Small-size speech recognition on Android to look for keywords

问题 I'm developing a voice command app and need to use speech to text in Android. I want my app to work offline. Its yet possible only in jellybean version and it requires huge sized database to download and keep in the device. But i don't require whole database, i just want few keywords for the conversions. Is it possible to record a .wav files on our own and set its reference to a particular word and when a voice input is given we could match the two voice tracks and recognize the corresponding

Is there a way to rank the difficulty of pronunciation of a word?

阅读更多关于 Is there a way to rank the difficulty of pronunciation of a word?

问题 I'm trying to build a collection English words that are difficult to pronounce. I was wondering if there is an algorithm of some kind or a theory, that can be used to show how difficult a word is to pronounce. Does this appear to you as something that can be computed? As this seems to be a very subjective thing, let me make it more objective, let's say hardest words to pronounce by text to speech technologies. 回答1: One approach would be to build a list with two versions of each word. One the