speech-recognition

Android record audio while doing speech recognition

萝らか妹 提交于 2019-12-03 03:44:23
I am doing speech recognition using a third party cloud service on Android, and it works well with Android API SpeechRecognizer. Code below: Intent recognizerIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); recognizerIntent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, RecognizerIntent.LANGUAGE_MODEL_WEB_SEARCH); // accept partial results if they come recognizerIntent.putExtra(RecognizerIntent.EXTRA_PARTIAL_RESULTS, true); //need to have a calling package for it to work if (!recognizerIntent.hasExtra(RecognizerIntent.EXTRA_CALLING_PACKAGE)) { recognizerIntent.putExtra

How Shazam or Sound Hound works? [closed]

懵懂的女人 提交于 2019-12-03 03:23:46
问题 Closed . This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 3 years ago . I'm developing an iOS application with SDK for iOS 5.0 and XCode 4.2 . I want to develop an application that recognize sounds. I see there is an application called Sound Hound that recognize music and tells artist and title. How can I do something similar? I want to compare a

Using Mac OSX Dictation with Speech API

懵懂的女人 提交于 2019-12-03 03:22:35
In OSX Mavericks, speech dictation is now included, and is very useful. I am trying to use the dictation capability to create my own digital life assistant, but I can't find how to use the recognition functionality to get the speech in an application rather than a text box. I have looked into NSSpeechRecognizer, but that seems to be geared toward programming speakable commands with a pre-defined grammar rather than dictation. It doesn't matter what programming language I use, but Python or Java would be nice... Thanks for your help! You can use SFSpeechRecognizer ( mirror ) (requires macOS 10

Pocket Sphinx on Android

[亡魂溺海] 提交于 2019-12-03 02:51:08
What are the steps required to use PocketSphinx on Android? I have found various hints around the web that it is possible, but not real answers and hence I ask my question here. Nick Ruiz This is the most comprehensive resource I've found to date. I still have some compilation errors, though. The blog gives you a tutorial on how to build the PocketSphinxAndroidDemo app, which was developed (and discontinued) on GitHub . Personally, I'm having similar compilation errors to the Stack Overflow question here , but my rep is too low to ask for clarification on the answer. The latest and most

How do I search content, within audio files/streams? [closed]

半城伤御伤魂 提交于 2019-12-03 02:48:14
问题 As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 7 years ago . I have always wondered how many different search techniques existed, for searching text, for searching images and even for videos.

Is there a way to force Google Speech api to return only words as response?

前提是你 提交于 2019-12-03 01:56:25
I am using Googles this api :- https://www.google.com/speech-api/v2/recognize?output=json&lang= "+ language_code+"&key="My key" for speech recognition and it's working very well. The issue is with numbers i.e, if I say one two three four the result will be 1234 and if I say one thousand two hundred thirty four the result is still 1234 . Another issue is that with other languages i.e. the word elf in German is eleven . If you say elf the result is 11 , instead of elf. I know we have no control over the api but is there any parameters or hacks we can add to this api to force it to return only

Google Speech Recognition API: timestamp for each word?

ぃ、小莉子 提交于 2019-12-03 01:48:56
It's possible to use Google's Speech recognition API to get a transcription for an audio file (WAV, MP3, etc.) by doing a request to http://www.google.com/speech-api/v2/recognize?... Example: I have said " one two three for five " in a WAV file. Google API gives me this: { u'alternative': [ {u'transcript': u'12345'}, {u'transcript': u'1 2 3 4 5'}, {u'transcript': u'one two three four five'} ], u'final': True } Question: is it possible to get the time (in seconds) at which each word has been said? With my example: ['one', 0.23, 0.80], ['two', 1.03, 1.45], ['three', 1.79, 2.35], etc. i.e. the

What are language codes in Chrome's implementation of the HTML5 speech recognition API?

白昼怎懂夜的黑 提交于 2019-12-03 01:39:58
问题 Chrome implemented the HTML5 speech recognition API. Many languages are supported. I wanna know which languages are supported and each language's corresponding code which is used in the HTML element's lang attribute. For instance: Polish (pl-PL) Turkish (tr-TR) Thank you! 回答1: Ok, if it is not published, we can try to at least figure this out. Let me put this table for the beginning and we will refine it if someone has more information. I'm making assumption that supported languages shall be

How to implement speech-to-text via Speech framework [closed]

风格不统一 提交于 2019-12-03 01:02:57
Closed . This question needs to be more focused. It is not currently accepting answers. Learn more . Want to improve this question? Update the question so it focuses on one problem only by editing this post . I want to do speech recognition in my Objective-C app using the iOS Speech framework. I found some Swift examples but haven't been able to find anything in Objective-C. Is it possible to access this framework from Objective-C? If so, how? After spending enough time looking for Objective-C samples -even in the Apple documentation- I couldn't find anything decent, so I figured it out myself

Java voice recognition

跟風遠走 提交于 2019-12-02 21:00:56
Is there Anyone that has experience with any open source, or relatively cheap voice recognition API for java? I'm pretty much looking for something that will turn spoken words into text. From the java speech recognition page on sun, it seems that it is something that is rather dead. My requirements is something that at the least runs on linux. Can anyone recommend something? Pure java would be a bonus, else a linux based solution could be considered. And since this is a home project... the cheaper the better. Edit CMU Sphinx As Amit pointed out CMU Sphinx http://cmusphinx.sourceforge.net/html