microsoft-cognitive | 易学教程

What audio formats are supported by Azure Cognitive Services' Speech Service (SST)?

阅读更多关于 What audio formats are supported by Azure Cognitive Services' Speech Service (SST)?

问题 Bearing in mind that the Microsoft/Azure Cognitive Services' "Speech Service" is currently going through a rationalisation exercise, as far as I can tell from looking at https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-apis#speech-to-text https://docs.microsoft.com/en-us/azure/cognitive-services/speech/home only .wav binaries are acceptable, with anything else giving the response: {"Message":"Unsupported audio format"} Is there any other way to discover the

How to send a local image instead of URL to Microsoft Cognitive Face API using JAVA

阅读更多关于 How to send a local image instead of URL to Microsoft Cognitive Face API using JAVA

问题 Am trying to play with Face API of Microsoft Cognitive Services. Am wondering how to send a local image through rest API calls to Face API and request for the results from it using JAVA . Can anyone help me with this please? The Testing opting provided by Microsoft on their site only takes URL, I Tried to convert my local path to URL and give it as input but that doesn't work. 回答1: One option is to use the FileEntity class. // This sample uses the Apache HTTP client from HTTP Components (http

Using Project Oxford's Emotion API

阅读更多关于 Using Project Oxford's Emotion API

问题 I came across Project Oxford and became really interested in it and using its API, specifically the emotion one. Microsoft provides sample code ########### Python 2.7 ############# import httplib, urllib, base64 headers = { # Request headers 'Content-Type': 'application/json', 'Ocp-Apim-Subscription-Key': 'add key', } params = urllib.urlencode({ # Request parameters 'faceRectangles': '{string}', }) try: conn = httplib.HTTPSConnection('api.projectoxford.ai') conn.request("POST", "/emotion/v1.0

Could not load file or assembly Bond.IO

阅读更多关于 Could not load file or assembly Bond.IO

问题 Using Microsoft.Bing.Speech nuget package and Net Framework 4.6.1 I'm having this exception when calling RecognizeAsync() Could not load file or assembly 'Bond.IO, Version=1.0.0.0, Culture=neutral, PublicKeyToken=31bf3856ad364e35' or one of its dependencies. The located assembly's manifest definition does not match the assembly reference. (Exception from HRESULT: 0x80131040) My code: public static async Task SpeechToTextStreamPO(Stream audioStream, string textResult) { var subscriptionKey =

Can Microsoft Bing Speech be configured to return only numbers / letters?

阅读更多关于 Can Microsoft Bing Speech be configured to return only numbers / letters?

问题 Can the Microsoft Bing Speech API be configured to only return numbers and letters, as opposed to full words? The use case is translating Canadian postal codes. Ex. M 1 B 0 R 3. Microsoft may return "Em 1 Be 0 Are 3" Our audio file is 8000hz and encoded with "M-ULAW". We have no flexibility in changing the sample rate or encoding. We are using the "SMD" scenario, but I can't find any documentation on what this does. Base request URI: https://speech.platform.bing.com/recognize?scenarios=smd

How does navigation work with LUIS subdialogs?

阅读更多关于 How does navigation work with LUIS subdialogs?

问题 I have a question... Unfortunately all the samples on the web are too shallow and don't really cover this well: I have a RootDialog that extends the LuisDialog. This RootDialog is responsible for figuring out what the user wants to do. It could be multiple things, but one of them would be initiating a new order. For this, the RootDialog would forward the call to the NewOrderDialog, and the responsibility of the NewOrderDialog would be to figure out some basic details (what does the user want

Posting An Image from Webcam to Azure Face Api

阅读更多关于 Posting An Image from Webcam to Azure Face Api

问题 I am trying to upload an image that I get from my webcam to the Microsoft Azure Face Api. I get the image from canvas.toDataUrl(‘image/png’) which contains the Data Uri. I change the Content Type to application/octet-stream and when I attach the Data Uri to the post request, I get a Bad Request (400) Invalid Face Image. If I change the attached data to a Blob, I stop receiving errors however I only get back an empty array instead of a JSON object. I would really appreciate any help for

Microsoft Cognitive services - Speech customization testing processing seems freezed

阅读更多关于 Microsoft Cognitive services - Speech customization testing processing seems freezed

问题 I upload sucessfully data to speech customization (wav audio+ txt transcription) for just one audio in a zip file according to Microsoft docs: https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-custom-speech-test-data. When i click to add a test i choose data and it's taking an eternity to process results and never stops processing. My audio is in pt-BR model. Any idea? I cannot interrupt or deleting tests while it's processing 回答1: There is currently an issue in

requests to issue auth token from Microsoft Cognitive API return 500 with stack trace

阅读更多关于 requests to issue auth token from Microsoft Cognitive API return 500 with stack trace

问题 I am posting this as instructed by @AzureSupport on Twitter. All calls to the Cognitive auth API are returning 500 with a stack trace and this happens for all my subscription keys. This was working correctly and suddenly stopped. This happens from my application, from curl, and even from the Microsoft test form here. curl: curl --header 'Ocp-Apim-Subscription-Key: <my-key>' --data "" 'https://api.cognitive.microsoft.com/sts/v1.0/issueToken' The pertinent error in the stack trace appears to be

Is new ms botbuilder directline speech good fit for call center scenario?

阅读更多关于 Is new ms botbuilder directline speech good fit for call center scenario?

问题 MS recently introduced direct speech channel and some samples for web frontend to use it. But i was wondering is it a good fit for use in call center scenario using some SIP or services like twilio phone? If so i would like to see some docs how to use direct line speech api and wire it up to some telephony? I've already created github issue but it stay wo attention https://github.com/MicrosoftDocs/bot-docs/issues/1162 PS: also i have related problem, i can't find any docs on how to exachange