What is the difference between AWS Transcribe > Streaming Transcription feature and Kinesis Video Streams(For Audio Input) for live streaming audio

大城市里の小女人 提交于 2019-12-13 05:05:58

问题


Hi My requirement is I have live audio stream as input, say a call between 2 people, now to convert that audio to text on live and pick certain keywords from that extracted text and insert in Database.

As per architecture in https://github.com/aws-samples/amazon-connect-realtime-transcription Both AWS Kinesis Video Streams service and AWS Transcribe used for live streaming but as per link : https://aws.amazon.com/blogs/machine-learning/amazon-transcribe-now-supports-real-time-transcriptions/ AWS Transcribe supports live transcription then why in that architecture Kinesis used ?

If any one know, please help me in understanding, Hope Amazon connect can ingest live audio to AWS Transcribe for live transcription.


回答1:


Amazon Kinesis Video Streams is the service that enables streaming voice data from Amazon Connect. Amazon Transcribe can ingest streams from any source for real-time transcription, but the only way to get that real-time data from Amazon Connect is via Kinesis. The launch announcement for real-time streams might help make this more clear:

With the customer voice stream feature, your customer audio is automatically sent to Amazon Kinesis Video Streams, where it can be accessed by the integrations that you allow. For example, you could integrate customer voice stream with real-time text transcription and sentiment analysis for immediate feedback on call quality, or use this feature with a 3rd party voice biometric product to authenticate the caller automatically without having to enter a password or confirm personal information.



来源:https://stackoverflow.com/questions/55188353/what-is-the-difference-between-aws-transcribe-streaming-transcription-feature

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!