anyway to get words time line in an audio generated by text to speech - by google (SSML or plain text)?

前端 未结 0 1155
佛祖请我去吃肉
佛祖请我去吃肉 2020-12-17 10:17

after creating audio with google text to speech, i need to get the time of each word so i can align translation with the audio output. I can see that it is done be the other

相关标签:
回答
  • 消灭零回复
提交回复
热议问题