In Africa we have hundreds of languages I would like to build a speech recognition engine to convert audio to text in some of the languages. Considering google STT does not