GOAL: Capturing audio output from a browser tab and transcribe it (almost) realtime, (i.e without making frequent api calls and probably using dire