I have a recognizer and used its listen function to get the Float32Array array from SpeechCommandRecognizerResult.spectrogram.data, then concatenat
recognizer
listen