I have a dataset in the following format, it\'s an array of arrays called filtered_data
["docmetaID", "pathwaySeqID", "srcstrID"