Biopython SeqIO to Pandas Dataframe
I have a FASTA file that can easily be parsed by SeqIO.parse . I am interested in extracting sequence ID's and sequence lengths. I used these lines to do it, but I feel it's waaaay too heavy (two iterations, conversions, etc.) from Bio import SeqIO import pandas as pd # parse sequence fasta file identifiers = [seq_record.id for seq_record in SeqIO.parse("sequence.fasta", "fasta")] lengths = [len(seq_record.seq) for seq_record in SeqIO.parse("sequence.fasta", "fasta")] #converting lists to pandas Series s1 = Series(identifiers, name='ID') s2 = Series(lengths, name='length') #Gathering Series