I have DataFrame called data with 477154 rows.
PDB_ID Chain Sequence Secstr 0 101M A GEWQLVLHVWAKVEA | HH