Read XML file to Pandas DataFrame

后端 未结 3 828
臣服心动
臣服心动 2020-12-11 20:30

Can someone please help convert the following XML file to Pandas dataframe:

3条回答
  •  庸人自扰
    2020-12-11 21:05

    Hello all I found another really easily way to solve those question. reference: https://www.youtube.com/watch?v=WVrg5-cjr5k

    import xml.etree.ElementTree as ET
    import pandas as pd
    import codecs
    
    ## open notebook and save your xml file to text.xml 
    with codecs.open('text.xml', 'r', encoding='utf8') as f:
        tt = f.read()
    
    
    def xml2df(xml_data):
        root = ET.XML(xml_data)
        all_records = []
        for i, child in enumerate(root):
            record = {}
            for sub_child in child:
                record[sub_child.tag] = sub_child.text
            all_records.append(record)
        return pd.DataFrame(all_records)
    
    
    df_xml1 = xml2df(tt)
    print(df_xml1)
    

    for better understanding of ET you can use underneath code to see what in side of your xml

    import xml.etree.ElementTree as ET
    import pandas as pd
    import codecs
    with codecs.open('text.xml', 'r', encoding='utf8') as f:
        tt = f.read()
    
    root = ET.XML(tt)
    
    print(type(root))
    print(root[0])
    for ele in root[0]:
        print(ele.tag + '////' + ele.text)
    
    print(root[0][0].tag)
    

    Once you finish running the program you can see the output underneath:

    C:\Users\username\Documents\pycode\Scripts\python.exe C:/Users/username/PycharmProjects/DestinationLight/try.py
          n35237      n32238     n44699
    0        1.0         3.0        nan
    1  7020000.0  10000000.0  4128000.0
    2    35237.0     32238.0    44699.0
    
    
    
    n35237////1.0
    n32238////3.0
    n44699////nan
    n35237
    
    Process finished with exit code 0
    

提交回复
热议问题