convert from string to pandas dataframe

。_饼干妹妹 提交于 2019-12-11 06:11:55

问题


I'm importing a csv file from AWS S3 in AWS Lambdawith below code:

file = s3.get_object(Bucket = bucket, Key = key)
rows = file['Body'].read().decode('utf-8').splitlines(False)

I'm getting input in below format :

data = "a,b,c,d,\"x,y\",e,f"

and I want output in below format:

>>>`>>> df
   0  1  2  3    4  5  6
0  a  b  c  d  x,y  e  f`

i have to split data based on ',' but if some thong is between " " they should remain as it is.

Or if you have any other solution for import csv file from s3 to lambda and converting in Data Frame, Please suggest


回答1:


use csv module

try this,

from csv import reader
import pandas as pd
data=["a,b,c,d,\"x,y\",e,f"]
df=pd.DataFrame( list(reader(data)))
print df

Output:

   0  1  2  3    4  5  6
0  a  b  c  d  x,y  e  f


来源:https://stackoverflow.com/questions/53278358/convert-from-string-to-pandas-dataframe

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!