Pandas: how to convert a cell with multiple values to multiple rows?

感情迁移 提交于 2019-12-03 04:33:25

Carrying on from the same idea, you could set a MultiIndex for df2 and then stack. For example:

>>> df2 = df.asn.str.split(',').apply(pd.Series)
>>> df2.index = df.set_index(['Name', 'count']).index
>>> df2.stack().reset_index(['Name', 'count'])
   Name  count     0
0  Org1      1  asn1
1  Org1      1  asn2
0  org2      2  asn3
0  org3      5  asn4
1  org3      5  asn5

You can then rename the column and set an index of your choosing.

As an alternative:

import pandas as pd
from StringIO import StringIO

ctn = '''Name asn count
Org1 asn1,asn2 1
org2 asn3      2
org3 asn4,asn5 5'''

df = pd.read_csv(StringIO(ctn), sep='\s*', engine='python')
s = df['asn'].str.split(',').apply(pd.Series, 1).stack()
s.index = s.index.droplevel(-1)
s.name = 'asn'
del df['asn']
df = df.join(s)

print df

Result:

   Name  count   asn
0  Org1      1  asn1
0  Org1      1  asn2
1  org2      2  asn3
2  org3      5  asn4
2  org3      5  asn5
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!