Pandas and Python Dataframes and Conditional Shift Function

邮差的信 提交于 2019-12-04 17:49:21

You can use groupby and shift():

import io
import pandas as pd

text = """SaleDate    Car
12/1/2016   Wrangler
12/2/2016   Camry
12/3/2016   Wrangler
12/7/2016   Prius
12/10/2016  Prius
12/12/2016  Wrangler"""

df = pd.read_csv(io.StringIO(text), delim_whitespace=True, parse_dates=[0])
df["lastSaleDate"] = df.groupby("Car").SaleDate.shift()

the output:

    SaleDate       Car lastSaleDate
0 2016-12-01  Wrangler          NaT
1 2016-12-02     Camry          NaT
2 2016-12-03  Wrangler   2016-12-01
3 2016-12-07     Prius          NaT
4 2016-12-10     Prius   2016-12-07
5 2016-12-12  Wrangler   2016-12-03

I'm basically copying HYRY's answer and modifying it slightly. If you like this solution. Choose HYRY's answer as your answer.

from StringIO import StringIO  # this is what I needed to do
import pandas as pd

text = """SaleDate    Car
12/1/2016   Wrangler
12/2/2016   Camry
12/3/2016   Wrangler
12/7/2016   Prius
12/10/2016  Prius
12/12/2016  Wrangler"""

df = pd.read_csv(StringIO(text), delim_whitespace=True, parse_dates=[0])

# what you already did
df['PriorSaleDate'] = df['SaleDate'].shift()

# what HYRY did
df["CarSpecificPriorSaleDate"] = df.groupby("Car").SaleDate.shift()

Looks like

Out[34]:
    SaleDate       Car PriorSaleDate CarSpecificPriorSaleDate
0 2016-12-01  Wrangler           NaT                      NaT
1 2016-12-02     Camry    2016-12-01                      NaT
2 2016-12-03  Wrangler    2016-12-02               2016-12-01
3 2016-12-07     Prius    2016-12-03                      NaT
4 2016-12-10     Prius    2016-12-07               2016-12-07
5 2016-12-12  Wrangler    2016-12-10               2016-12-03
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!