multi-column factorize in pandas

后端 未结 4 726
长发绾君心
长发绾君心 2020-12-28 09:35

The pandas factorize function assigns each unique value in a series to a sequential, 0-based index, and calculates which index each series entry belongs to.

4条回答
  •  时光取名叫无心
    2020-12-28 09:46

    df = pd.DataFrame({'x': [1, 1, 2, 2, 1, 1], 'y':[1, 2, 2, 2, 2, 1]})
    tuples = df[['x', 'y']].apply(tuple, axis=1)
    df['newID'] = pd.factorize( tuples )[0]
    

提交回复
热议问题