basic groupby operations in Dask
问题 I am attempting to use Dask to handle a large file (50 gb). Typically, I would load it in memory and use Pandas. I want to groupby two columns "A", and "B", and whenever column "C" starts with a value, I want to repeat that value in that column for that particular group. In pandas, I would do the following: df['C'] = df.groupby(['A','B'])['C'].fillna(method = 'ffill') What would be the equivalent in Dask? Also, I am a little bit lost as to how to structure problems in Dask as opposed to in