So I have a dataframe (or series) where there are always 4 occurrences of each of column \'A\', like this:
df = pd.DataFrame([[\'foo\'],
[
Or you can just repeat the column A of df_key
the remaining number of times from df
.
s=df.A.value_counts()-df_key.A.value_counts()
pd.concat([df_key,pd.DataFrame({'A':s.index.repeat(s)})]).sort_values('A')
Out[469]:
A B C
2 bar 5.0 9.0
3 bar 2.0 4.0
4 bar 1.0 9.0
0 bar NaN NaN
0 foo 1.0 2.0
1 foo 3.0 4.0
1 foo NaN NaN
2 foo NaN NaN