Given a dataframe with different categorical variables, how do I return a cross-tabulation with percentages instead of frequencies?
df = pd.DataFrame({\'A\' : [\
If you're looking for a percentage of the total, you can divide by the len of the df instead of the row sum:
pd.crosstab(df.A, df.B).apply(lambda r: r/len(df), axis=1)