Can you help me understand the difference between the statements mentioned below? Given that Survived
column contains binary data (0,1), they give different ans
sum() is for like 1+0 = 1. if data is 3 and 3 then it return 6.
count() return number of row. so it will return 2.
simple :)
count()
It's a string function, which count the total number of individual category. And that's the reason we put this in quotes while using it in aggregate function.
Ex
grouped = order_products_all.groupby("product_id")["reordered"].aggregate(
{'reorder_sum': sum,'reorder_total': 'count'}
).reset_index()
sum()
It's a integer function, which sum the number of unique category.
count()
will just count number of rows.
sum()
will sum the 1's and 0's.