Consider this simple example
import pandas as pd
df = pd.DataFrame({\'one\' : [1,2,3],
\'two\' : [1,0,0]})
df
Out[9]:
one two
0
I think you need []
for select column by column name what is general solution for selecting columns, because select by attributes have many exceptions:
- You can use this access only if the index element is a valid python identifier, e.g. s.1 is not allowed. See here for an explanation of valid identifiers.
- The attribute will not be available if it conflicts with an existing method name, e.g. s.min is not allowed.
- Similarly, the attribute will not be available if it conflicts with any of the following list: index, major_axis, minor_axis, items, labels.
- In any of these cases, standard indexing will still work, e.g. s['1'], s['min'], and s['index'] will access the corresponding element or column.
def megabug(df,mycol):
return df.groupby('one')[mycol].sum()
print (megabug(df, 'two'))
one
1 1
2 0
3 0
Name: two, dtype: int64