pandas 0.10.0 documentation

pandas.DataFrame.groupby

DataFrame.groupby(by=None, axis=0, level=None, as_index=True, sort=True, group_keys=True)

Group series using mapper (dict or key function, apply given function to group, return result as series) or by a series of columns

Parameters :

by : mapping function / list of functions, dict, Series, or tuple /

list of column names. Called on each element of the object index to determine the groups. If a dict or Series is passed, the Series or dict VALUES will be used to determine the groups

axis : int, default 0

level : int, level name, or sequence of such, default None

If the axis is a MultiIndex (hierarchical), group by a particular level or levels

as_index : boolean, default True

For aggregated output, return object with group labels as the index. Only relevant for DataFrame input. as_index=False is effectively “SQL-style” grouped output

sort : boolean, default True

Sort group keys. Get better performance by turning this off

group_keys : boolean, default True

When calling apply, add group keys to index to identify pieces

Returns :

GroupBy object :

Examples

# DataFrame result >>> data.groupby(func, axis=0).mean()

# DataFrame result >>> data.groupby([‘col1’, ‘col2’])[‘col3’].mean()

# DataFrame with hierarchical index >>> data.groupby([‘col1’, ‘col2’]).mean()