groupby

PHOTO EMBED

Tue Jan 04 2022 15:54:50 GMT+0000 (Coordinated Universal Time)

Saved by @ahoeweler

df.groupby(['col1'])['col2'].mean()  # or .count(), etc.

# other way for multiple statistics
df.groupby(['col1', 'col2'])[['col3', 'col4']].agg(['min', 'max', 'sum'])

# choose specific columns
df1.groupby('col1').agg({'col2':'count'})

# use filter to count rows
df.groupby('col1').filter(lambda x: len(x) > 1)
content_copyCOPY