group by and aggregate multiple columns + pyspark code example
Example: Pyspark Aggregation on multiple columns
df.groupBy("year", "sex").agg(avg("percent"), count("*"))
df.groupBy("year", "sex").agg(avg("percent"), count("*"))