how to use agg with multiple columns in pyspark code example Example: Pyspark Aggregation on multiple columns df.groupBy("year", "sex").agg(avg("percent"), count("*"))