pyspark get groupby keys code example Example: Pyspark Aggregation on multiple columns df.groupBy("year", "sex").agg(avg("percent"), count("*"))