Return a new RDD containing the distinct elements in this RDD. code example
Example: Return a new RDD containing the distinct elements in this RDD.
sorted(sc.parallelize([1, 1, 2, 3]).distinct().collect())
# [1, 2, 3]
sorted(sc.parallelize([1, 1, 2, 3]).distinct().collect())
# [1, 2, 3]