R equivalent of SELECT DISTINCT on two or more fields/variables
unique
works on data.frame
so unique(df[c("var1","var2")])
should be what you want.
Another option is distinct
from dplyr
package:
df %>% distinct(var1, var2) # or distinct(df, var1, var2)
Note:
For older versions of dplyr (< 0.5.0, 2016-06-24) distinct
required additional step
df %>% select(var1, var2) %>% distinct
(or oldish way distinct(select(df, var1, var2))
).
@Marek's answer is obviously correct, but may be outdated. The current dplyr
version (0.7.4) allows for an even simpler code:
Simply use:
df %>% distinct(var1, var2)
If you want to keep all columns, add
df %>% distinct(var1, var2, .keep_all = TRUE)