Correct syntax for mutate_if
I learned this trick from the purrr tutorial, and it also works in dplyr.
There are two ways to solve this problem:
First, define custom functions outside the pipe, and use it in mutate_if()
:
any_column_NA <- function(x){
any(is.na(x))
}
replace_NA_0 <- function(x){
if_else(is.na(x),0,x)
}
mtcars %>% mutate_if(any_column_NA,replace_NA_0)
Second, use the combination of ~
,.
or .x
.( .x
can be replaced with .
, but not any other character or symbol):
mtcars %>% mutate_if(~ any(is.na(.x)),~ if_else(is.na(.x),0,.x))
#This also works
mtcars %>% mutate_if(~ any(is.na(.)),~ if_else(is.na(.),0,.))
In your case, you can also use mutate_all()
:
mtcars %>% mutate_all(~ if_else(is.na(.x),0,.x))
Using ~
, we can define an anonymous function, while .x
or .
stands for the variable. In mutate_if()
case, .
or .x
is each column.
The "if" in mutate_if
refers to choosing columns, not rows. Eg mutate_if(data, is.numeric, ...)
means to carry out a transformation on all numeric columns in your dataset.
If you want to replace all NAs with zeros in numeric columns:
data %>% mutate_if(is.numeric, funs(ifelse(is.na(.), 0, .)))