A dataset based on cat_data where some values are randomly removed to simulate MCAR.
A data.frame with 1000 rows and 5 variables.
- X1
Categorical, 100 values set to NA (MCAR).
- X2
Categorical, 50 values set to NA (MCAR).
- X3
Categorical, 200 values set to NA (MCAR).
- Z
Categorical, no missing values.
- Y
Categorical, no missing values.
Details
The R code used to generate this dataset is as follows:
data(cat_data)
cat_data_mcar <- cat_data
n <- nrow(cat_data_mcar)
set.seed(1405)
cat_data_mcar$X1[sample(1:n, 100)] <- NA
cat_data_mcar$X2[sample(1:n, 50)] <- NA
cat_data_mcar$X3[sample(1:n, 200)] <- NA
Examples
data(cat_data_mcar)
head(cat_data_mcar)
#> X1 X2 X3 Z Y
#> 1 c b b c b
#> 2 d b a c c
#> 3 d d e b d
#> 4 e b b d d
#> 5 d c c c c
#> 6 d c c b d