R-miss-tastic

A resource website on missing values - Methods and references for managing missing data

Package:

naniar

Category:

Data Structures, Summaries, and Visualisations for Missing Data

Use-Cases:

Visualization of missing values, descriptive statistics, …

Popularity:

CRAN Downloads

Description:

Missing values are ubiquitous in data and need to be carefully explored and handled in the initial stages of analysis. In this vignette we describe the tools in the package naniar for exploring missing data structures with minimal deviation from the common workflows of ggplot and tidy data.

Last update:

CRAN Release

Datasets:
  • oceanbuoys
  • pedestrian
  • riskfactors
Further Information:
  • Tierney, N. J., & Cook, D. H. (2018). Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations. arXiv preprint arXiv:1809.02264. PDF (on arXiv)
  • Vignettes
  • Related visdat R-package
Input:

data.frame, vector

Example:
library(naniar)

data(airquality)

print("print data set with NAs")
print(head(airquality))

## Replace "NA" values with values 10% lower 
## than the minimum value in that variable.
## This is done by calling the geom_miss_point() function
ggplot2::ggplot(airquality, 
       			ggplot2::aes(x = Solar.R, 
           		y = Ozone)) + 
  geom_miss_point()

Here you can have a interactive look at the example:

https://rdrr.io/snippets/embedding/


Share