Skip to contents

Usage

outlierdetection(
  data,
  vars,
  method_category = "composite",
  univariate_methods = "zscore_robust",
  multivariate_methods = "mahalanobis",
  composite_threshold = 0.5,
  zscore_threshold = 3.29,
  iqr_multiplier = 1.7,
  confidence_level = 0.999,
  show_outlier_table = TRUE,
  show_method_comparison = TRUE,
  show_exclusion_summary = TRUE,
  show_visualization = TRUE,
  show_interpretation = TRUE
)

Arguments

data

The data as a data frame.

vars

Continuous variables to analyze for outliers. The module will detect outliers based on the selected variables using the chosen detection methods.

method_category

Category of outlier detection methods to use. CLINICAL GUIDANCE: Univariate methods analyze each variable separately (ideal for lab values, vital signs). Multivariate methods consider relationships between variables (useful for correlated biomarkers). Composite combines multiple approaches for robust detection (recommended for most clinical data). Examples: Use univariate for hemoglobin levels, multivariate for complete blood count panels.

univariate_methods

Specific univariate method for outlier detection when univariate category is selected.

multivariate_methods

Specific multivariate method for outlier detection when multivariate category is selected.

composite_threshold

Threshold for composite outlier score (0.1-1.0). Default 0.5 means observations classified as outliers by at least half of the methods are considered outliers.

zscore_threshold

Threshold for Z-score based methods. CLINICAL EXAMPLES: 3.0 = 99.7\ 3.29 = 99.9\ enzymes), 2.5 = 98.8\ Recommended: 3.29 for clinical quality control.

iqr_multiplier

Multiplier for IQR-based outlier detection. CLINICAL EXAMPLES: 1.5 = Tukey's standard (sensitive, may flag ~0.7\ data), 1.7 = conservative (recommended for clinical screening), 2.0 = very conservative (for critical biomarkers). Useful for non-normal distributions common in clinical data.

confidence_level

Confidence level for interval-based methods (ETI, HDI). Default 99.9\

show_outlier_tableDisplay a comprehensive table of outlier detection results including outlier scores, distances, and classification for each observation.

show_method_comparisonCompare results across different detection methods when using composite approach.

show_exclusion_summaryProvide summary of observations recommended for exclusion and impact analysis.

show_visualizationGenerate plots showing outlier detection results and distribution of outlier scores.

show_interpretationDisplay detailed interpretation of outlier detection results and methodological notes.

A results object containing:

results$todoa html
results$plotan image
results$outlier_tablea html
results$method_comparisona html
results$exclusion_summarya html
results$interpretationa html
Advanced outlier detection using multiple statistical methods from the easystats performance package. This module provides comprehensive outlier detection through univariate methods (Z-scores, IQR, confidence intervals), multivariate methods (Mahalanobis distance, MCD, OPTICS, LOF), and composite scoring across multiple algorithms. Complements existing data quality assessment modules with state-of-the-art outlier detection capabilities. Perfect for clinical research data quality control and preprocessing.