Dataset with 300 patients and very low disease prevalence (5%), representing rare disease screening scenarios where prevalence affects predictive values.
Format
A data frame with 300 rows and 3 variables:
- patient_id
Character: Patient identifier (PT001-PT300)
- rare_disease
Factor: "Disease" or "No_Disease" (5%/95% prevalence)
- biomarker
Numeric: Biomarker value (mean: 80 for disease, 45 for no disease)
Details
Low prevalence (5%) with good biomarker discrimination. Demonstrates impact of prevalence on positive and negative predictive values even with good sensitivity and specificity.
Examples
data(psychopdaROC_rare)
psychopdaROC(data = psychopdaROC_rare, dependentVars = "biomarker",
classVar = "rare_disease", positiveClass = "Disease")
#> Error in psychopdaROC(data = psychopdaROC_rare, dependentVars = "biomarker", classVar = "rare_disease", positiveClass = "Disease"): argument "refVar" is missing, with no default