Large dataset with 500 patients and multiple biomarkers for testing computational efficiency and performance with substantial sample sizes.
Format
A data frame with 500 rows and 8 variables:
- patient_id
Character: Patient identifier (PT0001-PT0500)
- disease_status
Factor: "Disease" or "No_Disease" (30%/70% prevalence)
- biomarker1
Numeric: First biomarker (mean: 75 for disease, 50 for no disease)
- biomarker2
Numeric: Second biomarker (mean: 68 for disease, 48 for no disease)
- age
Numeric: Patient age in years (mean 62, SD 13)
- sex
Factor: "Male" or "Female"
- site
Factor: Research site (Site_1 through Site_10)
- risk_category
Factor: "Low", "Intermediate", or "High"
Details
Large sample (n=500) with multiple biomarkers and stratification variables. Tests computational efficiency and stability of estimates with adequate sample sizes. Includes multi-site and risk stratification for subgroup analysis.
Examples
data(psychopdaROC_large)
psychopdaROC(data = psychopdaROC_large,
dependentVars = c("biomarker1", "biomarker2"),
classVar = "disease_status", positiveClass = "Disease")
#> Error in psychopdaROC(data = psychopdaROC_large, dependentVars = c("biomarker1", "biomarker2"), classVar = "disease_status", positiveClass = "Disease"): argument "refVar" is missing, with no default