Skip to contents

Cancer screening dataset with 250 patients featuring multiple biomarkers (PSA and CA125) for evaluating screening test performance with low disease prevalence (15%).

Usage

psychopdaROC_screening

Format

A data frame with 250 rows and 6 variables:

patient_id

Character: Patient identifier (PT001-PT250)

cancer

Factor: "Cancer" or "No_Cancer" (15%/85% prevalence)

psa_level

Numeric: PSA level (ng/mL), log-normal distribution

ca125

Numeric: CA125 level (U/mL), higher in cancer cases

age

Numeric: Patient age in years (mean 65, SD 10)

risk_factors

Factor: "None", "Family_History", or "Multiple"

Source

Generated test data for ClinicoPath package

Details

Designed for evaluating screening test characteristics where high sensitivity is prioritized. PSA levels are log-normally distributed (median: 12 for cancer, 4 for no cancer). CA125 shows normal distribution with higher values in cancer cases (mean: 65 vs 25).

Examples

data(psychopdaROC_screening)
psychopdaROC(data = psychopdaROC_screening,
             dependentVars = c("psa_level", "ca125"),
             classVar = "cancer", positiveClass = "Cancer",
             clinicalPreset = "screening")
#> Error in psychopdaROC(data = psychopdaROC_screening, dependentVars = c("psa_level",     "ca125"), classVar = "cancer", positiveClass = "Cancer",     clinicalPreset = "screening"): argument "refVar" is missing, with no default