Decision Panel Optimization — .escapeVariableNames • ClinicoPath

Performs comprehensive univariate and stratified survival analysis comparing survival between groups. This analysis calculates person-time follow-up for each group and uses this to derive accurate survival estimates and incidence rates that account for varying follow-up durations across groups. The Cox proportional hazards model incorporates person-time by modeling the hazard function, which represents the instantaneous event rate per unit of person-time.

Key Features:

Kaplan-Meier survival curves with multiple plot types
Cox proportional hazards regression (univariate and stratified)
Median survival time with confidence intervals
Restricted Mean Survival Time (RMST) analysis
Person-time analysis with incidence rates
Competing risks and cause-specific survival
Landmark analysis for conditional survival
Proportional hazards assumption testing
Model residual diagnostics
Pairwise group comparisons with multiple testing correction

Statistical Methods:

Kaplan-Meier estimator for survival probabilities
Log-rank test for group comparisons
Cox proportional hazards model for risk assessment
Competing risks analysis using cumulative incidence functions
RMST for robust survival comparisons

Visualization Options:

Standard survival curves
Cumulative events and hazard plots
KMunicate-style plots for publication
Log-log plots for proportional hazards assessment
Residual diagnostic plots

Generates a Venn Diagram and an Upset diagram from selected categorical variables. This function converts specified variables to logical values based on a chosen "true" level. Two visual outputs are produced: a Venn diagram (via ggvenn) and an Upset plot (via UpSetR or ComplexUpset). Additionally, a summary table of "true" counts for each variable is provided.

ComplexUpset features include advanced styling, statistical annotations, custom sorting, and enhanced theming options for publication-ready figures.

Usage

.escapeVariableNames(var_names)

.escapeVariableNames(var_names)

.escapeVariableNames(var_names)

.escapeVariableNames(var_names)

Arguments

data: The data as a data frame
elapsedtime: Numeric variable containing survival time (time to event or censoring)
tint: Logical. Use dates to calculate survival time from diagnosis and follow-up dates
dxdate: Date variable for diagnosis date (when tint = TRUE)
fudate: Date variable for follow-up/event date (when tint = TRUE)
explanatory: Factor variable for group comparisons (e.g., treatment groups, risk categories)
outcome: Event indicator variable (binary: 0=censored, 1=event) or factor for multi-state outcomes
outcomeLevel: Event level when using factor outcome variable
dod: Dead of disease level (for competing risks analysis)
dooc: Dead of other causes level (for competing risks analysis)
awd: Alive with disease level (for competing risks analysis)
awod: Alive without disease level (for competing risks analysis)
analysistype: Type of survival analysis: "overall", "cause", or "compete"
cutp: Time points for survival probability estimation (comma-separated)
timetypedata: Date format in data: "ymd", "dmy", "mdy", etc.
timetypeoutput: Time unit for output: "days", "weeks", "months", "years"
uselandmark: Logical. Perform landmark analysis
landmark: Landmark time point for conditional survival analysis
pw: Logical. Perform pairwise group comparisons
padjustmethod: Multiple testing correction method for pairwise comparisons
ph_cox: Logical. Test proportional hazards assumption
stratified_cox: Logical. Use stratified Cox regression
strata_variable: Variable for Cox model stratification
rmst_analysis: Logical. Calculate Restricted Mean Survival Time
rmst_tau: Time horizon for RMST calculation (uses 75th percentile if NULL)
residual_diagnostics: Logical. Calculate and display model residuals
export_survival_data: Logical. Export survival estimates for external analysis
person_time: Logical. Calculate person-time metrics and incidence rates
time_intervals: Time intervals for stratified person-time analysis
rate_multiplier: Multiplier for incidence rates (e.g., 100 for per 100 person-years)
sc: Logical. Display survival curve plot
ce: Logical. Display cumulative events plot
ch: Logical. Display cumulative hazard plot
kmunicate: Logical. Display KMunicate-style plot
loglog: Logical. Display log-log plot for proportional hazards assessment
endplot: Maximum time for plot x-axis
ybegin_plot: Minimum value for plot y-axis
yend_plot: Maximum value for plot y-axis
byplot: Time interval for plot axis breaks
multievent: Logical. Use multiple event levels for competing risks
ci95: Logical. Display 95% confidence intervals on plots
risktable: Logical. Display risk table below survival curves
censored: Logical. Display censoring marks on survival curves
pplot: Logical. Display p-value on plots
medianline: Type of median survival line: "none", "h", "v", "hv"

Value

A comprehensive results object containing survival analysis outputs

The function produces a Venn diagram and an Upset diagram.

Details

Analysis Types:

Overall Survival: Time from study entry to death from any cause
Cause-Specific Survival: Time to death from specific cause (censoring other deaths)
Competing Risks: Accounts for multiple types of events that prevent observation of the primary outcome

Person-Time Analysis: Calculates incidence rates accounting for varying follow-up times. Particularly useful for:

Studies with differential loss to follow-up
Comparison of event rates across populations
Assessment of time-varying risk

Restricted Mean Survival Time (RMST): Alternative to median survival when survival curves don't reach 50% or for comparing survival over a specific time horizon. Represents the area under the survival curve up to a specified time point.

Model Diagnostics:

Proportional hazards assumption testing using Schoenfeld residuals
Martingale and deviance residuals for outlier detection
Log-log plots for visual assessment of proportional hazards

References

Klein JP, Moeschberger ML (2003). Survival Analysis: Techniques for Censored and Truncated Data. Springer.

Therneau TM, Grambsch PM (2000). Modeling Survival Data: Extending the Cox Model. Springer.

Royston P, Parmar MK (2013). Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Medical Research Methodology 13:152.

Examples

# \donttest{
# Basic survival analysis
data("histopathologySurvival", package = "ClinicoPathJamoviModule")
#> Error in find.package(package, lib.loc, verbose = verbose): there is no package called ‘ClinicoPathJamoviModule’

# Standard survival analysis with median and survival probabilities
survival_result <- survival(
  data = histopathologySurvival,
  elapsedtime = "OverallSurvival_indays",
  outcome = "Outcome",
  outcomeLevel = "Dead",
  explanatory = "Grade",
  timetypeoutput = "months",
  cutp = "12, 36, 60",
  sc = TRUE,
  pw = TRUE
)
#> Error: object 'histopathologySurvival' not found

# Survival analysis with person-time metrics
survival_with_pt <- survival(
  data = histopathologySurvival,
  elapsedtime = "OverallSurvival_indays", 
  outcome = "Outcome",
  outcomeLevel = "Dead",
  explanatory = "Stage",
  person_time = TRUE,
  time_intervals = "365, 1095, 1825",
  rate_multiplier = 1000
)
#> Error: object 'histopathologySurvival' not found

# RMST analysis for non-proportional hazards
rmst_analysis <- survival(
  data = histopathologySurvival,
  elapsedtime = "OverallSurvival_indays",
  outcome = "Outcome", 
  outcomeLevel = "Dead",
  explanatory = "Treatment",
  rmst_analysis = TRUE,
  rmst_tau = 1095  # 3 years
)
#> Error: object 'histopathologySurvival' not found

# Competing risks analysis
competing_risks <- survival(
  data = cancer_data,
  elapsedtime = "survival_days",
  outcome = "death_cause",
  multievent = TRUE,
  dod = "Cancer",
  dooc = "Other",
  awd = "Alive_Disease",
  awod = "Alive_Free",
  analysistype = "compete",
  explanatory = "risk_group"
)
#> Error in survival(data = cancer_data, elapsedtime = "survival_days", outcome = "death_cause",     multievent = TRUE, dod = "Cancer", dooc = "Other", awd = "Alive_Disease",     awod = "Alive_Free", analysistype = "compete", explanatory = "risk_group"): argument "outcomeLevel" is missing, with no default

# Landmark analysis for conditional survival
landmark_survival <- survival(
  data = histopathologySurvival,
  elapsedtime = "OverallSurvival_indays",
  outcome = "Outcome",
  outcomeLevel = "Dead", 
  explanatory = "Grade",
  uselandmark = TRUE,
  landmark = 365  # 1-year conditional survival
)
#> Error: object 'histopathologySurvival' not found

# Date-based survival calculation
date_survival <- survival(
  data = clinical_data,
  tint = TRUE,
  dxdate = "diagnosis_date",
  fudate = "last_contact_date",
  timetypedata = "ymd",
  timetypeoutput = "months",
  outcome = "vital_status",
  outcomeLevel = "Dead",
  explanatory = "treatment_arm"
)
#> Error: object 'clinical_data' not found
# }

if (FALSE) { # \dontrun{
# Example 1: Basic 2-variable Venn diagram
data("mtcars")
mtcars$vs <- factor(mtcars$vs, levels = c(0, 1), labels = c("V-shaped", "Straight"))
mtcars$am <- factor(mtcars$am, levels = c(0, 1), labels = c("Automatic", "Manual"))

# Create Venn diagram showing overlap between V-shaped engines and Manual transmission
venn(data = mtcars, var1 = "vs", var1true = "V-shaped",
     var2 = "am", var2true = "Manual")

# Example 2: 3-variable Venn diagram with penguins data
library(palmerpenguins)
data("penguins")
penguins$large_bill <- factor(ifelse(penguins$bill_length_mm > 45, "Large", "Small"))
penguins$heavy_weight <- factor(ifelse(penguins$body_mass_g > 4000, "Heavy", "Light"))
penguins$adelie_species <- factor(ifelse(penguins$species == "Adelie", "Adelie", "Other"))

venn(data = penguins,
     var1 = "large_bill", var1true = "Large",
     var2 = "heavy_weight", var2true = "Heavy",
     var3 = "adelie_species", var3true = "Adelie")

# Example 3: Variable names with spaces and numbers (requires careful handling)
# jamovi GUI automatically handles most problematic names
# When calling directly in R, variable names with spaces/numbers need backticks:
# venn(data = mydata, var1 = "`Rater 1`", var1true = "Positive",
#      var2 = "`Score 2A`", var2true = "High")

# Note: Names like "Rater 1", "Score 2A", "Item 3B" may cause parsing issues
# at the jamovi interface level. Solutions:
# 1. Use jamovi GUI for variable selection (recommended)
# 2. Rename variables to avoid spaces + numbers: "Rater1", "Score2A", "Item3B"
# 3. In R console, use backticks: `Rater 1` or quote properly

# Example 4: Clinical biomarker analysis
data("biomarkers")  # Hypothetical clinical dataset
venn(data = biomarkers,
     var1 = "ER_positive", var1true = "Positive",
     var2 = "PR_positive", var2true = "Positive",
     var3 = "HER2_amplified", var3true = "Amplified",
     show_ggVennDiagram = TRUE,
     regionLabels = "both",
     clinicalSummary = TRUE)

# Example 5: Medical/Clinical comorbidity analysis
# Create sample clinical data
clinical_data <- data.frame(
  patient_id = 1:100,
  diabetes = sample(c("Yes", "No"), 100, replace = TRUE, prob = c(0.3, 0.7)),
  hypertension = sample(c("Yes", "No"), 100, replace = TRUE, prob = c(0.4, 0.6)),
  obesity = sample(c("Yes", "No"), 100, replace = TRUE, prob = c(0.25, 0.75))
)

# Analyze comorbidity patterns
venn(data = clinical_data,
     var1 = "diabetes", var1true = "Yes",
     var2 = "hypertension", var2true = "Yes",
     var3 = "obesity", var3true = "Yes")

# Example 4: Using ComplexUpset for advanced features
venn(data = clinical_data,
     var1 = "diabetes", var1true = "Yes",
     var2 = "hypertension", var2true = "Yes",
     var3 = "obesity", var3true = "Yes",
     show_complexUpset = TRUE,
     sortBy = "freq",
     minSize = 5,
     showAnnotations = TRUE)

# Example 5: Advanced customization using ggVennDiagram
venn(data = clinical_data,
     var1 = "diabetes", var1true = "Yes",
     var2 = "hypertension", var2true = "Yes",
     var3 = "obesity", var3true = "Yes",
     show_ggVennDiagram = TRUE,
     regionLabels = "both",
     colorPalette = "Set1",
     labelSize = 3.5,
     setNameSize = 4.5)

# Example 6: 5-variable Venn diagram using ggVennDiagram
# Add more clinical variables
clinical_data$smoking <- sample(c("Yes", "No"), 100, replace = TRUE, prob = c(0.2, 0.8))
clinical_data$family_history <- sample(c("Yes", "No"), 100, replace = TRUE, prob = c(0.35, 0.65))

venn(data = clinical_data,
     var1 = "diabetes", var1true = "Yes",
     var2 = "hypertension", var2true = "Yes",
     var3 = "obesity", var3true = "Yes",
     var4 = "smoking", var4true = "Yes",
     var5 = "family_history", var5true = "Yes",
     show_ggVennDiagram = TRUE,
     regionLabels = "percent",
     colorPalette = "viridis")
} # }