Sequential Testing Analysis: A Comprehensive Guide

Introduction

Sequential testing analysis is a powerful approach for optimizing diagnostic accuracy by combining multiple tests in systematic ways. This guide demonstrates how to use the Sequential Testing Analysis module to design, evaluate, and optimize diagnostic protocols across various clinical scenarios.

Key Learning Objectives

After reading this guide, you will understand:

The three main sequential testing strategies and when to use each
How test characteristics interact in different combinations
Clinical applications across medical specialties
Cost-effectiveness considerations in test sequencing
How to interpret and apply Fagan nomograms for sequential testing
Practical implementation in clinical decision-making

What Makes Sequential Testing Important

Sequential testing addresses fundamental challenges in clinical diagnosis:

Single tests are often insufficient for complex clinical decisions
Balancing sensitivity and specificity requires strategic test combinations
Cost-effectiveness demands optimal use of expensive confirmatory tests
Clinical workflows benefit from systematic diagnostic protocols
Patient outcomes improve with more accurate diagnostic strategies

Understanding Sequential Testing Strategies

The Three Core Strategies

1. Serial Positive Testing (Confirmation Strategy)

Approach: Perform second test only if first test is positive Result interpretation: Positive only if both tests are positive Effect: Maximizes specificity, reduces sensitivity

Best for: - Avoiding false positives when consequences are serious - Expensive or invasive confirmatory tests - High-stakes diagnoses (e.g., cancer, genetic diseases)

2. Serial Negative Testing (Exclusion Strategy)

Approach: Perform second test only if first test is negative Result interpretation: Positive if either test is positive Effect: Maximizes sensitivity, reduces specificity

Best for: - Cannot afford to miss cases - Screening for serious conditions - Complementary tests detecting different disease manifestations

3. Parallel Testing

Approach: Perform both tests on all subjects Result interpretation: Positive if either test is positive Effect: Improves sensitivity, reduces specificity

Best for: - Emergency situations requiring rapid diagnosis - Comprehensive evaluation protocols - When tests are quick and inexpensive

Mathematical Framework

Combined Sensitivity and Specificity

Serial Positive Strategy:

Combined Sensitivity = Se₁ × Se₂
Combined Specificity = Sp₁ + (1-Sp₁) × Sp₂

Serial Negative Strategy:

Combined Sensitivity = Se₁ + (1-Se₁) × Se₂
Combined Specificity = Sp₁ × Sp₂

Parallel Strategy:

Combined Sensitivity = Se₁ + Se₂ - (Se₁ × Se₂)
Combined Specificity = Sp₁ × Sp₂

Example Data and Clinical Scenarios

Loading the Example Datasets

The package includes comprehensive datasets with realistic clinical scenarios:

# Load all sequential testing datasets
data(sequential_testing_examples)
data(strategy_comparison) 
data(cost_effectiveness_examples)
data(teaching_examples)
data(common_test_combinations)

# Display overview of main clinical scenarios
kable(sequential_testing_examples[1:5, c("scenario", "clinical_setting", "strategy", "combined_sens", "combined_spec")], 
      caption = "Sample Sequential Testing Scenarios",
      digits = 3)

COVID-19 Testing: A Modern Example

Let’s examine how COVID-19 testing demonstrates sequential testing principles:

covid_data <- sequential_testing_examples[
  sequential_testing_examples$scenario == "COVID-19 Testing", ]

kable(covid_data[, c("clinical_setting", "prevalence", "test1_ppv", "combined_ppv", "strategy_benefit")],
      caption = "COVID-19 Testing: Impact of Sequential Testing",
      digits = 3)

Key Insights:

Community screening (2% prevalence): Rapid test alone has 26% PPV, combined strategy improves confidence
Contact tracing (15% prevalence): Higher prevalence improves PPV significantly
Symptomatic patients (40% prevalence): High prevalence leads to excellent combined PPV

This demonstrates how prevalence dramatically affects the value of sequential testing strategies.

Cancer Screening: The Classic Use Case

Cancer screening exemplifies serial positive testing:

cancer_data <- sequential_testing_examples[
  grepl("Cancer", sequential_testing_examples$scenario), ]

cancer_summary <- cancer_data %>%
  select(scenario, test1_name, test2_name, prevalence, test1_ppv, combined_ppv) %>%
  mutate(
    ppv_improvement = combined_ppv - test1_ppv,
    fold_improvement = combined_ppv / test1_ppv
  )

kable(cancer_summary,
      caption = "Cancer Screening: Serial Positive Strategy Benefits",
      digits = 3)

cancer_data %>%
  select(scenario, test1_ppv, combined_ppv) %>%
  pivot_longer(cols = c(test1_ppv, combined_ppv), 
               names_to = "test_stage", values_to = "ppv") %>%
  mutate(test_stage = ifelse(test_stage == "test1_ppv", "Screening Test", "Combined Strategy")) %>%
  ggplot(aes(x = scenario, y = ppv, fill = test_stage)) +
  geom_col(position = "dodge") +
  scale_y_continuous(labels = scales::percent) +
  labs(title = "Cancer Screening: PPV Improvement Through Sequential Testing",
       x = "Cancer Type", y = "Positive Predictive Value", fill = "Test Stage") +
  theme_minimal() +
  theme(axis.text.x = element_text(angle = 45, hjust = 1))

Strategy Comparison Analysis

Comparing All Three Strategies

# Filter for a specific test combination at different prevalences
strategy_subset <- strategy_comparison %>%
  filter(test1_sens == 0.90, test1_spec == 0.85, 
         test2_sens == 0.85, test2_spec == 0.95) %>%
  select(strategy, prevalence, combined_sens, combined_spec, combined_ppv, combined_npv)

kable(strategy_subset, 
      caption = "Strategy Comparison: Same Tests, Different Approaches",
      digits = 3)

strategy_subset %>%
  pivot_longer(cols = c(combined_ppv, combined_npv), 
               names_to = "metric", values_to = "value") %>%
  mutate(metric = ifelse(metric == "combined_ppv", "PPV", "NPV")) %>%
  ggplot(aes(x = prevalence, y = value, color = strategy, linetype = metric)) +
  geom_line(size = 1.2) +
  geom_point(size = 3) +
  scale_y_continuous(labels = scales::percent) +
  scale_x_continuous(labels = scales::percent) +
  labs(title = "Sequential Testing Strategies: Performance Across Prevalence",
       x = "Disease Prevalence", y = "Predictive Value", 
       color = "Strategy", linetype = "Metric") +
  theme_minimal()

Strategy Selection Guidelines

guidelines <- data.frame(
  Strategy = c("Serial Positive", "Serial Negative", "Parallel"),
  `Primary Goal` = c("Minimize false positives", "Minimize false negatives", "Comprehensive evaluation"),
  `Best When` = c("High-stakes diagnosis", "Cannot miss cases", "Emergency situations"),
  `Trade-off` = c("Lower sensitivity", "Lower specificity", "Higher cost"),
  `Example Use` = c("Cancer screening → biopsy", "Rare disease screening", "MI rule-out in ED"),
  check.names = FALSE
)

kable(guidelines, caption = "Strategy Selection Guidelines")

Clinical Applications by Specialty

Infectious Disease Testing

infectious_data <- sequential_testing_examples[
  sequential_testing_examples$scenario %in% c("COVID-19 Testing", "Tuberculosis Screening"), ]

infectious_summary <- infectious_data %>%
  group_by(scenario, strategy) %>%
  summarise(
    settings = n(),
    avg_sensitivity = mean(combined_sens),
    avg_specificity = mean(combined_spec),
    avg_ppv = mean(combined_ppv)
  )

kable(infectious_summary, 
      caption = "Infectious Disease Sequential Testing",
      digits = 3)

TB Screening Example

TB screening demonstrates how prevalence affects strategy choice:

tb_data <- sequential_testing_examples[
  sequential_testing_examples$scenario == "Tuberculosis Screening", ]

ggplot(tb_data, aes(x = prevalence, y = combined_ppv)) +
  geom_point(aes(color = clinical_setting), size = 4) +
  geom_smooth(method = "loess", se = FALSE) +
  scale_x_continuous(labels = scales::percent) +
  scale_y_continuous(labels = scales::percent) +
  labs(title = "TB Screening: PPV vs Prevalence in Different Populations",
       x = "TB Prevalence", y = "Combined PPV", color = "Population") +
  theme_minimal()

Emergency Medicine: Parallel Testing

emergency_data <- sequential_testing_examples[
  sequential_testing_examples$scenario == "Myocardial Infarction Rule-out", ]

kable(emergency_data[, c("clinical_setting", "combined_sens", "combined_npv", "clinical_impact")],
      caption = "Emergency Medicine: Parallel Testing for MI Rule-out",
      digits = 3)

Cardiology: Risk Stratification

cardiac_data <- sequential_testing_examples[
  sequential_testing_examples$scenario == "Coronary Artery Disease", ]

cardiac_analysis <- cardiac_data %>%
  mutate(
    risk_category = case_when(
      prevalence < 0.20 ~ "Low Risk",
      prevalence < 0.30 ~ "Intermediate Risk",
      TRUE ~ "High Risk"
    )
  )

ggplot(cardiac_analysis, aes(x = combined_sens, y = combined_spec, color = risk_category)) +
  geom_point(size = 4) +
  scale_x_continuous(labels = scales::percent) +
  scale_y_continuous(labels = scales::percent) +
  labs(title = "Cardiac Testing: Sensitivity-Specificity by Risk Category",
       x = "Combined Sensitivity", y = "Combined Specificity", color = "Risk Category") +
  theme_minimal()

Cost-Effectiveness Analysis

Economic Considerations

data(cost_effectiveness_examples)

# Calculate additional metrics
cost_analysis <- cost_effectiveness_examples %>%
  mutate(
    cost_per_person_serial = serial_total_cost / population_size,
    cost_per_person_parallel = parallel_total_cost / population_size,
    savings_per_person = cost_savings / population_size,
    savings_percent = (cost_savings / parallel_total_cost) * 100
  )

kable(cost_analysis[, c("scenario", "cost_per_person_serial", "cost_per_person_parallel", 
                       "savings_percent", "cost_per_case_found")],
      caption = "Cost-Effectiveness of Sequential Testing Strategies",
      digits = 2)

cost_comparison <- cost_effectiveness_examples %>%
  select(scenario, serial_total_cost, parallel_total_cost) %>%
  pivot_longer(cols = c(serial_total_cost, parallel_total_cost),
               names_to = "strategy", values_to = "total_cost") %>%
  mutate(strategy = ifelse(grepl("serial", strategy), "Serial Positive", "Parallel"))

ggplot(cost_comparison, aes(x = scenario, y = total_cost, fill = strategy)) +
  geom_col(position = "dodge") +
  scale_y_continuous(labels = scales::dollar) +
  labs(title = "Cost Comparison: Serial vs Parallel Testing",
       subtitle = "Based on 1,000 person population",
       x = "Clinical Scenario", y = "Total Testing Cost", fill = "Strategy") +
  theme_minimal() +
  theme(axis.text.x = element_text(angle = 45, hjust = 1))

Cost-Effectiveness Calculations

# Calculate incremental cost-effectiveness
cost_effectiveness_examples %>%
  mutate(
    diseased_population = population_size * prevalence,
    cases_found_serial = diseased_population * test1_sens * test2_sens,
    incremental_cost = serial_total_cost - (population_size * test1_cost),
    incremental_cases = cases_found_serial - (diseased_population * test1_sens),
    icer = ifelse(incremental_cases > 0, incremental_cost / incremental_cases, NA)
  ) %>%
  select(scenario, incremental_cost, incremental_cases, icer) %>%
  kable(caption = "Incremental Cost-Effectiveness of Adding Second Test",
        digits = 0)

Advanced Topics

Fagan Nomograms for Sequential Testing

Sequential Fagan nomograms help visualize how probability changes through multiple testing stages:

# Example: COVID-19 testing progression
covid_example <- sequential_testing_examples[1, ]

nomogram_data <- data.frame(
  Stage = c("Pre-test", "After Rapid Test (+)", "After RT-PCR (+)"),
  Probability = c(
    covid_example$prevalence,
    covid_example$test1_ppv,
    covid_example$combined_ppv
  ),
  Test = c("Clinical assessment", "Rapid antigen", "RT-PCR confirmation"),
  LR = c(NA, covid_example$test1_plr, covid_example$test2_plr)
)

kable(nomogram_data, 
      caption = "Probability Evolution in COVID-19 Sequential Testing",
      digits = 3)

Population Flow Analysis

Understanding how patients flow through sequential testing:

# Simulate population flow for breast cancer screening
pop_size <- 10000
prevalence <- 0.008
sens1 <- 0.80  # Mammography
spec1 <- 0.90
sens2 <- 0.95  # Biopsy
spec2 <- 0.98

# Calculate flow
diseased <- pop_size * prevalence
healthy <- pop_size - diseased

# After mammography
mammo_tp <- diseased * sens1
mammo_fp <- healthy * (1 - spec1)
mammo_pos <- mammo_tp + mammo_fp

# After biopsy (serial positive strategy)
biopsy_tp <- mammo_tp * sens2
biopsy_fp <- mammo_fp * (1 - spec2)

flow_data <- data.frame(
  Stage = c("Initial Population", "Mammography Positive", "Biopsy Positive"),
  Total = c(pop_size, mammo_pos, biopsy_tp + biopsy_fp),
  `True Positive` = c(diseased, mammo_tp, biopsy_tp),
  `False Positive` = c(0, mammo_fp, biopsy_fp),
  PPV = c(prevalence, mammo_tp/mammo_pos, biopsy_tp/(biopsy_tp + biopsy_fp))
)

kable(flow_data, 
      caption = "Population Flow: Breast Cancer Screening",
      digits = 3)

Test Independence vs Dependence

Most sequential testing analysis assumes test independence, but real-world tests may be correlated:

correlation_scenarios <- data.frame(
  Scenario = c("Independent tests", "Positively correlated", "Negatively correlated"),
  Description = c(
    "Tests detect different disease aspects",
    "Tests detect similar disease features",
    "Tests detect complementary aspects"
  ),
  `Expected Effect` = c(
    "Standard calculations apply",
    "Lower combined sensitivity/specificity than predicted",
    "Higher combined sensitivity/specificity than predicted"
  ),
  Example = c(
    "Troponin + ECG for MI",
    "Two imaging modalities",
    "Functional + anatomical tests"
  ),
  check.names = FALSE
)

kable(correlation_scenarios, caption = "Test Correlation Effects")

Using the Module in jamovi

Step-by-Step Guide

1. Basic Setup

Navigate to meddecide → Decision → Sequential Testing Analysis
Enter test characteristics for both tests:
- First Test: Name, sensitivity, specificity
- Second Test: Name, sensitivity, specificity
- Disease prevalence in your population

2. Strategy Selection

Choose the appropriate testing strategy: - Test all positives from first test (Serial positive) - Test all negatives from first test (Serial negative)
- Test all subjects with both tests (Parallel)

3. Display Options

Show Explanations: Detailed strategy explanations
Show Calculation Formulas: Step-by-step mathematics
Show Fagan Nomogram: Visual probability progression

4. Results Interpretation

Summary Table: Combined strategy performance
Individual Tests Table: Comparison of individual test performance
Population Flow Table: Patient flow through testing process

Input Validation and Warnings

The module includes intelligent validation:

Error Conditions

Test characteristics outside 0-1 range
Missing test names
Invalid prevalence values

Clinical Warnings

Unusual test characteristics (sensitivity/specificity <50% or >99%)
Extreme prevalence values
Strategy-test mismatches (e.g., low-specificity confirmatory test)
Poor test performance combinations

Strategy-Specific Guidance

Serial positive: Second test should have higher specificity
Serial negative: Second test should have higher sensitivity
Parallel: Tests should be complementary

Best Practices and Recommendations

For Clinical Practice

1. Strategy Selection

High-stakes diagnoses: Use serial positive for confirmation
Cannot miss cases: Use serial negative for comprehensive screening
Emergency situations: Use parallel for rapid, sensitive diagnosis

2. Test Selection

First test: Should be practical, accessible, cost-effective
Second test: Should complement first test’s weaknesses
Consider test correlation and independence assumptions

3. Prevalence Considerations

Low prevalence: PPV will be low even with good tests
High prevalence: NPV becomes less informative
Variable prevalence: Strategy effectiveness changes dramatically

For Research and Quality Improvement

1. Protocol Design

Model different strategies before implementation
Consider cost-effectiveness alongside diagnostic accuracy
Plan for different prevalence scenarios

2. Performance Monitoring

Track real-world performance vs predictions
Monitor for test correlation effects
Evaluate patient outcomes and satisfaction

3. Continuous Improvement

Regular review of test characteristics
Update protocols based on new evidence
Consider emerging diagnostic technologies

Case Studies

Case Study 1: Optimizing COVID-19 Testing Protocol

Challenge: Design testing protocol for hospital employee screening

Parameters: - Population: 5,000 employees - Expected prevalence: 3% - Available tests: Rapid antigen (Se=85%, Sp=95%), RT-PCR (Se=95%, Sp=99%) - Constraints: Cost, turnaround time, accuracy

Analysis:

# Calculate outcomes for different strategies
covid_strategies <- data.frame(
  Strategy = c("Rapid only", "PCR only", "Rapid → PCR (pos)", "Both parallel"),
  Sensitivity = c(0.85, 0.95, 0.85*0.95, 0.85 + 0.95 - 0.85*0.95),
  Specificity = c(0.95, 0.99, 0.95 + (1-0.95)*0.99, 0.95*0.99),
  Cost_per_person = c(25, 100, 25 + 0.08*100, 125),
  Turnaround_hours = c(0.5, 24, 24, 24)
) %>%
  mutate(
    PPV = (Sensitivity * 0.03) / (Sensitivity * 0.03 + (1-Specificity) * 0.97),
    NPV = (Specificity * 0.97) / (Specificity * 0.97 + (1-Sensitivity) * 0.03)
  )

kable(covid_strategies, caption = "COVID-19 Testing Strategy Comparison", digits = 3)

Recommendation: Serial positive strategy balances accuracy, cost, and operational feasibility.

Case Study 2: Breast Cancer Screening Optimization

Challenge: Optimize screening protocol for women aged 50-69

Considerations: - Cancer prevalence: 0.8% - Available tests: Digital mammography, 3D tomosynthesis, MRI, biopsy - Goals: Maximize cancer detection, minimize unnecessary procedures

# Compare different screening approaches
breast_strategies <- data.frame(
  Approach = c("2D mammography only", "3D tomosynthesis only", "2D → biopsy", "3D → biopsy", "2D + 3D → biopsy"),
  First_test_sens = c(0.80, 0.85, 0.80, 0.85, 0.90),
  First_test_spec = c(0.90, 0.92, 0.90, 0.92, 0.88),
  Combined_sens = c(0.80, 0.85, 0.76, 0.81, 0.86),
  Combined_spec = c(0.90, 0.92, 0.998, 0.998, 0.998),
  Biopsies_per_1000 = c(0, 0, 12, 10, 14)
) %>%
  mutate(
    Cancers_detected_per_1000 = Combined_sens * 8,
    PPV = (Combined_sens * 0.008) / (Combined_sens * 0.008 + (1-Combined_spec) * 0.992)
  )

kable(breast_strategies, caption = "Breast Cancer Screening Strategy Analysis", digits = 3)

Troubleshooting and Common Issues

Mathematical Issues

1. Division by Zero

Occurs with perfect test characteristics
Module handles gracefully with infinite likelihood ratios
Clinical interpretation: “Perfect” tests rarely exist in practice

2. Extreme Prevalence

Very low prevalence: PPV remains low despite good tests
Very high prevalence: NPV becomes uninformative
Solution: Consider prevalence-appropriate strategies

3. Test Correlation

Independence assumption may not hold
Real performance may differ from predictions
Solution: Validate with real-world data

Clinical Interpretation Challenges

1. Strategy Selection Confusion

Problem: Unclear which strategy to use
Solution: Focus on primary clinical goal (avoid false positives vs false negatives)

2. Prevalence Estimation

Problem: Uncertain disease prevalence
Solution: Use sensitivity analysis across prevalence ranges

3. Test Characteristic Uncertainty

Problem: Published test characteristics may not apply to your population
Solution: Use conservative estimates and local validation data

Advanced Applications

Markov Models for Sequential Testing

For complex diagnostic pathways with multiple decision points:

markov_states <- data.frame(
  State = c("Pre-test", "Test 1 +", "Test 1 -", "Test 2 +", "Test 2 -", "Diagnosed", "Not diagnosed"),
  Description = c(
    "Initial clinical presentation",
    "Positive first test result",
    "Negative first test result", 
    "Positive second test result",
    "Negative second test result",
    "Disease confirmed",
    "Disease ruled out"
  ),
  Transition_probabilities = c(
    "Based on test characteristics",
    "To Test 2 or final diagnosis",
    "To Test 2 or discharge",
    "To final diagnosis",
    "To discharge or additional testing",
    "Terminal state",
    "Terminal state"
  )
)

kable(markov_states, caption = "Markov Model States for Complex Sequential Testing")

Machine Learning Integration

Modern sequential testing can incorporate ML predictions:

ml_framework <- data.frame(
  Component = c("Risk Prediction", "Test Selection", "Result Integration", "Decision Support"),
  Traditional_Approach = c(
    "Fixed prevalence estimates",
    "Protocol-driven selection",
    "Rule-based interpretation",
    "Physician judgment"
  ),
  ML_Enhanced_Approach = c(
    "Personalized risk scores",
    "Adaptive test selection",
    "Probabilistic integration",
    "Evidence-based recommendations"
  ),
  Benefit = c(
    "More accurate pre-test probabilities",
    "Optimal test sequences",
    "Better diagnostic accuracy",
    "Reduced cognitive load"
  )
)

kable(ml_framework, caption = "Machine Learning Enhancement of Sequential Testing")

Conclusion

Sequential testing analysis provides a systematic framework for optimizing diagnostic accuracy through strategic test combinations. The Sequential Testing Analysis module offers comprehensive tools for:

Key Capabilities

Strategy Comparison: Evaluate serial positive, serial negative, and parallel approaches
Clinical Scenarios: Apply to real-world medical situations across specialties
Cost-Effectiveness: Balance diagnostic accuracy with economic considerations
Visual Analysis: Use Fagan nomograms and population flow diagrams
Quality Improvement: Design and optimize clinical protocols

Best Practices Summary

Match strategy to clinical goals: Confirmation vs exclusion vs comprehensive evaluation
Consider prevalence effects: Strategy effectiveness varies dramatically with disease prevalence
Plan for real-world constraints: Cost, time, availability, and patient factors
Validate assumptions: Test independence and population applicability
Monitor performance: Track outcomes and continuously improve protocols

Future Directions

Integration with electronic health records for automated risk assessment
Real-time adaptive testing protocols based on individual patient characteristics
Machine learning-enhanced test selection and interpretation
Cost-effectiveness modeling with health economic outcomes
Population-level optimization for public health screening programs

Sequential testing analysis represents a critical tool for evidence-based diagnostic decision-making, helping clinicians and healthcare systems optimize patient care while managing resources effectively.

This vignette was generated using ClinicoPath version 0.0.3.56 on 2025-07-13.

Optimizing Diagnostic Strategies Through Test Combinations

ClinicoPath

2025-07-13