Skip to contents

A comprehensive simulated dataset for general medical research, including patient demographics, visit information, clinical measurements, lab results, and various health scores. This dataset is structured to represent data collected over multiple visits for some patients.

Usage

data(medical_research_data)

Format

A data frame with 890 rows and 32 variables:

PatientID

Character. Unique patient identifier.

VisitNumber

Integer. The sequential number of the patient's visit.

VisitDate

Character. Date of the visit. Should ideally be parsed as Date.

StudyCenter

Character. Identifier for the study center or hospital.

AgeGroup

Character. Categorized age group of the patient.

Gender

Character. Gender of the patient.

TreatmentGroup

Character. Assigned treatment group (e.g., "Treatment A", "Control").

DiagnosisPrimary

Character. Primary diagnosis for the patient.

DiseaseStage

Character. Stage of the primary disease.

ComorbidityCount

Integer. Number of comorbidities.

BMICategory

Character. Body Mass Index category (e.g., "Underweight", "Normal", "Overweight").

SystolicBP

Integer. Systolic blood pressure (mmHg).

DiastolicBP

Integer. Diastolic blood pressure (mmHg).

HeartRate

Integer. Heart rate (beats per minute).

Temperature

Numeric. Body temperature (e.g., Celsius or Fahrenheit).

RespiratoryRate

Integer. Respiratory rate (breaths per minute).

OxygenSaturation

Integer. Oxygen saturation (SpO2, percent).

HbA1c

Numeric. Glycated hemoglobin level (%).

Creatinine

Numeric. Creatinine level (e.g., mg/dL or µmol/L).

eGFR

Integer. Estimated Glomerular Filtration Rate.

Hemoglobin

Numeric. Hemoglobin level (e.g., g/dL).

WBC

Numeric. White Blood Cell count (e.g., cells/µL).

Platelets

Integer. Platelet count (e.g., cells/µL).

TotalCholesterol

Integer. Total cholesterol level (e.g., mg/dL).

LDLCholesterol

Integer. LDL (low-density lipoprotein) cholesterol level.

HDLCholesterol

Integer. HDL (high-density lipoprotein) cholesterol level.

Triglycerides

Integer. Triglycerides level.

PainScore

Integer. Patient-reported pain score (e.g., 0-10 scale).

QualityOfLife

Integer. Quality of Life score from a standardized questionnaire.

FunctionalStatus

Integer. Functional status score.

MedicationAdherence

Integer. Medication adherence score or percentage.

ClinicalScore

Integer. A composite or overall clinical score.

Examples

data(medical_research_data)
str(medical_research_data)
#> 'data.frame':	890 obs. of  32 variables:
#>  $ PatientID          : chr  "PT001" "PT001" "PT001" "PT002" ...
#>  $ VisitNumber        : int  1 2 3 1 2 3 1 2 3 4 ...
#>  $ VisitDate          : chr  "2022-01-01" "2022-02-09" "2022-04-12" "2022-01-01" ...
#>  $ StudyCenter        : chr  "Boston Medical Center" "Boston Medical Center" "Boston Medical Center" "Johns Hopkins" ...
#>  $ AgeGroup           : chr  "46-60" "46-60" "46-60" "46-60" ...
#>  $ Gender             : chr  "Male" "Male" "Male" "Female" ...
#>  $ TreatmentGroup     : chr  "Treatment A" "Treatment A" "Treatment A" "Treatment A+B" ...
#>  $ DiagnosisPrimary   : chr  "Diabetes Type 2" "Diabetes Type 2" "Diabetes Type 2" "Diabetes Type 2" ...
#>  $ DiseaseStage       : chr  "Stage II" "Stage II" "Stage II" "Stage II" ...
#>  $ ComorbidityCount   : int  3 3 3 3 3 3 2 2 2 2 ...
#>  $ BMICategory        : chr  "Normal" "Normal" "Normal" "Overweight" ...
#>  $ SystolicBP         : int  142 141 143 120 118 125 139 145 124 133 ...
#>  $ DiastolicBP        : int  111 103 111 96 94 94 101 109 89 106 ...
#>  $ HeartRate          : int  84 66 55 76 88 72 83 101 66 71 ...
#>  $ Temperature        : num  36.6 37.9 36.7 36.8 37.5 36.7 36 36.5 36.8 36.8 ...
#>  $ RespiratoryRate    : int  16 11 16 15 18 27 12 20 14 17 ...
#>  $ OxygenSaturation   : int  98 98 94 98 100 98 100 96 96 98 ...
#>  $ HbA1c              : num  NA 9.4 9 6.7 9.1 10.4 10.3 5.9 8.8 NA ...
#>  $ Creatinine         : num  0.74 0.6 0.46 0.73 1.43 1.08 1.08 0.63 0.97 NA ...
#>  $ eGFR               : int  89 85 92 89 80 81 84 NA 94 87 ...
#>  $ Hemoglobin         : num  14.1 11.5 NA 10.6 13.2 13.6 12.8 NA 14.2 13.8 ...
#>  $ WBC                : num  NA 10 9.1 7.5 5.7 NA NA NA 3.9 4.2 ...
#>  $ Platelets          : int  238 225 295 254 297 181 186 141 NA 249 ...
#>  $ TotalCholesterol   : int  130 NA 165 236 216 198 146 216 154 192 ...
#>  $ LDLCholesterol     : int  95 136 132 145 136 NA 113 148 146 84 ...
#>  $ HDLCholesterol     : int  41 62 50 36 64 NA 50 41 48 58 ...
#>  $ Triglycerides      : int  148 117 53 226 104 108 103 163 199 NA ...
#>  $ PainScore          : int  NA NA 1 4 7 2 4 4 3 NA ...
#>  $ QualityOfLife      : int  41 58 75 53 64 51 65 61 68 87 ...
#>  $ FunctionalStatus   : int  91 97 69 63 74 76 45 100 49 87 ...
#>  $ MedicationAdherence: int  88 100 97 100 87 88 73 100 69 60 ...
#>  $ ClinicalScore      : int  59 55 70 50 46 67 47 40 67 46 ...
head(medical_research_data)
#>   PatientID VisitNumber  VisitDate           StudyCenter AgeGroup Gender
#> 1     PT001           1 2022-01-01 Boston Medical Center    46-60   Male
#> 2     PT001           2 2022-02-09 Boston Medical Center    46-60   Male
#> 3     PT001           3 2022-04-12 Boston Medical Center    46-60   Male
#> 4     PT002           1 2022-01-01         Johns Hopkins    46-60 Female
#> 5     PT002           2 2022-04-18         Johns Hopkins    46-60 Female
#> 6     PT002           3 2022-05-30         Johns Hopkins    46-60 Female
#>   TreatmentGroup DiagnosisPrimary DiseaseStage ComorbidityCount BMICategory
#> 1    Treatment A  Diabetes Type 2     Stage II                3      Normal
#> 2    Treatment A  Diabetes Type 2     Stage II                3      Normal
#> 3    Treatment A  Diabetes Type 2     Stage II                3      Normal
#> 4  Treatment A+B  Diabetes Type 2     Stage II                3  Overweight
#> 5  Treatment A+B  Diabetes Type 2     Stage II                3  Overweight
#> 6  Treatment A+B  Diabetes Type 2     Stage II                3  Overweight
#>   SystolicBP DiastolicBP HeartRate Temperature RespiratoryRate OxygenSaturation
#> 1        142         111        84        36.6              16               98
#> 2        141         103        66        37.9              11               98
#> 3        143         111        55        36.7              16               94
#> 4        120          96        76        36.8              15               98
#> 5        118          94        88        37.5              18              100
#> 6        125          94        72        36.7              27               98
#>   HbA1c Creatinine eGFR Hemoglobin  WBC Platelets TotalCholesterol
#> 1    NA       0.74   89       14.1   NA       238              130
#> 2   9.4       0.60   85       11.5 10.0       225               NA
#> 3   9.0       0.46   92         NA  9.1       295              165
#> 4   6.7       0.73   89       10.6  7.5       254              236
#> 5   9.1       1.43   80       13.2  5.7       297              216
#> 6  10.4       1.08   81       13.6   NA       181              198
#>   LDLCholesterol HDLCholesterol Triglycerides PainScore QualityOfLife
#> 1             95             41           148        NA            41
#> 2            136             62           117        NA            58
#> 3            132             50            53         1            75
#> 4            145             36           226         4            53
#> 5            136             64           104         7            64
#> 6             NA             NA           108         2            51
#>   FunctionalStatus MedicationAdherence ClinicalScore
#> 1               91                  88            59
#> 2               97                 100            55
#> 3               69                  97            70
#> 4               63                 100            50
#> 5               74                  87            46
#> 6               76                  88            67
summary(medical_research_data$AgeGroup) # Example, assuming Age is present, use AgeGroup
#>    Length     Class      Mode 
#>       890 character character 
hist(medical_research_data$SystolicBP)