A dataset containing simulated Immunohistochemistry (IHC) marker results for a set of samples. Each marker's expression is likely categorical (e.g., "Positive", "Negative", "Low", "High", or specific scoring).
Usage
data(ihc_test_data)
Format
A data frame with 100 rows and 9 variables:
- SampleID
Character. Unique identifier for each sample.
- ER
Character. Estrogen Receptor status or score.
- PR
Character. Progesterone Receptor status or score.
- HER2
Character. HER2/neu status or score.
- Ki67
Character. Ki-67 proliferation index, possibly as a category or percentage range.
- p53
Character. p53 protein expression status or score.
- CD3
Character. CD3 (T-cell marker) expression status or score.
- CD20
Character. CD20 (B-cell marker) expression status or score.
- CD45
Character. CD45 (leukocyte common antigen) expression status or score.
Examples
data(ihc_test_data)
str(ihc_test_data)
#> 'data.frame': 100 obs. of 9 variables:
#> $ SampleID: chr "S001" "S002" "S003" "S004" ...
#> $ ER : chr "1+" "3+" "2+" "-" ...
#> $ PR : chr "1+" "3+" "2+" "3+" ...
#> $ HER2 : chr "-" "3+" "1+" "1+" ...
#> $ Ki67 : chr "-" "1+" "-" "-" ...
#> $ p53 : chr "1+" "-" "1+" "2+" ...
#> $ CD3 : chr "+" "+" "+" "+" ...
#> $ CD20 : chr "+" "-" "+" "+" ...
#> $ CD45 : chr "-" "+" "+" "-" ...
head(ihc_test_data)
#> SampleID ER PR HER2 Ki67 p53 CD3 CD20 CD45
#> 1 S001 1+ 1+ - - 1+ + + -
#> 2 S002 3+ 3+ 3+ 1+ - + - +
#> 3 S003 2+ 2+ 1+ - 1+ + + +
#> 4 S004 - 3+ 1+ - 2+ + + -
#> 5 S005 - 1+ 1+ 2+ - + + +
#> 6 S006 1+ 3+ 2+ - 2+ + - +
table(ihc_test_data$ER)
#>
#> - 1+ 2+ 3+
#> 18 30 30 22
table(ihc_test_data$HER2, ihc_test_data$Ki67)
#>
#> - 1+ 2+
#> - 13 19 12
#> 1+ 8 12 7
#> 2+ 2 7 10
#> 3+ 1 3 6