Skip to contents

A dataset containing simulated Immunohistochemistry (IHC) marker results for a set of samples. Each marker's expression is likely categorical (e.g., "Positive", "Negative", "Low", "High", or specific scoring).

Usage

data(ihc_test_data)

Format

A data frame with 100 rows and 9 variables:

SampleID

Character. Unique identifier for each sample.

ER

Character. Estrogen Receptor status or score.

PR

Character. Progesterone Receptor status or score.

HER2

Character. HER2/neu status or score.

Ki67

Character. Ki-67 proliferation index, possibly as a category or percentage range.

p53

Character. p53 protein expression status or score.

CD3

Character. CD3 (T-cell marker) expression status or score.

CD20

Character. CD20 (B-cell marker) expression status or score.

CD45

Character. CD45 (leukocyte common antigen) expression status or score.

Examples

data(ihc_test_data)
str(ihc_test_data)
#> 'data.frame':	100 obs. of  9 variables:
#>  $ SampleID: chr  "S001" "S002" "S003" "S004" ...
#>  $ ER      : chr  "1+" "3+" "2+" "-" ...
#>  $ PR      : chr  "1+" "3+" "2+" "3+" ...
#>  $ HER2    : chr  "-" "3+" "1+" "1+" ...
#>  $ Ki67    : chr  "-" "1+" "-" "-" ...
#>  $ p53     : chr  "1+" "-" "1+" "2+" ...
#>  $ CD3     : chr  "+" "+" "+" "+" ...
#>  $ CD20    : chr  "+" "-" "+" "+" ...
#>  $ CD45    : chr  "-" "+" "+" "-" ...
head(ihc_test_data)
#>   SampleID ER PR HER2 Ki67 p53 CD3 CD20 CD45
#> 1     S001 1+ 1+    -    -  1+   +    +    -
#> 2     S002 3+ 3+   3+   1+   -   +    -    +
#> 3     S003 2+ 2+   1+    -  1+   +    +    +
#> 4     S004  - 3+   1+    -  2+   +    +    -
#> 5     S005  - 1+   1+   2+   -   +    +    +
#> 6     S006 1+ 3+   2+    -  2+   +    -    +
table(ihc_test_data$ER)
#> 
#>  - 1+ 2+ 3+ 
#> 18 30 30 22 
table(ihc_test_data$HER2, ihc_test_data$Ki67)
#>     
#>       - 1+ 2+
#>   -  13 19 12
#>   1+  8 12  7
#>   2+  2  7 10
#>   3+  1  3  6