Skip to contents

Scientific data visualization using the grafify R package for easy and accessible plotting. Provides quick exploration graphs with few lines of code, color-blind friendly palettes, and integrated statistical analysis. Perfect for clinical research, experimental data, and scientific publications. Features scatter plots with error bars, distribution plots, before-after comparisons, and factorial designs with professional styling.

Usage

grafify(
  data,
  vars,
  groups = NULL,
  blocks = NULL,
  facet_var = NULL,
  plot_type = "scatterbar",
  x_var,
  y_var,
  error_type = "sd",
  summary_function = "mean",
  color_palette = "default",
  reverse_palette = FALSE,
  use_grafify_theme = TRUE,
  jitter_width = 0.3,
  transparency = 0.7,
  point_size = 3,
  line_size = 1,
  log_transform = "none",
  add_statistics = FALSE,
  stat_method = "anova_1way",
  posthoc_comparisons = FALSE,
  comparison_method = "pairwise",
  befafter_shape_var = NULL,
  befafter_id_var = NULL,
  show_individual_points = TRUE,
  show_summary_stats = TRUE,
  show_model_diagnostics = FALSE,
  export_data = FALSE,
  plot_width = 8,
  plot_height = 6,
  title_text = "",
  x_label = "",
  y_label = "",
  legend_position = "right",
  experimental_design = "crd",
  random_effects = NULL,
  alpha_level = 0.05
)

Arguments

data

The data as a data frame containing scientific or clinical variables for professional visualization with grafify.

vars

Continuous variables for plotting. Can be used as X, Y variables or multiple variables for distribution analysis.

groups

Categorical variable for grouping data in plots. Creates different colors, shapes, or panels for each group.

blocks

Variable for blocking or pairing observations. Used in randomized block designs or before-after comparisons.

facet_var

Variable for creating multiple panels (facets). Useful for complex experimental designs with multiple factors.

plot_type

Type of plot to create. Each type is optimized for different data structures and research questions.

x_var

Variable for X-axis. Can be continuous or categorical depending on the plot type selected.

y_var

Variable for Y-axis. Should be continuous for most plot types.

error_type

Type of error bars to display. SD shows data spread, SEM shows precision of mean, CI95 shows statistical confidence.

summary_function

Function used to summarize data for central tendency. Geometric mean is useful for log-scaled or ratio data.

color_palette

Color palette for the plot. All palettes are color-blind friendly and designed for scientific publications.

reverse_palette

Reverse the order of colors in the selected palette. Useful for adjusting color assignments to groups.

use_grafify_theme

Apply the grafify theme for classic-style scientific plots. Provides clean, publication-ready styling.

jitter_width

Amount of horizontal jittering for data points. Helps avoid overlapping points in categorical plots.

transparency

Transparency level for data points (alpha value). Lower values make points more transparent.

point_size

Size of data points in the plot.

line_size

Thickness of lines (error bars, connecting lines).

log_transform

Apply logarithmic transformation to axes. Useful for data spanning multiple orders of magnitude.

add_statistics

Perform and display statistical analysis on the plot. Integrates with grafify's built-in statistical functions.

stat_method

Statistical method to apply. Results will be displayed as text annotations or in separate output sections.

posthoc_comparisons

Perform post-hoc pairwise comparisons after ANOVA. Uses emmeans for estimated marginal means and contrasts.

comparison_method

Method for post-hoc comparisons. Pairwise compares all groups, vs reference compares to control group, trends test for patterns.

befafter_shape_var

Variable defining before/after or timepoint for paired data. Used in before-after plot types.

befafter_id_var

Variable identifying individual subjects or experimental units. Used to connect paired observations in before-after plots.

show_individual_points

Display individual data points on the plot. Recommended for transparency and data exploration.

show_summary_stats

Display summary statistics table with means, SD, SEM, and sample sizes.

show_model_diagnostics

Display model diagnostic plots (Q-Q plots, residuals) when statistical analysis is performed.

export_data

Export the processed data used for plotting. Includes summary statistics and model results.

plot_width

Width of the plot in inches for optimal display and export.

plot_height

Height of the plot in inches for optimal display and export.

title_text

Custom title for the plot. Leave empty for automatic title based on variables and analysis type.

x_label

Custom label for X-axis. Leave empty to use variable name.

y_label

Custom label for Y-axis. Leave empty to use variable name.

legend_position

Position of the plot legend.

experimental_design

Type of experimental design for appropriate statistical analysis. Affects model specification and interpretation.

random_effects

Variables to include as random effects in mixed models. Typically subjects, blocks, or experimental units.

alpha_level

Significance level for statistical tests and confidence intervals.

Value

A results object containing:

results$todoa html
results$main_plotan image
results$summary_statsa html
results$statistical_analysisa html
results$posthoc_resultsa html
results$diagnostic_plotsan image
results$qqplotan image
results$palette_previewan image
results$plot_interpretationa html
results$export_infoa html

Examples

# Example: Scientific scatter plot with error bars
data(clinical_data)
#> Warning: data set ‘clinical_data’ not found
grafify(
    data = clinical_data,
    vars = c("biomarker1", "biomarker2"),
    groups = "treatment_group",
    plot_type = "scatterbar",
    error_type = "sd",
    color_palette = "vibrant"
)
#> Error: object 'clinical_data' not found