Skip to contents

Calculate PRS using plink2

Usage

plink_prs(
  base_df,
  id_col,
  effect_allele_col,
  effect_size_col,
  multithread = TRUE,
  pfile_dir,
  plink_bin
)

Arguments

base_df

(dataframe) A dataframe containing at minimum variant ids, effect alleles, and corresponding effect sizes

id_col

(character) Name of column containing variant ids, corresponding to the variant ids in the plink files

effect_allele_col

(character) Name of column containing effect alleles

effect_size_col

(character) Name of column containing effect estimates

multithread

(logical) Should multi-threading be used to compute PRS across all pfiles in parallel (default = TRUE)

pfile_dir

(character) Directory containing the plink files for the target dataset. If the directory contains multiple files (for example, one per chromosome), the PRS can be calculated in parallel.

plink_bin

(character) Path to executable plink2 binary

Value

A tibble containing rows for each individual in the target dataset, and a column containing their PRS

Examples

if (FALSE) {
# load weights file into dataframe
weights_df <- fs::path_package("extdata", "PGS000667-PMBB.txt", package = "prsr") %>%
  vroom::vroom()

# calculate prs
prs_res <- plink_prs(
  base_df = weights_df,
  id_col = variant,
  effect_allele_col = effect_allele,
  effect_size_col = effect_weight_mg_dl,
  pfile_dir = "/project/PMBB/PMBB-Release-2020-2.0/Imputed/pgen/",
  plink_bin = "/scratch/Applications/PLINK/plink2_linux_x86_64_20211125/plink2a"
)
}