目前GWAS方向發了不少文獻,可是並無一個很完善的R包對這些文獻的數據進行彙總。html
接下來推薦的這個是最新發表的GWAS數據彙總R包。看了一下功能齊全,可是數據不是收錄的很齊全。
下面具體講一下。git
install.packages("remotes")
github
remotes::install_github("ramiromagno/gwasrapidd")
api
library(gwasrapidd)
測試
my_studies <- get_studies(efo_trait = 'autoimmune disease')
code
n(my_studies)
htm
my_studies@studies$study_id
blog
my_studies@publications$title
ci
open_in_pubmed(my_studies@publications$pubmed_id)
開發
my_associations <- get_associations(study_id = my_studies@studies$study_id)
dplyr::filter(my_associations@associations, pvalue < 1e-6) %>% # Filter by p-value
tidyr::drop_na(pvalue) %>%
dplyr::pull(association_id) -> association_ids # Extract column association_id
my_associations2 <- my_associations[association_ids]
n(my_associations2)
my_associations2@risk_alleles[c('variant_id', 'risk_allele', 'risk_frequency')] %>%
print(n = Inf)
s2 <- get_studies(variant_id = 'rs12752552')
測試了一下,優勢是減去了不少信息檢索的工做,缺點是這個包剛開發不久,信息還不夠齊全
更多詳細信息請看gwasrapidd
參考文獻:Magno R, Maia A T. gwasrapidd: an R package to query, download and wrangle GWAS Catalog data[J]. bioRxiv, 2019: 643940.