作生物信息,遺傳發育,分析數據的時候老是要narrow down分析範圍,高通量數據尤爲是基因表達,在龐大的confounder面前,縮小分析範圍是必須的,不然你會一直在混沌中游蕩。html
看一篇文章:2018 - Identification of genes associated with Hirschsprung disease, based on whole-genome sequence analysis, and potential effects on enteric nervous system development算法
Known Genes of ENS Development and Their Interactome
Genes with mutations reported to cause colonic aganglionosis in mutant mice according to the Mouse Genomics database were considered to be known ENS genes. ENS interactome was defined by genes encoding proteins that show protein–protein interaction with known ENS genes (see Supplementary Methods).數據庫
對於某個特定的疾病,咱們總想找到與它直接相關的基因,怎麼找呢?ide
MGI的數據庫裏就有這種信息,怎麼纔算直接相關呢?最具邏輯的就是突變數據,若是某個基因上的突變致使了某個疾病,那這個因果關聯就確定存在!!!設計
怎麼找呢?看官方教程:orm
How do I find mutations that cause a specific combination of phenotypes?htm
這個基因list可能包含的基因太少,那如何擴大呢?教程
另外一個數據庫就是PPI了,找出與直接基因有蛋白互做的其餘的基因,也就是以上方法中的interactome。ci
文章中庸的InWeb - A scored human protein–protein interaction network to catalyze genomic interpretationget
發在NM上了,整合了PPI數據,可是好像好久沒更新了,也沒仔細去看它是怎麼設計的算法。
目前我是直接用的STRING數據庫。