Computational Assessment of the Expression-modulating Potential for Noncoding Variants
非编码变体表达调节潜力的计算评估
ノンコーディング変異体の発現調節能の計算による評価
비코딩 변이체에 대한 발현 조절 가능성의 전산 평가
Evaluación computacional del potencial modulador de expresión para variantes no codificantes
Évaluation informatique du potentiel de modulation de l'expression pour les variantes non codantes
Вычислительная оценка возможности модуляции выражения для вариантов без кодирования
Fang-Yuan Shi 史方圆 ¹, Yu Wang 王宇 ¹, Dong Huang 黄东 ², Yu Liang ³, Nan Liang 梁楠 ¹, Xiao-Wei Chen 陈晓伟 ² ⁴, Ge Gao 高歌 ¹
¹ Biomedical Pioneering Innovation Center (BIOPIC), Beijing Advanced Innovation Center for Genomics (ICG), Center for Bioinformatics (CBI), and State Key Laboratory of Protein and Plant Gene Research at School of Life Sciences, Peking University, Beijing 100871, China
中国 北京 北京大学生物医学前沿创新中心 北京未来基因诊断高精尖创新中心 北京大学生物信息中心 北京大学生命科学学院 蛋白质与植物基因研究国家重点实验室
² State Key Laboratory of Membrane Biology, Institute of Molecular Medicine, Peking University, Beijing 100871, China
中国 北京 北京大学分子医学研究所 生物膜国家重点实验室
³ Human Aging Research Institute, School of Life Science, Nanchang University, Nanchang 330031, China
中国 南昌 南昌大学生命科学学院人类衰老研究所
⁴ Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
中国 北京 清华大学-北京大学生命科学联合中心 北京大学前沿交叉学科研究院
Genomics, Proteomics & Bioinformatics, 7 December 2021
Abstract
Large-scale genome-wide association studies (GWAS) and expression quantitative trait loci (eQTLs) studies have identified multiple noncoding variants associated with genetic diseases by affecting gene expression. However, pinpointing causal variants effectively and efficiently remains a serious challenge.
Here, we developed CARMEN, a novel algorithm to identify functional noncoding expression-modulating variants. Multiple evaluations demonstrated CARMEN’s superior performance over state-of-the-art tools. Applying CARMEN to GWAS and eQTLs datasets further pinpoints several causal variants other than reported lead single-nucleotide polymorphisms (SNPs). CARMEN scales well with the massive datasets and is available online as a web server at http://carmen.gao-lab.org.