We provide here detailed Description about the files outputted from the somatic mutation annotators via ANNOVAR and SnpEff.

    • *_annoTable.txt from the annotator via ANNOVAR
Column Names Description
Chr Chromosome number
Start Start position
End End position
Ref Reference base(s)
Alt Alternate non-reference alleles called on at least one of the samples
COSMIC ID COSMIC ID
Func.refGene Regions (e.g., exonic, intronic, non-coding RNA)) that one variant hits; please click here for details.
Gene.refGene Gene name associated with one variant
ExonicFunc.refGene Exonic variant function, e.g., nonsynonymous, synonymous, frameshift insertion.please click here for details.
AAChange.refGene Amino acid change. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationTaster score. See the dbNSFP information table for details.
MutationAssessor_pred MutationTaster prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
DANN_score DANN score. See the dbNSFP information table for details.
fathmm-MKL_coding_score fathmm-MKL score for one coding variant. See the dbNSFP information table for details.
fathmm-MKL_coding_pred fathmm-MKL prediction for one coding variant. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
integrated_fitCons_score fitCons score<. See the dbNSFP information table for details./td>
integrated_confidence_value confidence level. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP7way_vertebrate Phylogenetic p-values for 7 vertebrate species. See the dbNSFP information table for details.
phyloP20way_mammalian Phylogenetic p-values for 20 mammalian species. See the dbNSFP information table for details.
phastCons7way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
phastCons20way_mammalian phastCons p-values for 20 mammalian species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_annoTable.txt from the annotator via SnpEff
Column Names Description
CHROM Chromosome number
POS Position
ID semi-colon separated list of unique identifiers where available. If this is a dbSNP variant it is encouraged to use the rs number(s).
REF Reference base(s)
ALT Alternate non-reference alleles called on at least one of the samples
EFFECT Functional consequences of one variant, e.g., missense_variant, synonymous_variant. please clickhere for details.
REGION Regions (e.g., exonic, intronic) that one variant hits
IMPACT Putative impact of the variant (e.g. HIGH, MODERATE or LOW impact).
GENE Gene name (usually HUGO)
GENEID Gene ID)
FEATURE The type of feature is in the next field (e.g. transcript, motif, miRNA, etc.)
FEATUREID Transcript ID (preferably using version number), Motif ID, miRNA, ChipSeq peak, Histone mark, depending on the annotation.
BIOTYPE Description on whether the transcript is {“Coding”, “Noncoding”}. Whenever possible, use ENSEMBL biotypes. .
HGVS_C Variant using HGVS notation (DNA level). For example, c.352A>G stands for A to G substitution of nucleotide 352. Click here for details.
HGVS_P Coding variant using HGVS notation (Protein level). For example, p.Ile118Val stands for Isoleucine at position number 66 substitution to Valine. p.Ile118Val can be also be represented by p.I118V using the 1-letter symbol here. Click here for details.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationAssessor score. See the dbNSFP information table for details.
MutationAssessor_pred MutationAssessor prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
GERP++_NR GREP++ conservation score. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP100way_vertebrate Phylogenetic p-values for 100 vertebrate species. See the dbNSFP information table for details.
phastCons100way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_genelist.txt from the annotators via ANNOVAR and SnpEff
Column Names Description
Gene Gene name associated with each variant; one gene name may correspond to several variants.
Mutations Amino acid change information. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change..
    • dbNSFP Information
Columns of Annotations from dbNSFP Database Pediction Algorithm/Conservation Score Description Method Categorical Prediction Author(s)
SIFT_pred 
SIFT_score
SIFT Sort intolerated from tolerated P(An amino acid at a position is tolerated | The most frequentest amino acid being tolerated) D: Deleterious (sift<=0.05);
T: tolerated (sift>0.05)
Pauline Ng, Fred Hutchinson 
Cancer Research Center, Seattle, Washington
Polyphen2_HDIV_pred 
Polyphen2_HDIV_score
Polyphen v2 Polymorphism phenotyping v2 D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956), 
B: benign (pp2_hdiv<=0.452)
Probablistic Classifier Training sets: HumDiv Havard Medical School/td>
Polyphen2_HVAR_pred
Polyphen2_HVAR_score
Polyphen v2 Polymorphism phenotyping v2 Machine learning Training sets: HumVar D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956);
B: benign (pp2_hdiv<=0.452)
Shamil Sunyaev
Havard Medical School
LRT_pred 
LRT_score
LRT Likelihood ratio test LRT of H0: each codon evolves neutrally vs H1: the codon evovles under negative selection D: Deleterious; 
N: Neutral;
U: Unknown
Lower scores are more deleterious
Sung Chung, Justin Fay Washington University
MutationTaster_pred 
MutationTaster_score
MutationTaster Bayes Classifier A: (""disease_causing_automatic""); 
D: (""disease_causing""); 
N: (""polymorphism [probably harmless]""); 
P: (""polymorphism_automatic[known to be harmless]"
higher values are more deleterious"
  Markus Schuelke
the Charité - Universitätsmedizin Berlin
MutationAssessor_pred 
MutationAssessor_score
MutationAssessor Entropy of multiple sequence alighnment H: high; 
M: medium; 
L: low; 
N: neutral. 
H/M means functional and L/N means non-functional higher values are more deleterious
  Reva Boris
Computation Biology Center Memorial Sloan Kettering Cancer Center
FATHMM_pred 
FATHMM_score
FATHMM HMM Functional analysis through hidden markov model HMM D: Deleterious; 
T: Tolerated;
lower values are more deleterious
Shihab Hashem
University of Bristol, UK
PROVEAN_pred 
PROVEAN_score
  Protein Variation Effect Analyzer Clustering of homologus sequences D: Deleterious; 
N: Neutral
higher values are more deleterious
Choi Y J. Craig Venter Institute
VEST3_score VEST V3 Variant effect scoring tool Random forest classifier higher values are more deleterious Rachel Karchin John Hopkins University
CADD_raw CADD_phred CADD Combined annotation dependent depletion Linear kernel SVM higher values are more deleterious   Jay Shendure, Xiaohui Xie University of California - Irvine
DANN_score DANN Deleterious Annotation of genetic variants using Neural Networks Neural network higher values are more deleterious Jay Shendure, Xiaohui Xie
University of California - Irvine
fathmm-MKL_coding_pred FATHMM-MKL predicting the effects of both coding and non-coding variants using nucleotide-based HMMs Classifier based on multiple kernel learning D: Deleterious; 
T: Tolerated
Score >= 0.5: D; 
Score < 0.5: T
Shihab Hashem
University of Bristol, UK
MetaSVM_pred 
MetaSVM_score
MetaSVM Support vector machine D: Deleterious; T: Tolerated;
higher scores are more deleterious
  Coco Dong
USC Biostatiscs Department
MetaLR_pred 
MetaLR_score
MetaLR Logistic regression D: Deleterious; 
T: Tolerated; 
higher scores are more deleterious
  Coco Dong 
USC Biostatiscs Department
integrated_fitCons_score 
integrated_confidence_value
FitCons Fitness consequences of functional annotation Integrate functional assays like ChIP-Seq with conservation measure of transcription factor binding sites higher scores are more deleterious Abriza
Cold Spring Harbor Lab
GERP++_RS
GERP++_NR
Genome Evolutionary Rate Profiling ++ maximum likelihood estimation procedure higher scores are more deleterious   Eugne Davydov
Stanford University, CS Department
phyloP7way_vertebrate PhyloP Phylogentic p-values Phylogentic p-values calculated from a LRT, score-based test, GERP test Use 7 species higher scores are more deleterious Adam Siepel 
UCSC
phyloP20way_mammalian PhyloP Phylogentic p-values a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious Adam Siepel
UCSC
phastCons7way_vertebrate phastCons A phylogenetic hidden Markov model (phylo-HMM) Use 7 species higher scores are more deleterious   Adam Siepel
UCSC
phastCons20way_mammalian phastCons a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious   Adam Siepel
UCSC
SiPhy_29_way SiPhy Probablistic framework, HMM Use 29 species higher scores are more deleterious   Manual Garber
Broad Institute of MIT & Harvard

>

Questions? Kindly contact arraytools [at] emmes.com using the subject heading detailed information for outputted files from somatic mutation annotators.

原文链接地址:https://brb.nci.nih.gov/seqtools/colexpanno.html

Detailed Information for Outputted Files from Somatic Mutation Annotators(annovar 注释文件条目详细解释)的更多相关文章

  1. Debugging Information in Separate Files

    [Debugging Information in Separate Files] gdb allows you to put a program's debugging information in ...

  2. MCP|DYM|Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenocarcinoma and validate a novel somatic mutation CDK12-G879V (利用定量质谱探究转移性肺腺瘤的蛋白质组异质性及验证新体细胞突变)

    文献名:Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenoca ...

  3. somatic mutation体细胞变异检测文献分享--转载

    转载 :http://blog.sina.com.cn/s/blog_83f77c940102xuro.html Kalatskaya I, Trinh Q M, Spears M, et al. I ...

  4. The absolute uri: [http://java.sun.com/jsp/jstl/core] cannot be resolved in either web.xml or the jar files deployed with this application] with root cause异常处理及解释

    1.问题描述: 在web的jsp文件中想用jstl这个标准库,在运行的时候很自然的引用jar包如下: <dependency> <groupId>javax.servlet.j ...

  5. How to check type of files without extensions in python? 不通过文件扩展名,怎样知道文件类型?

    有一个命令 file 可以用 $ file fuck fuck.png: PNG image data, 1122 x 750, 8-bit colormap, non-interlaced pyth ...

  6. BlackArch-Tools

    BlackArch-Tools 简介 安装在ArchLinux之上添加存储库从blackarch存储库安装工具替代安装方法BlackArch Linux Complete Tools List 简介 ...

  7. xsltproc docbook 转 html

    /etc/xml/catalog <?xml version="1.0" encoding="UTF-8"?> <catalog xmlns= ...

  8. curl-手册

    Manual -- curl usage explained Related: Man Page FAQ LATEST VERSION   You always find news about wha ...

  9. 【ANT】Ant常用的内置task

    ant 例如: <target name="callProjectB"> <echo message="In projectA calling proj ...

随机推荐

  1. 使用react native制作的一款网络音乐播放器

    使用react native制作的一款网络音乐播放器 基于第三方库 react-native-video设计"react-native-video": "^1.0.0&q ...

  2. Spring+SpringMVC+MyBatis+easyUI整合优化篇(九)数据层优化-jdbc连接池简述、druid简介

    日常啰嗦 终于回到既定轨道上了,这一篇讲讲数据库连接池的相关知识,线程池以后有机会再结合项目单独写篇文章(自己给自己挖坑,不知道什么时候能填上),从这一篇文章开始到本阶段结束的文章都会围绕数据库和da ...

  3. UPS对电源故障的处理能力

    UPS对电源故障的处理能力 双变换在线式UPS由于其逆变器实时在线工作,因而能对所有的电源故障具有隔离和处理功能.由于目前电网情况发生了很大变化,真正的长时间断电只占所有电源故障的30%甚至更低,而非 ...

  4. 一、iOS中的事件可以分为3大类型

    触摸事件加速计事件远程控制事件 响应者对象在iOS中不是任何对象都能处理事件,只有继承了UIResponder的对象才能接收并处理事件.我们称之为"响应者对象" UIApplica ...

  5. 输入一个数字n 如果n为偶数则除以2,若为奇数则加1或者减1,直到n为1,求最少次数 写出一个函数

    题目: 输入一个数字n  如果n为偶数则除以2,若为奇数则加1或者减1,直到n为1,求最少次数  写出一个函数 首先,这道题肯定可以用动态规划来解, n为整数时,n的解为 n/2 的解加1 n为奇数时 ...

  6. CF #edu 11 C. Hard Process

    题目链接:http://codeforces.com/problemset/problem/660/C 大意是给一个01数组,至多可以将k个0变为1,问最后数组中最长能有多少个连续的1,并输出. 问题 ...

  7. effective c++ 思维导图

    历时两个多月的时间,终于把effective c++又复习了一遍,比较慢,看的是英文版,之前看的时候做过一些笔记,但不够详细,这次笔者是从头到尾的翻译了一遍,加了一些标题,先记录到word里面,然后发 ...

  8. webrtc学习笔记1(建立连接基本流程)

    最近在做一个基于webrtc的视频软件,以下是自己对于上层建立通话连接流程的基本理解,记录于此. 假设A和B要建立视频通话,A为房间创建端,B为加入房间端: 1.A通过http登录.获取其他服务器地址 ...

  9. Javascript一道面试题

    实现一个函数,运算结果可以满足如下预期结果: add(1)(2) // 3add(1, 2, 3)(10) // 16 add(1)(2)(3)(4)(5) // 15 function add () ...

  10. 学习《ASP.NET MVC5高级编程》——基架

    基架--代码生成的模板.我姑且这么去定义它,在我学习微软向编程之前从未听说过,比如php代码,大部分情况下是我用vim去手写而成,重复使用的代码需要复制粘贴,即使后来我在使用eclipse这样的IDE ...