We provide here detailed Description about the files outputted from the somatic mutation annotators via ANNOVAR and SnpEff.

    • *_annoTable.txt from the annotator via ANNOVAR
Column Names Description
Chr Chromosome number
Start Start position
End End position
Ref Reference base(s)
Alt Alternate non-reference alleles called on at least one of the samples
COSMIC ID COSMIC ID
Func.refGene Regions (e.g., exonic, intronic, non-coding RNA)) that one variant hits; please click here for details.
Gene.refGene Gene name associated with one variant
ExonicFunc.refGene Exonic variant function, e.g., nonsynonymous, synonymous, frameshift insertion.please click here for details.
AAChange.refGene Amino acid change. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationTaster score. See the dbNSFP information table for details.
MutationAssessor_pred MutationTaster prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
DANN_score DANN score. See the dbNSFP information table for details.
fathmm-MKL_coding_score fathmm-MKL score for one coding variant. See the dbNSFP information table for details.
fathmm-MKL_coding_pred fathmm-MKL prediction for one coding variant. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
integrated_fitCons_score fitCons score<. See the dbNSFP information table for details./td>
integrated_confidence_value confidence level. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP7way_vertebrate Phylogenetic p-values for 7 vertebrate species. See the dbNSFP information table for details.
phyloP20way_mammalian Phylogenetic p-values for 20 mammalian species. See the dbNSFP information table for details.
phastCons7way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
phastCons20way_mammalian phastCons p-values for 20 mammalian species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_annoTable.txt from the annotator via SnpEff
Column Names Description
CHROM Chromosome number
POS Position
ID semi-colon separated list of unique identifiers where available. If this is a dbSNP variant it is encouraged to use the rs number(s).
REF Reference base(s)
ALT Alternate non-reference alleles called on at least one of the samples
EFFECT Functional consequences of one variant, e.g., missense_variant, synonymous_variant. please clickhere for details.
REGION Regions (e.g., exonic, intronic) that one variant hits
IMPACT Putative impact of the variant (e.g. HIGH, MODERATE or LOW impact).
GENE Gene name (usually HUGO)
GENEID Gene ID)
FEATURE The type of feature is in the next field (e.g. transcript, motif, miRNA, etc.)
FEATUREID Transcript ID (preferably using version number), Motif ID, miRNA, ChipSeq peak, Histone mark, depending on the annotation.
BIOTYPE Description on whether the transcript is {“Coding”, “Noncoding”}. Whenever possible, use ENSEMBL biotypes. .
HGVS_C Variant using HGVS notation (DNA level). For example, c.352A>G stands for A to G substitution of nucleotide 352. Click here for details.
HGVS_P Coding variant using HGVS notation (Protein level). For example, p.Ile118Val stands for Isoleucine at position number 66 substitution to Valine. p.Ile118Val can be also be represented by p.I118V using the 1-letter symbol here. Click here for details.
SIFT_score SIFT score. See the dbNSFP information table for details.
SIFT_pred SIFT prediction. See the dbNSFP information table for details.
Polyphen2_HDIV_score Pholyphen2 score based on HDIV. See the dbNSFP information table for details.
Polyphen2_HDIV_pred Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details.
Polyphen2_HVAR_score Polyphen2 score based on HVAR. See the dbNSFP information table for details.
Polyphen2_HVAR_pred Polyphen2 prediction based on HVAR. See the dbNSFP information table for details.
LRT_score LRT score. See the dbNSFP information table for details.
LRT_pred LRT prediction. See the dbNSFP information table for details.
MutationTaster_score MutationTaster score. See the dbNSFP information table for details.
MutationTaster_pred MutationTaster prediction. See the dbNSFP information table for details.
MutationAssessor_score MutationAssessor score. See the dbNSFP information table for details.
MutationAssessor_pred MutationAssessor prediction. See the dbNSFP information table for details.
FATHMM_score FATHMM score. See the dbNSFP information table for details.
FATHMM_pred FATHMM prediction. See the dbNSFP information table for details.
PROVEAN_score PROVEAN score<. See the dbNSFP information table for details./td>
PROVEAN_pred PROVEAN prediction. See the dbNSFP information table for details.
VEST3_score VEST V3 score. See the dbNSFP information table for details.
CADD_raw CADD raw score. See the dbNSFP information table for details.
CADD_phred CADD phred-like score. See the dbNSFP information table for details.
MetaSVM_score MetaSVM score. See the dbNSFP information table for details.
MetaSVM_pred MetaSVM prediction. See the dbNSFP information table for details.
MetaLR_score MetaLR score. See the dbNSFP information table for details.
MetaLR_pred MetaLR prediction. See the dbNSFP information table for details.
GERP++_NR GREP++ conservation score. See the dbNSFP information table for details.
GERP++_RS GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details.
phyloP100way_vertebrate Phylogenetic p-values for 100 vertebrate species. See the dbNSFP information table for details.
phastCons100way_vertebrate PhastCons score for 7 vertebrate species. See the dbNSFP information table for details.
SiPhy_29way_logOdds SiPhy log odds score for 29 species. See the dbNSFP information table for details.
    • *_genelist.txt from the annotators via ANNOVAR and SnpEff
Column Names Description
Gene Gene name associated with each variant; one gene name may correspond to several variants.
Mutations Amino acid change information. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change..
    • dbNSFP Information
Columns of Annotations from dbNSFP Database Pediction Algorithm/Conservation Score Description Method Categorical Prediction Author(s)
SIFT_pred 
SIFT_score
SIFT Sort intolerated from tolerated P(An amino acid at a position is tolerated | The most frequentest amino acid being tolerated) D: Deleterious (sift<=0.05);
T: tolerated (sift>0.05)
Pauline Ng, Fred Hutchinson 
Cancer Research Center, Seattle, Washington
Polyphen2_HDIV_pred 
Polyphen2_HDIV_score
Polyphen v2 Polymorphism phenotyping v2 D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956), 
B: benign (pp2_hdiv<=0.452)
Probablistic Classifier Training sets: HumDiv Havard Medical School/td>
Polyphen2_HVAR_pred
Polyphen2_HVAR_score
Polyphen v2 Polymorphism phenotyping v2 Machine learning Training sets: HumVar D: Probably damaging (>=0.957), 
P: possibly damaging (0.453<=pp2_hdiv<=0.956);
B: benign (pp2_hdiv<=0.452)
Shamil Sunyaev
Havard Medical School
LRT_pred 
LRT_score
LRT Likelihood ratio test LRT of H0: each codon evolves neutrally vs H1: the codon evovles under negative selection D: Deleterious; 
N: Neutral;
U: Unknown
Lower scores are more deleterious
Sung Chung, Justin Fay Washington University
MutationTaster_pred 
MutationTaster_score
MutationTaster Bayes Classifier A: (""disease_causing_automatic""); 
D: (""disease_causing""); 
N: (""polymorphism [probably harmless]""); 
P: (""polymorphism_automatic[known to be harmless]"
higher values are more deleterious"
  Markus Schuelke
the Charité - Universitätsmedizin Berlin
MutationAssessor_pred 
MutationAssessor_score
MutationAssessor Entropy of multiple sequence alighnment H: high; 
M: medium; 
L: low; 
N: neutral. 
H/M means functional and L/N means non-functional higher values are more deleterious
  Reva Boris
Computation Biology Center Memorial Sloan Kettering Cancer Center
FATHMM_pred 
FATHMM_score
FATHMM HMM Functional analysis through hidden markov model HMM D: Deleterious; 
T: Tolerated;
lower values are more deleterious
Shihab Hashem
University of Bristol, UK
PROVEAN_pred 
PROVEAN_score
  Protein Variation Effect Analyzer Clustering of homologus sequences D: Deleterious; 
N: Neutral
higher values are more deleterious
Choi Y J. Craig Venter Institute
VEST3_score VEST V3 Variant effect scoring tool Random forest classifier higher values are more deleterious Rachel Karchin John Hopkins University
CADD_raw CADD_phred CADD Combined annotation dependent depletion Linear kernel SVM higher values are more deleterious   Jay Shendure, Xiaohui Xie University of California - Irvine
DANN_score DANN Deleterious Annotation of genetic variants using Neural Networks Neural network higher values are more deleterious Jay Shendure, Xiaohui Xie
University of California - Irvine
fathmm-MKL_coding_pred FATHMM-MKL predicting the effects of both coding and non-coding variants using nucleotide-based HMMs Classifier based on multiple kernel learning D: Deleterious; 
T: Tolerated
Score >= 0.5: D; 
Score < 0.5: T
Shihab Hashem
University of Bristol, UK
MetaSVM_pred 
MetaSVM_score
MetaSVM Support vector machine D: Deleterious; T: Tolerated;
higher scores are more deleterious
  Coco Dong
USC Biostatiscs Department
MetaLR_pred 
MetaLR_score
MetaLR Logistic regression D: Deleterious; 
T: Tolerated; 
higher scores are more deleterious
  Coco Dong 
USC Biostatiscs Department
integrated_fitCons_score 
integrated_confidence_value
FitCons Fitness consequences of functional annotation Integrate functional assays like ChIP-Seq with conservation measure of transcription factor binding sites higher scores are more deleterious Abriza
Cold Spring Harbor Lab
GERP++_RS
GERP++_NR
Genome Evolutionary Rate Profiling ++ maximum likelihood estimation procedure higher scores are more deleterious   Eugne Davydov
Stanford University, CS Department
phyloP7way_vertebrate PhyloP Phylogentic p-values Phylogentic p-values calculated from a LRT, score-based test, GERP test Use 7 species higher scores are more deleterious Adam Siepel 
UCSC
phyloP20way_mammalian PhyloP Phylogentic p-values a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious Adam Siepel
UCSC
phastCons7way_vertebrate phastCons A phylogenetic hidden Markov model (phylo-HMM) Use 7 species higher scores are more deleterious   Adam Siepel
UCSC
phastCons20way_mammalian phastCons a phylogenetic hidden Markov model (phylo-HMM) Use 20 species higher scores are more deleterious   Adam Siepel
UCSC
SiPhy_29_way SiPhy Probablistic framework, HMM Use 29 species higher scores are more deleterious   Manual Garber
Broad Institute of MIT & Harvard

>

Questions? Kindly contact arraytools [at] emmes.com using the subject heading detailed information for outputted files from somatic mutation annotators.

原文链接地址:https://brb.nci.nih.gov/seqtools/colexpanno.html

Detailed Information for Outputted Files from Somatic Mutation Annotators(annovar 注释文件条目详细解释)的更多相关文章

  1. Debugging Information in Separate Files

    [Debugging Information in Separate Files] gdb allows you to put a program's debugging information in ...

  2. MCP|DYM|Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenocarcinoma and validate a novel somatic mutation CDK12-G879V (利用定量质谱探究转移性肺腺瘤的蛋白质组异质性及验证新体细胞突变)

    文献名:Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenoca ...

  3. somatic mutation体细胞变异检测文献分享--转载

    转载 :http://blog.sina.com.cn/s/blog_83f77c940102xuro.html Kalatskaya I, Trinh Q M, Spears M, et al. I ...

  4. The absolute uri: [http://java.sun.com/jsp/jstl/core] cannot be resolved in either web.xml or the jar files deployed with this application] with root cause异常处理及解释

    1.问题描述: 在web的jsp文件中想用jstl这个标准库,在运行的时候很自然的引用jar包如下: <dependency> <groupId>javax.servlet.j ...

  5. How to check type of files without extensions in python? 不通过文件扩展名,怎样知道文件类型?

    有一个命令 file 可以用 $ file fuck fuck.png: PNG image data, 1122 x 750, 8-bit colormap, non-interlaced pyth ...

  6. BlackArch-Tools

    BlackArch-Tools 简介 安装在ArchLinux之上添加存储库从blackarch存储库安装工具替代安装方法BlackArch Linux Complete Tools List 简介 ...

  7. xsltproc docbook 转 html

    /etc/xml/catalog <?xml version="1.0" encoding="UTF-8"?> <catalog xmlns= ...

  8. curl-手册

    Manual -- curl usage explained Related: Man Page FAQ LATEST VERSION   You always find news about wha ...

  9. 【ANT】Ant常用的内置task

    ant 例如: <target name="callProjectB"> <echo message="In projectA calling proj ...

随机推荐

  1. C#图解教程-方法参数笔记(上)

    一晃大学四年要过去了,期间乱点了很多技能点, 导致每一项技能都只是处于入门阶段.为了将C#作为我的主要技能,准备恶补相关姿势(知识),通过各种技术论坛的推荐,找到了<C#图解教程>这本书. ...

  2. 串口屏Modbus协议,串口屏的modbus协议资料,串口屏modbus通讯协议开发,串口屏之modbus协议使用技巧

    串口屏Modbus协议,串口屏的modbus协议资料,串口屏modbus通讯协议开发,串口屏之modbus协议使用技巧 本例程中用51单片机作为Modbus从机,从机的设备地址为2,从机有4个寄存器, ...

  3. angular ng-bind

    <body ng-app=""> <div ng-controller="firstController"> <input typ ...

  4. UITableView grouped样式使用探索

    UITableView的style有plain和grouped两种样式,两种样式各有不同的风格和功能,plain样式已经封装好了悬停功能,gouped样式则为我们在区头和区尾在实际项目开发中需要我们选 ...

  5. HDU 5008 求第k小子串

    本题要求第k小的distinct子串,可以根据height数组,二分出这个第k小子串所在后缀的位置信息.由于题目要求子串起始下标尽可能小.所以再在rank数组中,二分出与当前后缀LCP大于等于所求子串 ...

  6. Asp.NET MVC 之心跳/长连接

    0x01 在线用户类,我的用户唯一性由ID和类型识别(因为在不同的表里) public class UserIdentity : IEqualityComparer<UserIdentity&g ...

  7. Set ,List,ArrayList,LinkedList,Vectory,HashMap,Hashtable,HashSet,TreeSet,TreeSet

    Set与List区别: 两者都是接口,并继承Collection接口:List有序,允许重复:Set无序,不能重复: ArrayList与LinkList区别: ArrayList是动态数组,查询效率 ...

  8. 天方夜谈·数据结构·List

    在战场上杀不死的敌人,永远也别想打败他,他就像幽灵横亘在你失败的田地上. 大一下学期,接触到Java程序设计语言,时至今日,才越发觉得知识与技术的海洋是多么多么的浩瀚.......如果说编程语言的一个 ...

  9. bzoj4514 [Sdoi2016]数字配对

    Description 有 n 种数字,第 i 种数字是 ai.有 bi 个,权值是 ci. 若两个数字 ai.aj 满足,ai 是 aj 的倍数,且 ai/aj 是一个质数, 那么这两个数字可以配对 ...

  10. 《分布式Java应用之基础与实践》读书笔记四

    Java代码作为一门跨操作系统的语言,最终是运行在JVM中的,所以对于JVM的理解就变得非常重要了.整体上,我们可以从三个方面来深入理解JVM. Java代码的执行 内存管理 线程资源同步和交互机制 ...