Detailed Information for Outputted Files from Somatic Mutation Annotators(annovar 注释文件条目详细解释)
We provide here detailed Description about the files outputted from the somatic mutation annotators via ANNOVAR and SnpEff.
*_annoTable.txt
from the annotator via ANNOVAR
Column Names | Description |
---|---|
Chr | Chromosome number |
Start | Start position |
End | End position |
Ref | Reference base(s) |
Alt | Alternate non-reference alleles called on at least one of the samples |
COSMIC ID | COSMIC ID |
Func.refGene | Regions (e.g., exonic, intronic, non-coding RNA)) that one variant hits; please click here for details. |
Gene.refGene | Gene name associated with one variant |
ExonicFunc.refGene | Exonic variant function, e.g., nonsynonymous, synonymous, frameshift insertion.please click here for details. |
AAChange.refGene | Amino acid change. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change. |
SIFT_score | SIFT score. See the dbNSFP information table for details. |
SIFT_pred | SIFT prediction. See the dbNSFP information table for details. |
Polyphen2_HDIV_score | Pholyphen2 score based on HDIV. See the dbNSFP information table for details. |
Polyphen2_HDIV_pred | Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details. |
Polyphen2_HVAR_score | Polyphen2 score based on HVAR. See the dbNSFP information table for details. |
Polyphen2_HVAR_pred | Polyphen2 prediction based on HVAR. See the dbNSFP information table for details. |
LRT_score | LRT score. See the dbNSFP information table for details. |
LRT_pred | LRT prediction. See the dbNSFP information table for details. |
MutationTaster_score | MutationTaster score. See the dbNSFP information table for details. |
MutationTaster_pred | MutationTaster prediction. See the dbNSFP information table for details. |
MutationAssessor_score | MutationTaster score. See the dbNSFP information table for details. |
MutationAssessor_pred | MutationTaster prediction. See the dbNSFP information table for details. |
FATHMM_score | FATHMM score. See the dbNSFP information table for details. |
FATHMM_pred | FATHMM prediction. See the dbNSFP information table for details. |
PROVEAN_score | PROVEAN score<. See the dbNSFP information table for details./td> |
PROVEAN_pred | PROVEAN prediction. See the dbNSFP information table for details. |
VEST3_score | VEST V3 score. See the dbNSFP information table for details. |
CADD_raw | CADD raw score. See the dbNSFP information table for details. |
CADD_phred | CADD phred-like score. See the dbNSFP information table for details. |
DANN_score | DANN score. See the dbNSFP information table for details. |
fathmm-MKL_coding_score | fathmm-MKL score for one coding variant. See the dbNSFP information table for details. |
fathmm-MKL_coding_pred | fathmm-MKL prediction for one coding variant. See the dbNSFP information table for details. |
MetaSVM_score | MetaSVM score. See the dbNSFP information table for details. |
MetaSVM_pred | MetaSVM prediction. See the dbNSFP information table for details. |
MetaLR_score | MetaLR score. See the dbNSFP information table for details. |
MetaLR_pred | MetaLR prediction. See the dbNSFP information table for details. |
integrated_fitCons_score | fitCons score<. See the dbNSFP information table for details./td> |
integrated_confidence_value | confidence level. See the dbNSFP information table for details. |
GERP++_RS | GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details. |
phyloP7way_vertebrate | Phylogenetic p-values for 7 vertebrate species. See the dbNSFP information table for details. |
phyloP20way_mammalian | Phylogenetic p-values for 20 mammalian species. See the dbNSFP information table for details. |
phastCons7way_vertebrate | PhastCons score for 7 vertebrate species. See the dbNSFP information table for details. |
phastCons20way_mammalian | phastCons p-values for 20 mammalian species. See the dbNSFP information table for details. |
SiPhy_29way_logOdds | SiPhy log odds score for 29 species. See the dbNSFP information table for details. |
*_annoTable.txt
from the annotator via SnpEff
Column Names | Description |
---|---|
CHROM | Chromosome number |
POS | Position |
ID | semi-colon separated list of unique identifiers where available. If this is a dbSNP variant it is encouraged to use the rs number(s). |
REF | Reference base(s) |
ALT | Alternate non-reference alleles called on at least one of the samples |
EFFECT | Functional consequences of one variant, e.g., missense_variant, synonymous_variant. please clickhere for details. |
REGION | Regions (e.g., exonic, intronic) that one variant hits |
IMPACT | Putative impact of the variant (e.g. HIGH, MODERATE or LOW impact). |
GENE | Gene name (usually HUGO) |
GENEID | Gene ID) |
FEATURE | The type of feature is in the next field (e.g. transcript, motif, miRNA, etc.) |
FEATUREID | Transcript ID (preferably using version number), Motif ID, miRNA, ChipSeq peak, Histone mark, depending on the annotation. |
BIOTYPE | Description on whether the transcript is {“Coding”, “Noncoding”}. Whenever possible, use ENSEMBL biotypes. . |
HGVS_C | Variant using HGVS notation (DNA level). For example, c.352A>G stands for A to G substitution of nucleotide 352. Click here for details. |
HGVS_P | Coding variant using HGVS notation (Protein level). For example, p.Ile118Val stands for Isoleucine at position number 66 substitution to Valine. p.Ile118Val can be also be represented by p.I118V using the 1-letter symbol here. Click here for details. |
SIFT_score | SIFT score. See the dbNSFP information table for details. |
SIFT_pred | SIFT prediction. See the dbNSFP information table for details. |
Polyphen2_HDIV_score | Pholyphen2 score based on HDIV. See the dbNSFP information table for details. |
Polyphen2_HDIV_pred | Pholyphen2 prediction based on HDIV. See the dbNSFP information table for details. |
Polyphen2_HVAR_score | Polyphen2 score based on HVAR. See the dbNSFP information table for details. |
Polyphen2_HVAR_pred | Polyphen2 prediction based on HVAR. See the dbNSFP information table for details. |
LRT_score | LRT score. See the dbNSFP information table for details. |
LRT_pred | LRT prediction. See the dbNSFP information table for details. |
MutationTaster_score | MutationTaster score. See the dbNSFP information table for details. |
MutationTaster_pred | MutationTaster prediction. See the dbNSFP information table for details. |
MutationAssessor_score | MutationAssessor score. See the dbNSFP information table for details. |
MutationAssessor_pred | MutationAssessor prediction. See the dbNSFP information table for details. |
FATHMM_score | FATHMM score. See the dbNSFP information table for details. |
FATHMM_pred | FATHMM prediction. See the dbNSFP information table for details. |
PROVEAN_score | PROVEAN score<. See the dbNSFP information table for details./td> |
PROVEAN_pred | PROVEAN prediction. See the dbNSFP information table for details. |
VEST3_score | VEST V3 score. See the dbNSFP information table for details. |
CADD_raw | CADD raw score. See the dbNSFP information table for details. |
CADD_phred | CADD phred-like score. See the dbNSFP information table for details. |
MetaSVM_score | MetaSVM score. See the dbNSFP information table for details. |
MetaSVM_pred | MetaSVM prediction. See the dbNSFP information table for details. |
MetaLR_score | MetaLR score. See the dbNSFP information table for details. |
MetaLR_pred | MetaLR prediction. See the dbNSFP information table for details. |
GERP++_NR | GREP++ conservation score. See the dbNSFP information table for details. |
GERP++_RS | GREP++ "rejected substitutions" (RS) score. See the dbNSFP information table for details. |
phyloP100way_vertebrate | Phylogenetic p-values for 100 vertebrate species. See the dbNSFP information table for details. |
phastCons100way_vertebrate | PhastCons score for 7 vertebrate species. See the dbNSFP information table for details. |
SiPhy_29way_logOdds | SiPhy log odds score for 29 species. See the dbNSFP information table for details. |
*_genelist.txt
from the annotators via ANNOVAR and SnpEff
Column Names | Description |
---|---|
Gene | Gene name associated with each variant; one gene name may correspond to several variants. |
Mutations | Amino acid change information. For example, SAMD11:NM_152486:exon10:c.T1027C:p.W343R stands for gene name, Known RefSeq accession, region, cDNA level change, protein level change.. |
- dbNSFP Information
Columns of Annotations from dbNSFP Database | Pediction Algorithm/Conservation Score | Description | Method | Categorical Prediction | Author(s) |
---|---|---|---|---|---|
SIFT_pred SIFT_score |
SIFT | Sort intolerated from tolerated | P(An amino acid at a position is tolerated | The most frequentest amino acid being tolerated) | D: Deleterious (sift<=0.05); T: tolerated (sift>0.05) |
Pauline Ng, Fred Hutchinson Cancer Research Center, Seattle, Washington |
Polyphen2_HDIV_pred Polyphen2_HDIV_score |
Polyphen v2 | Polymorphism phenotyping v2 | D: Probably damaging (>=0.957), P: possibly damaging (0.453<=pp2_hdiv<=0.956), B: benign (pp2_hdiv<=0.452) |
Probablistic Classifier Training sets: HumDiv | Havard Medical School/td> |
Polyphen2_HVAR_pred Polyphen2_HVAR_score |
Polyphen v2 | Polymorphism phenotyping v2 | Machine learning Training sets: HumVar | D: Probably damaging (>=0.957), P: possibly damaging (0.453<=pp2_hdiv<=0.956); B: benign (pp2_hdiv<=0.452) |
Shamil Sunyaev Havard Medical School |
LRT_pred LRT_score |
LRT | Likelihood ratio test | LRT of H0: each codon evolves neutrally vs H1: the codon evovles under negative selection | D: Deleterious; N: Neutral; U: Unknown Lower scores are more deleterious |
Sung Chung, Justin Fay Washington University |
MutationTaster_pred MutationTaster_score |
MutationTaster | Bayes Classifier | A: (""disease_causing_automatic""); D: (""disease_causing""); N: (""polymorphism [probably harmless]""); P: (""polymorphism_automatic[known to be harmless]" higher values are more deleterious" |
Markus Schuelke the Charité - Universitätsmedizin Berlin |
|
MutationAssessor_pred MutationAssessor_score |
MutationAssessor | Entropy of multiple sequence alighnment | H: high; M: medium; L: low; N: neutral. H/M means functional and L/N means non-functional higher values are more deleterious |
Reva Boris Computation Biology Center Memorial Sloan Kettering Cancer Center |
|
FATHMM_pred FATHMM_score |
FATHMM | HMM | Functional analysis through hidden markov model HMM | D: Deleterious; T: Tolerated; lower values are more deleterious |
Shihab Hashem University of Bristol, UK |
PROVEAN_pred PROVEAN_score |
Protein Variation Effect Analyzer | Clustering of homologus sequences | D: Deleterious; N: Neutral higher values are more deleterious |
Choi Y J. Craig Venter Institute | |
VEST3_score | VEST V3 | Variant effect scoring tool | Random forest classifier | higher values are more deleterious | Rachel Karchin John Hopkins University |
CADD_raw CADD_phred | CADD Combined annotation dependent depletion | Linear kernel SVM | higher values are more deleterious | Jay Shendure, Xiaohui Xie University of California - Irvine | |
DANN_score | DANN | Deleterious Annotation of genetic variants using Neural Networks | Neural network | higher values are more deleterious | Jay Shendure, Xiaohui Xie University of California - Irvine |
fathmm-MKL_coding_pred | FATHMM-MKL | predicting the effects of both coding and non-coding variants using nucleotide-based HMMs | Classifier based on multiple kernel learning | D: Deleterious; T: Tolerated Score >= 0.5: D; Score < 0.5: T |
Shihab Hashem University of Bristol, UK |
MetaSVM_pred MetaSVM_score |
MetaSVM | Support vector machine | D: Deleterious; T: Tolerated; higher scores are more deleterious |
Coco Dong USC Biostatiscs Department |
|
MetaLR_pred MetaLR_score |
MetaLR | Logistic regression | D: Deleterious; T: Tolerated; higher scores are more deleterious |
Coco Dong USC Biostatiscs Department |
|
integrated_fitCons_score integrated_confidence_value |
FitCons | Fitness consequences of functional annotation | Integrate functional assays like ChIP-Seq with conservation measure of transcription factor binding sites | higher scores are more deleterious | Abriza Cold Spring Harbor Lab |
GERP++_RS GERP++_NR |
Genome Evolutionary Rate Profiling ++ | maximum likelihood estimation procedure | higher scores are more deleterious | Eugne Davydov Stanford University, CS Department |
|
phyloP7way_vertebrate | PhyloP | Phylogentic p-values | Phylogentic p-values calculated from a LRT, score-based test, GERP test Use 7 species | higher scores are more deleterious | Adam Siepel UCSC |
phyloP20way_mammalian | PhyloP | Phylogentic p-values | a phylogenetic hidden Markov model (phylo-HMM) Use 20 species | higher scores are more deleterious | Adam Siepel UCSC |
phastCons7way_vertebrate | phastCons | A phylogenetic hidden Markov model (phylo-HMM) Use 7 species | higher scores are more deleterious | Adam Siepel UCSC |
|
phastCons20way_mammalian | phastCons | a phylogenetic hidden Markov model (phylo-HMM) Use 20 species | higher scores are more deleterious | Adam Siepel UCSC |
|
SiPhy_29_way | SiPhy | Probablistic framework, HMM Use 29 species | higher scores are more deleterious | Manual Garber Broad Institute of MIT & Harvard |
>
Questions? Kindly contact arraytools [at] emmes.com
using the subject heading detailed information for outputted files from somatic mutation annotators
.
原文链接地址:https://brb.nci.nih.gov/seqtools/colexpanno.html
Detailed Information for Outputted Files from Somatic Mutation Annotators(annovar 注释文件条目详细解释)的更多相关文章
- Debugging Information in Separate Files
[Debugging Information in Separate Files] gdb allows you to put a program's debugging information in ...
- MCP|DYM|Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenocarcinoma and validate a novel somatic mutation CDK12-G879V (利用定量质谱探究转移性肺腺瘤的蛋白质组异质性及验证新体细胞突变)
文献名:Quantitative mass spectrometry to interrogate proteomic heterogeneity in metastatic lung adenoca ...
- somatic mutation体细胞变异检测文献分享--转载
转载 :http://blog.sina.com.cn/s/blog_83f77c940102xuro.html Kalatskaya I, Trinh Q M, Spears M, et al. I ...
- The absolute uri: [http://java.sun.com/jsp/jstl/core] cannot be resolved in either web.xml or the jar files deployed with this application] with root cause异常处理及解释
1.问题描述: 在web的jsp文件中想用jstl这个标准库,在运行的时候很自然的引用jar包如下: <dependency> <groupId>javax.servlet.j ...
- How to check type of files without extensions in python? 不通过文件扩展名,怎样知道文件类型?
有一个命令 file 可以用 $ file fuck fuck.png: PNG image data, 1122 x 750, 8-bit colormap, non-interlaced pyth ...
- BlackArch-Tools
BlackArch-Tools 简介 安装在ArchLinux之上添加存储库从blackarch存储库安装工具替代安装方法BlackArch Linux Complete Tools List 简介 ...
- xsltproc docbook 转 html
/etc/xml/catalog <?xml version="1.0" encoding="UTF-8"?> <catalog xmlns= ...
- curl-手册
Manual -- curl usage explained Related: Man Page FAQ LATEST VERSION You always find news about wha ...
- 【ANT】Ant常用的内置task
ant 例如: <target name="callProjectB"> <echo message="In projectA calling proj ...
随机推荐
- 使用react native制作的一款网络音乐播放器
使用react native制作的一款网络音乐播放器 基于第三方库 react-native-video设计"react-native-video": "^1.0.0&q ...
- Spring+SpringMVC+MyBatis+easyUI整合优化篇(九)数据层优化-jdbc连接池简述、druid简介
日常啰嗦 终于回到既定轨道上了,这一篇讲讲数据库连接池的相关知识,线程池以后有机会再结合项目单独写篇文章(自己给自己挖坑,不知道什么时候能填上),从这一篇文章开始到本阶段结束的文章都会围绕数据库和da ...
- UPS对电源故障的处理能力
UPS对电源故障的处理能力 双变换在线式UPS由于其逆变器实时在线工作,因而能对所有的电源故障具有隔离和处理功能.由于目前电网情况发生了很大变化,真正的长时间断电只占所有电源故障的30%甚至更低,而非 ...
- 一、iOS中的事件可以分为3大类型
触摸事件加速计事件远程控制事件 响应者对象在iOS中不是任何对象都能处理事件,只有继承了UIResponder的对象才能接收并处理事件.我们称之为"响应者对象" UIApplica ...
- 输入一个数字n 如果n为偶数则除以2,若为奇数则加1或者减1,直到n为1,求最少次数 写出一个函数
题目: 输入一个数字n 如果n为偶数则除以2,若为奇数则加1或者减1,直到n为1,求最少次数 写出一个函数 首先,这道题肯定可以用动态规划来解, n为整数时,n的解为 n/2 的解加1 n为奇数时 ...
- CF #edu 11 C. Hard Process
题目链接:http://codeforces.com/problemset/problem/660/C 大意是给一个01数组,至多可以将k个0变为1,问最后数组中最长能有多少个连续的1,并输出. 问题 ...
- effective c++ 思维导图
历时两个多月的时间,终于把effective c++又复习了一遍,比较慢,看的是英文版,之前看的时候做过一些笔记,但不够详细,这次笔者是从头到尾的翻译了一遍,加了一些标题,先记录到word里面,然后发 ...
- webrtc学习笔记1(建立连接基本流程)
最近在做一个基于webrtc的视频软件,以下是自己对于上层建立通话连接流程的基本理解,记录于此. 假设A和B要建立视频通话,A为房间创建端,B为加入房间端: 1.A通过http登录.获取其他服务器地址 ...
- Javascript一道面试题
实现一个函数,运算结果可以满足如下预期结果: add(1)(2) // 3add(1, 2, 3)(10) // 16 add(1)(2)(3)(4)(5) // 15 function add () ...
- 学习《ASP.NET MVC5高级编程》——基架
基架--代码生成的模板.我姑且这么去定义它,在我学习微软向编程之前从未听说过,比如php代码,大部分情况下是我用vim去手写而成,重复使用的代码需要复制粘贴,即使后来我在使用eclipse这样的IDE ...