1. What is KOBAS 3.0?


KOBAS (KEGG Orthology Based Annotation System) is a web server for gene/protein functional annotation (Annotation module) and functional set enrichment (Enrichment module). Given a set of genes or protein, it can determine whether a pathway, disease, and Gene Ontology(GO) term shows statistically significant. The last version of KOBAS, KOBAS 2.0, has abundant annotation information of gene sets from multiple databases covering pathways (KEGG PATHWAY, Reactome, Biocyc, Panther), diseases (KEGG DISEASE, OMIM, NHGRI GWAS Catalog), and GO terms, and more than 4,000 species are supported. Since KOBAS 2.0 is widely used by worldwide researchers, we update it to KOBAS 3.0, which supports more data formats as input and more accurate functional enrichment algorithms.

KOBAS 3.0 is composed by two function, Annotation and Enrichment, as follows:

1.1 Annotation

For Annotation module, it accepts gene/protein list as input, including IDs or sequences. And it generates annotations for each gene based on multiple databases about pathways, diseases, and Gene Ontology. That is, for each gene, you can find which pathways, diseases, and Gene Ontology are related to this gene.

1.2 Enrichment

Enrichment module gives you the answer of which pathways, diseases, and GO terms is statistically significant associated with the genes/proteins you just input.

For Enrichment module, there are two modules according to their differences in input format:

1.2.1 Gene list Enrichment

This module is called “Identify” in KOBAS 2.0. It accepts same input formats as Annotation module, and the results of Annotation module as input is also allowed (see details at 3.1). It is based on the first generation gene set enrichment method, a gene-level statistic called Overrepresentation Analysis(ORA), a simple and frequently used test based on the hypergeometric distribution. Many tools have applied this methods, such as DAVID. However, we support other distributions like binominal test, chi-square test, frequency list and 3 FDR correction methods, like Benjamini and Hochberg (1995), Benjamini and Yekutieli (2001), and QVALUE.

1.2.2 Exp-data Enrichment

This module is a new feature in KOBAS 3.0. Allowing the gene expression as input gives a big change for functional gene sets enrichment because it makes us be able to use set based second or net-based gene set enrichment method, which use the information of molecular measures where the ORA ignores. By considering the coordinated changes in gene expression, these methods account for dependence between genes in a pathway, which ORA does not.

This module has integrated 9 methods including set-based methods: Globaltest, GSEA, GSA PADOG, PLAGE, GAGE, SAFE and net-based methods: GANPA, CEPA.

Furthermore, to detect the enriched gene sets supported by multiple methods, Exp-data Enrichment module gives gene set enrichment score and probability of being enriched sets based on the results of 9 gene set enrichment(GSE) methods.

KOBAS的更多相关文章

  1. 使用GEO数据库来筛选差异表达基因,KOBAS进行KEGG注释分析

    前言 本文主要演示GEO数据库的一些工具,使用的数据是2015年在Nature Communications上发表的文章Regulation of autophagy and the ubiquiti ...

  2. KEGG and Gene Ontology Mapping in Bioinformatic Method

    使用KOBAS进行KEGG pathway和Gene Ontology分析 Article from Blog of Alfred-Feng http://blog.sina.com.cn/u/170 ...

  3. (转)基因芯片数据GO和KEGG功能分析

    随着人类基因组计划(Human Genome Project)即全部核苷酸测序的即将完成,人类基因组研究的重心逐渐进入后基因组时代(Postgenome Era),向基因的功能及基因的多样性倾斜.通过 ...

随机推荐

  1. distinct group by

    select num from test_test group by num; 比 select distinct(num) from test_test;  效率高 select count(dis ...

  2. [ilink32 Error] Fatal: Unable to open file 'RM_COMMON.OBJ'

    [ilink32 Error] Fatal: Unable to open file 'RM_COMMON.OBJ' [ilink32 Error] Fatal: Unable to open fil ...

  3. VB6 创建控制台应用程序

    ' 功能:为VB程序创建一个consolewindow.Private Declare Function AllocConsole Lib "kernel32" () As Lon ...

  4. AS3 - 数组元素乱序方法以及效率比较

    http://www.hangge.com/blog/cache/detail_453.html

  5. C# 通用方法

    一. /// <summary> /// 删除字符串中的中文 /// </summary> public static string Delete(string str) { ...

  6. taskset: 让进程运行在指定的CPU 上

    观察发现4核CPU,只有第1个核心(CPU#0)非常忙,其他都处于idle状态. 不了解Linux是如何调度的,但目前显然有优化的余地.除了处理正常任务,CPU#0还需要处理每秒网卡中断.因此,若能将 ...

  7. SQL语言类别

    SQL语言主要分为四大类:数据查询语言DQL,数据操纵语言DML, 数据定义语言DDL,数据控制语言DCL. DQL (data query language) DML(data manipulati ...

  8. TEXT 8 Ready, fire, aim

    TEXT 8 Ready, fire, aim 预备!开火!瞄准!! Feb 16th 2006 From The Economist print edition Foreword:A vice-pr ...

  9. Haskell语言学习笔记(56)Lens(3)

    手动计算(view, over, set, to, _1) view l = getConst . l Const over l f = runIdentity . l (Identity . f) ...

  10. MIUI添加内存调试工具:查看进程中的Bitmap信息

    Android开发中的内存管理一直是令人头痛的事情.其中占用内存最大的一般是Bitmap.   在上周五发布的MIUI开发版中,我添加了查看内存里Bitmap信息的功能.大家开发app的时候可以使用这 ...