https://www.biostars.org/p/15890/

 
 
71
 
5.9 years ago by
Washington University School of Medicine, St. Louis, USA

Does anyone know of a good source of human genes that act as a tumor suppressor or oncogene? The source can be a database or data mining approach that queries a more general database.

Some possible features of a 'good' source:

  • More than just a list of 'cancer genes'. One abstract with the name of the gene and the word cancer in it does not make it a convincing cancer gene. But a more sophisticated text-mining based approach would be acceptable
  • The gene will be annotated as a tumor suppressor or oncogene with additional information on how this classification is justified
  • Other relevant annotation such as whether the gene is involved in DNA repair, apoptosis, etc.
  • Up to date. A spreadsheet from 10 years ago is less useful than a routinely updated source.
  • Free and open source. Although if you know of commercial options please suggest them.

Here are some of the things I have found so far:

  • The Wikipedia entries above list categories of oncogenes .. but not all RTKs for example will necessarily act as an oncogene
  • The Gene Ontology used to have a term 'Tumor Suppressor' but this has been superseded by the term 'regulation of cell cycle'. A gene involved in regulation of the cell cycle may generally have the potential to function as a tumor suppressor but it would be nice to know which had been demonstrated to do so and how. A combination of GO terms and evidence codes might be acceptable if someone wishes to elaborate on this approach
  • UniProtKB has a keyword 'Proto-oncogene' associated with 560 genes and a keyword 'Tumor Suppressor' associated with 631 genes.
  • A compilation of cancer gene lists from the 'Bushman Lab'
  • The Sanger Cancer Gene Census
  • A more empirical approach might involve using patterns of somatic mutation across many cancers to identify likely tumor suppressors and oncogenes. Where tumor suppressors are expected to be characterized by loss-of-function mutations (copy number deletions, nonsense, or missense mutations spread across multiple sites in the gene) and oncogenes would be characterized by recurrent mutation sites (amplifications, mutation hotspots, gene fusions involving a particular gene partner, etc.). COSMIC is already working along these lines but if there are others, please post.
  • The Cancer HotSpots Resource performs an analysis that looks for recurrently mutated cancer hotspots by mining tumor sequence data. These hotspots can be indicative that a gene is an Oncogene.
  • A variety of older websites that list tumor suppressors: TSGDBTumor Gene Database, and TAG

Useful suggestions gathered from below (refer there for more details):

Sources you could mine to develop your own lists:

Organizations that are annotating druggable/actionable genes:

Some relevant posts:

ADD COMMENT • link •

Not following 

modified 11 months ago by Min • 60 • written 5.9 years ago by Malachi Griffith ♦ 16k

 
2

I feel like, right now, the best answer to this question is the Cancer Gene Census. They currently provide a TSV download of their complete list of 567 genes with nearly all being indicated as oncogene and/or tumor suppressor (TSG).

ADD REPLY • linkwritten 3 months ago by Obi Griffith ♦ 16k
 
1

I doubt that you have missed a useful resource given your fairly extensive groundwork. Interesting questions though, particular if there is no good solution to the problem yet.

ADD REPLY • linkwritten 5.9 years ago by Spitshine • 620
 
1

haha I compiled that "Bushman Lab" list back in 2005. NCI was very disorganized - there was no Cancer Gene Index or lists to speak of. Today I think it would almost be easier to assemble a list of "non-cancer genes", since there are so many passenger mutations that can occur.

ADD REPLY • linkwritten 5.9 years ago by Jeremy Leipzig ♦ 17k
 
1

Yeah the passenger mutation comment is a good one. This is why I am interested in more context. As you say, virtually every gene cited in more than 5 papers is a 'cancer gene' and the remainder just aren't a cancer gene yet. What we need is more information on how each gene is related to cancer initiation, progression, metastasis, response to treatment, etc., etc.

ADD REPLY • linkwritten 5.9 years ago by Malachi Griffith ♦ 16k
 
6
 
4.8 years ago by
Travis • 2.7k
USA

I realize this is an old thread but still a very relevant question. The following resource ticks most boxes for tumor suppressor genes. It is a recent (NAR) published database resource, complete with literature evidence, downloadable data and a number of nice web-based resources like the ability to view Kegg pathways with tumor suppressors highlighted.

TSGene: http://bioinfo.mc.vanderbilt.edu/TSGene/

ADD COMMENT • linkmodified 4.8 years ago • written 4.8 years ago by Travis • 2.7k
 
4
 
5.9 years ago by
Sean Davis ♦ 23k
National Institutes of Health, Bethesda, MD

An additional resource is the NCI Cancer Gene Index.

ADD COMMENT • linkwritten 5.9 years ago by Sean Davis ♦ 23k
 

Looks promising. From the website: "The goal of the Cancer Gene Index is to further translational cancer research by providing a high quality data resource consisting of genes that have been experimentally associated with human cancer diseases and/or pharmacological compounds, the evidence of these associations, and relevant annotations on the data. This extremely valuable resource was created through a unique process that coupled automated linguistic text analysis of millions of MEDLINE abstracts with manual validation and annotation of the extracted data by expert human curators."

ADD REPLY • linkwritten 5.9 years ago by Malachi Griffith ♦ 16k
 
4
 
5.8 years ago by
Obi Griffith ♦ 16k
Washington University, St Louis, USA

The Cancer Genes database is produced by MSKCC and has a nice interface with which you can do a very simple query and get a list of 873 tumor suppressor genes and 495 oncogenes with associated gene IDs and GO categories. But it does not meet your criteria for stringency as tumor suppressors are determined by a simple term query of Entrez Gene.

What about text-mining something like OMIM or even the rapidly improving gene pages in Wikipedia. This might get you better quality than going straight at the literature because a lot of manual/expert curation has already been put in. And, if you have a big over-representation of the term "Tumor Suppressor" in an OMIM record or the "Role in Disease" section of a gene's wikipage its probably a decent candidate.

See this OMIM search with genes sorted by relevance.
http://omim.org/search?index=entry&start=1&limit=10&search=tumor+suppressor&sort=score+desc%2C+prefix_sort+desc

Its just too bad that these manually curated resources don't also have more controlled data entry mechanisms where the curators could indicate that it was a tumor suppressor and provide evidence through a controlled vocabulary. Maybe someone is aware of such efforts?

ADD COMMENT • linkmodified 5.8 years ago • written 5.8 years ago by Obi Griffith ♦ 16k
 

Unfortunately, it seems the Cancer Genes database has gone dark. I have tried several times in recent weeks and always get a 404 error. While searching for its new home I did find this list of cancer genes lists which is used as a gene ranker for GBM. I'm not sure how current it is. I feel like the you might also be able to get at cancer gene lists through the cBio Portal somehow but have not figured out how yet.

ADD REPLY • linkmodified 3.7 years ago • written 3.7 years ago by Obi Griffith ♦ 16k
 

The 404 points you at this old paper version:http://nar.oxfordjournals.org/content/35/suppl_1/D721.full

ADD REPLY • linkwritten 11 months ago by Malachi Griffith ♦ 16k
 
3
 
3.7 years ago by
Obi Griffith ♦ 16k
Washington University, St Louis, USA

Another option is the Network of Cancer Genes. According to the site, NCG is a:

Manually curated list of 2,000 protein-coding cancer genes and 64 OncomiRs. Cancer genes are genes with a driver role in the onset of human cancer upon mutations of their sequence and/or amplifications of their genomic locus. 537 are known cancer genes from the Cancer Gene Census and from a list of genes that undergo cancer-specific amplifications. Their involvement in cancer is documented in the literature. 1463 are candidate cancer genes that derive from the manual curation of 77 whole genome or whole exome cancer-resequencing screenings. Their involvement in cancer is inferred with various statistical methods.

ADD COMMENT • linkmodified 3.7 years ago • written 3.7 years ago by Obi Griffith ♦ 16k
 
2
 
5.7 years ago by
Jiri Voller • 20
 

NCI Cancer Gene Index looks great, seems that a lot of effort was invested into curation of mined cancer related sentences. But the problem is that the project ended in 2009. In fact I got here, when I was looking for its successors....

ADD COMMENT • linkwritten 5.7 years ago by Jiri Voller • 20
 
2
 
4.7 years ago by
henryvuong • 700
USA

There is a nice table of oncogenes and tumor suppressor genes from this publication (Supplementary material) Vogelstein, B., Papadopoulos, N., Velculescu, V., Zhou, S., Diaz, L. & Kinzler, K. Cancer genome landscapes. Science (New York, N.Y.) 339, 1546–58 (2013).http://www.sciencemag.org/content/339/6127/1546.full

ADD COMMENT • linkwritten 4.7 years ago by henryvuong • 700
 
2
 
11 months ago by
Min • 60
 

http://ongene.bioinfo-minzhao.org/ The literature based oncogene list, the same author of TSGene.

ADD COMMENT • linkwritten 11 months ago by Min • 60
 
1
 
5.9 years ago by
Boston, MA USA

The NCI Cancer Gene Index is good +1. This was developed with software from Biomax and you can find information about the collaboration between NCI and Biomax as well as details on the dataset here.

ADD COMMENT • linkwritten 5.9 years ago by Larry_Parnell ♦ 16k
 
1

Anyone knows how to get this list of tumour suppressor genes from NCI Cancer gene index? I've been reading and trying but the interface of searching is not really firendly

ADD REPLY • link

Question: Database Of Tumor Suppressors And/Or Oncogenes的更多相关文章

  1. Mol Cell Proteomics. |彭建祥| 人胃肠道间质瘤亚群蛋白质组图谱

    大家好,本周分享的是发表在Molecular & Cellular Proteomics 上的一篇关于人胃肠道间质瘤亚群蛋白质组图谱的文章,题目是Proteomic maps of human ...

  2. sns社区架构设计案例分享(二)

    源码下载地址:http://www.jinhusns.com/Products/Download/?type=xcj 五. 架构使用说明 > 缓存 > 使用说明 > (一)基础类库介 ...

  3. DataBase -- Employees Earning More Than Their Managers My Submissions Question

    Question: The Employee table holds all employees including their managers. Every employee has an Id, ...

  4. mysql-databaseython 3.4.0 with MySQL database

    Phttp://shttp://stackoverflow.com/questions/23376103/python-3-4-0-with-mysql-databasetackoverflow.co ...

  5. most queries (more than 90 percent) never hit the database at all but only touch the cache layer

    https://gigaom.com/2011/12/06/facebook-shares-some-secrets-on-making-mysql-scale/ Facebook shares so ...

  6. How do you build a database?

    在reddit上看到的一篇讲解数据库实现的文章,非常有意思,在这里记录一下. 回答者technical_guy: Its a great question, and deserves a long a ...

  7. Why GUID primary keys are a database’s worst nightmare

    http://csharptest.net/1250/why-guid-primary-keys-are-a-databases-worst-nightmare/ When you ask most ...

  8. P6 EPPM Manual Installation Guide (Oracle Database)

    P6 EPPM Manual Installation Guide (Oracle Database) P6 EPPM Manual Installation Guide (Oracle Databa ...

  9. P6 Professional Installation and Configuration Guide (Microsoft SQL Server Database) 16 R1

    P6 Professional Installation and Configuration Guide (Microsoft SQL Server Database) 16 R1       May ...

随机推荐

  1. 实用的IOS应用程序框架

    实用的IOS应用程序框架 目录 概述 概述

  2. 【BZOJ3943】[Usaco2015 Feb]SuperBull 最大生成树

    [BZOJ3943][Usaco2015 Feb]SuperBull Description Bessie and her friends are playing hoofball in the an ...

  3. php 自带的过滤函数和转义函数

    函数名 释义 介绍 htmlspecialchars 将与.单双引号.大于和小于号化成HTML格式 &转成&"转成"' 转成'<转成<>转成> ...

  4. Redis 缓存穿透,缓存击穿,缓存雪崩的解决方案分析

    设计一个缓存系统,不得不要考虑的问题就是:缓存穿透.缓存击穿与失效时的雪崩效应. 一.什么样的数据适合缓存? 分析一个数据是否适合缓存,我们要从访问频率.读写比例.数据一致性等要求去分析.  二.什么 ...

  5. arpa/inet.h所引起的Segmentation fault及网络编程常见的头文件

    最近在学习Linux网络编程方面的知识,感觉还是有些困难.主要是对协议过程的理解,还有socket的API的理解不够深刻.今天复习编写了一个TCP的服务端和客户端的程序实现client.c从命令行参数 ...

  6. 由SOAP说开去 - - 谈谈WebServices、RMI、RPC、SOA、REST、XML、JSON

    引子: 关于SOAP其实我一直模模糊糊不太理解,这种模模糊糊的感觉表述起来是这样: 在使用web服务时(功能接口),本来我就可以通过安卓中固有的http类(使用http协议),来发送http请求,并且 ...

  7. SQL Server查看库、表占用空间大小

    转自:https://blog.csdn.net/yenange/article/details/50493580 查询数据文件与日志文件占用情况,查看数据大小,查看库大小 1. 查看数据文件占用(权 ...

  8. mysql 数据操作 多表查询 多表连接查询 内连接

    内连接:只连接匹配的行 只取两张表共同的部分,相当于利用where 过滤条件从笛卡尔积结果中筛选出了正确的结果 select * from 左表 inner join 要连接的表 on 条件 #dep ...

  9. rtcp多媒体控制协议应用

    rtcp package send/recv demo main.c #include <stdio.h> #include <rtp.h> #include <rtcp ...

  10. Deep Learning(2)

    二.Deep Learning的基本思想和方法 实际生活中,人们为了解决一个问题,如对象的分类(对象可是是文档.图像等),首先必须做的事情是如何来表达一个对象,即必须抽取一些特征来表示一个对象,如文本 ...