tophat输出结果junction.bed
tophat输出结果junction.bed
| BED format |
|
BED format provides a flexible way to define the data lines that are displayed in an annotation track. BED lines have three required fields and nine additional optional fields. The number of fields per line must be consistent throughout any single set of data in an annotation track. The order of the optional fields is binding: lower-numbered fields must always be populated if higher-numbered fields are used. If your data set is BED-like, but it is very large and you would like to keep it on your own server, you should use the bigBed data format. The first three required BED fields are:
The 9 additional optional BED fields are:
Example: track name=pairedReads description="Clone Paired Reads" useScore=1 Example: browser position chr7:127471196-127495720 Example: browser position chr7:127471196-127495720 |
tophat输出结果junction.bed中定义的junction就是中间是intron两边是exon的这种序列结构, JUNC00000001包含两个exon和中间的intron
cat junctions.bed
track name=junctions description="TopHat junctions"
test_chromosome 177 400 JUNC00000001 61 + 177 400 255,0,0 2 73,50 0,173
test_chromosome 350 550 JUNC00000002 45 + 350 550 255,0,0 2 50,50 0,150

exon1: 177-250(display), 177-249(actual)
intron: 250-350(display), 250-349(actual)
exon2: 350-400(display), 350-399(actual)
chrom(test_chromosome): The name of the chromosome.
这里就是test_chromosome
chromStart(177): The starting position of the feature in the
chromosome or scaffold. The first base in a chromosome is numbered
0.
chromosome上feature(在这里就是junction)的起始位置
chromEnd(400): The ending position of the feature in the chromosome
or scaffold. The chromEnd base is not included in the display of
the feature. For example, the first 100 bases of a chromosome are
defined as chromStart=0, chromEnd=100, and span the bases numbered
0-99.
对应的终止位置
name(JUNC00000001): Defines the name of the BED line.
score(61): A score between 0 and 1000.
strand(+): Defines the strand: either '+' or '-'.
thickStart(177): The starting position at which the feature is
drawn thickly (for example, the start codon in gene
displays).
thickEnd(400): The ending position at which the feature is drawn
thickly (for example, the stop codon in gene displays).
itemRgb(255,0,0): An RGB value of the form R,G,B (e.g. 255,0,0). If
the track line itemRgb attribute is set to "On", this RBG value
will determine the display color of the data contained in this BED
line. NOTE: It is recommended that a simple color scheme (eight
colors or less) be used with this attribute to avoid overwhelming
the color resources of the Genome Browser and your Internet
browser.
blockCount(2): The number of blocks (exons) in the BED line.
外显子个数,2个
blockSizes(73,50): A comma-separated list of the block sizes. The
number of items in this list should correspond to
blockCount.外显子大小,两个数字用逗号隔开
blockStarts(0,173): A comma-separated list of block starts. All of
the blockStart positions should be calculated relative to
chromStart. The number of items in this list should correspond to
blockCount.
外显子起始位置,不从chromosome的起始开始,而是从chromStart开始数,也就是junction的起始位置开始计算,第一个碱基是0
用bed_to_junc转换后得到.juncs其中只保留了junction中intron的部分
cat new_list.juncs
test_chromosome 249 350 +
test_chromosome 399 500 +
<chrom> <left> <right> <+/->
http://genome.ucsc.edu/FAQ/FAQformat.html#format1
http://www.ensembl.org/info/website/upload/bed.html
http://zhongguozhanying.blog.163.com/blog/static/21618709520135484450746/
http://blog.sina.com.cn/s/blog_7cffd14001012rtd.html
https://github.com/WormBase/wormbase-pipeline/blob/master/scripts/align_RNASeq.pl
tophat输出结果junction.bed的更多相关文章
- 使用Tophat+cufflinks分析差异表达
使用Tophat+cufflinks分析差异表达 2017-06-15 19:09:43 522 0 0 使用TopHat+Cufflinks的流程图 序列的比对是RNA分析 ...
- RNA-seq连特异性
RNA-seq连特异性 Oct 15, 2015 The strandness of RNA-seq analysis 前段时间一直在研究关于illumina TrueSeq stranded RNA ...
- tophat的用法
概述:tophat是以bowtie2为核心的一款比对软件. tophat工作分两步: 1.将reads用bowtie比对到参考基因组上. 2.将unmapped-reads打断成更小的fragment ...
- TopHat
What is TopHat? TopHat is a program that aligns RNA-Seq reads to a genome in order to identify exon- ...
- RNA-seq差异表达基因分析之TopHat篇
RNA-seq差异表达基因分析之TopHat篇 发表于2012 年 10 月 23 日 TopHat是基于Bowtie的将RNA-Seq数据mapping到参考基因组上,从而鉴定可变剪切(exon-e ...
- poj 1041(字典序输出欧拉回路)
John's trip Time Limit: 1000MS Memory Limit: 65536K Total Submissions: 8641 Accepted: 2893 Spe ...
- 链终止法|边合成边测序|Bowtie|TopHat|Cufflinks|RPKM|FASTX-Toolkit|fastaQC|基因芯片|桥式扩增|
生物信息学 Sanger采用链终止法进行测序 带有荧光基团的ddXTP+其他四种普通的脱氧核苷酸放入同一个培养皿中,例如带有荧光基团的ddATP+普通的脱氧核苷酸A.T.C.G放入同一个培养皿,以此类 ...
- mapreduce多文件输出的两方法
mapreduce多文件输出的两方法 package duogemap; import java.io.IOException; import org.apache.hadoop.conf ...
- Android Studio 多个编译环境配置 多渠道打包 APK输出配置
看完这篇你学到什么: 熟悉gradle的构建配置 熟悉代码构建环境的目录结构,你知道的不仅仅是只有src/main 开发.生成环境等等环境可以任意切换打包 多渠道打包 APK输出文件配置 需求 一般我 ...
随机推荐
- 用PHP生成随机数的函数
转自:http://www.jbxue.com/article/5034.html 介绍:在早期的php中生成一个随机字符串时,总是先创建一个字符池,然后用一个循环和mt_rand()或rand()生 ...
- Think PHP 提示验证码输入错误
最近遇到一个项目中用的是Thinkphp这个框架开发的,其中在登录这块有验证码这个功能,其实这个功能是TP自带的,其中主要方法是buildImageVerify,位于ThinkPHP\Extend\L ...
- ubuntu安装QQ目前最完善的方法!(亲测,成功)
wine qq 2012 for linux Ubuntu 64位兼容(12月21日末日版) 由 smile » 2011-04-07 9:08 +-------------------------- ...
- QQ空间g_tk加密算法PHP版
QQ空间g_tk加密算法PHP版 //G_tk计算 function getGTK($skey){ $hash = 5381; for($i=0;$i<strlen($skey);++$i){ ...
- MSP430F4152串口操作
/**********************************************************************/ /* 名称:串口通讯 功能:将接到的数据组后原封不 ...
- 【Java】Linux下安装配置Oracle JDK 1.7版本
1 环境 Vmware虚拟机中的Ubuntu 12.04 32位系统 2具体安装步骤 ①下载最新的jdk包 注意jdk区分32位版本和64位版本,要与Ubuntu兼容才行 下载地址 http://ww ...
- c 递归函数浅析
所谓递归,简而言之就是应用程序自身调用自身,以实现层次数据结构的查询和访问. 递归的使用可以使代码更简洁清晰,可读性更好(对于初学者到不见得),但由于递归需要系统堆栈,所以空间消耗要比非递归代码要大很 ...
- Linux VM 设置静态ip地址上网
因为是路由器共享上网,VM每次都是通过DHCP方式自动获取ip地址,连接Linux VM时ip地址经常变,很麻烦.现在把VM设置静态ip的方法总结一下,以免以后忘了. 1. VM上网方式设置为桥接. ...
- IIS7 无法访问请求的页面,因为该页的相关配置数据无效
HTTP 错误 500.19 - Internal Server Error 无法访问请求的页面,因为该页的相关配置数据无效. 解决方案: C:\Windows\Microsoft.NET\Frame ...
- Xcode文档下载与安装路径
https://developer.apple.com/library/downloads/docset-index.dvtdownloadableindex ~/Library/Developer/ ...