bedtools 每天都会用到的工具
详细的使用说明:http://bedtools.readthedocs.org/en/latest/
Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For example, bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF/GTF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), quite sophisticated analyses can be conducted by combining multiple bedtools operations on the UNIX command line.
Summary of available tools.
bedtools support a wide range of operations for interrogating and manipulating genomic features. The table below summarizes the tools available in the suite.
| Utility | Description |
|---|---|
| annotate | Annotate coverage of features from multiple files. |
| bamtobed | Convert BAM alignments to BED (& other) formats. |
| bamtofastq | Convert BAM records to FASTQ records. |
| bed12tobed6 | Breaks BED12 intervals into discrete BED6 intervals. |
| bedpetobam | Convert BEDPE intervals to BAM records. |
| bedtobam | Convert intervals to BAM records. |
| closest | Find the closest, potentially non-overlapping interval. |
| cluster | Cluster (but don’t merge) overlapping/nearby intervals. |
| complement | Extract intervals _not_ represented by an interval file. |
| coverage | Compute the coverage over defined intervals. |
| expand | Replicate lines based on lists of values in columns. |
| flank | Create new intervals from the flanks of existing intervals. |
| genomecov | Compute the coverage over an entire genome. |
| getfasta | Use intervals to extract sequences from a FASTA file. |
| groupby | Group by common cols. & summarize oth. cols. (~ SQL “groupBy”) |
| igv | Create an IGV snapshot batch script. |
| intersect | Find overlapping intervals in various ways. |
| jaccard | Calculate the Jaccard statistic b/w two sets of intervals. |
| links | Create a HTML page of links to UCSC locations. |
| makewindows | Make interval “windows” across a genome. |
| map | Apply a function to a column for each overlapping interval. |
| maskfasta | Use intervals to mask sequences from a FASTA file. |
| merge | Combine overlapping/nearby intervals into a single interval. |
| multicov | Counts coverage from multiple BAMs at specific intervals. |
| multiinter | Identifies common intervals among multiple interval files. |
| nuc | Profile the nucleotide content of intervals in a FASTA file. |
| overlap | Computes the amount of overlap from two intervals. |
| pairtobed | Find pairs that overlap intervals in various ways. |
| pairtopair | Find pairs that overlap other pairs in various ways. |
| random | Generate random intervals in a genome. |
| reldist | Calculate the distribution of relative distances b/w two files. |
| shuffle | Randomly redistribute intervals in a genome. |
| slop | Adjust the size of intervals. |
| sort | Order the intervals in a file. |
| subtract | Remove intervals based on overlaps b/w two files. |
| tag | Tag BAM alignments based on overlaps with interval files. |
| unionbedg | Combines coverage intervals from multiple BEDGRAPH files. |
| window |
Find overlapping intervals within a window around an interval. |
安装: yum install BEDTools
1, 将bam文件(tophat得到的结果)转化为fastq
先将比对得到的accepted_hits.bam和unmapped.bam合并
samtools merge RC6-1_ATTCCT_L005.bam accepted_hits.bam unmapped.bam
得到合并后的RC6-1_ATTCCT_L005.bam文件
将该bam文件按照reads名称排序:
samtools_0.1.18 sort -n RC6-1_ATTCCT_L005.bam RC6-1_ATTCCT_L005.sorted
得到RC6-1_ATTCCT_L005.sorted.bam文件
最后用bedtools转化
bedtools bamtofastq -i RC6-1_ATTCCT_L005.sorted.bam -fq RC6-1_ATTCCT_L005_R1.fastq -fq2 RC6-1_ATTCCT_L005_R2.fastq
得到双端的fastq文件。
bedtools 每天都会用到的工具的更多相关文章
- 价值1400美元的CEH(道德黑客)认证培训课程长啥样?(3)工具集
美元的CEH(道德黑客)认证培训课程长啥样?(3)工具集 这是我收到的CEH官方发来的邮件,参加CEH认证培训原价为1424.25刀,可以给我便宜到1282刀.只有一个感觉,心在流血.站在这价值120 ...
- JMeter 的调式工具
任何的编程工具都会相应的调式工具,JMeter的调式 工具主要有五种: 1.查看结果树:含请求信息.响应信息等 2.HTTP 镜像服务器:HTTP Mirror Server用于查看请求信息 3.De ...
- 教你用Windows自带工具给优盘/移动硬盘添加密码
教你用Windows自带工具给优盘/移动硬盘添加密码 本文中优盘,移动硬盘和分区操作方式一样,为方便描述,下文将只说优盘 优盘成了很多人每天都会用到的工具,有时候自己优盘会存着一些不希望别人看到的文件 ...
- 轻量级ORM工具Simple.Data
今天推举的这篇文章,本意不是要推举文章的内容,而是据此介绍一下Simple.Data这个很有意思的类ORM工具. 现在大家在.NET开发中如果需要进行数据访问,那么基本都会使用一些ORM工具,比如微软 ...
- 使用redux-devtools工具
在vue中型项目开发的过程中,一般都是要用到vuex这个状态管理工具的,这样可以方便我们管理全局的状态,同时,为了在开发的过程中,更加方便地实时查看到state状态,我们会使用 vue-devtool ...
- Linux常用网络工具:路由扫描之traceroute
之前两篇<Linux常用网络工具:fping主机扫描>和<Linux常用网络工具:hping高级主机扫描>都是关于主机扫描的,本篇介绍Linux下常用的路由扫描工具tracer ...
- 拍拍贷投资工具|拍拍贷投标工具|PPD投标工具|PPD投资工具介绍
我们先来分析一下现在市场上在PPD投资的途径: 其他解决方案 1.在网站或者手机客户端手动投标 这种方法对于非常小额的资金是可以的,稍微多一点就会发现不可行,目前PPD手动刷新出来的标几乎都是你刚刷新 ...
- iOS包管理工具Cocoapods的安装与使用
转自:http://www.sxt.cn/u/10014/blog/6448 在我们开发移动应用的时候,一般都会使用到第三方工具,而由于第三方类库的种类繁多,我们在项目中进行管理也会相对麻烦,所以此时 ...
- PMP-番外篇-PMP工具与技术目录
########################################################### 这里先总结所有工具和技术,让大家有一个整体的概念. 也可以当作一个工具和技术查询 ...
随机推荐
- 在SQLite中创建数据库时总是提示错误?
答案:原先以为是因为编码影响的其实不是,是因为逗号和分号的原因,不是标准的英文状态下的格式
- js 数组去除空值
for(var i = 0 ;i<wordarr.length;i++) { if(wordarr[i] == "& ...
- 作用域闭包、预解释和this关键字综合题目
var number = 2; var obj = {number : 5, fn1 : ( function() { this.number *= 2; number=number*2; var n ...
- POJ 2965 The Pilots Brothers' refrigerator 暴力 难度:1
The Pilots Brothers' refrigerator Time Limit: 1000MS Memory Limit: 65536K Total Submissions: 16868 ...
- PowerMock与EasyMock的应用(转)
Leader请求在做Junit测试的时辰,Mock掉各个办法之间的依附.这两天进修了下PowerMock的应用. PowerMock是EasyMock的一个扩大,参加了static,final,pri ...
- NOIP2004 解题报告
第一题:津津的零花钱一直都是自己管理.每个月的月初妈妈给津津300元钱,津津会预算这个月的花销,并且总能做到实际花销和预算的相同. 为了让津津学习如何储蓄,妈妈提出,津津可以随时把整百的钱存在她那里, ...
- 如何在redhat下安装WineQQ
使用过redhat的朋友都知道在redhat下要使用聊天工具例如:腾讯QQ只能是用网页QQ,但网页QQ始终用得不尽人意,下面我将给大家介绍一种在redhat下安装WineQQ的方法,让你能在redha ...
- 反Secure Boot垄断:兼谈如何在Windows 8电脑上安装Linux
感谢HQSQ的投递一.自由软件基金会的呼吁上周,2012年将近结束的时候,自由软件基金会(FSF)发出呼吁,要求人们继续支持反Secure Boot垄断,希望签名者能达到5万人(目前是4万).我觉得, ...
- 从协议VersionedProtocol开始3——ClientProtocol、DatanodeProtocol、NamenodeProtocol、RefreshAuthorizationPolicyProtocol、RefreshUserMappingsProtocol
1.ClientProtocol这个玩意的版本号是61L:DatanodeProtocol 是26L:NamenodeProtocol是 3L;RefreshAuthorizationPolicyPr ...
- Ubuntu根目录下各文件的功能介绍
http://jingyan.baidu.com/article/afd8f4de55189c34e286e9e6.html