bedtools 每天都会用到的工具
详细的使用说明:http://bedtools.readthedocs.org/en/latest/
Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For example, bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF/GTF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), quite sophisticated analyses can be conducted by combining multiple bedtools operations on the UNIX command line.
Summary of available tools.
bedtools support a wide range of operations for interrogating and manipulating genomic features. The table below summarizes the tools available in the suite.
| Utility | Description |
|---|---|
| annotate | Annotate coverage of features from multiple files. |
| bamtobed | Convert BAM alignments to BED (& other) formats. |
| bamtofastq | Convert BAM records to FASTQ records. |
| bed12tobed6 | Breaks BED12 intervals into discrete BED6 intervals. |
| bedpetobam | Convert BEDPE intervals to BAM records. |
| bedtobam | Convert intervals to BAM records. |
| closest | Find the closest, potentially non-overlapping interval. |
| cluster | Cluster (but don’t merge) overlapping/nearby intervals. |
| complement | Extract intervals _not_ represented by an interval file. |
| coverage | Compute the coverage over defined intervals. |
| expand | Replicate lines based on lists of values in columns. |
| flank | Create new intervals from the flanks of existing intervals. |
| genomecov | Compute the coverage over an entire genome. |
| getfasta | Use intervals to extract sequences from a FASTA file. |
| groupby | Group by common cols. & summarize oth. cols. (~ SQL “groupBy”) |
| igv | Create an IGV snapshot batch script. |
| intersect | Find overlapping intervals in various ways. |
| jaccard | Calculate the Jaccard statistic b/w two sets of intervals. |
| links | Create a HTML page of links to UCSC locations. |
| makewindows | Make interval “windows” across a genome. |
| map | Apply a function to a column for each overlapping interval. |
| maskfasta | Use intervals to mask sequences from a FASTA file. |
| merge | Combine overlapping/nearby intervals into a single interval. |
| multicov | Counts coverage from multiple BAMs at specific intervals. |
| multiinter | Identifies common intervals among multiple interval files. |
| nuc | Profile the nucleotide content of intervals in a FASTA file. |
| overlap | Computes the amount of overlap from two intervals. |
| pairtobed | Find pairs that overlap intervals in various ways. |
| pairtopair | Find pairs that overlap other pairs in various ways. |
| random | Generate random intervals in a genome. |
| reldist | Calculate the distribution of relative distances b/w two files. |
| shuffle | Randomly redistribute intervals in a genome. |
| slop | Adjust the size of intervals. |
| sort | Order the intervals in a file. |
| subtract | Remove intervals based on overlaps b/w two files. |
| tag | Tag BAM alignments based on overlaps with interval files. |
| unionbedg | Combines coverage intervals from multiple BEDGRAPH files. |
| window |
Find overlapping intervals within a window around an interval. |
安装: yum install BEDTools
1, 将bam文件(tophat得到的结果)转化为fastq
先将比对得到的accepted_hits.bam和unmapped.bam合并
samtools merge RC6-1_ATTCCT_L005.bam accepted_hits.bam unmapped.bam
得到合并后的RC6-1_ATTCCT_L005.bam文件
将该bam文件按照reads名称排序:
samtools_0.1.18 sort -n RC6-1_ATTCCT_L005.bam RC6-1_ATTCCT_L005.sorted
得到RC6-1_ATTCCT_L005.sorted.bam文件
最后用bedtools转化
bedtools bamtofastq -i RC6-1_ATTCCT_L005.sorted.bam -fq RC6-1_ATTCCT_L005_R1.fastq -fq2 RC6-1_ATTCCT_L005_R2.fastq
得到双端的fastq文件。
bedtools 每天都会用到的工具的更多相关文章
- 价值1400美元的CEH(道德黑客)认证培训课程长啥样?(3)工具集
美元的CEH(道德黑客)认证培训课程长啥样?(3)工具集 这是我收到的CEH官方发来的邮件,参加CEH认证培训原价为1424.25刀,可以给我便宜到1282刀.只有一个感觉,心在流血.站在这价值120 ...
- JMeter 的调式工具
任何的编程工具都会相应的调式工具,JMeter的调式 工具主要有五种: 1.查看结果树:含请求信息.响应信息等 2.HTTP 镜像服务器:HTTP Mirror Server用于查看请求信息 3.De ...
- 教你用Windows自带工具给优盘/移动硬盘添加密码
教你用Windows自带工具给优盘/移动硬盘添加密码 本文中优盘,移动硬盘和分区操作方式一样,为方便描述,下文将只说优盘 优盘成了很多人每天都会用到的工具,有时候自己优盘会存着一些不希望别人看到的文件 ...
- 轻量级ORM工具Simple.Data
今天推举的这篇文章,本意不是要推举文章的内容,而是据此介绍一下Simple.Data这个很有意思的类ORM工具. 现在大家在.NET开发中如果需要进行数据访问,那么基本都会使用一些ORM工具,比如微软 ...
- 使用redux-devtools工具
在vue中型项目开发的过程中,一般都是要用到vuex这个状态管理工具的,这样可以方便我们管理全局的状态,同时,为了在开发的过程中,更加方便地实时查看到state状态,我们会使用 vue-devtool ...
- Linux常用网络工具:路由扫描之traceroute
之前两篇<Linux常用网络工具:fping主机扫描>和<Linux常用网络工具:hping高级主机扫描>都是关于主机扫描的,本篇介绍Linux下常用的路由扫描工具tracer ...
- 拍拍贷投资工具|拍拍贷投标工具|PPD投标工具|PPD投资工具介绍
我们先来分析一下现在市场上在PPD投资的途径: 其他解决方案 1.在网站或者手机客户端手动投标 这种方法对于非常小额的资金是可以的,稍微多一点就会发现不可行,目前PPD手动刷新出来的标几乎都是你刚刷新 ...
- iOS包管理工具Cocoapods的安装与使用
转自:http://www.sxt.cn/u/10014/blog/6448 在我们开发移动应用的时候,一般都会使用到第三方工具,而由于第三方类库的种类繁多,我们在项目中进行管理也会相对麻烦,所以此时 ...
- PMP-番外篇-PMP工具与技术目录
########################################################### 这里先总结所有工具和技术,让大家有一个整体的概念. 也可以当作一个工具和技术查询 ...
随机推荐
- sharepoint2010无法创建网站集
出现以上错误,查看IIS中有关Sharepoint的网站中的“身份验证”中ASP.Net模拟是否为禁用,如果为禁用,请启用即可.
- System.Web.Optimization找不到引用
在程序包管理控制程序中录入:Install-Package Microsoft.AspNet.Web.Optimization,安装即可.
- spring mvc上传图片
1.需要commons-fileupload.jar commons-io.jar 2.需要在springmvc.xml中 配置存放静态资源的路径,对图片等静态资源放行 <mvc:resourc ...
- linux 磁盘管理以及维护
Linux系统中,进行频繁的读写操作,容易发送只读.以及磁盘损坏等故障.下文为其解决方案: 1.如何界定磁盘已经存在故障 方法一(界定将如下内容另存为Repair.sh然后执行即可): #!/bin/ ...
- 配置coffeeScript
1.安装好node.js后 在系统环境变量自动会设置好: 我安装在D:\Program Files文件夹中 也安装好了npm(node packges manager) 2.系统会自动配置np ...
- eclipse关联tomcat并且部署java web应用程序
http://www.ibm.com/developerworks/cn/opensource/os-eclipse-tomcat/
- 一个app中保持程序全屏的方法。
public void toggleFullscreen(boolean fullScreen) { //fullScreen为true时全屏 WindowManager.LayoutParams a ...
- WEBService动态调用代码
BasicHttpBinding bind = new BasicHttpBinding(); bind.MaxReceivedMessageSize = int.MaxValue; Endpoint ...
- CSS最常用和实用的技巧
1.重置浏览器的字体大小重置浏览器的默认值 ,然后重设浏览器的字体大小你可以使用雅虎的用户界面重置的CSS方案 ,如果你不想下载9MB的文件,代码如下: body,div,dl,dt,dd,ul,ol ...
- Oracle实现自增方式:序列+触发器
Oracle不能像MySQL那样设置主键自增,Oracle用 <序列+触发器>的方式使数据表的一列或多列实现自增 序列sequence+触发器trigger:实现数据表S_DEPART中的 ...