Extracting info from VCF files
R, Bioconductor
filterVcf: Extract Variants of Interest from a Large VCF File (Paul Shannon)
We demonstrate three methods:  filtering by genomic region,  filtering on attributes of
each specific variant call, and intersecting with known regions of interest (exons, splice
sites, regulatory regions, etc.).
http://www.bioconductor.org/packages/release/bioc/vignettes/VariantAnnotation/inst/doc/filterVcf.pdf
Java
SelectVariants -- Select a subset of variants from a larger callset ( GATK SelectVariants )
Often, a VCF containing many samples and/or variants will need to be subset in order to facilitate certain analyses (e.g. comparing and contrasting cases vs. controls; extracting variant or non-variant loci that meet certain requirements, displaying just a few samples in a browser like IGV, etc.). SelectVariants can be used for this purpose.
Biostars
Question: How To Split Multiple Samples In Vcf File Generated By Gatk?
I did variant calling using BWA + PiCard + GATK and have just got the filtered VCF files from GATK. In the process of running GATK, I used list of inputs (11 samples) and for most steps, I had only one output file for each step. Now, I got two VCF files (one for SNPs and the other is for indels), each of which contains 11 samples. I can see the names of the 11 samples in the header of vcf files, and each sample seems to have one column of data. So I am wondering how to split each VCF files into individual sample vcf files?
https://www.biostars.org/p/78929/
bcftools
for file in *.vcf*; do
for sample in `bcftools view -h $file | grep "^#CHROM" | cut -f10-`; do
bcftools view -c1 -Oz -s $sample -o ${file/.vcf*/.$sample.vcf.gz} $file
done
done
https://www.biostars.org/p/12535/#115691
vcf-subset
vcf-subset -c S1 bigfile.vcf > S1.vcf
https://www.biostars.org/p/78929/
http://campagnelab.org/software/goby/reference-documentation/modes/vcf-subset/
REF:
http://samtools.github.io/hts-specs/VCFv4.2.pdf
Extracting info from VCF files的更多相关文章
- 将vcf文件转化为plink格式并且保持phasing状态
		VCFtools can convert VCF files into formats convenient for use in other programs. One such example i ... 
- 【Bcftools】合并不同sample的vcf文件,通过bcftools
		通过GATK calling出来的SNP如果使用UnifiedGenotype获得的SNP文件是分sample的,但是如果使用vcftools或者ANGSD则需要Vcf文件是multi-sample的 ... 
- iCloud无法导入vCard问题。fix the error when import vcard/vcf to icloud.
		问题描述:当登录icloud.com,进入通讯录的时候,导入VCF格式的联系人的时候会报错.如图: 1.从outlook的联系人中选一个联系人,导出联系人卡片-vCard文件 (如果是塞班手机,可以用 ... 
- Linux command line exercises for NGS data processing
		by Umer Zeeshan Ijaz The purpose of this tutorial is to introduce students to the frequently used to ... 
- Awesome C/C++
		Awesome C/C++ A curated list of awesome C/C++ frameworks, libraries, resources, and shiny things. In ... 
- autodock 结果pdb的生成
		Is there a way to save a protein-ligand complex as a PDB file in AutoDock? I have completed my docki ... 
- Gumshoe - Microsoft Code Coverage Test Toolset
		Gumshoe - Microsoft Code Coverage Test Toolset 2014-07-17 What is Gumshoe? How to instrument a binar ... 
- C/C++ 框架,类库,资源集合
		很棒的 C/C++ 框架,类库,资源集合. Awesome C/C++ Standard Libraries Frameworks Artificial Intelligence Asynchrono ... 
- awesome cpp
		https://github.com/fffaraz/awesome-cpp Awesome C/C++ A curated list of awesome C/C++ frameworks, lib ... 
随机推荐
- IOS控件:计算文字长度(UITextField,UILabel对象 和 IBAction)
			#import <UIKit/UIKit.h> // UIViewController类为程序提供了基本的视图管理模块 @interface NavControllerViewContro ... 
- 红黑树C++实现
			1 /* 2 * rbtree.h 3 * 1. 每个节点是红色或者黑色 4 * 2. 根节点是黑色 5 * 3. 每个叶子节点是黑色(该叶子节点就空的节点) 6 * 4. 如果一个节点是红色,则它的 ... 
- 分布式服务框架dubbo入门实例
			dubbo是一个分布式的服务架构,可直接用于生产环境作为SOA服务框架. 官网首页:http://dubbo.io/ ,官方用户指南 http://dubbo.io/User+Guide-zh.htm ... 
- 理解PHP中会话控制
			如果以前没有接触过建站或网络编程,只是从头开始学PHP,以及用PHP来建立动态站点,那么会话(SESSION)对于初学者就有点难理解.那么到底什么是会话呢?理解一个概念需要从它产生的背景或问题出发,所 ... 
- ES6学习之Babel的正确安装姿势
			开始学习ES6,写点东西放上博客^_^ 本文介绍Babel6.x的安装过程~ 首先呢,可以使用Babel在线转换 https://babeljs.io/repl/ 然后进入主题:安装Babel(命令行 ... 
- 第二课——解析mysqldump命令和mysqlbinlog命令+innodb和Myisam存储引擎简介
			环境说明 mysql版本:Percona-Server-5.6.30 IP:10.7.15.167 端口:3306 安装目录:/httx/run/mysql 数据目录:/httx/run/mysql/ ... 
- A4纸网页打印中对应像素的设定和换算
			最近开发项目时遇到了网页打印的问题,这是问题之二,打印宽度设置 在公制长度单位与屏幕分辨率进行换算时,必须用到一个DPI(Dot PerInch)指标. 经过我仔细的测试,发现了网页打印中,默认采用 ... 
- C#HTML与UBB(纯文本)之间的转换
			private string HtmlToUBB(string _Html) { _Html = Regex.Replace(_Html,"<br ... 
- django博客项目1.环境搭建
			安装 Python Windows 下安装 Python 非常简单,去 Python 官方网站找到 Python 3 的下载地址,根据你的系统选择 32 位或者 64 位的安装包,下载好后双击安装即可 ... 
- Oracle 11g数据库详解(2)
			FAILED_LOGIN_ATTEMPTS 用于指定连续登陆失败的最大次数 达到最大次数后,用户会被锁定,登陆时提示ORA-28000 UNLIMITED为不限制 精确无误差 是 实时 PASSWOR ... 
