Created by Dennis C Wylie, last modified on Jun 29, 2015

Machine learning methods (including clustering, dimensionality reduction, classification and regression modeling, resampling techniques, etc.), ANOVA modeling, and empirical Bayes analysis.

Unsupervised Analysis

Unsupervised methods provide exploratory data analysis useful for getting a big picture view: can provide valuable QC information and can help to both assess expected trends and identify unexpected patterns in your data.

  • Deliverables:

    • Plots in png and pdf format
    • Results from any additional algorithms applied may be provided in tab-delimited or excel formatted tables as appropriate
  • Tools Used:

    • Hierarchical Clustering: both of genes and and samples.
    • Principal Components Analysis: PCA biplot of data after centering both on the gene and sample axes (and optionally scaling of gene axis if desired).
    • Other methods: (e.g., k-means clustering, self-organized maps, multidimensional scaling, etc.) available if desired

Empirical Bayes Differential Expression Analysis

RNAseq experiments yield simultaneous measurements of many intrinsically similar variables (gene expression levels) but with often limited sample sizes. Empirical Bayes methods provide a statistical approach designed just for such situations which "borrow strength" across genes to increase statistical power and decrease false discovery.

  • Deliverables:

    • Tables of model parameters, p-values, and FDR q-values (in tab-delimited and excel format)

    • Boxplots (stratified by sample group) and pairs plots of top genes provided in png and pdf format

  • Tools Used:

    • Limma: applies empirical Bayes methods in the construction of linear models (e.g, t-tests, ANOVA) for a large variety of experimental designs. Originally designed for microarray data analysis, Limma's developers have substantially extended its functionality into the realm of RNAseq as well.

Supervised Analysis

Many methods available for classification and regression as appropriate to your analysis. Model performance may be assessed using standard metrics evaluated under cross-validation or using independent test sets if available. Analysis will be conducted using R and/or Python scripts.

  • Deliverables:

    • Tables of results (in tab-delimited and excel formats)
    • plots in png and pdf format
    • R and/or Python source files
    • binary, JSON, or XML representations of R or Python objects can be made available if desired
    • further reports in the form of slides or text documents may be provided in standard formats (pdf, doc, ppt) if desired
  • Methods Available:

    • Diagonal linear discriminant analysis (DLDA, a form of linear naive Bayes classification)
    • Linear and quadratic discriminant analysis
    • Logistic regression including L1/lasso and/or L2/ridge regularization if desired
    • Partial least squares (PLS) discriminant analysis and regression
    • k-nearest neighbors (KNN)
    • Support vector machines (SVM)
    • Decision tree ensembles (Random Forests or AdaBoost).
    • Other methods are available on request.
 

7、RNAseq Downstream Analysis的更多相关文章

  1. 6、RNA-Seq Analysis Pipeline

    Created by Dhivya Arasappan, last modified by Dennis C Wylie on Nov 08, 2015 This pipeline uses an a ...

  2. 转录组分析综述A survey of best practices for RNA-seq data analysis

    转录组分析综述 转录组 文献解读 Trinity cufflinks 转录组研究综述文章解读 今天介绍下小编最近阅读的关于RNA-seq分析的文章,文章发在Genome Biology 上的A sur ...

  3. A survey of best practices for RNA-seq data analysis RNA-seq数据分析指南

    A survey of best practices for RNA-seq data analysis RNA-seq数据分析指南 内容 前言 各位同学/老师,大家好,现在由我给大家讲讲我的文献阅读 ...

  4. Power BI 与 Azure Analysis Services 的数据关联:1、建立 Azure Analysis Services服务

    Power BI 与 Azure  Analysis Services 的数据关联:1.建立  Azure  Analysis Services服务

  5. 10、RNA-seq for DE analysis training(Mapping to assign reads to genes)

    1.Goal of mapping 1)We want to assign reads to genes they were derived from 2)The result of the mapp ...

  6. single-cell RNA-seq 工具大全

    [怪毛匠子-整理] awesome-single-cell List of software packages (and the people developing these methods) fo ...

  7. 7、sraToolkit安装使用

    参考:http://blog.csdn.net/Cs_mary/article/details/78378552        ###prefetch 参数解释 https://www.ncbi.nl ...

  8. 玩转大数据:深入浅出大数据挖掘技术(Apriori算法、Tanagra工具、决策树)

    一.本课程是怎么样的一门课程(全面介绍) 1.1.课程的背景           “大数据”作为时下最火热的IT行业的词汇,随之而来的数据仓库.数据分析.数据挖掘等等围绕大数据的商业价值的利用逐渐成为 ...

  9. loadrunner入门篇-Analysis 分析器

    analysis简介 分析器就是对测试结果数据进行分析的组件,它是LR三大组件之一,保存着大量用来分析性能测试结果的数据图,但并不一定要对每个视图进行分析,可以根据实际情况选择相关的数据视图进行分析, ...

随机推荐

  1. 【转载】xtrabackup原理及实施

    转载于:http://www.baidu-ops.com/2013/05/26/xtrabackup/ xtrabackup是基于InnoDB存储引擎灾难恢复的.它复制InnoDB的数据文件,尽管数据 ...

  2. 斯坦福机器学习视频笔记 Week2 多元线性回归 Linear Regression with Multiple Variables

    相比于week1中讨论的单变量的线性回归,多元线性回归更具有一般性,应用范围也更大,更贴近实际. Multiple Features 上面就是接上次的例子,将房价预测问题进行扩充,添加多个特征(fea ...

  3. 解决COMODO Internet Security更新慢或失败的问题

    方法一: COMODO 默认更新服务器只有一个:http://download.comodo.com/ 我电信网络更新速度非常慢: 实在没有办法了,找到了几个更新服务器: http://eu1.dow ...

  4. nginx Unit 服务器

    转自: https://github.com/nginx/unit/pull/18/ 感谢: https://www.v2ex.com/t/389528 English 简体中文 繁體中文 NGINX ...

  5. LINQ 学习路程 -- 查询操作 Skip SkipWhile Take TakeWhile

    Method Description Skip 跳过序列中指定数量元素,然后返回剩余序列 SkipWhile 只要满足条件,就跳过序列中的元素,然后返回剩余函数 Take 从序列的开头返回指定数量的连 ...

  6. castle windsor学习-------Container Events 容器的事件

    所有的事件是实现IKernelEvents 接口,已容器的Kernel属性暴露出来 1. AddedAsChildKernel 当前的容器添加子容器或其他容器时触发 2. RemovedAsChild ...

  7. 每个程序员都应该了解的 CPU 高速缓存

    每个程序员都应该了解的 CPU 高速缓存 英文原文:Memory part 2: CPU caches 来源:oschina [编者按:这是Ulrich Drepper写“程序员都该知道存储器”的第二 ...

  8. matlab点云处理函数

    1. pcread: 输入文件名,返回pointCloud类(用于存储点云).eg: pcloud = pcread(“filename.ply”) 2. pcshow: 输入pointCloud类, ...

  9. 如何在node.js中使用neo4j

    本章中你将会学到如何在node.js中使用neo4j图形数据库. 当你想存储或者查询和数据紧密关联的数据的时候,图形数据库很有用. neo4j是一个可有效存储,处理和查询你数据模型中紧密相连的元素的数 ...

  10. QWidget上下文菜单处理函数

    QWidget类是Qt中所有可视化部件的基类,其很多函数都是虚函数,能被子类重写以表现不同形式和功能,今天来学习一下QWdiget的事件处理函数. 事件是鼠标.键盘或系统因其自身某些状态发生改变而引起 ...