Monthly Archives: April 2015


04.29.15

EBSeq-HMM: A Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments

Ning Leng1,2, Yuan Li<sup)1, Brian E. Mcintosh2, Bao Kim Nguyen2, Bret Duffin2, Shulan Tian2, James A. Thomson2,3,4, Colin Dewey5, Ron Stewart2 and Christina Kendziorski5,*

– Author Affiliations

1Department of Statistics, University of Wisconsin, Madison, WI
2Regenerative Biology, Morgridge Institute for Research, Madison, WI
3Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI
4Department of Molecular, Cellular, and Developmental Biology, University of California, Santa Barbara, CA
5Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI
*To whom correspondence should be addressed. Christina Kendziorski, E-mail: kendzior@biostat.wisc.edu

  • Received October 14, 2014.
  • Revision received February 23, 2015.
  • Accepted March 30, 2015.

Abstract

Motivation: With improvements in next-generation sequencing technologies and reductions in price, ordered RNA-seq experiments are becoming common. Of primary interest in these experiments is identifying genes that are changing over time or space, for example, and then characterizing the specific expression changes. A number of robust statistical methods are available to identify genes showing differential expression among multiple conditions, but most assume conditions are exchangeable and thereby sacrifice power and precision when applied to ordered data.

Results: We propose an empirical Bayes mixture modeling approach called EBSeq-HMM. In EBSeq-HMM, an auto-regressive hidden Markov model is implemented to accommodate dependence in gene expression across ordered conditions. As demonstrated in simulation and case studies, the output proves useful in identifying differentially expressed genes and in specifying gene-specific expression paths. EBSeq-HMM may also be used for inference regarding isoform expression.

Availability: An R package containing examples and sample datasets is available at Bioconductor.

Contact: kendzior@biostat.wisc.edu

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com


04.15.15

Integrative analysis of 111 reference human epigenomes

Roadmap Epigenomics Consortium, Anshul Kundaje, et al.

Abstract:
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.