Wigwams: identifying gene modules co-regulated across multiple biological conditions
- Krzysztof Polanski1,†,
- Johanna Rhodes1,†‡,
- Claire Hill2,
- Peijun Zhang2,
- Dafyd J. Jenkins1,
- Steven J. Kiddle1,§,
- Aleksey Jironkin1,
- Jim Beynon1,2,
- Vicky Buchanan-Wollaston1,2,
- Sascha Ott1 and
- Katherine J. Denby1,2,*
+ Author Affiliations
- ↵*To whom correspondence should be addressed.
- Received September 17, 2013.
- Revision received December 12, 2013.
- Accepted December 13, 2013.
Motivation: Identification of modules of co-regulated genes is a crucial first step towards dissecting the regulatory circuitry underlying biological processes. Co-regulated genes are likely to reveal themselves by showing tight co-expression, e.g. high correlation of expression profiles across multiple time series datasets. However, numbers of up- or downregulated genes are often large, making it difficult to discriminate between dependent co-expression resulting from co-regulation and independent co-expression. Furthermore, modules of co-regulated genes may only show tight co-expression across a subset of the time series, i.e. show condition-dependent regulation.
Results: Wigwams is a simple and efficient method to identify gene modules showing evidence for co-regulation in multiple time series of gene expression data. Wigwams analyzes similarities of gene expression patterns within each time series (condition) and directly tests the dependence or independence of these across different conditions. The expression pattern of each gene in each subset of conditions is tested statistically as a potential signature of a condition-dependent regulatory mechanism regulating multiple genes. Wigwams does not require particular time points and can process datasets that are on different time scales. Differential expression relative to control conditions can be taken into account. The output is succinct and non-redundant, enabling gene network reconstruction to be focused on those gene modules and combinations of conditions that show evidence for shared regulatory mechanisms. Wigwams was run using six Arabidopsis time series expression datasets, producing a set of biologically significant modules spanning different combinations of conditions.
Availability and implementation: A Matlab implementation of Wigwams, complete with graphical user interfaces and documentation, is available at: warwick.ac.uk/wigwams.
Supplementary Data: Supplementary data are available at Bioinformatics online.