PMID:17214507

From EcoliWiki
Jump to: navigation, search
Citation

Faith, JJ, Hayete, B, Thaden, JT, Mogno, I, Wierzbowski, J, Cottarel, G, Kasif, S, Collins, JJ and Gardner, TS (2007) Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol. 5:e8

Abstract

Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the global performance of four existing classes of inference algorithms using 445 Escherichia coli Affymetrix arrays and 3,216 known E. coli regulatory interactions from RegulonDB. We also developed and applied the context likelihood of relatedness (CLR) algorithm, a novel extension of the relevance networks class of algorithms. CLR demonstrates an average precision gain of 36% relative to the next-best performing algorithm. At a 60% true positive rate, CLR identifies 1,079 regulatory interactions, of which 338 were in the previously known network and 741 were novel predictions. We tested the predicted interactions for three transcription factors with chromatin immunoprecipitation, confirming 21 novel interactions and verifying our RegulonDB-based performance estimates. CLR also identified a regulatory link providing central metabolic control of iron transport, which we confirmed with real-time quantitative PCR. The compendium of expression data compiled in this study, coupled with RegulonDB, provides a valuable model system for further improvement of network inference algorithms using experimental data.

Links

PubMed PMC1764438 Online version:10.1371/journal.pbio.0050008

Keywords

Algorithms; Biological Transport; Escherichia coli/genetics; Escherichia coli/metabolism; Gene Expression Profiling; Gene Expression Regulation, Bacterial; Gene Regulatory Networks; Iron/metabolism; Oligonucleotide Array Sequence Analysis; Operon/genetics; Reproducibility of Results; Transcription, Genetic/genetics

Significance

You can help EcoliWiki by summarizing why this paper is useful

Useful Materials and Methods

You can help Ecoliwiki by describing the useful materials (strains, plasmids, antibodies, etc) described in this paper.

For overexpression experiments, genes were cloned into vector pBADx53.

One set of microarrays from this paper looked at transcription after treatment with norfloxacin (75 ng/ml) in a strain where the hlpA gene was overexpressed. There are two E. coli genes that have hlpA as a synonym: skp and lpxD. The Skp protein was called "histone-like protein A" when first isolated, so this may be the gene amplified by Faith et al.

Annotations

<protect><annotationlinks/></protect>


<protect>

Accessions SMD Category SMD Subcategory SMD status

</protect>


EcoliWiki Links

Add links to pages that link here (e.g. gene, product, method pages)

See also

References

See Help:References for how to manage references in EcoliWiki.