Quantitative Genome-Wide Genetic Interaction Screens Reveal Global Epistatic Relationships of Protein Complexes in

Download PDF České info

Large-scale proteomic analyses in Escherichia coli have documented the composition and physical relationships of multiprotein complexes, but not their functional organization into biological pathways and processes. Conversely, genetic interaction (GI) screens can provide insights into the biological role(s) of individual gene and higher order associations. Combining the information from both approaches should elucidate how complexes and pathways intersect functionally at a systems level. However, such integrative analysis has been hindered due to the lack of relevant GI data. Here we present a systematic, unbiased, and quantitative synthetic genetic array screen in E. coli describing the genetic dependencies and functional cross-talk among over 600,000 digenic mutant combinations. Combining this epistasis information with putative functional modules derived from previous proteomic data and genomic context-based methods revealed unexpected associations, including new components required for the biogenesis of iron-sulphur and ribosome integrity, and the interplay between molecular chaperones and proteases. We find that functionally-linked genes co-conserved among γ-proteobacteria are far more likely to have correlated GI profiles than genes with divergent patterns of evolution. Overall, examining bacterial GIs in the context of protein complexes provides avenues for a deeper mechanistic understanding of core microbial systems.

Published in the journal: . PLoS Genet 10(2): e32767. doi:10.1371/journal.pgen.1004120
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1004120

Summary

Introduction

A key feature of the molecular organization of microbes is the tendency of functionally-linked proteins to associate as components of macromolecular complexes, operons, or other biological groupings. As a consequence, the gene products present in a bacterial cell are organized into functional modules, which in turn mediate the major cellular pathways and processes that support bacterial cell growth, proliferation, and adaptation [1]–[3]. Identifying the pairwise functional relationships between genes can reveal these modules, and elucidate the molecular systems that underlie the functional organization of a microbial cell. While chromosomal associations informative about gene functional relationships can be inferred computationally using genomic context (GC)-based methods [4], [5], knowledge of the composition and connectivity of multiprotein complexes and their organization into pathways requires experimentation, and such information remains incomplete even in one of the most tractable and well-studied, prokaryotic model-organisms, Escherichia coli [1], [6].

Physical interactions can be mapped with high-confidence based on the affinity-purification of chromosomally-tagged proteins in combination with mass spectrometry (APMS), which aims to isolate and identify endogenous protein complexes. Analogous to the tandem affinity purification (i.e., TAP tag) method developed for yeast [7]–[9], we developed an efficient sequential peptide affinity purification procedure for E. coli [2], [10] and used it to decipher the global physical organization of a bacterial cell [2], [10]–[12]. Our protein-protein interaction (PPI) map allows for the prediction of protein functions for previously uncharacterized components of soluble macromolecular complexes that co-purify with functionally annotated subunits, via ‘guilt-by-association’ [2], [10]. We further integrated our proteomic data with comparative genomic inferences to define a more comprehensive network of functional interactions covering most of E. coli's cytosolic proteome [2], [3]. Nevertheless, these maps do not fully capture the global systems organization of complexes within biological pathways or processes.

To this end, we and others have developed high-throughput genetic screening methods to systematically map epistasis relationships (i.e., genetic interactions, abbreviated as GIs hereafter) between bacterial gene pairs [13]–[16]. Biochemical pathways and networks are often robust [17], such that most bacterial genes produce no discernible phenotype when singly deleted or mutated [18]. Indeed, only ∼300 of E. coli's 4,145 protein-coding genes are essential under standard laboratory conditions [19]. However, examining the fitness of double mutants can reveal functional dependencies. Hence, our quantitative E. coli synthetic genetic array (eSGA) technology, which simplifies the systematic generation and phenotypic scoring of large numbers of double mutants created by mating collections of engineered E. coli strains en masse [13], [16], can reveal the functional relationships of previously uncharacterized gene products [1], [6]. For example, loss of two non-essential genes, which functionally compensate or buffer each other, may show an aggravating (synthetic sick or lethal, or SSL) GI if the combination of mutations critically impairs a process essential for cell growth or viability. Conversely, ‘alleviating’ (i.e., buffering or suppression) GIs can occur between two genes encoding subunits of the same protein complex, where inactivation of either one alone annihilates complex activity, such that loss of the second component confers no additional defect. Indeed, the global patterns of aggravating and alleviating interactions measured by large-scale GI screens have been used to decipher the functional organization of biological pathways and protein complexes in yeast [20]–[23].

Here, to study the global organization of the E. coli interactome, we employ our eSGA approach in an unbiased manner by performing 163 functionally diverse query genes. The resulting filtered GI network was then combined with existing PPI data and GC-derived interactions to reveal pathway-level crosstalk between disparate protein complexes, and specific biological roles of uncharacterized bacterial gene products.

Results

Target gene selection for an unbiased GI survey

Since fully comprehensive screens are not yet practicable, we selected a diverse, minimally-redundant set of broadly representative ‘query’ genes for our screens (see Protocol S1). After generating selectable mutants in a hyper-recombinant Hfr-Cavalli (Hfr C) ‘donor’ strain background marked with a chloramphenicol-resistance cassette (Cm^R), the corresponding deletion alleles were transferred by conjugation into a near genome-wide mutant collection of F -⁠ ‘recipient’ mutant strains, arrayed in duplicate at 384-colony density. This collection, contains 3,968 non-essential single gene deletions in which the open reading frame was replaced and marked by a kanamycin resistance (Kan^R) cassette (i.e., the Keio collection) [19], and 149 hypomorphic mutant strains [13], [16], in which a Kan^R marker was integrated into the 3′-UTR to alter transcript abundance or stability [13] (Figure 1A, Protocol S2).

**Fig. 1. Target selection and eSGA screen pipeline.**

In total, a set of 163 query ‘donor’ genes with evidence of expression and whose products had high physical interaction degree were selected for screening (Protocol S1). These included 93 genes linked to core bacterial processes (Figure 1B), such as metabolism, cell envelope biogenesis, transcription, protein synthesis and chromosomal replication and repair, and 25 genes of unknown function (Table S1). Since accurate quantitation of epistasis depends on reliable estimations of mutant fitness [24], we performed two independent replicate screens such that each donor-recipient mutant gene pair was tested eight times to account for experimental variation (see Protocol S2). Following genetic transfer, the double mutants were selected on rich medium (Luria Broth) containing both marker drugs (Kan+Cm). After outgrowth for 36 hrs at 32°C, the plates were imaged digitally. Colony growth was quantified using a data processing strategy originally devised for yeast SGA analysis [24], [25], to correct for possible batch and plate position effects, and the different intrinsic growth rates of the single mutants [26]. We also eliminated from consideration pairs of closely-linked loci that potentially suffer from reduced recombination efficiency due to linkage suppression [24], [25]. Overall replicate screen reproducibility was high (r = 0.7; Figure 2A), similar to that reported for other high-quality GI studies [16], [24], [27].

Functional properties of the global <i>E. coli</i> GI network. — **Fig. 2. Functional properties of the global *E. coli* GI network.**

Generating a genome-wide network of high-confidence GIs

We used a multiplicative model to calculate epistasis (S) scores [21], [22], [28], determining both the strength and confidence of putative GIs based on differences between the observed growth of the digenic mutants and the expected growth rates. The null hypothesis assumes independent fitness defects for non-interacting gene pairs -⁠ that is, if two alleles are functionally unrelated (i.e., independent), their joint fitness defects should combine in a multiplicative (i.e., non-synergistic) manner, as was done previously for yeast [25], [29]. Conversely, S-scores deviating significantly from expectation represent candidates for functional associations (i.e., genes working together in a pathway to perform a specialized cellular activity) [29].

The S-scores calculated for ∼600,000 digenic mutant combinations tested showed a normal distribution centered on zero (i.e., neutral) (Figure 2B), consistent with the expectation that GIs are relatively rare, with the fitness of most double mutants (i.e., functionally unrelated) typically equal to the product of individual single mutant growth defects [1], [30]. To rigorously define GIs, as with our previous studies [13], [16], we applied stringent statistical thresholds corresponding to two standard deviations (|Z-score|≥2; P≤0.05) of the score distribution to define significant outliers (Protocol S2). After filtering, the network encompassed GI with S-scores of −3 or lower (25,239 in total) that indicate aggravating (i.e., SSL) relationships, and GIs with S-scores of +3 or higher (17,466) representing alleviating relationships (Figure 2B, Table S2), which occasionally (but rarely) reflect suppression of an impaired growth phenotype conferred by a single allele.

Like other biological networks [24], [31], the filtered GI network had a modular connectivity structure (average clustering coefficient = 0.23, Figure S1A), wherein the majority of the genes have few GIs compared to a small number (n = 25) of highly connected (edge ≥640) ‘hubs’ (Figure S1B). As was reported for yeast [27], [32], [33], essential E. coli genes tend to be more highly connected in the network compared to non-essential genes, both in terms of GI degree (Figure S1C, Protocol S3) and overall network betweenness (i.e., a graph centrality measure reflecting the proportion of shortest paths between pairs of nodes that go through a particular gene) (Figure S1D, Protocol S3). Essential subunits of annotated protein complexes are also significantly enriched (p = 2.2×10⁻¹⁶) in aggravating interactions with each other, compared to pairs of components within non-essential complexes (Figure 2C, Protocol S3), suggesting that as in yeast [34], essential bacterial protein complexes occupy a central position within the E. coli GI network, just as they do in the E. coli PPI network [10].

External benchmarking

Comparison of the filtered GI network against a reference set of manually curated GIs extracted from the literature showed high (∼75%) agreement, which is significant (p-value ≤10⁻⁴) by random sampling null models (Figure 2D, Table S3, Protocol S3). For instance, our network captured the synthetic lethality reported between mutants of the chaperones, cbpA and dnaJ [35], and between the exonucleases recB and recJ, and recB and components of the RecFOR DNA repair complex, which jointly function in RecA-mediated recombination [36].

As the number of interactions in the literature curated reference set was quite limited, we examined if the interacting gene pairs were enriched for functional relatedness using a battery of different metrics (see Protocol S4). For example, inspection of the GI network revealed a slight but significant (p = 1.2×10⁻⁴³) tendency for E. coli genes encoding subunits of the same protein complexes to display correlated patterns of GIs as compared to randomly selected protein pairs (Figure 2E). Likewise, the components of the membrane-associated ferric enterobactin permease complex, FepD and FepG [37], [38], showed highly correlated (r_fepD,fepG = 0.5; Figure 2E) GI patterns, consistent with their co-operative role in transporting iron-bound siderophores into the cytoplasm [39].

Indeed, by every other measure examined, including functional associations predicted by GC methods (p = 2.2×10⁻¹¹⁸) [2], mRNA co-expression (p = 3.3×10⁻⁹³) [40], and phenomic (i.e., chemical genetic, p = 4.8×10⁻¹⁴) profiles [41]; we found that pairs of genes showing similar connectivity patterns in the GI network tended to be more highly correlated (i.e., as measured by Pearson Correlation Co-efficient (PCC) scores) (Figure S2A–C, Protocol S4). Similarly, genes present within the same operon in E. coli [42] had significantly (p = 6.1×10⁻²⁵²) more positively correlated genetic profiles than random pairs of genes (Figure 2F), and this correlation was likely not due to polarity effects as the last and the first genes within each operon were, on average, just as likely to be positively correlated as the first and the middle genes (Figure S2D); intuitively, however the last gene cannot possibly underlie the GI phenotypes for every operon (Protocol S5). An illustrative example is the highly similar (r_{tusC, tusD} = 0.8) GI patterns of the two gene products, tusCD, encoded by the sulfur mediator operon, tusBCDE (Figure 2F), consistent with their joint role in coordinating sulfur transfer [43]. Taken together, the benchmarking underscored the reliability and coverage of our screen data, indicating that the filtered GI network is informative about biological relationships at the level of individual gene pairs, multiprotein complexes, and pathways.

Probing functional neighborhoods in GI networks by monochromaticity

To identify broader functional groupings (i.e., modules or interconnected gene sets), we sorted the genes according to their biological process annotations, and examined the extent to which their corresponding high-confidence GI (|S-score≥3|; P≤0.05) tended towards alleviating or aggravating GI (Figure 3A), using a “monochromatic” score that has been previously used to unveil the modularity of yeast GI networks [44], [45]. While discrete clusters were clearly identified (Figure 3B and 3C) from the GI spanning the constituent genes within bioprocesses with high alleviating or aggravating monochromatic scores, several of these bioprocesses displayed extensive inter-connectivity, suggestive of biological cross-talk (Table S4, Protocol S6). For example, alleviating interactions bridge the cell envelope machinery (e.g., alr, dadX, aer) to phospholipid biosynthesis (clsB, pgpA, ugpA, ugpB, cdh) (Figure 3B), consistent with their close coupling during membrane formation and integrity [16], [46].

**Fig. 3. Monochromaticity of GIs among bacterial bioprocesses.**

Conversely, other process combinations were preferentially enriched for aggravating relationships (Figure 3C). For example, strong SSL associations were observed between the homologous recombination machinery (recABC) and DNA polymerases [polIII (dnaNQ, holAC); polIV (dinB)], whose coordination is critical for genomic integrity [47]. Likewise, sulfur-relay systems [yccK (tusE), yheLMN complex (tusBCD)], which channel sulfur from various trafficking pathways to 2-thiouridine [43], showed aggravating interactions with downstream iron-sulfur (Fe-S) cluster scaffold assembly factors (e.g., ydhD, gntY) (Figure 3C). Similarly, the ferric (Fe³⁺) enterobactin transporter system (e.g., fepBCDG complex, fepA, fepE) showed strong SSL links with the CSD (cysteine sulfinate desulfinase) sulfur transfer apparatus (e.g., csdAEL) (Figure 3C), implying overlap in iron homeostasis.

Functional insights revealed by unexpected epistatic pathway relationships

Since the global patterns of GI measured by eSGA reflect biological relationships, we examined our GI network specifically to delineate novel functional roles for bacterial genes of unclear biological significance. Clustering the GIs resulting from the monochromatic analysis (Protocol S6) implicated orphan genes lacking annotations to specific pathways. For instance, seven unannotated genes (ynjABCDEFI) were grouped together with particular components (e.g., sufCDS, ydhD) of the “Suf” Fe-S cluster assembly machinery (Figure 3C), consistent with a recent report that YnjE is a sulfur transferase required for molybdopterin biosynthesis [48].

Another illustrative example is a modular sub-network consisting of RavA (Regulatory ATPase variant A), a AAA+ ATPase of the MoxR protein family whose physiological function is uncertain, and its binding partner, ViaA (von Willebrand factor A domain interacting AAA+ ATPase) [49], which also exhibited strong aggravating connections with the Fe-S cluster assembly apparatus (Figure 4A). Consistent with predicted epistasis, ravA viaA Fe-S triple deletion strains showed virtually identical GIs (i.e., SSL) as ravA Fe-S or viaA Fe-S double mutants (Figure 4A), which were confirmed independently by liquid culture growth assays (Figure 4B, Protocol S7; representative ravA viaA hscA triple mutant shown).

**Fig. 4. RavA and ViaA linked to Fe-S assembly.**

To further examine the link with Fe-S assembly, we exploited the observations that, at sub-lethal dosages, bactericidal drugs such as aminoglycosides (e.g., streptomycin, gentamycin) cause cell death via mechanisms that are dependent on Fe-S clusters [50]–[53], and that the uptake of aminoglycosides are directly influenced by the Isc pathway of Fe-S cluster biogenesis [54]. As a result, strains deficient in Fe-S assembly show decreased drug sensitivity [52], [54]. We therefore tested the influence of ravA and viaA on Fe-S biogenesis in strains over-expressing the isc assembly machinery (iscRSUA-hscBA-fdx-iscX) on a multicopy plasmid (pRKISC) [55] upon challenge with the aminoglycoside, kanamycin. Notably, the presence of kanamycin impaired wild-type, but not ravA viaA double mutants (Figure 4C, Protocol S8).

Consistent with this, ravA and viaA also showed GIs with cofactors required for Fe-S cluster formation, including genes involved in the biosynthesis of L-cysteine (e.g., the serine acetyltransferase complex, cysEK; hemoprotein subunit of sulfite reductase, cysIJ) from which precursor sulfur is extracted (Figure 4A). The fact that cysteine biosynthetic genes become essential despite the presence of rich media suggests a defect in cysteine transport in the cysB mutant strain (Figure 4D, Protocol S7). Thus, defects in the de novo biosynthesis of cysteine, coupled with impaired import, likely decrease the pool of cysteine available for Fe-S biogenesis and related sulfur transfer reactions by this pathway, which is mirrored as an aggravating phenotype. Since the uptake and assimilation of inorganic sulfurs by cysteine biosynthesis genes in bacteria requires the CysAUWP ABC transporter complex [56]–[58], while organic sulfurs are imported by other ABC transporters [59], we challenged strains over-expressing ravA with inorganic (e.g., SO₄²⁻ and S₂O₃²⁻) and organic [taurine, 2-(4-pyridyl)-ethanesulfonate (PESF), and cysteine (i.e., Cys-S-S-Cys)] sulfur compounds (Figure 4E, Protocol S9). Unlike wild-type E. coli, ravA over-expressing strains showed increased sensitivity to inorganic, but not organic sulfurs (Figure 4E, Protocol S9), seemingly due to perturbation of the normal RavA-ViaA stoichiometry necessary for normal cell function. Taken together, a direct or indirect impact of RavA/ViaA on bacterial sulfur transport is consistent with our GI data, reflecting the tight integration of these systems.

Since the growth assays confirmed participation of ravA and viaA in Fe-S assembly (Figure 4B and 4C), we performed co-immunoprecipitation (co-IP) experiments to determine whether these two proteins interact physically with the Fe-S cluster (Isc) assembly proteins, with which they showed strong aggravating interactions (Figure 4A). Indeed, endogenous affinity-tagged Isc proteins specifically and efficiently co-precipitated native RavA and ViaA (Figure 4F, Protocol S10), implying joint participation in cellular iron homeostasis through physical associations. Most notably, the fact that ravA-viaA mutants displayed a strong aggravating phenotype between the subunits of Isc complex supports the idea that these two overlooked processes function redundantly to tightly regulate cellular iron levels required for the maintenance of cell viability. That is while deletion of subunits of either protein complex shows a similar effect as loss of the entire complex, mutations in both complexes (i.e., RavA-ViaA and Isc simultaneously perturbed) result in SSL phenotypes due to system failure.

Another example of functional insights resulting from this GI analysis involves a sub-network (Figure 5A) of aggravating GIs connecting the late ribosome biogenesis factor, rsgA, with both the ribosome and an unannotated gene, yaiF, which, while not essential in E. coli, is predicted to belong to a protein family of acetyl-transferases that are widely conserved among microbes (Table S5). Although the co-IP experiments showed no physical association between YaiF and RsgA in E. coli solubilized cell extracts (data not shown), as with the GI dataset, analysis of previously published large-scale phenomics (i.e., chemical genetic profiling) data [41] showed that a mutant strain lacking yaiF is hyper-sensitive to antibiotics (macrolide, tetracycline, amino-glycoside) targeting protein synthesis (Figure 5B). Similarly, we found that the mutant strain lacking yaiF or rsgA was sensitive to tetracycline, whereas the yaiF rsgA double mutant exhibited increased drug sensitivity (Figure 5C, Protocol S7), suggesting participation of YaiF in translation.

**Fig. 5. YaiF linked to ribosome biogenesis.**

To evaluate this link further, we examined ribosome profiles in yaiF deletion mutants. Unlike rsgA, the ribosome profile of yaiF mutant from the log-phase culture was nearly wild-type (Figure 5D, Protocol S11), consistent with the previous finding where loss of known protein synthesis gene products, including the ribosome modulation factor, rmf [60], resulted in near wild-type profiles. However, in contrast to wild-type cells, yaiF or rsgA mutants exhibited translational defects, including mistranslation as indicated by higher read-through of out-of-frame amber (UAG) and opal (UGA) nonsense codon alleles and miscoding of +1 and −1 frame-shift mutations in a β-galactosidase reporter [61] (Figure 5E, Protocol S11). Strikingly, these defects were exacerbated when both yaiF and rsgA were deleted (Figure 5E, Protocol S11), consistent with our genetic data.

Moreover, strains lacking yaiF delayed the production of mature 16S rRNA, resulting in the accumulation of late unprocessed 17S rRNA molecules (Figure 5F, Protocol S11) in a similar manner to the mutant strain lacking rsgA [62], [63]. This effect was specific as overexpression of yaiF or rsgA in trans fully rescued the 17S rRNA defects in the respective deletion strains (data not shown), indicating the involvement of YaiF in bacterial protein synthesis. However, further experiments are warranted to delineate how YaiF affects RNA processing and ribosome biogenesis, potentially in a pathway relating to RsgA.

Genetic networks showcase the systems coupling supporting protein homeostasis

Molecular chaperones often have numerous binding partners, as they typically participate in the folding, assembly, transport, and stability of multiple client proteins involved in distinct processes [64], [65]. Previous systems-wide analyses of physical and genetic interactions involving chaperones in yeast has revealed an extensive interplay of inter-chaperone interactions that mediate protein homeostasis in eukaryotes [66]. Since earlier studies in E. coli have largely focused on reductionist biochemical analyses of single or closely related chaperones in isolation, the extent of functional connectivity between bacterial chaperones and their cofactors and substrates has not been explored systematically [67]. We address this gap by examining the global epistatic relationships of 22 general, widely conserved bacterial chaperones and ATP-dependent proteases, including ribosome-associated trigger factor (tig), and members of the Hsp40 (cbpA, djlA, dnaJ, hscB), Hsp70 (dnaK, hscA, hscC, yegD), Hsp90 (htpG), Hsp100 (clpA, clpB, clpX, hslU), small HSPs (hsp33, ibpA, ibpB), and ATP-dependent proteases (clpP, ftsH, hslV, lon) (Figure 6A, Table S1).

**Fig. 6. Functional crosstalk among chaperones and proteases.**

By applying the same strict filtering criteria (|S-score≥3|; P≤0.05) as previously, a network of 3,816 high-confidence GIs involving one or more of these factors (Table S2), revealed functional redundancy and cross talk between these determinants of protein stability. For example, a sub-network of alleviating GIs (Figure 3B) connected the ATP-dependent molecular chaperone, clpX, and its serine protease, clpP, with other known and putative chaperones/co-factors, such as the ATP-dependent protease (hslV), small heat shock proteins (HSPs) (ibpA), and hsp100 (clpA), presumably reflecting functional cooperation in substrate recognition and degradation [68]–[70].

While the number of GIs identified per chaperone varied significantly, ranging from 6 (e.g., hsp33) to well over 600 (e.g., cbpA), with chaperones localized in the cytosol showing the highest connectivity (Figure 6A, Table S6), many non-chaperone genes in this sub-network interacted preferentially with a single chaperone, consistent with a specific role in protein folding (Table S7). For example, while the dnaJ chaperone paralog cbpA showed strong aggravating interactions with over 200 non-chaperones, the NAD-dependent malate dehydrogenase, sfcA only interacted with Hsp70 chaperone, dnaK. In contrast to most soluble proteins, the outer membrane porin, ompA, interacted with 10 different chaperones (Table S8), reflecting the multiphasic nature of membrane protein, secretion, transport, and assembly.

Functional dependencies among chaperone systems

As each gene in the GI network possesses a GI profile, or signature, describing its functional interactions with other tested genes, the biological roles of incompletely characterized components can be inferred based on their GI profile correlation with annotated genes [6], [16], [21] (Table S9). To filter high-confidence correlations, we chose a PCC cut-off score (≥0.3) that captured roughly 18% (438 of 2,385) of the correlated gene pairs mapping to well-annotated EcoCyc complexes or pathways (Figure S3A, Protocol S12).

As implied by the GI network, the correlated GI profiles showed strong functional coordination among distinct chaperone systems (Figure 6B, Table S9). An illustrative example is the highly correlated (r_{ibpA, ibpB}>0.5) interaction profiles of two small HSPs, ibpA and ibpB, which prevent irreversible protein aggregation due to high temperature [71], [72] (Figure 6B). Likewise, a strong correlation was observed between the ATP-dependent protein unfoldases, clpX and clpA (Figure 6B), consistent with their documented cooperation in maintaining client protein stability [73].

To gain insight into the prevalence of functional dependencies between protein complexes and chaperones, we next assessed the degree to which protein complexes were enriched with aggravating or alleviating interactions involving chaperones. We observed that roughly half of all putative soluble protein complexes showed significant (p-value ≤0.05) enrichment for alleviating interactions involving one or more of the 18 chaperone containing protein complexes compiled from our own large-scale proteomics survey [2] and the EcoCyc database (Figure 6C, Protocol S13). Large complexes related to general metabolism and envelope biogenesis interacted with multiple chaperones (Table S10). Chaperone-related complexes shared many non-chaperone interactors, as evidenced by high Jaccard similarity indices, suggesting functional cooperation in complex formation or maintenance (Figure 6D, Table S10, Protocol S13). Strikingly, ATP-dependent proteases, such as clpP interacted strongly with members of the small HSPs and Hsp100 families (Figure 6E, Table S10), consistent with previously reported interplay in protein folding and quality control [74], [75]. Likewise, GIs connected members of the Hsp100 and Hsp70 families (Figure 6E), likely reflecting Hsp100's role in rescuing protein aggregates caused by defects in Hsp70-mediated protein folding [76]. As well, members of the Hsp40 and Hsp90 systems showed strong genetic crosstalk (Figure 6E), consistent with current models of system dependencies between these chaperones [77].

Functional modules enriched for GIs

Despite the scope of the screens, the experimentally mapped GI network of E. coli is sparse. To glean additional insights into the functional organization of bacterial processes, we combined our GI data with alternate evidence of functional associations, such as physical interaction information and GC-based inferences, analogous to integrative studies reported in yeast [20], [23], [78]. In particular, we examined a previously published set of 316 putative E. coli functional modules [2], [3], encompassing protein complexes and 43% (1,784) of all 4,145 known protein-coding genes in E. coli (Table S11), probing for significant enrichment of GIs between modules.

Although only ∼5% (104) of these components were screened as query mutants by eSGA, we observed significant enrichment of GIs between certain functional groupings, or modules, either as protein complexes or overlapping pathways (Figure S3B). After applying stringent permutation testing (Protocol S14), we identified 302 significant enrichments (p-value ≤0.05), of which the vast majority (99%) occurred between different modules (Figure S3C, Table S12). As reported for yeast [20], [22], aggravating GIs were far more prevalent than alleviating interactions between modules (Figure S3D).

The preponderance of GIs between modules provided an opportunity to explore the nature of functional crosstalk between biological systems (Figure S4A, Table S13). For example, the Suf Fe-S cluster biosynthetic module, members of the DNA polymerase module involved in proofreading and correcting replication errors via exonuclease activity, and components of the Psp (phage shock protein) system, mediating cellular responses to envelope instability and maintaining respiratory chains in E. coli, showed a remarkably high degree of interconnectivity (Figure S4B).

In addition to previously noted strong aggravating GIs with the functionally equivalent Isc Fe-S system (encoded by iscRSUA-hscBA) [13], particularly evident (Figure S4B) from the Suf module (sufABCDSE) were aggravating crosstalk with the vitamin B₁₂ transport system, which participates in the E. coli response to reactive oxygen species [79]. Fe-S clusters play important roles in sensing redox/oxidative stress and iron homeostasis [80], and their breakdown can lead to accumulation of reactive oxygen species that triggers an adaptive response [81]. Structural similarity between certain components (e.g., btuD vs. sufC) [82] is also suggestive of functional dependency.

Functional coupling was also evident between the Psp (phage shock protein) and cell-envelope associated modules, such as Sap (sensitive to antimicrobial peptides), Mgl (β-methylgalactoside transporter), Mdt (multidrug resistance exporter) and Nar (Nitrate reductase) transporters, as well as with members of purine salvage pathway (Figure S4B), consistent with joint participation in respiration, maintenance of proton-motive force, and envelope integrity [83]–[86].

Conversely, alleviating interactions were preferentially detected among different module pairs, such as between the small heat shock chaperones (e.g., ibpAB) and multidrug efflux transporters (acrAB-tolC) (Figure S4B), possibly reflecting the active secretion of toxic protein degradation products [87]. Genes encoding members of the AAA+ family of proteases such as clpA-clpP and hslV-clpP, exhibited strong alleviating interactions with the hslV-ftsH protein quality control factors [69], suggesting they work in union (Table S12). On an individual component level, alleviating interactions often occurred between structurally similar proteins, such as the energy-dependent proteases hslV and clpP underlying a common mechanism in protein degradation [88].

Evolutionary conservation of bacterial complexes and pathways

Given that a large proportion of E. coli genes are conserved among a majority of bacteria, particularly among closely related γ-proteobacterial species [2], [10], we investigated the evolutionary significance of the putative functional associations detected by eSGA in E. coli by examining co-conservation of orthologs among other sequenced prokaryotes. Phylogenetic profiles were created by retrieving orthologous groups across a total of 233 fully sequenced γ-proteobacterial genomes (29 closely-related E. coli serotypes, 64 enterobacterial and 140 γ-proteobacterial species) from the eggNOG database [89] (Table S14). These profiles were used to derive mutual information (MI) scores based on the degree of similarity in the pattern of co-conservation of a given pair of genes (Protocol S15). We focused on gene pairs having correlated GI profiles in E. coli with a PCC score of ≥0.3, which favored interactions among components of the same complex and pathway (Figure S3). Consistent with biological expectation, co-conserved subunits of E. coli modules tended to possess highly correlated GI profiles on average compared to those belonging to different (i.e., between) complexes or pathways (Figure 7A and 7B).

**Fig. 7. Correlated GI profiles of co-conserved genes and modules.**

Applying an MI score cut-off ≥0.2, representing a probability of co-conservation more significant than expected by random chance (Figure S5A and S5B), revealed several functionally highly correlated (r≥0.5) co-conserved clusters in γ-proteobacterial species (Table S15, Figure S5B). These included essential E. coli factors functioning in core bacterial bioprocesses such as envelope biogenesis, gluconeogenesis, and RNA/DNA/protein synthesis, which were all highly inter-connected by GIs (Figure 7C).

Furthermore, this analysis revealed varying degrees of functional correlation (i.e., at greater or less than 50% conservation) between several large, co-conserved, but non-essential bacterial protein complexes. For example, orthologues of the substrate (e.g., Sap and Fep ABC transporters) and proton (e.g., periplasmic nitrate reductase) transporter complexes, as well as the sulfur relay heterohexameric TusBCD machinery (Figure 7D), were all evolutionarily co-conserved, consistent with their broad functional importance across γ-proteobacterial species. Surprisingly, however, some subunits of highly co-conserved complexes and pathways had notable differences in their GI profiles. For example, two partly redundant, non-essential, highly conserved lysyl-tRNA synthetases of E. coli, lysU and lysS, each capable of sustaining protein synthesis [90], [91], were functionally anti-correlated with other tRNA synthetases (e.g., thrS, tyrS) (Figure 7E). This suggests opposing functions in support of translation, consistent with previous reports of distinct functions for these genes [90], [91]. Likewise, anti-correlated GI profiles were observed among subunits of the flagellum complex, which were largely found in closely-related E. coli serotypes and enterobacterial species, but which lacked orthologs among other γ-proteobacteria (Figure 7F), suggesting specialized roles in flagellum assembly [92].

Since co-conservation and correlated GI profiles reflect shared functionality [93], [94], we were able to delineate specific biological relationships. For example, the co-conserved components of the ferric enterobactin ABC transporter (e.g., fepBCD) and enterobactin synthetases (e.g., entBE) (Figure S5C) showed highly correlated GI profiles, consistent with their joint participation in iron homeostasis [95], [96]. Likewise, significant correlation was observed among the subunits of the sulfur transfer mediator (e.g., tusBCD) and the thiamin (e.g., thiCDEFM) biosynthesis machinery (Figure S5C), both of which participate in thiamin production [97], [98].

Discussion

The vast majority (>90%) of E. coli's genes are dispensable for viability under standard laboratory culture conditions [19]. Unbiased interaction screens are increasingly being used to characterize the biological organization of E. coli [1], [2], [13], [14], [16]. Yet despite being one of the most heavily studied bacteria, nearly one-third of E. coli's genes currently lack experiment-based functional annotations [1]. While proteomics and GC approaches are valuable for understanding how bacterial gene products associate into discrete biological entities (i.e., protein complexes) [2], [3], [99], they often fail to reveal higher order (i.e., pathway-level) functional relationships and process cross-talk that underlie genetic redundancy, impeding systems-level modeling [100], [101].

Genetic screens have long been appreciated as a powerful means for probing biological relationships in bacteria, but historically these studies have been focused on individual genes, complexes, or pathways in isolation [1], [6], [16]. Recent technical advances, including the development of high-throughput methods such as eSGA, GIANT-coli, and Tn-Seq [13]–[16], now permit the systematic mapping of epistatic dependencies.

In the present study, we have markedly expanded on previous initial surveys of the bacterial GI space [13], [16], achieving a scope for a prokaryote that begins to approach that reported for yeast [24], [102]. Our current GI map, although still sparse, encompasses virtually the entire E. coli proteome. Given the functional information contained within the recorded GI patterns, this map, despite being incomplete, represents a substantial resource for mechanistic prediction. In this study, based solely on our GI data, we were able to discover novel components and unexpected connections in well-studied pathways essential for bacterial fitness such as the association of RavA and ViaA with Fe-S and cysteine assembly, and the implication of the previously uncharacterized component YaiF in maintaining ribosomal integrity, especially in preserving translational fidelity and protein synthesis. The GI map also provides insights into the global architecture of convergent and compensatory pathway crosstalk that contributes to the overall robustness of bacterial processes. To facilitate mechanistic exploration at both levels, we report all high-confidence interactions in a dedicated open web-portal (http://ecoli.med.utoronto.ca/esga), allowing examination of both individual pair-wise gene interactions and broader connectivity among bacterial complexes and biological processes.

Integrative analyses have been documented extensively in yeast [20], [21], [45], [103], however the lack of unbiased GI data has hindered such analysis in bacteria. By combining the eSGA data from this study with previously reported E. coli functional modules derived by physical interaction mapping and GC [2], [3], we found unexpected relationships between certain complexes and pathways. For example, by illuminating how chaperones cooperate within a bacterial cell, we revealed unforeseen functional dependencies, suggesting an overarching surveillance network maintains protein homeostasis in bacteria.

Despite deriving meaningful biological information by expanding the scope of GI data, the current network still remains sparse, as only ∼10% (∼600 K out of ∼8 million) of all possible digenic mutant E. coli gene pairs were evaluated by eSGA to date. Hence, we have likely missed important patterns of connectivity that potentially biases our global inferences, leading to an underestimation of the extent of process crosstalk. However, our integrated approach revealed several novel functional associations between functional modules with significant enrichment in inter-module GIs, revealing various pathways and complexes that participate in related biological processes. This present shortfall will be overcome as the coverage of available GI data improves over the coming years and will provide a greater understanding of the functional organization of the bacterial cell.

The ability to extrapolate the epistatic connectivity diagram of E. coli to other microbial species lacking experimental information provides a conceptual framework for exploring bacterial evolution across different lifestyles and phylogenetically diverse microbiomes [104]. Our preliminary exploration of the co-conservation of genes and functional modules with correlated GI profiles among γ-proteobacteria illustrates the potential to outline possible adaptations, such as connectivity between iron-import and sulfonation in the biogenesis of thiamin utilization, which are linked to bacterial pathogenesis of enteric bacteria [97], [105], [106]. Thus, epistatic interactions can describe how sequence evolution in bacterial species drives functional specialization, environmental adaptations, and, potentially, speciation.

Materials and Methods

Bacterial strains used in this study are listed in Table S16 and Protocol S16. Procedures used for the compilation of donor query targets for eSGA, strain construction, eSGA screens, computational processing epistatic interaction data to derive high confidence GI scores, the analysis of GI network properties, monochromatic analysis, computing correlation scores using GI profiles, enrichment of GI associations within and between functional modules, evolutionary conservation, phenotypic assays, as well as other relevant methods are described in detail in Supplementary Information. Network graphs were generated using Cytoscape (ver. 2.8.2), and the heat-maps were generated using in-house JAVA scripts or MATLAB.

Supporting Information

Zdroje

1. BabuM, MussoG, Diaz-MejiaJJ, ButlandG, GreenblattJF, et al. (2009) Systems-level approaches for identifying and analyzing genetic interaction networks in Escherichia coli and extensions to other prokaryotes. Mol Biosyst 12 : 1439–1455.

2. HuP, JangaSC, BabuM, Diaz-MejiaJJ, ButlandG (2009) Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins. PLoS Biol 7: e1000096.

3. Peregrin-AlvarezJM, XiongX, SuC, ParkinsonJ (2009) The Modular Organization of Protein Interactions in Escherichia coli. PLoS Comput Biol 5: e1000523.

4. Moreno-HagelsiebG, Collado-VidesJ (2002) A powerful non-homology method for the prediction of operons in prokaryotes. Bioinformatics 18 Suppl 1: S329–336.

5. SalgadoH, Moreno-HagelsiebG, SmithTF, Collado-VidesJ (2000) Operons in Escherichia coli: genomic analyses and predictions. Proc Natl Acad Sci U S A 97 : 6652–6657.

6. GagarinovaA, EmiliA (2012) Genome-scale genetic manipulation methods for exploring bacterial molecular biology. Mol Biosyst 8 : 1626–1638.

7. BabuM, VlasblomJ, PuS, GuoX, GrahamC, et al. (2012) Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae. Nature 489 : 585–589.

8. GavinAC, AloyP, GrandiP, KrauseR, BoescheM, et al. (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 631–636.

9. KroganNJ, CagneyG, YuH, ZhongG, GuoX, et al. (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440 : 637–643.

10. ButlandG, Peregrín-AlvarezJM, LiJ, YangW, YangX, et al. (2005) Interaction network containing conserved and essential protein complexes in Escherichia coli. Nature 433 : 531–537.

11. ArifuzzamanM, MaedaM, ItohA, NishikataK, TakitaC, et al. (2006) Large-scale identification of protein-protein interaction of Escherichia coli K-12. Genome Res 16 : 686–691.

12. Díaz-MejíaJJ, BabuM, EmiliA (2009) Computational and experimental approaches to chart the Escherichia coli cell-envelope-associated proteome and interactome. FEMS Microbiol Rev 33 : 66–97.

13. ButlandG, BabuM, Díaz-MejíaJJ, BohdanaF, PhanseS, et al. (2008) eSGA: E. coli synthetic genetic array analysis. Nat Methods 5 : 789–795.

14. TypasA, NicholsRJ, SiegeleDA, ShalesM, CollinsSR, et al. (2008) High-throughput, quantitative analyses of genetic interactions in E. coli. Nat Methods 5 : 781–787.

15. van OpijnenT, BodiKL, CamilliA (2009) Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat Methods 6 : 767–772.

16. BabuM, Díaz-MejíaJJ, VlasblomJ, GagarinovaA, PhanseS, et al. (2011) Genetic interaction maps in Escherichia coli reveal functional crosstalk among cell envelope biogenesis pathways. PLoS Genet 7: e1002377.

17. BarabasiAL, OltvaiZN (2004) Network biology: understanding the cell's functional organization. Nat Rev Genet 5 : 101–113.

18. RileyM, AbeT, ArnaudMB, BerlynMK, BlattnerFR, et al. (2006) Escherichia coli K-12: a cooperatively developed annotation snapshot–2005. Nucleic Acids Res 34 : 1–9.

19. BabaT, AraT, HasegawaM, TakaiY, OkumuraY, et al. (2006) Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol 2 : 2006.0008.

20. BandyopadhyayS, KelleyR, KroganNJ, IdekerT (2008) Functional maps of protein complexes from quantitative genetic interaction data. PLoS Comput Biol 4: e1000065.

21. BeltraoP, CagneyG, KroganNJ (2010) Quantitative genetic interactions reveal biological modularity. Cell 141 : 739–745.

22. BooneC, BusseyH, AndrewsBJ (2007) Exploring genetic interactions and networks with yeast. Nat Rev Genet 8 : 437–449.

23. UlitskyI, ShlomiT, KupiecM, ShamirR (2008) From E-MAPs to module maps: dissecting quantitative genetic interactions using physical interactions. Mol Syst Biol 4 : 209.

24. CostanzoM, BaryshnikovaA, BellayJ, KimY, SpearED, et al. (2010) The genetic landscape of a cell. Science 327 : 425–431.

25. CollinsSR, SchuldinerM, KroganNJ, WeissmanJS (2006) A strategy for extracting and analyzing large-scale quantitative epistatic interaction data. Genome Biol 7: R63.

26. BaryshnikovaA, CostanzoM, KimY, DingH, KohJ, et al. (2010) Quantitative analysis of fitness and genetic interactions in yeast on a genome scale. Nat Methods 7 : 1017–1024.

27. SchuldinerM, CollinsSR, ThompsonNJ, DenicV, BhamidipatiA, et al. (2005) Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell 123 : 507–519.

28. ManiR, St OngeRP, HartmanJLt, GiaeverG, RothFP (2008) Defining genetic interaction. Proc Natl Acad Sci U S A 105 : 3461–3466.

29. DixonSJ, CostanzoM, BaryshnikovaA, AndrewsB, BooneC (2009) Systematic mapping of genetic interaction networks. Annu Rev Genet 43 : 601–625.

30. CostanzoM, BaryshnikovaA, MyersCL, AndrewsB, BooneC (2011) Charting the genetic interaction map of a cell. Curr Opin Biotechnol 22 : 66–74.

31. BellayJ, AtluriG, SingTL, ToufighiK, CostanzoM, et al. (2011) Putting genetic interactions in context through a global modular decomposition. Genome Res 21 : 1375–1387.

32. CollinsSR, MillerKM, MaasNL, RoguevA, FillinghamJ, et al. (2007) Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map. Nature 446 : 806–810.

33. DavierwalaAP, HaynesJ, LiZ, BrostRL, RobinsonMD, et al. (2005) The synthetic genetic interaction spectrum of essential genes. Nat Genet 37 : 1147–1152.

34. WilmesGM, BergkesselM, BandyopadhyayS, ShalesM, BrabergH, et al. (2008) A genetic interaction map of RNA-processing factors reveals links between Sem1/Dss1-containing complexes and mRNA export and splicing. Mol Cell 32 : 735–746.

35. UeguchiC, ShiozawaT, KakedaM, YamadaH, MizunoT (1995) A study of the double mutation of dnaJ and cbpA, whose gene products function as molecular chaperones in Escherichia coli. J Bacteriol 177 : 3894–3896.

36. CourcelleJ, Carswell-CrumptonC, HanawaltPC (1997) recF and recR are required for the resumption of replication at DNA replication forks in Escherichia coli. Proc Natl Acad Sci U S A 94 : 3714–3719.

37. ChenaultSS, EarhartCF (1991) Organization of genes encoding membrane proteins of the Escherichia coli ferrienterobactin permease. Mol Microbiol 5 : 1405–1413.

38. SheaCM, McIntoshMA (1991) Nucleotide sequence and genetic organization of the ferric enterobactin transport system: homology to other periplasmic binding protein-dependent systems in Escherichia coli. Mol Microbiol 5 : 1415–1428.

39. OrchardSS, RostronJE, SegallAM (2012) Escherichia coli enterobactin synthesis and uptake mutants are hypersensitive to an antimicrobial peptide that limits the availability of iron in addition to blocking Holliday junction resolution. Microbiology 158 : 547–559.

40. FaithJJ, DriscollME, FusaroVA, CosgroveEJ, HayeteB, et al. (2008) Many Microbe Microarrays Database: uniformly normalized Affymetrix compendia with structured experimental metadata. Nucleic Acids Res 36: D866–870.

41. NicholsRJ, SenS, ChooYJ, BeltraoP, ZietekM, et al. (2011) Phenotypic landscape of a bacterial cell. Cell 144 : 143–156.

42. PriceMN, ArkinAP, AlmEJ (2006) The life-cycle of operons. PLoS Genet 2: e96.

43. IkeuchiY, ShigiN, KatoJ, NishimuraA, SuzukiT (2006) Mechanistic insights into sulfur relay by multiple sulfur mediators involved in thiouridine biosynthesis at tRNA wobble positions. Mol Cell 21 : 97–108.

44. SegrèD, DelunaA, ChurchGM, KishonyR (2005) Modular epistasis in yeast metabolism. Nat Genet 37 : 77–83.

45. MichautM, BaryshnikovaA, CostanzoM, MyersCL, AndrewsBJ, et al. (2011) Protein complexes are central in the yeast genetic landscape. PLoS Comput Biol 7: e1001092.

46. SilhavyTJ, KahneD, WalkerS (2010) The bacterial cell envelope. Cold Spring Harb Perspect Biol 2: a000414.

47. DelmasS, MaticI (2006) Interplay between replication and recombination in Escherichia coli: impact of the alternative DNA polymerases. Proc Natl Acad Sci U S A 103 : 4564–4569.

48. DahlJU, UrbanA, BolteA, SriyabhayaP, DonahueJL, et al. (2011) The identification of a novel protein involved in molybdenum cofactor biosynthesis in Escherichia coli. J Biol Chem 286 : 35801–35812.

49. SniderJ, HouryWA (2006) MoxR AAA+ ATPases: a novel family of molecular chaperones? J Struct Biol 156 : 200–209.

50. BrillasE, SiresI, OturanMA (2009) Electro-Fenton process and related electrochemical technologies based on Fenton's reaction chemistry. Chem Rev 109 : 6570–6631.

51. DjamanO, OuttenFW, ImlayJA (2004) Repair of oxidized iron-sulfur clusters in Escherichia coli. J Biol Chem 279 : 44590–44599.

52. KohanskiMA, DwyerDJ, HayeteB, LawrenceCA, CollinsJJ (2007) A common mechanism of cellular death induced by bactericidal antibiotics. Cell 130 : 797–810.

53. SchwartzCJ, DjamanO, ImlayJA, KileyPJ (2000) The cysteine desulfurase, IscS, has a major role in in vivo Fe-S cluster formation in Escherichia coli. Proc Natl Acad Sci U S A 97 : 9009–9014.

54. EzratyB, VergnesA, BanzhafM, DuvergerY, HuguenotA, et al. (2013) Fe-S cluster biosynthesis controls uptake of aminoglycosides in a ROS-less death pathway. Science 340 : 1583–1587.

55. TakahashiY, NakamuraM (1999) Functional assignment of the ORF2-iscS-iscU-iscA-hscB-hscA-fdx-ORF3 gene cluster involved in the assembly of Fe-S clusters in Escherichia coli. J Biochem 126 : 917–926.

56. WooffE, MichellSL, GordonSV, ChambersMA, BardarovS, et al. (2002) Functional genomics reveals the sole sulphate transporter of the Mycobacterium tuberculosis complex and its relevance to the acquisition of sulphur in vivo. Mol Microbiol 43 : 653–663.

57. LithgowJK, HayhurstEJ, CohenG, AharonowitzY, FosterSJ (2004) Role of a cysteine synthase in Staphylococcus aureus. J Bacteriol 186 : 1579–1590.

58. BykowskiT, van der PloegJR, Iwanicka-NowickaR, HryniewiczMM (2002) The switch from inorganic to organic sulphur assimilation in Escherichia coli: adenosine 5′-phosphosulphate (APS) as a signalling molecule for sulphate excess. Mol Microbiol 43 : 1347–1358.

59. EichhornE, van der PloegJR, LeisingerT (2000) Deletion analysis of the Escherichia coli taurine and alkanesulfonate transport systems. J Bacteriol 182 : 2687–2695.

60. WadaA, MikkolaR, KurlandCG, IshihamaA (2000) Growth phase-coupled changes of the ribosome profile in natural isolates and laboratory strains of Escherichia coli. J Bacteriol 182 : 2893–2899.

61. O'ConnorM, GoringerHU, DahlbergAE (1992) A ribosomal ambiguity mutation in the 530 loop of E. coli 16S rRNA. Nucleic Acids Res 20 : 4221–4227.

62. JomaaA, StewartG, Martin-BenitoJ, ZielkeR, CampbellTL, et al. (2011) Understanding ribosome assembly: the structure of in vivo assembled immature 30S subunits revealed by cryo-electron microscopy. RNA 17 : 697–709.

63. LeongV, KentM, JomaaA, OrtegaJ (2013) Escherichia coli rimM and yjeQ null strains accumulate immature 30S subunits of similar structure and protein complement. RNA 19 : 789–802.

64. HartlFU, Hayer-HartlM (2009) Converging concepts of protein folding in vitro and in vivo. Nat Struct Mol Biol 16 : 574–581.

65. WongP, HouryWA (2004) Chaperone networks in bacteria: analysis of protein homeostasis in minimal cells. J Struct Biol 146 : 79–89.

66. GongY, KakiharaY, KroganN, GreenblattJ, EmiliA, et al. (2009) An atlas of chaperone-protein interactions in Saccharomyces cerevisiae: implications to protein folding pathways in the cell. Mol Syst Biol 5 : 275.

67. KumarM, SourjikV (2012) Physical map and dynamics of the chaperone network in Escherichia coli. Mol Microbiol 84 : 736–747.

68. BakerTA, SauerRT (2012) ClpXP, an ATP-powered unfolding and protein-degradation machine. Biochim Biophys Acta 1823 : 15–28.

69. SauerRT, BakerTA (2011) AAA+ proteases: ATP-fueled machines of protein destruction. Annu Rev Biochem 80 : 587–612.

70. ReidBG, FentonWA, HorwichAL, Weber-BanEU (2001) ClpA mediates directional translocation of substrate proteins into the ClpP protease. Proc Natl Acad Sci U S A 98 : 3768–3772.

71. RatajczakE, ZietkiewiczS, LiberekK (2009) Distinct activities of Escherichia coli small heat shock proteins IbpA and IbpB promote efficient protein disaggregation. J Mol Biol 386 : 178–189.

72. StrozeckaJ, ChruscielE, GornaE, SzymanskaA, ZietkiewiczS, et al. (2012) Importance of N -⁠ and C-terminal regions of IbpA, Escherichia coli small heat shock protein, for chaperone function and oligomerization. J Biol Chem 287 : 2843–2853.

73. SinghSK, RozyckiJ, OrtegaJ, IshikawaT, LoJ, et al. (2001) Functional domains of the ClpA and ClpX molecular chaperones identified by limited proteolysis and deletion analysis. J Biol Chem 276 : 29420–29429.

74. ThomasJG, BaneyxF (1998) Roles of the Escherichia coli small heat shock proteins IbpA and IbpB in thermal stress management: comparison with ClpA, ClpB, and HtpG In vivo. J Bacteriol 180 : 5165–5172.

75. KirsteinJ, MoliereN, DouganDA, TurgayK (2009) Adapting the machine: adaptor proteins for Hsp100/Clp and AAA+ proteases. Nat Rev Microbiol 7 : 589–599.

76. SaibilH (2000) Molecular chaperones: containers and surfaces for folding, stabilising or unfolding proteins. Curr Opin Struct Biol 10 : 251–258.

77. GenestO, HoskinsJR, CambergJL, DoyleSM, WicknerS (2011) Heat shock protein 90 from Escherichia coli collaborates with the DnaK chaperone system in client protein remodeling. Proc Natl Acad Sci U S A 108 : 8206–8211.

78. UlitskyI, ShamirR (2007) Identification of functional modules using network topology and high-throughput data. BMC Syst Biol 1 : 8.

79. ArenasFA, DiazWA, LealCA, Perez-DonosoJM, ImlayJA, et al. (2010) The Escherichia coli btuE gene, encodes a glutathione peroxidase that is induced under oxidative stress conditions. Biochem Biophys Res Commun 398 : 690–694.

80. Ayala-CastroC, SainiA, OuttenFW (2008) Fe-S cluster assembly pathways in bacteria. Microbiol Mol Biol Rev 72 : 110–125.

81. PyB, MoreauPL, BarrasF (2011) Fe-S clusters, fragile sentinels of the cell. Curr Opin Microbiol 14 : 218–223.

82. KitaokaS, WadaK, HasegawaY, MinamiY, FukuyamaK, et al. (2006) Crystal structure of Escherichia coli SufC, an ABC-type ATPase component of the SUF iron-sulfur cluster assembly machinery. FEBS Lett 580 : 137–143.

83. HarmsC, DomotoY, CelikC, RaheE, StumpeS, et al. (2001) Identification of the ABC protein SapD as the subunit that confers ATP dependence to the K+-uptake systems Trk(H) and Trk(G) from Escherichia coli K-12. Microbiology 147 : 2991–3003.

84. JolyN, EnglC, JovanovicG, HuvetM, ToniT, et al. (2010) Managing membrane stress: the phage shock protein (Psp) response, from molecular mechanisms to physiology. FEMS Microbiol Rev 34 : 797–827.

85. SamantS, LeeH, GhassemiM, ChenJ, CookJL, et al. (2008) Nucleotide biosynthesis is critical for growth of bacteria in human blood. PLoS Pathog 4: e37.

86. BerteroMG, RotheryRA, PalakM, HouC, LimD, et al. (2003) Insights into the respiratory electron transfer pathway from the structure of nitrate reductase A. Nat Struct Biol 10 : 681–687.

87. MessaoudiN, GautierV, KthiriF, LelandaisG, MihoubM, et al. (2013) Global Stress Response in a Prokaryotic Model of DJ-1-Associated Parkinsonism. J Bacteriol 195 : 1167–1178.

88. WangJ, HartlingJA, FlanaganJM (1998) Crystal structure determination of Escherichia coli ClpP starting from an EM-derived mask. J Struct Biol 124 : 151–163.

89. PowellS, SzklarczykD, TrachanaK, RothA, KuhnM, et al. (2012) eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges. Nucleic Acids Res 40: D284–289.

90. ClarkRL, NeidhardtFC (1990) Roles of the two lysyl-tRNA synthetases of Escherichia coli: analysis of nucleotide sequences and mutant behavior. J Bacteriol 172 : 3237–3243.

91. OnestiS, MillerAD, BrickP (1995) The crystal structure of the lysyl-tRNA synthetase (LysU) from Escherichia coli. Structure 3 : 163–176.

92. ChilcottGS, HughesKT (2000) Coupling of flagellar gene expression to flagellar assembly in Salmonella enterica serovar typhimurium and Escherichia coli. Microbiol Mol Biol Rev 64 : 694–708.

93. RoguevA, BandyopadhyayS, ZofallM, ZhangK, FischerT, et al. (2008) Conservation and rewiring of functional modules revealed by an epistasis map in fission yeast. Science 322 : 405–410.

94. DixonSJ, FedyshynY, KohJL, PrasadTS, ChahwanC, et al. (2008) Significant conservation of synthetic lethal genetic interaction networks between distantly related eukaryotes. Proc Natl Acad Sci U S A 105 : 16653–16658.

95. FischbachMA, LinH, LiuDR, WalshCT (2006) How pathogenic bacteria evade mammalian sabotage in the battle for iron. Nat Chem Biol 2 : 132–138.

96. MiethkeM, MarahielMA (2007) Siderophore-based iron acquisition and pathogen control. Microbiol Mol Biol Rev 71 : 413–451.

97. DuQ, WangH, XieJ (2011) Thiamin (vitamin B1) biosynthesis and regulation: a rich source of antimicrobial drug targets? Int J Biol Sci 7 : 41–52.

98. JurgensonCT, BegleyTP, EalickSE (2009) The structural and biochemical foundations of thiamin biosynthesis. Annu Rev Biochem 78 : 569–603.

99. CampillosM, von MeringC, JensenLJ, BorkP (2006) Identification and analysis of evolutionarily cohesive functional modules in protein networks. Genome Res 16 : 374–382.

100. BalchWE, MorimotoRI, DillinA, KellyJW (2008) Adapting proteostasis for disease intervention. Science 319 : 916–919.

101. CalloniG, ChenT, SchermannSM, ChangHC, GenevauxP, et al. (2012) DnaK functions as a central hub in the E. coli chaperone network. Cell Rep 1 : 251–264.

102. TongAH, EvangelistaM, ParsonsAB, XuH, BaderGD, et al. (2001) Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science 294 : 2364–2368.

103. MichautM, BaderGD (2012) Multiple genetic interaction experiments provide complementary information useful for gene function prediction. PLoS Comput Biol 8: e1002559.

104. BorensteinE (2012) Computational systems biology and in silico modeling of the human microbiome. Brief Bioinform 13 : 769–780.

105. GarciaEC, BrumbaughAR, MobleyHL (2011) Redundancy and specificity of Escherichia coli iron acquisition systems during urinary tract infection. Infect Immun 79 : 1225–1235.

106. PappasCJ, IyerR, PetzkeMM, CaimanoMJ, RadolfJD, et al. (2011) Borrelia burgdorferi requires glycerol for maximum fitness during the tick phase of the enzootic cycle. PLoS Pathog 7: e1002102.

107. LiZ, PanditS, DeutscherMP (1999) RNase G (CafA protein) and RNase E are both required for the 5′ maturation of 16S ribosomal RNA. EMBO J 18 : 2878–2885.