Molecular basis of the biogenesis of a protein organelle for ethanolamine utilization
Abstract
Many pathogenic bacteria use proteinaceous ethanolamine utilization microcompartments (Eut BMCs) to catabolize ethanolamine. This ability gives pathogens a competitive edge over commensal microbiota, which can drive virulence in the inflamed gut. Despite such a critical function, the molecular mechanisms underlying the synthesis of Eut BMCs remain elusive. We report a systematic study for dissecting the molecular basis underlying Eut BMC assembly in Salmonella. We determined the functions of individual constituent proteins in the structure and function of Eut BMCs and demonstrated that EutQ is essential for cargo encapsulation and Eut BMC formation through specific association with the shell and cargo enzymes. We found that Eut proteins can self-assemble to form cargo and shell aggregates independently in vivo and that the biogenesis of Eut BMCs follows a “shell-initiated” pathway. Cargo enzymes exhibit dynamic liquid-like organization within the Eut BMC. Our findings provide mechanistic insights into the structure and assembly of the Eut BMC that serves as a paradigm for membraneless organelles.
Affiliations: Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.; MOE Key Laboratory of Evolution and Marine Biodiversity, State Key Laboratory of Marine Food Processing and Safety Control & College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China.; Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.; Centre for Cell Imaging, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.; Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.
License: Copyright © 2025 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY). CC BY 4.0 This is an open-access article distributed under the terms of the Creative Commons Attribution license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Article links: DOI: 10.1126/sciadv.adx9774 | PubMed: 41032589 | PMC: PMC12487888
Relevance: Moderate: mentioned 3+ times in text
Full text: PDF (15.6 MB)
INTRODUCTION
Ethanolamine (EA) is an abundant nutrient in the mammalian gastrointestinal (GI) tract, a membrane-rich environment due to the constant turnover of the mucosal epithelium and the resident microbiota (ref. R1–ref. R3). Diverse gut bacteria, including those from pathogenic genera such as Salmonella, Clostridium, Klebsiella, Listeria, Escherichia, and Enterococcus, can use EA as a sole source of carbon and/or nitrogen (ref. R4–ref. R6). The ability to metabolize EA provides a competitive growth advantage to pathogens over commensal microbiota. Moreover, EA plays a role in regulating the virulence of these pathogens (ref. R7, ref. R8) and EA catabolism is closely associated with virulence in the inflamed gut (ref. R9–ref. R13).
In many pathogenic species, such as Salmonella enterica serovar Typhimurium (S. Typhimurium), EA utilization is mediated by a group of proteins that are encoded by 17 genes clustered in the eut operon (ref. R14–ref. R16). These proteins, known as EutS/P/Q/T/D/M/N/E/J/G/H/A/B/C/L/K/R, include catalytic and structural protein components that self-assemble to form a polyhedral organelle, called the EA utilization microcompartment (Eut BMCs; Fig. 1, A and B). The Eut BMCs belong to a family of bacterial microcompartments (BMCs) that are widespread in the bacterial kingdom, which perform a variety of functions, including CO2 fixation, infection, and microbial ecology (ref. R16–ref. R21). However, despite their important functions in bacterial pathogenesis and host-pathogen interactions, our understanding of how the various shell proteins and cargo enzymes interact and self-assemble in vivo to form functional Eut BMCs is still limited.

Here, we perform a systematic study on the molecular mechanisms that underlie the organization and biogenesis of Eut BMCs in S. Typhimurium strain LT2. Using genetic modification, super-resolution fluorescence imaging, electron microscopy (EM), and growth assays, we delineate the roles of individual Eut proteins in Eut BMC assembly and functionality. Combining nuclear magnetic resonance (NMR) spectroscopy, AlphaFold predictions (ref. R22), and biochemical studies, we elucidate how the N-terminal domain of EutQ mediates the physical binding between the shell and cargos to ensure encapsulation. We found that Eut proteins independently self-assemble to form cargo and shell aggregates. Our results establish that the in vivo biogenesis of Eut BMCs follows a distinct “shell-initiated” assembly pathway, in which the assembly of shells takes place before the assembly of cargos and at the cell poles. We further demonstrate that the Eut BMC cargo core has liquid-like features, highlighting the essential role of phase separation in constructing and maintaining self-assembling protein organelles in pathogenic bacteria. Our study addresses long-standing questions surrounding Eut BMC biogenesis and the self-assembly mechanisms of shell and cargos into functional protein organelles. These findings could guide the development of therapeutic interventions for diseases caused by pathogens.
RESULTS
Subcellular localization of Eut proteins and protein organization of the Eut BMC
To examine the subcellular location and assembly of Eut proteins in S. Typhimurium, we labeled target Eut proteins with fluorescent proteins using a pBAD/Myc-His vector with the arabinose-inducible ParaBAD promoter (ref. R23) (Fig. 1C, fig. S1, and tables S1 and S2). For in vivo super-resolution fluorescence imaging, selected Eut shell or cargo proteins tagged with superfolder green fluorescent protein (sfGFP) or mCherry were expressed under the control of the ParaBAD promoter, along with the expression of the endogenous eut operon, which was activated in the presence of EA and cobalamin (vitamin B12) (ref. R24).
In the absence of EA and B12, no endogenous Eut proteins were expressed in S. Typhimurium. In the absence of Eut BMC formation, the individual vector-expressed Eut proteins, including the shell proteins EutM/K/L, cargo proteins EutG/B, as well as EutQ/P/J and EutR, were dispersed throughout the cytoplasm (Fig. 1C and fig. S2). One exception is the cargo protein EutE, which formed an aggregate at the cell pole. In addition, the multiple enzyme complex EutBC (sfGFP was fused at the EutC C terminus) appeared to aggregate at a single pole of the cell (fig. S3). Both EutE and EutC have short peptides at their N termini, which act as endogenous encapsulation peptides (EPs) to target cargo enzymes into the Eut BMC (ref. R25, ref. R26). The observed polar aggregation of EutE and EutBC was likely mediated by their EPs, consistent with the findings from 1,2-propanediol utilization microcompartments (Pdu BMCs) (ref. R23).
Growth of S. Typhimurium in the presence of EA and B12 led to expression of endogenous Eut proteins and formation of Eut BMCs (Fig. 1A). Fluorescence imaging revealed that the Eut proteins (shell proteins EutM/K/L and cargo proteins EutE/G/B/Q) exhibited a patchy distribution, representing the characteristic localization pattern of Eut BMCs in vivo (Fig. 1, C and D). By contrast, EutP, EutJ, and EutR were evenly distributed throughout the cytoplasm, indicating that these fused proteins (or likely in the native context) are not incorporated within Eut BMCs. EutQ and EutP were assumed to function as acetate kinases (ref. R27). Our results show that only EutQ is encapsulated within the Eut BMC, possibly catalyzing the formation of acetate and adenosine triphosphate (ATP) from acetyl phosphate generated from EutD (see further analysis below). Our data lead us to propose a model for the protein organization of the Eut BMC (Fig. 1E).
In addition to the canonical 17-gene eut operon, recent studies have indicated 19 adjacent genes that show conservation across the eut loci in various Salmonella strains (ref. R16, ref. R28). To investigate whether these gene products are involved in Eut BMC biogenesis and function, we performed mass spectrometry–based proteomics on S. Typhimurium LT2 cells grown without or with EA induction at 15, 30, 60, and 300 min (fig. S4). Our results reveal that EA induction stimulated the production of all 17 Eut proteins to different levels at a variety of time points, with increases ranging from 11- to 63-fold after 300 min. We then analyzed the expression levels of 17 of the 19 ancillary proteins (fig. S4), with MaeB and TalA proteins showing the highest abundance. Following EA induction (300 min), AmiA and YpfG proteins exhibited two- and fivefold increases, respectively. These results suggest a correlation between these ancillary proteins and Eut BMC biogenesis and assembly. To test this, we determined the cellular locations of the four proteins relative to Eut BMC in vivo localization using fluorescence tagging. Our results showed that MaeB, TalA, and AmiA were distributed evenly in the cytoplasm, whereas YpfG was located in the cell membrane (fig. S5). We conclude that the four proteins were not physically incorporated into Eut BMCs. Their precise roles in Eut BMC biogenesis and function merit further investigation.
Roles of individual Eut proteins in Eut BMC assembly
To decipher the roles of individual protein components in the Eut BMC assembly, we generated a series of mutants that lacked individual Eut proteins via scarless deletion of the corresponding eut genes (ΔeutS, ΔeutM, ΔeutN, ΔeutL, ΔeutK, ΔeutQ, and ΔeutJ) (fig. S6A). The dual-labeling plasmid pBAD-eutM-mCherry::eutB-sfGFP was then transformed into these deletion mutants to visualize the assembly of Eut shell proteins (indicated by EutM) and cargos (indicated by EutB).
In the absence of EutL or EutQ, EutB-sfGFP aggregated into a large punctum at a cell pole, whereas EutM-mCherry formed multiple clustered assemblies in the cytoplasm, suggesting spatial separation of shell and cargo proteins (Fig. 2, A and B). Thin-section EM of the Salmonella strains with and without fluorescent labels verified the observations from fluorescence imaging (fig. S7). We conclude that both EutL and EutQ play essential roles in mediating association between the shell and cargo proteins of Eut BMCs. The growth rates of the ΔeutL and ΔeutQ strains were reduced compared with the wild-type (WT) strain (Fig. 2G and fig. S8), possibly due to the release of toxic acetaldehyde, the impact of cytosolic redox state on cargo enzymatic functions, or improper enzyme stoichiometry within the formed polar aggregates. The ΔeutQ grew even slower than ΔeutL, consistent with EutQ acting as an acetate kinase during EA degradation (ref. R27). Expression of EutL or EutQ in the ΔeutL or ΔeutQ strain, respectively, partially rescued the assembly of Eut BMCs, evidenced by colocalization of EutM-mCherry and EutB-sfGFP and growth assays (fig. S9). We found that complementation of EutL resulted in various aggregates mostly at cell poles, whereas complementation of EutQ led to formation of rodlike structures (fig. S9A). These results suggest that precise levels of EutL/EutQ are required to define the assembly of native Eut BMCs.

For the ΔeutM strain, we used the pBAD-eutL-mCherry::eutB-sfGFP to monitor the assembly of shell proteins (represented by EutL) and cargos (represented by EutB) (Fig. 2C). In the absence of EutM, EutB-sfGFP formed a large punctum at a single polar position within the cell, whereas EutL-mCherry was evenly distributed throughout the cytosol (Fig. 2C and fig. S7A). The ΔeutM strain exhibited a growth defect on EA compared to the WT (Fig. 2G). These results indicate that EutM may initiate the assembly of the Eut BMC shell.
In the ΔeutN mutant, Eut assemblies were elongated (Fig. 2D and fig. S7A), suggesting that EutN proteins play an important role in shaping the Eut BMC structure. This is consistent with the functions of shell vertex pentameric proteins in other BMCs, including CsoS4 in the α-carboxysome (ref. R29), CcmL in the β-carboxysome (ref. R30), and PduN in the Pdu BMC (ref. R23, ref. R31). EutN is the only CsoS4/CcmL homolog in S. enterica, and no other pentamers have been identified in Eut BMCs (ref. R32). EutN was characterized to be pentameric in solution (ref. R33), whereas crystallization analysis showed that EutN was a homohexamer (ref. R30, ref. R34), suggesting the quaternary structural flexibility of the shell proteins under varying environments.
In ΔeutK, Eut BMCs were predominantly located at cell poles (Fig. 2E and fig. S7A) with the polar Eut BMC assemblies being more static than the highly dynamic Eut BMCs in the WT (movies S1 and S2). We conclude that EutK plays a role in determining the spatial distribution of Eut BMCs in S. Typhimurium. The ΔeutK strain exhibited a slower growth rate than the WT on EA (Fig. 2G and fig. S8). Complementation experiments showed that both Eut BMC formation and cell growth recovered when EutK was expressed in ΔeutK (fig. S9).
In the absence of EutS, EutM-mCherry and EutB-sfGFP colocalized into fluorescent foci (Fig. 2F), resembling the typical Eut BMC distribution in WT cells (Fig. 1C). Thin-section EM confirmed the formation of Eut BMC in ΔeutS (fig. S7A). The ΔeutS strain exhibited a similar growth profile as the WT using EA as the sole carbon source (Fig. 2G and fig. S8), consistent with previous observations (ref. R35). These results suggest that EutS is not essential for the assembly of Eut BMCs. Similarly, the deletion of EutJ, a putative chaperone-like protein (ref. R36) or an MreB/FtsZ homolog (ref. R28), did not remarkably affect Eut BMC assembly or the growth of S. Typhimurium (fig. S10).
To sum up, our systematic analyses revealed the essential roles of EutM, EutN, EutL, EutK, and EutQ for determining the structural integrity and growth of Eut BMCs.
EutQ governs the encapsulation of the enzymatic core to the shell
Our results showed that, in the absence of EutL, EutQ, and EutM, the cargo enzymes were spatially separated from the shell proteins; cargos formed one aggregate at the cell pole, whereas shell proteins appeared as multiple aggregates (ΔeutL and ΔeutQ) or were evenly distributed (ΔeutM) in cells (Fig. 2, A to C). To investigate the protein composition of these self-assemblies, we determined the locations of Eut proteins in the ΔeutQ, ΔeutL, and ΔeutM strains grown upon EA and B12.
In ΔeutQ and ΔeutL, the shell protein EutK was dissociated from the cargo EutB but colocalized with the shell protein EutM to form multiple aggregates (Fig. 2H). In ΔeutM, EutK was freely distributed in the cytoplasm (Fig. 2H), which fits with the observations for EutL (Fig. 2C), confirming that EutM drives shell formation. Furthermore, in ΔeutQ, ΔeutL, and ΔeutM, the cargo proteins EutE and EutG colocalized with EutB, forming a single polar aggregate (Fig. 2I). These findings indicate that the lack of EutQ, EutL, or EutM could cause shell-cargo disassociation, resulting in the formation of a cargo aggregate at the cell pole coupled with multiple shell aggregates or the dispersion of shell proteins within cells (Fig. 2, A to C and H to J).
Our results suggest that interactions between EutQ and EutL or/and EutM may be essential for Eut BMC cargo-shell association. To test this hypothesis, we examined the locations of EutL and EutM in ΔeutQ and the distribution of EutQ in ΔeutL and ΔeutM. In the absence of EutQ (ΔeutQ), EutL and EutM assembled together to form shell aggregates (Fig. 2H) but disassociated with cargos (Fig. 2, B and H). In ΔeutL and ΔeutM, EutQ formed a punctum at the cell pole that colocalized with cargos (Fig. 2I). We conclude that EutQ forms a strong association with cargos while interacting with the shell proteins EutM and EutL, which is essential for cargo encapsulation. The findings from ΔeutL show that shell assemblies disassociated with cargos in the presence of EutM (Fig. 2, H and J), suggesting that the interactions between EutQ and EutM were insufficient for cargo encapsulation. A previous study has indicated that EutQ was required for Eut BMCs assembly and its interaction with EutM through pull-down assays (ref. R37). Our microscale thermophoresis (MST) analysis further revealed mutual interactions between EutQ, EutM, and EutL (Fig. 2K and fig. S11). We propose that the binding of EutQ to the shell is facilitated by the shell proteins EutL and EutM and that EutQ specifically attaches at the interaction interface between EutM and EutL.
Molecular mechanisms of the EutQ-mediated shell-cargo association
Size exclusion chromatography coupled with multiangle light scattering analysis revealed that EutQ (~25 kDa per monomer) forms an ~50-kDa dimer and that dimerization was mediated by the contacts between two EutQ C termini (fig. S12). This is consistent with the crystal structure of EutQ C terminus (Protein Data Bank ID: 2PYT) (ref. R38). AlphaFold prediction of the entire EutQ structure shows that the EutQ N-terminal region (EutQ1–99) has four α helices (H1 to H4) and a long flexible unstructured region linking H2 and H3. The C-terminal region (EutQ100–229) consists of mainly β strands (Fig. 3A and fig. S13, A and B), representing a cupin barrel fold domain (Pfam: PF06249) that is widespread and conserved among bacteria.

To determine the function of EutQ in Eut BMC assembly, we generated two mutants that lacked the EutQ N terminus (ΔeutQ9–99; note that eutQ1–9 overlaps with the upstream gene eutP) or the C terminus (ΔeutQ100–229). Deletion of EutQ9–99 caused disassociation of the shell and cargos and formation of a polar cargo aggregate along with numerous shell assemblies (Fig. 3B and fig. S13, C and E), in line with the observations from the deletion of full-length EutQ (Fig. 2 and fig. S7). In the absence of EutQ9–99 (ΔeutQ9–99), the EutQ C terminus exhibited an even distribution throughout the cytosol and did not assemble with either the shells or the cargos (Fig. 3B). The EutQ N terminus (EutQ1–99) was sufficient to target sfGFP to Eut BMC, both in the presence and absence of EutQ9–99 (Fig. 3C). By contrast, Eut BMCs were formed in ΔeutQ100–229 (Fig. 3B and fig. S13, D and F). These results demonstrate that the EutQ N terminus plays a critical role in mediating the shell-cargo association, whereas the EutQ C terminus is not essential for Eut BMC assembly.
Both the ΔeutQ9–99 and ΔeutQ mutants had reduced growth rates compared to the WT (Fig. 3D and fig. S14). We propose that the absence of EutQ coordinating function upon loss of the full protein (ΔeutQ) or the N-terminal helices involved in interactions with cargo and shell proteins (ΔeutQ9–99) adversely affects bacterial growth under EA utilization. Both EutQ and EutP have been suggested to have acetate kinase activity, facilitating the conversion of acetate and ATP into acetyl phosphate and vice versa; however, EutQ is required for Salmonella growth, but EutP is not (ref. R27). Likewise, EutQ was found to be 10 times more abundant than EutP in Escherichia coli (ref. R39). A recent study assumed that both EutP and EutQ are cytosolic, highlighting that their subcellular localization requires further investigation (ref. R39). Our results demonstrate that EutQ is incorporated within the Eut BMC, whereas EutP is most likely cytosolic (Fig. 1D). We found that ΔeutP had a similar growth rate to the WT and, unexpectedly, that ΔeutPQ showed slightly faster growth than ΔeutQ (fig. S14). This presumably reflects the presence of the housekeeping acetate kinase (AckA) in the cytosol, which is encoded by the ackA gene outside the eut operon (ref. R40, ref. R41). In ΔeutQ9–99, formation of acetate can be catalyzed by EutQ100–229 that contains the C-terminal catalytic domain of EutQ, ensuring a faster growth of ΔeutQ9–99 than ΔeutQ (Fig. 3D and fig. S14). Moreover, ΔeutQ100–229 cannot grow on EA as the sole carbon source. In ΔeutQ100–229, Eut BMCs were still formed, but the access of acetyl phosphate to the external acetate kinase was limited by shell encapsulation, resulting in a slower growth rate of ΔeutQ100–229 compared to other strains (Fig. 3D and fig. S14). The growth of ΔeutQ100–229 was restored by expression of full-length EutQ (Fig. 3D and fig. S14). However, expression EutQ100–229 in ΔeutQ100–229 did not restore growth, suggesting that EutQ100–229 alone could not be encapsulated into the Eut BMC without the EutQ N terminus.
The encapsulation of EutQ suggests its role in the localized acetyl phosphate metabolism, enabling ATP generation within Eut BMCs. Because the large adenosine diphosphate (ADP) and ATP molecules may be difficult to pass through the shell pores, this localized ATP pool may serve as a local energy source for EutBC reactivation by EutAT (ref. R42). In this context, the presence of an acetate kinase within the BMC core may be essential for sustaining enzymatic activity and maintaining efficient metabolic flux within the BMC lumen.
To determine how the N terminus of EutQ mediates shell-cargo association, we deleted the H1 to H4 α helices individually (ΔeutQ9–21, ΔeutQ22–41, ΔeutQ62–79, and ΔeutQ80–99) and examined the subcellular locations of shell and cargo proteins in these mutants. Deletion of each α helix led to the disassociation of the shell and cargos (Fig. 3E), indicating that all four α helices are essential for Eut BMC formation. The AlphaFold-predicted EutQ structure and EutQ1–99-EutC1–20 interactions (figs. S13A and S15) led us to propose that H1/H2 interacts with the shell and that H3/H4 is responsible for anchoring the EPs of cargos. To test this hypothesis, we deleted the H1/H2 (ΔeutQ9–41), H3/H4 (ΔeutQ62–99), and the flexible region (ΔeutQ42–61) individually. The shell and cargo assemblies were spatially separated in all three mutants (Fig. 3F and fig. S16), suggesting that the three regions were crucial for shell-cargo association. Furthermore, EutQΔ9–41 and EutQΔ42–61 aggregated with the cargo cores at the cell poles in ΔeutQ9–41and ΔeutQ42–61, whereas EutQΔ62–99 dissociated with the cargos and colocalized with the shell assemblies in ΔeutQ62–99 (Fig. 3F and fig. S17).
Together, our results demonstrate that the EutQ N-terminal H1/H2 interacts with the shell whereas H3/H4 associates with cargos. We propose that the interactions between H3/H4 and cargos were stronger than those between H1/H2 and the shell because the EutQ that lacked the flexible region (EutQΔ42–61) colocalized with cargos rather than shells (Fig. 3F, middle bottom). This finding indicates that the flexible region plays an important role in shell-cargo binding and potential interactions with the shell.
We then examined the roles of endogenous cargo-derived EPs by studying their interactions with EutQ. Deletion of the N terminus of EutC (ΔeutC2–19) caused cytosolic distribution of EutB, whereas EutQ, the cargo protein EutE, and the shell protein EutM could still form assemblies individually (Fig. 3G). These results suggest that the EutC N terminus was specifically responsible for the assembly and encapsulation of EutBC cargo complexes. Likewise, removal of the native EP of EutE (ΔeutE2–21) resulted in an even distribution of EutEΔ2–21 throughout the cytosol, whereas the subcellular locations of other Eut proteins were not affected (Fig. 3H), identifying a role for the native EP in targeting EutE to the Eut BMC. Growth assays of the EP deletion mutants showed no growth on EA as the sole carbon source after 50 hours (Fig. 3D and fig. S14), indicating that, without the EPs, EutBC and EutE could not be incorporated into Eut BMCs. The location of another cargo protein, EutG, was not affected in both ΔeutC2–19 and ΔeutE2–21 (fig. S18). EutG has an N-terminal extension with an unknown function (fig. S19A). Our results reveal that, although this extension played a role in mediating the incorporation of EutG into Eut BMCs (fig. S19B), it was insufficient to independently target sfGFP to Eut BMCs (fig. S19C). The N terminus of EutG lacks canonical EP features (fig. S19D), and EutG proteins are likely incorporated into Eut BMCs by “piggybacking” with other cargo enzymes (ref. R25, ref. R42). Collectively, our results elucidate the mechanisms by which the EutQ N terminus mediates the assembly of the Eut shell and cargos (Fig. 3I).
To determine how EutQ binds cargo EPs, we examined the protein-protein interactions using isothermal titration calorimetry (ITC) and NMR spectroscopy. The ITC results revealed strong binding between EutC1–20 and EutQ [dissociation constant (Kd) = 6.35 μM] or between EutC1–20 and the N-terminal region of EutQ, EutQ1–99, fused with a soluble fusion partner, the immunoglobulin binding domain of streptococcal protein G (GB1) (ref. R43) (Kd = 7.60 μM) (Fig. 4, A and B, and fig. S20). A binding stoichiometry of 2 for the EutQ interactions suggests that EutQ functions as a dimer. This is in contrast to EutQ1–99-GB1, where the binding stoichiometry to the same EutC peptide is 1, consistent with the monomeric state of EutQ1–99. No binding was detected between EutC1–20 and EutQ100–229 (Fig. 4C) or for GB1 alone or for the incubation buffer (fig. S21). Two-dimensional (2D) heteronuclear single-quantum correlation (HSQC) NMR identified interactions between EutQ1–99 and EutC1–20, indicated by chemical shift perturbations (CSPs) of selected residues of 15N-labeled EutQ1–99 in the presence of unlabeled EutC1–20 (Fig. 4D). Complete backbone assignment of EutQ1–99-GB1 was hampered by severe resonance overlap and/or line broadening due to the propensity of the H3/H4 region to aggregate; specifically, resonances from the residues 66 to 77, 82, and 85 to 98 within H3 and H4 regions could not be assigned (Fig. 4E and fig. S22). The absence of CSPs of residues in H1 and H2 (which were all assigned) confirmed that this region of EutQ is not involved in interactions with EutC1–20. Assigned resonances showing CSPs include amino acids 63 to 65, 78 to 81, 83, 84, and 99; these are residues found in the linker region and within H3/H4 of EutQ (Fig. 4F). These NMR results confirmed the in vivo findings (Fig. 3) and the AlphaFold predictions described above (fig. S15). Furthermore, molecular dynamics (MD) simulations revealed that the amino acids I6, V10, M14, and V13 of EutC1–20 lie on one side of the peptide helices and interact with the conserved amino acids V93, L89, E64, R67, and Q75 of EutQ (figs. S23 and S24). Point mutation of EutQ (figs. S25 and S26) showed that the mutants V93A and L89A, with the alternations to hydrophobic residues, completely eliminated the binding of EutQ to EutC1–20. The Q75A mutant exhibited a higher Kd (Kd = 21.5 μM) compared to the WT EutQ, indicating reduced binding affinity. These results suggest that the EutQ-EutC1–20 interaction involves both hydrophobic and hydrophilic amino acids, with hydrophobic interactions playing a dominant role in complex stabilization.

We investigated the binding between EutQ and EutE1–20, the EP of the distinct EutE cargo protein, using ITC and NMR (fig. S27). The EutE binding characteristics, including the binding site and thermodynamics, resembled those observed for EutC1–20 binding, albeit with a notable decrease in affinity. EutC1–20 and EutE1–20 helical peptides show 90% homology with 40% identity (Fig. 4G). The difference between the affinities of EutC and EutE to EutQ could be explained by slight variations in the hydrophilic interactions between the two peptides and EutQ. Together, our results suggest that the EutC1–20 and EutE1–20 peptides interact with EutQ to mediate cargo encapsulation.
Comparative genomic analysis and protein sequence alignment showed that Eut proteins are highly conserved in sequence among Gram-negative bacteria (figs. S28 and S29). EutQ homologs are also known as components of another type of metabolosomes, the glycyl radical enzyme-associated microcompartments (GRMs) (ref. R18), implying that the EutQ-mediated assembly mechanisms of Eut BMCs we have found in S. Typhimurium may serve as a general principle for the assembly of BMCs in a variety of bacterial species. In contrast, the Eut proteins from Gram-positive bacteria exhibit low sequence similarity compared to S. Typhimurium counterparts (fig. S28). Notably, the EutQ proteins of Gram-positive species like Enterococcus faecalis and Listeria monocytogenes lack the first three α helices (fig. S29), which, according to our results, are crucial for BMC assembly in S. Typhimurium. This difference strongly indicates that the assembly mechanisms of Eut BMCs may vary substantially between Gram-negative and Gram-positive bacteria. Instead of one universal and conserved assembly pathway, our results highlight that Eut BMC assembly follows various mechanisms across different bacterial species.
Eut BMC undergoes a shell-initiated assembly pathway
To delineate the de novo biogenesis of Eut BMCs, we conducted time-lapse live-cell fluorescence imaging to monitor the initial in vivo assembly of cargo and shell proteins. We expressed EutM-mCherry/EutB-sfGFP, EutK-mCherry/EutB-sfGFP, or EutB-mCherry/EutM-sfGFP in S. Typhimurium cells. After adding EA and B12 metabolites to induce expression of native Eut proteins, the first fluorescent spot of Eut shell proteins (indicated by EutM-mCherry, EutK-mCherry, or EutM-sfGFP) emerged from the cytosolic distribution fluorescence background after ~30 min of induction. For comparison, Salmonella cells growing on solid media under the microscope exhibited localization patterns similar to those seen in liquid media (Fig. 5A and fig. S30), indicating that the observed localization patterns accurately reflect the authentic in vivo organization of Eut BMCs. Intriguingly, the assembly of shell proteins always appeared before the assembly of cargo proteins (Fig. 5A and fig. S31, A and B). We observed that the first shell assembly tended to occur at the cell pole and along the cell membrane (Fig. 5B). The time interval between the first shell and cargo assembly was 2.4 ± 2.2 min (means ± SD, n = 48) (Fig. 5D); in 62.5% of cells, this interval was less than 3 min.

Following initiation of assembly, the fluorescence intensity of shell and cargo assemblies continued to increase and reached a steady state at ~6 to 7 min (Fig. 5C and fig. S32). Consistent with this, during the early stage (60 min) after EA induction, our proteomic analysis revealed a greater increase in the level of shell proteins (for example, EutM) compared to cargo proteins (such as EutBC) (fig. S4). Together, the results strongly support our proposed “shell-initiated” pathway, where Eut BMC biogenesis is initiated with the assembly of shell proteins followed by the nascent, partially formed shell assemblies serving as scaffolds for recruiting cargo enzymes and incorporating other shell proteins, ultimately leading to the formation of entire Eut BMCs. This pathway is distinct from a combination of the “shell first” and “cargo first” pathways of Pdu BMCs (ref. R23) (fig. S31, C and D).
Liquid-like association of cargo enzymes within the Eut BMC
The Eut BMC sequesters a series of cargo protein complexes within the shell. These cargo enzymes can oligomerize via interactions mediated by native EPs, whereas enzymes that lack EPs “piggyback” onto the enzymes containing EPs to achieve encapsulation. To understand the organizational properties of Eut BMCs, we determined the diffusion dynamics of the shell (indicated by EutM) and cargos (indicated by EutB) of Eut BMCs by fluorescence recovery after photobleaching (FRAP). Elongated structures of Eut BMCs formed in the ΔeutN strain were used for FRAP experiments and analysis (Fig. 5E and fig. S33). We found that cargo proteins showed a higher mobile fraction (58.5 ± 4.5%, n = 20) than shell proteins (12.8 ± 3.0%, n = 20) (Fig. 5F); the half-life of recovery of the cargos was 3.85 ± 2.0 min (n = 20). We conclude that the cargo proteins form a liquid-like condensate within the Eut BMC, consistent with the findings for α- and β-carboxysomes and for Pdu BMCs of S. Typhimurium (ref. R23, ref. R44, ref. R45). These findings suggest that oligomerization and condensation of cargo proteins are a widespread phenomenon that occurs within a diverse range of BMCs, which plays a crucial role in BMC assembly and in sustaining their structures and functions.
DISCUSSION
Eut BMCs are self-assembling proteinaceous organelles that mediate EA catabolism. Previous attempts using protein crystallization, growth analysis, and synthetic biology had only provided fragmented knowledge about the composition and function of Eut BMCs. The mechanism by which various protein components connect and self-assemble to create a complete and functioning entity remains unclear. Here, we have elucidated the organizational and assembly mechanisms of Eut BMCs in S. Typhimurium in unprecedented detail and propose a model for the biogenesis pathway of Eut BMCs (Fig. 6).

The assembly of the Eut BMC shell, triggered by EutM, initially occurs at the cell pole. Next, EutQ interacts with the interface of EutM and EutL to provide the binding sites for endogenous EPs of cargo enzymes, enabling cargo encapsulation. The binding mode between EutQ and EutM/L aligns with a model proposed by Aussignargues et al., in which EPs interact with an epitope created by the interactions of multiple shell proteins (ref. R42). It should be noted that tagging fluorescent proteins may potentially cause artifacts of BMC assembly. To minimize possible interference, low levels of fluorescently tagged Eut proteins were expressed from a pBAD plasmid without induction and were incorporated, together with nontagged Eut proteins, into Eut BMCs. To corroborate our fluorescence imaging observations, we conducted a comprehensive set of control experiments and comparative analyses and used complementary techniques, including thin-section EM, mass spectrometry, and growth assays.
Our data reveal that the Eut BMC undertakes the shell-initiated assembly pathway, distinct from the assembly pathways of Pdu BMCs in Salmonella (concomitant assembly of shell and cargo proteins; fig. S31) (ref. R23), and β-carboxysomes in cyanobacteria using conventional fluorescence imaging and EM (“cargo first” pathway) (ref. R46, ref. R47). The biogenesis of Eut BMC structures begins with the initial assembly of shell proteins. These newly forming, incomplete shell structures then serve as structural frameworks, facilitating the attraction of cargo enzymes and the integration of additional shell components during subsequent maturation stages. This stepwise process ultimately leads to the formation of fully intact and functional Eut BMCs. In Salmonella, the distinct assembly pathways of Eut BMCs and Pdu BMCs might reflect differences in the availability of the particular shell and cargo proteins. Our proteomic analysis reveals a greater fold increase in Eut shell proteins compared to cargo proteins following EA induction for 60 min (fig. S4). In addition, previous transcriptomic analysis revealed higher expression levels of genes encoding Eut shell proteins than those encoding Eut cargos, whereas genes encoding Salmonella Pdu shell and cargo proteins were expressed at similar levels under anaerobic growth conditions (ref. R48). Moreover, the assembly of Eut BMC cargos occurs separately from the assembly of its shell as shell and cargo aggregates were spatially partitioned in the ΔeutQ and ΔeutL mutants (Fig. 2J). The independent assembly of shell and cargo proteins resembles the observations in Pdu BMCs (ref. R23). Together, our findings indicate that the shell and cargo assembly pathways of different metabolosomes are completely independent, and the temporal sequence of assembly of shell and cargo proteins varies considerably.
We found that the shell proteins, EutM, EutN, EutL, and EutK, were essential for the formation and function of Eut BMCs, but EutS was not required. We can draw parallels with the second Salmonella-encoded metabolosome, the Pdu BMC. PduU, the EutS homolog within the Pdu BMC, is not required for the assembly and function of Pdu BMCs (ref. R38, ref. R49, ref. R50). EutS appears to be well conserved among the isolates presented in fig. S33, suggesting that it may be under selective pressure and could play a role under more physiologically relevant conditions. EutM can self-assemble to form large protein filaments (ref. R51), which resemble the nanotube structures formed by the homologous PduA and PduJ proteins (ref. R52–ref. R54). On the basis of the fact that PduA and PduJ are the two major canonical shell proteins of Pdu BMCs (ref. R49), we propose that EutM is a major building component of the Eut shell. Our results also show that the pseudohexameric homotrimer EutL (ref. R55) is essential for the shell-cargo association of Eut BMCs (Fig. 2J). This is consistent with the function of its homolog, PduB, in the assembly of Pdu BMCs (ref. R23). EutK plays a role in the in vivo distribution of Eut BMCs, reminiscent of the role of PduK (with a disordered C-terminal extension) in determining the subcellular locations of Pdu BMCs (ref. R49). The C-terminal tail of EutK has a positively charged domain, which could potentially bind to nucleotides (ref. R56). This implies that EutK may function in a manner analogous to the ParB protein or associate with ParB, which plays a crucial role in chromosome segregation (ref. R57), to mediate Eut BMC positioning in Salmonella. Similarly, the McdAB system mediates the partitioning of carboxysomes in cyanobacteria and some proteobacteria (ref. R58, ref. R59). The molecular mechanism by which EutK governs the partitioning of Eut BMCs and interactions with nucleotides remains to be determined. Overall, our data indicate that Eut shell proteins perform similar functions to their counterparts in Pdu BMCs during the assembly of metabolosomes, suggesting that the specific functions of shell proteins we have identified may extend to a range of catabolic BMCs found in various bacterial hosts.
Our results demonstrate both the structural and catalytic roles of EutQ. The N terminus of EutQ links the shell and cargos of Eut BMCs, comparable to CcmN, CsoS2, and PduB1–37, the linkers proteins that mediate the binding of the shell and cargo enzymes of β-carboxysomes, α-carboxysomes, and Pdu BMCs, respectively (ref. R23, ref. R45, ref. R46). Recent studies have revealed that the interactions between CsoS2 C terminus and shell proteins occurred at the interface of shell proteins (ref. R29, ref. R60), in good agreement with our findings that EutQ could interact with the shell proteins EutL and EutM (Fig. 3I), whereas the N terminus of CsoS2 binds the cargo enzyme Rubisco in α-carboxysomes (ref. R45, ref. R61, ref. R62). We have shown that the H1/H2 region of EutQ N terminus specifically binds to the shell, whereas its H3/H4 region associates with cargo. Moreover, the C terminus of EutQ demonstrates acetate kinase activity and contains a cupin barrel fold domain. It is worth noting that many cupin proteins, including (S)-ureidoglycine aminohydrolase (ref. R63), are not associated with BMCs. We have found that the N terminus of EutQ is critical for the integration of the EutQ protein into Eut BMCs. Overall, EutQ serves as a multifunctional protein that integrates both structural and enzymatic roles within the BMC system.
In summary, our extensive live-cell super-resolution fluorescence imaging, structural, and biochemical studies provide unprecedented insights into the organization and biogenesis of Eut BMCs. We dissected the molecular mechanisms of the stepwise protein assembly and the hierarchical cooperation that governs the formation of structurally defined and functional Eut BMCs. This breakthrough advances our knowledge of catabolic BMC assembly and provides a molecular roadmap for reprogramming and modulating BMCs to improve the efficiency of metabolic reactions. The future identification of specific binding sites for BMC shell and cargo protein interactions could pave the way for the development of therapeutic interventions for a variety of infectious diseases that target the mammalian GI tract.
MATERIALS AND METHODS
Bacterial strains and growth conditions
The bacterial strains used in this study derived from S. enterica subsp. enterica serovar Typhimurium LT2 (S. Typhimurium LT2) (ref. R64). The rich medium was LB broth (Lennox) medium [tryptone (10 g liter−1), yeast extract (5 g liter−1), and sodium chloride (5 g liter−1)], and the minimal medium was no-carbon-E (NCE) medium supplemented with 1 mM MgSO4, 0.5% succinate, and 30 mM EA and 200 nM vitamin B12 (if applicable). The LB plate was prepared with agar (15 g liter−1), and the minimal medium plate was prepared with 2% (w/v) low–melting point agarose. Antibiotics were added to media as required at the following final concentrations: ampicillin at 100 μg ml−1, kanamycin at 50 μg ml−1, and gentamicin at 20 μg ml−1 in ddH2O, chloramphenicol at 25 μg ml−1 in ethanol, and tetracycline at 25 μg ml−1 in methanol.
Intracellular visualization of fluorescently tagged Eut proteins was conducted following the growth of the plasmid-carrying strains in minimal medium under aerobic conditions. An overnight LB culture was inoculated 1:100 in 100 μl of minimal medium in a 2-ml Eppendorf tube in the absence of EA and B12 shaken horizontally at 220 rpm overnight. Unless otherwise specified, 1 μl of this culture was subinoculated to 100 μl of minimal medium in a 2-ml Eppendorf tube, both in the absence and in the presence of EA and B12, shaken aerobically at 37°C for 5 hours. For birth event detection, 5 μl of the subinoculated culture (in minimal medium without EA and B12) was dropped onto the minimal medium plate in the presence of EA and B12 and incubated aerobically at 37°C for time-lapse imaging.
Construction of chromosomal mutations
All mutants in this study were generated by a scarless genome editing technique developed previously using the pEMG and pSW-2 plasmids (ref. R65). A pair of DNA fragments (~800 base pairs) flanking the chromosome regions of interest (ROIs) were firstly polymerase chain reaction (PCR) amplified and integrated into the linear pEMG suicide plasmid (digested by EcoRI and BamHI) by Gibson assembly (NEBuilder HiFi DNA Assembly kit) (ref. R66). The pEMG-derivative plasmids were mobilized from E. coli S17-1 λ pir to S. Typhimurium by conjugation. S. Typhimurium transconjugants that have integrated pEMG-derivative plasmids were selected on M9 minimal medium plates supplemented with kanamycin and 0.2% glucose. pSW-2, extracted from S. Typhimurium LT2, was subsequently transformed into the transconjugants by electroporation. Then, transformants were selected on LB plates with gentamicin and 1 mM m-toluate was added. Colonies were screened for kanamycin resistance and sensitive clones were verified by PCR. pSW-2 was lastly cured from the resulting strains by two passages in LB without adding gentamicin. Please note that, for the deletion of N terminus of EutQ, the first nine amino acids were kept due to an overlap of coding genes between eutP and eutQ. Strains and plasmids used in this study are listed in table S1. A complete list of primers can be found in table S2. All mutants were verified by PCR and DNA sequencing of PCR-amplified genomic DNA sequencing (fig. S6).
Construction of vectors
pBAD/Myc-His was used as a backbone for generation of fluorescence tagging vectors. pBAD/Myc-His was first digested by NcoI and HindIII. Then, PCR-amplified DNA fragments of individual eut genes, mCherry, and sfGFP were integrated into the linear vector by Gibson assembly (ref. R66). pXG10-SF containing a constitutive PLtetO-1 promoter was used as a backbone for complementation experiments (ref. R67). The PCR-amplified coding sequences of EutK/L/Q were inserted into linear pXG10-SF (obtained by PCR cloning) via Gibson assembly. Plasmids and primers used in this study were listed in tables S1 and S2, respectively. All constructed vectors were verified by PCR and plasmid sequencing.
Bacterial growth assays
The S. Typhimurium strains of interest were firstly inoculated from isolated colonies from LB plates into 5 ml of LB in 30-ml universal glass vials and then incubated at 37°C overnight with shaking at 220 rpm. The overnight cells were washed three times and then resuspended in M9 medium (supplemented with 2 mM MgSO4, 100 μM CaCl2, 30 mM EA, and 200 nM B12) to an optical density at 600 nm (OD600) of 0.01. A 250-μl culture was growing aerobically at 37°C in a 96-well microplate reader with shaking at 225 rpm, using the System Duetz technology platform (Growth Profiler 960, EnzyScreen). The readings of OD600 were taken every 30 min. At least three biological replicates of each growth curve were collected. Lag time was calculated by time to reach OD600 = 0.2. Nonlinear regression was used for calculation by the Gompertz growth model.
Transmission electron microscopy
The structures in the S. Typhimurium WT and mutant strains were visualized using thin-section EM. An overnight LB culture was inoculated 1:100 in 5 ml of minimal medium in a 30-ml universal glass vial in the absence of EA and B12 shaken at 220 rpm overnight at 37°C. A 500-μl culture was subinoculated to 50 ml of minimal medium in a 250-ml flask in the presence of EA and B12, shaken aerobically at 37°C for 5 hours. Bacterial samples were pelleted at 6000g for 10 min and fixed twice with 0.05 M sodium cacodylate buffer (pH 7.2) containing 2.5% glutaraldehyde using 100 W for 1 min (P1). Samples were embedded in 4% agarose and then stained three times with 2% osmium tetroxide and 3% potassium ferrocyanide using 100 W for 20 s (P2). The reduced osmium stain was then set using a 1% thiocarbohydrazide solution for 10 min. The second osmium stain was applied using P2 with 2% osmium tetroxide. The sample was made electron dense with 2% uranyl acetate incubated at 4°C overnight. Dehydration was run with a series of increasing alcohol concentrations (30 to 100%) before cell embedding in medium resin. Thin sections of 70 nm were cut with a diamond knife. Images were recorded by a Gatan Rio 16 camera, DigitalMicrograph software, and an FEI 120-kV Tecnai G2 Spirit BioTWIN transmission electron microscope.
Fluorescence microscopy
The bacterial cells were imaged using a ZEISS Elyra 7 with Lattice SIM2 super-resolution microscope. Cells were pelleted and washed twice with phosphate-buffered saline (PBS) buffer, followed by fixing with 4% paraformaldehyde solution (prepared with PBS buffer) and incubated at room temperature for 15 min before imaging. Images were captured in Lattice SIM mode using an SBS LP 560 dual camera beam splitter and a Plan-Apochromat 63×/1.4 oil DIC M27 objective. Two tracks (one for sfGFP imaging excited at 488 nm and the other one for mCherry imaging excited at 561 nm) detected by two cameras were used to avoid fluorescence cross-talk. Two cameras were always aligned using the microscope internal calibration pattern before imaging. Images were captured as 1280 x 1280 pixels at 16 bits. Super-resolution images were obtained by processing under SIM2 by Zen software. Additional imaging and processing parameters of light microscopy are listed in table S3.
To monitor the birth event and the progression of the shell and the cargo of Eut BMCs, 5 μl of bacterial culture in minimal medium in the absence of EA and B12 was dropped onto the minimal medium plate in the presence of EA and B12 and left to dry at 37°C. The agarose plate with cell patches was cut out and attached to a 35-mm glass-bottom dish and covered by a 0.17-mm glass coverslip. The images were taken at 0 min when cells just started to grow on the minimal medium plate in the presence of EA and B12 and at either 1- or 3-min intervals to track the assembly of Eut BMCs. Represented images were shown at different time points when important events occurred. The temperature was controlled at 37°C during imaging. All images were captured from at least three biological replicates. A custom Fiji (ref. R68) pipeline was developed to analyze multichannel fluorescence microscopy image stacks for quantitative and classification purposes. The time series images were preprocessed with bleach correction using an exponential decay fit method (ref. R69), and sequential time frames were registered to correct for stage jitter (ref. R70). Individual bacteria were cropped from the registered images and resaved for further analysis. A developed macro then generates a maximum intensity projection (MIP) and sums the signal of both channels. ROIs are identified using a global threshold (ref. R71) followed by connected component analysis (Analyze Particles, size 0.5 μm2-Infinity) and measured across all frames for both channels. The key intensity metrics—including maximum, mean, SD, and kurtosis—are extracted. The resulting measurements are compiled into structured CSV files, which serve as the primary input for downstream analysis in the KNIME Analytics Platform (version 4.7.8) (ref. R72). A secondary input consists of an Excel file, which includes manually determined frames for each channel where the first microcompartment appears. These two inputs are merged using the file name as a unique identifier. Subsequently, the data are split by channel (magenta and green), and a random forest classifier is trained on each dataset to categorize bacteria as exhibiting either homogeneous fluorescence or punctate signals. The training uses the four extracted intensity-based measurements and the identified first microcompartment appearance frame as input. The trained models are saved for use in classifying new data of similar type. The results are further processed to determine the time frame of the first microcompartment appearance for each channel and input file. The complete workflow, including the training data and trained models, is publicly available on the KNIME Community Hub (https://hub.knime.com/marie_held/spaces/Public/~FiLNvaHabPidBPpt/). Last, the extracted first frames of MC appearance are used to normalize and align the time series maximum intensity data to the penultimate frame preceding microcompartment appearance.
FRAP experiments were performed on a Zeiss LSM780 confocal microscopy as described previously (ref. R23). Five microliters of bacterial cells that grew in minimal medium in the presence of EA and B12 for 5 hours was dropped onto the minimal medium plate and allowed to dry before sandwiching with a 35-mm glass-bottom dish and a 0.17-mm glass coverslip. 100% laser power was applied to bleach a cross-sectional line across the center of the cell. Images were captured every 1 min for 60 min. Fluorescence profiles along the long axis of the cell were obtained by ImageJ software and normalized to the same total fluorescence. The mobile proportion (M) was given by
Experimental fluorescence recovery curves were obtained by plotting the fluorescence profile values at the center of the bleached area against time. Then, fluorescence recovery curves were fitted to a single exponential function, given by
Protein expression and isolation
The EPs EutC1–20 and EutE1–20 were synthesized by GenicBio. Synthetic gene for EutQ (residues 1 to 229; UniProt code: Q9ZFV5) was cloned into pET14b modified with a TEV protease site located before the protein gene sequence. The EutQ N-terminal domain fused with a soluble fusion partner, the immunoglobulin binding domain of streptococcal protein G (GB1) at its C terminus (EutQ1–99-GB1), EutQ C-terminal domain (EutQ100–229), and GB1 were cloned into the pETM11 expression vector. Hexahistidine Ni2+ affinity tag (His-tag) was positioned between EutQ1–99 and GB1. For MST, the eutM, eutQ and eutL genes of S. Typhimurium LT2 were inserted into pET-22b (+) with either a C-terminal his tag or a strep tag. Different pairs of primers containing restriction enzyme sites (Nde I and Xho I) and gene-specific sequences were used to amplify the gene fragment, and the gene fragment was ligated to the expression vector pET-22b (+) using a seamless cloning kit. The vector backbone and primers for Gibson assembly cloning are listed in tables S1 and S2.
Heterologous expression of recombinant EutQ, EutQ1–99-GB1, EutQ100–229, and GB1 were achieved by transforming BL21 (DE3) star cells with the relevant expression plasmid. Cells were plated and incubated overnight on LB agar containing either ampicillin (pET14b) or kanamycin (pETM11) at 37°C in a static incubator. Starting E. coli culture were prepared by inoculating 1 ml of LB broth containing the relevant antibiotics was inoculated with one colony picked from the prepared plates and incubating at 210 rpm for 5 hours at 37°C. For protein expression in minimal media, another starting E. coli culture were prepared by inoculating 20 ml of M9 media with 100 μl of primary culture in the presence of the relevant antibiotics and incubated overnight at 210 rpm and 37°C. M9 protein expression media for preparing the 15N and 13C isotopically labeled proteins were prepared with either isotopically labeled 15NH4Cl for 2D NMR or with both 15NH4Cl and 13C-glucose for 3D NMR studies. Expression culture was then made by adding 20 ml of starting culture into 1 liter of M9 media containing the relevant antibiotics and incubated at 37°C and 210 rpm until the OD600 of 0.7 was reached.
Protein expression was induced with 0.8 mM isopropyl-β-d-thiogalactopyranoside (IPTG). The cell culture was incubated overnight at 18°C and 180 rpm, and cells were harvested by centrifugation at 5000 rpm for 30 min at 4°C. Cell pellets of 2-liter culture were resuspended to 5% (w/v) in buffer A [50 mM Na2HPO4, 300 mM NaCl, 1 mM dithiothreitol (DTT), and 10% glycerol (pH 7.2)] supplemented with DNase1 (15 μg ml−1), stood in ice for 20 min, and lysed by one pass through the cell disruptor (Constant Systems, UK) at 19 kpsi (131 MPa) at 4°C. The lysate was centrifuged at 18,000 rpm and 4°C, and the supernatant was collected and passed through a preequilibrated 5-ml FF His-trap column (Cytiva, USA). The column was washed with 5 column volumes (CV) of 5% buffer B (buffer A + 500 mM imidazole) and eluted by a gradient increase in buffer B. Protein samples were further purified by size exclusion chromatography using an S200 Superdex 16/600 GE Healthcare column in a buffer containing 50 mM Na2HPO4,150 mM NaCl, and 10% glycerol (pH 7.2). These were aliquoted and stored at −80°C. Experimental protein samples were buffer exchanged into 20 mM Na2HPO4 and 20 mM NaCl (pH 6.5).
For MST, the gene expression plasmid construction was carried out in E. coli strain BL21(DE3) at 37°C in LB medium with ampicillin (100 μg ml−1). The BL21(DE3) constructs were cultured overnight at 37°C in LB medium with ampicillin (100 μg ml−1) to an OD600 of 0.8 to 1.0. EutQ and EutL expression were induced at 25°C, and EutM expression was induced at 37°C for 12 to 16 hours with 0.4 mM IPTG. The proteins were first purified with Ni2+-NTA resin (GE Healthcare, USA) or Strep-tag XT (IBA Lifescience, Germany), and then gel filtration chromatography was performed on a Superose 6 column or a Superdex G200 column with fast protein liquid chromatography (AKTA purifier 10, GE Healthcare, USA).
Proteomic analysis
Overnight LB cultures from four single colonies of S. Typhimurium LT2 were inoculated 1:100 in 50 ml of minimal medium in a 250-ml flask in the absence of EA and B12 shaken at 220 rpm overnight at 37°C. Five milliliters of overnight minimal medium cultures was pelleted, washed, and subinoculated to 100 ml of minimal medium in a 500-ml flask in the presence of EA and B12, shaken aerobically at 37°C for 15, 30, 60, and 300 min. Control samples are S. Typhimurium growing in the same condition but without EA and B12 for 0 and 300 min. Bacterial samples (24 samples in total) were pelleted at 6000g for 10 min and stored at −80°C for further analysis.
Samples were lysed in 250 μl of lysis buffer [1% SDS, 1% IGEPAL, 1% sodium deoxycholate, 125 mM NaCl, 5 mM EDTA, and 100 mM Tris buffer (pH 8)] and then boiled at 80°C for 10 min and sonicated using a probe sonicator at 30% amplitude for six cycles of 10-s on and 20-s off. Protein quantification was conducted by BCA assay. Fifty micrograms of protein was diluted to a total volume of 80 μl using 50 mM ammonium bicarbonate (AmBic). Cysteine reduction was performed by adding 5 μl of a DTT solution (11.1 mg/ml; to a final concentration of 4 mM) and incubating at 60°C for 10 min with 600 rpm. Subsequent alkylation was performed by adding 5 μl of a iodoacetamide (46.6 mg/ml; to a final concentration of 14 mM) and incubating for 30 min in the dark. Sixty nanograms of SP3 beads was added followed by 212 μl of ethanol to a total volume of 80% ethanol to bind the protein to the beads. The beads were mixed for 30 min at 1500 rpm. Samples were placed onto a magnetic rack and left to settle for 5 min. The eluent was removed. Samples were washed six times with 200 μl of ethanol, leaving the sample to stand for 5 min before removing the wash solution. In-solution digestion was carried out in 45 μl of 50 mM AmBic with 5 μl of trypsin (0.2 μg/μl) in 50 mM acetic acid and incubated at 37°C overnight with 1500 rpm. Samples were placed onto a magnetic rack and left to settle for 5 min. The solution was removed and retained. Samples were analyzed using an Evosep One liquid chromatography system (Evosep Biosystems, Odense, Denmark) coupled online to a TIMS ToF HT mass spectrometer (Bruker) using the built-in 15 SPD (samples per day) method. Four survey loading analysis was done to determine final loading. Samples were diluted with 0.1% formic acid. The proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD059040.
NMR spectroscopy
Uniformly 15N labeled recombinant EutQ1–99-GB1 used was diluted to a final concentration of 100 μM 600-μl NMR buffer [20 mM Na2HPO4 (pH 6.5) and 20 mM NaCl] supplemented with 10% (v/v) D2O. Binding studies was done by mixing 15N-labeled EutQ1–99-GB1 or EutQ with unlabeled peptides (either EutC1–20 or EutE1–20) in a 1:5 molar ratio of EutQ:peptide. All 15N-H HSQC NMR experiments were acquired at 298 K on Bruker Avance III 700-MHz spectrometer equipped with TCI cryoprobe [1H, 15N, 13C] with frequency locked on 2H2O. Water suppression was achieved by excitation sculpting. Protein backbone resonance assignment of uniformly 15N, 13C-labeled 1 mM EutQ1–99-GB1 was achieved using 25% nonuniform sampling of three pairs of 3D triple-resonance experiments: CBCA(CO)NH/CBCANH, HBHA(CO)NH/HBHANH, and HNCO/HN(CA)CO. Data analysis was performed with CcpNMR Analysis version 3.1.1.
Microscale thermophoresis
MST assays were performed as previously described (ref. R73). Dyes were incubated with target proteins at a final concentration of 0.25 μM for 30 min at room temperature to label His-tagged target proteins. After mixing target proteins and twofold increased concentrations of nonlabeled legend proteins, the samples were drawn into 16 glass capillaries and were measured using a Monolith NT.115 instrument (Nano Temper Technologies GmbH, Munich, Germany) at 25°C with 60% excitation power and medium light-emitting diode power. For MST analysis of EutM and EutL, the EutM-his and EutL-strep samples were 0.25 and 24 μM, respectively. The signal-to-noise ratio of MST analysis was 23.00. For MST analysis of EutM-his and EutQ-strep, the EutM and EutQ samples were 0.25 and 108 μM, respectively. The signal-to-noise ratio was 28.15. For MST analysis of EutQ-his and EutL-strep, the EutQ and EutL samples were 0.25 and 24 μM, respectively. The signal-to-noise ratio was 23.05. Please note that 24 μM EutL was used as higher concentrations resulted in protein precipitation. The Kd was calculated and fitted using the Nano Temper Analysis software.
Isothermal titration calorimetry
Malvern Microcal Peaq Automated fitted with its proprietary software (MicroCal PEAQ_ITC Automated Control Software) was used for the experiment. The instrument was set for 19 injections at 25°C, whereas reference power, initial delay, and stir speed were set for 6 μcal/s, 60 s, and 750 rpm, respectively. The cell and syringe contained 20 μM protein (EutQ/EutQ1–99-GB1/EutQ100–229/GB1) and 200 to 400 μM EPs, respectively. Peptide to buffer titration was used as the internal control. Both the protein and ligand were diluted in buffers containing 20 mM Na2HPO4 and 20 mM NaCl (pH 6.5). The start injection volume was 0.4 μl, whereas subsequent injections were 2 μl. MicroCal PEAQ-ITC Analysis Software was used for data evaluation.
MD simulations
To investigate the interactions between Helix3 and Helix4 of EutQ1–99 and EutC1–20, MD simulations were performed. To avoid substantial conformational influences from Helix1 and Helix2 of EutQ1–99, which might affect the binding site between Helix3/Helix4 and EutC1–20, only Helix3/Helix4 and EutC1–20 were modeled and simulated during the process. The MD simulations were performed using the Amber22 software package. The protein force field leaprc.protein.ff19SB was applied, and TIP3P water molecules were added within a 15-Å distance from the protein, creating a cubic box with dimensions of 69 Å by 78 Å by 64 Å. Sodium (Na+) and chloride (Cl−) ions were added to neutralize the system to a concentration of 150 mM, resulting in a solvated system containing 27,344 atoms.
Position restraints of 2 kcal/mol per square angstrom were applied to the backbone atoms of the protein, and energy minimization was conducted for 2000 steps (1000 steps of steepest descent followed by 1000 steps of conjugate gradient). Then, the restraints were removed, and another 2500 steps (1000 steps of steepest descent followed by 1500 steps of conjugate gradient) of energy minimization were performed. Afterward, under control of the Langevin thermostat, the system was gradually heated from 0 to 310 K over 500 ps, with position restraints of 2 kcal/mol per square angstrom applied to the backbone atoms. This was followed by an unrestrained equilibrium simulation for 5 ns under the NVT ensemble [constant number of particles (N), volume (V), and temperature (T)]. Last, a production MD simulation for 500 ns was performed under the NPT [constant number of particles (N), pressure (P), and temperature (T)] ensemble, where the Berendsen barostat maintained the pressure at 1 atm, and a 10-Å cutoff was applied for nonbonded interactions. Electrostatic interactions were calculated using the particle mesh Ewald method. All covalent bonds involving hydrogen atoms were constrained using the SHAKE algorithm.
Trajectory analysis was performed using the cpptraj module in AMBER22. Binding free energies between Helix3/Helix4 of EutQ1–99 and EutC1–20 were calculated using the Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) method, and entropy was computed using the NMODE module.
Bioinformatics analysis
According to previous studies, bacteria from 10 genera were reported to contain the eut operon: Citrobacter, Clostridium, Enterococcus, Escherichia, Klebsiella, Listeria, Proteus, Salmonella, Shigella, and Yersinia (ref. R74, ref. R75). To screen the bacterial species that produce the Eut BMCs, the representative genomes of bacterial species from the 10 genera were downloaded from the RefSeq database (ftp://ftp.ncbi.nlm.nih.gov/genomes/). A protein BLAST database was made from the translated CDS of the representative genomes. The protein sequences of the eut operon from S. Typhimurium LT2 (RefSeq ID: ASM694v2) were queried against the BLAST database using BLASTp v2.5.0+ (ref. R76). According to the presence of the essential proteins including EutB, EutL, and EutM, 30 representative isolates were found to contain the Eut BMCs (data S1).
To infer the phylogenetic relationship of the 30 bacterial isolates, the core genes of the isolates were extracted using BUSCO v5.2.5 with the “bacteria_odb10” database. From the single-copy orthologs detected by BUSCO, 113 proteins were existing in all the genomes. The orthologs were extracted and then aligned using Mafft v7.475 (ref. R77). The alignments were then merged using Seqkit v0.15.0 and trimmed with trimAl (ref. R78). Fasttree v2.1.10 was used to infer the phylogenetic tree using the JTT + Gamma model. The tree was mid-rooted. The identity of Eut proteins summarized from the BLASTp result was visualized with the tree in iTOL (ref. R79). The sequences of EutQ, EutC, and EutE proteins were aligned by ESPript 3 (ref. R80).
The structural predictions of EutQ and EutG were generated by AlphaFold 3 (ref. R81). The sequence logo of EutQ was generated by WebLogo 3 (ref. R82, ref. R83). The alignment was generated using the EutQ sequence queried against the NCBI BLAST refseq-select-prot database. Sequences with similarity greater than 80% were selected, resulting in a total of 44 sequences.
References
- The phosphoglyceride composition of Gram-negative bacteria and the changes in composition during growth.. Biochim. Biophys. Acta, 1969. [PubMed]
- 2G. B. Ansell, J. N. Hawthorne, R. M. C. Dawson, Form and Function of Phospholipids (Elseiver, 1973).
- Periplasmic glycerophosphodiester phosphodiesterase of Escherichia coli, a new enzyme of the glp regulon.. J. Biol. Chem., 1983. [PubMed]
- Ethanolamine utilization in bacterial pathogens: Roles and regulation.. Nat. Rev. Microbiol., 2010. [PubMed]
- Evidence for the B12-dependent enzyme ethanolamine deaminase in Salmonella.. Nature, 1975. [PubMed]
- Enterohaemorrhagic Escherichia coli gains a competitive advantage by using ethanolamine as a nitrogen source in the bovine intestinal content.. Environ. Microbiol., 2011. [PubMed]
- Ethanolamine signaling promotes Salmonella niche recognition and adaptation during infection.. PLOS Pathog., 2015. [PubMed]
- Ethanolamine controls expression of genes encoding components involved in interkingdom signaling and virulence in enterohemorrhagic Escherichia coli O157:H7.. mBio, 2012. [PubMed]
- Bacterial microcompartment-mediated ethanolamine metabolism in Escherichia coli urinary tract infection.. Infect. Immun., 2019. [PubMed]
- Inflammation associated ethanolamine facilitates infection by Crohn’s disease-linked adherent-invasive Escherichia coli.. EBioMedicine, 2019. [PubMed]
- Intestinal inflammation allows Salmonella to use ethanolamine to compete with the microbiota.. Proc. Natl. Acad. Sci. U.S.A., 2011. [PubMed]
- Ethanolamine utilization contributes to proliferation of Salmonella enterica serovar Typhimurium in food and in nematodes.. Appl. Environ. Microbiol., 2011. [PubMed]
- Identification of Listeria monocytogenes genes contributing to intracellular replication by expression profiling and mutant screening.. J. Bacteriol., 2006. [PubMed]
- The 17-gene ethanolamine (eut) operon of Salmonella typhimurium encodes five homologues of carboxysome shell proteins.. J. Bacteriol., 1999. [PubMed]
- Comparative genomics of ethanolamine utilization.. J. Bacteriol., 2009. [PubMed]
- A catalog of the diversity and ubiquity of bacterial microcompartments.. Nat. Commun., 2021. [PubMed]
- Diverse bacterial microcompartment organelles.. Microbiol. Mol. Biol. Rev., 2014. [PubMed]
- A taxonomy of bacterial microcompartment loci constructed by a novel scoring method.. PLOS Comput. Biol., 2014. [PubMed]
- Protein stoichiometry, structural plasticity and regulation of bacterial microcompartments.. Curr. Opin. Microbiol., 2021. [PubMed]
- Bacterial metabolosomes: New insights into their structure and bioengineering.. J. Microbial. Biotechnol., 2021
- Advances in the bacterial organelles for CO2 fixation.. Trends Microbiol., 2022. [PubMed]
- Highly accurate protein structure prediction with AlphaFold.. Nature, 2021. [PubMed]
- Biogenesis of a bacterial metabolosome for propanediol utilization.. Nat. Commun., 2022. [PubMed]
- Autogenous regulation of ethanolamine utilization by a transcriptional activator of the eut operon in Salmonella typhimurium.. J. Bacteriol., 1992. [PubMed]
- Engineered protein nano-compartments for targeted enzyme localization.. PLOS ONE, 2012. [PubMed]
- Encapsulation of multiple cargo proteins within recombinant Eut nanocompartments.. Appl. Microbiol. Biotechnol., 2016. [PubMed]
- The EutQ and EutP proteins are novel acetate kinases involved in ethanolamine catabolism: Physiological implications for the function of the ethanolamine metabolosome in Salmonella enterica.. Mol. Microbiol., 2016. [PubMed]
- BMC Caller: A webtool to identify and analyze bacterial microcompartment types in sequence data.. Biol. Direct, 2022. [PubMed]
- Intrinsically disordered CsoS2 acts as a general molecular thread for α-carboxysome shell assembly.. Nat. Commun., 2023. [PubMed]
- Atomic-level models of the bacterial carboxysome shell.. Science, 2008. [PubMed]
- Vertex protein PduN tunes encapsulated pathway performance by dictating bacterial metabolosome morphology.. Nat. Commun., 2022. [PubMed]
- The pentameric vertex proteins are necessary for the icosahedral carboxysome shell to function as a CO2 leakage barrier.. PLOS ONE, 2009. [PubMed]
- Bacterial microcompartment shells of diverse functional types possess pentameric vertex proteins.. Protein Sci., 2013. [PubMed]
- Functional insights from structural genomics.. J. Struct. Funct. Genomics, 2007. [PubMed]
- Conserving a volatile metabolite: A role for carboxysome-like organelles in Salmonella enterica.. J. Bacteriol., 2006. [PubMed]
- Ethanolamine utilization in Salmonella typhimurium: Nucleotide sequence, protein expression, and mutational analysis of the cchA cchB eutE eutJ eutG eutH gene cluster.. J. Bacteriol., 1995. [PubMed]
- Engineering formation of multiple recombinant Eut protein nanocompartments in E. coli.. Sci. Rep., 2016. [PubMed]
- Structural insight into the Clostridium difficile ethanolamine utilisation microcompartment.. PLOS ONE, 2012. [PubMed]
- Integrative in vivo analysis of the ethanolamine utilization bacterial microcompartment in Escherichia coli.. mSystems, 2024. [PubMed]
- Structural and mechanistic investigations on Salmonella typhimurium acetate kinase (AckA): Identification of a putative ligand binding pocket at the dimeric interface.. BMC Struct. Biol., 2012. [PubMed]
- The alternative electron acceptor tetrathionate supports B12-dependent anaerobic growth of Salmonella enterica serovar Typhimurium on ethanolamine or 1,2-propanediol.. J. Bacteriol., 2001. [PubMed]
- Bacterial microcompartment assembly: The key role of encapsulation peptides.. Commun. Integr. Biol., 2015. [PubMed]
- A rapid method to attain isotope labeled small soluble peptides for NMR studies.. J. Biomol. NMR, 2003. [PubMed]
- Rubisco condensate formation by CcmM in β-carboxysome biogenesis.. Nature, 2019. [PubMed]
- Multivalent interactions between CsoS2 and Rubisco mediate α-carboxysome formation.. Nat. Struct. Mol. Biol., 2020. [PubMed]
- Biogenesis of a bacterial organelle: The carboxysome assembly pathway.. Cell, 2013. [PubMed]
- The bacterial carbon-fixing organelle is formed by shell envelopment of preassembled cargo.. PLOS ONE, 2013. [PubMed]
- An infection-relevant transcriptomic compendium for Salmonella enterica Serovar Typhimurium.. Cell Host Microbe, 2013. [PubMed]
- Decoding the stoichiometric composition and organisation of bacterial metabolosomes.. Nat. Commun., 2020. [PubMed]
- Genetic analysis of the protein shell of the microcompartments involved in coenzyme B12-dependent 1,2-propanediol degradation by Salmonella.. J. Bacteriol., 2011. [PubMed]
- Programming an orthogonal self-assembling protein cascade based on reactive peptide–protein pairs for in vitro enzymatic trehalose production.. J. Agric. Food Chem., 2022. [PubMed]
- Structural insights into higher order assembly and function of the bacterial microcompartment protein PduA.. J. Biol. Chem., 2014. [PubMed]
- Self-assembling shell proteins PduA and PduJ have essential and redundant roles in bacterial microcompartment assembly.. J. Mol. Biol., 2021. [PubMed]
- Structural insight into the mechanisms of transport across the Salmonella enterica Pdu microcompartment shell.. J. Biol. Chem., 2010. [PubMed]
- Structure of a bacterial microcompartment shell protein bound to a cobalamin cofactor.. Acta Crystallogr. F. Struct. Biol. Commun., 2014. [PubMed]
- Structure and mechanisms of a protein-based organelle in Escherichia coli.. Science, 2010. [PubMed]
- Multiple ParA/MinD ATPases coordinate the positioning of disparate cargos in a bacterial cell.. Nat. Commun., 2023. [PubMed]
- The McdAB system positions α-carboxysomes in proteobacteria.. Mol. Microbiol., 2021. [PubMed]
- Origin and evolution of carboxysome positioning systems in cyanobacteria.. Mol. Biol. Evol., 2020. [PubMed]
- Molecular principles of the assembly and construction of a carboxysome shell.. Sci. Adv., 2024. [PubMed]
- Structure and assembly of cargo Rubisco in two native α-carboxysomes.. Nat. Commun., 2022. [PubMed]
- Rubisco forms a lattice inside alpha-carboxysomes.. Nat. Commun., 2022. [PubMed]
- Structural and functional insights into (S)-ureidoglycine aminohydrolase, key enzyme of purine catabolism in Arabidopsis thaliana.. J. Biol. Chem., 2012. [PubMed]
- Complete genome sequence of Salmonella enterica serovar Typhimurium LT2.. Nature, 2001. [PubMed]
- Engineering multiple genomic deletions in Gram-negative bacteria: Analysis of the multi-resistant antibiotic profile of Pseudomonas putida KT2440.. Environ. Microbiol., 2011. [PubMed]
- Enzymatic assembly of DNA molecules up to several hundred kilobases.. Nat. Methods, 2009. [PubMed]
- Superfolder GFP reporters validate diverse new mRNA targets of the classic porin regulator, MicF RNA.. Mol. Microbiol., 2012. [PubMed]
- Fiji: An open-source platform for biological-image analysis.. Nat. Methods, 2012. [PubMed]
- Bleach correction ImageJ plugin for compensating the photobleaching of time-lapse sequences.. F1000Res., 2020. [PubMed]
- Software for bead-based registration of selective plane illumination microscopy data.. Nat. Methods, 2010. [PubMed]
- Image thresholding by minimizing the measures of fuzziness.. Pattern Recognit., 1995
- KNIME—The Konstanz information miner: Version 2.0 and beyond.. SIGKDD Explor., 2009
- Molecular interaction studies using microscale thermophoresis.. Assay Drug Dev. Technol., 2011. [PubMed]
- Genomic diversity of Salmonella enterica—The UoWUCC 10K genomes project.. Wellcome Open Res., 2020. [PubMed]
- Ethanolamine activates a sensor histidine kinase regulating its utilization in Enterococcus faecalis.. J. Bacteriol., 2008. [PubMed]
- BLAST+: Architecture and applications.. BMC Bioinformatics, 2009. [PubMed]
- MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform.. Nucleic Acids Res., 2002. [PubMed]
- trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses.. Bioinformatics, 2009. [PubMed]
- Interactive Tree Of Life (iTOL) v4: Recent updates and new developments.. Nucleic Acids Res., 2019. [PubMed]
- Deciphering key features in protein structures with the new ENDscript server.. Nucleic Acids Res., 2014. [PubMed]
- Accurate structure prediction of biomolecular interactions with AlphaFold 3.. Nature, 2024. [PubMed]
- Sequence logos: A new way to display consensus sequences.. Nucleic Acids Res., 1990. [PubMed]
- WebLogo: A sequence logo generator.. Genome Res., 2004. [PubMed]
