Decoding Order and Disorder in Proteins by NMR Spectroscopy
Abstract
Proteins often have a complex architecture, consisting of both globular ordered domains and intrinsically disordered regions (IDRs). These multidomain proteins pose challenges for traditional structural biology techniques. One major difficulty arises from the dynamic and flexible nature of IDRs, which lack a stable three-dimensional structure. Indeed, this feature further complicates the application of traditional structural biology techniques. Characterizing these systems is typically simplified by isolating individual domains, which can provide valuable insights into the structure and function of specific regions. However, this approach overlooks the interactions and regulatory mechanisms that occur between domains. To capture the full functional and structural complexity of multidomain proteins, it is crucial to study larger constructs. In this study, we focused on the CREB binding protein (CBP), a pivotal protein involved in numerous cellular processes. CBP is characterized by its modular structure, featuring alternating globular domains and IDRs. We specifically examined the TAZ4 construct, encompassing the TAZ2 globular domain and the ID4 flexible linker region. To characterize this multidomain system, we designed NMR experiments that take advantage of the dynamic differences between the two domains to obtain 2D and 3D spectra enabling the selection of the signals based on their nuclear relaxation properties. These experiments allowed the sequence-specific assignment of the TAZ4 construct to be extended revealing a crosstalk between the disordered region and the globular domain.
License: © 2025 The Authors. Published by American Chemical Society CC BY 4.0 Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
Article links: DOI: 10.1021/jacs.4c14959 | PubMed: 40223218 | PMC: PMC12022988
Relevance: Moderate: mentioned 3+ times in text
Full text: PDF (3.4 MB)
Introduction
CBP and its paralog E1A-binding protein (p300) are modular transcriptional coactivators that increase gene expression by binding to transcription factors that contain DNA binding domains and, in turn, can control the rate of transcription of genetic information. CBP activity is found in almost all known cellular functions, such as the decision to grow, differentiate, or lead to cell death by apoptosis;1−3 in order to fulfill the variety of these processes, consistent with their complexity, many intracellular factors, including transcription factors, nuclear receptors, and other coactivators, have been found to interact with CBP. The domain architecture of its 2,442 residues is reported in Figure A. There are seven domains that are able to fold independently, and four of them require zinc binding to stabilize their tertiary structures: the transcriptional-adaptor zinc-finger-1 (TAZ1) domain,4 the plant homeodomain (PHD),5 a zinc-binding domain near the dystrophin WW domain (ZZ),6 and the transcriptional-adaptor zinc-finger-2 (TAZ2) domain;7 the other folded domains are KID-binding domain (KIX),8 the bromodomain,9 and the histone acetyltransferase domain (HAT).10 The nuclear-receptor coactivator-binding domain (NCBD) is intrinsically disordered but undergoes synergistic folding on forming a complex with its partners.11 Furthermore, there are five disordered regions of different length, denoted ID (which stands for “Intrinsically Disordered”) followed by the sequential number of the disordered region in the primary structure, which account for about 60% of the sequence. The ID regions (IDRs) have always been defined just as flexible linkers of proteins that connect functional regions (globular domains), providing their appropriate spatial separation. They are now emerging as possible functional motifs themselves involved in binding with partners.12,13 In light of the increasing interest in the investigation of intrinsically disordered proteins, in recent years three IDRs of CBP, namely IDR3,12 IDR414 and IDR5,15 have been investigated by NMR spectroscopy to determine their structural and dynamic features. Moreover, their interactions with potential partners/interactors have been characterized by NMR and other ancillary techniques such as X-ray scattering and mass spectrometry, demonstrating that the IDRs provide additional opportunities for CBP to orchestrate its function.12,13,15 With a high incidence of Pro (16%), Gln (15%), Ser (11%), Gly (9%), and Ala (9%) residues, these IDRs clearly exhibit the compositional bias typical of disordered regions.16−19 ID4, the 207-residue long linker placed between the TAZ2 domain and the NCBD domain, is the CBP region with the highest percentage of Pro residues (22%); its biased sequence properties, together with other peculiar characteristics of IDRs contributed to render the sequence-specific assignment and the structural and dynamic characterization of ID4 quite challenging. This was recently achieved combining 1H and 13C direct detected NMR experiments,14 and the results showed that ID4 is highly flexible except for two regions encompassing residues 1852–1876 and 1952–1979, which have a high propensity to sample an α-helical conformation. The first helical region in the disordered fragment can be seen as the extension in the wild-type protein of the TAZ2 domain, whose structure has a peculiar fold composed of four alpha helices (α1, α2, α3, and α4) that are arranged in a helical bundle.7 The two partially populated helices of ID4 will thus be referred to as α5 and α6. Three zinc ions are coordinated by the TAZ2 domain, each bound to one histidine and three cysteine ligands in HCCC-type motifs.7 The zinc ions play a crucial role in stabilizing the TAZ2 helical bundle and maintaining its structure. They also contribute to the formation of the TAZ2 binding surface, which is essential for protein–protein interactions. We would like now to extend our understanding of the interplay between TAZ2 and ID4 by studying a novel construct comprising both regions, counting 295 amino acids (TAZ4). To this end novel NMR tools are presented to address the spectral complexity deriving from the simultaneous presence of ordered and disordered domains in the protein construct of interest.

Results
The two domains of this construct exhibit distinct structural and dynamic features that are reflected on the NMR observables. The 2D HN spectrum of TAZ4 reported in Figure B shows two sets of signals: narrow and intense peaks with 1H resonance frequencies between 7.8 and 8.7 ppm (deriving from the ID4 region), and broader peaks that fall in a much wider range of chemical shifts (deriving from the TAZ2 domain). The spectrum is very crowded and many peaks of TAZ2 are hidden by the more intense ID4 peaks.
The resonances of the amino acids in the disordered region can be selectively observed with a 13C-detected 2D CON experiment, such as the one reported in Figure C. Indeed, the long coherence transfer delays (tuned to 1/21JCN), intrinsic to the experiment, act as a transverse relaxation filter: signals of the residues in the globular domain are characterized by much faster transverse relaxation with respect to those of the highly flexible disordered regions and thus their intensity is greatly reduced, up to complete disappearance of the cross peaks. CON spectra thus reveal in a very clean way signals of the intrinsically disordered regions also when part of more complex protein constructs.20 Furthermore, the cross peaks in the spectrum are well resolved thanks to the narrow line widths (especially those of 15N) and to the long acquisition and evolution times used to build the direct and indirect dimensions, respectively.
In contrast, the selection of the signals of the globular domain, that should be accomplished through the 2D HN spectrum, is more challenging, particularly in the spectral region where signals from the highly flexible disordered region are clustered. However, it is possible to enhance the readability of the spectrum by taking advantage of the different relaxation properties of the nuclei of the two domains (TAZ2 and ID4). The implementation of transverse relaxation filters would select signals of the highly flexible region at the expense of those of the globular domain. Conversely, it is necessary to design a novel approach that enables the opposite selection to discriminate the signals of residues belonging to the more ordered regions with respect to the signals of the disordered ones. To this end we focused on 1H longitudinal relaxation aiming to exploit one of the properties that take advantage of the increased local correlation time typical of globular ordered domains: longitudinal relaxation enhancement upon selective perturbation of a subset of signals (amide protons in the present case).
A Relaxation Filter Based on Longitudinal Relaxation Enhancement (LRE)
Figure A shows a series of 1D experiments in which a band selective inversion recovery block on amide protons has been implemented at the beginning of the pulse sequence (Figures B and S1). Perturbation of a selected subset of nuclear spins promotes longitudinal relaxation enhancement (LRE) in proteins, which increases with increasing local correlation time under conditions of low solvent exchange.21 Fine tuning of the recovery delay (d7) enables the discrimination of the TAZ2 1H signals, which recover faster and result in positive signals, from the ID4 signals that exhibit a slower recovery and are still negative as shown in Figure C. This relaxation filter can be implemented in all the pulse sequences needed for the characterization of TAZ4. In Figure A,B, the 2D 1H–15N HSQC inversion recovery spectra (2D IR-HN) acquired with different recovery delays are shown. The pulse sequence is described in detail in Figure S2. This experiment was used to acquire residue-resolved inversion recovery profiles for the amide protons under band selective inversion. As an example, two inversion recovery profiles of representative residues in the TAZ2 and ID4 regions of the construct (T52 and V154) are shown in Figure C (the results obtained in this way for the whole TAZ4 construct are shown in Figure S3). A very different behavior is observed for amide protons in the globular domain (fast longitudinal recovery), with respect to those of the disordered region (slow longitudinal recovery in the experimental conditions used in the present work in which the contribution from the solvent exchange is modest). This property is exploited to encode a difference in sign of the observed cross peaks by appropriate selection of the recovery delay. In the spectrum acquired with the recovery delay of 150 ms the signals deriving from the TAZ2 domain are positive, while the vast majority of those deriving from the ID4 domain are negative. Some fast-relaxing peaks that were difficult to observe in the conventional 2D HN spectrum are now clearly visible (Figure S4). The implementation of the IR filter results in a reduction in intrinsic experimental sensitivity. In general the selection of the most appropriate delay is guided by maximizing the intensities of the cross peaks of residues in the globular domain; the cross peaks originating from residues in disordered regions are generally very intense, therefore even pronounced reductions in intensity caused by the IR filter can be tolerated, as shown in a few examples in Figure S4. It is worth noting that the spectrum reported in Figure B acquired on the TAZ4 construct looks superficially similar to the overlay of the two different spectra reported in Figure B acquired on TAZ4 and ID4; importantly the former spectrum encodes the information on which domain the signals originate from, while still being part of the same protein construct. A further step was made by introducing the inversion recovery block before more complex experiments such as 3D HNCO and 3D HNCA experiments (3D IR-HNCO and 3D IR-HNCA respectively), using the BEST variant.22 The 3D IR-HNCO spreads the cross peaks in an additional indirect dimension (13C’) and can be used to resolve possible overlaps in the 2D IR-HN spectra still encoding the sign of cross peaks linked to local motional properties. The 3D IR-HNCA, a useful experiment for the sequence specific resonance assignment, is shown as an example in Figure D,E by reporting two 1HN–13Cα slices of the 3D spectrum extracted at 123.4 and 122.8 ppm in the 15N dimension. The pairs of signals associated with each 1HN resonance (1HNi) allow to identify the resonances of two neighboring 13Cα nuclear spins (13Cαi, 13Cαi-1) in the sequence providing information for sequence specific assignment; their sign, encoded through the IR block inserted at the beginning of the pulse sequence, provides clean information about the local motional properties facilitating the sequence specific assignment process. In this version of the experiment, the sign of the signals constitutes additional information that suggests the domain of origin of the residue from which the peaks derive. The pulse sequences of all the inversion recovery experiments can be found as part of the Supporting Information. These experiments effectively complemented a set of NMR spectra, including both 1H-detected 3D HNCO,23−25 3D HN(CA)CO,25,26 3D CBCANH,27 3D CBCA(CO)NH,28,29 3D (H)N(CA)NNH,30 3D (H)N(COCA)NNH,31−33 and 13C-detected 2D CACO,34 2D CBCACO,34 2D CCCO,35 and a 3D CBCACON,36 to obtain the sequence specific assignment of the TAZ4 resonances, including the Zn-ligands. Indeed, the Zn(II)-bound cysteine residues could be recognized based on the typical chemical shift of the 13Cβ signals while the Zn(II)-bound histidine residues could be recognized by the typical pattern in a long-range 2D 1H-15NHSQC spectrum (Figure S5), that allows us to detect 2JHN correlations as indicated in the figure. This spectrum shows that three histidine residues are bound to the Zn(II) ions through the Nε2 while four histidine residues are free. Other than the long-range 2D 1H-15N HSQC (Figure S5), we acquired a 2D 1H-13C TROSY37 and 2D (HB)CB(CGCD)HD38 to obtain the assignment of the histidine residues side chains.


TAZ4 Construct Characterization
The dynamic properties of the TAZ4 construct were assessed determining the 15N relaxation rates and the heteronuclear 1H–15N NOEs (Figure A). 15N R1 values show an average higher value for the disordered region. 15N R2 and 1H–15N NOE values have very distinct patterns: for instance, the TAZ2 domain generally shows higher values than the ID4 domain. This is due to the higher flexibility of the IDR with respect to the globular region. The 15N relaxation values confirm the presence of the partially folded helices observed by Piai and coworkers for isolated ID4,14 also in this construct. These regions (highlighted in cyan in Figure A) are characterized by 15N R2 and 1H–15N NOE that are higher than the average values for the ID4 domain, two observations that indicate higher local rigidity.

Focusing on the TAZ2 domain, the NOE values are coherent with the structural and dynamic properties of a globular domain. The N-terminus of TAZ4 constitutes an exception, with smaller 1H–15N NOEs and 15N R2 values suggesting that being at the very beginning of the sequence these residues could be more flexible and less buried in the core of the globular domain as often observed for N-terminal and C-terminal residues within a protein construct. The rotational correlation time (τR) obtained experimentally for TAZ2 in the TAZ4 construct is 11.6 ns, a value considerably larger of that expected for a small globular protein.39,40 For comparison we calculated the predicted τR value for TAZ2 by using HYDRONMR,41 obtaining a value of 7 ns. These data suggest that the presence of ID4 influences the mobility of the TAZ2 domain, either through exchange contributions to R2 or through reduced and more anisotropic tumbling by the adjacent disordered domain. The τR value of the TAZ2 domain in the TAZ4 construct is still smaller than the one predicted for a protein as large as TAZ4 (with calculated τR values well above 15 ns), meaning that this domain still possesses some rotational freedom.
Changes of chemical shift of amide protons with temperature can give insights on the presence of intramolecular hydrogen bonds.421HN chemical shift differences at various temperatures were divided by the difference in temperature to obtain the temperature coefficient reported in Figure B as a function of the residue number. The horizontal black dashed line at −4.6 ppb/K represents the threshold above which the presence of H-bond is predicted with a probability of 85%.42 In the TAZ2 domain there is a modulation of the values of temperature coefficients that is expected from a fold constituted by α-helices connected through several external loop regions. This modulation derives from the different length of the hydrogen bonds that the amide protons form with surrounding residues.42 On the other hand, the disordered regions show uniformly negative values of temperature coefficients (below −4.6 ppb/K) that indicate the absence of intramolecular hydrogen bonds. The regions 98–117 and 194–221 show values that are above the threshold. These results, in line with what was observed through 15N relaxation rates, confirm the presence of the two partially formed helices α5 and α6. To establish the extent of solvent exchange at the residue level we performed the CLEANEX43 experiment with four values of mixing time: 10, 20, 50, and 100 ms. As mentioned above when discussing the 15N relaxation rates, there is a clear difference between the two domains of the protein. Amide protons of the globular domain poorly exchange with the solvent protons, except those of residues in the N-terminal region and in the exposed loops (Figure A). Amide protons of intrinsically disordered regions, on the other hand, are well exposed to the solvent, as described by the large exchange rates measured with the CLEANEX experiment, with lower effects observed for the two partially populated α-helices (Figure B).

Are the Two Domains Independent?
The presence of two domains within the same construct could lead to changes in their structural and dynamic properties with respect to the isolated ones. To test whether this is the case for TAZ4 we acquired a 2D HN experiment of ID4 in the same conditions as TAZ4 to compare the chemical shifts of the isolated domain to that of the ID4 domain within the TAZ4 construct. Figure A shows the superimposition of the two spectra. Generally, the disordered residues show minor perturbations, while measurable changes are observed for the residues belonging to the partially populated helices (Figure S6A); intensity changes are reported in Figure S6B. Specifically, the peaks of the residues belonging to α5 broaden and shift, while the peaks of the residues of α6 shift with a less pronounced broadening. Many residues of the TAZ2 domain within the TAZ4 construct are also perturbed by the presence of ID4 (the four mutated cysteine residues as well as the contiguous residues were not considered in the comparison made with respect to the available chemical shift data BMRB entry 4789).7 In Figure B the structure color code is based on the magnitude of the chemical shift changes, increasing from white to red. Notably, the residues that change the most are those of α2 and α3 residues facing α4, and those at the beginning of α5. These changes, partly expected considering that this is the region connecting the two isolated domains, are quite large and extend more than just a few residues, indicating a substantial change in this region. On the bottom of the panel, the chemical shift differences are plotted against the residue number for the entire TAZ4 construct.

We calculated the Secondary Structure Propensity (SSP) for both TAZ4 and ID4 to investigate possible changes in the propensity to sample the helical conformation (Figure S7). In addition, we compared the 15N relaxation properties of TAZ4 (Figure S8) with those already published for isolated ID4.14 The results show that while α5 seems more stabilized by the presence of the folded domain; for α6, both the SSP and the 15N relaxation properties do not change much and are not correlated with a different propensity to form an α-helix. These observations suggest that the chemical shift variations are due to interactions between the two domains, involving amino acids belonging to the second helix of ID4. For comparison CLEANEX experiments were repeated on the isolated ID4 domain in the same experimental conditions (Figure S6C,D): interestingly larger solvent exchange effects are monitored in absence of the TAZ2 domain.
How Far Do Different Structural Models Reproduce NMR Observations?
Describing a complex protein construct as the one investigated in the present case is not an easy task, but it is important to visualize the results of experimental investigations through meaningful structural models. To this end, we compared our experimental results with the ensemble of conformers obtained through two different computational approaches: IDPConformerGenerator (IDPConfGen)44 and AlphaFold 3 (AF3)45 (Figure ). IDPConfGen allows us to visualize the conformational space sampled by TAZ4. For the calculation, the Local Disordered Region Sampling (LDRS) module46 was used. This module allows the modeling of the backbone of disordered regions that are next to a globular domain. The algorithm generated the ID4 conformers by taking the dihedral angles from a custom subset of PDB structures, sampling both loops and α-helices. Then, it connected these conformers to the C-terminus of TAZ2 without modifying its structure. IDPConfGen nicely shows the disorder (and high flexibility) of the ID4 domain in the TAZ4 construct, an aspect in agreement with our NMR experimental data (Figure A).

AF3 models the 3D structure starting from the sequence, building the whole construct. In these models, the TAZ2 domain is very similar to the X-ray structure of p300-TAZ247 (RMSD = 1.64 Å). α5 is modeled as fully populated and fused with α4. α6 is predicted to be fully populated too. In the crystal structure obtained by Miller and coworkers47 of the TAZ2 domain of p300, which shows an identity of 89.5% and a similarity of 94.7% with the corresponding domain of CBP (Figure S9), the fourth helix of the construct is extended to contain the first 23 residues of the ID4 region. Dyson and Wright48 proposed that this elongated helix is stabilized in the crystals by lattice contacts. The data reported in the present work suggests that in solution and in the presence of the entire ID4 region, this long helix is populated, even if the occurrence of conformational exchange makes the relevant NMR signals difficult to detect. Interestingly, AF3 predicts the existence of an interaction between α6 and the TAZ2 domain (Figure B), as suggested by the NMR data. We repeated the calculations considering TAZ2 and ID4 as separated domains and the software predicts the interactions also in this case, with different poses for the ID4 helix (Figure C). The lack of a well-defined interaction pocket suggested by AF3 is consistent with the chemical shift perturbations shown in Figure B. The picture that appears from this analysis is of a highly dynamic construct that samples a large conformational space, with the presence of a transient crosstalk between the folded domain and the disordered region.
Discussion
On IDR/TAZ2 Interactions
In the literature, several studies have explored the interactions between the TAZ2 domain and the activation domains of various disordered proteins (Figure D). TAZ2 has been shown to interact with the transactivation domains of p53,49 which is involved in tumor suppression and cellular stress responses. Additionally, TAZ2 interacts with the first activation domain of E2A,50 which is involved in the transcriptional activation during lymphocyte development, and with E1A oncoprotein,51 which is involved in the activation of the viral gene expression and in the deregulation of cellular processes. The interaction with signal transducer and activator of transcription 1 (STAT1) has also been reported.52 The interactions with all these partners typically involve the hydrophobic region localized at the interface of the first three α-helices of TAZ2. Even though these disordered proteins do not share a sequence homology, all of them fold upon binding when interacting with TAZ2, forming amphipathic helices. The cleft present on TAZ2 shows chemical shift changes when part of the TAZ4 construct, suggesting the presence of transient interactions with α6 that is constituted by both hydrophobic and negatively charged residues. The latter could thus play a role in protecting this interaction site until stronger binders, such as the examples mentioned above, are present. IDRs acting as regulatory effectors with autoinhibitory and/or allosteric mechanisms have been reported in literature.53,54 For instance, it has been shown that IDRs are involved in fine-tuning of enzymatic activity55 and DNA binding affinity56 through intramolecular interactions. In this framework, ID4 could modulate the interactions of TAZ2 with its protein partners. An example of regulation by competitive binding between two disordered protein regions has been described for the TAZ1 domain of CBP.57 A similar mechanism could exist for TAZ2, for which only proteins with a higher affinity than ID4 can successfully interact with the folded domain. Interactions involving disordered proteins occurs with different mechanisms58 including folding upon binding and the formation of fuzzy complexes.59 Our data suggest that the secondary structure properties of ID4 does not change substantially upon interaction with TAZ2, in contrast to what happens for the TAZ2 partners mentioned above. Further investigation is needed to characterize the regulatory mechanism that we suggest exists for ID4 in the context of the TAZ4 construct of CBP.
A Novel NMR Approach to Study Highly Flexible Multidomain Proteins
The complexity of NMR spectra of highly flexible multidomain proteins can be overcome by “filtering/editing” different subsets of signals on the basis of their NMR properties. In general, it is not difficult to select NMR signals originating from highly flexible regions, characterized by lower transverse relaxation rates with respect to NMR signals originating from globular domains. The smaller relaxation rates allow one to select disordered regions just by including a transverse relaxation filter that suppresses faster relaxing signals, such as those belonging to the globular domain itself. An elegant experiment that can be included in this category is the CON experiment (Figure C), in which the two coherence transfer elements naturally included in the pulse sequence can act as a filter. Thus, the CON experiment provides highly resolved NMR spectra that reveal in a clean way signals originating from the highly flexible regions also when they are part of more complex protein constructs (Figure C).20 The method proposed here, instead, exploits 1H longitudinal relaxation enhancement (LRE) through band-selective inversion recovery of amide protons21,60−62 to facilitate detection of the signals deriving from nuclei of residues in more structured portions of a protein. In our construct the most structured residues and the highly flexible ones are segregated in two different domains, even if some residues belonging to the TAZ2 domain are flexible (e.g., the loops) and some residues of the disordered region are quite rigid (e.g., the partially folded helices). This allows the almost complete selection of signals of the globular domain at the expense of those of the disordered region. The contribution to the 1H LRE deriving from 1H–1H cross-relaxation increases with increase of local effective correlation time and thus favors globular domains with respect to highly flexible disordered ones, in the experimental conditions used in the present work. Indeed, the exchange of amide protons with water is hampered at pH 5.5, thus minimizing the most efficient contribution to longitudinal recovery of amide protons in IDRs.61 Moreover, the active use of LRE permits a reduction of the experimental time. Specifically, when filtering/editing on the basis of longitudinal relaxation, long recovery delays are necessary to ensure complete recovery of the magnetization to the equilibrium state. These delays become shorter when exploiting longitudinal relaxation enhancement techniques. The approach, once tested through 1D experiments to assess its feasibility, was used to determine the inversion recovery properties of the signals deriving from the different protein domains (Figures B and S3). This provided a residue-specific measurement of the recovery profile of amide protons confirming the validity of the approach. Selection of a specific inversion recovery delay for which the polarization of the signals of the disordered domain still has negative values while those of nuclear spins in the globular domain are positive, allowed us to readily identify in simple 2D NMR spectra the signals with different relaxation properties and thus associate them to a specific protein domain. In some cases, it may be beneficial to record multiple data sets tailored to different sets of signals. Tests performed at different magnetic fields clearly show that the method is generally applicable with minimal adjustments (Figure S10). Should the interest shift mainly to the study of globular domains in multidomain protein constructs the delay can also be optimized to reduce as much as possible the intensities of signals of IDRs, a solution that also has the advantage of reducing T1 noise in the spectra. The method is easily adaptable to any multidimensional pulse sequence, such as the 3D HNCO and HNCA described here. The resulting 3D spectra have been shown to be critical in obtaining the sequence-specific resonance assignment of TAZ4, resulting in the sequence specific assignment of 90% of the expected signals, with only the first residues of ID4 missing.
It is worth comparing the approach presented here with other possible strategies useful for distinguishing ordered and disordered protein regions. The heteronuclear 1H–15N NOE experiment provides the first observables that allow to discriminate between regions of proteins characterized by very different local dynamics, as discussed above for TAZ4 and shown in Figures and S11. However, its low sensitivity makes it challenging to be implemented into more complex NMR experiments, such as 3D triple resonance experiments necessary for assignment purposes. The other observables that provide complementary information to discriminate between globular and disordered regions are transverse relaxation rates (15N R2, Figure ) as well as solvent exchange processes measured with experiments like the CLEANEX used for TAZ4 (Figure ). These observables can be easily used to select signals originating from disordered regions at the expense of those of the globular domain. The orthogonal selection, i.e., the selection of the signals of the globular domain, could in principle be obtained through difference spectra, that is subtracting signals of the disordered regions from spectra containing all signals. However, the sharpness of the signals of residues in disordered regions often prevents a clean selection of the broader signals of the globular one. A different strategy to discriminate between ordered and disordered protein regions, consists in varying two factors, temperature and pH, that strongly influence exchange processes with solvent protons. Solvent exposed residues undergo relevant changes under varying the external conditions (T, pH), often leading to signal broadening beyond detection, whereas residues within globular cores remain largely unaffected. On these grounds, increasing temperature and pH can be used to suppress signals from solvent exposed protein regions while maintaining signals from globular cores.63 The 2D HN NMR spectra of TAZ4, acquired under different temperature and pH conditions as shown in Figure S12, clearly demonstrate this trend. As expected, the signals from the disordered regions are reduced in intensity, although not completely. However, external loops of the globular domain also exhibit signal broadening beyond detection with increasing pH as shown in the insets of Figure S12. Moreover, the dependence of chemical shifts on these two parameters (T, pH) complicates the unambiguous identification of resonances under different conditions, often necessitating additional sequence-specific resonance assignment experiments to confirm assignments in different conditions (T, pH). This is particularly challenging when focusing on complex systems as multidomain proteins. The method proposed here provides a tool that not only avoids the need for reassignment of resonances but also aids in this process by distinguishing cross-peaks based on their sign, which depends on the dynamic properties of the residue which it originates from.
A comment is due on the effect of temperature and pH on the longitudinal relaxation enhancement filter proposed. It is well-established that in conditions in which exchange processes with the solvent are modest, the major contributions to longitudinal relaxation enhancement derives from 1H–1H dipolar cross relaxation rates, which in turn depend on the local correlation time, the underlying rational of the proposed approach. However, as solvent exchange processes become more effective, these also contribute to longitudinal relaxation enhancement and in extreme conditions they may become so efficient to reverse the situation: solvent exposed amides may become the signals that experience faster recovery with respect to signals of nonexchangeable protons. This could be in principle an orthogonal approach to achieve discrimination between two sets of signals. However, often many 1HN signals are broadened beyond detection in these conditions reducing the information content. Therefore, the initial screening to maximize 1HN signal detectability for the investigation of an heterogeneous protein also leads to optimal conditions for the implementation of the longitudinal relaxation enhancement filter proposed here. Since the two most relevant parameters that modulate solvent exchange rates (temperature and pH) can be independently changed, it is generally possible to find optimal conditions (Figure S13).
Longitudinal relaxation filters have been used since the early days of NMR, for example to investigate paramagnetic systems in which the nuclear spins in the proximity of the paramagnetic center experience additional contributions to relaxation with respect to the diamagnetic ones (both selective and nonselective inversion recovery experiments were used).64,65 In the present case we opted for a novel approach based on exploiting longitudinal relaxation enhancement effects to capitalize on the different relaxation behavior of ordered and highly flexible protein regions. The differences in local dynamics have been also used in MAS-solid-state NMR experiments to distinguish more rigid, ordered protein regions from highly flexible disordered ones by exploiting either cross-polarization or scalar couplings in NMR experiments.66,67
Recently, a T1-filtered DEER EPR pulse sequence has been proposed to deconvolute the distance distribution obtained from a mixture of monomeric and oligomeric proteins.68 This experiment presents an electron inversion recovery pulse followed by a variable recovery delay prior to the DEER sequence. The peak intensities in the distance distribution depend on the duration of the variable delay, thus it is possible to assign them to the different species present in the sample relying on their apparent T1.
Conclusions
In summary, this work used NMR spectroscopy to explore the multidomain protein TAZ4, with the primary goal of developing an experimental strategy for studying multidomain proteins. The designed approach provided critical information regarding the protein’s structure and dynamics, as well as key information for sequential assignment of this heterogeneous construct. The chemical shift perturbations observed when comparing the entire construct to the individual domains in isolation demonstrate the existence of an interaction between the two sections of the construct, and several other NMR observables aid in describing the intricate nature of these interplays. The methodology developed in this study holds potential for broader application in the investigation of other biologically relevant multidomain proteins, but also advanced our knowledge of TAZ4 properties, providing the groundwork for further exploration into the realm of CBP.
References
- R. H. Goodman, S. Smolik. CBP/P300 in cell growth, transformation, and development.. Genes Dev, 2000. [DOI | PubMed]
- K. J. McManus, M. J. Hendzel. CBP, a transcriptional coactivator and acetyltransferase.. Biochem. Cell Biol., 2001. [DOI | PubMed]
- A. Giordano, M. L. Avantaggiati. P300 and CBP: Partners for life and death.. J. Cell. Physiol., 1999. [DOI | PubMed]
- R. N. De Guzman, J. M. Wojciak, M. A. Martinez-Yamout, H. J. Dyson, P. E. Wright. CBP/P300 TAZ1 domain forms a structured scaffold for ligand binding.. Biochemistry, 2005. [DOI | PubMed]
- A. N. Plotnikov, S. Yang, T. J. Zhou, E. Rusinova, A. Frasca, M.-M. Zhou. Structural insights into acetylated-histone H4 recognition by the bromodomain-PHD finger module of human transcriptional coactivator CBP.. Structure, 2014. [DOI | PubMed]
- G. B. Legge, M. A. Martinez-Yamout, D. M. Hambly, T. Trinh, B. M. Lee, H. J. Dyson, P. E. Wright. ZZ domain of CBP: An unusual zinc finger fold in a protein interaction module.. J. Mol. Biol., 2004. [DOI | PubMed]
- R. N. De Guzman, H. Y. Liu, M. Martinez-Yamout, H. J. Dyson, P. E. Wright. Solution structure of the TAZ2 (CH3) domain of the transcriptional adaptor protein CBP.. J. Mol. Biol., 2000. [DOI | PubMed]
- F. Wang, C. B. Marshall, K. Yamamoto, G.-Y. Li, G. M. C. Gasmi-Seabrook, H. Okada, T. W. Mak, M. Ikura. Structures of KIX domain of CBP in complex with two FOXO3a transactivation domains reveal promiscuity and plasticity in coactivator recruitment.. Proc. Natl. Acad. Sci. U. S. A., 2012. [DOI | PubMed]
- P. Filippakopoulos, S. Picaud, M. Mangos, T. Keates, J.-P. Lambert, D. Barsyte-Lovejoy, I. Felletar, R. Volkmer, S. Müller, T. Pawson, A.-C. Gingras, C. H. Arrowsmith, S. Knapp. Histone Recognition and large-scale structural analysis of the human bromodomain family.. Cell, 2012. [DOI | PubMed]
- X. Liu, L. Wang, K. Zhao, P. R. Thompson, Y. Hwang, R. Marmorstein, P. A. Cole. The structural basis of protein acetylation by the P300/CBP transcriptional coactivator.. Nature, 2008. [DOI | PubMed]
- M. Kjaergaard, K. Teilum, F. M. Poulsen. Conformational selection in the molten globule state of the nuclear coactivator binding domain of CBP.. Proc. Natl. Acad. Sci. U. S. A., 2010. [DOI | PubMed]
- S. Contreras-Martos, A. Piai, S. Kosol, M. Varadi, A. Bekesi, P. Lebrun, A. N. Volkov, K. Gevaert, R. Pierattelli, I. C. Felli, P. Tompa. Linking functions: An additional role for an intrinsically disordered linker domain in the transcriptional coactivator CBP.. Sci. Rep., 2017. [DOI | PubMed]
- M. G. Murrali, I. C. Felli, R. Pierattelli. Adenoviral E1A exploits flexibility and disorder to target cellular proteins.. Biomolecules, 2020. [DOI | PubMed]
- A. Piai, E. O. Calçada, T. Tarenzi, A. del Grande, M. Varadi, P. Tompa, I. C. Felli, R. Pierattelli. Just a flexible linker? The structural and dynamic properties of CBP-ID4 revealed by NMR spectroscopy.. Biophys. J., 2016. [DOI | PubMed]
- S. Kosol, S. Contreras-Martos, A. Piai, M. Varadi, T. Lazar, A. Bekesi, P. Lebrun, I. C. Felli, R. Pierattelli, P. Tompa. Interaction between the scaffold proteins CBP by IQGAP1 provides an interface between gene expression and cytoskeletal activity.. Sci. Rep., 2020. [DOI | PubMed]
- J. Habchi, P. Tompa, S. Longhi, V. N. Uversky. Introducing protein intrinsic disorder.. Chem. Rev., 2014. [DOI | PubMed]
- A. K. Dunker, J. D. Lawson, C. J. Brown, R. M. Williams, P. Romero, J. S. Oh, C. J. Oldfield, A. M. Campen, C. M. Ratliff, K. W. Hipps, J. Ausio, M. S. Nissen, R. Reeves, C. Kang, C. R. Kissinger, R. W. Bailey, M. D. Griswold, W. Chiu, E. C. Garner, Z. Obradovic. Intrinsically disordered protein.. J. Mol. Graphics Modell., 2001. [DOI]
- J. C. Hansen, X. Lu, E. D. Ross, R. W. Woody. Intrinsic protein disorder, amino acid composition, and histone terminal domains.. J. Biol. Chem., 2006. [DOI | PubMed]
- V. N. Uversky. Introduction to intrinsically disordered proteins (IDPs).. Chem. Rev., 2014. [DOI | PubMed]
- M. Schiavina, L. Pontoriero, V. N. Uversky, I. C. Felli, R. Pierattelli. The highly flexible disordered regions of the SARS-CoV-2 nucleocapsid N protein within the 1–248 residue construct: Sequence-specific resonance assignments through NMR.. Biomol. NMR Assign., 2021. [DOI | PubMed]
- B. Brutscher, Z. Solyom, M. Mobli, J. C. Hoch. Chapter 1. Polarization-enhanced fast-pulsing techniques. In Fast NMR data acquisition: Beyond the Fourier transform., 2017. [DOI]
- P. Schanda, H. Van Melckebeke, B. Brutscher. Speeding up three-dimensional protein NMR experiments to a few minutes.. J. Am. Chem. Soc., 2006. [DOI | PubMed]
- S. Grzesiek, A. Bax. Improved 3D triple-resonance NMR techniques applied to a 31 KDa protein.. J. Magn. Reson., 1992. [DOI]
- J. Schleucher, M. Sattler, C. Griesinger. Coherence selection by gradients without signal attenuation: Application to the three-dimensional HNCO experiment.. Angew. Chem., Int. Ed., 1993. [DOI]
- L. E. Kay, G. Y. Xu, T. Yamazaki. Enhanced-sensitivity triple-resonance spectroscopy with minimal H2O saturation.. J. Magn. Reson., 1994. [DOI]
- R. T. Clubb, V. Thanabal, G. Wagner. A Constant-time three-dimensional triple-resonance pulse scheme to correlate intraresidue 1HN, 15N, and 13C′chemical shifts in 15N 13C-labelled proteins.. J. Magn. Reson., 1992. [DOI]
- S. Grzesiek, A. Bax. An efficient experiment for sequential backbone assignment of medium-sized isotopically enriched proteins.. J. Magn. Reson., 1992. [DOI]
- S. Grzesiek, A. Bax. Amino acid type determination in the sequential assignment procedure of uniformly 13C/15N-enriched proteins.. J. Biomol. NMR, 1993. [DOI]
- D. R. Muhandiram, L. E. Kay. Gradient-enhanced triple-resonance three-dimensional NMR experiments with improved sensitivity.. J. Magn. Reson., 1994. [DOI]
- R. Weisemann, H. Rüterjans, W. Bermel. 3D triple-resonance NMR techniques for the sequential assignment of NH and 15N resonances in 15N- and 13C-labelled proteins.. J. Biomol. NMR, 1993. [DOI | PubMed]
- C. Bracken, A. G. Palmer, J. Cavanagh. (H)N(COCA)NH and HN(COCA)NH experiments for 1H-15N backbone assignments in 13C/15N-labeled proteins.. J. Biomol. NMR, 1997. [DOI | PubMed]
- N. S. Bhavesh, S. C. Panchal, R. V. Hosur. An efficient high-throughput resonance assignment procedure for structural genomics and protein folding research by NMR.. Biochemistry, 2001. [DOI | PubMed]
- Z.-Y. J. Sun, D. P. Frueh, P. Selenko, J. C. Hoch, G. Wagner. Fast assignment of 15N-HSQC peaks using high-resolution 3D HNcocaNH experiments with non-uniform sampling.. J. Biomol. NMR, 2005. [DOI | PubMed]
- W. Bermel, I. Bertini, L. Duma, I. C. Felli, L. Emsley, R. Pierattelli, P. R. Vasos. Complete assignment of heteronuclear protein resonances by protonless NMR spectroscopy.. Angew. Chem., Int. Ed., 2005. [DOI]
- W. Bermel, I. Bertini, I. C. Felli, R. Kümmerle, R. Pierattelli. Novel 13C Direct Detection Experiments, Including Extension to the Third Dimension, to Perform the Complete Assignment of Proteins.. J. Magn. Reson., 2006. [DOI | PubMed]
- W. Bermel, I. Bertini, V. Csizmok, I. C. Felli, R. Pierattelli, P. Tompa. H-start for exclusively heteronuclear NMR spectroscopy: The case of intrinsically disordered proteins.. J. Magn. Reson., 2009. [DOI | PubMed]
- K. Pervushin, R. Riek, G. Wider, K. Wüthrich. Transverse relaxation-optimized spectroscopy (TROSY) for NMR studies of aromatic spin systems in 13 C-labeled proteins.. J. Am. Chem. Soc., 1998. [DOI]
- N. E. Christou, B. Brutscher. BEST and SOFAST experiments for resonance assignment of histidine and tyrosine side chains in 13C/15N labeled proteins.. J. Biomol. NMR, 2018. [DOI | PubMed]
- N. Tjandra, S. E. Feller, R. W. Pastor, A. Bax. Rotational diffusion anisotropy of human ubiquitin from 15N NMR relaxation.. J. Am. Chem. Soc., 1995. [DOI]
- N. Tjandra, P. Wingfield, S. Stahl, A. Bax. Anisotropic rotational diffusion of perdeuterated HIV protease from 15N NMR relaxation measurements at two magnetic fields.. J. Biomol. NMR, 1996. [DOI | PubMed]
- J. García de la Torre, M. L. Huertas, B. Carrasco. HYDRONMR: Prediction of NMR relaxation of globular proteins from atomic-level structures and hydrodynamic calculations.. J. Magn. Reson., 2000. [DOI | PubMed]
- T. Cierpicki, J. Otlewski. Amide proton temperature coefficients as hydrogen bond indicators in proteins.. J. Biol. NMR, 2001. [DOI]
- T.-L. Hwang, P. C. M. van Zijl, S. Mori. Accurate quantitation of water–amide proton exchange rates using the phase-modulated CLEAN chemical EXchange (CLEANEX-PM) approach with a fast-HSQC (FHSQC) detection scheme.. J. Biomol. NMR, 1998. [DOI | PubMed]
- J. M. C. Teixeira, Z. H. Liu, A. Namini, J. Li, R. M. Vernon, M. Krzeminski, A. A. Shamandy, O. Zhang, M. Haghighatlari, L. Yu, T. Head-Gordon, J. D. Forman-Kay. IDPConformerGenerator: A flexible software suite for sampling the conformational space of disordered protein states.. J. Phys. Chem. A, 2022. [DOI | PubMed]
- J. Abramson, J. Adler, J. Dunger, R. Evans, T. Green, A. Pritzel, O. Ronneberger, L. Willmore, A. J. Ballard, J. Bambrick, S. W. Bodenstein, D. A. Evans, C.-C. Hung, M. O’Neill, D. Reiman, K. Tunyasuvunakool, Z. Wu, A. Žemgulytė, E. Arvaniti, C. Beattie, O. Bertolli, A. Bridgland, A. Cherepanov, M. Congreve, A. I. Cowen-Rivers, A. Cowie, M. Figurnov, F. B. Fuchs, H. Gladman, R. Jain, Y. A. Khan, C. M. R. Low, K. Perlin, A. Potapenko, P. Savy, S. Singh, A. Stecula, A. Thillaisundaram, C. Tong, S. Yakneen, E. D. Zhong, M. Zielinski, A. Žídek, V. Bapst, P. Kohli, M. Jaderberg, D. Hassabis, J. M. Jumper. Accurate structure prediction of biomolecular interactions with AlphaFold 3.. Nature, 2024. [DOI | PubMed]
- Z. H. Liu, J. M. C. Teixeira, O. Zhang, T. E. Tsangaris, J. Li, C. C. Gradinaru, T. Head-Gordon, J. D. Forman-Kay. Local disordered region sampling (LDRS) for ensemble modeling of proteins with experimentally undetermined or low confidence prediction segments.. Bioinformatics, 2023. [DOI | PubMed]
- M. Miller, Z. Dauter, S. Cherry, J. E. Tropea, A. Wlodawer. Structure of the Taz2 domain of P300: Insights into ligand binding.. Acta Crystallogr., Sect. D: Biol. Crystallogr., 2009. [DOI | PubMed]
- H. J. Dyson, P. E. Wright. Role of intrinsic protein disorder in the function and interactions of the transcriptional coactivators CREB-binding protein (CBP) and P300.. J. Biol. Chem., 2016. [DOI | PubMed]
- A. S. Krois, J. C. Ferreon, M. A. Martinez-Yamout, H. J. Dyson, P. E. Wright. Recognition of the disordered P53 transactivation domain by the transcriptional adapter zinc finger domains of CREB-binding protein.. Proc. Natl. Acad. Sci. U. S. A., 2016. [DOI | PubMed]
- M. R. Lochhead, A. D. Brown, A. C. Kirlin, S. Chitayat, K. Munro, J. E. Findlay, G. S. Baillie, D. P. LeBrun, D. N. Langelaan, S. P. Smith. Structural insights into TAZ2 domain–mediated CBP/P300 recruitment by transactivation domain 1 of the lymphopoietic transcription factor E2A.. J. Biol. Chem., 2020. [DOI | PubMed]
- J. C. Ferreon, M. A. Martinez-Yamout, H. J. Dyson, P. E. Wright. Structural basis for subversion of cellular control mechanisms by the adenoviral E1A oncoprotein.. Proc. Natl. Acad. Sci. U. S. A., 2009. [DOI | PubMed]
- J. M. Wojciak, M. A. Martinez-Yamout, H. J. Dyson, P. E. Wright. Structural basis for recruitment of CBP/P300 coactivators by STAT1 and STAT2 transactivation domains.. EMBO J., 2009. [DOI | PubMed]
- T. Trudeau, R. Nassar, A. Cumberworth, E. T. C. Wong, G. Woollard, J. Gsponer. Structure and intrinsic disorder in protein autoinhibition.. Structure, 2013. [DOI | PubMed]
- P. Tompa. Multisteric regulation by structural disorder in modular signaling proteins: An extension of the concept of allostery.. Chem. Rev., 2014. [DOI | PubMed]
- M. Arbesú, M. Pons. Integrating disorder in globular multidomain proteins: Fuzzy sensors and the role of SH3 domains.. Arch. Biochem. Biophys., 2019. [DOI | PubMed]
- M. A. Pufall, G. M. Lee, M. L. Nelson, H.-S. Kang, A. Velyvis, L. E. Kay, L. P. McIntosh, B. J. Graves. Variable control of Ets-1 DNA binding by multiple phosphates in an unstructured region.. Science, 2005. [DOI | PubMed]
- R. B. Berlow, H. J. Dyson, P. E. Wright. Hypersensitive termination of the hypoxic response by a disordered protein switch.. Nature, 2017. [DOI | PubMed]
- A. S. Holehouse, B. B. Kragelund. The molecular basis for cellular function of intrinsically disordered protein regions.. Nat. Rev. Mol. Cell Biol., 2024. [DOI | PubMed]
- M. Fuxreiter, P. Tompa, M. Fuxreiter, P. Tompa. Fuzzy complexes: A more stochastic view of protein function. In. Fuzziness: Structural disorder in protein complexes,, 2012. [DOI]
- P. Schanda. Fast-pulsing longitudinal relaxation optimized techniques: Enriching the toolbox of fast biomolecular NMR Spectroscopy.. Prog. Nucl. Magn. Reson. Spectrosc., 2009. [DOI]
- T. Hošek, S. Gil-Caballero, R. Pierattelli, B. Brutscher, I. C. Felli. Longitudinal relaxation properties of 1HN and 1Hα determined by direct-detected 13C NMR experiments to study intrinsically disordered proteins (IDPs).. J. Magn. Reson., 2015. [DOI | PubMed]
- P. Schanda, B. Brutscher. Very fast two-dimensional NMR spectroscopy for real-time investigation of dynamic events in proteins on the time scale of seconds.. J. Am. Chem. Soc., 2005. [DOI | PubMed]
- E. O. Calçada, I. C. Felli, T. Hošek, R. Pierattelli. The heterogeneous structural behavior of E7 from HPV16 revealed by NMR spectroscopy.. ChemBiochem, 2013. [DOI | PubMed]
- T. Inubushi, E. D. Becker. Efficient detection of paramagnetically shifted NMR resonances by optimizing the WEFT pulse sequence.. J. Magn. Reson., 1983. [DOI]
- L. Banci, I. Bertini, C. Luchinat. Nuclear and electron relaxation. The magnetic nucleus-unpaired electron coupling in solution., 1993. [DOI]
- I. Matlahov, P. C. A. van der Wel. Hidden motions and motion-induced invisibility: Dynamics-based spectral editing in solid-state NMR.. Methods, 2018. [DOI | PubMed]
- K. Aebischer, M. Ernst. INEPT and CP transfer efficiencies of dynamic systems in MAS solid-state NMR.. J. Magn. Reson., 2024. [DOI | PubMed]
- T. Schmidt, N. Kubatova, G. M. Clore. Deconvoluting monomer- and dimer-specific distance distributions between spin labels in a monomer/dimer mixture using T1 -Edited DEER EPR spectroscopy.. J. Am. Chem. Soc., 2024. [DOI | PubMed]
