Extreme conformational diversity in human telomeric DNA J. Y. Lee*†, Burak Okumus†‡, D. S. Kim*, and Taekjip Ha‡§¶储 *Department of Physics, Seoul National University, Seoul 151-742, Korea; and ‡Center for Biophysics and Computational Biology, §Department of Physics, and ¶Howard Hughes Medical Institute, University of Illinois at Urbana–Champaign, Urbana, IL 61801 Edited by Kiyoshi Mizuuchi, National Institutes of Health, Bethesda, MD, and approved October 28, 2005 (received for review July 20, 2005)

DNA with tandem repeats of guanines folds into G-quadruplexes made of a stack of G-quartets. In vitro, G-quadruplex formation inhibits telomere extension, and POT1 binding to the singlestranded telomeric DNA enhances telomerase activity by disrupting the G-quadruplex structure, highlighting the potential importance of the G-quadruplex structure in regulating telomere length in vivo. We have used single-molecule spectroscopy to probe the dynamics of human telomeric DNA. Three conformations were observed in potassium solution, one unfolded and two folded, and each conformation could be further divided into two species, long-lived and short-lived, based on lifetimes of minutes vs. seconds. Vesicle encapsulation studies suggest that the total of six states detected here is intrinsic to the DNA. Folding was severely hindered by replacing a single guanine, showing only the shortlived species. The long-lived folded states are dominant in physiologically relevant conditions and probably correspond to the parallel and antiparallel G-quadruplexes seen in high-resolution structural studies. Although rare under these conditions, the shortlived species determine the overall dynamics because they bridge the different long-lived species. We propose that these previously unobserved transient states represent the early and late intermediates toward the formation of stable G-quadruplexes. The major compaction occurs between the early and late intermediates, and it is possible that local rearrangements are sufficient in locking the late intermediates into the stably folded forms. The extremely diverse conformations of the human telomeric DNA may have mechanistic implications for the proteins and drugs that recognize G-rich sequences. FRET 兩 G-quadruplex 兩 single molecule 兩 telomere 兩 vesicle encapsulation

E

ukaryotic chromosomes have a novel nucleoprotein structure at both ends called a telomere. Telomeres are indispensable for protection against genome degradation and have implications in cellular aging and cancer (1–3). In many organisms, a telomeric DNA is composed of tandem repeats of short DNA sequences. In vitro, this guanine-rich region forms G-quadruplex (4), which blocks the binding of telomerase (5). The potential importance of Gquadruplex structure was further implied by the recent demonstration in vitro that the human POT1, which binds to the singlestranded form of human telomeric DNA (6), facilitates the telomerase activity by disrupting G-quadruplex (7). Human disease helicases, BLM and WRN, also were shown to disrupt Gquadruplexes in vitro (8, 9). Because extension of the telomere is critical for generation and growth of cancer cells, the human telomeric G-quadruplex is considered a target for anticancer agents. Structure-based design of chemotherapeutic drugs would require an understanding of a range of conformations that can be adopted by the G-rich sequences, and indeed it was shown to be possible to design molecules that are specific to different types of G-quadruplex structures (10). Although there is no demonstration yet that the G-quadruplex forms in the cell for human telomeric DNA, recent studies of the telomeres of Stylonychia macronuclei indicated that G-quadruplexes can indeed form in vivo (11), and their formation is controlled by telomere end binding proteins in vivo (12). In addition, intracellular formation of the G-quadruplex 18938 –18943 兩 PNAS 兩 December 27, 2005 兩 vol. 102 兩 no. 52

was observed in the nontemplate G-rich sequence during transcription (13). Our goal here is to understand the dynamic structural properties of a G-quadruplex forming DNA using the unique capabilities of single-molecule spectroscopy (14–16) and thereby provide a firm basis for elucidating the interaction between the telomeric DNA and associated proteins. Crystal structures showed that the human telomeric DNA can form a parallel G-quadruplex structure in the presence of K⫹ ions (17). In the parallel structure, all four guaninecontributing edges run in the same direction. In contrast, the previously determined NMR structure in Na⫹ solution was in the antiparallel form (18). More recently, it was shown by NMR studies that parallel and antiparallel forms are in dynamic equilibrium at 100 mM K⫹ (19). Earlier single-molecule fluorescence resonance energy transfer (FRET) measurements of freely diffusing human telomeric DNA molecules detected two populations of differing FRET values (20). The two forms did not interconvert during the time a molecule spends in the excitation volume of a confocal microscope (⬍1 ms), and both conformations could be made into duplex structures in minutes by means of hybridization to a complementary strand, suggesting that the dynamic interconversion between different conformations may occur in the time window between 1 ms and a few minutes. To probe the dynamic properties of the telomeric DNA, we performed single-molecule FRET experiments on a similar construct (Fig. 1A) but with an added biotin for specific tethering to a quartz surface via biotin– streptavidin linker to extend the observation time. Folding of the DNA into the compact G-quadruplex structure is expected to yield smaller average distance between the donor (tetramethylrhodamine) and the acceptor (Cy5), hence to display higher FRET than the unfolded form (21). We observed interconversion between three different FRET values, which represent an unfolded and two folded (likely parallel and antiparallel) conformations. Each of the three is further divided into the long- and short-lived species, giving rise to a total of six states. We also observed exchanges between the long- and shortlived species. Kinetic analysis of interconversion pathways, vesicle encapsulation experiments, and mutant studies indicate that the remarkable conformational diversity is intrinsic to the DNA. We propose a model for the identities of these states and their reaction pathways and discuss biological implications on how the functions of telomere-specific proteins and drugs that target telomeric DNA could be modulated by the complex behaviors of the DNA as observed here. Materials and Methods DNA Preparation. DNA oligonucleotides were purchased from Integrated DNA Technologies (Coralville, IA). G-quadruplex Conflict of interest statement: No conflicts declared. This paper was submitted directly (Track II) to the PNAS office. Freely available online through the PNAS open access option. Abbreviations: LU, long-lived unfolded; LF1, long-lived folded 1; LF2, long-lived folded 2; SU, short-lived unfolded; SF1, short-lived folded 1; SF2, short-lived folded 2. †J.Y.L. 储To

and B.O. contributed equally to this work.

whom correspondence should be addressed: E-mail: [email protected].

© 2005 by The National Academy of Sciences of the USA

www.pnas.org兾cgi兾doi兾10.1073兾pnas.0506144102

strand sequence is 5⬘-Cy5-(GGGTTA)3 GGG AGA GGT AAA AGG ATA ATG GCC ACG GTG CG-3⬘-biotin. For the mutant studies, (GGGTTA)3 GGG was replaced by (GGGTTA)2 GTGTTAGGG, where the underline denotes the change from G to T. Complementary stem strand sequence is 5⬘-CGC ACC GTG GCC ATT ATC CTT (amino-C6 dT)TA CCT CT-3⬘. The G-quadruplex strand includes the human telomeric repeat motif (bold-face type) at the end of which the acceptor dye (Cy5) is attached. Its 3⬘ end is modified by biotin for immobilization. The complementary stem strand is labeled with tetramethylrhodamine internally via amino-modified C6 dT (20). DNA was annealed with 2:1 mixture of the stem and the G-quadruplex strands by heating the mixture to 95°C and then cooling down to room temperature slowly. The overall configuration is shown in Fig. 1A. Single-Molecule and Bulk Experiments. Total internal reflection

fluorescence microscopy was used for single-molecule FRET imaging (22, 23) with 532-nm excitation and a back-illuminated electron-multiplying charge-coupled device camera (iXON, Andor Technology, South Windsor, CT). The quartz surface of the sample cell was successively treated with biotinylated BSA and streptavidin, and 50–200 pM of annealed DNA molecules was added. The same protocol has been used for single-molecule FRET studies of RNA and DNA without any surface-induced perturbations (22, 24). Two sets of data were obtained, one with 0.1-s integration time and 160-s duration and the other with 0.9-s integration time and 900-s duration. Imaging buffer [10 mM Tris䡠HCl, pH 7.5兾0.4% (wt/wt) glucose兾1% 2-mercaptoethanol兾0.1 g/ml glucose oxidase (Sigma)兾 0.02 mg/ml catalase (Sigma)] was used as the base buffer, and specified concentrations of KCl or NaCl were added. Sample temperature was controlled by using a water-circulating temperature controller (Neslab Instruments, Portsmouth, NH) and measured by using a thermocouple with an uncertainty of ⬇1°C (25). Measurements without the temperature control were performed at Lee et al.

room temperature (22°C). Single-molecule FRET efficiency was approximated by using 1兾(1 ⫹ ID兾IA) where IA and ID are the fluorescence intensities of the acceptor and the donor, respectively, after background subtraction and crosstalk correction (26). The negative values of the FRET efficiency are due to background subtraction from very low signal in the acceptor channel, giving rise to negative acceptor intensity. FRET efficiency in the singlemolecule histograms is obtained by averaging the first 10 data points of each molecule’s time trace. For the bulk kinetic data, we used the Cary Eclipse fluorometer (Varian) and excitation wavelength of 532 nm, and K⫹ concentration was increased by manually adding K⫹. Results One Unfolded and Two Folded Conformations. Single-molecule FRET efficiency histograms are shown in Fig. 1B for various K⫹ concentrations. The zero FRET peak is due to inactive or missing acceptors and henceforth is ignored. At 2 mM K⫹, three additional peaks are apparent (Fig. 1C), one at a high FRET efficiency (⬇0.8), another at an intermediate FRET efficiency (⬇0.6), and the third at a low FRET efficiency (⬇0.3). As K⫹ concentration increases, the low FRET state decreases in population. Depletion of the low FRET state also was observed with addition of Na⫹ only, but higher ion concentrations were required (⬇50% in the low FRET state at 30 mM Na⫹ vs. 2 mM for K⫹; see Fig. 6A, which is published as supporting information on the PNAS web site). Because monovalent cations are needed to induce the stable formation of Gquadruplex with K⫹ being more effective than Na⫹, we assign the low FRET state to the unfolded conformation (disordered singlestranded overhang) and the other two states to folded conformations (G-quadruplex structures). Only the low FRET peak was apparent without monovalent ions, consistent with this assignment. Because the buffering agent, Tris, may act as a cation and the low FRET peak could still be due to a particular folded conformation, PNAS 兩 December 27, 2005 兩 vol. 102 兩 no. 52 兩 18939

BIOPHYSICS

Fig. 1. Probing potassium-dependent conformations of human telomeric DNA with FRET. (A) Schematic diagram of DNA construct. The G-quadruplex strand and the complementary stem strand were annealed to form a duplex stem. The quadruplex is depicted in the antiparallel conformation, and the rectangles represent the guanines (yellow in syn and red in anti conformations). Biotin is used to immobilize the DNA to a streptavidin-coated quartz surface, and FRET between tetramethylrhodamine (donor) and Cy5 (acceptor) was measured. (B) Single-molecule FRET histograms as a function of K⫹ concentration. (C) An expanded view of FRET histogram at 2 mM K⫹ showing three nonzero FRET peaks, named U (unfolded), F1, and F2 (folded). Also shown is the fit by four Gaussians. (D) Single-molecule FRET histograms as a function of K⫹ concentration for a mutant sequence as indicated above the graph. (E) Time traces (0.1-s integration time) of donor and acceptor intensities and corresponding FRET from a single molecule at 2 mM K⫹ exhibit interconversion between U, F1, and F2. A transition to zero FRET at 65 s is due to Cy5 blinking. Note that the transition from F2 to F1 occurred through U.

Fig. 2. Extreme conformational diversity revealed by single-molecule dynamics. (A) Representative FRET time traces of single molecules at 2 mM K⫹ and room temperature (0.9-s integration time). Although some molecules show a single conformation for hundreds of seconds (bottom two traces, acceptor photobleaching events at 810 and 850 s), others show rapid fluctuations between U, F1, and F2 (third trace from the top). The top two traces show a mixed behavior in which switching events between the long- and short-lived species are observed. Segments of long- and short-lived species are marked by solid and dotted lines, respectively. (B) Fraction remaining in F2 after a given time. The result is fitted by a double-exponential decay. (C) An example FRET time trace (0.1-s integration time) of a mutant sequence as indicated above the graph. The acceptor was photobleached at 115 s. (D) Bulk FRET efficiency, calculated as the ratio of the fluorescence intensity at the acceptor emission peak wavelength divided by the sum of the fluorescence intensities of the donor and the acceptor at their emission peak wavelengths, was measured as a function of time after adding 2 mM K⫹ at t ⫽ 0. The result is fitted by a double exponential.

we tested the neutral Hepes buffer (10 mM, pH 7.4) but observed only the low FRET state in the absence of cations (data not shown). We will use U to denote the unfolded conformation and F1 and F2 to denote the folded conformations with intermediate and high FRET values, respectively. We also studied a mutant sequence for which a guanine in the middle quartet is replaced by a thymine. As anticipated, Gquadruplex formation was significantly disrupted. Folded conformations were observed only at K⫹ concentration of ⬎100 mM (Fig. 1D), and no folding was observed with Na⫹ even at 1 M concentration (Fig. 6B). With Na⫹, the low FRET peak shifts to higher values as the ion concentration increases (Fig. 6) likely due to reduced electrostatic repulsion between phosphate groups of the DNA backbone that allows more compact conformations of unfolded ssDNA (27). A similar effect was observed up to 100 mM K⫹ for the mutant (Fig. 1D). The results below will be on the wild-type sequence unless specified otherwise. Transitions between all three FRET states (U, F1, and F2) could be observed within single molecules, most frequently in 2 mM K⫹ (Fig. 1E); 97%, 1,395 of 1,436, of transitions between F1 and F2 passed through U. This result is expected because a direct transition between two folded conformations without unfolding is highly unlikely considering the structural complexity of G-quadruplexes. This observation also makes it very unlikely that F1 and F2 are different only in the dye properties. Even though fluorescent labeling does have some effects in that it slightly increased the stability of the G-quadruplex in 100 mM K⫹ (20), distinct conformations seen here are not likely to be caused by mere changes in dyes because interconversion between them requires G-quadruplex unfolding. At low cation concentrations, raising temperature would readily 18940 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0506144102

disrupt G-quadruplexes and favor U. At 2 mM K⫹, U indeed becomes dominant as the temperature increases (see Fig. 7A, which is published as supporting information on the PNAS web site). At 100 mM K⫹, U does not appear significantly even at 45°C, but F1 becomes populated at the expense of F2 as the temperature increases (Fig. 7B). The dominance of the parallel over antiparallel conformations at elevated temperatures was observed in NMR studies of bimolecular G-quadruplex of human telomeric sequence (19). If the same is true for the unimolecular construct studied here, F1 would correspond to the parallel conformation and F2 to the antiparallel conformation. Circular dichroism measurements of the same DNA sequence and labeling also suggested that at room temperature and at 100 mM K⫹ the structure is mostly antiparallel (20), further supporting our assignment. Thus, our data suggest that the human telomeric DNA in physiological conditions (37°C, 140 mM K⫹) (28) is primarily folded with comparable populations of parallel and antiparallel conformations. Long- and Short-Lived Species. Fig. 2A shows five representative time

traces of FRET efficiency at 2 mM K⫹ (0.9-s integration time and lower excitation powers were used to extend observation time). From such long time traces (⬎15 min), two types of dynamic behavior are observed. Some molecules maintain their particular conformation for a long time, whereas others show rapid fluctuations between different conformations. Switching between these two distinct behaviors also was observed for some molecules. To confirm that this dichotomy is statistically significant, we analyzed the dwell times of each conformation. Fig. 2B shows the fraction of molecules remaining in F2 after time t because the molecule is observed to be in that conformation. The biphasic behavior is apparent and the data are well fitted by a double exponential decay Lee et al.

Fig. 3. Conformational distribution of various states. The relative populations of the folded (■) vs. unfolded (E) conformations (Top), antiparallel (■) vs. parallel (E) conformations (Middle), and long-lived (■) vs. short-lived (E) species (Bottom). The results are shown for various K⫹ concentrations at room temperature (Left), and for various temperatures at 2 mM K⫹ (Center) and 100 mM K⫹ (Right).

Conformational Diversity Is Intrinsic. As a further test, we used the vesicle encapsulation technique (29, 30) that confines each molecule in a volume smaller than the diffraction limit. This technique eliminates the possibility of surface artifacts while achieving a long observation time by immobilizing the vesicles. We encapsulated the same G-quadruplex construct, but lacking biotin, in small unilamellar vesicles (200-nm diameter), which in turn were tethered to a supported bilayer (see Fig. 9, which is published as supporting information on the PNAS web site). Control experiments showed that nonspecific binding of DNA to the membrane is undetectable. Vesicle-encapsulated molecules in 2 mM K⫹ showed all of the six states as well as interconversions between them (Fig. 9). Therefore, it is highly likely that the extreme conformational diversity observed here is not caused by surface tethering, but is intrinsic to the DNA. As an additional test, we performed bulk kinetic measurements in which 80 nM solution of the DNA was prepared in the imaging buffer without K⫹, and then FRET efficiency was measured as a function of time after addition of 2 mM K⫹ (Fig. 2D). Biphasic behavior was observed with lifetimes of 253 (⫾20) and 8.8 (⫾0.3) s, consistent with the existence of multiple states with markedly different lifetimes. We also note that the long-lived states were not observed for the mutant missing one guanine (Fig. 2C), suggesting that the two distinct lifetimes observed from the wild type are due Lee et al.

to the intrinsic variations in the conformational stability instead of being artifacts caused by fluorescent labeling. Equilibrium Effect of Kⴙ and Temperature. Fig. 3 shows how populations shift between different states as K⫹ concentration and temperature are varied. The six states are appropriately grouped in this presentation. For example, the folded population includes all four folded states (LF1, LF2, SF1, and SF2). All results were obtained by analyzing the time traces as described (see Supporting Text, which is published as supporting information on the PNAS web site). At room temperature, increasing K⫹ concentration induces folding (Fig. 3A). F1 is maximally populated at 2 mM K⫹ and is replaced by F2 as K⫹ concentration is further increased (Fig. 3B). The long-lived species is dominant at both low and high K⫹ concentrations and is less occupied at intermediate concentrations because of the appearance of the short-lived species (Fig. 3C). Overall, the DNA is primarily in the long-lived unfolded state (LU) at low K⫹ concentrations, fluctuates among the short-lived conformations at intermediate K⫹ concentrations, and finally folds primarily into the long-lived antiparallel state (LF2) when K⫹ concentration is further increased. Thus, the short-lived species appears to be a transient form on the pathway from LU to LF2. At 2 mM K⫹, the populations of U and short-lived species increase as the temperature increases to 37°C (Fig. 3 D–F). Upon further increase in temperature, LU dominates. At 100 mM K⫹, the molecules remain stably folded even at the highest temperature (Fig. 3G), but a shift in population from F2 to F1 was observed (Fig. 3H) as has been shown in the histograms (Fig. 7B). Switching Between Long- and Short-Lived Species. Next, we exam-

ined the transition rates between various states. Fig. 4 A and B shows how often the molecule switches between the long-lived (LU and LF) and short-lived (SX, X ⫽ U, F1, or F2) species. This process is slow, taking hundreds of seconds, but a significant number of molecules showed switching during observation which allowed us to extract the switching rates (see Supporting Text). Direct transitions between any two long-lived species are very rare (2%; 12 of 679 traces), whereas much more frequent transitions were observed between the long- and short-lived species (31%; 210 of 679 traces). PNAS 兩 December 27, 2005 兩 vol. 102 兩 no. 52 兩 18941

BIOPHYSICS

with decay times of 188 (⫾ 2) and 20 (⫾ 0.1) s, suggesting at least two different time scales for unfolding of F2. Similar biphasic behavior was observed for U and F1 (see Fig. 8, which is published as supporting information on the PNAS web site). Hence, we further classify the molecular conformations according to the dwell times: long-lived species if the dwell time is ⬎100 s and short-lived species if the dwell time is ⬍100 s. In total, we observed six distinct states: long-lived states (LU, LF1, and LF2) and short-lived states (SU, SF , and SF ) denoted by the superscripts. Although we assign the 1 2 long-lived folded states, LF1 and LF2, observed in high K⫹ concentrations to the parallel and antiparallel conformations as seen in high-resolution structural studies, the short-lived folded conformations, SF1 and SF2, are likely to be different in microscopic details from their long-lived counterparts, although their global folds may be similar.

overall dynamics because they bridge the long-lived states, prompting us to probe the detailed dynamics within the shortlived species. Transitions within the short-lived species are an order of magnitude faster than those between long- and shortlived species (Fig. 4 C and D). As K⫹ concentration is raised, the unfolding rate decreases relatively slowly, whereas the folding rate increases steeply (Fig. 4C). This asymmetry suggests the ion uptake is similar for the folded state and the transition state so that their free energies vary by a similar amount as ion concentration is varied (31). In contrast, the temperature increase affects the unfolding rate much more than the folding rate (Fig. 4D). Weakly temperature-dependent folding rate indicates that the enthalpic barrier is small and the folding barrier is largely entropic. Therefore, the transition state for folding must be much better ordered than the unfolded state. Overall, this analysis suggests that the folding transition state within the short-lived species is significantly ordered and has similar cation binding characteristics as the folded state. Fig. 4. Conformation transition rates between various states. (A and B) The rates of transition between the long- and short-lived species as a function of K⫹ concentration at room temperature (A) and as a function of temperature at 2 mM K⫹ (B) (E, transition from SF to LU; 䊐, transition from SU to LF; {, transition from LU to SF; and ‚, transition from LF to SU). Superscripts S and L denote short- and long-lived respectively. (C and D) The rates of folding (■) and unfolding (E) within the short-lived species as a function of K⫹ concentration at room temperature (C) and as a function of temperature at 2 mM K⫹ (D).

Thus, it is probable that interconversion between long-lived species require a visit to SX. Increasing K⫹ accelerates transitions from SX to LF and decelerates transitions from SX to LU (Fig. 4A). Higher temperature accelerates transitions from SX to LU and decelerates transitions from SX to LF (Fig. 4B). In contrast to the large effects on the rates of leaving SX, neither K⫹ concentration nor temperature has much influence on the rates of entering SX (Fig. 4 A and B). Folding and Unfolding Rates Within the Short-Lived Species. Data shown above suggest that the short-lived species determine the

Discussion Proposed Model for Conformational Diversity. Our single-molecule

measurements revealed the existence of distinct species that differ in kinetic stability and indicated that the human telomeric sequence possesses at least six different conformational states. Below, we propose a model for the identities of the six states and the reaction pathways that connect them (Fig. 5). The long-lived folded states, LF1 and LF2, were dominant in high K⫹ concentrations and were assigned to the parallel and antiparallel structures, respectively, based on comparison with NMR studies of temperature dependence (19). The crystal structure with K⫹ showed that a cation is placed at the cavity between two adjacent planes of G-quartets (17) with a total of two cations observed for the human telomeric sequence composed of three G-quartets. It is likely that LF1 and LF2 correspond to such well defined structures with stably bound cations between G-quartets. The long-lived unfolded state, LU, was highly populated at low K⫹ concentrations and displayed the characteristic salt dependence of a disordered ssDNA.

Fig. 5. Proposed model for conformational identities and reaction pathways. The cartoon for LU denotes a disordered conformation of the ssDNA, and LF1 and LF are depicted as the parallel and antiparallel G-quadruplex structures, respectively, holding K⫹ ions between G-quartets. The short-lived species (SX) include 2 SU, SF , and SF . The solid arrows represent the proposed reaction paths, and the dashed arrows point toward the states that are favored upon increasing K⫹ 1 2 concentration or temperature. 18942 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0506144102

Lee et al.

be a semiordered structure, or an intermediate toward folding, that may be specific to the G-rich sequence even though it cannot be distinguished from the fully disordered conformation of LU based on the FRET values alone. Likewise, SF1 and SF2 are much less stable than their long-lived counterparts. One possibility is that their reduced stability stems from incomplete cation binding between G-quartets. Although SF1 and SF2 must differ from LF1 and LF in microscopic detail, their similar FRET values suggest that 2 they may have the same global folds. If this assumption is indeed the case, we may speculate that SF1 and SF2 are the obligatory late folding intermediates for LF1 and LF2, respectively, as sketched in Fig. 5. Likewise, we propose that SU is the obligatory early folding intermediate between LU and folded conformations. In this model, local rearrangements of the late intermediates (SF1 and SF2), would be sufficient in locking the molecule into the stably folded Gquadruplex structures (LF1 and LF2). The major compaction occurs between SU and SF, that is, within the short-lived species (SX). We also observed that SX are required for transitions between the different long-lived species and that the temperature and K⫹ concentration primarily affect the rates of leaving SX but not the rates of arriving at SX. Therefore, despite being the minority species in the physiological solution conditions, the properties of the short-lived species are highly relevant in determining the overall structural properties of G-quadruplexes. Biological Significance. In vivo, interactions with proteins and other small molecules would certainly modify the equilibrium properties we observed from DNA only. If a protein recognizes the singlestranded telomeric DNA, as is the case for POT1 (6), the protein binding would have to wait until the G-quadruplex is unfolded. Because of the coexistence of both long- and short-lived folded species, only a subset of the DNA would bind the proteins quickly, whereas the rest of the DNA molecules would remain folded for much longer. Another possibility is that even the short-lived unfolded state, SU, may not be recognized by the protein. If so, multiple visits to SU may be necessary until an excursion to the fully disordered LU at which point the protein binds. Further singlemolecule studies may be able to clarify these situations. Proteins and drugs that bind to the folded G-quadruplex structures also would be affected by extremely diverse conformations seen here. We observed both fast (a few seconds) and slow (a few 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18.

Neidle, S. & Parkinson, G. (2002) Nat. Rev. Drug Discov. 1, 383–393. Neidle, S. & Parkinson, G. N. (2003) Curr. Opin. Struct. Biol. 13, 275–283. Zakian, V. A. (1995) Science 270, 1601–1607. Sen, D. & Gilbert, W. (1988) Nature 334, 364–366. Zahler, A. M., Williamson, J. R., Cech, T. R. & Prescott, D. M. (1991) Nature 350, 718–720. Lei, M., Podell, E. R. & Cech, T. R. (2004) Nat. Struct. Mol. Biol. 11, 1223–1229. Zaug, A. J., Podell, E. R. & Cech, T. R. (2005) Proc. Natl. Acad. Sci. USA 102, 10864–10869. Sun, H., Karow, J. K., Hickson, I. D. & Maizels, N. (1998) J. Biol. Chem. 273, 27587–27592. Fry, M. & Loeb, L. A. (1999) J. Biol. Chem. 274, 12797–12802. Rezler, E. M., Seenisamy, J., Bashyam, S., Kim, M. Y., White, E., Wilson, W. D. & Hurley, L. H. (2005) J. Am. Chem. Soc. 127, 9439–9447. Schaffitzel, C., Berger, I., Postberg, J., Hanes, J., Lipps, H. J. & Pluckthun, A. (2001) Proc. Natl. Acad. Sci. USA 98, 8572–8577. Paeschke, K., Simonsson, T., Postberg, J., Rhodes, D. & Lipps, H. J. (2005) Nat. Struct. Mol. Biol. 12, 847–854. Duquette, M. L., Handa, P., Vincent, J. A., Taylor, A. F. & Maizels, N. (2004) Genes Dev. 18, 1618–1629. Moerner, W. E. & Orrit, M. (1999) Science 283, 1670–1676. Weiss, S. (1999) Science 283, 1676–1683. Ha, T. (2004) Biochemistry 43, 4055–4063. Parkinson, G. N., Lee, M. P. & Neidle, S. (2002) Nature 417, 876–880. Wang, Y. & Patel, D. J. (1993) Structure (London) 1, 263–282.

Lee et al.

minutes) conformational exchanges, and any agent that recognizes only one structure [for example, parallel over antiparallel (11)] would act quickly on a subset of molecules, whereas the remaining populations would need to undergo conformational exchange before the agent can be effective. The conformational equilibrium can be further altered by the helicases that actively disrupt Gquadruplex structure (8, 9). The human telomere ends with 3⬘ overhang with ⬇100–200 nt (32), corresponding to 5–10 G-quadruplex forming units. Our studies used a 5⬘ overhang with a minimum sequence needed for G-quadruplex formation and should be relevant to any identical segments within the long 3⬘ overhang as long as one is not specifically concerned with a protein that recognizes the junction between the 3⬘ tail and duplex DNA. Because any four successive telomeric repeats can form a G-quadruplex, there can be one, two, and three telomeric repeats in the single-stranded form between adjacent G-quadruplexes. It is possible these single-stranded regions, which will be present even in G-quadruplex stabilizing conditions, can be the initiating sites for the action of telomereassociated proteins. Conclusions It is now well established that G-rich DNA can form a wide range of structures, depending on the sequence and conditions (10, 19, 33, 34). Our single-molecule investigation of G-quadruplexes derived from human telomeric DNA detected six distinct conformational states, which differ in static spectroscopic signals and兾or in their dynamic characteristics. The rates of inter- and intra-species transitions show that the short-lived species play an important role in the overall dynamics. The highly dynamic, short-lived species has a highly ordered transition state, which is stabilized by the binding of additional cations. This work reveals the extremely diverse conformations of G-quadruplex in its naked form and therefore establishes a basis for future biophysical studies of enzymes that interact with G-rich sequences. We thank Sean A. McKinney for writing the data acquisition program, Min Ah Seo for helping with the illustration of the proposed model depicted in Fig. 5, and Chirlmin Joo and Sungchul Hohng for experimental help. This work was supported by National Science Foundation Grant PHY-0134916, National Institutes of Health Grant GM065367兾 GM074526, and by a Cottrell Scholar award of Research Corporation. In Korea, this work was supported by the Korea Science and Engineering Foundation, the Ministry of Science and Technology, and the Ministry of Commerce, Industry, and Energy. 19. Phan, A. T. & Patel, D. J. (2003) J. Am. Chem. Soc. 125, 15021–15027. 20. Ying, L., Green, J. J., Li, H., Klenerman, D. & Balasubramanian, S. (2003) Proc. Natl. Acad. Sci. USA 100, 14629–14634. 21. Selvin, P. R. (2000) Nat. Struct. Biol. 7, 730–734. 22. Zhuang, X. W., Bartley, L. E., Babcock, H. P., Russell, R., Ha, T. J., Herschlag, D. & Chu, S. (2000) Science 288, 2048–2051. 23. Ha, T., Rasnik, I., Cheng, W., Babcock, H. P., Gauss, G., Lohman, T. M. & Chu, S. (2002) Nature 419, 638–641. 24. McKinney, S. A., Freeman, A. D., Lilley, D. M. & Ha, T. (2005) Proc. Natl. Acad. Sci. USA 102, 5715–5720. 25. McKinney, S. A., Declais, A. C., Lilley, D. M. J. & Ha, T. (2003) Nat. Struct. Biol. 10, 93–97. 26. Ha, T. (2001) Methods 25, 78–86. 27. Murphy, M. C., Rasnik, I., Cheng, W., Lohman, T. M. & Ha, T. (2004) Biophys. J. 86, 2530–2537. 28. Guyton, A. C. (1991) Textbook of Medical Physiology (Saunders, Philadelphia). 29. Boukobza, E., Sonnenfeld, A. & Haran, G. (2001) J. Phys. Chem. B 105, 12165–12170. 30. Okumus, B., Wilson, T. J., Lilley, D. M. J. & Ha, T. (2004) Biophys. J. 87, 2798–2806. 31. Bokinsky, G., Rueda, D., Misra, V. K., Rhodes, M. M., Gordus, A., Babcock, H. P., Walter, N. G. & Zhuang, X. (2003) Proc. Natl. Acad. Sci. USA 100, 9302–9307. 32. Makarov, V. L., Hirose, Y. & Langmore, J. P. (1997) Cell 88, 657–666. 33. Hazel, P., Huppert, J., Balasubramanian, S. & Neidle, S. (2004) J. Am. Chem. Soc. 126, 16405–16415. 34. Qi, J. & Shafer, R. H. (2005) Nucleic Acids Res. 33, 3185–3192.

PNAS 兩 December 27, 2005 兩 vol. 102 兩 no. 52 兩 18943

BIOPHYSICS

The microscopic nature of the short-lived species (SU, SF1, and SF ) is much less clear. SU is definitely different from LU in its much 2 shorter lifetime and is much more readily foldable. Thus, SU may

Extreme conformational diversity in human telomeric DNA

Dec 27, 2005 - yet that the G-quadruplex forms in the cell for human telomeric. DNA, recent studies of the ... Conflict of interest statement: No conflicts declared. This paper was ... ATT ATC CTT (amino-C6 dT)TA CCT CT-3. The G-quadruplex ...

552KB Sizes 6 Downloads 204 Views

Recommend Documents

Conformational diversity and protein evolution – a 60 ...
binding kinetics provided the first data indicating pre- existing isomers in .... duplication and mutation could then provide the increased genetic diversity to drive ...

MITOCHONDRIAL DNA MUTATIONS IN HUMAN DISEASE.pdf ...
MITOCHONDRIAL DNA MUTATIONS IN HUMAN DISEASE.pdf. MITOCHONDRIAL DNA MUTATIONS IN HUMAN DISEASE.pdf. Open. Extract. Open with.

Mitochondrial DNA in Ancient Human Populations of Europe
Seo et al., 1998. 96. East Asia. Japanese. Tanaka et al., 2004. 96. East Asia. Japanese. Horai et al., 1990. 111. East Asia. Japanese. Mabuchi et al., 2006. 124.

Mitochondrial DNA in Ancient Human Populations of Europe
A thesis submitted for the degree of Doctor of Philosophy at The ...... Abbreviations: A, adenine; aDNA, ancient DNA; B.C. , Before Christ; bp, base pair;.

Loss of Telomeric DNA during Aging of Normal and ...
The rate of telomere loss was calculated from the decrease in mean TRF length, as a function of donor .... old normal donors and from young and old DS pa-.