We now report results that should prompt a revival of the combinatorial approach to foldamers. We discovered a new macrocycle that folds into a complex but precisely defined secondary and tertiary structure, constructing itself out of 15 identical peptide-nucleobase subunits. Folding of the structure drives its synthesis in 95% yield from a mixture of interconverting molecules of different ring size in a one-step process. Single-crystal X-ray crystallography and NMR analyses reveal a new folding pattern based on an intricate network of hydrogen bonds and π-stacking interactions featuring long-range interactions.

Dynamic combinatorial chemistry (20,21) has been suggested as a useful selection tool for accessing such new folded structures. In brief, in a dynamic combinatorial library (DCL) building blocks react with each other to give rise to a mixture of oligomeric compounds that continuously exchange these building blocks between them. When a specific library member is able to form efficient intramolecular noncovalent interactions, inducing it to fold, this compound should be more stable than other library members that are unable to engage in such interactions. Hence, the library composition should shift in favor of the foldamer. Indeed, several groups have reported folding-driven changes in library compositions. (22−29) However, until now this approach has failed to deliver fundamentally new folding motifs. This lack of success is surprising, given that dynamic combinatorial methods have proven successful in the discovery of new and often unexpected host–guest systems, (30,31) interlocked structures, (31,32) and self-replicating molecules. (33) All of these systems rely, like foldamers, on noncovalent interactions as the prime selection criterion.

The approach taken to accessing new synthetic foldamers has until now relied almost exclusively on design, followed by multistep synthesis. An impressive new range of backbones have been developed that fold into a variety of well-defined architectures, including secondary structures such as helices and sheets. (19) Yet, the design approach tends to be based on relatively simple and small-range assembly motifs, primarily driven by interactions between residues close to each other in the oligomeric chain of monomer units. Foldamers that rely on long-range interactions (between residues further apart in the oligomeric chain, as observed in the folding of proteins and nucleic acids) have remained difficult to access due to a lack of reliable design rules. Hence, alternative approaches are needed for accessing fundamentally new classes of foldamers that rely on long-range interactions.

The chemistry of life relies on biopolymers (proteins, nucleic acids) folding into specific conformations that dictate their properties. It is generally believed that the complex folded structures encountered in biology are the result of millions of years of evolution. Much of the research on synthetic foldamers (1−13) is driven by the desire to bypass evolution, go beyond the constraints of using only nature’s building blocks, and directly access structures that fold like proteins, but are based on completely synthetic subunits. The ultimate goal is achieving new and sophisticated functions that require the molecular complexity of extended folded structures. This goal is still largely out of reach and foldamers able to exhibit specific function have remained rare, (14−16) due to the huge challenge of obtaining new modes of folding in designed proteins, (17,18) molecules that mimic peptides or nucleic acids, (5,8,9) and in completely abiotic molecules. (1−4,6,7,10−13)

Results and Discussion ARTICLE SECTIONS Jump To

1, which contains an amino-acid and a nucleobase subunit, both key structural elements involved in the folding of proteins and nucleic acids, respectively. We reasoned that the presence of these two types of subunits, which are central in nature’s folded macromolecules, should maximize the chances of accessing foldamers in our dynamic libraries. At the same time, and unlike nature, having both subunits present within a single building should give rise to fundamentally new foldamers. Building block 1 was synthesized in six steps as outlined in the 1 are dominated by small trimeric and tetrameric macrocycles (the dimer is too strained to form in significant quantities). Indeed, a small DCL made from a 50 μM solution of building block 1 in borate buffer (50 mM, pH 8.2) consisted mostly of cyclic tetramer (top trace in 1, as evident from mass spectrometric analysis of the relevant peak in the UPLC chromatogram ( We designed chimeric building block, which contains an amino-acid and a nucleobase subunit, both key structural elements involved in the folding of proteins and nucleic acids, respectively. We reasoned that the presence of these two types of subunits, which are central in nature’s folded macromolecules, should maximize the chances of accessing foldamers in our dynamic libraries. At the same time, and unlike nature, having both subunits present within a single building should give rise to fundamentally new foldamers. Building blockwas synthesized in six steps as outlined in the Supporting Information . The building block is equipped with two thiol groups, which, upon exposure to atmospheric oxygen, oxidize to give rise to an equilibrium mixture of different macrocyclic disulfides, (30) which interconvert through thiol–disulfide exchange ( Figure 1 a). Ordinarily, in the absence of any noncovalent interactions, DCLs of building blocks with the benzenedithiol core ofare dominated by small trimeric and tetrameric macrocycles (the dimer is too strained to form in significant quantities). Indeed, a small DCL made from a 50 μM solution of building blockin borate buffer (50 mM, pH 8.2) consisted mostly of cyclic tetramer (top trace in Figure 1 b; composition monitored by UPLC). However, repeating the same experiment at higher building block concentration (500 μM) led to the emergence of an unusually large macrocycle, consisting of 15 subunits of, as evident from mass spectrometric analysis of the relevant peak in the UPLC chromatogram ( Figure S20 ). Increasing the concentration of the building block or addition of NaCl (1.0 M) to the mixture enhanced the formation of the 15mer, accounting for up to 95% of the library material ( Figure 1 b, trace 4). Experiments with other salts revealed similar effects ( Figure S4 ), indicating that the exact nature of the salt is not important and that the formation of the 15mer benefits from a high ionic strength, suggesting that the salt acts to reduce charge repulsion within the 15mer. The 15mer is most likely the thermodynamic product, as partial reduction followed by reoxidation led to the almost quantitative re-formation of the 15mer ( Figure S6 ). Experiments in the absence of salt but in the presence of 50% acetone (bottom trace in Figure 1 b) or other cosolvents ( Figure S5 ) gave only negligible amounts of 15mer, suggesting that hydrophobic interactions are important in stabilizing this compound. These observations, together with the fact that the 15mer is only poorly retained on the UPLC column, suggest that the compound adopts a folded structure. This hypothesis was confirmed when we characterized its structure using tandem mass spectrometry, circular dichroism (CD) spectroscopy, X-ray crystallography, and NMR spectroscopy.

Figure 1 Figure 1. Selective formation of the 15mer. (a) Dynamic combinatorial library of amino-acid nucleobase building block 1. Arrows indicate two sets of π-stacks observed between phenyls and adenines. The remaining three stacks are not shown for clarity. (b) UPLC chromatograms (absorption at 254 nm) showing library compositions after 14 days of stirring (in 50 mM borate buffer, pH = 8.2) at a building block concentration of (1) 0.050 mM; (2) 0.50 mM; (3) 2.0 mM; (4) 0.50 mM in the presence of 1.0 M NaCl; and (5) 0.50 mM in the presence of 50% acetone. (c) CD spectrum of monomer 1 (dotted line) and 15mer (solid line) in water at 298 K.

The 15mer was isolated by preparative HPLC (isolated yield: 90%) and then analyzed by means of MALDI TOF/TOF. The results ( Figure S7 ) show that the 15mer fragments into smaller components, ranging from tetramer up to 12mer, indicating that the 15mer is a single macrocycle and not a system of interlocked rings (a catenane).

1 (1, located on the amino-acid residue. The CD spectrum of the 15mer showed an intense positive band at 260 nm, which can be attributed to the absorption of the aromatic dithiol (34) and the terminal adenine. Importantly, the CD signals of the 15mer are dramatically enhanced compared to those of monomer 1 Figure c), which suggests that the aromatic rings reside in a well-defined chiral environment, even though these rings are relatively remote from the chiral center in building block, located on the amino-acid residue.

i, i+2, and i+4 residues, while the capping adenines belong to the i–3 and the i+7 residues. Except for the adenines recruited for crystal packing (disrupting the otherwise 5-fold symmetry of the structure), this arrangement gets repeated five times in an interdigitated fashion (the second stack is indicated by a dotted line in i, i+2, and i+4), five hydrogen bonds are observed between the NH and CO groups ( Detailed insights into the structure of the 15mer were obtained from single-crystal X-ray diffraction data. Crystals of the 15mer were prepared by slow diffusion of acetone into an aqueous solution of the 15mer. The crystal structure ( Figure 2 ) confirms that the 15mer is a single giant macrocycle (35) linked together by 15 disulfide bonds to give a 75-atom ring. Figure 1 a shows the extended conformation of this ring for clarity, which, in reality, is collapsed into a compact but intricately folded structure. Figure 2 e shows the 75-atom ring in its highly twisted conformation. Overall, the structure is characterized by a hydrophobic core and presents its hydrophilic groups on its surface, similar to what is observed in folded proteins. The most notable structural motif is the stacking of aromatic rings. Five stacks of three phenyl rings can be identified (shown in Figure 2 f), capped with an adenine ring at the top and bottom (except where these adenines are recruited for crystal packing), as shown for one of these stacks in Figure 2 g. The distances between the three phenyl rings in these stacks are in the range of 3.43–3.45 Å, while the distances between the phenyl and adenine rings are somewhat larger (3.46–3.49 Å), but all in the range typical for π-stacking. (36) Interestingly, the rings that end up in the same stack are spaced far apart in the extended structure; as indicated in Figure 1 a, a stack is composed of the phenyl rings from the+2, and+4 residues, while the capping adenines belong to the–3 and the+7 residues. Except for the adenines recruited for crystal packing (disrupting the otherwise 5-fold symmetry of the structure), this arrangement gets repeated five times in an interdigitated fashion (the second stack is indicated by a dotted line in Figure 1 a; the other stacks are not shown for clarity). The five stacks of rings are arranged in a tiled fashion, as shown in Figure 2 f. Thus, not only can secondary structure elements be identified (corresponding to the stacks of rings), but since these stacks adopt well-defined orientations with respect to each other, also tertiary structure is present, which has only very recently been achieved in designed foldamers. (37) Apart from π-stacking interactions, also hydrogen bonds play an important role in stabilizing this folded structure. Between the building blocks that constitute the π-stack (residues+2, and+4), five hydrogen bonds are observed between the NH and CO groups ( Figure 2 h).

Figure 2 Figure 2. X-ray crystal structure of the 15mer. (a) Top view of the central cavity of the macrocycle. (b) Side view. (c) Top and (d) side view of the 15mer in space-filling representation. (e) The ring of disulfide bonds connecting the phenyl rings. (f) Core part of the foldamer, showing five stacks of three phenyl rings connected by disulfide bonds (two of the five sets of stacked phenyls are indicated by arrows in the extended structure shown in Figure 1a). (g) Top view of the 15mer highlighting one stack of three core phenyl rings and two adenines on the top and bottom of the stack. (h) Set of intermolecular hydrogen bonds formed between the three building blocks that constitute the stack of three phenyl rings. Solvent molecules, hydrogen atoms, and disorders are omitted for clarity. C atoms are shown in gray, N in purple, O in red, and S in yellow; except in panel f the C atoms of the macrocycle core are shown in light blue.

1H NMR spectrum ( 2 O, 298 K) is consistent with the X-ray crystal structure. The presence of sharp signals is in agreement with a highly ordered compact structure. In the spectrum the monomeric unit 1 appears in three separate sets of signals, indicative of a C 5 symmetry for the 15mer in solution. The signals of the protons of the phenyl rings are distributed over a wide range of chemical shifts (from 5.8 to 7.6 ppm). The large upfield shift of two of these (to 6.6 and 5.8 ppm, respectively, versus 7.2 and 7.3 ppm in monomer 1) indicates that some of them are adjacent to the face of an aromatic ring. 2D-NMR studies enabled the complete assignment of the spectrum of the 15mer as well as the assignment of several through-space interactions that are consistent with the X-ray crystal structure ( The solution-phaseH NMR spectrum ( Figure 3 a) of the isolated 15mer (DO, 298 K) is consistent with the X-ray crystal structure. The presence of sharp signals is in agreement with a highly ordered compact structure. In the spectrum the monomeric unitappears in three separate sets of signals, indicative of asymmetry for the 15mer in solution. The signals of the protons of the phenyl rings are distributed over a wide range of chemical shifts (from 5.8 to 7.6 ppm). The large upfield shift of two of these (to 6.6 and 5.8 ppm, respectively, versus 7.2 and 7.3 ppm in monomer) indicates that some of them are adjacent to the face of an aromatic ring. 2D-NMR studies enabled the complete assignment of the spectrum of the 15mer as well as the assignment of several through-space interactions that are consistent with the X-ray crystal structure ( Figure 3 b). These data indicate that, also in solution, the 15mer adopts a folded structure that is very similar to that observed in the X-ray crystal structure.

Figure 3 Figure 3. NMR and CD characterization of the 15mer. (a) 1H NMR spectrum of the 15mer (500 MHz, D 2 O, 298 K), with signal assignments corresponding to the labeling shown in Figure 1a. (b) Repeating unit of the 15mer with observed NOEs. (c) Changes in the Cotton effect intensities at specified wavelengths in the CD spectra of a DMF solution of the 15mer observed upon cooling from 373 K to 248 K.

1H NMR experiments on aqueous solutions of the 15mer were performed, but no significant spectral changes were observed, even at temperatures up to 353 K (d 7 . At room temperature sharp signals corresponding to the folded 15mer were still observed ( We then attempted to unfold the 15mer. Temperature-dependentH NMR experiments on aqueous solutions of the 15mer were performed, but no significant spectral changes were observed, even at temperatures up to 353 K ( Figure S12 ). Acetone was added (which disrupts the process of formation of the 15mer; vide supra), but the NMR spectra remained essentially unchanged ( Figure S13 ), indicating that the 15mer, once formed, does not readily unfold. Unfolding was eventually achieved by dissolving the 15mer in DMF-. At room temperature sharp signals corresponding to the folded 15mer were still observed ( Figure S14 ), but these broadened significantly when the temperature was increased to 363 K, suggesting (at least partial) unfolding. At 373 K one set of relatively sharp signals was observed indicating a 15-fold symmetry of the system and, thus, complete unfolding of the macrocycle. The original spectrum was retained upon cooling the sample to room temperature. The unfolding–refolding process was further analyzed by variable-temperature CD spectroscopy in dimethylformamide (DMF) ( Figure S17 ). Reversible attenuation and enhancement of the CD signals were observed by heating and cooling the DMF solution ( Figure 3 c).