
ABSTRACT The entropy loss due to the formation of one or multiple loops in circular and linear DNA chains is calculated from a scaling approach in the limit of long chain segments. The analytical results allow us to obtain a fast estimate for the entropy loss for a given configuration. Numerical values obtained for some examples suggest that the entropy loss encountered in loop closure in typical genetic switches may become a relevant factor in comparison to both k^sub B^T and typical bond energies in biopolymers, which has to be overcome by the released bond energy between the looping contact sites.
INTRODUCTION
Gene expression in all organisms comprises the transcription of a certain gene on the DNA into messenger RNA through RNA polymerase starting from the promoter site, and its subsequent translation into a protein. The initiation of the transcription at a specific gene underlies a subtle cooperative scheme of transcription factors, which in turn is determined by a given set of boundary conditions such as the concentration of the transcription factors. Transcription factors often act cooperatively, and they are known to interact with each other over distances of several thousand basepairs (bp). This interaction is effected through DNA looping (Alberts et al., 1994; Blackwood and Kadonaga, 1998; Bolsover et al., 2001; Ptashne and Gunn, 2002; Revet et al., 1999; Snustad and Simmons, 2003), compare Fig. 1.
A typical example for DNA looping is found in the genetic switch which determines whether the replication of bacteriophage [lambda] in Escherichia coli follows either the lysogenic or the lytic pathway (Ptashne, 1992; Ptashne and Gunn, 2002; Snustad and Simmons, 2003). A key component of this [lambda]-switch is the [lambda]-repressor which activates the expression of a gene that encodes the production of the [lambda]-repressor itself. [lambda]-repressor can bind to the three operator sites O^sub R^ which overlap the two promoter sites of the switch. [lambda]-repressor binds cooperatively as a dimer, and typically under stable lysogenic conditions two such dimers on O^sub R^ form a tetramer, the next higher order of cooperativity, which is the main factor for the stability of the [lambda]-switch against noise (Aurell and Sneppen, 2002; Aurell et al., 2002; Metzler, 2001). However, [lambda]-repressor can also bind to the very similar operator O^sub L^, which is located roughly 2300 bp away and not part of the [lambda]-switch. It has been found that the two [lambda]-repressor tetramers at O^sub L^ and O^sub R^ synergistically form an octamer through DNA looping. This higher-ordered oligomerization enhances the performance of the switch considerably (Amouyal et al., 1998; Bell and Lewis, 2001; Bell et al., 2000; Blackwood and Kadonaga, 1998; Ptashne and Gunn, 2002; Revet et al., 1999; Semsey et al., 2002; Xu and Hoover, 2001). The specific binding along the tetramer-tetramer interface has recently been revealed through crystallographic structure determination (Bell and Lewis, 2001; Bell et al., 2000). Similar realizations of DNA looping also occur in linear DNA, naturally in the form of telomeres or in vitro in engineered DNA (De Bruin et al., 2001; Griffith et al., 1999; Zaman et al., 2002, compare Fig. 1). Multiple looping in large DNA molecules around a locus can be observed in vivo and can be induced in vitro by introducing of specific binding zones on the DNA, which leads to a considerable reduction of the gyration radius of the molecule such that it can be more easily transferred into, for example, mammalian cells (Montigny et al., 2001).
DNA looping often involves large loop sizes of several thousand bp. Therefore, the formation of these loops causes a non-negligible entropy loss which has to be overcome by the binding energy released at the bond formation on loop closure. In the present study, we quantify this entropy loss for such long DNA loops, taking into account self-avoiding effects due to both the monomer-monomer interaction within the loop and the additional effects due to the higher order contact points (vertices) at the loop closure site. The resulting numbers for typical systems suggest that the entropy loss is a relevant factor in the formation of DNA loops in comparison to the thermal energy and typical bond energies found in DNA nucleotides and other biopolymers, and it gives a lower bound for the bond-forming energy required to stabilize the loop.
Entropy loss due to loop formation was studied for the case of disconnected loops by Schellmann (1955), Flory (1956), and Semlyen (1997). In their seminal article, Poland and Scheraga (1965), and Wang and Uhlenbeck (1955), considered coupled Gaussian loops. To our knowledge the full effect of self-avoidance in the DNA looping network has not been considered before. Hereby, the contributions of non-trivial vertices turns out to be a relevant factor, and for multiple looping with a common locus actually become the dominating contribution. The analytical results presented here are derived from a scaling approach for general polymer networks and provide the advantage that, on their basis, estimates for the entropy loss in a given DNA system can be computed in a straightforward manner. It should also be noted that the additional vertex effects studied herein may be crucial in the analytical treatment of the DNA looping dynamics, as the higher-order self-interaction at such vertices poses an additional barrier in the loop closure process (Lee et al., 1976; Merlitz et al., 1998). Our results for long DNA with large loops complement the investigations of the bending and twisting energies in small DNA plasmids (Coleman et al., 2000; Tobias et al., 2000). In the case of intermediate-sized DNA segments, both approaches may be combined.
In what follows, we calculate the scaling results for the entropy loss on looping for the three different cases: 1), looping in a circular DNA; 2), looping in a linear DNA; and 3), multiple looping in a circular DNA. In the Appendix, the general expressions for calculating the system entropy of an arbitrary polymer network are compiled so that the entropy loss for different configurations can be calculated according to the general procedure developed below.
LOOPING IN A CIRCULAR DNA CHAIN
As stated before, we consider the limit in which each segment of the looped DNA, e.g., both subloops created in the circular DNA upon looping, are long in comparison to the persistence length l^sub p^ of the double-stranded DNA chain. (It can be assumed that this criterion is fulfilled if the segment contains more than ten persistence lengths.) In this long chain limit, we can neglect energetic effects due to bending or twisting, such that we treat the DNA as a flexible self-avoiding polymer. Therefore, we can employ results for the configuration number of a general polymer network, which we briefly review in the Appendix.
Before looping, the free energy of the circular DNA of total length L is given by
F^sub circ^ = H^sub 0^ - TS^sub circ^, (1)
where H^sub 0^ combines all binding enthalpies in the macromolecule and the entropy S^sub circ^ = K^sub B^ ln [omega]^sub circ^ is determined by the number of configurations (see De Gennes, 1979; see also the Appendix, this article)
[omega]^sub circ^ = A^sub circ^ [mu]^sup L^L^sup -3[nu]^, (2)
of a simply connected ring polymer of length L in units of the monomer length. The latter can be estimated by the persistence length l^sub p^ of the polymer (~500 [Angstrom] for double-stranded DNA corresponding to 100 bp; see Marko and Siggia, 1996). In Eq. 2, A^sub circ^ is a nonuniversal amplitude, [mu] is the support dependent connectivity constant, and [nu] [Asymptotically = to] 0.588 (Guida and Zinn-Justin, 1998) is the Flory exponent. Thus, S^sub circ^ has the form
S^sub circ^ = K^sub B^(ln A^sub circ^ + L ln [mu] - 3[nu] ln L). (3)
On looping, as sketched in Fig. 1 to the left, the circular DNA is divided into two subloops of lengths l and L - l by creation of a vertex at which four legs of the chain are bound together. For a self-avoiding chain, the number of configurations of the resulting figure-eight shape (Metzler et al., 2002a,b) is not simply the product of the configuration numbers of the two created loops, but has the more complicated form (Duplantier, 1986, 1989; Ohno and Binder, 1988; Schafer et al, 1992; see also the Appendix, this article),
In this expression, A^sub 8^ is a nonuniversal amplitude, Y^sub 8^ is a universal scaling function, and [sigma]^sub 4^ [Asymptotically = to] -0.48 is a universal exponent associated with the vertex with four outgoing legs. Note that in the Gaussian chain limit, the exponents [sigma]^sub N^ vanish; as we are going to show, the inclusion of the additional effects due to the higher order vertex formation reflected by nonzero values for [sigma]^sub N^ are non-negligible. Given the entropy S^sub 8^ = K^sub B^ ln [omega]^sub 8^ of the figure-eight configuration, the entropy loss suffered from creating this configuration out of the original circular DNA amounts to |S^sub 8^ - S^sub circ|. To proceed, we now evaluate the scaling function Y^sub 8^(x) in some special cases, and calculate typical numbers for the required entropy loss compensation. Two limiting cases can be distinguished.
Limiting case 1
If one of the loop sizes is much smaller than the other (l [much less than] L - l, say), the big loop of size L - l will essentially behave like a free circular chain so that its contribution to [omega]^sub 8^ will scale like a regular ring polymer, i.e., like (L - l)^sup -3[nu]^. Consequently, we find the behavior Y^sub 8^(x) = ax^sup -3[nu]+[sigma]4^ for x [much less than] 1, where a is a universal amplitude, and therefore (Hanke and Metzler, 2002; Metzler et al., 2002a,b),
[omega]^sub 8^ = A^sub 8^a[mu]^sup L^(L - l)^sup -3[nu]^ l^sup -3[nu]+[sigma]4^. (5)
In this case, the free energy difference between the initial circular and the looped states becomes
[Delta]F = [Delta]H^sub bond^ - T(S^sub 8^ - S^sub circ^), (6)
where [Delta]H^sub bond^ is the binding enthalpy at the loop closure site. The formation of the looping bond has to release a higher enthalpy than what is lost in entropy, i.e., [Delta]H^sub bond^ < T(S^sub 8^ - S^sub circ^). Collecting the different expressions, we thus find the condition
In this expression (and in similar expressions below), the first term in the square brackets is nonuniversal and depends on details of the model (for a Gaussian random walk on a cubic lattice, ln(A^sub 8^a/A^sub circ^) = ln(A^sub 8^Y^sub 8^(1)/A^sub circ^) = ln[3/(2[pi])] [Asymptotically = to] -0.74, is small compared with the other contributions in Eqs. 7 and 9; for the cases discussed below, we give the corresponding numbers in Table 1), whereas the remaining contributions are universal (apart from the fact that L is measured in units of the nonuniversal monomer length).
To get an estimate for the magnitude of the entropy loss, consider the case of the [lambda]-repressor loop in E. coli. With the size of the entire DNA of ~3.5 x 10^sup 3^ kbp and the looping branch of ~2.3 kbp, the two loops correspond to 3.5 x 10^sup 4^ and 23 monomers, respectively (each monomer corresponds to a persistence length l^sub p^ of 100 bp; see above). Neglecting the nonuniversal first term in brackets in Eq. 7, these numbers produce
[Delta]H^sub bond^ < - 7.0 k^sub B^T = - 17.5 kJ/mol = -4.2 kcal/mol;
(8)
here and in the following examples, we choose T = 300 K and make use of the gas constant, R = 8.31 JK^sup -1^ mol^sup -1^, and the conversion factor 1 cal = 4.2 J (Abramowitz and Stegun, 1972). Eq. 8 gives a considerable minimal value for the required bond energy between the two looping sites. For comparison, the typical free energy for basepair formation in DNA is 8 kcal/mol for AT pairs and 13 kcal/mol for GC pairs (Breslauer et al., 1986). Thus, even for the relatively small loop of 23 monomers, the required enthalpy release is non-negligible. Note that the relative contribution stemming from the [sigma]^sub 4^ term in Eq. 7 amounts to ~20% of the required enthalpy. In the Conclusion section, we compare the results obtained in the text with the corresponding ones in the Gaussian chain limit, demonstrating the significant increase of |[Delta]H^sub bond^| if self-avoidance is taken into consideration (see Table 1).
Limiting case 2
If the two created loops are of comparable size, i.e., x = l / (L - l) [approximate] 1, the corresponding value of the scaling function Y^sub 8^(x) is a finite number. For example, for l = L/2 one finds
In a modified DNA with two loops of 2.3 kbp each, one finds a bo nd enthalpy requirement of
[Delta]H^sub bond^ < - 5.8 k^sub B^T = - 14.5 kJ/mol = - 3.4 kcal/mol,
(10)
where we again neglect the nonuniversal first term in the square brackets. If both loops are of size 2 x 10^sup 3^ kbp each, the required bond enthalpy would increase to [Delta]H^sub bond^ < - 12.5 kcal/mol.
LOOPING IN A LINEAR DNA CHAIN
A linear chain of length L can assume
[omega]^sub lin^ = A^sub lin^ [mu]^sup L^L^sup [gamma]-1^ (11)
distinct configurations, where A^sub lin^ is a nonuniversal amplitude and [gamma] [Asymptotically = to] 1.16 is a universal exponent (Caracciolo et al., 1998; Guida and Zinn-Justin, 1998). If looping occurs and produces the A-shape in Fig. 1 to the right, the configuration number is modified to
where l is the size of the loop, and l^sub 1^ and l^sub 2^ are the sizes of the two loose end-segments, respectively.
We distinguish four different cases belonging to two groups: the configuration with l^sub 1^ [approximate] l^sub 2^, and the telomere configuration for which l^sub 1^ = 0 (or l^sub 2^ = 0).
Case 1
If l^sub 1^ = l^sub 2^, we find
where W^sub A^(x) = Y^sub A^(x,1). If furthermore l [much less than] L - l, an analogous reasoning as in the limiting case 1 of the previous section, leads to
MULTIPLE LOOPING IN A CIRCULAR DNA CHAIN
Assume that m potential connector points are distributed evenly along a circular DNA chain of total length L. If these condense to form a common locus, a number m of loops of equal size are created which are held together at this locus, as sketched in Fig. 2 (Montigny et al., 2001). This creates, in the scaling limit, a high-order vertex where 2m legs are joined. The procedure for the configuration number for this locus configuration yields
where the universal exponent [sigma]^sub 2m^ is associated with a vertex with 2m outgoing legs (see the Appendix). It should be noted that this result holds true only if the size of the locus is much smaller than the sizes of the created loops (Metzler et al., 2002a,b).
Due to the assumption that all m loops are of the same size, we immediately arrive at
The absolute value of [sigma]^sub 2m^ increases rapidly with increasing m, and can be determined from Pade or Pade-Borel analysis as shown in Schafer et al. (1992). We list the topological exponents up to order 8 in the Appendix. Taking a circular chain of 3500 kbp and m = 4, and neglecting the (nonuniversal) first term in the square brackets, we find that the entropy loss is fairly high (using [sigma]^sub 8^ = -2.4),
[Delta]H^sub bond^ < - 67.4 k^sub B^T = -168 kJ/mol = -40.1 kcal/mol.
(29)
In this case, the contribution due to the [sigma]^sub 8^ term is as large as 50% of the total entropy loss.
CONCLUSIONS
We have presented an analytical method to estimate the entropy loss in different scenarios of DNA looping in the limit of long segments. This approach takes explicitly the self-avoidance and interacting nature of the formed loops and other segments into account, and considers the additional effect of vertex formation, i.e., the effective interaction between different segments at the point where they are joined. This is possible via the scaling theory for arbitrary polymer networks derived by Duplantier. The obtained numbers do not vary much, due to the logarithmic dependence on the segment sizes. However, they are all non-negligible, and therefore have to be compensated by the released bond energy on formation of the DNA loop. We noted that the entropy loss is of the same order or close to the bond melting energy required for splitting an AT or GC bond, i.e., a considerable amount. Moreover, it is to be expected that the vertex effect increases the characteristic bond formation times in analytical approaches which are based on the free energy.
In Table 1 we summarize the obtained estimates for the minimum bond energy [Delta]H^sub bond^ in the present self-avoiding model, and compare them with the corresponding results in the Gaussian limit. The differences are significant, clearly showing the much higher entropy loss in the self-avoiding case. In Table 1 we also show estimates for the corresponding nonuniversal contributions, which we neglected in the explicit calculation of [Delta]H^sub bond^.
Our calculations are valid in the long chain limit. In units of the monomer size of a typical DNA double-helix persistence length ...^^^sub p^ ~ 100 bp, a minimum number of at least 10 monomers is expected to be required to consider a segment in the final structure flexible. For shorter segments, additional effects due to bending and twisting energy are expected to become relevant. As the mentioned examples document, there are numerous systems, both in vivo and in vitro, in which the flexibility condition is easily fulfilled, and in which our estimation method for the entropy loss becomes fully applicable. The persistence length of single-stranded DNA and RNA is much shorter, typically taken to be of the order ...^sub p^ ~ 8 bases. Thus, in single strand looping experiments the expected entropy loss will be considerably larger.
[Sidebar]
Biophysical Journal Volume 85 July 2003 97-107
[Reference]
REFERENCES
Abramowitz, M., and I. Stegun. 1972. Handbook of Mathematical Functions. Dover, New York.
Alberts, B., K. Roberts, D. Bray, J. Lewis, M. Raff, and J. D. Watson. 1994. The molecular biology of the cell. Garland, New York.
Amouyal, M., N. Perez, and S. Rolland. 1998. Adjacent cooperation of proteins on DNA are not representative of long-distance interactions. C. R. Acad. Sci. Paris. 321:877-881.
Aurell, E., and K. Sneppen. 2002. Epigenetics as a first exit problem. Phys. Rev. Lett. 88:048101.
Aurell, E., S. Brown, J. Johanson, and K. Sneppen. 2002. Stability puzzles in phage lambda. Phys. Rev. E. 65:051914.
Bell, C. E., and M. Lewis. 2001. Crystal structure of the lambda repressor C-terminal domain octamer. J. Mol. Biol. 314:1127-1136.
Bell, C. E., P. Frescura, A. Hochschild, and M. Lewis. 2000. Crystal structure of the lambda repressor C-terminal domain provides a model for cooperative operator binding. Cell. 101:801-811.
Blackwood, E. M., and J. T. Kadonaga. 1998. Going the distance: a current view of enhancer action. Science. 281:60-63.
Bolsover, S. R., J. S. Hyams, S. Jones, E. A. Shephard, and H. A. White. 2001. From Genes to Cells. Wiley, New York.
Breslauer, K. J., R. Frank, H. Blocker, and L. A. Marky. 1986. Predicting DNA duplex stability from the base sequence. Proc. Natl. Acad. Sci. USA. 83:3746-3750.
Caracciolo, S., M. S. Causo, and A. Pelissetto. 1998. High-precision determination of the critical exponent gamma for self-avoiding walks. Phys. Rev. E. 57:R1215-R1218.
Coleman, B. D., D. Swigon, and I. Tobias. 2000. Elastic stability of DNA configurations. II. Supercoiled plasmids with self-contact. Phys. Rev. E. 61:759-770.
De Bruin, D., Z. Zaman, R. A. Liberatore, and M. Ptashne. 2001. Telomere looping permits gene activation by a downstream UAS in yeast. Nature. 409:109-113.
De Gennes, P.-G. 1979. Scaling concepts in polymer physics. Cornell University Press, Ithaca, New York.
Duplantier, B. 1986. Polymer network of fixed topology: renormalization, exact critical exponent gamma in two dimensions, and d = 4-epsilon. Phys. Rev. Lett. 57:941-945.
Duplantier, B. 1989. Statistical mechanics of polymer networks of any topology. J. Stat. Phys. 54:581-680.
Flory, P. J. 1956. Theory of elastic mechanisms in fibrous proteins. J. Am. Chem. Soc. 78:5222-5235.
Griffith, J. D., L. Comeau, S. Rosenfield, R. M. Stansel, A. Bianchi, H. Moss, and T. de Lange. 1999. Mammalian telomeres end in a large duplex loop. Cell. 97:503-514.
Guida, R., and J. Zinn-Justin. 1998. Critical exponents of the N-vector model. J. Phys. A. 31:8103-8121.
Hanke, A., and R. Metzler. 2002. Towards the molecular workshop: entropy-driven designer molecules, entropy activation, and nanomechanical devices. Chem. Phys. Lett. 359:22-26.
Lee, C. S., R. E. Pyeritz, and C. A. Thomas, Jr. 1976. Kinetic studies on cyclization of T7 and T1 DNAs. Biopolymers. 15:671-686.
Marko, J. F., and E. D. Siggia. 1996. Stretching DNA. Macromolecules. 28:8759-8770.
Merlitz, H., K. Rippe, K. V. Klenin, and J. Langowski. 1998. Looping dynamics of linear DNA molecules and the effect of DNA curvature: a study by Brownian dynamics simulation. Biophys. J. 74:773-779.
Metzler, R. 2001. The future is noisy: on the role of spatial fluctuations and the merits of cooperativity in genetic switching. Phys. Rev. Lett. 87:068103.
Metzler, R., A. Hanke, P. G. Dommersnes, Y. Kantor, and M. Kardar. 2002a. Equilibrium shapes of flat knots. Phys. Rev. Lett. 88:188101.
Metzler, R., A. Hanke, P. G. Dommersnes, Y. Kantor, and M. Kardar. 2002b. Tightness of slip-linked polymer chains. Phys. Rev. E. 65:061103.
Montigny, W. J., C. R. Houchens, S. Illenye, J. Gilbert, E. Coonrod, Y.-C. Chang, and N. H. Heintz. 2001. Condensation by DNA looping facilitates transfer of large DNA molecules into mammalian cells. Nucleic Acids Res. 29:1982-1988.
Ohno, K., and K. Binder. 1988. Scaling theory of star polymers and general polymer networks in bulk and semi-infinite good solvents. J. Phys. (Paris). 49:1329-1351.
Poland, D. C., and H. A. Scheraga. 1965. Statistical mechanics of non-covalent bonds in polyamino acids. VIII. Covalent loops in protein. Biopolymers. 3:379-399.
Ptashne, M., and A. Gunn. 2002. Genes and signals. Cold Spring Harbor Laboratory Press, Cold Spring Harbor.
Ptashne, M. 1992. A Genetic Switch. Cell Press and Blackwell Science, Cambridge, Massachusetts.
Reevet, B., B. van Wilcken-Bergmann, H. Bessert, A. Barker, and B. Muller-Hill. 1999. Four dimers of lambda repressor bound to two suitably spaced pairs of lambda operators form octamers and DNA loops over large distances. Curr. Biol. 9:151-154.
Schafer, L., C. van Ferber, U. Lehr, and B. Duplantier. 1992. Renormalization of polymer networks and stars. Nucl. Phys. B. 374:473-495.
Schellmann, J. A. editor. 1955. The stability of hydrogen-bonded peptide structures in aqueous solution. C. R. Trav. Lab. Carlsberg Ser. Chim. 29:230-259.
Semlyen, J. A. 1997. Large Ring Molecules. John Wiley and Sons, Chichester, West Sussex, England.
Semsey, S., M. Geanacopoulos, D. E. A. Lewis, and S. Adhya. 2002. Operator-bound Ga1R dimers close DNA loops by direct interaction: tetramerization and inducer binding. EMBO J. 21:4349-4356.
Snustad, D. P., and M. J. Simmons. 2003. Principles of Genetics. John Wiley and Sons, New York.
Tobias, I., D. Swigon, and B. D. Coleman. 2000. Elastic stability of DNA configurations. I. General theory. Phys. Rev. E. 61:747-758.
Wang, M. C., and G. E. Uhlenbeck. 1955. On the theory of the Brownian motion II. Rev. Mod. Phys. 17:323-342.
Xu, H., and T. R. Hoover. 2001. Transcriptional regulation at a distance in bacteria. Curr. Op. Microbiol. 4:138-144.
Zaman, Z., C. Heid, and M. Ptashne. 2002. Telomere looping permits repression 'at a distance' in yeast. Curr. Biol. 12:930-933.
[Author Affiliation]
Andreas Hanke*[dagger] and Ralf Metzler[double dagger][sec]
*Institute for Theoretical Physics, University of Stuttgart, Stuttgart, Germany; [dagger]Department of Physics-Theoretical Physics, University of Oxford, Oxford, United Kingdom; [double dagger]NORDITA-Nordic Institute for Theoretical Physics, Copenhagen, Denmark; and [sec]Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts USA
[Author Affiliation]
Submitted November 25, 2002, and accepted for publication March 12, 2003.
Address reprint requests to Ralf Metzler, E-mail: metz@nordita.dk.
(C) 2003 by the Biophysical Society
0006-3495/03/07/167/07 $2.00