CHAPTER 5: BINDING

We have studied macromolecule structure. Now it is time to impart function to these molecules. It is simple to imagine that before these molecules can perform a function, they must interact with specific molecule(s) or ligand(s) in their environment. In fact, binding and subsequent release of a ligand might be the sole function of the macromolecule (example myoglobin binding oxygen). Binding is the first step necessary for a biological response (with the exception of visual transduction in which photo-induced isomerization of rhodopsin initiates the response). To understanding binding, we must consider the equilbria involved, how binding is affected by ligand and macromolecule concentration, and how to experimentally analyze and interpret binding data and binding curves.

DERIVATIONS OF BINDING EQUATIONS AND GRAPHS

These derivations can be made and interpreted using simple principles from General Chemistry, which you reviewed and strengthened in Analytical Chemistry, with some slight differences. Biochemists rarely talk about equilibrium or association constants, but rather their reciprocals - the dissociation constants, Kd. For the reactions M + L <====> ML, where M is free macromolecule, L is free ligand, and ML is macromolecule-ligand complex (which is held together by intermolecular forces, not covalent forces), the Kd is given by
[M]_eq[L]_eq/[ML]_eq.

Figure: M is free macromolecule, L is free ligand, and ML is macromolecule-ligand complex

Notice the unit of Kd is molarity, M. The lower the Kd (i.e. the higher the [ML] at any given M and L, the tighter the binding. The higher the Kd, the looser the binding. Kd's for biological molecules are finely tuned to their environments. They vary from about 1 mM (weak interactions) for some enzyme-substrate complex, to pM - fM levels. Example of very tight, non-covalent interactions are for the avidin (an egg protein)-biotin (a vitamin) complex and thrombin (enzyme initiating clotting)-hirudin (a leech salivary protein) complex.

M = macromolecule; L = ligand

For a simple equilibrium M + L <--> ML

where M = free macromolecule, L = free ligand, and ML = bound M and L (a complex)

3 equations can be written:

Equation 1 - Dissociation constant: K_d = ([M]eq[L]eq)/[ML]eq = ([M][L])/[ML] (units of molarity)

Equation 2 - Mass Balance of M: M_o = M + ML

Equation 3 - Mass Balance of L: L_o = L + ML

We would like to derive equations which give ML as a function of known or measurable values. The Kd equations shows that ML depends on free M and free L. From Equations 1-3, two different and equally valid equations can be derived for two different cases.

Case 1: used either when you can readily measure free L or when experimental conditions are such the L_o >> M_o, which is often encountered. Under these latter conditions, free L = L_o, which you know without measuring it, simply by knowing how much total ligand was added to the system.
Case 2 (more general): used when you don't know free L or haven't measured it, and you just wish to calculate how much ML is present at equilibrium. These conditions imply that L_o is not >> M_o. (If L_o >> Mo, we would know free L = L_o.)

EXPERIMENTAL CASE 1: USE THIS FORM OF THE EQUATION WHEN L IS MEASURABLE OR WHEN L_o >> M_o(i.e. L= L_o)

Equation 4 - Substitute 2 into 1: K_d = ([M][L])/[ML] = [Mo-ML][L])/[ML]

(ML)K_d = (M_o)L - (ML)L

(ML)K_d + (ML)L = (M_o)L

(ML)(Kd+L) = (M_o)L

Equation 5: ML = MoL/(Kd + L)

This equation is ALWAYS TRUE for the chemical equation written above. L is the free ligand concentration at equilibrium. If L_o >> M_o, then the equations simplifies to:

Equation 6: ML = MoLo/(Kd + L), whose graph you should draw below.

Dividing Equation 5 by M_o gives the fractional saturation of the macromolecule M, where

Equation 7: Y = q = [ML]/Mo = L/(Kd + L)

where Y can vary from 0 (when L = 0) to 1 (when L >> Kd)

Draw a graph of Y vs L or L_{o
GRAPH IS A HYPERBOLA}

Equations 5/6 can be understood best by examining three cases:

ML = MoL/(Kd + L)

Case 1: L = 0

ML = 0

Case 2: L = K_d

ML = MoL/(L + L)= MoL/2L = Mo/2

which indicates that M is half saturated. In fact the operational definition of K_dis the ligand concentration at which the M is half saturated.

Case 3: L >> K_d

ML = M_o

EXPERIMENTAL CASE 2 (more general): USE THIS FORM OF THE EQUATION WHEN FREE L IS NOT KNOWN (such as when L_o is not >> M_o)OR YOU WISH TO CALCULATE ML FROM JUST L_o, M_o AND K_D

Equation 8 - Substitute 2 AND 3 into 1: K_d = ([M][L])/[ML] = [Mo-ML][Lo-ML]/[ML]

(ML)K_d = (M_o- ML)(L_o- ML)

(ML)K_d = (M_o)(L_o) - (ML)(L_o) - (ML)(M_o) + (ML)² or

Equation 9: (ML)² - (L_o + M_o +K_d)(ML) + (M_o)(L_o) = 0, which is of the form

ax² + bx + c = 0, where

a = 1

b = - (L_o + M_o +K_d)

c = (M_o)(L_o)

which are all constants, and

x = {-b +/- (b² - 4ac)^1/2}/2a or

Equation 10: ML = {-(Lo+Mo+Kd) +/- ((Lo+Mo+Kd)² - 4MoLo)^1/2}/2

A graph of ML calculated from this formula vs free L (or Lo if L_o >> M_o) give a

A HYPERBOLA

In the derivations, we came up with two equations for ML:

one (Equation 5) using mass conservation on M, which gave: ML = MoL/[Kd +L]
one (Equation 10) using mass conservation on M and L, which gave ML = quadratic equation as function of Mo, Lo, and Kd: ML = {-(Lo+Mo+Kd) +/- ((Lo+Mo+Kd)² - 4MoLo)^1/2}/2

Both equations are valid. In the first you must known free L which is often Lo if Mo << Lo. In the second, you don't need to know free M or L at all. At a given Lo and Mo, you can calculate ML, which should be the same ML you get from the first equation if you know free L.

Equations 5 and 10 are useful in several circumstances. They can be used to

calculate the concentration of ML if Kd, M_o, and L (for equation 5) or if Kd, M_o, and L_o (for equation 10) are known. This is analogous to the use of the Henderson-Hasselbach equation to calculate the protonation state (HA) and hence charge state of an acid at various pH values. In the former case we are measuring the concentration of bound ligand (ML) and in the later case, the concentration of bound protons (HA).
calculate Kd if ML, Kd, M_o, and L (for equation 5) or if ML, M_o, and L_o (for equation 10) are known. Techniques to extract the Kd from binding data will be discussed in the next chapter section.

INTERPRETATION OF BINDING EQUATIONS AND GRAPHS

It is important to get a mathematical understanding of the binding equations and graphs. It is equally important to get an intuitive understanding of their properties. Just as we used the +/- 2 rule in determining at a glance the charge state of an acid, you need to be able to determine the extent of binding (how much of M is bound with L) given their relative concentrations and the Kd. The usual situation is that the [M_o] is << [L_o]. What happens to the binding curves for M + L <===> ML if the Kd gets progressively lower? Intuitively, you should expect that binding will increase, especially as L gets greater. The curves below should help you develop the intuition you need with respect to binding equilibria.

Fig: ML vs L at Varying Kd's

Fig: ML vs L at Even Lower Kd's

Fig: ML vs L at a Very Low Kd!

Fig: Comparison of Covalent Binding of Protons vs Noncovalent Binding of Ligand

HOW STRONG IS BINDING - THE CONTINUUM

Binding affinities give us a way to measure the relative strength of binding between two substances. But how "tight" is tight binding? Weak binding? Let us exam that issue by considering a binding continuum. Consider two substances, A and B that might interact. Over what range of strengths can they actually bind to each other? It would helpful to set up the extremes of the binding continuum. At one end is no binding at all. At the other end, consider two things that bind covalently. We have discussed how Kd reflects binding strength. Remember, Kd = 1/Keq. Also, we know that Keq is related to DG^o, by the equations: DG ^o = - R T ln Keq, or RT ln Kd. Given these simple equations, you should be able to interconvert between Keq, Kd, and DG ^o. (Keep your units straight.).

NO INTERACTION: One end of the binding continuum represents no interaction. Let's assume that Keq is tiny (Kd large), for example Keq ~ 2.4 10^-72. Plugging this into the equation DG ^o = - R T ln Keq, where R = 2.00 cal/mol^.K, and T is about 300K, the DG^o~ +100 kcal/mol. That is, if we add A + B, there is no drive to form AB. If AB did form, then it would immediately fall apart.

COVALENT INTERACTION: At the other end of the continuum consider the interaction of 1H atom with another to form H₂. From a general chemistry book we can get DG^o_form. Using General Chem. thermodynamics, we can calculate DG^o for H-H formation. (DG^o = SG^o_{form
prod.} - SG^o_{form react.}⁾ Doing this gives a value of -97 kcal/mol.

INTERMEDIATE REVERSIBLE INTERACTIONS: Consider the interaction of a protein, the lambda repressor (R), with a small oligonucleotide to which it binds tightly (called the operator DNA, O). This is an example of a biologically tight, but reversible interaction. R can bind to many short oligonucleotides due to electrostatic interactions and H bonds from the positively charged protein to the negatively charge nucleic acid backbone. The tight binding interaction, however, involves oligonucleotides of specific base sequence. Hence we can distinguish between tight binding, which usually involves specific DNA sequence and weak binding which involves nonspecific sequences. Likewise, we will speak of specific and nonspecific binding. R and O, which bind with a Kd of 1 pM, is an example of specific binding, while R and nonspecific DNA (D), which bind mostly through electrostatic interactions with a Kd of 1 mM, is an example of nonspecific binding.

RATE CONSTANTS FOR ASSOCIATION AND DISSOCIATION: When the reaction
M + L <-===> ML is at equilibrium, the rate of the forward reaction is equal to the rate of the reverse reaction. From General Chemistry, the forward reaction is biomolecular and second order. Hence the v_f, the rate in the forward direction is proportional to [M][L], or
v_f = k_f [M][L], where k_f is the rate constant in the forward direction. The rate of the reverse reaction, v_r is first order, proportional to [ML], and is given by v_r = k_r [ML], where k_r is the rate constant for the reverse reaction. Notice that the units of k_f are M^-1s^-1, while units of k_r are s^-1. At equilibrium, v_f = v_r, or k_f [M][L] = k_r [ML]. Rearranging the equation gives [ML]/[M][L]= k_f/ k_r= Keq. Hence Keq is given by the ratio of rate constants. For tight binding interactions, Keq >> 1, Kd << 1, and k_f is very large (in the order of 10^8-9) and k_r must be very small (10^-2- 10^-4 s^-1).

To get a more intuitive understanding of Kd's, it is often easier to think about the rate constants which contribute to binding and dissociation. Let us assume that k_r is the rate constant which describes the dissociation reaction. It is often times called k_off. It can be shown mathematically that the rate at which two simple object associate depends on their radius and effective molecular weight. The maximal rate at which they will associate is the maximal rate at which diffusion will lead them together. Let us assume that the rate at which M and L associate is diffusion limited. The theoretical k_on is about 10⁸M^-1s^-1. Knowing this, the Kd and the fact that k_on/ k_off= Keq = 1/Kd, we can calculate k_off, which remember is a first order rate constant._.

We can also determine k _off experimentally. Imagine the following example. Adjust the concentrations of M and L such that M_o << L_o and L_o>> Kd. Under these conditions of ligand excess, M is entirely in the bound from, ML. Now at t = 0, dilute the solution so that L_o << Kd. The only process that will occur here is dissociation, since negligible association can occur given the new condition. If you can measure the biological activity of ML, then you could measure the rate of disappearance of ML with time, and get k_off. Alternatively, if you could measure the biological activity of M, the rate at which activity returns will give you k_off.

Now you will remember from General Chemistry that from a first order rate constant, the half-life of the reaction can be calculated by the expression: k = 0.693/t_1/2. Hence given k_off, you can determine the t_1/2 for the associated species existence. That is, how long will a complex of ML last before it dissociates? Given DG^o or Kd, and assuming a k_on(10⁸), you should be able to calculate k_off and t_1/2.Or, you could be able to determine k_off experimentally, and then calculate t_1/2. Applying these principles, you can calculate the parameters below.

Calculated k_off and t_1/2for binary complexes assuming diffusion-controlled k_on

Complex	K_D(M)	k_off (s-1)	t _½
H₂	1 x 10^-71	1 x 10^-63	2 x 10⁵⁵ yr
R_tV_{3 :}R_t^'L₃^(a)	10^-17	1 x 10^-9	2 yr
Avidin:biotin	10^-15	1 x 10^-7	80 days
thrombin:hirudin^(b)	5 x10^-14	5 x 10^-6	2 days
lac_rep:DNA_oper^(c)	1 x 10^-13	1 x 10^-5	0.8 days
Zif268:DNA^(d)	10^-11	1 x 10^-3	700 s
GroEL:r-lactalbumin^(e)	10^-9	0.1	7 s
TBP:TATA^(f)	2 x 10^-9	2 x 10^-1	3 s
TBP:TBP	4 x 10^-9	4 x 10^-1	2 s
LDH (pig): NADH^(g)	7.1 x 10^-7(j)	7.1 x 10¹	10 ms
profilin: CaATP-G-actin	1.2 x 10^-6	1.2 x 10²	6 ms
TBP: DNA_nonspec^(h)	5 x 10^-6	5 x 10²	1 ms
TCR⁽ⁱ⁾: cyto C peptide	7X10^-5	7X10³	100 ms
lac_rep:DNA_nonspec^(h)	1 x 10^-4	1 X10⁴	70 ms
uridine-3P: RNase	1.4 x 10^-4(j)	1.4X10^-4	50 ms
Creatine Kinase: ADP	8.2 x 10^-4 (j)	8.2X10⁴	10 ms
Acetylcholine:Esterase	1.2 x 10^-3	1.2 x 10⁵	6 ms
no interaction	4 x 10⁷³	4 x 10⁸¹	-

Trivalent Vancomycin derivative RtV3 + Trivalent D-Ala-D-Ala deriv, Rt'L3'
Hirudin is a potent thrombin inhibitor from leach saliva
lac _rep is the E. Coli lac operon repressor protein, and DNA_operis the specific DNA binding region in the E. Coli genome that binds to the repressor
Zif268 is a mouse zinc-finger binding protein
GroEL is a chaperone protein; r-lactalbumin is the reduced form of lactalbumin
TBP is the TATA Binding Protein which binds to the TATA box consensus sequence
LDH is lactate dehydrogenase
DNA_nonspec is DNA which does not contain the specific DNA sequence region involved in specific
binding to a DNA binding protein
TCR is the T-cell receptor
calculated from equation: K_D = k_off/k_on.

What is usually measured is Kd and/or k_off (if the k_offis reasonable). This analysis is very simplified. Electrostatic forces and other orientation factors may significantly change k_on, while conformational changes in the complex may prevent ready unbinding of the bound ligand, dramatically altering k_off.

EXPERIMENTAL DETERMINATION OF Kd

It is often important to determine the Kd for a ML complex, since given that number and the concentrations of M and L in the system, we can predict if M is bound or not under physiological conditions. Again, this is important since whether M is bound or free will govern its activity. The trick in determining Kd is to determine ML and L at equilibrium. How can we differentiate free from bound ligand? The following techniques allow such a differentiation.

TECHNIQUES THAT REQUIRE SEPARATION OF BOUND FROM FREE LIGAND - Care must be given to ensure that the equilibrium of M + L <==> ML is not shifted during the separation technique.

gel filration chromatography - Add M to a given concentration of L. Then chromatograph the mixture on gel filtration column eluting with the free ligand at the same concentration. The ML complex will elute first and can be quantitated . If you measure the free ligand coming off the column, it will be constant after the ML elutes with the exception of a single dip near where the free L would elute if the column was eluted without free L in the buffer solution. This dip represents the amount of ligand bound by M.
membrane filtration - Add M to radiolableled L, equilibrate, and then filter through a filter which binds M and ML. For instance, a nitrocellulose membrane binds proteins irreversibly. Determine the amount of radiolabeled L on the membrane which equals [ML].
precipitation - Add a precipitating agent like ammonium sulfate, which precipitates proteins and hence both M and ML. Determine the amount of ML.

TECHNIQUES THAT DO NOT REQUIRE SEPARATION OF BOUND FROM FREE LIGAND

equilibirum dialysis - Place M in a dialysis bag and dialyse against a solution containing a ligand whose concentration can be determined using radioisotopic or spectroscopic techniques. At equilibrium, determine free L by sampling the solution surrounding the bag. By mass balance, determine the amount of bound ligand, which for a 1:1 stoichiometry gives ML. Repeat at many different ligand concentrations
.
spectroscopy - Find a ligand whose absorbance or fluorescence spectra changes when bound to M. Alternatively, monitor a group on M whose absorbance or fluorescence spectra changes when bound to L.

Surface Plasmon Resonance

A newer technique to measure binding is called surface plasmon resonance (SPR) using a sensor chip consisting of a 50 nm layer of gold on a glass surface. A carbohydrate matrix is then added to the gold surface. To the CHO matrix is attached through covalent chemistry a macromolecle which contains a binding site of a ligand. The binding site on the macromolecule must not be perturbed to any significant extent. A liquid containing the ligand is flowed over the binding surface.

The detection system consists of a light beam that passes through a prism on top of the glass layer. The light is totally reflected but another component of the wave called an evanescent wave, passes into the gold layer, where it can excite the Au electrons. If the correct wavelength and angle is chosen, a resonant wave of excited electrons (plasmon resonance) is produced at the gold surface, decreasing the total intensity of the reflected wave. The angle of the SPR is sensitive to the layers attached to the gold. Binding and dissociation of ligand is sufficient to change the SPR angle, as seen in the figure below. (Science, 295, pg 2103 (2002))

Fig: Surface Plasmon Resonance

Biachore: Surface Plasmon Resonance | Animation of Technique

Amazing Binding Affinity of Metals to Protein: An Example

An incredibly tight binding interaction has recently been reported for the binding of Cu¹⁺ to the CueR protein from E. Coli. Cu¹⁺ ions are usually kept to a very low concentration in cells as a mechanism to prevent toxicity. Yet some enzymes require Cu. Free copper ions must be present in the cell to allow binding to appropriate sites in proteins. How are these competing concerns regulated in the cell? The total Cu concentration in E. Coli is about 10 mM, which, given the small size of the bacterium, represents about 10,000 copper ions per cell.

Cells have evolved many mechanisms to control and deliver Cu ions. Copper ions can be delivered to target proteins by copper chaperones (analogs of the chaperone proteins which guide protein folding). CueR in E. Coli appears to regulate the copper-induced expression of genes involved in copper biochemistry (including an enzyme that oxidizes Cu¹⁺ to Cu²⁺ which is less toxic). One particular gene that is upregulated is copA. CueR increases transcription of copA in the presence of Cu, Ag, and Au (coinage metal) ions. Changela et al. developed an in vitro assay which determined the extent of expression of CueR regulated genes, under a variety of ion types and concentrations. In the assay, purified CueR was added to a gene construct containing the promoter for copA. Initially they found that transcription was always on even in the presence of a ligand, glutathione, which binds Cu¹⁺ avidly and should keep free Cu¹⁺ levels very low. They switched to an even tighter binding Cu¹⁺ coordinator, cyanide (CN^-), to reduce the free Cu¹⁺ levels to even lower levels. Extremely high levels of CN^- (millimolar) stopped transcriptional activation, but if excess Cu¹⁺ was added, activation ensued, suggesting that copper binding to the protein was reversible. At 1 mM CN^-, transcription increased with addition of copper ions up to a TOTAL Cu¹⁺ concentration of 60 mm. Under these condition, the free Cu¹⁺ concentrations were much less. Half-maximal activation occurred at a TOTAL Cu¹⁺ concentration of 0.7 mM. Similar activation was observed by Ag¹⁺ and Au¹⁺, but not by Zn and Hg ions, showing the specificity for monovalent cations over divalent cations.

Knowing the pKa of HCN, stability constants for Cu¹⁺:CN^- complexes, and CN- concentrations, Changela et al produced a series of solutions buffered in FREE Cu¹⁺ that extended from 10^-18 to 10^-23 M (pH 8.0). (For example, the log of the binding constant b, logb, for the Cu¹⁺ + 2CN^- <==> [Cu(CN)₂]^-is 21.7. You solved problems such as this involving linked equilibrium if you have taken analytical chemistry.) The free Cu1⁺ concentration at half-maximal activation of gene reporter transcription, a measure of the dissociation constant, Kd, was approximately 1 x 10^-21 M (zeptomolar)! Now assume that the volume of the contents of an E. Coli cell is 1.5 x 10^-15 L. If there were only one ion of Cu¹⁺ in the cell, it would have a concentration of 10^-9 M. The values suggest that there is no free Cu¹⁺ ions in the cell, and that only 1 Cu⁺¹ ion in the cell is enough to ensure its binding to CueR and subsequent transcriptional activation of copA.

Molecular Basis of High Affinity (Low Kd) Binding Interactions

What differentiates how and low affinity binding at the molecular level? Do high affinity interactions have lots of intramolecular H-bonds, salt bridges, van der Waals interactions, or are hydrophobic interactions most important? Recently, the crystal structures of a variety of antibody-protein complexes were determined in order to study the basis of affinity maturation of antibody molecules. It is well know that antibodies elicited on exposure to a foreign molecule (antigen) are initially of lower affinity than antibodies released later in the immune response. An incredible number of different antibodies can be made by antibody-producing B cells due to genetic mechanisms (combining different variable regions of antibody genes through splicing, imprecise splicing, and hypermutation of critical nucleotides in the genes of antigen binding regions of antibodies). Clones of antibody-producing cells with higher affinity are selected through binding and clonal expansion of these cells. Investigators studied the crystal structure of 4 different antibodies which bound to the same site (epitope) on the protein antigen lysozyme. Increased affinity was correlated with increased buried apolar surface area and not with increased numbers of H bonds or salt bridges.

Table: Characteristics of Antibody:Hen Egg Lysozyme Complexes(HEL)

Antibody	H26-HEL	H63-HEL	H10-HEL	H8-HEL
K_d (nM)	7.14	3.60	0.313	0.200
Intermolecular Interactions
H bonds	24	25	20	23
VDW contacts	159	144	134	153
salt bridges	1	1	1	1
Buried Surface Area
DA_SURF(A²)	1,812	1,825	1,824	1,872
DA_SURF-polar(A²)	1,149	1,101	1,075	1,052
DA_SURF-apolar(A²)	663	724	749	820

Li,Y. et al. Nature: Structural Biology. 6, pg 484 (2003)

Online Lit: Antibodies with Infinite Affinity. Chmura et al. PNAS. 98, pg 8480 (1998)

Docking

The quantitative methods described above do not elucidate the mechanism of binding. Computer programs have been developed that allow the docking of a ligand (small molecule or even another protein) to another protein. The automatic docking of flexible ligands to proteins can be modeled using Autodock. AutoDock contains the following programs:

AutoDock performs the docking of the ligand to a set of grids describing the target protein;
AutoGrid pre-calculates these grids;
AutoTors sets up which bonds will treated as rotatable in the ligand.

Autodock 4 is set to be released in mid 2003, hopefully in time to incorporate into class projects.

Molecular Docking Web

The Crowded Cell

Most binding studies are performed in vitro with dilution concentrations of both macromolecule and ligand. Are these conditions illustrious of conditions inside a cell? The answer is no! Cells are very crowded with organelles, macromolecular complexes, and cytoskeletal components which provide internal architecture to the cell, etc. Total macromolecule concentration in the cell has been estimated to be as high as 400 g/1L = 400 g/1000 mL = 0.4 g/mL = 400 mg/mL. Try to dissolve a water soluble protein like albumin to those concentrations! From 5 to 40% of the entire cellular volume is occupied with large molecules, and at the upper range, very little space exists for other large macromolecules.