Cation-π interactions in structural biology
Abstract
Cation-π interactions in protein structures are identified and evaluated by using an energy-based criterion for selecting significant sidechain pairs. Cation-π interactions are found to be common among structures in the Protein Data Bank, and it is clearly demonstrated that, when a cationic sidechain (Lys or Arg) is near an aromatic sidechain (Phe, Tyr, or Trp), the geometry is biased toward one that would experience a favorable cation-π interaction. The sidechain of Arg is more likely than that of Lys to be in a cation-π interaction. Among the aromatics, a strong bias toward Trp is clear, such that over one-fourth of all tryptophans in the data bank experience an energetically significant cation-π interaction.
The three-dimensional structure of a protein is determined by a delicate balance of weak interactions. Hydrogen bonds, salt bridges, and the hydrophobic effect all play roles in folding a protein and establishing its final structure. In addition, the cation-π interaction (1–3) is increasingly recognized as an important noncovalent binding interaction relevant to structural biology. Theoretical and experimental studies have shown that cation-π interactions can be quite strong, both in the gas phase and in aqueous media. A number of studies have established a role for cation-π interactions in biological recognition, especially in the binding of acetylcholine (4, 5). Here we present a detailed analysis of the extent and nature of cation-π interactions that are intrinsic to a protein’s structure and likely contribute to protein stability. We find that energetically significant cation-π interactions are common in proteins—a “typical” protein will contain several. We also have documented some significant preferences for certain amino acid pairs as partners in a cation-π interaction.
Important early work indicated a role for cation-π interactions in protein structures. Following work by Levitt and Perutz (6–8) suggesting a hydrogen bond between aromatic and amino groups, Burley and Petsko identified the “amino aromatic” interaction (9), in which NH-containing groups tend to be positioned near aromatic rings within proteins. It is now appreciated that the interaction of a cationic group with an aromatic—a cation-π interaction—is much more favorable than an analogous interaction involving a neutral amine (10, 11). Important subsequent studies by Thornton (12–17) modified the Burley and Petsko analysis, especially with regard to the amino-aromatic “hydrogen bond.” In addition, explicit studies of Arg interacting with aromatic residues have been reported by Flocco and Mowbray (18) and by Thornton (14), and other efforts to search the Protein Data Bank (PDB) for cation-π interactions between ligands and proteins have been reported (19, 20).
Previous protein database searches relied on geometric definitions of sidechain interactions, focusing on when a cationic sidechain displayed a certain distance/angle relationship to an aromatic sidechain. The different geometries of Lys vs. Arg and Trp vs. Phe/Tyr can make such comparisons problematical. In addition, not all cation-aromatic contacts represent energetically favorable cation-π interactions. Unlike ion pairs, for which any close contact will be energetically favorable, a cation interaction with an aromatic can be attractive or repulsive. The electrostatic potential surfaces of the aromatics, which control such distinctions (1), can be complex, and it is difficult to clearly distinguish attractive from repulsive cation-aromatic contacts using geometric criteria alone. To circumvent this problem, and to put the diverse array of potential cation-π interactions on a more nearly equal footing, we have chosen to use energy-based, rather than geometry-based, criteria in this study. Our goals in this study are twofold. First, we wish to develop meaningful statistics for cation-π interactions for structures within the PDB (21). Second, we wish to develop a simple, unambiguous protocol for identifying cation-π interactions that can be easily applied by other workers.
Within a protein, cation-π interactions can occur between the cationic sidechains of either lysine (Lys, K) or arginine (Arg, R) and the aromatic sidechains of phenylalanine (Phe, F), tyrosine (Tyr, Y) or tryptophan (Trp, W). Because histidine can participate in cation-π interactions as either a cation or as a π-system, depending on its protonation state, we do not consider histidine in this study. We assume Lys and Arg are always protonated.
Acknowledgments
We thank Dr. Scott Silverman for fruitful discussions. J.P.G. thanks the Eastman Kodak Corporation for generous fellowship support. This work was supported by the National Institutes of Health (Grant NS34407).
ABBREVIATIONS
| PDB | Protein Data Bank |
| OPLS | optimized potentials for liquid simulations |
Footnotes
The capture program can be obtained from the authors by e-mail request.
References
- 1. Dougherty D A. Science. 1996;271:163–168.[PubMed]
- 2. Ma J C, Dougherty D A. Chem Rev. 1997;97:1303–1324.[PubMed]
- 3. Scrutton N S, Raine A R C. Biochem J. 1996;319:1–8.
- 4. Sussman J L, Harel M, Frolow F, Oefner C, Goldman A, Toker L, Silman I. Science. 1991;253:872–879.[PubMed]
- 5. Zhong W, Gallivan J P, Zhang Y, Li L, Lester H A, Dougherty D A. Proc Natl Acad Sci USA. 1998;95:12088–12093.
- 6. Perutz M F, Fermi G, Abraham D J, Poyart C, Bursaux E. J Am Chem Soc. 1986;108:1064–1078.[PubMed]
- 7. Levitt M, Perutz M F. J Mol Biol. 1988;201:751–754.[PubMed]
- 8. Perutz M F. Philos Trans R Soc London A. 1993;345:105–112.[PubMed]
- 9. Burley S K, Petsko G A. FEBS Lett. 1986;203:139–143.[PubMed]
- 10. Deakyne C A, Meot-Ner M. J Am Chem Soc. 1985;107:474–479.[PubMed]
- 11. Rodham D A, Suzuki S, Suenram R D, Lovas F J, Dasgupta S, Goddard W A, III, Blake G A. Nature (London) 1993;362:735–737.[PubMed]
- 12. Singh J, Thornton J M. J Mol Biol. 1990;211:595–615.[PubMed]
- 13. Mitchell J B, Nandi C L, Thornton J M, Prince S L, Singh J, Snarey M. J Chem Soc Faraday Trans. 1993;89:2619–2630.[PubMed]
- 14. Mitchell J B O, Nandi C L, McDonald I K, Thornton J M, Price S L. J Mol Biol. 1994;239:315–331.[PubMed]
- 15. Nandi C L, Singh J, Thornton J M. Protein Eng. 1993;6:247–259.[PubMed]
- 16. Mitchell J B O, Laskowski R A, Thornton J M. Proteins. 1997;29:370–380.[PubMed]
- 17. Singh J, Thornton J M Atlas of Protein Side-Chain Interactions. 1 and 2. Oxford: IRL; 1992. [PubMed][Google Scholar]
- 18. Flocco M M, Mowbray S L. J Mol Biol. 1994;235:709–717.[PubMed]
- 19. Hendlich M. Acta Crystallogr D. 1998;54:1178–1182.[PubMed]
- 20. Wouters J. Protein Sci. 1998;7:2472–2475.
- 21. Abola E E, Bernstein F C, Bryant S H, Koetzle T F, Weng J In: Protein Data Bank. Abola E E, Bernstein F C, Bryant S H, Koetzle T F, Weng J, editors. Bonn: Data Commission of the International Union of Crystallography; 1987. pp. 107–132. [PubMed][Google Scholar]
- 22. Jorgensen W L, Tirado-Rives J. J Am Chem Soc. 1988;110:1657–1666.[PubMed]
- 23. Jorgensen W L, Maxwell D S, Tirado-Rives J. J Am Chem Soc. 1996;118:11225–11236.[PubMed]
- 24. Frisch M J, Trucks G W, Schlegel H B, Gill P M W, Johnson B G, Robb M A, Cheeseman J R, Keith T A, Petersson G A, Montgomery J A, et al Gaussian 94 (Revision D.3) Pittsburgh: Gaussian; 1995. [PubMed][Google Scholar]
- 25. Boys S F, Bernardi F. Mol Phys. 1970;19:533–566.[PubMed]
- 26. Kim K S, Lee J Y, Lee S J, Ha T-K, Kim D H. J Am Chem Soc. 1994;116:7399–7400.[PubMed]
- 27. Pullman A, Berthier G, Savinelli R. J Am Chem Soc. 1998;120:8553–8554.[PubMed]
- 28. Kumpf R A, Dougherty D A. Science. 1993;261:1708–1710.[PubMed]
- 29. Caldwell J W, Kollman P A. J Am Chem Soc. 1995;117:4177–4178.[PubMed]
- 30. Donini O, Weaver D F. J Comput Chem. 1998;19:1515–1525.[PubMed]
- 31. Mecozzi S, West A P, Jr, Dougherty D A. J Am Chem Soc. 1996;118:2307–2308.[PubMed]
- 32. Mecozzi S, West A P, Jr, Dougherty D A. Proc Natl Acad Sci USA. 1996;93:10566–10571.
- 33. Hobohm U, Scharf M, Schneider R, Sander C. Protein Sci. 1992;1:409–417.
- 34. Hobohm U, Sander C. Protein Sci. 1994;3:522–524.
- 35. Duffy E M, Kowalczyk P J, Jorgensen W L. J Am Chem Soc. 1993;115:9271–9275.[PubMed]
- 36. Kraulis P J. J Appl Crystallogr. 1991;24:946–950.[PubMed]
- 37. Merritt E A, Bacon D J. Methods Enzymol. 1997;277:505–524.[PubMed]





