Sequence logos: a new way to display consensus sequences.
Journal: 1990/December - Nucleic Acids Research
ISSN: 0305-1048
PUBMED: 2172928
Abstract:
A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.
Relations:
Content
Citations
(1K+)
References
(11)
Conditions
(1)
Chemicals
(2)
Organisms
(4)
Processes
(4)
Affiliates
(1)
Similar articles
Articles by the same authors
Discussion board
Nucleic Acids Res 18(20): 6097-6100

Sequence logos: a new way to display consensus sequences.

Abstract

A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (635K), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Gold L, Pribnow D, Schneider T, Shinedling S, Singer BS, Stormo G. Translational initiation in prokaryotes. Annu Rev Microbiol. 1981;35:365–403. [PubMed] [Google Scholar]
  • Sadler JR, Waterman MS, Smith TF. Regulatory pattern identification in nucleic acid sequences. Nucleic Acids Res. 1983 Apr 11;11(7):2221–2231.[PMC free article] [PubMed] [Google Scholar]
  • Stormo GD. Consensus patterns in DNA. Methods Enzymol. 1990;183:211–221. [PubMed] [Google Scholar]
  • Schneider TD, Stormo GD, Gold L, Ehrenfeucht A. Information content of binding sites on nucleotide sequences. J Mol Biol. 1986 Apr 5;188(3):415–431. [PubMed] [Google Scholar]
  • Fields C. Information content of Caenorhabditis elegans splice site sequences varies with intron length. Nucleic Acids Res. 1990 Mar 25;18(6):1509–1512.[PMC free article] [PubMed] [Google Scholar]
  • Schneider TD, Stormo GD. Excess information at bacteriophage T7 genomic promoters detected by a random cloning technique. Nucleic Acids Res. 1989 Jan 25;17(2):659–674.[PMC free article] [PubMed] [Google Scholar]
  • Hein J. Unified approach to alignment and phylogenies. Methods Enzymol. 1990;183:626–645. [PubMed] [Google Scholar]
  • Schneider TD, Stormo GD, Haemer JS, Gold L. A design for computer nucleic-acid-sequence storage, retrieval, and manipulation. Nucleic Acids Res. 1982 May 11;10(9):3013–3024.[PMC free article] [PubMed] [Google Scholar]
  • Schneider TD, Stormo GD, Yarus MA, Gold L. Delila system tools. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):129–140.[PMC free article] [PubMed] [Google Scholar]
  • Shine J, Dalgarno L. The 3'-terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites. Proc Natl Acad Sci U S A. 1974 Apr;71(4):1342–1346.[PMC free article] [PubMed] [Google Scholar]
  • Bashford D, Chothia C, Lesk AM. Determinants of a protein fold. Unique features of the globin amino acid sequences. J Mol Biol. 1987 Jul 5;196(1):199–216. [PubMed] [Google Scholar]
National Cancer Institute, Frederick Cancer Research and Development Center, MD 21701.
National Cancer Institute, Frederick Cancer Research and Development Center, MD 21701.
Abstract
A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns.
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.