Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
Career Basics

Site Tools

  • AAAS
  • Subscribe
  • Feedback

Site Search

Search Advanced

Science 3 July 1992:
Vol. 257. no. 5066, pp. 39 - 49
DOI: 10.1126/science.1621093

Articles

Science, Vol 257, Issue 5066, 39-49
Copyright © 1992 by American Association for the Advancement of Science


articles

Chance and statistical significance in protein and DNA sequence analysis

S Karlin and V Brendel

Department of Mathematics, Stanford University, CA 94305.

Statistical approaches help in the determination of significant configurations in protein and nucleic acid sequence data. Three recent statistical methods are discussed: (i) score-based sequence analysis that provides a means for characterizing anomalies in local sequence text and for evaluating sequence comparisons; (ii) quantile distributions of amino acid usage that reveal general compositional biases in proteins and evolutionary relations; and (iii) r-scan statistics that can be applied to the analysis of spacings of sequence markers.


THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
De novo search for non-coding RNA genes in the AT-rich genome of Dictyostelium discoideum: Performance of Markov-dependent genome feature scoring.
P. Larsson, A. Hinas, D. H. Ardell, L. A. Kirsebom, A. Virtanen, and F. Soderbom (2008)
Genome Res. 18, 888-899
   Abstract »    Full Text »    PDF »
AIMIE: a web-based environment for detection and interpretation of significant sequence motifs in prokaryotic genomes.
J. Mrazek, S. Xie, X. Guo, and A. Srivastava (2008)
Bioinformatics 24, 1041-1048
   Abstract »    Full Text »    PDF »
An Exact Nonparametric Method for Inferring Mosaic Structure in Sequence Triplets.
M. F. Boni, D. Posada, and M. W. Feldman (2007)
Genetics 176, 1035-1047
   Abstract »    Full Text »    PDF »
ClusterDraw web server: a tool to identify and visualize clusters of binding motifs for transcription factors.
D. Papatsenko (2007)
Bioinformatics 23, 1032-1034
   Abstract »    Full Text »    PDF »
UV-Targeted Dinucleotides Are Not Depleted in Light-Exposed Prokaryotic Genomes.
L. Palmeira, L. Gueguen, and J. R. Lobry (2006)
Mol. Biol. Evol. 23, 2214-2219
   Abstract »    Full Text »    PDF »
Statistical significance in biological sequence analysis.
A. Yu. Mitrophanov and M. Borodovsky (2006)
Brief Bioinform 7, 2-24
Colloquium Perspective: Statistical signals in bioinformatics.
S. Karlin (2005)
PNAS 102, 13355-13362
   Abstract »    Full Text »    PDF »
The replication-related organization of bacterial genomes.
E. P. C. Rocha (2004)
Microbiology 150, 1609-1627
   Abstract »    Full Text »    PDF »
Intimate Evolution of Proteins: PROTEOME ATOMIC CONTENT CORRELATES WITH GENOME BASE COMPOSITION.
P. Baudouin-Cornu, K. Schuerer, P. Marliere, and D. Thomas (2004)
J. Biol. Chem. 279, 5421-5428
   Abstract »    Full Text »    PDF »
Molecular Evolution of Protein Atomic Composition.
P. Baudouin-Cornu, Y. Surdin-Kerjan, P. Marliere, and D. Thomas (2001)
Science 293, 297-300
   Abstract »    Full Text »
Highly expressed and alien genes of the Synechocystis genome.
J. Mrazek, D. Bhaya, A. R. Grossman, and S. Karlin (2001)
Nucleic Acids Res. 29, 1590-1601
   Abstract »    Full Text »    PDF »
SELEX_DB: an activated database on selected randomized DNA/RNA sequences addressed to genomic sequence annotation.
J. V. Ponomarenko, G. V. Orlova, M. P. Ponomarenko, S. V. Lavryushev, A. S. Frolov, S. V. Zybova, and N. A. Kolchanov (2000)
Nucleic Acids Res. 28, 205-208
   Abstract »    Full Text »    PDF »
Analysis of the Interaction of the Novel RNA Polymerase II (pol II) Subunit hsRPB4 with Its Partner hsRPB7 and with pol II.
V. Khazak, J. Estojak, H. Cho, J. Majors, G. Sonoda, J. R. Testa, and E. A. Golemis (1998)
Mol. Cell. Biol. 18, 1935-1945
   Abstract »    Full Text »
Analysis of DNA sequences.
B. Weir (1993)
Statistical Methods in Medical Research 2, 225-239
   Abstract »    PDF »
Amplifying DNA with arbitrary oligonucleotide primers..
G Caetano-Anolles (1993)
Genome Res. 3, 85-94
   PDF »
Patchiness and correlations in DNA sequences.
S Karlin and V Brendel (1993)
Science 259, 677-680
   Abstract »    PDF »
Genes, pseudogenes, and Alu sequence organization across human chromosomes 21 and 22.
C. Chen, A. J. Gentles, J. Jurka, and S. Karlin (2002)
PNAS 99, 2930-2935
   Abstract »    Full Text »    PDF »
Scan statistics to scan markers for susceptibility genes.
J. Hoh and J. Ott (2000)
PNAS 97, 9615-9617
   Abstract »    Full Text »    PDF »



ADVERTISEMENT
Click Me!

ADVERTISEMENT
Click Me!

To Advertise     Find Products


Science. ISSN 0036-8075 (print), 1095-9203 (online)