NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Numerical classification of coding sequencesDNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9 ... (TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.
Document ID
20050000741
Acquisition Source
Legacy CDMS
Document Type
Reprint (Version printed in journal)
Authors
Collins, D. W.
(University of California Berkeley 94720)
Liu, C. C.
Jukes, T. H.
Date Acquired
August 22, 2013
Publication Date
March 25, 1992
Publication Information
Publication: Nucleic acids research
Volume: 20
Issue: 6
ISSN: 0305-1048
Subject Category
Exobiology
Funding Number(s)
CONTRACT_GRANT: HG00312
Distribution Limits
Public
Copyright
Other
Keywords
Non-NASA Center
NASA Discipline Exobiology

Available Downloads

There are no available downloads for this record.
No Preview Available