16S rRNA Refseq V15.22 Genomic RefSeq V9.14
HOMD User Documentation     
Table of Content:
User Documentation
+Taxon Description
+Identify 16S rRNA Sequence
+Tools & Download
 >Genomics Tools Overview
 >HOMD Genome Viewer
 >HOMD JBrowse Genome Viewer
 >Dynamic Genome Annotation
 +Blast against Genomes
  >BLAST databases
  >BLAST Search main parameters
  >FASTA format description
  >Full list of the BLAST Advanced options
  >Out-Of-Frame BLAST notation
  >BLAST Color schema description
  >Programs available for the BLAST search
  >Low complexity filtering
 >HOMD Dynamic Genomic BLAST
 >KEGG Pathway
 >Gene Ontology
 >Sequence Batch Download
+Database Search
General Documentation
+HOMD Information
Search Documentation:
Input Keywords:
Title Content
4-5] FASTA format description
A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length. An example sequence in FASTA format is:
Sequences are expected to be represented in the standard IUB/IUPAC amino acid and nucleic acid codes, with these exceptions: lower-case letters are accepted and are mapped into upper-case; a single hyphen or dash can be used to represent a gap of indeterminate length; and in amino acid sequences, U and * are acceptable letters (see below). Before submitting a request, any numerical digits in the query sequence should either be removed or replaced by appropriate letter codes (e.g., N for unknown nucleic acid residue or X for unknown amino acid residue).
The nucleic acid codes supported are:
A --> adenosine           M --> A C (amino)         C --> cytidine            S --> G C (strong)        G --> guanine            W --> A T (weak)         T --> thymidine           B --> G T C         U --> uridine             D --> G A T         R --> G A (purine)        H --> A C T         Y --> T C (pyrimidine)    V --> G C A         K --> G T (keto)          N --> A G C T (any)      -  gap of indeterminate length
For those programs that use amino acid query sequences (BLASTP and TBLASTN), the accepted amino acid codes are:
A  alanine                         P  proline     B  aspartate or asparagine         Q  glutamine     C  cystine                         R  arginine     D  aspartate                       S  serine     E  glutamate                       T  threonine     F  phenylalanine                   U  selenocysteine     G  glycine                        V  valine     H  histidine                       W  tryptophan     I  isoleucine                      Y  tyrosine     K  lysine                          Z  glutamate or glutamine     L  leucine                        X  any     M  methionine                      *  translation stop     N  asparagine                      -  gap of indeterminate length
Article last modified on 2014-04-08 11:00:54 by lyang; viewed 2754 times; Category: User Documentation; Topic: Tools & Downloa
Copyright 2007-2022 The Forsyth Institute
Hosted on Amazon AWS EC2