In bioinformatics and biochemistry—where collecting and analyzing complex biological data is a central focus—long character strings are often encoded in a format called FASTA.
In this post, we’ll provide a quick overview of the format and its uses.
Nucleic Acid Codes:
A = adenosine
C = cytidine
G = guanine
T = thymidine
N = A/G/C/T (any)
U = uridine
R = G/A (purine)
Y = T/C (pyrimidine)
K = G/T (keto)
M = A/C (amino)
S = G/C (strong)
W = A/T (weak)
B = G/T/C
D = G/A/T
H = A/C/T
V = G/C/A
Accepted Amino Acid Codes:
A = alanine
B = aspartate or asparagine
C = cystine
D = aspartate
E = glutamate
F = phenylalanine
G = glycine
H = histidine
I = isoleucine
K = lysine
L = leucine
M = methionine
N = asparagine
P = proline
Q = glutamine
R = arginine
S = serine
T = threonine
U = selenocysteine
V = valine
W = tryptophan
Y = tyrosine
Z = glutamate or glutamine
X = any
A single dash or hyphen (-) can be used to represent a gap of indeterminate length and an asterisk (*) can be used to represent a translation stop.
Here are three examples of how FASTA format looks:
>seq1
KYRTWEEFTRAAEKLYQADPMKVRVVLKYRHCDGNLCIKVTDDVVCLLYRTDQAQDVKKIEKFHSQLMRLMELKVTDNKECLKFKTDQAQEAKKMEKLNNIFFTLM
>MCHU - Calmodulin - Human, rabbit, bovine, rat, and chicken
ADQLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTIDFPEFLTMMARKMKDTDSEEEIREAFRVFDKDGNGYISAAELRHVMTNLGEKLTDEEVDEMIREADIDGDGQVNYEEFVQMMTAK*
>U06486.1 Human Wilms' tumor (WT1) gene, 5' region, partial cds CAGTGTCTTGTAGAATCTTCAGTGTCTTGATAATAATTTTAAAAGCTTCTGAGTGGAGACGACGCAAAGTCAAGCAGCAAAGGTGGCCTGGGAGGCAAGCGGAGGGCTCAAGTGCCGCATCTTTACCCTCAGGGTCTCCTGCGCCTACGGGATGCGCATTCCCAAGAAGTGCGCCCTTCGAGTAA
One of the oldest recognized formats in bioinformatics, FASTA format is still widely used in sequence retrieval due to its simplicity and flexibility. Indeed, the format is considered an almost universal standard in the bioinformatics field of research.
In our ongoing effort to help make researchers' lives easier, the Gadgeteers here at Research Solutions have incorporated FASTA Format into a number of our lab analysis and productivity apps or Gadgets, including:
With lab productivity Gadgets, you can save up to 50% of your research time by automating routine tasks – including those that involve FASTA format.
If you haven't done so already, we invite you to sign up for a free account and put these Gadgets to the test! We think you'll be pleased. And of course, we welcome any feedback you may have.