TEA FASTA Files

TEA-FASTA format is the same as normal FASTA format, containing tea-sequences with additional information in the header. Each header ends with ‘|H=x’, where x is the average normalised Shannon entropy of the tea-sequence, which can be interpreted as a measure of uncertainty. Sequences with H<0.25 are more likely to be well described, and lead to better search results. (See the pre-print for more details). Similarly, residues with H>0.25 are represented as lowercase characters in tea-sequences.