Calculate residue counts, dinucleotide frequencies, and grouped totals for DNA sequences.
Paste one or more raw or FASTA-formatted sequences (limit 500,000,000 characters). Non-DNA characters are stripped prior to counting.
Paste one or more DNA sequences in FASTA format or as raw text. Non-DNA characters are automatically stripped before analysis (limit 500 million characters).
Click execute to calculate residue counts, dinucleotide frequencies, GC content, and grouped totals. The tool uses legacy SMS patterns for precise compatibility.
Results include raw counts and percentages for each nucleotide, dinucleotide pair (e.g., AA, AT, GC), and grouped categories like purines vs pyrimidines.
Raw counts are calculated by applying each legacy SMS pattern (including dinucleotide lookaheads) to the cleaned sequence.
Percentages divide the counts by length + 1 - k, where k matches the pattern’s adjustment value, reproducing legacy results.
Group rows (e.g. g,c or degeneracy sets) combine bases via the same regex patterns used by the legacy DNA Stats tool.