DNA Stats

Calculate residue counts, dinucleotide frequencies, and grouped totals for DNA sequences.

Tool Configuration
Configure the parameters for DNA Stats

Paste one or more raw or FASTA-formatted sequences (limit 500,000,000 characters). Non-DNA characters are stripped prior to counting.

1

Input DNA Sequences

Paste one or more DNA sequences in FASTA format or as raw text. Non-DNA characters are automatically stripped before analysis (limit 500 million characters).

2

Execute Statistical Analysis

Click execute to calculate residue counts, dinucleotide frequencies, GC content, and grouped totals. The tool uses legacy SMS patterns for precise compatibility.

3

Interpret Results

Results include raw counts and percentages for each nucleotide, dinucleotide pair (e.g., AA, AT, GC), and grouped categories like purines vs pyrimidines.

How to Interpret
Understanding the counts and percentages

Counts

Raw counts are calculated by applying each legacy SMS pattern (including dinucleotide lookaheads) to the cleaned sequence.

Percentages

Percentages divide the counts by length + 1 - k, where k matches the pattern’s adjustment value, reproducing legacy results.

Grouped Totals

Group rows (e.g. g,c or degeneracy sets) combine bases via the same regex patterns used by the legacy DNA Stats tool.