Remove non-DNA characters from text to make sequences suitable for other applications
Paste the text containing DNA sequence. Input limit is 500,000,000 characters.
Paste DNA sequence text that may contain formatting characters, numbers, or whitespace.
Select which characters to remove (e.g., non-GATCN, whitespace, digits, or IUPAC ambiguity codes).
Set replacement character for removed bases and choose case conversion (uppercase, lowercase, or preserve).
Click "Execute Tool" to filter the sequence. Download the cleaned sequence or copy it for use in other tools.
Remove digits, spaces, and formatting characters from sequences copied from publications or databases.
Standardize sequences to use only valid IUPAC nucleotide codes for downstream analysis.
Convert sequences to uppercase or lowercase to meet specific tool requirements.
Remove or replace 'U' characters to convert RNA sequences to DNA format.