v1.2 Research-Safeπ No signup requiredπ₯οΈ All processing in your browser
BioFileKit
Validate, clean, analyze, and convert FASTA files β all in your browser. Built for bioinformatics researchers who need accuracy, transparency, and zero data uploads.
Sequences
8
Type
Mixed
Warnings
4
Errors
0
Avg GC
43.15%
Duplicates
None
FASTA Input
Paste or edit sequence data below
sample.fasta
Validation Mode
Cleaning Options
Line width:60
Sequence Report(8 records)
| ID | Description | Detected | Confidence | Expected | Length | GC% | Invalid | Status | Message |
|---|---|---|---|---|---|---|---|---|---|
| sequence_1 | β | DNA | High | Auto | 20 | 50% | 0 | Clean | No issues detected |
| sequence_2 | β | DNA | High | Auto | 18 | 50% | 0 | Clean | No issues detected |
| sequence_with_lowercase | β | DNA | High | Auto | 39 | 48.72% | 0 | Warning | Lowercase letters found; cleaning will convert to uppercase |
| mixed_case_sequence | β | Ambiguous nucleotide | Medium | Auto | 20 | 20% | 0 | Warning | Lowercase letters found; cleaning will convert to uppercase; Ambiguous bases found |
| sequence_with_spaces_and_invalid | β | Invalid | Low | Auto | 24 | 33.33% | 0 | Warning | Spaces found; Contains DNA-like pattern with characters not valid for DNA/RNA: X, Y |
| long_dna_sequence | β | DNA | High | Auto | 240 | 50% | 0 | Clean | No issues detected |
| protein_sequence_example | β | Protein | Medium | Auto | 142 | NA | 0 | Warning | Protein sequence detected; GC skipped |
| rna_sequence | β | RNA | High | Auto | 40 | 50% | 0 | Clean | No issues detected |
β οΈ 6 Warning(s)
- β’ Sequence 'sequence_with_lowercase': Lowercase letters found; cleaning will convert to uppercase
- β’ Sequence 'mixed_case_sequence': Lowercase letters found; cleaning will convert to uppercase
- β’ Sequence 'mixed_case_sequence': Ambiguous bases found
- β’ Sequence 'sequence_with_spaces_and_invalid': Spaces found
- β’ Sequence 'sequence_with_spaces_and_invalid': Contains DNA-like pattern with characters not valid for DNA/RNA: X, Y
- β¦and 1 more
π§Ή Cleaning Summary
Modifiedβ οΈ Modified output. Cleaned sequences have been altered from the original file. Always verify the changes before using in research workflows.
46
Uppercased
7
Spaces
0
Invalid
Converted 46 lowercase bases to uppercase
Removed 7 spaces/whitespace characters
π Statistics
543
Total bases
18
Min
240
Max
68.75
Mean
35
Median
142
N50
πΎ Export
π Per-Record Issues
| ID | Severity | Issue | Message |
|---|---|---|---|
| sequence_1 | Clean | None | No issues detected |
| sequence_2 | Clean | None | No issues detected |
| sequence_with_lowercase | Warning | Lowercase letters | Lowercase letters found; cleaning will convert to uppercase |
| mixed_case_sequence | Warning | Lowercase letters | Lowercase letters found; cleaning will convert to uppercase |
| mixed_case_sequence | Warning | Ambiguous base | Ambiguous bases found |
| sequence_with_spaces_and_invalid | Warning | Whitespace | Spaces found |
| sequence_with_spaces_and_invalid | Warning | Invalid DNA-like character | Contains DNA-like pattern with characters not valid for DNA/RNA: X, Y |
| long_dna_sequence | Clean | None | No issues detected |
| protein_sequence_example | Warning | GC skipped | Protein sequence detected; GC skipped |
| rna_sequence | Clean | None | No issues detected |
BioFileKit v1.2 Research-Safe β All sequence data is processed locally in your browser. No signup. No login. No data uploads.Built for bioinformatics researchers who need accuracy, transparency, and privacy.