Skip to content
BF
BioFileKit
FASTA validation, cleaning, statistics, and CSV export
Open workspace
v1.2 Research-SafeπŸ”’ No signup requiredπŸ–₯️ All processing in your browser

BioFileKit

Validate, clean, analyze, and convert FASTA files β€” all in your browser. Built for bioinformatics researchers who need accuracy, transparency, and zero data uploads.

Sequences
8
Type
Mixed
Warnings
4
Errors
0
Avg GC
43.15%
Duplicates
None

FASTA Input

Paste or edit sequence data below

sample.fasta

Validation Mode

Cleaning Options

Line width:60

Sequence Report(8 records)

IDDescriptionDetectedConfidenceExpectedLengthGC%InvalidStatusMessage
sequence_1β€”DNAHighAuto2050%0CleanNo issues detected
sequence_2β€”DNAHighAuto1850%0CleanNo issues detected
sequence_with_lowercaseβ€”DNAHighAuto3948.72%0WarningLowercase letters found; cleaning will convert to uppercase
mixed_case_sequenceβ€”Ambiguous nucleotideMediumAuto2020%0WarningLowercase letters found; cleaning will convert to uppercase; Ambiguous bases found
sequence_with_spaces_and_invalidβ€”InvalidLowAuto2433.33%0WarningSpaces found; Contains DNA-like pattern with characters not valid for DNA/RNA: X, Y
long_dna_sequenceβ€”DNAHighAuto24050%0CleanNo issues detected
protein_sequence_exampleβ€”ProteinMediumAuto142NA0WarningProtein sequence detected; GC skipped
rna_sequenceβ€”RNAHighAuto4050%0CleanNo issues detected

⚠️ 6 Warning(s)

  • β€’ Sequence 'sequence_with_lowercase': Lowercase letters found; cleaning will convert to uppercase
  • β€’ Sequence 'mixed_case_sequence': Lowercase letters found; cleaning will convert to uppercase
  • β€’ Sequence 'mixed_case_sequence': Ambiguous bases found
  • β€’ Sequence 'sequence_with_spaces_and_invalid': Spaces found
  • β€’ Sequence 'sequence_with_spaces_and_invalid': Contains DNA-like pattern with characters not valid for DNA/RNA: X, Y
  • …and 1 more

🧹 Cleaning Summary

Modified
⚠️ Modified output. Cleaned sequences have been altered from the original file. Always verify the changes before using in research workflows.
46
Uppercased
7
Spaces
0
Invalid
Converted 46 lowercase bases to uppercase
Removed 7 spaces/whitespace characters

πŸ“Š Statistics

543
Total bases
18
Min
240
Max
68.75
Mean
35
Median
142
N50

πŸ’Ύ Export

πŸ“‹ Per-Record Issues

IDSeverityIssueMessage
sequence_1CleanNoneNo issues detected
sequence_2CleanNoneNo issues detected
sequence_with_lowercaseWarningLowercase lettersLowercase letters found; cleaning will convert to uppercase
mixed_case_sequenceWarningLowercase lettersLowercase letters found; cleaning will convert to uppercase
mixed_case_sequenceWarningAmbiguous baseAmbiguous bases found
sequence_with_spaces_and_invalidWarningWhitespaceSpaces found
sequence_with_spaces_and_invalidWarningInvalid DNA-like characterContains DNA-like pattern with characters not valid for DNA/RNA: X, Y
long_dna_sequenceCleanNoneNo issues detected
protein_sequence_exampleWarningGC skippedProtein sequence detected; GC skipped
rna_sequenceCleanNoneNo issues detected

BioFileKit v1.2 Research-Safe β€” All sequence data is processed locally in your browser. No signup. No login. No data uploads.Built for bioinformatics researchers who need accuracy, transparency, and privacy.