Skip to content

seqkit

seqkit is a cross-platform toolkit for FASTA/FASTQ file manipulation. Liatir uses the stats subcommand to compute sequence statistics — read/sequence counts, length distribution, GC content, N50, and quality rates.

Details

PropertyValue
TypeNative tool
Binaryseqkit
Subcommandstats

Installation

seqkit must be installed and available in your system PATH.

bash
brew install seqkit
bash
conda install -c bioconda seqkit
bash
sudo apt install seqkit

Accepted inputs

ExtensionDescription
.fasta, .fa, .fnaUncompressed FASTA
.fasta.gz, .fa.gzGzip-compressed FASTA
.fastq, .fqUncompressed FASTQ
.fastq.gz, .fq.gzGzip-compressed FASTQ

seqkit handles both compressed and uncompressed formats natively without piping through gunzip first.

Running seqkit stats

  1. Navigate to Tools → seqkit.
  2. Select a FASTA or FASTQ file from your Data library.
  3. Optionally enable extended stats to include N50, Q20, Q30, and GC content (adds a small overhead for large files).
  4. Click Run stats.

Output metrics

Basic stats (always included)

MetricDescription
SequencesTotal number of sequences (reads for FASTQ, entries for FASTA)
Total lengthSum of all sequence lengths
Min lengthShortest sequence
Avg lengthMean sequence length
Max lengthLongest sequence

Extended stats (-a flag)

MetricDescription
N50Half of total assembly length is contained in sequences ≥ N50. Primary assembly quality metric.
GC contentPercentage of guanine + cytosine bases
Q20 rateFraction of bases with Phred quality ≥ 20 (FASTQ only)
Q30 rateFraction of bases with Phred quality ≥ 30 (FASTQ only)

N50 for assemblies

N50 is a key metric for genome assemblies. A higher N50 means larger contigs. For chromosome-level assemblies N50 ≈ chromosome length; for fragmented assemblies N50 may be a few hundred Kb. A low N50 in a FASTA intended to be a complete genome suggests the assembly is highly fragmented.

Liatir — powerful bioinformatics on your machine.

By using this app, you agree to our Privacy Policy and Terms of Service.