sample

Commands under this group are meant to be used as a part of an NGS data processing pipeline, to generate read statistics at different stages (trimming, pre-processing, alignment etc) of the pipeline, and finally combine them into a single tsv file for all the samples in the pipeline, for the final report.

sample-stats

Aggregate read statistics from various steps in the workflow for a single sample into a json file.

Options

option	description	required
–sample-name	Sample name	✓
–raw-reads	Path to seqkit stats output for raw reads	✓
–first-trim	Path to seqkit stats output for reads after first trimming step	✓
–second-trim	Path to seqkit stats output for reads after second trimming step	✗
–rRNA-mapped	Path to seqkit stats output for reads mapped to rRNA reference	✗
–rRNA-free	Path to seqkit stats output for reads not mapped to rRNA reference	✗
–align-stats	Path to bam statistics json file generated using STAR command	✓
–dedup-stats	Path to bam statistics json file after UMI deduplication generated using STAR command	✗
–kraken2-report	Path to kraken2 report file	✗
–out-json	Output json file for collected statistics	✓

Usage

ngs-statter sample-stats --sample-name SAMPLE_NAME --raw-reads path/to/raw_reads_stats.txt --first-trim path/to/first_trim_stats.txt --align-stats path/to/align_stats.json --out-json path/to/sample_stats.json

compile-stats

Aggregate read statistics for multiple samples (generated using sample-stats command) into a single tsv file for the final report.

Options

option	description	required	default value
–output	Output file name for all sample statistics	✓

Usage

ngs-statter compile-stats --output path/to/compiled_stats.tsv path/to/sample1_stats.json path/to/sample2_stats.json