Skip to content

Summary

Downloadable tables of QualiBact thresholds and reference distributions across every species, derived from each species' preferred QC scheme. The quality criteria table below is the primary dataset; the other tables describe the reference distributions those thresholds were fit from. For programmatic access see the API endpoints section.

Primary dataset

Quality criteria

Per-species, per-metric quality thresholds. Each row gives four boundaries: Fail below and Fail above are the published cut-offs that downstream tools consume; Warn below and Warn above are the tighter inner boundaries derived from the reference distribution. A value inside the warn range passes, between warn and fail it warns, and outside fail it fails. This is the same data the per-species pages render.

Showing first 10 of 2656 rows. Search or sort to filter, or expand below.
Species
Scheme
Engine flag
Metric
Fail below
Warn below
Warn above
Fail above
Achromobacter xylosoxidansqualibact-v1.0warnGenome_Size6,200,0006,251,8107,359,1467,400,000
Achromobacter xylosoxidansqualibact-v1.0warnGC_Content65.7765.77968.265768.27
Achromobacter xylosoxidansqualibact-v1.0warnTotal_Coding_Sequences5,6005,6996,8706,900
Achromobacter xylosoxidansqualibact-v1.0warnCompleteness_Specific9191.1
Achromobacter xylosoxidansqualibact-v1.0warnContamination9.2410
Achromobacter xylosoxidansqualibact-v1.0warnN5033,00033,327
Achromobacter xylosoxidansqualibact-v1.0warnno_of_contigs447.22450
Achromobacter xylosoxidansqualibact-v1.0warnlongest
Achromobacter xylosoxidansqualibact-v1.1warnGenome_Size5,900,0005,940,0747,359,1467,400,000
Achromobacter xylosoxidansqualibact-v1.1warnGC_Content6565.77968.265769

Summary statistics

Per-metric distribution statistics for each species (preferred QC scheme only). The preview below shows the most useful columns; the full CSV also includes RefSeq comparison statistics (KS test, Wasserstein distance) and pre-filter bounds. Useful for building your own thresholds or diagnosing how a candidate assembly sits within the reference distribution.

Showing first 10 of 2656 rows. Search or sort to filter, or expand below.
Species
Metric
n
Mean
Median
SD
Q1
Q3
RefSeq n
Achromobacter xylosoxidansN50286243,388144,313200,29298,766373,7130
Achromobacter xylosoxidansnumber286116.849710479.999460141.750
Achromobacter xylosoxidanslongest286665,754458,029484,258303,881949,2790
Achromobacter xylosoxidansGC_Content28667.607767.61410.168467.500367.734328
Achromobacter xylosoxidansCompleteness_Specific28699.99991000.001210010028
Achromobacter xylosoxidansContamination2860.31590.180.44460.10.3428
Achromobacter xylosoxidansTotal_Coding_Sequences2866,0816,080201.18715,9046,24128
Achromobacter xylosoxidansGenome_Size2866,634,2776,615,944189,6456,479,4916,790,04128
Achromobacter xylosoxidansN50387323,578151,927320,67667,362503,5740
Achromobacter xylosoxidansnumber387100.95877885.5554311500

Species count summary

Per species: how many reference genomes contributed to the thresholds (split between RefSeq and other sources), the engine's quality flag for that reference dataset, and how the published thresholds partition the broader All-The-Bacteria dataset into PASS / WARN / FAIL tiers. Use this to sanity-check thresholds for species with small reference sets or non-PASS engine flags.

Showing first 10 of 317 rows. Search or sort to filter, or expand below.
Species
Scheme
Engine flag
Total genomes
Other sources
RefSeq
PASS
WARN
FAIL
Achromobacter xylosoxidansqualibact-v1.1warn41538728
Acinetobacter baumanniiqualibact-v1.0info27,26226,2471,01523,8283,2301,103
Acinetobacter baylyiqualibact-v1.0warn2452423228918
Acinetobacter johnsoniiqualibact-v1.0error112872580146
Acinetobacter nosocomialisqualibact-v1.0warn356321352973019
Acinetobacter pittiiqualibact-v1.0info674592825506524
Acinetobacter radioresistensqualibact-v1.0error9790784106
Acinetobacter ursingiiqualibact-v1.0warn115107893179
Actinobacillus lignieresiiqualibact-v1.0warn48248114453632
Aerococcus urinaequalibact-v1.0error10095591813

API endpoints

QualiBact ships a small set of versioned static endpoints under /api/v2/ so that downstream tools can fetch thresholds without scraping HTML. These files are regenerated on every site build from the per-species manifests. Third-party comparison schemes that QualiBact does not publish itself (currently EnteroBase) live under /api/v2/external/ in the same shape — kept separate so the canonical /api/v2/ contract only carries QualiBact-published thresholds.

EndpointWhat it is
/api/v2/thresholds.csvLong-form CSV: one row per (species, scheme, metric) with fail/warn lower & upper bounds. The source for the table above.
/api/v2/thresholds.jsonSame data as the CSV but JSON-keyed and grouped, easier for programmatic consumption.
/api/v2/thresholds.xlsxSame rows as the CSV, packaged as an Excel workbook for direct opening in spreadsheet software.
/api/v2/index.jsonManifest: schema version, generation timestamp, list of available species and the endpoint paths.
/static/species/{species}/{scheme}/manifest.jsonPer-species, per-scheme bundle: thresholds, distribution summary, plot URLs, and warning flags. One file per pair — the same one the per-species pages read.

External / third-party comparison schemes

Same shape as the canonical endpoints above, but for schemes QualiBact does not publish itself. Currently carries EnteroBase (enterobase-v2.3) rows.

EndpointWhat it is
/api/v2/external/thresholds.csvLong-form CSV of third-party comparison thresholds, mirroring the canonical CSV shape.
/api/v2/external/thresholds.jsonSame data as the external CSV, JSON-keyed and grouped by species & scheme for programmatic consumption.
/api/v2/external/thresholds.xlsxExternal thresholds packaged as an Excel workbook for direct opening in spreadsheet software.
/api/v2/external/index.jsonManifest for the external endpoints: schema version, generation timestamp, species count, and the list of species covered.

The path prefix v2 means the schema will not break compatibly. A schema change that does break consumers gets a new prefix (v3) and v2 stays in place until usage drops.