Campylobacter fetus is a Gram-negative, microaerophilic, spiral-shaped rod of the family Campylobacteraceae, primarily known as a veterinary pathogen causing infertility and abortion in cattle and sheep. In humans, C. fetus subsp. fetus can cause systemic infections including bacteraemia, endocarditis, and meningitis, particularly in immunocompromised or elderly individuals. Its genome encodes a surface-layer protein (S-layer) system that undergoes antigenic variation, allowing the organism to evade host immune responses and establish persistent infections.
Derived from 549 genomes: 30 from RefSeq and 519 from other sources. For the derivation pipeline and the PASS / WARN / FAIL verdict model, see the methods page for qualibact-v1.1.
Applied to the full All-The-Bacteria dataset, these thresholds place 495 genomes at PASS, 39 at WARN, and 40 at FAIL (574 assessed in total). The per-tier genome lists can be downloaded below in .csv.gz format; the FAIL list also records the reason each assembly was rejected.
QualiBact qualibact-v1.1 thresholds for Campylobacter fetus, refined from v1.0 based on the expert-feedback survey (Nov 2025).
GC content lower bound loosened from ~33.0% to 32% on the recommendation of the Campylobacter fetus subgroup. C. fetus subsp. venerealis carries a large accessory genome that can pull the GC content of otherwise high-quality assemblies below the v1.0 lower bound. Setting the threshold at 32% retains those venerealis isolates while remaining tighter than the broader genus-level limit.
All other thresholds inherit unchanged from v1.0. See the methods page for qualibact-v1.1 for the full pipeline.
Threshold values and rationale for Campylobacter fetus (qualibact-v1.1) contributed by:
C. fetus subgroup of the Nov 2025 expert-feedback survey.
This table summarises the distribution of each metric, including standard deviation, mean, median, and percentiles.
A combined summary table across all species is available on the summary page.
| Metric | Distribution | n | Mean | SD | Min | Q1 | Median | Q3 | Max |
|---|---|---|---|---|---|---|---|---|---|
| N50 | non-normal | 519 | 213,936 | 80,681 | 48,267 | 144,373 | 225,677 | 290,826 | 347,940 |
| no_of_contigs | non-normal | 519 | 50.01 | 19.57 | 25 | 36 | 44 | 59 | 125 |
| longest | non-normal | 519 | 440,830 | 183,557 | 103,036 | 262,483 | 369,125 | 619,251 | 743,299 |
| GC_Content | non-normal | 519 | 33.19 | 0.07 | 32.96 | 33.14 | 33.23 | 33.24 | 33.3 |
| Completeness_Specific | non-normal | 519 | 99.97 | 0.02 | 99.92 | 99.96 | 99.97 | 99.98 | 99.99 |
| Contamination | non-normal | 519 | 0.3 | 0.41 | 0.04 | 0.07 | 0.08 | 0.36 | 1.95 |
| Total_Coding_Sequences | non-normal | 519 | 1,851 | 105.8 | 1,722 | 1,762 | 1,823 | 1,945 | 2,121 |
| Genome_Size | non-normal | 519 | 1,804,966 | 74,224 | 1,720,307 | 1,747,105 | 1,768,010 | 1,874,171 | 1,980,564 |
Full statistics including KS test vs RefSeq and Wasserstein distance are in the downloadable summary.csv.
Derived from 549 genomes including 30 RefSeq references
Both Fail and Warn bands shown as the published rounded values — easier to cite and consistent across the species page, CSV downloads, and downstream QC tools.
| Metric | Fail below | Warn below | Warn above | Fail above |
|---|---|---|---|---|
| Genome_Size | 1,700,000 | 1,700,000 | 2,000,000 | 2,200,000 |
| GC_Content | 32 | 33 | 33.3 | 33.5 |
| Total_Coding_Sequences | 1,700 | 1,700 | 2,100 | 2,400 |
| Completeness_Specific | 91 | 99 | - | - |
| Contamination | - | - | 2 | 8 |
| N50 | 59,000 | 82,000 | - | - |
| no_of_contigs | - | - | 110 | 120 |
| longest | - | - | - | - |
How to read this: a value between the two warn columns is typical for this species and passes QC. A value between a warn column and the corresponding fail column is borderline — worth a manual look but not an outright failure. A value outside the fail columns is unusual enough to fail QC.
The published rounded thresholds (the values in the table above) were applied to the full AllTheBacteria-2024-08 set for this species. Each row carries the per-metric verdict and, where applicable, the reason a genome was demoted to WARN or FAIL. Files are gzipped CSV.
This plot shows the relationship between the number of coding sequences (CDS) and genome size — how the number of genes scales with assembly length. The relationship should be roughly linear: as genome size increases, the number of coding sequences should rise proportionally. A secondary trend line or non-linear behaviour can indicate either bona fide sub-populations within the retained genomes (e.g. distinct sub-clades) or residual contamination that survived filtering.
Histogram comparing SRA to RefSeq; each bar shows genome density across value ranges to highlight shifts, peaks, or outliers.
QQ (quantile-quantile) plot comparing SRA and RefSeq. Points along the diagonal follow the expected distribution; deviations indicate skew, outliers, or other systematic differences.
A table of complete RefSeq genomes for Campylobacter fetus used to calibrate this scheme. The file includes accessions, some sample information, genome size, GC content, and other key metrics.
These plots show genomes before and after filtering to highlight the outliers removed:
The filtered distribution shown here may not exactly match the published thresholds because additional rounding and curator adjustments are applied on top.