Klebsiella quasipneumoniae is a Gram-negative, facultatively anaerobic, encapsulated rod in the family Enterobacteriaceae, closely related to Klebsiella pneumoniae and separated from it through genomic analyses. It comprises two subspecies (subsp. quasipneumoniae and subsp. similipneumoniae) and is isolated from clinical specimens including blood, urine, and respiratory samples. K. quasipneumoniae can harbour extended-spectrum beta-lactamase and carbapenemase genes, though it is generally less frequently associated with multidrug resistance and hypervirulence than K. pneumoniae.
These tables provide a summary of the distribution of each metric, including SDeviation, Mean, Median, and Percentiles.
| Metric | Lower bound | Upper bound |
|---|---|---|
| Genome_Size | 1,900,000 | 13,300,000 |
| GC_Content | 49.00 | 59.00 |
| Total_Coding_Sequences | 1,800 | 14,600 |
| Completeness_Specific | 38.00 | - |
| Contamination | - | 98.00 |
| no_of_contigs | - | 3,460 |
| N50 | 4,000 | - |
This plot shows the relationship between the number of coding sequences (CDS) and genome size. It helps to visualize how genome size correlates with the number of genes. This should be linear – as genome size increases, the number of coding sequences should also increase. Any secondary trend lines or non-linear behaviour indicates bona fide separate populations within the retained genomes or some remaining contaminant.
Histogram comparing SRA to RefSeq; each bar shows genome density across value ranges to highlight shifts, peaks, or outliers.
A table of complete RefSeq genomes for Klebsiella quasipneumoniae used to calibrate this scheme. The file includes accessions, some sample information, genome size, GC content, and other key metrics.
These plots show genomes before and after filtering to highlight the outliers removed. Left: Heatmap of all genomes in the dataset. Middle: A representative sample of genomes, with anomalies highlighted (purple). Right: The filtered distribution after applying filtering. There may have been additional adjustments and rounding so the distribution here may not enirely match with the final suggested metrics.