Skip to content

QC Scheme: REFSEQ-QC-v1 for Candida auris

For detailed methods on how these thresholds were calculated, please see Methods. The suggested thresholds are in the table below. These thresholds are based on 59 genomes from RefSeq and 0 genomes from other sources.

Summary tables

These tables provide a summary of the distribution of each metric, including SDeviation, Mean, Median, and Percentiles.

Suggested thresholds for Candida auris (REFSEQ-QC-v1)

MetricLower boundUpper bound
Genome_Size12,000,00012,400,000
GC_Content45.0046.00
Total_Coding_Sequences10,30012,000
Completeness_Specific100.0-
Contamination-93.00
no_of_contigs-110.0
N50435,000-

CDS vs Genome Size

CDS vs Genome Size

This plot shows the relationship between the number of coding sequences (CDS) and genome size. It helps to visualize how genome size correlates with the number of genes. This should be linear – as genome size increases, the number of coding sequences should also increase. Any secondary trend lines or non-linear behaviour indicates bona fide separate populations within the retained genomes or some remaining contaminant.

RefSeq distributions

Genome Size (RefSeq)
1 / 5
Histogram (SRA vs RefSeq)

Histogram comparing SRA to RefSeq; each bar shows genome density across value ranges to highlight shifts, peaks, or outliers.

QQ plot (SRA vs RefSeq)

QQ (quantile-quantile) plot comparing SRA and RefSeq. Points along the diagonal follow the expected distribution; deviations indicate skew, outliers, or other systematic differences.

Table of included RefSeq - Complete genomes

A table of complete RefSeq genomes for Candida auris used to calibrate this scheme. The file includes accessions, some sample information, genome size, GC content, and other key metrics.