|
IntroductionMBioSEQ Ridom Typer implements GAMBIT (Genomic Approximation Method for Bacterial Identification and Tracking; citation), a bioinformatics tool with a curated database that uses genome assembly k‑mer comparisons to rapidly and accurately identify bacterial species. It uses taxon-specific predicted taxonomic rank estimations. GAMBIT does not do identification down to subspecies level and subsumes all Mycobacterium tuberculosis complex species as M. tuberculosis. It works by hashing genome assemblies into targeted k‑mer signatures and comparing them against a reference database, but crucially it applies species‑specific thresholds to decide whether a match is reliable. Instead of using a single universal cutoff, GAMBIT determines for each species the minimum similarity score (based on shared k‑mers) that must be reached to confidently assign an isolate to that species. This design reduces false positives and accounts for natural genomic diversity across different taxa. As a result, GAMBIT can deliver rapid species identification with an accuracy comparable to average nucleotide identity (ANI) methods, but at a fraction of the computational cost. Furthermore, it is more discriminatory than ANI. Further details can be found in the documentation. MBioSEQ Ridom Typer uses the Theiagen database v2.0.1 (citation) that is based on GTDB r214.1 (April 28th, 2023) as reference database for GAMBIT. If a sample is processed in a pipeline and has a GAMBIT task entry, then the Genus and Species fields are derived from the Predicted Name (see Pipeline Script). Task Entry OverviewThe task entry overview shows the GAMBIT results for the sample. Result FieldsThe task entry stores the following result fields for each sample:
The Result tab of the Sample Overview only shows the field Predicted Name. This field is also written to the Procedure Statistics. When a comparison table is created for a project that contains the GAMBIT Bacterial Species ID Task Template this field is automatically added to the comparison table, replacing the Top Species Match" from Mash. If the task template is explicitly selected when creating a comparison table, all result fields are added to the table. Run times
|