TY - JOUR
T1 - Genome-based taxonomic classification
AU - Paul, Bobby
AU - Dixit, Gunjan
AU - Murali, Thokur Sreepathy
AU - Satyamoorthy, Kapaettu
AU - Hao, W.
N1 - Publisher Copyright:
© 2019 Published by NRC Research Press.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Bacterial populations are routinely characterized based on microscopic examination, colony formation, and biochemical tests. However, in the recent past, bacterial identification, classification, and nomenclature have been strongly influenced by genome sequence information. Advances in bioinformatics and growth in genome databases has placed genome-based metadata analysis in the hands of researchers who will require taxonomic experience to resolve intricacies. To achieve this, different tools are now available to quantitatively measure genome relatedness within members of the same species, and genome-wide average nucleotide identity (gANI) is one such reliable tool to measure genome similarity. A genome assembly with a gANI score of <95% at the intraspecies level is generally considered indicative of a separate species. In this study, we have analysed 300 whole-genome sequences belonging to 26 different bacterial species available in the NCBI Genome database and calculated their similarity at the intraspecies level based on gANI score. At the intraspecies level, nine bacterial species showed less than 90% gANI and more than 10% of unaligned regions. We suggest the appropriate use of available bioinformatics resources after genome assembly to arrive at the proper bacterial identification, classification, and nomenclature to avoid erroneous species assignments and disparity due to diversity at the intraspecies level.
AB - Bacterial populations are routinely characterized based on microscopic examination, colony formation, and biochemical tests. However, in the recent past, bacterial identification, classification, and nomenclature have been strongly influenced by genome sequence information. Advances in bioinformatics and growth in genome databases has placed genome-based metadata analysis in the hands of researchers who will require taxonomic experience to resolve intricacies. To achieve this, different tools are now available to quantitatively measure genome relatedness within members of the same species, and genome-wide average nucleotide identity (gANI) is one such reliable tool to measure genome similarity. A genome assembly with a gANI score of <95% at the intraspecies level is generally considered indicative of a separate species. In this study, we have analysed 300 whole-genome sequences belonging to 26 different bacterial species available in the NCBI Genome database and calculated their similarity at the intraspecies level based on gANI score. At the intraspecies level, nine bacterial species showed less than 90% gANI and more than 10% of unaligned regions. We suggest the appropriate use of available bioinformatics resources after genome assembly to arrive at the proper bacterial identification, classification, and nomenclature to avoid erroneous species assignments and disparity due to diversity at the intraspecies level.
UR - https://www.scopus.com/pages/publications/85062710926
UR - https://www.scopus.com/inward/citedby.url?scp=85062710926&partnerID=8YFLogxK
U2 - 10.1139/gen-2018-0072
DO - 10.1139/gen-2018-0072
M3 - Article
AN - SCOPUS:85062710926
SN - 0831-2796
VL - 62
SP - 45
EP - 52
JO - Genome
JF - Genome
IS - 2
ER -