TY - JOUR
T1 - Comparative analysis of the diversity of trinucleotide repeats in bacterial genomes
AU - Paul, Bobby
AU - Siddaramappa, Shivakumara
N1 - Publisher Copyright:
© 2024 The Author(s).
PY - 2024/8
Y1 - 2024/8
N2 - The human gut is the most favorable niche for microbial populations, and few studies have explored the possibilities of horizontal gene transfer between host and pathogen. Trinucleotide repeat (TNR) expansion in humans can cause more than 40 neurodegenerative diseases. Further, TNRs are a type of microsatellite that resides on coding regions can contribute to the synthesis of homopolymeric amino acids. Hence, the present study aims to estimate the occurrence and diversity of TNRs in bacterial genomes available in the NCBI Genome database. Genome-wide analyses revealed that several bacterial genomes contain different types of uninterrupted TNRs. It was found that TNRs are abundant in the genomes of Alcaligenes faecalis, Mycoplasma gallisepticum, Mycoplasma genitalium, Sorangium cellulosum, and Thermus thermophilus. Interestingly, the genome of Bacillus thuringiensis strain YBT-1518 contained 169 uninterrupted ATT repeats. The genome of Leclercia adecarboxylata had 46 uninterrupted CAG repeats, which potentially translate into polyglutamine. In some instances, the TNRs were present in genes that potentially encode essential functions. Similar occurrences in human genes are known to cause genetic disorders. Further analysis of the occurrence of TNRs in bacterial genomes is likely to provide a better understanding of mismatch repair, genetic disorders, host-pathogen interaction, and homopolymeric amino acids.
AB - The human gut is the most favorable niche for microbial populations, and few studies have explored the possibilities of horizontal gene transfer between host and pathogen. Trinucleotide repeat (TNR) expansion in humans can cause more than 40 neurodegenerative diseases. Further, TNRs are a type of microsatellite that resides on coding regions can contribute to the synthesis of homopolymeric amino acids. Hence, the present study aims to estimate the occurrence and diversity of TNRs in bacterial genomes available in the NCBI Genome database. Genome-wide analyses revealed that several bacterial genomes contain different types of uninterrupted TNRs. It was found that TNRs are abundant in the genomes of Alcaligenes faecalis, Mycoplasma gallisepticum, Mycoplasma genitalium, Sorangium cellulosum, and Thermus thermophilus. Interestingly, the genome of Bacillus thuringiensis strain YBT-1518 contained 169 uninterrupted ATT repeats. The genome of Leclercia adecarboxylata had 46 uninterrupted CAG repeats, which potentially translate into polyglutamine. In some instances, the TNRs were present in genes that potentially encode essential functions. Similar occurrences in human genes are known to cause genetic disorders. Further analysis of the occurrence of TNRs in bacterial genomes is likely to provide a better understanding of mismatch repair, genetic disorders, host-pathogen interaction, and homopolymeric amino acids.
UR - https://www.scopus.com/pages/publications/85200424231
UR - https://www.scopus.com/pages/publications/85200424231#tab=citedBy
U2 - 10.1139/gen-2023-0097
DO - 10.1139/gen-2023-0097
M3 - Article
C2 - 38593473
AN - SCOPUS:85200424231
SN - 0831-2796
VL - 67
SP - 281
EP - 291
JO - Genome
JF - Genome
IS - 8
ER -