List of some pangenomics tools. After publication, some addresses were inactivated. To obtain unavailable tools, please to contact the corresponding author from the references. More than 40 pangenomic tools are now available in online platforms or for local applications [12].
Abstract
This review summarizes the most important reports about Pseudomonas aeruginosa pangenome. Pan-genomics has tackled some fundamental concerns in pathogenic bacteria. PATRIC and other databases, store more than 9000 P. aeruginosa genomes. This data mining is an opportunity to develop discoveries related to antibiotic resistance, virulence, pathogenicity, fitness, and evolution, among others. Observing the different pangenomes of P. aeruginosa, it is concluded that this species has an open pangenome, and its accessory genome is larger than the central genome. HGT is one important source for P. aeruginosa genome. In recent years various authors developed P. aeruginosa pangenomes, from works with five genomes to more than 1300 genomes. This last work analyzed 54,272 genes, and they found a short and tiny core genome (only 665 genes). Other research with lesser strains or genomes identified a core genome bigger, almost 20% of the pangenome. Nevertheless, the total work proves that the accessory plus unique genome is larger than the core genome in P. aeruginosa.
Keywords
- pangenome
- pan-genome
- pseudomonas aeruginosa
- bacterias
- antibiotic resistome
1. Introduction
Pan-genomics/pangenomics is an innovative tool to explain pan-genome/pangenome construction throughout the species, which is resolved with comparative genomics, among others. Pangenome is divided into two main classes: the core and accessory genomes. Thousands of unknown bacteria and microorganisms are exposed to natural and manufactured antibiotics, toxins, and harmful compounds, to the highest and lowest temperatures, extreme pHs, and other species competitors. Pangenomics is too a powerful approach to identifying those thousands of involved genes. Virulent genes, phenotypes, and environmental expressed genes from horizontal transference (HGT) derive from the core or accessory genome. These latest concepts are a challenge as a new point of view to face
Different authors and web-published pangenomes’
In the pangenome, the core or central genome is the total of genes common to all the examined and analyzed genomes from a genome pool of a given species. Likewise, this represents the genes present in the overall strains from one species. On the other hand, accessory, variable or flexible genome (for some authors, dispensable genome) represents the genes that do not present in all strains of one species [8], http://www.metagenomics.wiki/pdf/definition/pan-genomeb [9, 10]. Those terms are the key in pangenomics to reach a significant pangenome into some species. Moreover, the next sections will see the robustness degree or gene orthology analysis to have a core and accessory genome.
Concomitantly, multiple pangenomic tools were developed and tested over the last ten years. Of course, we can start from genomics annotation if we do not want to use the available databases since classical pangenomics use only annotated genes and orthology analysis. We will mention some of them because it is sure the high amount of pangenomics tools available today. Furthermore, now database strategy, currently developed to perform via a personal server [11], there are now online resources that allow to quickly build own pangenome analysis see Table 1.
Tool | Webpage | Reference |
---|---|---|
Panseq | https://lfz.corefacility.ca/panseq/page/pan.html ; https://github.com/chadlaing/Panseq ; https://help.rc.ufl.edu/doc/Panseq ; https://opensourcelibs.com/lib/panseq | [13] |
PanCGHweb | http://bamics2.cmbi.ru.nl/websoftware/pancgh/pancgh_start.php; https://maayanlab.cloud/datasets2tools/landing/tool/PanCGHweb | [14] |
PGAT | http://nwrce.org/pgat/ ; http://tools.uwgenomics.org/pgat/index.html | [15] |
Spine | http://vfsmspineagent.fsm.northwestern.edu/cgi-bin/spine.cgi http://spineagent.fsm.northwestern.edu/cgi-bin/spine.cgi | [16] |
PGAweb | http://pgaweb.vlcc.cn/analyze ; stopped in English availability, only in Chinese language, http://www.cngrid.org/yhfw/zhsq1/# | [17] |
PanX | https://pangenome.org/ | [7] |
2. What is a pangenome?
The pangenomic technique requires at least two or multiple genomes of bacteria, archaea, fungi, or any eukaryote. This tool can provide the broadest resolution of genetic variation, among them: pathogenicity islands, virulence genes, mobile elements, transposons, horizontal gene transfer, pathogenicity islands, orthologous shared and syntheny, plasticity, evolution, and others. The development of pangenomics has promoted advances in many fields, like bioinformatics and computational biology, comparative genomics, molecular medicine, molecular epidemiology, agronomy and foods, and many more [18]. In the beginning, one advantage of pangenomics was that experimental data have shown for some species that new genes are being discovered even after sequencing several strain genomes [19]. Given that, the number of unique genes is vast, the pangenome of a bacterial species might be orders of magnitude larger than any single genome, as predicted by [20], more than 14 years ago.
The connection between the core and accessory genome put together the close and open pangenome inside the species. Open pangenome increases when new genomes are added, contrary to a close when a new genome added does not increase the size of the pangenome [10]. Table 2 summarizes the terms applied to pangenome and pangenomics. It is worth mentioning that the term coined “unique genome”, is related to solitary genes not shared among strains.
Terminology | Description |
---|---|
Pangenome or Pan-genome | It is the gene collection from a genomes group of species or strain group. Also, the complete gene set of all strains of a species, including genes present in all strains (core genome) plus genes present only in some strains of a species (variable or accessory genome). |
Core or Central genome | It is the maximum of genes common to all the examined genomes of a given species. Likewise, represent the genes present in the overall strains from one species. |
Accessory or Variable genome | The variable or accessory genome (flexible, dispensable genome) refers to genes not available in all strains of a species. These include genes present in two or more strains, or even unique genes from a single strain (for some authors). For example, genes for specific strain adaptation such as antibiotic resistance. Not uniques, but not in the core genome. |
Unique genome | A set of genes present in only one strain, that is never shared with other strains. Singleton is the name assigned to this kind of gene. Some authors classify singletons as accessory genomes others prefer to differentiate them. |
Open pangenome | It is known as the pangenome increasing when a new genome is added to the pangenome. The number of genes of the pangenome increases with the number of additionally sequenced strains. |
Closed pangenome | Finished pangenome is when there is no change when new genomes are added, after some sequenced strains, additional strains do not provide new genes to the species pangenome. |
3. From pangenomics to pangenome
The term pan-genome, currently written today as pangenome was first used by Sigaux [21] in cancer research to describe a public database containing an assessment of genome and transcriptome alterations in types of tumors, tissues and experimental models. Later, Tettelin et al. [22], using bacterial genomes, defined a microbial pangenome as the combination of a core genome, carrying genes present in overall strains. And a dispensable genome (also described as a flexible or accessory genome) is composed of genes absent from one or more strains [23]. A generalization of such representation could contain not only the genes (transposons, promotors, other mobile elements such as HGT, microRNAs, etc., but also other variations present in the collection of genomes.
The study of bacterial pangenome has many applications in clinical microbiology, also to study resistant genes or HGTs, virulence, and pathogenicity, and finally to classify species and to know its evolutions and clonal dispersion inside de environments see Figure 1. The pangenomics offer a wealth of information about human-associated bacterial species (12).
4. Could pangenome redefine bacteria species?
In his excellent chapter of Bobay [24], about the prokaryotic species concept, supported by the pangenome study, takes into account the approaches from Dobzhansky [25] and Mayr [26]. Speciation mechanism is involved in microorganisms, directly or indirectly, since the sustained interruption of gene flow between populations does not engage in sexual reproduction
Pangenomics is a fundamental tool for studying the entire repertoire of gene families in the genomes of pathogenic bacterial clades such as
The application of pangenomics in
5. Core and accessory genome of P. aeruginosa
As previously mentioned, different
It will most likely be necessary to adjust the core and accessory genome measurements for
6. The open pangenome of P. aeruginosa pathogen
As we defined in Table 2, an open pangenome increases when a new genome is added to the pangenome. In other words, the open pangenome is contrary to the closed pangenome, so there are changes when new genomes are added, and new strains provide new genes to the species pangenome.
The
7. The size of P. aeruginosa pangenome
There is a large variability in the size
Now, thinking about again the reported pangenomes: Sharma et al. [3] 5 genomes, Fischer et al. [5] 100 genomes, Ding et al. [7] 153 genomes, Mosquera-Rendón et al. [4] 181 genomes, and Freschi et al. [6] used 1311 genomes. It is clear that Freschi et al.’s study has the maximum number of samples and likely the best pangenome results for the
Taking together the data extracted from Ding et al. [7], shown in Figure 2 for
8. The pangenome of MDR P. aeruginosa
From 2006 multidrug-resistant (MDR
The growth of MDR
As was mentioned previously, Freschi et al. [6], who developed the biggest pangenome of
9. Antibiotic resistome
The antibiotic resistome of some organisms is the set of antibiotic-resistant genes used to resist the high concentration of some antibiotics [46]. Perhaps, the
The opened pangenome of
10. The HGT in P. aeruginosa pangenome
In pangenomes of Freschi et al. [6] they study the role of HGTs in the evolution of the
11. The future of pangenomics for P. aeruginosa
In conclusion and perspective, we can state that pangenomics had been an excellent and useful tool to decipher the
Acknowledgments
We thank the University of Antioquia for their financial support to the project CODI 2017-15753.
Conflicts of interest
The authors declare that they have no conflicts of interest regarding the publication of this paper
References
- 1.
Tettelin H, Medini D. The Pangenome: Diversity, Dynamics and Evolution of Genomes. Springer; 2020. pp. 307 - 2.
Barh D, Soares SC, Tiwari S, Azevedo V. Pan-genomics: Applications, Challenges, and Future Prospects. Elsevier; 2020. pp. 474 - 3.
Sharma A, Sangwan N, Negi V, Kohli P, Khurana JP, Rao DLN, et al. Pan-genome dynamics of Pseudomonas gene complements enriched across hexachlorocyclohexane dumpsite. BMC Genomics. 2015;16 (1):1-19 - 4.
Mosquera-Rendón J, Rada-Bravo AM, Cárdenas-Brito S, Corredor M, Restrepo-Pineda E, Benítez-Páez A. Pangenome-wide and molecular evolution analyses of the Pseudomonas aeruginosa species. BMC Genomics. 2016; 17 (1):1-14 - 5.
Fischer S, Klockgether J, Morán Losada P, Chouvarine P, Cramer N, Davenport CF, et al. Intraclonal genome diversity of the major Pseudomonas aeruginosa clones C and PA 14. Environmental Microbiology Reports. 2016;8 (2):227-234 - 6.
Freschi L, Vincent AT, Jeukens J, Emond-Rheault JG, Kukavica-Ibrulj I, Dupont MJ, et al. The Pseudomonas aeruginosa pan-genome provides new insights on its population structure, horizontal gene transfer, and pathogenicity. Genome Biology and Evolution. 2019;11 (1):109-120 - 7.
Ding W, Baumdicker F, Neher RA. panX: Pan-genome analysis and exploration. Nucleic Acids Research. 2018; 46 (1):e5-e5 - 8.
Rouli L, Merhej V, Fournier PE, Raoult D. The bacterial pangenome as a new tool for analysing pathogenic bacteria. New Microbes and New Infections. 2015; 7 :72-85 - 9.
Paralanov V, Lu J, Duffy LB, Crabb DM, Shrivastava S, Methé BA, et al. Comparative genome analysis of 19 Ureaplasma urealyticum and Ureaplasma parvumstrains. BMC Microbiology. 2012; 12 (1):1-20 - 10.
Corredor M, Muñoz-Gómez A. Pan-genomics of multidrug-resistant human pathogenic bacteria and their resistome. Pan-genomics: Applications, Challenges, and Future Prospects. 2020; 2020 :203-235 - 11.
Xiao J, Zhang Z, Wu J, Yu J. A brief review of software tools for pangenomics. Genomics, Proteomics & Bioinformatics. 2015; 13 (1):73-76 - 12.
Anani H, Zgheib R, Hasni I, Raoult D, Fournier PE. Interest of bacterial pangenome analyses in clinical microbiology. Microbial Pathogenesis. 2020; 149 :104275 - 13.
Laing C, Buchanan C, Taboada EN, Zhang Y, Kropinski A, Villegas A, et al. Pan-genome sequence analysis using Panseq: An online tool for the rapid analysis of core and accessory genomic regions. BMC Bioinformatics. 2010; 11 (1):1-14 - 14.
Bayjanov JR, Siezen RJ, van Hijum SA. PanCGHweb: A web tool for genotype calling in pangenome CGH data. Bioinformatics. 2010; 26 (9):1256-1257 - 15.
Brittnacher MJ, Fong C, Hayden HS, Jacobs MA, Radey M, Rohmer L. PGAT: A multistrain analysis resource for microbial genomes. Bioinformatics. 2011; 27 (17):2429-2430 - 16.
Ozer EA, Allen JP, Hauser AR. Characterization of the core and accessory genomes of Pseudomonas aeruginosa using bioinformatic tools Spine and AGEnt. BMC Genomics. 2014; 15 (1):1-17 - 17.
Chen X, Zhang Y, Zhang Z, Zhao Y, Sun C, Yang M, et al. PGAweb: A web server for bacterial pan-genome analysis. Frontiers in Microbiology. 2018; 9 :1910 - 18.
Jaiswal AK, Tiwari S, Tavares GC, da Silva WM, de Castro Oliveira L, Ibraim IC, et al. Pan-omics Focused to Crick’s central dogma. In: Pan-genomics: Applications, Challenges, and Future Prospects. Editors: Barh D, Soares SC, Tiwari S, Azevedo V; 2020. pp. 1-4. Elsevier; pp. 472 - 19.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R. The microbial pan-genome. Current Opinion in Genetics & Development. 2005; 15 (6):589-594 - 20.
Tettelin H, Riley D, Cattuto C, Medini D. Comparative genomics: The bacterial pan-genome. Current Opinion in Microbiology. 2008; 11 (5):472-477 - 21.
Sigaux F. Cancer genome or the development of molecular portraits of tumors. Bulletin de L'academie Nationale de Medecine. 2000; 184 (7):1441-1447 - 22.
Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”. Proceedings of the National Academy of Sciences. 2005; 102 (39):13950-13955 - 23.
Computational pan-genomics: Status, promises and challenges. Briefings in Bioinformatics. 2018; 19 (1):118-135 - 24.
Bobay LM. The prokaryotic species concept and challenges. The Pangenome. 2020:21-49 - 25.
Dobzhansky T. A critique of the species concept in biology. Philosophy of Science. 1935; 2 (3):344-355 - 26.
Mayr E. Systematics and the Origin of Species. New York: Columbia University Press; 1942, 334 - 27.
Hilker R, Munder A, Klockgether J, Losada PM, Tümmler B. Interclonal gradient of virulence in the Pseudomonas aeruginosa pangenome from disease and environment. Environmental Microbiology. 2014; 17 :29-46. DOI: 10.1111/1462-2920.12606 - 28.
Wang C, Ye Q, Jiang A, Zhang J, Shang Y, Li F, et al. Pseudomonas aeruginosa detection using conventional PCR and quantitative real-time PCR based on species-specific novel gene targets identified by pangenome analysis. Frontiers in Microbiology. 2022:13 - 29.
Baloyi IT, Adeosun IJ, Yusuf AA, Cosa S. In silico and in vitro screening of antipathogenic properties of Melianthus comosus (Vahl) against Pseudomonas aeruginosa. Antibiotics. 2021; 10 :1-23. DOI: 10.3390/antibiotics10060679 - 30.
Hilker R, Munder A, Klockgether J, Losada PM, Chouvarine P, Cramer N, et al. Interclonal gradient of virulence in the P seudomonas aeruginosa pangenome from disease and environment. Environmental Microbiology. 2015; 17 (1):29-46 - 31.
Garrity GM, Bell JA, Lilburn T. Pseudomonadales Orla-Jensen 1921, 270 AL. In: Bergey’s Manual® of Systematic Bacteriology. Boston, MA: Springer; 2005. pp. 323-442 - 32.
Yahr TL, Parsek MR. Pseudomonas aeruginosa. In: The Prokaryotes: Vol. 6: Proteobacteria: Gamma Subclass. Editor: Martin Dworking. Springer; 2006. pp. 704-713 - 33.
Woese CR, Fox GE. Phylogenetic structure of the prokaryotic domain: The primary kingdoms. Proceedings of the National Academy of Sciences of the United States of America. 1977; 74 (11):5088-5090 - 34.
Woese CR, Gutell R, Gupta RAMESH, Noller HF. Detailed analysis of the higher-order structure of 16S-like ribosomal ribonucleic acids. Microbiological Reviews. 1983; 47 (4):621 - 35.
Woese C, Kandler O, Wheelis M. Towards a natural system of organisms: Proposal for the domains Archaea, Bacteria, and Eucarya. Proceedings of the National Academy Science USA. 1990; 87 (12) - 36.
Komatsoulis GA, Waterman MS. A new computational method for detection of chimeric 16S rRNA artifacts generated by PCR amplification from mixed bacterial populations. Applied and Environmental Microbiology. 1997; 63 (6):2338-2346 - 37.
Winand R, Bogaerts B, Hoffman S, Lefevre L, Delvoye M, Van Braekel J, et al. Targeting the 16s rRNA gene for bacterial identification in complex mixed samples: Comparative evaluation of second (illumina) and third (oxford nanopore technologies) generation sequencing technologies. International Journal of Molecular Sciences. 2019; 21 (1):298 - 38.
Steven B, Hesse C, Soghigian J, Gallegos-Graves LV, Dunbar J. Simulated rRNA/DNA ratios show potential to misclassify active populations as dormant. Applied and Environmental Microbiology. 2017; 83 (11):e00696-e00617 - 39.
Ozer EA, Nnah E, Didelot X, Whitaker RJ, Hauser AR. The population structure of Pseudomonas aeruginosa is characterized by genetic isolation of exoU+ and exoS+ lineages. Genome Biology and Evolution. 2019; 11 (7):1780-1796 - 40.
Georgiades K, Raoult D. Defining pathogenic bacterial species in the genomic era. Frontiers in Microbiology. 2011; 1 :151 - 41.
Diene SM, Merhej V, Henry M, El Filali A, Roux V, Robert C, et al. The rhizome of the multidrug-resistant Enterobacter aerogenes genome reveals how new “killer bugs” are created because of a sympatric lifestyle. Molecular Biology and Evolution. 2013; 30 (2):369-383 - 42.
Klockgether J, Cramer N, Wiehlmann L, Davenport CF, Tümmler B. Pseudomonas aeruginosa genomic structure and diversity. Frontiers in Microbiology. 2011; 2 :150 - 43.
Botelho J, Tüffers L, Fuss J, Buchholz F, Utpatel C, Klockgether J, Schulenburg H. Phylogroup-specific variation shapes pangenome dynamics in Pseudomonas aeruginosa . BioRxiv; 2022. pp. 1-33 - 44.
Falagas ME, Koletsi PK, Bliziotis IA. The diversity of definitions of multidrug-resistant (MDR) and pandrug-resistant (PDR) Acinetobacter baumannii and Pseudomonas aeruginosa. Journal of Medical Microbiology. 2006; 55 (12):1619-1629 - 45.
Oliver A, Mulet X, López-Causapé C, Juan C. The increasing threat of Pseudomonas aeruginosa high-risk clones. Drug Resistance Updates. 2015; 21 :41-59 - 46.
Wright G. The antibiotic resitome: The nexus of chemical and genetic diversity. Nature Reviews. Microbiology. 2007; 5 :175-186 - 47.
D’Acosta D et al. Samplig the antibiotic resitome. Science. 2006; 311 :374-377 - 48.
Gillings MR. Evolutionary consequences of antibiotic use for the resistome, mobilome and microbial pangenome. Frontiers in Microbiology. 2013; 4 :4 - 49.
Jaillard M, van Belkum A, Cady KC, Creely D, Shortridge D, Blanc B, et al. Correlation between phenotypic antibiotic susceptibility and the resistome in Pseudomonas aeruginosa. International Journal of Antimicrobial Agents. 2017; 50 (2):210-218