Open access peer-reviewed chapter

Phylogeny of Three Palmwine Yeasts Genera

By Ogueri Nwaiwu

Submitted: March 1st 2018Reviewed: July 5th 2018Published: November 5th 2018

DOI: 10.5772/intechopen.79958

Downloaded: 594


Sequences from three palm wine yeast genera namely Saccharomyces cerevisiae, Pichia kudriavzevii, and Candida ethanolica were analyzed to establish their phylogenetic relationships, geographical origin, and food matrix source of their close relatives. Up to 600 sequences present in yeasts representing close relatives of palm wine yeasts were examined. Pyhlogenetic trees constructed showed polyphyletic relationships in C. ethanolica whereas close relatives of S. cerevisiae and P. kudriavzevii showed little divergence. Sequence data for both Elaeis sp. and Raphia sp. palm trees showed that highest number of palm wine yeasts relatives sequence submissions to the Genbank were from China and beverages were mainly the sources of close relatives of S. cerevisiae and P. kudriavzevii whereas C. ethanolica closest relatives were from various non-food sources. Overall relatives of palm wine yeasts were not specific to any particular food or fermentation mix. The guanine-cytosine (G+C) content in P. kudriavzevii (57–58%) and C. ethanolica (56–57%) was higher than that of S. cerevisiae (47.3–51%). This suggests that the P. kudriavzevii and C. ethanolica have a higher recombination rate than S. cerevisiae strains analyzed. The data may help to understand palm wine yeast conservation and the diverse food matrixes and geographical origins where their close relatives exist.


  • yeasts
  • phylogeny
  • Saccharomyces cerevisiae
  • Pichia kudriavzevii
  • Candida ethanolica

1. Introduction

Palm wine is a traditional drink consumed mainly in sub-Saharan Africa, parts of Asia, and South America. It is obtained from fermentation of saps of different palm trees. Palm wine is sourced from palm trees and they grow throughout tropical and subtropical regions with just a few species found in temperate regions possibly due to freeze intolerance of seedlings [1]. The method of obtaining the drink by tapping has been described in many reports [2] and the palm sap varies according to palm trees found in different geographical location. Yeasts are the main organisms implicated in the fermentation of the drink and they exist as natural flora on palm trees. Irrespective of the palm tree source, a common feature of the drink is that it goes sour within 24 h unless it is subjected to cold storage. The two trees from which palm wine is mostly tapped in Nigeria are Raphia hookeriand Elaeis guineensis. There is a debate on the possible origin or source of these palm trees. The tree Raphia hookeriis known as the wine palm and is the most widespread familiar Raphia palm in fresh water swamps of west and central Africa [3]. Many local varieties exist in the tropical rain forest of Nigeria and it is also grown in India, Malaysia, and Singapore [4]. The E. guineensisoil palm variety is more widely found around the world. A report pointed out that E. guineensispalm tree originated in the tropical rain forest region of West Africa and can be found in Cameroon, Côte d’Ivoire, Ghana, Liberia, Nigeria, Sierra Leone, Togo Angola, and the Congo [5]. It is believed in the report that during the fourteenth to seventeenth centuries, some palm fruits were taken to the Americas and from there to the Far East where it thrived. Yeast are known to reflect human history [6] hence it is possible the yeast strains found in palm wine were introduced to new regions via the plant materials introduced in those locations.

Although it is known that yeasts have been used for food and beverage fermentations [7] hundreds of years ago and domestication is believed to have been initiated before the discovery of microbes [8], the extent of genetic diversity is still under study around the world. Recent reports have shown that non-Saccharomycesyeasts have different oenological properties to those of S. cerevisiae[9]. Other reports emphasize that even though biochemical and genomic studies of S. cerevisiaehave helped our understanding of yeasts, the other lesser known yeast species have not been fully exploited [10]. More understanding of S. cerevisiaeand non-S. cerevisiaeyeasts in palm wine is needed [11] in order to get more information on the capabilities of yeasts present in the drink or to probe for novel species [12]. To generate more information, molecular characterization has been used by many investigators and this has led to proper identification of new yeast strains in the drink. The diversity of yeasts from palm wine has not had much in-depth investigation and reports that show evolutionary trees which are the basic structures necessary to establish the relationships among organisms [13] are few in literature. This chapter examines evolutionary relationships of palm wine yeasts and their close relatives based on 26S rRNA sequence data and aims to shed more light on the diversity of yeasts found in the drink.


2. Methodology

2.1. Ribosomal ribonucleic acid genes partial sequence data

In a previous study [2], partial 26S rRNA gene sequences from 18 palm wine yeast isolates were deposited under accession numbers (HG452325-42). The sequences from three yeasts genera identified in that study namely S. cerevisiae, P. kudriavzevii, and C. ethanolicafrom Elaeissp. and Raphiasp. palm trees were selected and used to carry out new updated searches in this report. For Elaeissp., the sequence accession numbers used were HG425336, HG425328, and HG425333 whereas HG425332, HG425338, and HG425335 were used for the Raphiasp. palm tree. The current versions of the selected six sequences mentioned above were used separately for an updated search in the Genbank database. The searches were optimized for highly similar sequences and the first 100 sequences from relatives of each yeast species with the highest percent identity were marked to make a shortlist of up to 600 sequences. These sequences were examined for the features listed at the time of submission after which the countries of origin and sources were noted. Sources were classified as beverage, food, or non-food sources.

2.2. Construction of phylogenetic trees

Phylogenetic trees were constructed from the shortlisted sequences by using the molecular evolutionary genetic analysis (MEGA, version 7) computer software [14]. The software allowed a seamless transfer of the sequences from Genbank. Using the multiple sequence comparison by log expectation (MUSCLE) reported by Edgar [15], multiple sequence alignments (MSA) were constructed with the software. The evolutionary history was inferred by using the maximum likelihood method based on the Tamura-Nei model [16]. The tree with the highest log likelihood was chosen. Initial trees for the heuristic search were obtained using the maximum composite likelihood approach. Trees were drawn to scale, with branch lengths measured in the number of substitutions per site. All positions containing gaps and missing data were eliminated. The nucleic acid composition of the sequences was calculated automatically by switching to the nucleic acids estimation mode of the software after which the G+C content of the sequences were calculated manually from the arginine, guanine, cytosine, and thiamine percentage distribution displayed. The MAS tool MUSCLE used assumes an equality of substitution rates among sites and takes into account differences in transitional, transversional rates, and G+C-content bias [17]. For brevity, only 20 sequences from the initial 100 relatives obtained are shown in the trees with the reference sequence.

The complete list of 600 sequences analyzed showing sources and countries of origin is available in the public repository figshare [18].

3. Results and discussion

3.1. Evolutionary relationships of palm wine yeasts and their relatives

Yeasts facilitate several industrial food fermentation processes, which often consist of a desired specific strain [19]. This may be why domestication is believed to be the main driver of specific yeast prevalence in a geographical location. The understanding of the ecological basis of yeast diversity in nature remains fragmented and cross-kingdom competition has been proposed as a method to generate industrially useful yeast strains with new metabolic traits [20]. Palm wine yeasts are yet to enjoy significant diversity study hence a look at their relatives will enable more information to be generated.

In the last decade, there has been increase in submissions of palm wine yeast sequences based on 26S rRNA genes mainly due to quality checks by academic journals. The identification of new strains is accompanied by performing a search with the basic local alignment search tool [21] followed by submission of DNA sequences to the GenBank. According to Benson et al. [22], GenBank is a comprehensive database that contains publicly available nucleotide sequences for up to 370,000 formally described species. It is common knowledge that these submissions which contain a lot of information are generated mainly through submissions from investigators around the world. Each sequence data received is curated by the GenBank annotation staff to ensure that it is free from errors after which accession numbers are assigned.

All the sequences used in this study were the first versions submitted by investigators. The maximum likelihood method was preferred for the trees constructed because it is computationally intense and all possible trees are considered. Also the method can be useful for widely divergent groups or other difficult situations [23].

3.2. Candida ethanolica

The yeast C. ethanolicais not widely reported in palm wine. It has been reported as a non-conventional yeast which may present massive resource of yeast biodiversity for industrial applications because it has been found to be adapted to some of the stress factors present in harsh environmental [24]. In that report, it was found that C. ethanolicatolerated up to 7% v/v ethanol. This could be useful information for new palm wine drink development especially now that there is increasing interest in non-Saccharomycesyeasts with peculiar features able to replace or accompany S. cerevisiaeduring specific industrial fermentations [25].

The C. ethanolicastrain from Raphiasp. (Figure 1) and Elaeissp. (Figure 2) palm wine showed close relationships with other Candidaspecies. The relatives of Raphiasp. palm wine that emanated from the same node (Figure 1) came from diverse sources. The flanking close relatives (KY283163 and DQ466540) of C. ethanolica(HG425332) were isolated from composite microbial powders for aquaculture in China [26] and composite cocoa fermentation in Ghana [27]. Other close relatives included species from the genus Pichia.The P. deserticolastrain (KM005182) from the same node as the reference strain was from aerobic deterioration of total mixed ration silage in China [28]. For Elaeissp. (Figure 2) palm wine, close relatives to C. ethanolica(HG425336) strain were from a laboratory culture collection with unidentified source [29] and a tannin tolerant yeasts associated with naturally fermented Miangleaves in Thailand [30]. A close P. deserticolastrain of unstated source in GenBank was from a large characterization study [31].

Figure 1.

Phylogenetic analysis ofCandida ethanolica(HG425332-underlined) fromRaphiasp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

Figure 2.

Phylogenetic analysis ofCandida ethanolica(HG425336-underlined) fromElaeissp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

In both Elaeissp. and Raphiasp. palm wine, several monophyletic groups were formed with other Pichiaspecies namely P. deserticola, P. Manshuricaand P. galeiformiswhich indicate polyphyletic relationships. The polyphyletic nature of Pichiahas been demonstrated by Kurtzman and Robnett [29] in the analysis of gene sequences that included all known ascomycetous yeasts. Apart from possible similar conserved regions, previous nomenclature at the time of submission of the sequences may also be the reason why Pichiaspecies of different genus were observed as close relatives of C. ethanolicafrom Elaeissp. and Raphiasp. palm trees.

It has been reported that ascomycetic fungi submitted to the database previously have been assigned names based on their life stages [32, 33]. For example, it was shown that the name for the fungi Candida kruseiis based on the anamorphic stage whereas its telemorph stage name is Pichia kudriavzevii. It also has an older name Issatchenkia orientalis. The whole Candidaspecies consists of up to 850 organisms, which can be distantly related [34]. Hence in order to avoid the confusion, the International Botanical Congress in Melbourne in July 2011, made a change in the international code of nomenclature for fungi and adopted the principle of one fungus can only have one name and ended the system of permitting separate names to be used for anamorphs [35]. The report emphasized that this validated all legitimate names proposed for a species, regardless of what stage they were typed and can serve as the correct name for that species.

3.3. Sachharomyces cerevisiae

The yeast S. cerevisiaeis generally known to be the most used microorganism in the food and drink manufacturing sector. The organism is the dominant yeast species isolated from many studies on palm wine. However, it is unclear whether S. cerevisiaeas a species occurs naturally or exists solely as a domesticated species [36]. S. cerevisiaestrains are genetically diverse, largely as a result of human efforts to develop strains specifically adapted to various fermentation processes. These adaptive pressures from various ecological niches may generate behavioral differences among these strains [37]. In a review [8], it was suggested that domestication in Saccharomyces, is most pronounced in beer strains, because they live in their industrial niche always and allow only limited genetic admixture with wild stocks and minimal contact with natural environments. Due to this restriction, it was pointed out that beer yeast genomes show complex patterns of domestication and divergence, making both ale (S. cerevisiae) and lager (S. pastorianus) strains ideal models to study domestication.

The relatives of palm wine S. cerevisiaewas not distributed among many species or different genus observed for Candida species. Two nodes were observed for the S. cerevisiaetrees constructed for Elaeissp. (Figure 3) and Raphiasp. (Figure 4). The yeast strain isolated from Elaeissp. (Figure 3) was in a different branch from most of its relative whereas it was vice versa for the palm wine yeast from Raphiasp. (Figure 4) palm wine. As observed for Candida species, isolation of S. cerevisiaespecies was from different sources. The close relatives flanking the palm wine strain from Elaeissp. palm wine (HG425328, Figure 3) with accession numbers KU862639 and MF966566 were isolated from grape surface [38] and pear sough dough [39] whereas the close relatives of Raphiasp. palm wine (HG425338, Figure 4) with accession numbers GU080046 and HM191669 were isolated from must of spontaneous fermentation [40] and grape juice used to brew Musalais,a beverage made from compressed grapes [41].

Figure 3.

Phylogenetic analysis ofS. cerevisiae(HG425328-underlined) fromElaeissp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

Figure 4.

Phylogenetic analysis ofS. cerevisiae(HG425338-underlined) fromRaphiasp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

It is believed that 99% of yeasts is still unknown [42], and S. cerevisiaefermentation could be specific to a particular substrate, hence more studies of S. cerevisiaefrom different palm trees will be beneficial. The genus Saccharomyceswas previously divided into two groups namely Saccharomyces sensu strictoand Saccharomyces sensu latoand the sensu stricto strains are mostly associated with the fermentation industry [43]. The S. cerevisiaein this study are sensu stricto. Comparative genomics analysis of S. cerevisiaeand closely related species has contributed to our understanding of how new species emerge and has shed light on various mechanisms that contribute to reproductive isolation [44]. This knowledge can be applied to palm wine yeasts to ascertain how they differ from well characterized yeasts.

3.4. Pichia kudriavzevii

From recent molecular studies of yeasts present in palm wine, the yeast species Pichia kudriavzeviihas emerged as a prevalent non-Saccharomycesyeast species in the drink. The genus has shown probiotic potentials [45] and multistress-tolerance [46]. It is worth looking closely at this genus because it has been shown that some P. kudriavzeviistrains can produce higher quantities of ethanol from lignocellulosic biomass than conventional cells of S. cerevisiaeat 45°C [47].

The tree constructed for P. kudriavzeviishowed the least divergence when compared to S. cerevisiaeor Candidapalm wine yeast relatives. All the relatives and the Elaeissp. palm wine strain (HG425333) originated from one node and formed separate taxonomic units (Figure 5). In contrast, the P. kudriavzevii(HG425335) from Raphiasp. palm wine formed a separate clade and did not lie on the same branch with the relatives (Figure 6). This indicates intraspecies diversity and confirms findings reported previously [11]. In that study, intraspecies diversity was suggested because P. kudriavzevii(HG425335) from Raphiasp. palm wine formed a separate clade with palm wine isolates from Mexico instead of isolates from the same geographical location.

Figure 5.

Phylogenetic analysis ofP. kudriavzevii(HG425333-underlined) fromElaeissp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

Figure 6.

Phylogenetic analysis ofP. kudriavzevii(HG425335-underlined) fromRaphiasp. palm wine. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site.

The information contained in the sequence submission of close relatives of P. kudriavzeviistrains also shows different sources of isolation. The strains close to the yeast from Elaeissp. palm wine (HG425333, Figure 5) with accession numbers KY283159 and KM234455 show that isolation was from composite microbial powders for aquaculture [21] and naturally fermented cashew apple juice [48] whereas a close relative of Raphiasp. palm wine (HG425335, Figure 6) with accession number KU167717 was isolated from activated sludge from textile dyeing [49].

3.5. Geographical origin and sources of palm wine yeast relatives

After ascertaining the sources of very close relatives from the phylogenetic trees constructed, the shortlisted 600 sequences from the aforementioned yeast genera were further examined and the information found was used to group the isolates according to country of isolation, food, beverage, and non-edible source.

3.5.1. Isolates submitted by country of origin and source

Overall, sequences examined for the aforementioned yeasts genera were submitted from 38 countries [18] and the top 6 countries is presented in this report. Sequence data for both Elaeissp. (Figure 7) and Raphiasp. (Figure 8) palm trees show that highest number of submissions to the Genbank database was from China. The top three countries from which palm wine yeast relatives originated were the same for both palm tree species. This suggests that a large number of palm wine yeasts may have common ancestors with yeasts found in China. The origins or sources of palm wine yeasts relatives were spread across beverages, food, and non-food sources. The prevalence of S. cerevisiae, P. kudriavzevii, and C. ethanolicafrom these sources is shown for Elaeissp. palm tree (Figure 9) and Raphiasp. palm tree (Figure 10). In both palm wine from Elaeisand Raphiapalm trees, yeasts relatives of S. cerevisiaeand P. kudriavzeviispecies were isolated mainly from beverage sources whereas relatives representing C. ethanolicaspecies were isolated from non-food sources. The sources of isolation revealed that the closest relatives of palm wine yeasts were from various sources and not specific to any particular food or fermentation mix.

Figure 7.

Top six countries from which sequences of palm wine yeast relatives ofElaeissp. palm tree were submitted to the GenBank.

Figure 8.

Top six countries from which sequences of palm wine yeast relatives ofRaphiasp. palm tree were submitted to the GenBank.

Figure 9.

Distribution of palm wine yeast relatives with reference to yeasts fromElaeissp. palm wine according to beverage (), food (∎), and non-food (⊞) sources.

Figure 10.

Distribution of palm wine yeast relatives with reference to yeasts fromRaphiasp. palm wine according to beverage (), food (∎), and non-food (⊞) sources.

A report [50] found that laboratory estimates of optimum growth temperature could be used to predict global distributions of free-living microbes. Also, it was pointed out that population genetic analyses show that the genetic diversity of S. cerevisiaeis high in the tropics and subtropics of China [51, 52]. It was suggested that without further sampling in tropical and subtropical regions, it is not possible to differentiate whether the higher diversity of S. cerevisiaein Asia reflects a greater habitat area or an Asian origin for S. cerevisiae. It would be beneficial to carry out further studies in order to establish if palm wine yeasts were taken from Africa to Asia or vice versa. The diversity could also be high in temperate regions because a study examined S. cerevisiaeand S. paradoxusin northeast America and uncovered a large diversity of yeasts [53]. Up to 24 yeast isolates could not be assigned to any known species and it was suggested that the yeasts identified may be of taxonomic, medical, or biotechnological importance.

3.6. G+C composition of palm wine yeast relatives

The G+C composition is a well known evolutionary property of eukaryotes, archaea, and bacteria. There are suggestions by Chen et al. [54], that concordance between proteomic architecture and the genetic code is related closely to genomic G+C content and phylogeny. It has been suggested that yeasts with higher G+C content have a higher recombination rate [55] and recombination is believed to be suppressed around centromeres [56]. The data in Table 1 present the average nucleotide composition and G+C content of partial sequences of 26S rRNA genes analyzed. It shows concentration of arginine, guanine and thiamine, and cytosine concentration in S. cerevisiae, P. kudriavzevii, or C. ethanolicaobtained from the aforementioned palm trees. Data were obtained after measuring nucleotide frequencies (%) in 100 sequences of strains relative to each palm wine yeast species listed. It was observed that the G+C content in P. kudriavzeviiand C. ethanolicawas higher than that of S. cerevisiae. This suggests that the P. kudriavzeviiand C. ethanolicahave a higher recombination rate than S. cerevisiaestrains analyzed in this report. The G+C range observed is within the reported average genomic G+C-content range (13–75%) among species [57]. It was also found to be within range of G+C content (38.3–52.9%) of the MATlocus reported [58] in different Saccharomycetaceaespecies.

Yeast speciesT/UCAGG+C
1. S. cerevisiae-R26.316.626.530.747.3
2. S. cerevisiae-E26.720.222.730.451.0
3. P. kudriavzevii-R20.021.922.635.557.0
4. P. kudriavzevii-E19.822.222.635.558.0
5. C. ethanolica-R21.121.421.835.756.0
6. C. ethanolica-E20.921.421.935.857.0

Table 1.

Average nucleotide composition and G+C content obtained from yeasts from Raphiasp. (R) or Elaeissp. (E) palm wine and their relatives after measuring nucleotide frequencies (%) in 100 sequences relative to each yeast species shown.

Nucleotide concentration was obtained after analysis with MEGA 7.0 software.

T/U, thiamine/uracil; C, cytosine; A, arginine; G, guanine.

Further studies are required because G+C-content is associated with multiple biases of different nature during down stream operations and these biases may include sequencing technologies, biological, and methodological reasons [57]. Another factor that could affect the G+C content is that some yeasts like Lachancea kluyverishow an intriguing compositional heterogeneity in that a region of the chromosome has an average G+C content of 52.9% which is significantly higher than the 40.4% global G+C content of the rest of the genome [58].

4. Conclusions

Sequence data are useful for comparing palm wine yeasts from different trees. Data show the countries where the relatives of palm wine yeasts are dominant and may be useful for evolution and species migration studies. Palm wine yeast relatives may originate from beverage, food, and non-edible source. The G+C nucleotide data present insights on changes which may have occurred in conserved regions of some isolates over time. Comparing sequences with the highly conserved regions of the 26S rRNA genes gives an immediate picture of the lineage of palm wine yeasts and their relatives. It can also provide a foundation to select candidates for whole genome sequencing for comparision in future.


The free use of MEGA 7.0 is appreciated. Software can be accessed at

© 2018 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Ogueri Nwaiwu (November 5th 2018). Phylogeny of Three Palmwine Yeasts Genera, Recent Advances in Phylogenetics, Zubaida Yousaf, IntechOpen, DOI: 10.5772/intechopen.79958. Available from:

chapter statistics

594total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Importance of Molecular and Phylogenetic Analyses for Identification of Basidiomycetes

By Samina Sarwar, Qudsia Firdous and Abdul Nasir Khalid

Related Book

First chapter

Introductory Chapter: Plant Ecology

By Zubaida Yousaf and Habiba Ramazan

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us