Open access peer-reviewed chapter

Computational Identification of the Plausible Molecular Vaccine Candidates of Multidrug-Resistant Salmonella enterica

By Rohit Mishra, Yong Chiang Tan, Amr Adel Ahmed Abd El-Aal and Chandrajit Lahiri

Submitted: October 29th 2020Reviewed: January 6th 2021Published: April 23rd 2021

DOI: 10.5772/intechopen.95856

Downloaded: 120


Salmonella enterica serovars are responsible for the life-threatening, fatal, invasive diseases that are common in children and young adults. According to the most recent estimates, globally, there are approximately 11–20 million cases of morbidity and between 128,000 and 161,000 mortality per year. The high incidence rates of diseases like typhoid, caused by the serovars Typhi and Paratyphi, and gastroenteritis, caused by the non-typhoidal Salmonellae, have become worse, with the ever-increasing pathogenic strains being resistant to fluoroquinolones or almost even the third generation cephalosporins, such as ciprofloxacin and ceftriaxone. With vaccination still being one of the chosen methods of eradicating this disease, identification of candidate proteins, to be utilized for effective molecular vaccines, has probably remained a challenging issue. In our study here, we portray the usage of computational tools to analyze and predict potential vaccine candidate(s) for the multi-drug resistant serovars of S. enterica.


  • typhoid
  • Salmonella Typhi
  • multidrug resistance
  • computational identification
  • vaccine candidates

1. Introduction

With a current worldwide prevalence of around twenty-seven million cases [1, 2] and hundreds of thousands of deaths every year [2, 3], salmonellosis remains the second most common food/water-borne illness. It constitutes a disease caused due to the systemic infection of human and animal hosts by the facultatively anaerobic, Gram-negative rod-shaped bacterial species of Salmonella entericafrom the family Enterobacteriaceae. Clinically, several serologic variants (serovars) of S. entericaexist, which differ with respect to their different antigenic variation in lipopolysaccharide and flagella [4, 5]. They include Typhi and Paratyphi A, besides the non-typhoidal serotypes like Typhimurium and Enteriditis [4]. Among these, the enteric fever termed typhoid, caused by S.Typhi and Paratyphi, is typically a more severe illness than those caused by other non-typhoidal serovars [5].

Being contagious in nature, salmonellosis, like typhoid, can spread through feces, water and the hands of those caring for the sick while, for non-typhoidal serovars, through the consumption of raw or undercooked contaminated food of animal origin such as meat, poultry, eggs and milk by humans [1, 6, 7]. Salmonellosis begins with ingestion of a dose for the bacterium enough to broach the first-line host defenses and colonize the gastrointestinal tract. The onset symptoms for typhoid are usually accompanied with fever, headache, myalgia, anorexia and sometimes diarrhea or constipation [6, 7], moving onto remittent fever, with a stepwise increment in the daily peak temperature, reaching 40°C by the end of the first week [6]. Slow recovery after 3–4 weeks is the normal case, though, for untreated patients with complications, major fatalities occur due to intestinal hemorrhage or perforation [6, 7].

Drugs available for the treatments are mostly ineffective due to the resistance developed with the emergence of multidrug-resistance (MDR) Salmonellastrains [8]. These new strains are ineffective to the older generations of drugs including ampicillin, chloramphenicol, ciprofloxacin, trimethoprim as well as co-trimoxazole and their derivatives, thereby necessitating the newer classes of cephalosporins and quinolone derivatives to be greatly explored to combat such MDR threats [1, 8]. Moreover, dating as early as the 1890s, whole-cell vaccines with parenteral administration of killed suspensions of S.Typhi [9] has several problems having: a) high-reactivity with 20–25% fever and 40–50% local reactions, b) moderate efficacy with protection rates of 51–88% insufficient to halt disease transmission in endemic area and c) logistical and safety problems having the need for needles and two doses. Approaches with recent vaccines, like, single-dose Typhim Vi® containing purified Vi capsular polysaccharide, or, the live attenuated vaccine S.Typhi Ty21a (Vivotif®), confer around 50% protection in adults, and very poor immunogenicity among young children, without any license for under two years old, besides being considered to be expensive for low-middle income areas [10, 11]. Thus, the urgency, for new and specific vaccines and/or drugs to combat the disease, is evident and indeed, proteins of the pathogen-specific biochemical and biosynthetic pathways, involved in the virulence of S.Typhi, has already begun to be targeted with a view to developing novel vaccines/drugs.

While the two afore-mentioned vaccines are for S.Typhi, those for other serovars including Paratyphi, Typhimurium and Enteritidis were largely unavailable until some few years back [11]. Of late, efforts to confer protective immunity for serovars of Typhimurium has been reported with the lppAand lppBBraun lipoprotein genes with and without the msbBgene, encoding an acetyltransferase enzyme required for modification of the lipid A of lipopolysaccharide [12]. Other candidate genes proposed for effective vaccines for different serovars include rpoS, phoPQ, ssaV, htrA[13], besides the proteins of SseBI, OmpACDFL and SopB being used as antigens in other vaccination studies [14]. Such recombinant attenuated Salmonellavaccines (RASV) are considered to be same or more effective than the whole wild-type strains [15]. RASV can persistently colonize internal lymphoid tissues to produce recombinant antigens having their maximum abilities to elicit mucosal and systemic antibody along with those of the cell mediated immune responses [15]. Thus, development of such recombinant vaccines is considered to be the cost-effective and most promising strategy against the pressing antibiotic resistance threats. In this regard, several strategies have been adopted in other drug resistant bacteria including reverse vaccinology through comparative genome analysis and in vitroproteomics [16, 17]. These become especially effective keeping in mind the new and emerging threats of multidrug resistance strains of Salmonella. Such strains might possibly arise form immune selection leading to antigen sequence variability followed by a down-regulation of the target antigens, thereby conferring poor “cross-protective efficacy” as reported for MDR Acinetobacter baumannii[18]. Therefore, identification of new and effective vaccine candidates is, probably, the current need of the hour.

With an availability of different virulent proteins, reported from different experimental verification and predictive databases, selection of the most plausible vaccine candidates can be confusing. To cater to the need of simplifying this complex problem of selection, graph theoretical analysis of the interacting networks of such virulent proteins, involved in the disease scenario, might be poised to be quite useful. Such virulent protein interaction networks (PIN) can be utilized to find out the most central or sought-after proteins for such cases [19]. Ideally, the centrality of any biological networks is efficiently analyzed through global parameters like betweenness, closeness, degree and eigen-vector centralities, referred to as the BC, CC, DC and EC, respectively [19, 20, 21]. Among them, BC has been regarded to be efficient enough to impart central character of a network above CC and DC for long until EC gained some prominence and can be quite effective as reported through recent studies [22, 23, 24, 25].

In this study, we proposed the vaccine candidates for Salmonellaserovars (Figure 1) as explained in the next section. Essentially, we utilized the four different centrality measures for analyzing three different virulent PINs denoted as VVaDK, VFDF and VFDX. Among the top 20 rankers of each of the different centralities, the unanimously present unique candidates were finally collected for further downstream analyses. These shortlisted candidate virulent proteins were rigorously analyzed through different bioinformatic tools to determine their antigenic and allergenic potential besides revealing the epitopes for efficient vaccines or molecular crevices for good drug targets.

Figure 1.

Graphical summary of the methods adopted in vaccine candidates and druggability prediction. This comprises a network-based approach to identify the key players inSalmonellavirulent proteome coupled with downstream predictions of vaccine candidates and druggable pockets among the top rankers.


2. Approach

2.1 Dataset collection

We have initiated our study with the proteins collected for Salmonella entericaserovar Typhimurium str. LT2 (NCBI txid: 99287) on the 19th of December 2020. They were retrieved from two different sources namely, the National Center for Biotechnology Information (NCBI) and the Virulence Factor Database (VFDB) [26]. From NCBI, protein datasets were collected through literature search using various keywords such as Virulence, Virulence Factor, Virulence Protein, Drug(s), Vaccine(s) and Key. Some of these keywords, having essentially the same meaning, were used to get more hits and to avoid missing of any possible candidates thereby reducing the false-negative hits. Finally, all the candidates of the lists were merged, and duplicates were removed to yield 120 proteins to be considered for further analysis. They were termed as VVaDK for easy reference, where V stands for Virulence, Va represents Vaccine(s), D means Drug(s) and K denotes Key. Moreover, two types of candidates’ lists were retrieved from VFDB. They comprised the Full dataset which covers all the proteins (261) related to unknown and predicted VFs of S.Typhimurium and were referred as VFDF. Additionally, 117 experimentally verified candidates were retrieved for S.Typhimurium and termed as VFDX.

All the afore-mentioned proteins for the different categories of VVaDK, VFDF and VFDX were fed as queries to the biological meta-database of protein interaction, STRING version 11.0 [27] to retrieve all the possible interactions of a particular protein [date and time of access: Dec 22, 2020, from 17 hours IST onwards]. Detailed protein links file under the accession number 90371 in STRING v11 was used to collect all the interactions of the whole genome proteins of S.Typhimurium. In each case, a database dictated default medium confidence value of 0.4, for the combined scores from different parameters of interaction, was used. Accordingly, the total number of protein interactions obtained were 138, 3501 and 2464 for VVaDK, VFDF and VFDX listed candidates, respectively.

2.2 Interactome construction

The protein interaction data for all individual sets for VVaDK, VFDF and VFDX, having medium confidence values, were imported into Cytoscape version 3.8.2 [28] to integrate and build the respective interactomes of protein interactions. Care was taken to remove duplicate and bidirectional interactions from each dataset. In essence, such interactome of proteins or the protein interaction network (PIN) has been constructed as an undirected graph, G = (V, E), consisting of E edges and a finite set of V vertices (or nodes) where, edge, e = (u, v), is connected to two vertices u and v. Each vertex/node in our PIN represents a protein. The number of connections/interactions/associations/links, a protein has with other proteins, reflects its degree, d [29].

2.3 Network analysis

All the constructed 3 PINs have been viewed by Cytoscape v 3.8.2 in the form of interactomes of aforementioned interconnected proteins. They were subsequently analyzed through the integrated java plugin CytoNCA version 2.1.6 [30] to compute values for BC, CC, DC and EC as the four different global network centrality parameters. The different parametric combined scores from STRING were considered as edge weights for computing the CytoNCA scores of the 4 centrality parameters. Upon sorting these 4 measures from largest to smallest, top 20 proteins for each of the categories of centrality were picked to create Venn diagrams using Venny 2.0 [31] for finding the common proteins from each of the measures. This resulted in 12, 10, 7 proteins from VVaDK, VFDF and VFDX, respectively. Among these 29 candidates, 9 duplicates were removed to yield a total of 20 proteins. Through a BLASTp alignment, these Typhimurium proteins were unanimously found in the serovars of Typhi and Paratyphi, and thus, considered for further analyses.

2.4 Vaccine and/or drug candidature prediction

2.4.1 Basic analysis

The 20 shortlisted protein candidates from VVaDK, VFDF and VFDX PIN analysis were subjected to further analyses for predicting the plausible vaccine and/or drug candidates. All such proteins were explored for their molecular weight calculation, cellular localization, signal peptide prediction followed by antigenicity prediction. ProtParam was used to find the molecular weight and number of amino acids [32] and cellular localization was analyzed by PSORTb v3.0.2 [33]. Location of signal peptides was predicted using the server called SignalP 4.1 [34]. Lipoprotein signal peptides were predicted using the LipoP 1.0 [35]. Finally, Vaxijen was used to predict the possible antigenicity of the proteins [36].

2.4.2 Mapping of available 3D structures in PDB

For the top ranked proteins, the respective crystallized protein 3D structures available in Protein Data Bank (PDB) were retrieved (Table 1). The seleno-methionine in PDB structures were changed back into methionine using Dock Prep in Chimera [37].

ProteinStructural Information
PDB IDChain IDStructure CoverageResolution
SsrB2JPCA133–193 (First 19 N-terminal amino acids missing)NMR

Table 1.

PDB structure availability among top rankers.

SsaQ, SsaD, InvE, HilA, BcfD, SicA, SsaJ, SscA, DD95_23890, DD95_21695, DD95_16310, and DD95_14775 have no structures available in PDB.

2.4.3 B-cell epitope prediction

Unlike viral pathogens, most bacterial pathogens are not intracellular parasites, especially Salmonella. Thus, the humoral immune response, which involves B cells and antibodies, will be of great focus in this study. Herein, BepiPred v2.0 and DiscoTope v2.0 were utilized in predicting linear and discontinuous B-cell epitopes, respectively [38, 39]. For BepiPred, the default threshold score of 0.5 was applied for epitope recognition. For DiscoTope, the propensity score radius was 22 Angstrom, upper half sphere radius was 14 Angstrom, window size was 1, and alpha was 0.115. An in-house script (DiscoTope2ChimeraAttr) has been utilized to convert DiscoTope result into Chimera attributes for visualization in 3D, with a default threshold DiscoTope score of −3.7 [40]. These analyses were done to pinpoint the specific immunogenic regions within the full-length proteins. Thus, the immunogenically insignificant regions can be trimmed out, resulting in shorter peptides which can confer higher specificity and ease the peptide synthesis process.

2.4.4 Allergenicity prediction

The ability of proposed immunogen to potentially evoke allergic reactions can usually fail clinical trials due to the severe adverse effects arising upon vaccination. Herein, we utilized AllerCatPro, AlgPred2, and AllergenFP v1.0 to predict possible allergic reactions raised by the query proteins, which were the top rankers in this case. For AlgPred2, the hybrid algorithm was selected and the default threshold value of 0.3 was selected. AllerCatPro predicts allergenicity by comparing the protein structural and sequential information to known allergens [41]. Besides, the hybrid algorithm of AlgPred2.0 utilizes the random forest, BLAST, and MERCI algorithms to predict the allergenicity of the query proteins [42]. Moreover, the allergenicity prediction of AllergenFP v1.0 utilizes an alignment-independent fingerprint-based approach [43].

2.4.5 Druggable pocket prediction

P2Rank was being utilized to predict the presence of druggable pockets in the available 3D structures of proteins [44]. P2Rank utilizes a template-independent machine learning algorithm in predicting potential ligand-binding sites on the query proteins. Herein, the topmost ranked predicted pockets were selected for further analyses. Thus, besides being utilized in vaccination, the potential druggability of the top rankers can be discovered.

2.4.6 Detecting human counterparts

Peptide vaccines that contain regions of high sequence similarity to human proteome counterparts can lead to ineffective vaccination due to recognition as “self” by the immune system, which can result in low antigenicity or adverse effects that arise from potential self-reactivity. Thus, the top rankers were screened for human counterparts via sequence alignment approach using BLASTp against non-redundant proteins (nr) database with Homo sapiensas the specified organism [45].


3. Interactome analyses of three virulent PINs

Three different interactomes of virulent proteins of Salmonellawere built using the method described above. The first of them comprised those available through literature search using different keywords comprising Virulence, Virulence Factor, Virulence Protein, Drug(s), Vaccine(s) and Key. This was named as VVaDK. The other two PINs were made of the full and experimentally verified datasets of virulent proteins from Salmonella, listed in VFDB and were named as VFDF and VFDX, respectively. The four centrality measures were applied for analyzing each of these PINs and twenty top rankers from each of the measures were initially segregated. Among them, the proteins present unanimously for all the measures were noted as 12, 10 and 7 for VVaDK, VFDF and VFDX, respectively, and a removal of duplicates from them finally yielded 20 candidates for further downstream analysis.

Our unique way of streamlining the candidates is based upon the following facts. Under pathological conditions, the virulent proteins are expected to be working in unison to render the final disease phenotype. Thus, their connectivity could be perceived in terms of the said PINs. Among these proteins, some can be master regulators and connecting to others more frequently thereby having higher order of connectivity. This renders them degree centrality (DC). Alternatively, there could be different types of such regulators for carrying out different sub-functions of the main disease phenotype and they form the bridge between the other proteins. These could impart the betweenness centrality (BC) of such proteins. Moreover, among such conglomerate of different proteins, certain numbers could connect to others faster to sequentially carry out their function, leading to a concept of closeness for them and having higher closeness centrality (CC). Furthermore, certain proteins could be more important to render the final disease phenotype and they are only connected to other important proteins to carry out their functions. These could bring out their character of eigen vector centrality (EC). Finally, from the top-ranking proteins of all these centrality measures, those, appearing unanimously, are expected to play a major role in virulence and could be segregated to scan for further analysis. These are 20 unique virulent proteins, mostly belonging to the SalmonellaPathogenicity Islands (SPI) from three different PIN analyses and reflected in Figure 1 and Table 2. These are discussed in the next section.

ProteinProtParamPsortBSignalP & LipoPTMpredVaxijen
# amino acidsMolecular
# of TM HelicesPositionScore (Orientation)
SptP54360047.68E0.5192A1477–496570 (o-i)
SsaQ32236009.35C0.3857NA1186–209611 (i-o)
SpaO30333793.74C0.5073A162–86600 (o-i)
PrgH39244459.53C0.5122A1142–1632551 (o-i)
2293 (i-o)
2875 (o-i)
SsaD40344849.66CM0.4319A1119–1352978 (o-i)
634 (o-i)
612 (i-o)
HilA55363040.96C0.3985N1340–361523 (o-i)
1342 (o-i)
582 (i-o)
0.371N15–251725 (i-0)
1371 (i-o)
2963 (o-i)
0.5132A1208–2252812 (i-o)
641 (i-o)
610 (o-i)
513 (i-o)
558 (o-i)
504 (i-o)
692 (o-I)
512 (i-o)
1827 (o-i)
2340 (i-o)
2738 (o-i)
1048 (i-o)
14716757.61U0.4706A1127–1441622 (o-i)

Table 2.

Basic screening of plausible vaccine candidates.

TM: Transmembrane. For Localization, E: Extracellular, C: Cytoplasmic, CM: Cytoplasmic Membrane, OM: Outer Membrane, U: Unknown. For TMpred status, A: Antigen, N: Non-antigen.


4. Features of the twenty virulent proteins

All the virulent proteins from different serovars of Salmonellaare discussed here, with their characteristic features along with a note on their existing vaccine potential.

SptPis one of the most important SPI-1 Type III Secretion System (T3SS) effector proteins which facilitates the bacterial translocation and survival into the host non-phagocytic cells by inhibition of the extracellular-regulated kinase (ERK) mitogen-activated protein kinase (MAP) pathways [46]. It requires SicP as a chaperone protein for its secretion and stabilization [46]. Moreover, SptP is directly responsible for the reversal of the actin cytoskeletal changes in the host cells by acting as a GTPase-activating protein (GAP) for Rac-1 and Cdc42. In fact, the efficacy of sptPdeletion mutation of S.Enteriditis has been shown to be effective for live attenuated vaccine (LAV) in chickens [47].

SsaQis a member of FliN/YscQ/Spa33/HrcQ family of both T3SS and flagellum proteins [48]. The gene ssaQis encoded in the ssaMVNOPQoperon within the SPI-2 and transcribes to two products namely, SsaQL of 322 residues and SsaQS of 106 residues. SsaQS acts as a chaperone-like protein for SsaQL and optimize its function. SsaQ interact with SsaK and SsaN to form the C-ring complex, which have a crucial role in secretion by acting as a cytoplasmic sorting platform at the base of T3SS as well as rotation and direction switching of the flagella [49].

SpaOis a major invasion factor of S. enterica spp.and the core component of the sorting platform in S.Typhimurium. SpaO is comprised of 303 residues of two translated products with SpaOS (the shorter product) encompassing the last 101 amino acids of SpaOL (full length protein) [50]. It is a highly conserved element in T3SS that shares similarity with limited residues with flagellar C-ring substructure [51]. In fact, SpaO, along with H1a, has been suggested to be promising new vaccine candidates to prevent typhoid fever caused by S.Paratyphi A infection [52].

PrgHis a 55 kDa protein encoded within prgHIJKoperon in the SPI-1. All the genes of prgoperon are essential for the formation of T3SS needle complex (NC) and known to share sequence similarity with the flagellar protein, FliF [53]. PrgH inserts in the inner membrane by its hydrophobic domain where it forms the MS-ring of the flagellar basal body as well as provides the structural foundation required for prgKoligomerization for further assembly of the NC [53].

SicAis a wide acting chaperone protein (18 KDa) which aids in the secretion process of all T3SS proteins through the invasion of host cells. Accordingly, it is encoded upstream to the Sip/SspABCDoperon in SPI-1. SipB and SipC proteins are responsible for the translocon formation in the host cell membrane to facilitate the injection of Type III effector proteins into the host cell to manipulate it [54]. Moreover, SicA is essential for the expression of the most virulence genes that encode T3SS effector proteins and is identified as a co-regulator with InvF for SigDEand SptP[55].

HilAis a member of the OmpR/ToxR regulator protein family and the central activator of SPI-1 genes, belonging to T3SS. The hilAgene is encoded within SPI-1 and is the key factor in SPI-1-T3SS regulation, starting from the expression of downstream genes sicAand invFto ultimate regulation of the effector genes sipAand sipB[56]. The upregulation of hilAresults in the high expression of all genes encoded within the SPI-1 which are necessary for the invasion of epithelial cells. Moreover, the expression of hilAis controlled by many different activators and suppressors in response to specific environmental changes during invasion of the host cells, such as, temperature, bile, fatty acids, osmolarity, pH, oxygen concentrations and growth state [57]. Additionally, certain studies considered HilA as a promising drug target to inhibit the activity of T3SS without affecting the growth of Salmonella[58].

SiiEis the largest protein in Salmonellaproteome, with the size of 595 kDa. It consists of 53 repetitive bacterial immunoglobulin domains, each containing several conserved residues [59]. The protein helps to contact the host cell membrane and positions the SPI T3SS, to initiate the translocation of effector proteins. A study states that SalmonellaSiiE-mediated entry of enterocytes via the apical route requires transmembrane mucin MUC1 [60]. Moreover, it is shown that, siiEis required for the prevention of efficient humoral immune response against the pathogen and it induces the high tires of specific Salmonella-specific IgG [61].

PrgKis a component from the inner membrane of SalmonellaSPI-1 T3SS basal body, in its N-terminus. It possess the canonical lipoproteins which acts as anchor for the hydrophilic proteins onto the surface of the bacterial cell membranes [62]. In addition, C-terminus of PrgK is found in the cytoplasm which confirms that the protein traverses the inner membrane. A study observed reduced fever in swine which were vaccinated with prgKgene attenuated S.Typhimurium in comparison with mock-vaccinated swine [63].

SscAis a chaperone protein of about 18 KDa size. It is an independent α-helical protein, that consists of eight α-helices and repeated large tetratricopeptide domain from 36 to 137 amino acids. SscA is a virulence factor which encodes the chaperonin of SseC and the translocon is involved during the adaptation and survival to desiccation [64]. A huge effect of the gene expression level of sscA,has been noted on treatment of the samples with ciprofloxacin [65].

SsaJis a core encoding component of the T3SS. It is required for SpvB, in-order to induce the actin depolymerization, especially inside the human macrophages. Salmonelladepends on SsaJ effector protein as it prevents the interaction of NADPH oxidase subunit Cytb558 with the Salmonellacontaining vesicle (SCV) thereby helping to avoid the oxidative burst [66]. An in vivostudy, conducted with the peptide of SsaJ, however, showed its inability to provide antigen specific immunity when compared with the other chosen peptides [67].

SctCis a layer of outer membrane anchor forming two distinct outer rings namely, OR1 and OR2. It is homologous to a protein of Type II Secretion System (T2SS) which requires pilotin lipoprotein for its optimal assembly and localization [68]. SctC serves as a midline between the inner and outer membrane, with evidence showing that the translocation of foreign antigens can induce potent immune response against pathogens [69].

SsrBis responsible for the survival and replication of Salmonellain the host cell and plays an important role in the transcription of multiple genes of SPI-2. SsrB has been claimed as one of the most important factors for Salmonella’s virulence by the fact that, a mutated ssrB, resulted in reduced ability of colonization on comparing with the wild type [70]. Moreover, one alteration in the gene ssrB, preferentially silencing the acquired DNA, can have a high contribution towards low transcription in the virulence factors of Salmonella[71].

BcfDis a fimbrial protein and part of the operon Bcf [72]. BcfD is a surface molecule, which helps in the adherence through specific receptors on the host cell. This step of adhesion is considered to be an important course during infection as it allows bacteria to initiate the colonization [73]. A research shows that the knockout of this gene influenced in the low adhesion capacity of Salmonellato the host cell [74].

InvE, encoded within SPI-1, is a protein located in the cell membrane and said to be essential for the translocation of Salmonellaproteins into the host cells by regulating the functions of the Sip protein translocases [75]. An investigation of finding the region of InvE, as the T3SS regulator protein, indicates that it may have two functional domains which are responsible for regulating the secretion of translocases as N-terminal secretion signal and C-terminal regulatory domain [76]. An in-vivostudy conducted with the BALB/c mice, showed less pathogenicity when it is injected with the mutated invEgene Salmonellaon comparing with the wild strain [77].

SipBis one of the effector proteins of SPI-1 T3SS which facilitates the entry of Salmonellainto the host cell. It is also called as an invasion protein as it initiates the bacterial entry process. It forms a complex along with the SipC to assemble into plasma membrane-integral structure which mediates the effectors delivery [78]. It also affects the membrane fluidity and bacterial osmotolerance and hence a small alteration of this gene will pave a huge way to prevent Salmonellaentry into the host cell [79]. In fact, a study evaluating the effect of sipBdeleted mutants, showed significant decrease in the virulence of sipBmutants when compared with the wild-type strains [80].

SsaDis an important cellular component which is responsible for the virulence of Salmonella.It is found to be in the transmembrane of the bacteria. The gene ssaDencodes for the proteins related to the basal body, cytoplasmic rings and export apparatus and it is also involved in the ATPase complex, regulation and translocation of T3SS [81]. A study shows that there is an important defect in the intercellular survival with the mutant ssaDstrains on comparing with the wild-type Salmonella[82].

DD95_23890refers to the computationally predicted protein, mapping to the autotransporter adhesin BigA protein. The BigA protein in Salmonellahas recently been identified via automated genome annotation in 2015. Thus, studies on this protein has been scarce. Inferring from its homolog in Brucella, the cell surface BigA protein promotes adhesion of bacteria on host epithelial cells [83, 84]. The adhesive properties of the BigA protein can be established by binding onto the cell adhesion molecules on the host epithelial cellular surface [85].

DD95_21695maps to the RING-type E3 ubiquitin transferase (SspH2) protein. The SspH2 protein aids in Salmonellapathogenicity by conferring anti-inflammatory properties, hence delaying the host immune response in reaction to bacterial invasion [86]. Moreover, the ability of SspH2 to ubiquitinate host NOD1 protein, through an essential interaction with host SGT1 protein, can result in NOD1-mediated IL-8 secretion in host [87].

DD95_16310maps to the SalmonellaTorS histidine kinase sensor. The TorS protein comprises the two-component systems along with the TorT response regulator [88]. Upon stimulation by Trimethylamine-N-oxide, TorS, along with TorT, carry out osmoregulation and protect the cellular proteins against low-pH induced denaturation in urea [88].

DD95_14775refers to the putative transcriptional regulator marT_1in Salmonella. The MarT protein mainly regulates the expression of MisL autotransporter protein, which is a fibronectin-binding protein that is involved in the cell adhesive properties of Salmonella[89]. Moreover, MarT has also been reported to regulate the expression of genes related to bacterial biofilm formation [90].


5. Initial screening of the candidate proteins

All the twenty proteins were screened to ascertain their potential for plausible candidatures as vaccines (Table 2). Proteins were localized in extracellular matrix (3), cytoplasm (7), cytoplasmic membrane (3) and outer membranes (2), besides some of them being predicted with unknown cellular location (5). Of these, surface/outer membrane proteins and vesicles have been deployed for prospective vaccinations against bacterial pathogens [91, 92, 93, 94]. Again, extracellular proteins have been potentiated as drugs for prospects against disease management, albeit,in a different scenario [95, 96]. Our results predict the proteins namely, SptP, SipB, SsaD, PrgK and TorS to be potentially antigenic except InvE, SctC and SspH2. Notably, the five proteins of unknown location, namely, BcfD, SsaJ, BigA, SiiE, and MarT_1 are all potentially antigenic. Of the two signal peptides BcfD and SctC, the latter was predicted to be non-antigenic while SsaJ and PrgK belongs to another category of signal peptides (lipoproteins) with good antigenic potential. Of these, SsaJ has been predicted with two transmembrane (TM) spanning helices and poses itself a good candidate for vaccines. Other candidates with more TM helices are BigA (5), SiiE (3) and TorS (3). Furthermore, a BLASTp alignment of these 20 proteins revealed SptP and SspH2 to have 40–50% similarity for 101 and 106 hits, respectively, against human counterparts, thereby completely ruling out their candidature as potential vaccines.


6. Selection of potential vaccine candidates

The 20 top ranked proteins were further screened for B cell epitopes. Therein, InvE, SsrB, SicA, and SscA were omitted from being considered as vaccine candidates due to the absence of predicted epitopes that fall within the normal range of peptide length (Table 3). Moreover, in allergenicity prediction, HilA, BcfD, SicA, BigA, SiiE, and MarT_1 were predicted to be potential allergens (Table 4), and thus, were excluded from consideration as well. Hence, we report SptP, SsaQ, SpaO, PrgH, SipB, SsaD, SctC, SsaJ, PrgK, SspH2, and TorS to be potentially utilized as B cell epitopes. Moreover, in discontinuous B cell epitope prediction, the localizations of the highly antigenic regions were illustrated in 3D (Figure 2). For successful vaccination, these regions should be prioritized and retained as much as possible due to their important roles in antigenicity.

ProteinStartEndPeptideLengthAverage Score

Table 3.

BepiPred v2.0 prediction of linear B-cell epitopes.

Only predicted peptides of length between 15 to 25 amino acids were selected [97]. SiiE protein were omitted from prediction because of its overly huge sequence.

ProteinAllerCatProAlgPred2AllergenFP v1.0
Hybrid ScorePrediction
SptPNo Hits0.04Non-AllergenProbable Non-Allergen
SsaQNo Hits0.08Non-AllergenProbable Non-Allergen
SpaONo Hits0.03Non-AllergenProbable Non-Allergen
PrgHNo Hits0.03Non-AllergenProbable Non-Allergen
SipBNo Hits0.24Non-AllergenProbable Non-Allergen
SsaDNo Hits0.09Non-AllergenProbable Non-Allergen
InvENo Hits0.02Non-AllergenProbable Non-Allergen
HilANo Hits0.54AllergenProbable Non-Allergen
BcfDNo Hits0.85AllergenProbable Non-Allergen
SsrBNo Hits−0.43Non-AllergenProbable Non-Allergen
SctCNo Hits0.09Non-AllergenProbable Non-Allergen
SicANo Hits0.31AllergenProbable Allergen
SsaJNo Hits−0.45Non-AllergenProbable Non-Allergen
SscANo Hits0.18Non-AllergenProbable Non-Allergen
PrgKNo Hits−0.48Non-AllergenProbable Non-Allergen
BigANo Hits0.75AllergenProbable Non-Allergen
SiiENo Hits0.86AllergenN/A
SspH2No Hits−0.48Non-AllergenProbable Non-Allergen
TorSNo Hits0.02Non-AllergenProbable Non-Allergen
MarT_1No Hits0.55AllergenProbable Non-Allergen

Table 4.

Allergenicity assessment through different predictive tools. Potential allergens are in bold case.

For AllergenFP v1.0, N/A refers to Not Available because of overly large protein size.

Figure 2.

DiscoTope v2.0 prediction of discontinuous B-cell epitopes. The residues are colored according to their respective DiscoTope scores (red: high, white: threshold of −3.7 and blue: low).


7. Potential druggable proteins

Besides potential vaccine candidates, we have conducted predictions on the druggability and druggable sites of the 20 top ranked proteins which have their 3D crystallized structures available in PDB. Eventually, the localization of the top ranked druggable pockets of SptP, SipB, SctC, SpaO, SsrB, PrgK, PrgH, and SiiE were illustrated in 3D (Figure 3). This can help future research in structure-aided drug discovery, by designing drugs specific for the druggable pockets to suppress the virulence of Salmonella.

Figure 3.

P2Rank predicted druggable pockets colored in light green. For SpaO, chain A is in blue, while B is in red. Residues contributing to druggability are tabulated.


8. Conclusions

The study depicted here essentially delineates a schematic approach of shortlisting the most probable virulent proteins as potential vaccine and/or drug candidates from the proteome of Salmonellaspp. It starts with the building of the theoretical PIN comprising the known and predicted virulent proteins followed by the graph theoretical parametric analyses for identifying a probable set of them. These were further screened through different essential tools enabling the prediction of cellular localisation, signal peptides, transmembrane helices, antigenicity, epitopes, allergenicity and molecular crevices besides comparing with any human homologs. A thorough analysis revealed SsaJ and PrgK to come to the forefront among those already known to be virulent. PrgK even has nice druggable pocket to be targeted through potential drugs. Our approach can pave the way for screening such effective molecular vaccines and/or drug targets for such pathogens. Newer candidates, however, could be unraveled through other effective methods.



The authors wish to acknowledge the support of Sunway University, Malaysia for the provision of computational facilities.


Conflict of interest

The authors declare no conflict of interest.

© 2021 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Rohit Mishra, Yong Chiang Tan, Amr Adel Ahmed Abd El-Aal and Chandrajit Lahiri (April 23rd 2021). Computational Identification of the Plausible Molecular Vaccine Candidates of Multidrug-Resistant <em>Salmonella enterica</em>, Salmonella spp. - A Global Challenge, Alexandre Lamas, Patricia Regal and Carlos Manuel Franco, IntechOpen, DOI: 10.5772/intechopen.95856. Available from:

chapter statistics

120total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

Related Content

This Book

Next chapter

Non-Typhoidal Salmonellosis: A Major Concern for Poultry Industry

By Mamta Pandey and Emmagouni Sharath Kumar Goud

Related Book

First chapter

Introductory Chapter: The Contribution of Cohort Studies to Health Sciences

By René Mauricio Barría

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us