Open access peer-reviewed chapter - ONLINE FIRST

The Diversity of Parvovirus Telomeres

Written By

Marianne Laugel, Emilie Lecomte, Eduard Ayuso, Oumeya Adjali, Mathieu Mével and Magalie Penaud-Budloo

Submitted: October 30th, 2021 Reviewed: January 14th, 2022 Published: April 27th, 2022

DOI: 10.5772/intechopen.102684

IntechOpen
Recent Advances in Canine Medicine Edited by Carlos Eduardo Fonseca-Alves

From the Edited Volume

Recent Advances in Canine Medicine [Working Title]

Dr. Carlos Eduardo Fonseca-Alves

Chapter metrics overview

25 Chapter Downloads

View Full Metrics

Abstract

Parvoviridae are small viruses composed of a 4–6 kb linear single-stranded DNA protected by an icosahedral capsid. The viral genes coding non-structural (NS), capsid, and accessory proteins are flanked by intriguing sequences, namely the telomeres. Telomeres are essential for parvovirus genome replication, encapsidation, and integration. Similar (homotelomeric) or different (heterotelomeric) at the two ends, they all contain imperfect palindromes that fold into hairpin structures. Up to 550 nucleotides in length, they harbor a wide variety of motifs and structures known to be recognized by host cell factors. Our study aims to comprehensively analyze parvovirus ends to better understand the role of these particular sequences in the virus life cycle. Forty Parvoviridae terminal repeats (TR) were publicly available in databases. The folding and specific DNA secondary structures, such as G4 and triplex, were systematically analyzed. A principal component analysis was carried out from the prediction data to determine variables signing parvovirus groups. A special focus will be put on adeno-associated virus (AAV) inverted terminal repeats (ITR), a member of the genus Dependoparvovirus used as vectors for gene therapy. This chapter highlights the diversity of the Parvoviridae telomeres regarding shape and secondary structures, providing information that could be relevant for virus-host interactions studies.

Keywords

  • parvovirus
  • telomeres
  • DNA folding
  • DNA secondary structure
  • adeno-associated virus

1. Introduction

Many linear DNA viruses possess terminal repeats (TRs) known to be critical for viral genome stability and propagation [1]. A parallel can be drawn with human chromosome telomeres that are composed of GC-rich repeat sequences of 5–10 nucleotides. In cells, telomeres are critical to maintain the linear structure of the chromosomes. They can adopt specific secondary structures, such as G-quadruplexes (G4), providing structural characteristics for protein binding and genomic stability. In addition, Cellular telomeres play a role in transcription regulation, chromatin compaction, subcellular localization, and chromosome segregation.

Similarly, ParvoviridaeTRs have been demonstrated to be essential to several steps of the virus cycle. They vary in shape and size from approximatively 100 to 550 nucleotides [2]. Due to the presence of palindromic repeats, TRs can fold into T-, I-, J-, Y-, U- shape or simple hairpin-like structures. Up to now, the global shape has been named without any consensus, for example, Y-shape also being called “rabbit ears.”

Viral genomes in some genera (Ambi- and Itera-densovirus; Ave-, Dependo-and Erythro-parvovirus) are homotelomeric meaning that both termini are similar but inverted, whereas, in other genera (Brevi- and Hepan-densovirus; Amdo-, Boca- and Proto-parvovirus), the 5′ and 3′ ends of the linear genome differ and therefore are called heterotelomeric. The strand polarity packed in viral capsids may be related to the left and right TRs dissimilarity. Indeed, most of the heterotelomeric parvoviruses encapsidate only one strand polarity, mainly negative. This preference may be due to inefficient nicking during replication or incomplete packaging signal at one TR [2]; for example, the minute virus of mice (MVM), a virus of the subfamily Parvovirinaeand genus Protoparvovirus, harbors a Y-shape left end and a longer U-shape structure on its right end. After replication and ori resolution, the single-stranded DNA of minus polarity is preferentially displaced from the left TR and encapsidated. For parvovirus with both polarities, the proportion can range from 1 to 50% and may be influenced by the host cell in which the virus is produced [3].

Parvoviral TRs are involved in many steps of the virus life cycle. They contain most of the cis-acting information required for genome replication and encapsidation, including tetranucleotide repeats that serve as binding sites for NS1 (Rep) oligomer, a resolution site necessary for the completion of the DNA strand copy, and a packaging signal. Recognized as DNA double-strand breaks (DSB) in the host cell, TR can trigger a DNA damage response (DDR), leading to the circularization and concatemerization of the viral genomes either by non-homologous end-joining (NHEJ) or homologous recombination (HR) [4]. Finally, transcription regulation elements are contained in the genome ends. For example, the MVM TRs contain both symmetric and asymmetric binding sites for transcription factors that modulate expression from the adjacent P4 promoter [5] and the Acheta Domestica Densovirus TRs contain a TATA box used for transcription initiation of NS gene on one side and VP on the other side [6].

The TRs secondary structures, motifs composition, and their role in the virus-cell cycle have been under-examined. In this study, DNA secondary structures of the ParvoviridaeTRs, including non-canonical secondary structures, have been predicted. We have shown a high diversity of parvovirus telomeres characteristics even within a genus. This chapter may provide significant knowledge for Parvoviridaeclassification and interaction with host cells.

Advertisement

2. The intriguing shape diversity of parvovirus telomeres

2.1 Size, GC content, and shape of parvovirus ends

Although Parvoviridaegenomes have been extensively studied, in particular for phylogenetic and evolutionary analyses, the sequence and characteristics of their telomeres are not clearly described. Therefore, we analyzed ParvoviridaeTRs sequences publicly available in the NCBI GenBank database. Parvoviridaecomplete genomes were downloaded. At least one representative virus per genus was selected based on their notoriety and information available on the internal committee on the taxonomy of viruses (ICTV) website. TRs were annotated following GenBank annotation or information available in the literature (Table S1). TR sequences of homotelomeric genomes were then verified by aligning the 5′ and 3′ ends. Sequences differing in length and showing no homology (with no common palindromic regions between 5′ and 3′ ends) were discarded from the data set. Finally, the presence of palindromic regions was verified by RNAfold (method described later) [7]. A total of 40 Parvoviridae5′ and the 3′ TRs sequences were extracted for further analysis. Among those are 17 Densovirinae, 22 Parvovirinae,and 1 unclassified Parvoviridaetelomeres.

First, the length of each TR was determined and listed in Table S1. Interestingly, TR length varies within a single genus (Table 1), for example, going from 122 nucleotides for the PcDV to 550 nucleotides for the GmDV in the same genus Ambidensovirus. Second, the percentage of GC was calculated for each parvovirus TR. Figure 1 highlights the GC content diversity of TRs between parvoviruses. The minimum and maximum GC content was observed for the left TR of AalDV2 and AAV2 with 32.4% and 69.7%, respectively. Within the genus Ambidensovirus, the percentage of GC ranges from 35% to 58.5% with the lowest GC content being attributed to the 5′ TR of the PcDV (Figure 1b). Comparatively, telomeres of the human chromosomes contain 50–55% of G and C bases whereas the whole human genome contains 40.9% GC on average [8].

SubfamilyGenus5′ TR length3′ TR length
ParvovirinaeBocaparvovirus[140–161][161–200]
Dependoparvovirus[141–455][141–455]
Erythroparvovirus[94–383][94–383]
DensovirinaeAmbidensovirus[122–550][122–550]
Brevidensovirus[98–182][134–165]
Iteradensovirus[101–271][101–271]

Table 1.

Minimal and maximal length of five-prime and three-prime terminal repeats within genera of the Parvoviridaefamily.

Figure 1.

GC content of 5-prime terminal repeats of theParvoviridae family: Differences inside theParvovirinae(a) and theDensovirinae(b) subfamilies. The telomere sequences were downloaded from the NCBI GenBank database (seeTable S1for accession numbers). GC content was calculated using the APE program.

To visualize the general shape and the secondary structures of the viral TRs, the folding of each parvovirus TR was predicted by RNAfold program using parameters of the Turner model for single-stranded RNA and DNA and the Matthews model for double-stranded DNA [7]. Additionally, mFold program was used in the DNA mode to corroborate the predictions [9]. The most thermodynamically stable structures, or minimum free energy (MFE) structures, obtained on the RNAfold web server were used to propose a classification of the TR (Figure 2). Four groups were constituted according to the number of hairpin loops at their extremity and named H1 (previously named U- and I-shapes in the literature), H2 (corresponding to J-, Y-, T-shapes), H3, and H4 (Figure 2, Table S1). This classification based on the number of terminal hairpin loops after folding and on additional structural characteristics may be more informative and precise than the global shape. Moreover, this nomenclature is applicable to all parvovirus TR. Interestingly, TR sequences and shapes differ within a genus. For example, among Ambidensovirus, CpDV and DicDV 5′ TRs are both classified in the H1 group although they only share 43% of sequence homology. In the genus Bocaparvovirus, HBoV1 and BpV1 left ends are 62% homologous in sequence but form a terminal H1 and H2 shape, respectively. Phylogenetic and evolution analyses of Parvoviridaehave been constructed on the basis of NS1 proteins homology. Telomeres have never been considered as a classification criterion.

Figure 2.

Examples of five-prime terminal repeat folding among theParvoviridaefamily. The most thermodynamically stable structures or minimum free energy (MFE) structures were obtained using the RNAfold program (RNA mode:http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi). Four groups (H1 to H4) were generated according to the number of hairpin loops (in blue) found at the five-prime TR extremity. The following DNA secondary structures were identified and counted: Stems (green), multiloops (red), interior loops (yellow), and hairpin loops (blue).

2.2 Comprehensive analysis of DNA secondary structural elements

The global analysis of the parvovirus TR has highlighted their broad diversity, even within the same genus. To study the TR divergence, an in-depth prediction of the secondary structures followed by a principal component analysis (PCA) have been realized. Secondary structure elements (Figure 2) and non-B form DNA structures were included as variables in the PCA.

Non-canonical specific structures are susceptible to be recognized by cellular proteins and thus to be essential in the virus-host interactions. For example, a recent study reported that special structures in DNA, such as quadruplex structures, can preferentially bind to IFI16 and trigger more potent type I IFN responses than those produced by the same sequence in dsDNA [10]. Such structures are intrinsic in many viral genomes, such as those of EBV and HPV [11]. Rich in GC, viral telomeres may also contain non-B DNA structures, such as G-quadruplexes (G4) or triplexes.

Therefore, putative G4 and triplexes were determined in all the parvoviral TR. G4 have been non-canonical DNA secondary structures formed by G-rich sequences (Figure 3a). Present in human telomeres, they are suggested to participate in chromosome stability maintenance [12]. G4 have also been shown to be present and play major roles in almost all virus families [13]. G4 have also been described in some parvovirus telomeres [14] but has not been systematically predicted in all parvovirus ends. G4 were predicted using the online tool QGRS-mapper [15] using the search parameters—QGRS max length 45, min G-group size 2, loop size from 0 to 12. These criteria are deliberately drastic to increase the stringency and relevance of the G4 prediction. Three values were collected—the raw number of predictive G4, with and without overlaps, and the QGRS max-score rewarding the G4 that are more likely to form. The erythroparvovirusB19V contains four G4 without overlaps which represent the maximum number of these non-B motifs for parvovirus TR. Including overlaps, CeDV and SifDV TRs harbor the highest number of putative tetraplex DNA structures with 296 G4. Of note, the two brevidensoviruseslack any predictive G4 in their ends. No correlation exist between the length of TR and the number of predictive G4 (data not shown).

Figure 3.

Two non-B secondary structures: G-quadruplex (G4) and triplex. (a) 3D representation of a theoretical G4. (b) 3D representation of a G4 in the parvovirus B19 five-prime telomere and correspondence to the G tetrads sequence (c). (d) Triplex theoretical 2D representation. (e) 3D representation of a triplex of the five-prime bovine AAV terminal repeat and (f) correspondence to the sequence. (b) and (e) figures were obtained using the Jmol software (Jmol: An open-source Java viewer for chemical structures in 3D.http://www.jmol.org/).

In parallel, triplexes are important non-B form DNA structures for protein recognition, such as for the binding of p53 factor [16]. Triplex can form at homopurine:homopyrimidine sequences with mirror symmetry (Figure 3b). The triplex package of the R program was used to predict the existence of intramolecular triplex DNA structures in parvovirus TR [17]. Only two triplexes were found, both in the bocaparvovirusBAAV ends.

Finally, a PCA was performed for the forty left TRs and with the following variables—length, GC content, shape, max G-score, raw number of G4 with overlaps, and secondary structures elements (hairpins loops, interior loops, junction loops, and stems) collected from RNAfold analysis. The R package FactoMineR was used [18]. The main PCA variables are the stems and loops (Figure 4a). The “hairpin loops” criteria is one of the most important element allowing division of parvoviruses into groups, hence the relevance of our proposed classification in shapes H1 to H4. Clustering was subsequently realized on the three most informative dimensions corresponding to more than 70% of the cumulative variance. Five clusters were obtained (Figure 4b).

Figure 4.

Principal component analysis (PCA) conducted on the five-prime telomere repeats data set. The PCA analysis was conducted using the R software and the Factoshiny package. (a) Contribution graph of each variable. (b) Clusterisation of the 40 parvovirus TR. (c) Correspondence of each parvovirus in the 5 clusters.

Cluster 1, composed of individuals such as the HBoV1 and AAV2, one of the most famous parvoviruses in the gene therapy field, is characterized by a high value for the variable “GC content.” Parvovirus B19 belongs to cluster 2, a group characterized by high values for the G4 scores and TR length. Cluster 3 mainly depends on the shape class. Individuals in cluster 4 hold a similar number of multiloops and hairpin loops. Finally, viruses in cluster 5 share many DNA structure common features (stems, interior loops, hairpin loops, multiloops, and length). Clusters do not perfectly correlate with phylogenetic classification (Figure 4c), however, we observed that cluster 1 is only composed of Dependoparvovirusand Bocaparvovirusand cluster 5 contains two Ambidensovirus, GmDV and PsiDV. Interestingly, the latter highly differs from other groups.

Advertisement

3. Focus on adeno-associated virus (AAV) inverted terminal repeats (ITR)

The use of vectors derived from the adeno-associated virus (AAV) for gene delivery encounters a growing success for the treatment of a variety of human diseases [19]. Nevertheless, the scientific community has recently faced tragic toxicity of AAV vectors administered intravenously at high doses in several clinical trials [20]. AAV vectors are generated by inserting a recombinant genome usually flanked by AAV-2 inverted terminal repeats (ITR) in an AAV capsid. The recent side-effects observed in human trials have raised the question of DNA sensing, in particular, ITR detection and subsequent cellular responses [21]. Considering the importance of providing new knowledge in this field, a special focus on AAV-2 ITR was included in our study.

The homotelomeric AAV2 possesses two identical ITR of 145 nucleotide-long. The first 125 bases contain three palindromic sequences allowing the ITR to form a T-shape structure composed of two small inverted repeat sequences (BB′ and CC′) and a larger repeated sequence (AA′) (Figure 5b). According to our analysis, AAV2 ITR belongs to group H2 (Figure 5c). A fourth proximal region called D remains single-stranded if not annealed to the opposite polarity strand or not in an intramolecular manner to the D′ region in 3′. Each ITR can be found in two alternative configurations termed “flip” and “flop” distinguishable by the BB’-CC’ orientation (Figure 5a), as a direct result of the replication mechanism. There are nomenclature inconsistencies in the literature. Here, ITR regions are named based on Lusby and Berns’ publication [22] and ordered as followed ABB′CC′A′D from 5′ to 3′.

Figure 5.

Inverted terminal repeats of the adeno-associated virus (AAV) serotype 2. (a) Scheme of five prime and three-prime ITRs of the wild-type AAV serotype 2. (b) Two-dimensional drawing of the five-prime ITR in flip configuration. (c) Predictive folding of the five-prime AAV2 ITR using RNAfold. The color code is the same that forFigure 2(http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi). (d) Putative human transcription factor binding sites in AAV2 ITR. ITR regions are named based on Lusby and Berns’ publication [22].

For historical reasons and the sake of convenience, most of the AAV vectors contain the ITR of AAV serotype 2, the sole viral sequences required for the replication and packaging of the recombinant genome in AAV capsids. Additional functions of AAV2 ITR have been described, such as a promoter activity [23], a role in the virus persistence either through genome integration [24] or recombination to form monomeric or concatemeric episomes [25].

Strikingly, the GC content of AAV2 ITR corresponds to the highest score of all studied parvoviral telomeres (69%). No predictive G4 was found using stringent parameters, unlike Satkunanathan et al.who described 18 QGRS inside the AAV2 ITR sequences [14]. In addition, no potential triplex structure was found. According to the PCA (Figure 4), AAV2 belongs to cluster 1 with several other Dependoparvoviruses.

Putative binding sites for human transcription factors (TF) have already been described in AAV2 ITR [26, 27]. We completed this work by analyzing human TF for AAV serotypes 1 to 7 ITRs using the Alggen-Promo tool with 0% sequence dissimilarity (Table 2) [28, 29]. Five human TF sites were found in AAV ITR: C/EBPbeta, Pax-5, YY1, AP-2alphaA, and GR-alpha. The C/EBPbeta was found in most of the AAV serotypes and unlike the other found TF, is mainly involved in immune responses. In our study, the GR-alpha was only found in AAV5 ITR contains two predictive TF binding sites, one for C/EBPbeta and other for YY1 (Figure 5d). YY1 participates in the initial steps of replication by binding to the p5 promoter region of AAV [30, 31].

SerotypeAccession numberTranscription factors
C/EBPbeta [T00581]Pax-5 [T00070]YY1 [T00915]AP-2alphaA [T00035]GR-alpha [T00337]
1NC_00207722110
2NC_00140110100
3AJB292182.112200
3BAF028705.112100
4NC_00182910100
5NC_006152.120002
6AF02870410100
7NC_00626002110
AAV_CHC1017MK139265.110100

Table 2.

Putative recognition sites of human transcription factors in inverted terminal repeats of AdenoAssociated virus serotypes. Predictions was realized using Alggen-promo tool with 0% sequence dissimilarity (http://alggen.lsi.upc.es/cgi-bin/promo_v3/promo/promoinit.cgi?dirDB=TF_8.3).

Advertisement

4. Conclusion

The genomic and structural diversity of parvovirus is today classified by phylogeny analysis showing an expected separation between parvoviruses and densoviruses, but its robustness is relative, suggesting that the introduction of new sequences could change our perception of their evolutionary history [32]. The diversity of sequences, structures, and genomic organizations of parvoviruses suggest evolutionary histories that are probably more complex than those illustrated by current phylogenies. These observations led us to analyze and characterize the intriguing terminal sequences present in all parvoviruses, namely the telomeres.

This chapter highlights the diversity of Parvoviridaetelomeres through a complete analysis of the terminal ends folding and secondary structures. Their length, GC content, and global shape vary even within a genus between phylogenetically closely related viruses. Evolution also led to heterotelomeric viruses with completely different left and right extremities. The diversity suggests high importance of these particular structures. Yet, factors involved in the TR selective pressure are unknown. Cotmore and Tattersall suggested a link between the resolution mechanism, the strand polarity, and the TR conformation [4, 33], while Tijssen et al. suggested that the significant differences in size and secondary structure of genome end between genera might reflect a dependence on specific cellular factors necessary for replication and encapsidation [34]. Consistently, we hypothesized that TR may have evolved according to the interactions with their replicase, helper virus co-factors, and/or cell host proteins. Based on data integration of predictive DNA secondary structures in a PCA, new groups were made that were distinct from the ICTV phylogenetic classification conducted from the NS replicase sequences.

Additionally, the significance of specific secondary structures in the parvovirus life cycle and the relation with strand polarity of the packaged linear genome are interesting topics deserving further investigations. The MVM, canine parvovirus (CPV), BPV1 (Parvovirinae), and the AalDV2 (Densovirinae) encapsidate only or predominantly negative-strand polarity genomes and possess heterotelomeric TR [35, 36]. On the contrary, the homotelomeric AAV2 encapsidates both strands polarities at the same level. By having different shapes and different secondary structure elements, the TR directly impacts the polarity of the encapsidated strand.

Finally, a special emphasis was put on the ITRs of the adeno-associated virus serotype 2, taking into consideration its importance in the world of gene transfer using viral vectors. Particular motifs and secondary structures within AAV ITR may have a significant impact on gene transfer efficiency. Indeed, it has already been demonstrated that AAV2 ITRs are detected by cellular factors belonging to the NHEJ and HR-DNA damage pathways [37]. The viral telomeres may also be recognized by DNA sensors which subsequently could restrict AAV vectors transduction or activate innate immune responses [21]. Consistent with this hypothesis, a variety of cellular proteins have been shown to interact with AAV2 ITR, such as nucleophosmin (NPM1), a protein involved in ribosome biogenesis and nucleolus transport of basic proteins. Notably, NPM1 binds preferentially G4. The restriction factor FKBP52 in its phosphorylated form also binds to the ITR in the D region, inhibits the second strand synthesis, and consequently decreases transgene expression [38]. Thus, the involvement of ITR recognition by cellular factors is central to understand the extent of subsequent responses to the rAAV DNA that can negatively impact the therapeutic gene expression and cause potential safety concerns for the patients. Using drastic parameters, no putative G4 or triplex were found in AAV2 ITR contrary to a previous study [14]. The formation of these non-conventional DNA motifs highly depends on the adjacent sequences as well as pH and ion concentration conditions and thus requires to be confirmed experimentally.

Advertisement

Acknowledgments

The authors would like to thank Judith Penzes for the initial discussion about phylogeny.

This work was supported by Nantes University Hospital.

Advertisement

Conflict of interest

The authors declare no conflict of interest.

Advertisement

SubfamilyGenusVirus nameAbbreviationAccession number5′ TR length5′ TR shape3′ TR length3′ TR shapeReference used for TR annotation
ParvovirinaeAveparvovirusChicken parvovirus ABU-P1ChiPVGU214704.1206H4206H4[39]
BocaparvovirusBovine parvovirus-1 strain AbinantiBPV1 HBoV1NC038895161H2161H1[40]
Human bocavirus 1JQ923422140H1200H1[41]
Dependoparvovirusadeno-associated virus 2AAV-2NC_001401145H2145H2[42]
adeno-associated virus 5AAV-5NC_006152.1167H2167H2[43]
Adeno-associated virus isolate MHH-05-2015AAV-MHHNC040671174H2174H2[44]
Adeno-associated virus-Go.1 (caprine)AAV-Go1DQ335246167H2167H2[45]
Avian adeno-associated virus ATCC VR-865AAAVNC004828142H2142H2[46]
Avian adeno-associated virus strain DA-1AAV-DA1NC006263142H2143H2[47]
Avian adeno-associated virus strain YZ-1AAV-YZ1GQ368252141H2141H2[48]
Bearded dragon parvovirusBDPVNC027429257H2257H2[49]
Bovine AAVBAAVNC005889172H2172H2[50]
Duck parvovirus strain FJM3DuPV-FJM3KR075690359H1359H1[51]
Duck parvovirus strain M8DuPV-M8KR029614387H1387H1[52]
Duck parvovirus strainDuPV-NMZJKR075691.1415H1415H1[52]
NMZJD110
Goose parvovirusGPVU25749444H1444H1[52]
Muscovy duck parvovirus FMDPVNC_006147.2457H1455H1[51]
Muscovy duck parvovirus YYMudPV-YYKU844281452H1452H1[51]
Serpentine adeno-associated virusSAAVNC006148154H2154H2[53]
ErythroparvovirusB19 virus isolate J35 Simian adeno-associated virusB19VAY386330.1383H1383H1[54]
SPVKT98449894H395H3[55]
Unclassified ParvovirinaePorcine partetravirus strain
FMV10-1437266
PoPTVNC022104.1210H2210H2[56]
DensovirinaeAmbidensovirusAcheta domestica densovirusAaDVHQ827781144H1144H1[57]
Blattella germanica densovirus
1
BgDV1NC005041217H1216H1[58]
Culex pipiens densovirusCpDVNC012685285H1285Unclassified[59]
Diaphorina citri densovirusDicDVNC030296.1210H1210H1[60]
Galleria mellonella densovirusGmDVNC_004286550H2550H2[61]
Planococcus citri densovirusPcDVNC004289.1122H2122Unclassified[62]
Pseudoplusia includens densovirusPsiDVNC019492.1540H2540H2[63]
BrevidensovirusAedes albopictus densovirus 2AalDV2NC004285.182H2134H2[64]
Anopheles gambiae densonucleosis virusAgDVNC_011317.198H2165H2[65]
IteradensovirusBombyx mori densovirus 1BmDV1NC003346.1230H1230H1[66]
Casphalia extranea densovirusCeDVNC004288.1230H1230H1[67]
Danaus plexippus plexippus iteravirus isolate GranbyDapDVNC023842239H1239H1[68]
Dendrolimus punctatus densovirusDpDVNC006555.1200H2200H2[69]
Helicoverpa armigera densovirusHaDVNC015718101H4101H1[70]
Papilio polyxenes densovirusPpDVNC018450.1271H1271H1[71]
Sibine fusca densovirusSifDVNC018399.1230H1230H1[72]
UnassignedAcheta domesticus mini ambidensovirus isolate KalamazooAdMADVNC022564.1199H2199H2[73]
Unclassified ParvoviridaeMouse kidney parvovirus strain Centenary InstituteMokPVNC040843.1145H1118H1[74]

Table S1.

List of the forty Parvoviridaeterminal repeats analyzed in the study.

The phylogenetic classification used here refers to the most up-to-date from the International Comitee on Taxonomy of Virus (ICTV) published in 2020. Abbreviations were taken from the literature; when not existing, they were created taking the first letters of the virus name. The TR shape were annotated according to our classification proposed in this chapter. The reference used for ITR annotations does not always match with the first citation of the virus.

References

  1. 1. Deng Z, Wang Z, Lieberman PM. Telomeres and viruses: Common themes of genome maintenance. Frontiers in Oncology. 2012;2:201
  2. 2. Pénzes JJ, Söderlund-Venermo M, Canuti M, Eis-Hübinger AM, Hughes J, Cotmore SF, et al. Reorganizing the family Parvoviridae: A revised taxonomy independent of the canonical approach based on host association. Archives of Virology. 2020;165(9):2133-2146
  3. 3. King AMQ, Adams MJ, Carstens EB, Lefkowitz EJ. Family - Parvoviridae. In: Virus Taxonomy. San Diego: Elsevier; 2012. pp. 405-425
  4. 4. Cotmore SF, Tattersall P. Parvovirus diversity and DNA damage responses. Cold Spring Harbor Perspectives in Biology. 2013;5(2)
  5. 5. Faisst S, Perros M, Deleu L, Spruyt N, Rommelaere J. Mapping of upstream regulatory elements in the P4 promoter of parvovirus minute virus of mice. Virology. 1994;202(1):466-470
  6. 6. Pham HT, Yu Q, Bergoin M, Tijssen P. A novel Ambisense Densovirus, Acheta domesticus Mini Ambidensovirus, from crickets. Genome Announcements. 2013;1(6)
  7. 7. Lorenz R, Bernhart SH, Höner zu Siederdissen C, Tafer H, Flamm C, Stadler PF, et al. ViennaRNA Package 2.0. Algorithms for Molecular Biology. 2011;6(1):26
  8. 8. Piovesan A, Pelleri MC, Antonaros F, Strippoli P, Caracausi M, Vitale L. On the length, weight and GC content of the human genome. BMC Research Notes. 2019;12(1):106
  9. 9. Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Research. 2003;31(13):3406-3415
  10. 10. Hároníková L, Coufal J, Kejnovská I, Jagelská EB, Fojta M, Dvořáková P, et al. IFI16 preferentially binds to DNA with Quadruplex structure and enhances DNA Quadruplex formation. PLoS One. 2016;11(6):e0157156
  11. 11. Ma Z, Ni G, Damania B. Innate sensing of DNA virus genomes. Annual Review of Virology. 2018;5(1):341-362
  12. 12. Huppert JL. Structure, location and interactions of G-quadruplexes. The FEBS Journal. 2010;277(17):3452-3458
  13. 13. Ruggiero E, Richter SN. Viral G-quadruplexes: New frontiers in virus pathogenesis and antiviral therapy. Annual Reports in Medicinal Chemistry. 2020;54:101-131
  14. 14. Satkunanathan S, Thorpe R, Zhao Y. The function of DNA binding protein nucleophosmin in AAV replication. Virology. 2017;510:46-54
  15. 15. Kikin O, D’Antonio L, Bagga PS. QGRS mapper: A web-based server for predicting G-quadruplexes in nucleotide sequences. Nucleic Acids Research. 2006;34(suppl_2):W676-W682
  16. 16. Brázdová M, Tichý V, Helma R, Bažantová P, Polášková A, Krejčí A, et al. p53 specifically binds triplex DNA In vitro and in cells. PLoS One. 2016;11(12):e0167439
  17. 17. Lexa M, Martínek T, Burgetová I, Kopeček D, Brázdová M. A dynamic programming algorithm for identification of triplex-forming sequences. Bioinformatics. 2011;27(18):2510-2517
  18. 18. Lê S, Josse J, Husson F. FactoMineR: An R package for multivariate analysis. Journal of Statistical Software. 2008;25(1):1-18
  19. 19. Samulski RJ, Muzyczka N. AAV-mediated gene therapy for research and therapeutic purposes. Annual Review of Virology. 2014;1(1):427-451
  20. 20. Wilson JM, Flotte TR. Moving forward after two deaths in a gene therapy trial of Myotubular myopathy. Human Gene Therapy. 2020;31(13–14):695-696
  21. 21. Muhuri M, Maeda Y, Ma H, Ram S, Fitzgerald KA, Tai PWL, et al. Overcoming innate immune barriers that impede AAV gene therapy vectors. The Journal of Clinical Investigation. 2021;131(1)
  22. 22. Lusby E, Fife KH. Nucleotide sequence of the inverted terminal repetition in adeno-associated virus DNA. Journal of Virology. 1980;34:8
  23. 23. Earley LF, Conatser LM, Lue VM, Dobbins AL, Li C, Hirsch ML, et al. Adeno-associated virus serotype-specific inverted terminal repeat sequence role in vector transgene expression. Human Gene Therapy. 2020;31(3–4):151-162
  24. 24. Samulski RJ, Zhu X, Xiao X, Brook JD, Housman DE, Epstein N, et al. Targeted integration of adeno-associated virus (AAV) into human chromosome 19. The EMBO Journal. 1991;10(12):3941-3950
  25. 25. Duan D, Sharma P, Yang J, Yue Y, Dudus L, Zhang Y, et al. Circular intermediates of recombinant adeno-associated virus have defined structural characteristics responsible for long-term Episomal persistence in muscle tissue. Journal of Virology. 1998;72:10
  26. 26. Ling C, Wang Y, Lu Y, Wang L, Jayandharan GR, Aslanidi GV, et al. Enhanced transgene expression from recombinant single-stranded D-sequence-substituted adeno-associated virus vectors in human cell linesIn vitroand in murine hepatocytesIn vivo. Journal of Virology. 2015;89(2):952-961
  27. 27. Julien L, Chassagne J, Peccate C, Lorain S, Piétri-Rouxel F, Danos O, et al. RFX1 and RFX3 transcription factors interact with the D sequence of adeno-associated virus inverted terminal repeat and regulate AAV transduction. Scientific Reports. 2018;8(1):210
  28. 28. Messeguer X, Escudero R, Farré D, Núñez O, Martínez J, Albà MM. PROMO: Detection of known transcription regulatory elements using species-tailored searches. Bioinformatics. 2002;18(2):333-334
  29. 29. Farré D, Roset R, Huerta M, Adsuara JE, Roselló L, Albà MM, et al. Identification of patterns in biological sequences at the ALGGEN server: PROMO and MALGEN. Nucleic Acids Research. 2003;31(13):3651-3653
  30. 30. Houbaviy HB, Burley SK. Thermodynamic analysis of the interaction between YY1 and the AAV P5 promoter initiator element. Chemistry & Biology. 2001;8(2):179-187
  31. 31. Murphy M, Gomos-Klein J, Stankic M, Falck-Pedersen E. Adeno-associated virus type 2 p5 promoter: A rep-regulated DNA switch element functioning in transcription, replication, and site-specific integration. Journal of Virology. 2007;81(8):3721-3730
  32. 32. Grenet A-SG, Salasc F, Francois S, Mutuel D, Dupressoir T, Multeau C, et al. Les densovirus : une « massive attaque » chez les arthropodes. Virologie. 2015;19(1):19-31
  33. 33. Cotmore SF, Tattersall P. Parvoviruses: Small does not mean simple. Annual Review of Virology. 2014;1(1):517-537
  34. 34. Tijssen P, Pénzes JJ, Yu Q, Pham HT, Bergoin M. Diversity of small, single-stranded DNA viruses of invertebrates and their chaotic evolutionary past. Journal of Invertebrate Pathology. 2016;140:83-96
  35. 35. Cotmore SF, Tattersall P. Genome packaging sense is controlled by the efficiency of the Nick site in the right-end replication origin of parvoviruses minute virus of mice and LuIII. Journal of Virology. 2005;79(4):2287-2300
  36. 36. Yu Y, Zhang J, Wang J, Xi J, Zhang X, Li P, et al. Naturally-occurring right terminal hairpin mutations in three genotypes of canine parvovirus (CPV-2a, CPV-2b and CPV-2c) have no effect on their growth characteristics. Virus Research. 2019;261:31-36
  37. 37. Adachi K, Nakai H. The Role of DNA Repair Pathways in Adeno-Associated Virus Infection and Viral Genome Replication / Recombination / Integration. DNA Repair and Human Health. 2011 Available from:https://www.intechopen.com/books/dna-repair-and-human-health/the-role-of-dna-repair-pathways-in-adeno-associated-virus-infection-and-viral-genome-replication-rec
  38. 38. Qing K, Hansen J, Weigel-Kelley KA, Tan M, Zhou S, Srivastava A. Adeno-associated virus type 2-mediated gene transfer: Role of cellular FKBP52 protein in transgene expression. Journal of Virology. 2001;75(19):8968-8976
  39. 39. Day JM, Zsak L. Determination and Analysis of the Full-Length Chicken Parvovirus Genome. Virology. 2010;399(1):59-64. DOI: 10.1016/j.virol.2009.12.027
  40. 40. Qiu J, Cheng F, Pintel D. Molecular Characterization of Caprine Adeno-Associated Virus (AAV-Go.1) Reveals Striking Similarity to Human AAV5. Virology. 2006;356(1):208-216. DOI: 10.1016/j.virol.2006.07.024
  41. 41. Shen W et al. Identification and Functional Analysis of Novel Nonstructural Proteins of Human Bocavirus 1, éd. par M. J. Imperiale. Journal of Virology. 2015;89(19):10097-10109. DOI: 10.1128/JVI.01374-15
  42. 42. Lusby E, Fife KH. Nucleotide Sequence of the Inverted Terminal Repetition in Adeno-Associated Virus DNA. Journal of Virology. 1980;34:8
  43. 43. Chiorini JA et al. Cloning and Characterization of Adeno-Associated Virus Type 5. Journal of Virology. 1999;73(2):1309-1319. DOI: 10.1128/JVI.73.2.1309-1319.1999
  44. 44. Su XN et al. Isolation and Genetic Characterization of a Novel Adeno-Associated Virus from Muscovy Ducks in China. Poultry Science. 2017;96(11):3867-3871. DOI: 10.3382/ps/pex235
  45. 45. Qiu J, Cheng F, Pintel DJ. Expression Profiles of Bovine Adeno-Associated Virus and Avian Adeno-Associated Virus Display Significant Similarity to That of Adeno-Associated Virus Type 5. Journal of Virology. 2006;80(11):5482-5493. DOI: 10.1128/JVI.02735-05
  46. 46. Bossis I, Chiorini JA. Cloning of an Avian Adeno-Associated Virus (AAAV) and Generation of Recombinant AAAV Particles. Journal of Virology. 2003;77(12):6799-6810. DOI: 10.1128/JVI.77.12.6799-6810.2003
  47. 47. Estevez C, Villegas P. Sequence Analysis, Viral Rescue from Infectious Clones and Generation of Recombinant Virions of the Avian Adeno-Associated Virus. Virus Research. 2004;105(2):195-208. DOI: 10.1016/j.virusres.2004.05.010
  48. 48. Wang J et al. Molecular Characterization and Phylogenetic Analysis of an Avian Adeno-Associated Virus Originating from a Chicken in China. Archives of Virology. 2011;156(1):71-77. DOI: 10.1007/s00705-010-0822-x
  49. 49. Pénzes JJ et al. Novel parvoviruses in reptiles and genome sequence of a lizard parvovirus shed light on Dependoparvovirus genus evolution. Journal of General Virology. 2015;96(9):2769-2779. DOI: 10.1099/vir.0.000215
  50. 50. Schmidt M et al. Cloning and Characterization of a Bovine Adeno-Associated Virus. Journal of Virology. 2004;78(12):6509-6516. DOI: 10.1128/JVI.78.12.6509-6516.2004
  51. 51. Zádori Z et al. Analysis of the complete nucleotide sequences of Goose and Muscovy Duck Pervoviruses indicates common ancestral origin with adeno-associated virus 2. Virology. 1995;212(2):562-573. DOI: 10.1006/viro.1995.1514
  52. 52. Wang J et al. Molecular characterization of a novel Muscovy duck parvovirus isolate: evidence of recombination between classical MDPV and goose parvovirus strains. BMC Veterinary Research. 2017;13. DOI: 10.1186/s12917-017-1238-6
  53. 53. Farkas SL. A parvovirus isolated from royal python (Python Regius) is a member of the genus Dependovirus. Journal of General Virology. 2004;85(3):555-561. DOI: 10.1099/vir.0.19616-0
  54. 54. Zhi N et al. Construction and sequencing of an infectious clone of the human parvovirus B19. Virology. 2004;318(1):142-152. DOI: 10.1016/j.virol.2003.09.011
  55. 55. Kapusinszky B et al. Case-control comparison of enteric viromes in captive rhesus macaques with acute or idiopathic chronic diarrhea. Journal of Virology. 2017;91(18). DOI: 10.1128/JVI.00952-17
  56. 56. Bellehumeur C et al. High-throughput sequencing revealed the presence of an unforeseen parvovirus species in Canadian swine: The porcine partetravirus. The Canadian Veterinary Journal. 2013;54(8):787-789
  57. 57. Szelei J et al. Susceptibility of North-American and European crickets to acheta domesticus densovirus (AdDNV) and associated epizootics. Journal of Invertebrate Pathology. 2011;106(3):394-399. DOI: 10.1016/j.jip.2010.12.009
  58. 58. Mukha DV et al. Characterization of a new densovirus infecting the German cockroach, Blattella germanica. Journal of General Virology. 2006;87(6):1567-1575. DOI: 10.1099/vir.0.81638-0
  59. 59. Baquerizo-Audiot E et al. Structure and expression strategy of the genome of Culex pipiens Densovirus, a Mosquito Densovirus with an Ambisense Organization. Journal of Virology. 2009;83(13):6863-6873. DOI: 10.1128/JVI.00524-09
  60. 60. Nigg JC, Nouri S, Falk BW. Complete genome sequence of a putative Densovirus of the Asian Citrus Psyllid, Diaphorina citri. Genome Announcements. 2016;4(4). DOI: 10.1128/genomeA.00589-16
  61. 61. Tijssen P et al. Organization and expression strategy of the Ambisense genome of densonucleosis virus of Galleria mellonella. Journal of Virology. 2003;77(19):10357-10365. DOI: 10.1128/JVI.77.19.10357-10365.2003
  62. 62. Thao ML et al. Genetic characterization of a Putative Densovirus from the Mealybug Planococcus Citri. Current Microbiology. 2001;43(6):457-458. DOI: 10.1007/s002840010339
  63. 63. Huynh OTH et al. Pseudoplusia includens Densovirus genome organization and expression strategy. Journal of Virology. 2012;86(23):13127-13128. DOI: 10.1128/JVI.02462-12
  64. 64. Boublik Y, Jousset F-X, Bergoin M. Complete nucleotide sequence and genomic organization of the Aedes Albopictus Parvovirus (AaPV) pathogenic for Aedes Aegypti Larvae. Virology. 1994;200(2):752-763. DOI: 10.1006/viro.1994.1239
  65. 65. Ren X, Hoiczyk E, Rasgon JL. Viral Paratransgenesis in the malaria vector Anopheles gambiae. PLoS Pathogens. 2008;4(8). DOI: 10.1371/journal.ppat.1000135
  66. 66. Bando H et al. Genome organization of the Densovirus from Bombyx Mori (BmDNV-1) and enzyme activity of its capsid. Journal of General Virology. 2001;82(11):2821-2825. DOI: 10.1099/0022-1317-82-11-2821
  67. 67. Fédière G et al. Genome organization of Casphalia Extranea Densovirus, a New Iteravirus. Virology. 2002;292(2):299-308. DOI: 10.1006/viro.2001.1257
  68. 68. Yu Q, Tijssen P. Iteradensovirus from the Monarch Butterfly, Danaus plexippus plexippus. Genome Announcements. 2014;2(2). DOI: 10.1128/genomeA.00321-14
  69. 69. Wang J et al. Nucleotide sequence and genomic organization of a newly isolated densovirus infecting Dendrolimus punctatus. Journal of General Virology. 2005;86(8):2169-2173. DOI: 10.1099/vir.0.80898-0
  70. 70. Pengjun X et al. Complete genome sequence of a monosense Densovirus infecting the cotton bollworm, Helicoverpa Armigera. Journal of Virology. 2012;86(19):10909. DOI: 10.1128/JVI.01912-12
  71. 71. Qian Y et al. Papilio polyxenes Densovirus has an iteravirus-like genome organization. Journal of Virology. 2012;86(17):9534-9535. DOI: 10.1128/JVI.01368-12
  72. 72. Qian Y et al. Iteravirus-like genome organization of a densovirus from Sibine Fusca Stoll. Journal of Virology. 2012;86(16):8897-8898. DOI: 10.1128/JVI.01267-12
  73. 73. Pham HT et al. A novel Ambisense Densovirus, Acheta domesticus Mini Ambidensovirus, from crickets. Genome Announcements. 2013;1(6). DOI: 10.1128/genomeA.00914-13
  74. 74. Roediger B et al. An atypical parvovirus driving chronic tubulointerstitial nephropathy and kidney fibrosis. Cell. 2018;175(2):530-543.e24. DOI: 10.1016/j.cell.2018.08.013

Written By

Marianne Laugel, Emilie Lecomte, Eduard Ayuso, Oumeya Adjali, Mathieu Mével and Magalie Penaud-Budloo

Submitted: October 30th, 2021 Reviewed: January 14th, 2022 Published: April 27th, 2022