The early history of life on Earth likely included a stage in which life existed as self-replicating protocells with single-stranded RNA (ssRNA) genomes. In this RNA world, genome damage from a variety of sources (spontaneous hydrolysis, UV, etc.) would have been a problem for survival. Selection pressure for dealing with genome damage would have led to adaptive strategies for mitigating the damage. In today’s world, RNA viruses with ssRNA genomes are common, and these viruses similarly need to cope with genome damage. Thus ssRNA viruses can serve as models for understanding the early evolution of genome repair. As the ssRNA protocells in the early RNA world evolved, the RNA genome likely gave rise, through a series of evolutionary stages, to the double-stranded DNA (dsDNA) genome. In ssRNA to dsDNA evolution, genome repair processes also likely evolved to accommodate this transition. Some of the basic features of ssRNA genome repair appear to have been retained in descendants with dsDNA genomes. In particular, a type of strand-switching recombination occurs when ssRNA replication is blocked by a damage in the template strand. Elements of this process appear to have a central role in recombinational repair processes during meiosis and mitosis of descendant dsDNA organisms.
- RNA world
- RNA virus
- recombination repair
- synthesis-dependent strand annealing (SDSA)
- DNA repair
- genome damage
- single-stranded RNA
Protocellular organisms may have come into existence 2.5 to 3.5 billion years ago [1, 2]. Woese  proposed that the genomes of the early protocellular forms of life were individual strands of RNA rather than DNA, and that these RNA strands were present as separate genome segments, rather than being linked together end-to-end as is generally the case for genes in DNA. The idea that, during an early period in the evolution of life, genetic information was stored and transmitted solely by RNA molecules has come to be known as the “RNA world hypothesis.” This hypothesis is currently being tested by many investigators. Of particular significance, Horning and Joyce  have demonstrated that the replication of genetic information and its conversion into functional molecules can be accomplished with RNA in the complete absence of protein. RNA molecules with catalytic activity are called ribozymes. An RNA ribozyme developed by Horning and Joyce can act as an RNA polymerase to replicate RNA .
Persistence and replication of even the simplest forms of RNA life must have depended on preserving the information content of the RNA genome from damage (a form of informational noise). Damage to the RNA genome likely occurred in a variety of ways including spontaneous hydrolysis, exposure to UV light and exposure to reactive chemicals. Natural selection would have acted to promote the evolution of RNA sequences that allowed solutions to this problem of informational noise. While free living organisms with ssRNA genomes are unknown in today’s world, viruses with ssRNA genomes are currently common. The present day ssRNA viruses also need to cope with informational noise in the form of damage to their RNA genome. Therefore, such ssRNA viruses can serve as models for understanding the adaptive solutions that early ssRNA protocells may have developed for coping with genome damage. Numerous ssRNA viruses have been shown to be capable of exchanging sequence information between individual genomes within an infected cell . This information exchange, or genetic recombination, can occur by reassortment of genome segments or during genome replication by a process of strand-switching to form a progeny genome with information from two parental genomes. The process of strand-switching is often referred to as “copy-choice” recombination. The term “copy-choice” embodies the idea of template-switching during genome replication, although the term was introduced before the DNA/RNA nature of genetic information was understood. Lederberg  and Bernstein  were among the first to explicitly propose copy-choice mechanisms of recombination. The two recombination processes, segment reassortment and copy-choice, allow the formation of an undamaged progeny genome even when one or both parental genomes contain damage. In the sense that both segment reassortment and copy-choice restore genetic sequence information that is damaged in the parental genomes, these are informational repair processes. Although information is restored in progeny, the parental genomes may retain their physical damage. Thus when “repair” is discussed at the level of ssRNA organisms it is the genetic information content of damaged parental genomes that is restored or “repaired” during formation of the progeny genome.
As the early protocells with RNA genomes evolved they likely went through a series of adaptive transitions that eventually led to the double-stranded DNA (dsDNA) genome. The archaea are a group of prokaryotes with a dsDNA genome that likely evolved prior to the emergence of eukaryotes. These organisms are capable of a process, genetic transformation, during which cells exchange DNA to repair DNA double-strand breaks via homologous recombination . In eukaryotes, during meiosis and mitosis, most recombination events occur by a repair process termed “synthesis-dependent strand annealing” (SDSA)  that is basically a form of copy-choice recombination (see Section 6.1.). In addition, single-strand damages that block the movement of the DNA polymerase during replication can be repaired by a mechanism that includes copy-choice recombination [12, 13]. Thus strand-switching copy-choice mechanisms that likely emerged in early ssRNA protocells appear to have evolved into fundamental processes for maintaining the information content of dsDNA genomes.
While the capability for recombinational repair is retained as a major mechanism for dealing with DNA damages, organisms with a dsDNA genome, including humans, have also evolved other repair processes that take advantage of the duplex nature of the DNA genome . For such organisms, damages in one strand can be repaired by removal of the damaged section and its replacement by copying information from the other strand, as occurs in the well-studied processes of mismatch repair, nucleotide excision repair and base excision repair . Other processes for dealing with DNA damages in organisms with DNA genomes include direct reversal of UV photolesions and alkylated bases, repair of DNA crosslinks by Fanconi anemia proteins, and a mechanism for tolerating damages termed translesion synthesis .
The aim of this review is to outline how genome repair processes emerged in the earliest evolved protocells that likely had RNA genomes, and how these processes further evolved in the transition from the RNA world to the DNA world.
2. Genome repair in the RNA world
Since the actual sequence of evolutionary adaptive events in the RNA world that gave rise to genome repair occurred in organisms that are probably long extinct, and it is unlikely that events at the nucleic acid level are preserved in the fossil record, the sequence of evolutionary events proposed here is necessarily speculative. However, the proposed evolutionary sequence is based on the established activities of extant RNA viruses. These activities are reviewed in sections 2.1, 2.2, 3 and 4. Thus it is assumed, as discussed by Bernstein et al. (pgs. 342-345) , that the adaptations that extant RNA viruses use to repair genome damage can illuminate how early life in the RNA world also coped with genome damage.
In early protocellular organisms the genome is thought to have consisted of ssRNAs (genes) that formed folded structures with catalyic activity (ribozymes) . If two or more such ssRNAs were present in a protocell they presumably functioned interdependently to promote the viability and reproduction of the protocell. A key ribozyme in early protocellular organisms would likely have been a polymerase that could catalyze RNA replication . A persistent problem for early protocellular organisms would probably have been damage to their ssRNA genomes. The damaging stresses on protocellular organisms likely would have included hydrolytic reactions, exposure to UV light and interaction with reactive chemicals in the environment. For example, Sagan  analyzed the flux of solar UV light that penetrated the earth’s primitive reducing atmosphere. His analysis indicated that unprotected microorganisms of the type existing today would receive a mean lethal dose at 2600 angstroms within 0.3 seconds and that this vulnerability could have posed a major problem during the early evolution of life. A protocell that has only one copy of each ssRNA (a haploid protocell) would be very vulnerable to damage, since damage to even one base in a ssRNA sequence might be lethal to the protocell by either blocking replication of the ssRNA or interfering with an essential ssRNA ribozyme function .
One possible adaptation for dealing with genome damage would be to maintain two or more copies of each ssRNA gene in each protocell, yielding a diploid or polyploid state. Genome redundancy would allow replacement of a damaged gene by an additional replication of an undamaged homologous gene. However, for a simple protocellular organism, the proportion of available resource budgeted to the maintenance of two or more genomes would have been a large portion of its total resource budget. When resources are limited, the protocell’s reproductive rate would likely be inversely related to ploidy number. The fitness of the protocell would be diminished by the costs of genome redundancy. Coping with damage to the ssRNA genome while minimizing the costs of genome redundancy would likely have been a fundamental problem in the early evolution of cellular life .
When the costs of maintaining genome redundancy verses the costs of genome damage were balanced against each other in a cost–benefit analysis, it was found that under a wide range of conditions the selected strategy would be for each protocell to be haploid, but to periodically fuse with another haploid protocell to form a transient diploid . This strategy allows the haploid state to be retained to maximize reproductive rate, while the periodic fusions would allow otherwise lethally damaged protocells to be mutually reactivated. Reactivation can occur if at least one undamaged copy of each ssRNA gene is present in the transient diploid and this leads to production of a viable progeny protocell. In order for two (rather than just one) viable progeny protocells to be produced, an extra replication of the gene(s) homologous to damaged gene(s) would have to occur before division of the fused diploid protocell. The process of recovering from potentially lethal damage in one ssRNA genome by reassorting information with another homologous ssRNA genome can be regarded as a primitive form of genome repair [8, 18]. This proposed cycle for coping with genome damage, although hypothetical, is based on the way that ssRNA viruses with segmented RNA genomes deal with genome damage as discussed below in Section 2.1.
The events that contributed to the evolution of genomic repair in ssRNA protocells can also be viewed as an early stage in the evolution of sexual reproduction since these events include the coming together of two genomes from separate parents to generate progeny genomes containing shared genetic information .
2.1 Recombination in influenza virus and hantavirus
Influenza virus (Family
Upon infection, influenza virus induces a host response involving increased production of reactive oxygen species, and this can damage the virus genome . Consider two individual viruses each with a lethal damage in its genome. If either of these viruses infects a host cell the infection aborts and no progeny viruses are produced. However, if these two damaged viruses infect the same host cell, the multiple infection may lead to reactivation (production of viable progeny). This phenomenon is known as “multiplicity reactivation” and is thought to reflect acts of recombination that allow an undamaged genome to be reconstituted from damaged ones . Multiplicity reactivation has been demonstrated in influenza virus infections after induction of RNA damage by UV-irradiation  and ionizing radiation . In these studies, recombination by reassortment of genome segments likely played a role in the observed multiplicity reactivation.
Recombination by reassortment is a simple way of restoring an undamaged genome from multiple lethally damaged genomes and thus is a primitive form of genomic repair. Lehman  has reviewed evidence supporting the view that recombination is an evolutionary development as ancient as the origins of life.
In addition to the role of recombination in genome repair, recombination also has a role in viral evolution by generating new genetic combinations that can be tested by natural selection. An infrequent new genetic combination may be selectively advantageous. However, RNA is very vulnerable to damage. Because of the reactivity of the oxygen and nitrogen atoms of the nucleobases , RNA molecules are especially susceptible to certain types of chemical damage from sources such as reactive oxygen species, UV light, and alkylating agents; and the oxygen atoms of the ribose and the phosphodiester backbone are also vulnerable to chemical damage . In early protocells, repair of RNA genome damage likely provided a considerable and immediate selective advantage while new recombinant genetic combinations may have been adaptively beneficial only infrequently.
2.2 Intragenic recombination in segmented ssRNA influenza virus and hantavirus
In influenza virus infections, genome segment reassortment is not the only mechanism of recombination. Intragenic homologous recombination can also occur between a pair of homologous viral genes . Homologous recombination occurs by template-switching (copy-choice) during viral genome replication .
In addition to influenza viruses, ssRNA hantaviruses are also capable of recombination by both segment reassortment and by homologous recombination .
In the evolution of repair processes in the RNA world, template-switching (copy-choice) recombination was likely an important advance since it allows two damaged homologous genes to generate an undamaged homolog. However, at present there is insufficient evidence available to determine whether copy-choice recombination emerged before or after the emergence of genome segment reassortment as a mechanism of genome repair .
3. Repair of RNA genomes by copy-choice recombination
3.1 Copy-choice recombination
Figure 2 indicates how an accurate undamaged progeny single-stranded genome can be generated from a damaged parental genome by strand-switching (copy-choice) recombination. As shown in this Figure 2, (1) during synthesis of a progeny strand by a replicative polymerase, a damage in the (green) template strand (strand being copied) blocks polymerase progression. (2) If another (orange) homologous template is available, the polymerase may switch templates, thereby bypassing the damage. (3) The newly synthesized strand may then release from the second template strand. (4) The newly synthesized strand can return and pair with the original template. (5) The polymerase may then complete the replication using the original template. (6) These steps can generate a new recombinant genome without damage [9, 34].
3.2 Poliovirus and coronavirus
To regenerate the next generation of (+)ssRNA strands, the (−)ssRNA strands are also copied and this may also be accompanied
When cells are infected by two or more viruses containing genome damage the viruses may undergo multiplicity reactivation. Polioviruses are able to undergo multiplicity reactivation . That is, when polioviruses were irradiated with UV light and then allowed to infect host cells at a multiplicity of two or greater, viable progeny are produced at UV doses that inactivate the virus in single infections. As noted above, multiplicity reactivation occurs in various different virus systems, and has been shown to be a form of recombinational repair [27, 28].
RNA recombination appears to be a major driving force in the evolution of (+)ssRNA coronaviruses. Recombination contributes to genetic variability within a coronavirus species, the capability of a coronavirus species to jump from one host to another and, infrequently, the emergence of a novel coronavirus . The mechanism of recombination in coronaviruses likely involves template-switching during genome replication . Also, the (+)ssRNA plant carmoviruses and tombusviruses frequently undergo recombination by RdRp template-switching (copy-choice) . A key step in the evolution of repair in the RNA world appears to have been the emergence of template-switching (copy-choice) recombination as a major mechanism for dealing with genome damage.
4. Reverse transcription of the RNA genome to DNA in HIV
Human immunodeficiency virus (HIV (Family
During infection of a host cell, genome replication is catalyzed by reverse transcriptase, an RNA-dependent DNA polymerase . During reverse transcription, recombination between the two genomes can occur . The reverse transcriptase can switch between the two parental RNA genomes by copy-choice recombination , and such events may occur throughout the genome. Thus the two infecting genomes from each virus can cooperate to form a complementary negative single-strand DNA copy that has recombined information from the two parental RNA genomes. Recombination is necessary for efficient HIV replication and the maintenance of genome integrity . During each replication cycle, from 5 to 14 recombination events may occur per genome . The recombination events are “clustered” so that one recombination event is correlated with another that is close by. This clustering is apparently caused by correlated template-switches, known as high negative interference, during minus-strand DNA synthesis . That is, once a switch is made from template
After the first single strand DNA copy is synthesized, another round of replication generates a duplex DNA molecule which can integrate into the host DNA genome to form a provirus .
4.1 HIV recombination can sometimes produce genetic variation
Recombination of the viral genomes can introduce genetic variation among progeny HIV that contributes to the evolution of resistance when humans are treated with anti-retroviral therapy . Viral genome recombination may also play a role in overcoming the immune defenses of the human host. The sequence of events necessary to produce genetic variation by recombination that is adaptively beneficial to HIV are considered next.
For an adaptive benefit of genetic variation to be realized, the two RNA genomes contained in an individual infecting virus particle would have to be derived from separate progenitor viruses of differing genetic constitution. In general, only viruses that have packaged two genetically different RNA genomes can produce a recombinant genome with a genotype distinctly different from that of its parents  . For this to occur multiple events are required . These events are: (1) A human host cell would need to be infected by two viruses of genetically different lineages, and the genomes of these two different viruses would have to produce progeny genomes. (2) Two different progeny RNA genomes produced from such an infection would have to be co-packaged into the same progeny virus particle. (3) When this progeny virus infects a new host cell, template-switching would have to occur during reverse transcription to generate a recombinant DNA copy. (4) The recombinant DNA would then need to integrate into the DNA genome of the infected cell. (5) The recombinant provirus would next have to be able to produce replication-competent virus progeny for the impact of the recombination to be observed.
How often cells in HIV patients are infected by more than one HIV (double-infection) is not known, and it is unknown how often mixed packaging occurs under natural conditions [44, 45]. As discussed above, from 5 to 14 strand-switching recombination events occur in each infection cycle. These events, in most cases, occur between genomes with the same genetic constitution. Thus it is apparent that although recombination can, under some circumstances, produce variation that is adaptive, the great majority of recombination events do not produce significant adaptive variation.
4.2 Recombination as a repair process
Infection by HIV results in chronic ongoing inflammation associated with reactive oxygen species production . Thus a strategy for dealing with oxidative damages to the HIV genome would be adaptively beneficial. Each HIV particle contains two homologous templates, rather than one. Temin  considered it likely that recombination is an adaptation for repair of damaged RNA genomes. Also, template-switching by the reverse transcriptase was suggested by Bonhoeffer et al.  to be a repair process for dealing with breaks in the ssRNA genome. Copy-choice recombination by the reverse transcriptase could produce a DNA copy of the genome that is free of damage even if both parental ssRNA copies in each virus are damaged. This benefit of recombination can be realized at each infection cycle even if, as is usually the case, the two genomes do not differ, or are closely similar genetically, and little if any new genetic variation will be produced [9, 45]. If recombination in HIV infections is primarily an adaptation for genome repair, the generation of recombinational variation would be an occasional natural consequence, but not the principle driving force, for the evolution of template-switching .
4.3 HIV as a model for the transition from ssRNA to dsDNA genomes
Early organisms may have evolved through a stage, like HIV, where their genome in the form of ssRNA was replicated to form a hybrid RNA: DNA duplex which upon further replication formed dsDNA. A laboratory evolved RNA polymerase ribozyme that synthesizes RNA has also been shown to act as a reverse transcriptase to synthesize DNA . A ribozyme like this may have evolved in nature and been instrumental in the transition from the RNA to the DNA world. It could have arisen as a secondary function of an RdRp.
While oxidative stress appears to be a principle damaging stress for the HIV genome, the damaging stresses on organisms that were undergoing the early evolutionary transition from RNA to DNA genomes would likely have been different. The genome damages in the transition from RNA to DNA genomes could have arisen, as described above, from hydrolytic reactions, UV light or environmental reactive chemicals, but undoubtedly there would have been some kinds of significant damages. Thus during the transition from the RNA world to the DNA world there was very probably a continuous need to cope with genome damage. The copy-choice mechanisms that had a repair function in the RNA world may have continued to operate as repair functions during the transition to the dsDNA world. The selective pressure of genome damage on genome repair as the genetic material transitioned from RNA to DNA is discussed further in Bernstein et al. (pgs. 342-345) .
5. Recombination in archaea acts in DNA repair
In the previous sections it was proposed that genome repair processes emerged in the RNA world and that, after going through several evolutionary stages, such repair processes were present in organisms with DNA genomes. The archaea are single-celled microorganisms whose genome is DNA. These organisms are regarded as descendants of a form of life that arose subsequent to organisms with RNA genomes but prior to eukaryotes .
The evolution of the eukaryotic cell appears to trace back to the establishment of a symbiotic relationship between a host anaerobic archaeal cell and an internalized bacterium capable of aerobic metabolism . The eukaryotic cell emerged at least 1.5 billion years ago . Eukaryotic genes of archaeal origin appear to have a more central role in basic cellular functions than genes of eubacterial origin . Thus the manner in which present day archaea deal with genome damage may throw light on how genome repair processes that arose in the RNA world became adapted for repair in both the archaeal and the eukaryote DNA world.
Recent findings show that cells of archaeal species, particularly
The repair capabilities of archaea suggest that ancestral organisms arising early in the DNA world underwent processes that allowed DNA damage in one cell to be repaired by transfer of DNA sequence information from a neighboring cell in order to facilitate recombinational repair.
Eukaryotes are capable of several different types of DNA repair process:
The DNA damage may be enzymatically directly reversed. There are three known direct reversal mechananisms (Yi C) : (1) Photolyase catalyzed direct reversal of UV light-induced photolesions; (2) O6 alkylguanine-DNA alkytransferase catalyzed direct reversal of a set of of O6 alkylated DNA damages; and (3) direct reversal of N-alkylated base adducts by AlkB family dioxygenases. Direct reversal mechanisms are specific for a small subset of DNA damages and thus have limited applicability.
Single-strand damages may be excised and the proper information restored by copying the other undamaged strand. This can occur by any one of several well-studied processes. These include mismatch repair (MMR), nucleotide excision repair (NER) and base excision repair (BER) . These processes appear to have arisen in the archaea , but are most well understood in the eukaryotes. This option was not available to organisms with ssRNA genomes because the double-stranded state exists only transiently during replication. In any case the enzymes that carry out such repair processes in organisms with DNA genomes are not known to be encoded in the ssRNA virus genomes. Thus this type of mechanism was not likely present during the early evolutionary stages in ssRNA genome containing organisms.
Double-strand damages in double-stranded DNA, such as double-strand breaks, can be repaired without the presence of an homologous template by such processes as non-homologous end joining (NHEJ) and microhomology mediated end joining (MMEJ). These processes depend on the duplex nature of DNA but not on strict homology. NHEJ can be accurate if the ends of the DNA in double-strand breaks do not need processing. However, if the ends need processing before rejoining then mutations are very likely to be introduced . MMEJ is inaccurate and is always associated with a DNA deletion . Thus these processes are inaccurate and generate mutations and are not applicable to ssRNA genomes.
Homologous recombinational repair is possible when two templates are present and adjacent. Such repair may occur for various types of DNA damage. For double-strand breaks in mitosis, homologous recombinational repair, either by the less common breakage and exchange mechanism or by the more frequently used SDSA (copy-choice) mechanism , are the only accurate forms of repair available. Template switching can occur during mitosis when two sister chromatids are present and adjacent after DNA synthesis and before cell division.
During meiosis homologous chromosomes originating from different parents align intimately with each other. This is followed by transfer of sequence information between homologs, homologous recombination. The main mechanism is SDSA (copy-choice recombination), a central characteristic of meiosis (see Section 6.1). Less frequent homologous recombination by breakage and exchange of chromosomes also occurs during meiosis.
Copy-choice recombination is also an important general mechanism for dealing with DNA damages that block the movement of the DNA polymerase during DNA replication (see Section 6.2).
6.1 Meiotic and mitotic recombination
The results of numerous studies in a wide range of eukaryotes indicate that during meiosis a variety of DNA damages are repaired by recombinational repair (reviewed in ). In somatic cells, mitotic recombination also facilitates DNA repair. Molecular models of recombination have been revised over the years as relevant evidence accumulated. Our current understanding of recombination reflects the work of several groups of investigators that have provided evidence that SDSA is a major mechanism of recombination [11, 63, 64, 65]. Furthermore, SDSA is a type of copy-choice mechanism since it involves switching from one template to another during strand synthesis and the return to the original template after a short distance (compare Figure 5 to Figure 2).
Figure 5 illustrates the series of steps that occur by the meiotic SDSA process in the repair of a double-strand break (DSB) in one chromosome using information from an adjacent undamaged homologous chromosome. As shown in the figure, the steps include strand invasion by a broken strand to form a D-loop, the further extension of the strand by DNA synthesis, and then the reassociation of the transferred strand with its original pairing partner. These strand-switching and DNA synthesis events associated with repair of a damage are similar to the copy-choice recombination described above for ssRNA viruses. Thus a central feature of eukaryotic recombination in meiosis and mitosis, strand-switching copy-choice recombinational repair, may have evolved from the simpler repair-related copy-choice events postulated above for ssRNA protocells based on the known processes in ssRNA viruses. Experimental evidence demonstrating that SDSA is a major recombination pathway in meiosis was presented by McMahill et al. .
The process of SDSA can accurately repair genome damage by copying the information lost in a damaged template strand from another intact homologous template strand without the need for physical breakage and exchange of DNA. Evidence bearing on the role of SDSA during meiotic recombination was reviewed by Bernstein et al. . An alternative mechanism for recombinational repair termed the Double-Strand Break Repair (DSBR) model also explains some types of recombination events, but in contrast to SDSA recombination, the DSBR model does require physical breakage and exchange of DNA strands . However, both the SDSA and DSBR models include a step in a which a DNA strand switches at a site of damage from one complementary partner strand to another and then continues synthesis with the new partner as template. Thus both models have elements of copy-choice recombination.
With respect to mitotic recombination in somatic cells, Andersen and Sekelsky  reviewed evidence that DSBR is a minor pathway for recombinational repair, and that the SDSA model appears to describe mitotic repair more accurately.
6.2 DNA replication
During DNA replication, a DNA damage in a template strand may be present and act as roadblock to the movement of the DNA polymerase as it extends synthesis of a new complementary strand. A blocked replication fork may be accurately bypassed by the mechanism illustrated in Figure 6 [12, 13]. When movement of the replicative polymerase is blocked by a damage, the polymerase can switch template strands (mediated by a helicase) [12, 13] to form a structure referred to as a “chickenfoot” intermediate. As synthesis of the new strand proceeds along the alternate template it synthesizes the DNA region that is complementary to the damaged site in its original partner strand. The newly forming strand may then unwind and then re-associate with its original partner to continue synthesis along its original track. Polymerase-mediated strand-switching to deal with a damaged template during DNA synthesis appears to be an important general mechanism in eukaryotic cells . This mechanism can be regarded as a type of copy-choice recombinational repair, and it too may have evolved from simpler copy-choice processes in ssRNA protocells.
Given the copy-choice genomic repair mechanism present in today’s ssRNA viruses, it appears that copy choice as a repair process may have emerged as early as 3.5 to 2.5 billion years ago when RNA was apparently the only genetic material. It is possible that the capability for strand-switching was a property of the earliest ribozyme polymerases.
In early protocells, the ssRNA genomes may have been segmented, as some ssRNA viruses are in the present day. Two protocells with damaged segmented genomes could have been able to generate undamaged progeny after fusion and then reassortment of segments. Present day ssRNA segmented genome viruses can repair damage in their genomes through both copy choice and segment reassortment.
The early stages of the evolution of genome repair proposed here are based on known capabilities of extant RNA viruses. Currently it is not known if these RNA viruses are the actual evolutionary descendants of early RNA life forms, or if they arose later. It has only been assumed here that the problem of dealing with damage to an RNA genome arises in the two cases, and that the solutions to this problem would be similar.
The earliest ssRNAs that formed folded structures that acted as ribozymes can be designated plus (+) strands. Such a ribozyme strand could have had polymerase activity and acted as an RdRp. The progeny ssRNAs that it synthesizes would be complementary to the corresponding parental (+) strands, and can be designated minus (−) strands. During the synthesis of (−) strands template-switching may have occurred.
When the ssRNA genome evolved to a dsDNA form, elements of the earlier copy-choice recombinational repair processes appear to have been retained. In addition, the informational redundancy inherent in double-stranded DNA allowed the emergence of novel excision repair pathways (MMR, BER and NER) that could use the information in one strand to repair damage in the other strand. Other mechanisms (e.g. NHEJ and MMEJ) also emerged to deal with double-strand damages when an homolgous genome was not available. As eukaryotes evolved from unicellularity to multicellularity, and within an organism the germline became segregated from the somatic cell line, copy-choice recombinational repair was retained in the germline as a central feature of meiosis. Recombinational repair was also retained during mitosis, and as a general process for overcoming damage roadblocks to DNA replication.