DNA is the carrier of genetic information, but is constantly assaulted by endogenous and exogenous genotoxic attacks in all living organisms. If left unrepaired damaged or structurally altered DNA can impede pathways of DNA metabolism and maintenance of genomic stability and lead to cell death or uncontrolled proliferation. Archaea comprise diverse microorganisms that can thrive in harsh environments like hydrothermal vents and acidic hot springs. They can live without sunlight or organic carbon as food, and instead survive on sulfur hydrogen, and other materials that most bacteria and eukaryotes can not metabolize. Considering the extreme environmental niches inhabited by archaeal species, DNA lesions could be massively induced by exposure to hazardous environmental factors, (e.g., ultraviolet, X- and -rays, elevated temperatures and endogenous mutagens, e.g., reactive oxygen/nitrogen species, alkylating agents and toxic metals), and very high rates of potentially mutagenic DNA lesions (deamination, depurination, oxidation by hydrolytic mechanisms, alkylations and subsequent strand breakage) are expected to arise. However, and interestingly, it was demonstrated that the hyperthermophilic crenarchaeota Sulfolobus Acidocaldarius exhibited a modest rate of spontaneous mutations nearly close to that of the mesophilic bacteria, Escherichia Coli (E. coli) (Grogan et al., 2001). Similarly, the euryarchaeota Pyrococcus abyssi can survive high doses of ionizing gamma irradiation (Jolivet et al., 2003b) and abasic sites formation in the hyperthermophilic chromosome was found to occur at a rate moderately higher than in E. coli (Palud et al., 2008). Thus, archaeal organisms seem to evolve efficient strategies for repairing DNA damage and thus avoiding mutations.
Like bacteria and eukaryotes, archaeal repair mechanisms seem to include nucleolytic processing of DNA. Consequently, this article sets out to review archaeal DNA nucleases based on current knowledge of sequence, structure and mechanism. We have focused on recent work on several DNA repair nucleases, with a detailed description of substrate preference and cleavage specificity of these archaeal enzymes. Crystal structures, when available, are discussed in the context of biochemical data to outline mechanistic features, such as enzymatic DNA cleavage, DNA binding, and sometimes, although not always, functions. This review stresses the molecular mechanisms which have been conserved throughout evolution with reference to eukaryotic DNA nucleases and, in some cases, to bacterial counterparts. On the other hand, DNA nucleases which appears unique to archaea are emphasized with the aim to describe novel aspects of repair mechanisms.
2. Type 2 Ribonuclease H, a structure-specific DNA repair nuclease
2.1. RNase HII/2: a ubiquitous enzyme
Ribonucleases H (RNases H) catalyse the cleavage of the RNA portion of RNA/DNA hybrid molecules that are ubiquitously present in cells (Stein and Hausen, 1969). RNases H are classified into two major families, type 1 and type 2, based on amino acid sequence identities and distinct biochemical properties. Genes encoding RNases H are found in viruses, archaea, bacteria and eukaryotes and, at least, one RNase H is present within a single cell. Furthermore, type 2 RNases H are more widely distributed than type 1 RNases H in prokaryotic and eukaryotic genomes (Ohtani et al., 1999b). Biological roles, including DNA replication, DNA repair, and transcription have been assigned for these RNases H, as recently reviewed (Cerritelli and Crouch, 2009; Tadokoro and Kanaya, 2009). Here, we report recent progress in the structural and functional characterization of type 2 ribonucleases H (RNase HII/2) presumed to be involved in an excision repair system for the removal of ribose residues with a particular accent on archaeal enzymes.
2.2. RNases HII/2 orthologs
2.2.1. Distribution and amino acid sequence identities
In the process of analysing the 95 sequenced archaeal genomes, type 2 RNases H (RNases HII) have been detected among the five archaeal phyla: Euryarchaeota, Crenarchaeota, Korarchaeota, Thaumarchaeota and Nanoarchaeota. In contrast to the type 1 enzymes, archaeal RNases HII appear universally distributed, and most organisms only contain RNase HII, with the exception of few archaea, such as Sulfolobus tokodaii, Haloferax volcanii, Halobacterium sp. NRC-1 and Pyrobaculum aerophilum which possess both types. Despite the multiplicity of rnh genes within a single cell, the ubiquitous occurrence of RNase HII suggests that type 2 may provide the major RNase H activity in archaea cells, as recently proposed for eukaryotes (Bubeck et al., 2011; Frank et al., 1998b).
Sequence comparison within archaeal RNases HII has revealed a high degree of amino acid sequence identity (Chai et al., 2001; Haruki et al., 1998; Le Laz et al. 2010; Muroya et al., 2001). For instance, Pyrococcus abyssi (PabRNase HII) shows amino acid sequence identities of 64% to Thermococcus kodakaraensis RNase HII (TkoRNase HII), 49% to Archaeoglobus fulgidus RNase HII (AfuRNase HII) and 40% to Methanocaldococcus jannaschii RNase HII (MjaRNase HII). Likewise, archaeal RNases HII have highly sequence similar orthologs in bacteria and eukaryotes. AfuRNase HII shows amino acid sequence identities of 31% to Thermotoga maritima RNase HII (TmaRNase HII) and 30% to the catalytical subunit of Mus musculus RNase H2A (MmuRNase H2A), the latter composed of three distinct subunits (Shaban et al., 2010).
2.2.2. Biochemical characterization
The apparent sequence conservation among RNases HII/2 orthologs would indicate that these enzymes have biochemical properties in common. Interestingly, archaeal RNases HII display activity at alkaline pH (Chai et al., 2001; Haruki et al., 1998; Le Laz et al., 2010), and this property seems to be a hallmark of type 2 RNases H (Chon et al., 2009; Rohman et al., 2008; Rychlik et al., 2010). As first reported by Haruki, et al., the archaeal TkoRNase HII activity was not salt-dependent, but was greatly attenuated by salt-concentrations higher than 100 mM (Haruki et al., 1998). More recent biochemical characterizations, however, seem to indicate that most of type 2 RNase H activities are salt-dependent (Chon et al., 2009; Ohtani et al., 2000; Rohman et al., 2008; Rychlik et al., 2010). All archaeal RNases HII studied to date have been shown to be strictly metal-dependent nucleases. PabRNase HII prefers Mg2+ to Mn2+ or Co2+ for activity, TkoRNase HII shows preference for Co2+ over Mg2+, Mn2+ or Ni2+, and AfuRNase HII mostly prefers Mg2+ and Mn2+ to other metals (Chai et al., 2001; Haruki et al., 1998; Le Laz et al., 2010). Thus, metal ion usage by archaeal RNases HII may be a consequence of the environmental conditions under which they thrive. It may also dictate the substrate requirement for hydrolysis and confer a specialized function on the enzyme in the maintenance of genome integrity. This concern is nicely exemplified by bacterial RNases HII, for which Mg2+- or Mn2+- dependent activities are imposed by the nature of the substrate. This is in contrast to the eukaryotic RNases HII which appear more active in the presence of Mg2+ (Chon et al., 2009; Frank et al., 1994; Rohman et al., 2008).
Cleavage specificities for substrates containing single or few ribonucleotides embedded in double-stranded DNA (dsDNA) of type 2 RNases H are now well documented. Such structural substrates can arise in vivo during Okazaki fragment processing from intrinsic RNA ligation activity (Rumbaugh et al., 1997) or erroneous nucleotide incorporation by DNA polymerases (Nick McElhinny et al., 2010a; Nick McElhinny et al., 2010b), and during exposure to external damaging agents (Von Sonntag and Schulte-Frohlinde, 1978). Initial studies revealed that the archaeal TkoRNase HII was active on four ribonucleotides embedded in dsDNA (DNA-RNA4-DNA/DNA). In the presence of Co2+ and 50 mM NaCl, the enzyme specifically cleaves at the phosphodiester bond between the third and fourth ribonucleotides, which is one ribonucleotide upstream of the RNA.DNA junction (Haruki et al., 1998). This cleavage specificity is consistent with what has been recently reported using the same substrate, but Mg2+ instead of Co2+ (Rohman et al., 2008). Similarly, TkoRNase HII exhibited a unique cleavage site on single ribonucleotides embedded in dsDNA (DNA-RNA1-DNA/DNA) and specifically cut at the 5’ side of the ribonucleotide. When kinetic parameters of TkoRNase HII were determined in the presence of both DNA-RNA1-DNA/DNA and DNA-RNA4-DNA/DNA, substrate binding and turnover number of proteins were found to be comparable. Thus, one or few ribonucleotides embedded in dsDNA must be uniformly recognised and hydrolysed with similar efficiency in this archaeal organism. AfuRNase HII also possesses such catalytic specificities. In the presence of Mg2+ and 50 mM KCl, AfuRNase HII is shown to be active on DNA-RNA1-DNA/DNA, unless the ribonucleotide is positioned<4 bases from the 5’ end or <2 bases from the 3’ end, and cleavage occurs at the phosphodiester bond 5’ of the junctional ribonucleotide (Bubeck et al., 2011). In addition, PCNA (Proliferating Cell Nuclear Antigen), described as a scaffold for DNA repair and replication enzymes (Maga and Hubscher, 2003; Meslet-Cladiere et al., 2007), enhances cleavage activity of AfuRNase HII on DNA-RNA1-DNA/DNA, with the exception of ribonucleotide located within the first ten 5’-bases of the strand containing it. Interestingly, this result is consistent with that observed previously for PabRNase HII (Meslet-Cladiere et al., 2007). AfuRNase HII also shows cleavage specificity on DNA-RNA4-DNA/DNA, cutting 5’ of the last ribonucleotide of the junction in the presence of Mg2+ (Chai et al., 2001). Similarly, the archaeal PabRNase HII acts as a specific nuclease on single embedded ribonucleotides, exhibiting cleavage activity in the presence of Mg2+ (Le Laz et al., 2010). Bacterial and eukaryotic type 2 RNase H enzymes share comparable substrate specificity for single or few ribonucleotides embedded in dsDNA. Substrate and cleavage specificities for bacterial RNases HII, e.g. Escherichia coli RNase HII (EcoRNase HII) and TmaRNase HII, on DNA-RNA1-DNA/DNA were found to be identical in the presence of Mg2+ and 50 mM NaCl (Chon et al., 2009; Ohtani et al., 2008; Rychlik et al., 2010). All bacterial enzymes specifically cleaved at the 5’ side of the ribonucleotide of the RNA.DNA junction. Similar cleavage specificities of few ribonucleotides embedded in dsDNA were also observed, leaving a mono-ribonucleotide at the 5’ terminus of the RNA.DNA junction (Ohtani et al., 2008; Rychlik et al., 2010). Thus, bacterial RNases HII share common features on junction substrates, in which Mg-dependent cleavage likely dominates over Mn-dependent hydrolysis. This statement can also be applicable to the eukaryotic RNases H2. Indeed, both mammalian and yeast enzymes displayed a unique cleavage site on DNA-RNA1-DNA/DNA, cutting at the 5’-deoxyribonucleotide–ribonucleotide bond at the RNA.DNA junction in the presence of Mg2+ (Bubeck et al., 2011; Chon et al., 2009; Jeong et al., 2004; Rohman et al., 2008; Shaban et al., 2010). In contrast to AfuRNase HII, cleavage efficiency of Human sapiens RNase H2 (HsaRNase H2) was not stimulated by its cognate PCNA, although they have been shown to co-localize and to interact in vivo (Bubeck et al., 2011; Chon et al., 2009). Under the same reaction conditions than those described for hydrolysis of single embedded ribonucleotides, cleavage specificity of Saccharomyces cerevisiae RNase H2 (SceRNase H2) on DNA-RNA4-DNA/DNA took place at the phosphodiester bond between the third and fourth ribonucleotides (Chai et al., 2001; Jeong et al., 2004; Rohman et al., 2008). Moreover, kinetic parameters indicated that substrate binding and turnover number of proteins were found equivalent for both DNA-RNA1-DNA/DNA and DNA-RNA4-DNA/DNA, as also observed for TkoRNase HII (Rohman et al., 2008).
Overall, it appears that Mg-dependent hydrolysis of single or few ribonucleotides embedded in dsDNA along with the unique substrate specificity are a hallmark of type 2 RNases H, implying that key structural elements necessary for activity must be conserved among eukaryotes and prokaryotes.
2.3. Structure and catalysis by RNases HII/2
2.3.1. Overall topology
Structural comparison of three type 2 RNases H from archaea (AfuRNase HII: PDB code 1I39), bacteria (TmaRNase HII: PDB code 3O3F) and eukaryotes (MmuRNase H2A: PDB code 3KIO) identifies a conserved catalytic core, termed RNase H fold, consisting of a five-stranded β sheet with three antiparallel and two parallel strands (54123, ↑↑↑↓↑) surrounded by α-helices (Fig. 1a)(Nowotny and Yang, 2009; Yang and Steitz, 1995). Concomitant with high sequence similarity, the three-dimensional structures of single polypeptide archaeal RNases HII from A. fulgidus, M. jannaschii and T. kodakaraensis share analogous topology and fold (Chapados et al., 2001; Lai et al., 2000; Muroya et al., 2001). They are composed of two distinct domains. The N-terminal domain comprises the central catalytic core flanked by seven α-helices (α1- α7). The C-terminal domain mainly consists of two parallel α-helices (α8- α9). Unlike TkoRNase HII, the helix α9 is incomplete in AfuRNase HII, MjaRNase HII and PabRNase HII, as recently reported (Le Laz et al., 2010). This secondary structure element is important for TkoRNase HII to bind substrate (Muroya et al., 2001). On the other hand, a flexible hinge region (residues 195SNLR198 in helix α9) in AfuRNase HII allows a wide range of motion when bound to its cognate PCNA (Bubeck et al., 2011)(Fig. 1a). Likewise, the structures of archaeal RNases HII are very similar to the bacterial ortholog, TmaRNase HII. For instance, the structures of AfuRNase HII and TmaRNase HII are superimposable with root-mean-square deviations (RMSD) of 2.6 Å over 128 Cα atoms (SuperPose V1.0 server (Maiti et al., 2004)). TmaRNase HII consists of the N-terminal domain with the five-stranded β sheet flanked by two helices on one side and three helices on the other side. The C-terminal domain contains two helices constituting a helix-loop-helix motif (Rychlik et al., 2010)(Fig. 1b). Additionally, TmaRNase HII possesses approximate 39-amino acids C-terminal and 12-amino acids N-terminal extensions that are not found in the archaeal RNase HII (Fig. 1b). Recently, the crystal structure of the heterotrimeric mouse RNase H2 (MmuRNase H2) has been solved, consisting of the RNase H2B-RNase H2C subcomplex that interfaces with the catalytic RNase H2A protein (Shaban et al., 2010). The RNase H2B-RNase H2C dimer is reported to provide a structural support for RNase H2A to become active and also acts as a platform for interactions with other proteins, such as PCNA (Bubeck et al., 2011; Chon et al., 2009). MmuRNase H2A shares significant structural similarity to the archaeal AfuRNase HII and the bacterial TmaRNase HII. For example, the structures of AfuRNase HII and MmuRNase H2A are superimposable with RMSD of 3.6 Å over 196 Cα atoms (SuperPose V1.0 server (Maiti et al., 2004)). MmuRNase H2A contains the conserved RNase H fold surrounded by α-helices (Shaban et al., 2010). Moreover, it comprises approximate 30-amino acids N-terminal and 50-amino acids C-terminal extensions that are absent in AfuRNase HII (Fig. 1b). Unlike TmaRNase HII that contains a helix-loop-helix motif at its extended C-terminal domain (Rychlik et al., 2010), this region appears disordered in the MmuRNase H2A structure (Fig. 1b) (Shaban et al., 2010). MmuRNase H2A N-terminal extension is structurally organised and forms a β-strand (called β1 or exposed β-strand) held rigidly by a disulfide bond (Fig. 1). This exposed β-strand is thought to act as an additional protein-protein interface (Shaban et al., 2010).
2.3.2. Active site, substrate binding and catalytic mechanism
Because the structures of type 2 RNases H contain a common RNase H fold, the catalytic center may be similar among archaea, bacteria and eukaryotes. Comparison of secondary structure among AfuRNase HII, MmuRNase H2A and TmaRNase HII for which crystal structures have been solved (Bubeck et al., 2011; Chapados et al., 2001; Shaban et al., 2010) points out that these three enzymes have remarkably similar active sites, consisting of four highly conserved carboxylates (DEDD motif) (Fig. 1b). The spatial clustering of these carboxylates is preserved among archaeal enzymes (Chapados et al., 2001; Lai et al., 2000; Muroya et al., 2001), and forms a small catalytic pocket composed of strands β1, β4 and helix α6. This active site geometry has been functionally analysed in AfuRNase HII and TkoRNase HII (Chapados et al., 2001; Muroya et al., 2001). Mutations of D6, E7, and D101 abolished the nucleolytic activity, while mutation of D129 showed 50% reduced activity in AfuRNase HII. Corresponding residues (D34, E35, D142, and D170) have been mutated in MmuRNase H2A, and the resulting heterotrimeric mutant RNase H2 exhibited no detectable activity (Shaban et al., 2010). Similarly, conservative mutations and subsequent functional analyses of three potential active site carboxylates in yeast RNase H2A led to a loss of substrate cleavage (Jeong et al., 2004). The four carboxylates are positionally conserved in MmuRNase H2A, and form a catalytic pocket composed of strands β2 and β5, and helices α4 and α5 (Shaban et al., 2010). The active site of TmaRNase HII is also composed of four conserved carboxylates that are structurally similar to AfuRNase HII, MmuRNase H2A. These residues (D18, E19, D107, and D124) are located in the catalytic cleft that is lined by strands β1 and β4, and a loop before the last helix of the RNase H fold (Rychlik et al., 2010). Thus, the similar active site geometry observed in eukaryotes and prokaryotes suggests a conserved two-metal ion catalytic mechanism in type 2 RNases H.
In the crystal structure of TmaRNase HII in complex with single embedded ribonucleotide, the substrate is bound to the protein, such that the noncleaved strand fits in a groove on the protein surface at the C-terminal domain, and the cleaved strand containing the single ribonucleotide interacts with the catalytic site (Rychlik et al., 2010). Several basic residues, e.g., K47, K122, and K138, are involved in substrate binding, and contact with the phosphate groups of the nucleic acid backbone in TmaRNase HII. These three lysines are strictly conserved in AfuRNase HII and MmuRNase H2A (Fig. 1b), which strongly supports a putative role for these residues in substrate binding. In fact, mutational studies indicated that K138 equivalent in AfuRNase HII is important for nucleic acid binding (Chapados et al., 2001). Furthermore, a substrate recognition motif (G21, R22, and G23) and Y163 have been identified as key structural elements responsible for the detection and cleavage of single embedded ribonucleotides in TmaRNase HII (Rychlik et al., 2010). Accordingly, mutational analyses of Y163 equivalent in AfuRNase HII suggested that this residue may intercalate into the duplex to stabilize a bent conformation required for substrate recognition and catalysis (Chapados et al., 2001). The highly conserved GRG motif and tyrosine residue (Fig. 1b) convincingly suggests that the specific recognition mechanism for single embedded ribonucleotides must be similar among type 2 RNases H. Besides, it has been recently described that the four conserved carboxylates of the DEDD motif were directly involved in coordinating the metal ions in TmaRNase HII, the distance between the two metal ions (A and B) imposed by the geometry of the active site carboxylates and substrate (Rychlik et al., 2010). The influence of nucleic acid on the metal-ion coordination ensures the catalytic specificity of TmaRNase HII. One of the most important features observed in the co-crystal structure of TmaRNase HII is that the tyrosine required for 2’-OH detection of single embedded ribonucleotides also participates in metal ion positioning, because it induces a slight deformation of the nucleic backbone at the RNA.DNA junction, and renders possible the coordination of metal ion A by the phosphate group. Moreover, a conserved DSK motif (D45, S46, and K47) located in the vicinity of the catalytic pocket has been proposed to participate in the active site formation of TmaRNase HII (Rychlik et al., 2010). Based on strictly conserved key structural elements responsible for substrate binding, metal ion coordination, and formation of the catalytic center in TmaRNase HII, the cleavage mechanism of single ribonucleotides embedded in dsDNA likely proceeds in a similar fashion in archaeal, bacterial and eukaryotic type 2 RNases H.
2.4. Physiological roles for RNases HII/2
Type 2 RNases H are represented in organisms across domains and exhibit a conserved core structure. Moreover, they have been identified as the sole enzymes able to recognise and cleave a single ribonucleotide embedded in dsDNA (Eder and Walder, 1991), thereby contrasting to type 1 RNases H that requires at least four ribonucleotides for cleavage (Ohtani et al., 1999a). As mentioned earlier, single ribonucleotides embedded in dsDNA can arise from external damaging agents (Von Sonntag and Schulte-Frohlinde, 1978), and can occur by intrinsic RNA ligation (Rumbaugh et al., 1997) or erroneous nucleotide incorporation during DNA replication (Nick McElhinny et al., 2010a; Nick McElhinny et al., 2010b). The presence of riboses in DNA has been shown to induce a helical alteration, promoting a B- to A-form transition in DNA (Horton and Finzel, 1996). This result is consistent with the local DNA backbone distortion recently observed in the structure of TmaRNase HII bound to single embedded ribonucleotide substrates (Rychlik et al., 2010). If left unrepaired, such structural alterations could be mutagenic for the cells, given that accurate DNA synthesis by replicative DNA polymerases depends on DNA helix geometry. Since type 2 RNases H specifically cleave single embedded ribonucleotides, they can be considered to be involved in DNA repair. However, as the enzymes cut the phosphodiester bond 5’ of the junctional ribonucleotide, other components are required to eliminate the remaining ribonucleotide. To clarify the physiological role of type 2 RNases H in bacteria, archaea and eukaryotes, mutant strains containing one or two RNases H-encoding genes have been constructed. In single-celled species, deletions of all RNase H genes were not lethal, but showed modest sensitivity to DNA-damaging agents, indicating their requirement in DNA repair (Arudchandran et al., 2000; Fukushima et al., 2007; Itaya et al., 1999; Meslet-Cladiere et al., 2007). Conversely, in a multicellular organism, both type 1 and type 2 RNases H were shown to be essential. Deletion of type 1 RNases H causes embryonic lethality in mouse and Drosophila Melanogaster (Cerritelli et al., 2003; Filippov et al., 2001), the former impairing mitochondrial DNA replication (Cerritelli et al., 2003). On the other hand, type 2 RNases H have been described as the major source of RNase H activity in eukaryotes (Eder and Walder, 1991; Frank et al., 1998a), and mutations in human RNase H2 can cause the neurological disorder, Aicardi-Goutières Syndrome (AGS). The pathogenesis of AGS is linked to the activation of innate immune system, likely because of accumulation of normally degraded RNA/DNA nucleic acids (Crow et al., 2006). Likewise, defect of the catalytic mutant MmuRNase H2 to hydrolyse single embedded ribonucleotides pointed toward a role for eukaryotic RNase H2 in DNA repair (Shaban et al., 2010). Furthermore, based on genetic and biochemical results, removal of single embedded ribonucleotides seems to involve at least type 2 RNases H, Fen1 (Flap Endonuclease 1) and PCNA (Bubeck et al., 2011; Meslet-Cladiere et al., 2007; Nick McElhinny et al., 2010a; Rumbaugh et al., 1997; Rydberg and Game, 2002), with PCNA:RNase HII/2 complex acting as a sensor of erroneous ribonucleotides. The association of such protein components likely suggests a role of type 2 RNases H in base excision repair (BER) to accomplish the removal of mutagenic ribonucleotides. In this pathway, type 2 RNase H would act initially by recognising and incising the damaged strand at the 5’ side of the ribonucleotide. However, further studies are now required to identify and reconstitute the sequential enzymatic steps involved in this repair process.
3. Endonucleases of the XPF/MUS81 family
DNA repair and replication restart pathways generate a variety of branched structures such as four-way DNA junctions (Holliday junctions, HJs), fork structures and 5'- or 3'-flaps, all of which are substrates for structure-specific endonucleases. Many nucleases that act upon 3'-flap structures belong to the XPF/MUS81 family of proteins, which are present throughout eukarya and archaea but are not found in bacteria. Defects in XPF/MUS81-family members are associated with human disease such as Xeroderma pigmentosum (XPF-ERCC1) (Sijbers et al., 1996) or Fanconi anemia (FANCM) (Meetei et al., 2005).
3.1. Eucaryal members of the XPF/MUS81 family of endonucleases
The human XPF-ERCC1 complex and its counterpart RAD1-RAD10 in Saccharomyces cerevisiae have been thoroughly characterized for their role in nucleotide excision repair (NER), cleaving damaged DNA 5’ to the lesion (Bardwell et al., 1994; Sijbers et al., 1996). Moreover RAD1-RAD10 acts in DNA double-strand break repair (Ma et al., 2003; Schiestl and Prakash, 1988), a role also conserved in mammalian cells (Ahmad et al., 2008; Al-Minawi et al., 2008). Notably, the XPF-ERCC complex has been directly observed in living cells using fluorescence resonance energy and spectral imaging techniques (Dinant et al., 2008).
MUS81 complexes are distinct to XPF, and initial work in Schizosaccharomyces pombe led to the proposal that Mus81-Eme1 is a Holliday junction resolvase (Boddy et al., 2001). However, in vitro work has shown that recombination intermediates such as D-loops and nicked HJs are cleaved by Mus81-Eme1, and are likely to represent its main substrates in vivo (Gaillard et al., 2003; Gaskell et al., 2007; Osman et al., 2003). By contrast the S. cerevisiae ortholog Mus81-Mms4, which displays similar substrate specificity in vitro (Ehmsen and Heyer, 2008; Gaskell et al., 2007; Whitby et al., 2003), is not the principal HJ resolvase (de los Santos et al., 2003) but instead resolves aberrant joint molecules in meiosis (Jessop and Lichten, 2008; Oh et al., 2008). The recent discovery in humans and S. cerevisiae of the Hen1/Yen1 Holliday junction resolvase, which is absent from S. pombe, suggests that this enzyme is a meiotic HJ resolvase, and that in its absence Mus81 can fulfill that role (Ip et al., 2008). Nevertheless, in both S. cerevisiae and S. pombe, the primary function of Mus81 appears to be the restart of collapsed-replication forks by homologous recombination (Doe et al., 2004; Froget et al., 2008; Kai et al., 2005; Matulova et al., 2009; Roseaulin et al., 2008), a role that is functionally redundant with the Sgs1-Top3 and the Rqh1-Top3 complexes, respectively (Doe et al., 2004; Kaliraman et al., 2001). The MUS81 complex is also found in humans (Chen et al., 2001), and appears to promote replication fork restart by homologous recombination (Ciccia et al., 2003; Franchitto et al., 2008; Hanada et al., 2007; Shimura et al., 2008).
Human FANCM was identified thanks to the archaeal ortholog Hef, an XPF/MUS81 family protein featuring a helicase:nuclease fusion (Meetei et al., 2005; Mosedale et al., 2005). The FANCM-FAAP24 complex is a XPF/MUS81 member found in humans (Ciccia et al., 2007) that possesses two separate functions: (i) to recruit the Fanconi anemia core complex to the repair of DNA interstrand crosslinks (Ciccia et al., 2007; Kim et al., 2008; Meetei et al., 2005; Mosedale et al., 2005), and (ii) to facilitate the response to replication stress by the ATR pathway, via its ATP-dependent translocase activity (Collis et al., 2008; Gari et al., 2008; Xue et al., 2008). Since FANCM promotes fork reversal in vitro, it has been proposed that FANCM ATP-dependent activity at stalled forks is needed to allow processing for replication restart (Gari et al., 2008; Xue et al., 2008). The recently-identified S. pombe FANCM ortholog Fml1 appears to promote homologous recombination at stalled forks, suggesting that the ATP-dependent helicase activity of FANCM at DNA replication forks is conserved in S. pombe (Sun et al., 2008).
3.2. Archaeal members of the XPF/MUS81 family of endonucleases
All archaea encode a protein of the XPF/MUS81/FANCM family of endonucleases except the Thermoplasmatales. Archaeal XPF exists in two forms: the long form consisting of an N-terminal helicase domain fused to a C-terminal nuclease domain, specific to the euryarchaea, and a short form lacking the helicase domain found in the crenarchaea and thaumarchaea.
Hef (helicase-associated endonuclease fork-structure DNA) is present only in euryarchaeota and was identified in Pyrococcus furiosus due to its activity on branched DNA structures (Komori et al., 2002). Similarly to for instance eukaryotic FANCM proteins, Hef has both an active helicase domain and an active nuclease domain. The C-terminal fragment of Pyrococcus furiosus Hef adopts a similar domain organization to those in the XPF/MUS81 proteins, corresponding to the nuclease domain containing the ERKX3D signature sequence involved in nuclease activity and the helix-hairpin-helix motifs of the HhH domain (Nishino et al., 2003). Mutational analyses showed that residues in the ERKX3D motif are indeed involved in the cleavage of Hef endonuclease. The Hef nuclease domain adopts a type II
restriction endonuclease fold, indicating that Hef nuclease belongs to this restriction endonuclease family (Nishino et al., 2003). Accordingly, the Hef nuclease activity is strictly dependent on Mg2+ or Mn2+ whereas Ca2+ cannot substitute.
The C-terminal fragment of Pyrococcus furiosus Hef and the entire Hef protein form dimers through a combination of two interfaces, one in the nuclease domain and one in the HhH domain, which function independently with each other. It appears that simultaneous dimer formation in both the nuclease and the HhH domains is crucial to substrate recognition specificity (Nishino et al., 2003). In the homodimer, both HhH domains are equally important for substrate recognition while at least one of the nuclease active site is required for cleavage of the fork-structured DNA. The active site of the catalytic domain is positioned near the cleavage site, two to three bases away from the junction, and has the potential to introduce unpairing near the junction center. The HhH region is bound to duplex regions and is not directly involved in the recognition of the fork structure but dramatically enhanced the catalytic site unpairing.
The substrate specificity for the cleavage activity of the Hef protein is contained in the C-terminal domain as both the C-terminal fragment and the entire Hef protein recognize and cleave nicked, flapped and fork-structured DNAs at the 5’ side of the nicked position. P. furiosus Hef thus displays XPF/Mus81-like specificity, suggesting that Hef is involved in NER (Nishino et al., 2005a; Nishino et al., 2005b).
The N-terminal fragment of P. furiosus Hef containing all the conserved helicase motifs consist of three structural subdomains. Domains 1 and 3 are each folded into the RecA-like architectures with the conserved helicase motifs. Domain 2 is a relatively mobile domain with a positively charged surface inserted into domain 3 (Nishino et al., 2005b). Hef domain 2 exhibits architectural similarity with the thumb domain of Taq DNA polymerase being involved in double-stranded DNA binding. Mutational analyses show that the domain 2 dictates the recognition of specific DNA structures, especially fork-structured DNA, while domains 1 and 3 are crucial for the structure-specific helicase activity. Interestingly, two other SF2 helicase members recognizing branched structures, RecQ and RecG, contain an insertion, after and before the helicase core respectively. It remains unclear how Hef domain 2 participates in branched structure recognition.
The N-terminal domain of Hef displays a DNA structure-specific helicase activity as the most prominent enhancement of the ATPase activity is observed with fork-structured DNAs. Interestingly in vitro experiments suggest that the N-terminal domain binds to the fork-structured DNA and process the DNA to increase cleavage of the substrate by the endonuclease domain. These observations have led to the proposal that Hef also acts at stalled replication forks, both the helicase and the nuclease activities being required for the rearrangement of forked-structure DNA (Komori et al., 2004).
The genetic study of Hef mutant in the euryarchaea Haloferax volcanii suggest that Hef is involved in the restart of arrested replication forks as an alternative pathway to homologous recombination-dependent pathway (Lestini et al., 2010). In this organism Hef is not involved in nucleotide excision repair but Haloferax volcanii possesses bacterial homologs of NER proteins which may have displaced the original archaeal NER proteins. Therefore the absence of an NER function of Hef cannot be generalized to all euryarchaea. In support to this, a recent genetic analysis of hef mutants in the hyperthermophilic archaeon, Thermococcus Kodakaraensis, has demonstrated that Hef is involved in the repair of a wide variety of DNA damages. In addition, the higher sensitivity of Δhef mutants to methyl sulfonate and mitomycin C, suggests a central role for Hef protein in the archaeal NER and/or ICL repair pathways (Fujikane et al., 2010).
3.2.2. XPF in Crenarcheota
By contrast, the XPF ortholog found in crenarchaeota contains only the C-terminal nuclease domain. The structure of XPF from the crenarchaea Aeropyrum pernix reveals two domains, a N-terminal nuclease domain and a (HhH)2 domain (Newman et al., 2005). As expected by analogy with Hef, dimers are formed by interaction of the two nuclease domains and by the interaction of the two (HhH)2 domains from each monomer. Comparaison of the structure of the protein in the presence and absence of dsDNA reveals that the (HhH)2 domain plays a major role in interacting with DNA. Upon binding to DNA a domain movement allow the coupling between the (HhH)2 domain and the nuclease domain to recognize and cleave the DNA thanks to the flexibility of the connecting linker sequence between the two domains. It seems that dimer interaction with DNA involves two binding sites that can only both interact with DNA if the substrate is bent by around 90°, suggesting that XPF may recognize ds/ssDNA junctions by their susceptibility to distorsion. The structural data suggest that only one monomer is catalytically active at a time in a dimer bound to DNA.
The studies of XPF from Sulfolobus solfataricus have shown that XPF strickly requires interaction with PCNA to show any nuclease activity and it has been suggested that PCNA directly stimulates the nuclease activity without changing the binding affinity of XPF for its substrate (Hutton et al., 2008). XPF preferentially cleaves 3’-flap but the presence of downstream duplex influences the choice of position cleavage, SsoXPF appearing to act as a processive nuclease in vitro by processing 3’-flap into gapped duplex products. In this respect, it is noteworthy that recent fluorescence quenching and FRET studies have indicated that PCNA and XPF cooperate to distort DNA substrates (Hutton et al., 2009). It has also been observed that SsoXPF can act on substrates containing a variety of types of DNA damage or modification, suggesting a role in the removal of these lesions in vivo (Roberts and White, 2005). Altogether these data suggest that crenarchaeal XPF is recruited by PCNA to act in NER and replication fork restart, but to date this scenario has not yet been addressed using in vivo data.
4. The Mre11-Rad50 complex in Archaea
The processing of DNA double strand breaks (DSBs) is a crucial mechanism for genomic integrity. DNA breaks can arise during replication as intermediates in programmed DNA rearrangements including meiosis and immune system development or can be caused by oxidative damages and exposure to ionizing radiations. Double strand break repair (DSBR) is an essential repair pathway in the three domains of life, and plays a major role in the rescue of stalled or collapsed replication forks. In bacteria, DSBs are processed via homologous recombination, whereas, in eukarya, they are repaired by homologous or non-homologous recombination (Kowalczykowski et al., 1994; Sonoda et al., 2006). In archaea, the picture is not clear. Homologous recombination (HR) is presumably the preferred process of DSBR as proteins involved in HR have been identified whereas Ku homologs have not (White, 2011); yet a recent genetic study on Haloferax volcanii demonstrate that DSBR by HR is restrained, likely because this species is highly polyploid (Delmas et al., 2009). Nevertheless, the study of such processes in hyperthermophilic archaea is of importance, given that they are exposed to DNA damaging temperatures and are among the most radioresistant organisms, repairing fragmented chromosomes efficiently (Di Ruggiero et al., 1997; Gerard et al., 2001; Jolivet et al., 2003a; Jolivet et al., 2003b). The core component in charge of the early steps of this critical event in eukaryotic cells is the Mre11-Rad50 (MR) complex, associated with a third component, Xrs2 in yeast and Nbs1 in higher eukaryotes. Homologs of the Mre11-Rad50 complex have been found in the three domains of life; however, Xrs2/Nbs1 additional component has not been found in bacteria and archaea, it appears thus that Mre11 and Rad50 comprise the core enzymatic members of this conserved multiprotein machine.
4.1. Catalytic activities and DSB ends processing
The archaeal homologs of Mre11 and Rad50 were initially identified in Pyrococcus furiosus (Hopfner et al., 2000a). PfuMre11 shares significant similarities with eukaryotic counterparts, particularly in the N-terminal region, which contains five phosphoesterase motifs that form the nuclease domain of the protein (Williams et al., 2008). The enzyme is endowed with ssDNA endonuclease and manganese-dependent 3’-5’exonuclease activities (Hopfner et al., 2000a). However, at temperature closer to physiological conditions (55°C), PfuMre11 displays an ATP dependent nuclease activity in magnesium. This activity consists of both a weak 3’-5’ exonuclease activity as well as endonucleolytic cleavage activity on the 5’ strand at a break (Hopkins and Paull, 2008). PfuRad50 is related to the Structural Maintenance of Chromosomes (SMC) family and displays the classical modular structure composed of N- and C- terminal head domains, each bearing the Walker A and B motifs, respectively, separated by a coiled-coil region with a zinc hook. Rad50 has ATPase and adenylate kinase activities (Bhaskara et al., 2007; Hopfner et al., 2000b). Consistent with the model based on the crystallographic structure of PfuRad50 catalytic domain, the MR complex of P. furiosus likely exhibits ATP-dependent DNA binding activity. Thus, Rad50 may regulate DNA binding and release after proper DNA end processing in conjunction with Mre11 (Hopfner et al., 2000b). All the archaeal genomes sequenced to date contain clear homologs of eukaryal Mre11 and Rad50 and the initial biochemical characterization of the P. furiosus homologs indicate that the archaeal MR complex is functionally similar to those from Bacteria and Eukarya. However, the 3’-5’ exonuclease activity described for the complex is opposite to the polarity required for the 5’-3’ resection of DSB ends necessary for the initiation of HR, suggesting the involvement of additional components to catalyse efficient DSB resection. As in the other thermophilic archaea, Mre11 and Rad50 are commonly found in an operon that frequently includes the herA and nurA genes and the four genes are co-transribed in the crenarchaeon Sulfolobus acidocaldarius (Constantinesco et al., 2002; Constantinesco et al., 2004). HerA is an ATP-dependent helicase, which is strikingly bidirectional and can thus unwind DNA from both 3’ and 5’ single-stranded overhangs. NurA defines a new nuclease family exhibiting both a single strand endonuclease activity and a 5'-3' exonuclease activity on single and double stranded DNA (Constantinesco et al., 2002; Constantinesco et al., 2004; Manzan et al., 2004). The cooperation of the four proteins for the 5’ strand resection at DSB has been demonstrated by Hopkins and Paull (Hopkins and Paull, 2008). This process depends on the enzymatic activities of HerA, NurA and Rad50, Mre11 activity being partially dispensable. The P. furiosus MR complex generates short 3’ single stranded overhangs through limited degradation of the 5’ strand at a DSB. This specific structure allows the entry of the complex HerA-NurA and the NurA nuclease together with HerA helicase activities generate the long 3’ single strand suitable for RadA-catalysed strand exchange. The role of Mre11 nuclease activity may be confined to the removal of short 5‘ terminal oligonucleotides, which could be essential for the clearance of covalently attached proteins at the 5’ strand. HerA and NurA have not been detected in eukaryotes, however, recent studies in budding yeast demonstrate that while MRX complex is involved in DSBs processing initiation, the functional homologs Sgs1 and Exo1 nucleases and Dna2 nuclease/helicase, are necessary for the extensive 5’ strand resection (Mimitou and Symington, 2008; Zhu et al., 2008). In bacteria, the DSB recognition and 5’strand resection to produce the 3’-OH overhang for RecA mediated strand exchange is performed by a single RecBCD complex (Kowalczykowski et al., 1994). SbcCD, the bacterial homolog of MR complex, has been shown to cleave hairpin DNA, which can block stalled replication fork, prior to homologous recombination rescue of the fork (Connelly et al., 1998) and to be implicated in the removal of covalently attached protein to promote repair (Connelly et al., 2003).
4.2. Structural insight into the Mre11-Rad50 complex
The archaeal MR complex is structurally very similar to their eukaryal counterparts, and has proven very useful for crystallographic and biophysical studies (Arthur et al., 2004; Hopfner et al., 2002a; Hopfner et al., 2001; Hopfner et al., 2000a; Hopfner et al., 2000b; Hopfner et al., 2002b; Williams et al., 2008). Indeed, with the exception of the recent description of the first eubacterial Mre11 nuclease, the bulk of structural data have been obtained from analysis of P. furiosus Mre11-Rad50 complex (Das et al., 2010).
The core Mre11-Rad50 complex forms a large globular complex at the root of an elongated coiled-coil structure. The complex exists as a heterotetrameric assembly (M2R2) and the globular head is composed of two Mre11 and two Rad50 ATPase domains, both of which bind DNA (Hopfner et al., 2001). This bipolar structure of the MR complex is consistent with both the enzymatic role in DNA end processing and structural function in DNA end joining. Indeed the M2R2 heterotetramer contains two DNA binding/processing active sites, which could be important in the alignment of DNA ends in NHEJ or of DNA ends and sister chromatids in HR (Hopfner et al., 2002a). X-ray crystallographic data from the P. furiosus Rad50 coiled-coil region reveals a hook structure that caps the distal end of the coiled-coils with a conserved Cys-X-X-Cys motifs that mediates Rad50-Rad50 dimerization through this motif by coordinated binding of a zinc atom (Hopfner et al., 2002b). The ability to interact through coiled coil ends in supra-molecular complexes is proposed to be necessary for the mechanistic role of MR complex (de Jager et al., 2004). The crystallographic structure of PfuMre11 reveals a two-domain architecture consisting of a protein-phosphatase-like domain and a small capping domain that interact at the active site (Fig.2).
The first domain contains the five phosphoesterase motifs which form the nuclease active site. This domain is composed of two parallel mixed β sheets that are flanked by seven α helices. The capping domain consists of a 5-stranded β sheet and two helices and partially caps the nuclease catalytic motifs of the N-terminal domain, suggesting that the capping domain might be involved in DNA-binding specificity (Hopfner et al., 2001). X-ray structure of PfuMre11 bound to DNA reveals that Mre11 dimerization is critical for efficient DNA binding and is mediated by N-terminal conserved domains (Williams et al., 2008). Six DNA recognition loops promote DNA binding and assemble into a contiguous DNA interaction surface. The interaction with DNA is mediated by contacts to the sugar-phosphate backbone, which is consistent with the lack of sequence specificity for Mre11 in DNA binding. The structure of the archaeal Mre11 was used to map eukaryotic Mre11 mutations linked to human disease and yeast DSBR defect and helped define a large surface area outside the nuclease motif, which may be an important protein-protein or protein-DNA interface (Hopfner et al., 2001).
4.3. Physiological roles of Mre11-Rad50 complex
Genetic studies in eukaryotes indicate that the MR complex is required for genomic stability and is involved in a large variety of different functions in response to DSBs (Stracker and Petrini, 2011). In vivo studies, in budding and fission yeast, have demonstrated a critical role for Mre11 and the other subunits of the MRX complex, for survival of DSBs caused by ionizing radiations and genotoxins (Chahwan et al., 2003; D'Amours and Jackson, 2002). In addition, hypomorphic mutations in the human mre11 and nsb1 genes lead to ataxia telangiectasia-like disorder and Nijmegen Breakage syndrome, respectively. The cellular features resulting from these mutations include hypersensitivity to ionizing radiation, radioresistant DNA synthesis, and abrogation of ATM-dependent events (Stewart et al., 1999; Williams et al., 2007). These phenotypes emphasize the importance of the eucaryotic Mre11 complex in a large variety of DNA metabolic pathways. The bacterial homologue of Mre11-Rad50 is SbcCD, and it has been shown that sbcCD mutants of Deinococcus radiodurans display reduced survival and present a delay in kinetics of DSB repair (Bentchikou et al., 2007). In Bacillus subtilis and D. radiodurans, the deletion of sbcC results in an increased sensitivity of the cells to ionizing radiation (Mascarenhas et al., 2006).
In archaea, the first evidence of the involvement of Mre11 in DNA repair process was demonstrated by Quaiser et al. (Quaiser et al., 2008), using an immunodetection approach to determine the roles of Mre11, Rad50, NurA and HerA proteins, in post-irradiation DNA repair in S. acidocaldarius. They observe that a complex of the three proteins HerA, Mre11 and Rad50 is formed constitutively in vivo. Rad50 is constitutively associated with DNA and upon chromosome fragmentation, Mre11 proteins is recruited to the DNA or to Rad50 DNA-bound proteins, suggesting that Mre11 is actively involved in DNA repair processes and/or acts as an inducible damage sensor. The analyses of mre11 rad50 deletion mutants of the halophilic Halobacterium sp. strain NRC1 also suggest that Mre11 and Rad50 may have independent functions outside the MR complex in archaea, since the absence of Rad50 has no effect on the repair of DSBs, whereas the loss of Mre11 results in a decrease rate of DSBR, due to the loss of either nuclease activity or the DNA damage-sensing activity of Mre11 (Kish and DiRuggiero, 2008). Surprinsingly, the mre11 rad50 deletion mutant of Haloferax volcanii displays an enhanced resistance to DNA damage that correlates with a higher level of homologous recombination in the mutant, suggesting that Mre11-Rad50 restrains the use of HR for repair (Delmas et al., 2009). The unrestrained use of HR in mre11 red50 mutant enhances cells survival but leads to a slower recovery presumably due to difficulties in the resolution of repair intermediates. Two non-exclusive hypotheses are proposed to account for that increased resistance observed in mre11 rad50 mutants: (i) Mre11-Rad50 binds to double-strand breaks and prevents HR, and/or (ii) Mre11-Rad50 stimulates an alternative pathway of double-strand breaks repair. To add some complexity, a recent genetic analysis in the hyperthermophilic archaeon Thermococcus kodakaraensis, has shown that the mre11, rad50, herA, nurA and radA genes are essential for T. kodakaraensis, which is in contradiction with the previous genetic analyses in archaea, yeast and bacteria. This result could reflect the importance of HR to repair DNA damage caused by the high temperature required for T. kodakaraensis for viability (Fujikane et al., 2010).
Contrasting with the wealth of structural and biochemical data gained from the study of archaeal MR complex, the paucity and the conflicting nature of the genetic analyses underscore the importance to develop more effective genetic tools, for hyperthermophilic archaea in particular, to improve our knowledge on the functions of the complex in response to DSBs. Biochemical and in vivo investigations of the functions of HerA, NurA and the single stranded DNA binding proteins, RadA and SSB/RPA, required for strand exchange, should also improve our knowledge of the HR/DSBR in archaea and potentially shed light on the eukaryal pathway.
5. The Pab2263-NucS protein
5.1. Identification of a novel nuclease
Many nucleases are highly regulated by the sliding clamp PCNA (Proliferating Cell Nuclear Antigen). For instance, PCNA increases the affinity of Fen-1 for its substrate (Hutton et al., 2008) as well as catalytic rate of SsoXPF (Hutton et al., 2009). Interaction of Fen-1 and other proteins with PCNA is mediated by the so-called PIP-motif (PCNA Interacting Motif), a relatively short peptide motif found in a large number of PCNA-interacting proteins (Vivona and Kelman, 2003; Warbrick, 1998). The sequence motif in Pyrococcus abyssi species has been experimentally defined and corresponds to the peptide motif QX2LX2[WLFT][LFT] (Meslet-Cladiere et al., 2007). Among others, previously uncharacterized protein encoded by the pab2263 gene was shown to carry this peptide motif at its carboxy-terminus.
Pab2263 belongs to the DUF91 family (Domain of Unknown Function 91) and, as many members of this family, contains the C-terminal domain that carries the characteristic residues forming the active site of the RecB family nucleases. This nuclease domain is found in many enzymes with potential functions in DNA replication and/or repair (Aravind et al., 2000). For instance, the DUF91 family members are found in euryarchaeota (59 homologues annotated in 2011), crenarchaeota (33 homologues), actinobacteria (259 homologues), and proteobacteria (41 homologues). Up to date, no eukaryotic member of the DUF91 family has been identified.
5.2. Structure of Pab2263-NucS
Crystallographic structure of Pab2263 has been solved (Ren et al., 2009; Ren et al., 2007) and is the first representative of the DUF91 family. Pab2263 is composed of two independent domains, separated by a long and flexible linker (~28Å). This multi-domain organisation is common for many nuclease domains which are often associated with helicase domains (Rouillon and White, 2010).
The C-terminal domain of this protein family displays the minimal endonuclease fold (Pingoud et al., 2005): an α/β structure composed of a five-stranded β-sheet and four flanking α helices. Active site is represented by a sequence motif conserved in the RecB-like nucleases (Aravind et al., 2000): E----[Gh]xxD----hxhh[ED]hK---QhxxY, where ‘h’ refers to hydrophobic residues (YFWLIVMA) and ‘---‘ indicates that the characteristic residues are not consecutive in the sequence. A conserved patch of four basic residues flanks one side of the cleft of the active site and might be involved in the binding of nucleic acids.
Strikingly, the N-terminal domain of Pab2263 displays a half-closed β-barrel, composed of height β-strands arranged in two antiparallel β-sheets. This fold was never previously described, but can remotely be seen as a structural homologue of the OB- or the Sm-folds, two folds that are involved in the binding of nucleic acid (Kambach et al., 1999; Theobald et al., 2003). In Pab2263, the potential binding site involves two patches of three consecutive basic residues, two conserved aromatic residues and a conserved arginine. High affinity ssDNA binding activity of the N-terminal domain was desmonstrated using site-directed mutagenesis and EMSA experiments (Ren et al., 2009).
The N-terminal domain displays a large hydrophobic patch exposed to the C-terminal domain, and is involved in the dimerisation of the protein. Dimer formation brings one extra residue of one monomer to the active site of the second monomer and the flexible linker cap the active site. As a result, the active site becomes a ‘closed’ channel, which indicates that the substrate for the enzyme must have a free end.
5.3. Activity of the Pab2263-NucS protein
Tests of various substrates on Pab2263 reveal its surprisingly broad range of substrate specificity. In agreement with the structural data indicating that the active site of the Pab2263 is located in the closed channel, this protein preferentially cleaves single stranded DNA, and was thus renamed ‘NucS’, for ‘NUClease specific for single-stranded DNA’. Under stoechiometric binding conditions, single-stranded regions of splayed arms, 3’ flaps and 5’ flaps are all cleaved by NucS, leaving only double-stranded products. Long single stranded DNA substrates are cleaved to regularly spaced products, which suggests that the protein could somehow measure its distance to the DNA end.
In high concentration of NucS, nuclease activity can invade to the double stranded DNA regions, suggesting that NucS proteins carry a weak helicase and/or unmelting activity of dsDNA. Important observation is that addition of PCNA directs the cleavage activity of P. abyssi NucS towards the ss/ds DNA junctions, thus increasing the cleavage specificity (Ren et al., 2009).
Pab2263-NucS is a founding member of a new family of structure-specific DNA endonuclease. The discovery of this novel nuclease family thus further indicates that archea contain many more nucleases than previously expected on the basis of search of homologous of ‘conventional’ eukaryotic or bacterial nucleases. For example, the Bax1 and GAN (GINS Associated Nuclease) nucleases from the DUF790 and Phosphoesterase RecJ-like families, respectively, have been identified in a similar manner (Li et al., 2011; Richards et al., 2008; Roth et al., 2009; Rouillon and White, 2010).
DNA repair pathways require the function of nucleases to ensure the removal of damages in DNA and thus integrity of the genetic information. The molecular mechanisms of the archaeal DNA nucleases reviewed in this work clearly underscore the conservation of the genetic information processing in archaea and eukarya. Whereas considerable amount of biochemical and structural data are available for archaeal DNA repair nucleases, their physiological roles remain less understood. More biochemical and genetic studies to investigate physiological functions of classical and recently discovered archaeal nucleases have clearly a strong potential to contribute to understanding complexity of eukaryotic DNA repair in the cellular context.
D.F., H.M, R.L., and C.R., were supported by grant ANR-07-BLAN-0371 from the National Research Agency. G.H., was supported by grant ANR-10-JC from the National Research Agency. H.M and R.L. also acknowledge support from the ANR Project RETYD(Y)DNA.