In Drosophila, compensation for the reduced dosage of genes located on the single male X chromosome involves doubling their expression in relation to their counterparts on female X chromosomes. Dosage compensation is an epigenetic process involving the specific acetylation of histone H4 at lysine 16 by the histone acetyltransferase Mof. Although Mof is expressed in both sexes, it only associates with the X chromosome in males. Its absence causes male-specific lethality. Mof is part of a chromosome-associated complex comprising male-specific lethal (MSL) proteins and at least one non-coding roX RNA. How Mof is integrated into the dosage compensation complex is unknown. Association of Mof with the male X chromosome is shown in this study to depend on its interaction with RNA. Mof specifically binds through its chromodomain to roX2 RNA in vivo. In vitro analyses of the Mof and Msl-3 chromodomains indicate that these chromodomains may function as RNA interaction modules. Their interaction with non-coding RNA may target regulators to specific chromosomal sites (Akhtar, 2000b).
The association of (MSL) proteins (Msl-1, Msl-2, Msl-3, Mof, Mle) and roX RNA with the male X chromosome has been visualized by immunofluorescence analysis of larval polytene chromosomes. Drosophila SL-2 cells can act as a model system for male features, because functional dosage compensation complex (DCC) has been purified from them, and they have also been used to study the post-transcriptional regulation of dosage compensation. The nuclear territory of the X chromosome in SL-2 cells can be visualized by immunostaining with antisera against Msl-1, Mof, the histone H4 isoform acetylated at Lys 16 (K16) and Mle. Msl-1 remains localized to the X chromosome after permeabilization of the cells and RNase treatment. In contrast, the bulk of Mof staining disappears from the X chromosome after RNase treatment. Mle, which interacts with larval polytene chromosomes in an RNase-sensitive manner, dissociates from the chromosome upon permeablization of the cells ; therefore, the RNase sensitivity of Mle's chromosomal association in this system could not be confirmed. Loss of Mof correlates with a reduction of the H4 acetyl-K16 histone isoform at the X chromosome, suggesting a high turnover of the modification under these conditions. These results suggest that the stable integration of Mof into chromosome-bound DCC involves an RNase-sensitive structure, and that the continued association of Mof with DCC is independent of Mle (Akhtar, 2000b).
Candidate RNAs that may contribute to the assembly and chromosomal association of the DCC are the roX1 and roX2 RNAs that are stably expressed in male but not female flies, and that colocalize with MSL proteins on the male X chromosome. The two RNAs have no effect on the integrity of DCC, because deletion of a single roX RNA has no phenotype, but mutation of both genes abolishes the interaction of MSL proteins with the chromosome. SL-2 cells express only roX2 RNA, which forms a complex with MSL proteins and is therefore an excellent candidate for an RNA involved on the targeting of Mof. To test whether Mof interacts with roX2 RNA in vivo, nuclear extracts were prepared from SL-2 cells and the soluble DCC was isolated by immunoprecipitation with antibodies specific for either Mof or Mle. The RNase sensitivity of interactions was tested by RNase treatment of extracts before immunoprecipitation. The immunoprecipitated complexes were characterized by western blot analysis. Under these conditions, Mof, Mle, Msl-3 and Msl-2 co-immunoprecipitate independently of RNase treatment, indicating that the DCC is held together by protein-protein interactions and/or that bridging RNA is protected from the nuclease attack. In the absence of RNase, roX2 RNA is readily detected in the immunopurified complex by reverse transcription of RNA followed by polymerase chain reaction with roX2 specific primers, whether the immunoprecipitation has been performed with antibodies specific for Mof or Mle (Akhtar, 2000b).
These experiments confirm earlier reports that roX2 RNA can be part of soluble DCC, but they raise the issues of how soluble DCC may differ from the chromosome-associated complex and whether Mof interacts with roX2 RNA directly or indirectly (for example, through Mle). To clarify the latter issue, use was made of the observation that the association of Mle with DCC is sensitive to elevated ionic strength. Under these stringent immunoprecipitation conditions, Mle is no longer associated with Mof, although Msl-2 and Msl-3 are still detectable. In the absence of Mle, a significant amount of roX2 RNA remains associated with Mof (Akhtar, 2000b).
Although these experiments rule out that Mle, to date the best candidate for an RNA interacting factor, bridges between Mof and roX2, the involvement of Msl-3 or other unknown protein(s) remain possible. To determine whether Mof interacts with RNA directly, the recombinant enzyme was studied in vitro. Electrophoretic mobility shift assay (EMSA) with fragments derived from roX1 reveals a nonspecific interaction of Mof with RNA. Competition experiments show that Mof interacts with RNA with high preference over DNA. Mof is unable to interact with DNA efficiently. A lack of specificity of the RNA interaction may be due to misfolding of the in vitro transcribed RNAs in the absence of chaperones or, most likely, to the use of arbitrary fragments of roX RNA, which lack optimum binding sites (Akhtar, 2000b).
The specific interaction of Mof with RNA should be distinguished from insignificant 'sticking', by determining whether Mof has a specific domain for RNA interaction. The RNA interaction domain was mapped by creating a series of recombinant Mof derivatives that are all active in chromatin binding and histone acetylation, and analysing their potential to interact with RNA. A Mof protein truncated at its amino terminus (N352) still interacts with RNA, and further deletion of the chromodomain (N518) abolishes this interaction. Since this result suggests that the chromodomain is involved in RNA binding, hydrophobic residues that are conserved in chromodomains from various origins were mutated. Mof derivatives with single amino-acid exchanges (W426G and Y416D) are unable to interact with RNA, whereas a point mutation in the acetyl CoA-binding site, G691E4, does not affect RNA binding. The W426G and Y416D mutations affect the structure of the chromodomain only locally because the mutant enzymes are still active histone acetyltransferases (HATs). Confirmatory results were obtained with an alternative binding assay involving roX1 RNA immobilized on streptavidin-sepharose beads (Akhtar, 2000b).
To establish the physiological relevance of the chromodomain for RNA interaction in vivo Mof derivatives were transiently expressed from a metallothioneine promoter in SL-2 cells. The addition of an N-terminal haemagglutinin (HA) epitope allowed the distinguishing of ectopic enzymes from endogenous Mof. Whole-cell extracts were prepared before or after induction of transgene expression with copper. Under stringent immunoprecipitation conditions with antibody directed against HA, roX2 RNA associates with HA-Mof. When expressed to the same level as intact Mof, both chromodomain mutants W426G and Y416D show an impaired interaction with roX2 RNA. Almost no roX2 RNA could be detected after immunoprecipitation of Y416D, whereas the effect of the W426G mutation was less severe, but still clear (Akhtar, 2000b).
To establish whether RNA interaction is a specific feature of the Mof chromodomain or whether it is a more general property of chromodomains, Msl-3, another dosage compensation protein containing two chromodomains was tested for interaction with RNA. Recombinant Msl-3 interacts efficiently with RNA, forming several complexes in the EMSA assay at higher concentrations. Competition experiments have established that Msl-3 interacts far better with RNA than with DNA. Even the carboxy-terminal chromodomain of Msl-3 (CD2) when fused to glutathione-S-transferase (GST) is able to interact with RNA but not DNA, whereas the GST moiety alone is inactive (Akhtar, 2000b).
These results suggest that the Mof chromodomain interacts with roX RNA in vivo, which may contribute to the integration of Mof into DCC at the male X chromosome. The association of Mof with DCC is a rather late step in the assembly of the complex and does not occur in the absence of Mle. The earlier incorporation of roX2 RNA also depends on Mle. The stable association of any one subunit with DCC may rely on multiple interactions with protein and/or RNA subunits, and an additional direct contact between Mof and Mle remains possible. Whether RNA interaction is a general property of chromodomains or restricted to a subfamily of the chromodomain superfamily remains to be seen. Chromodomains are important for the function of a number of chromatin regulators, but their modes of action have remained enigmatic. Although the related 'chromo shadow domain' of heterochromatin protein 1 (HP1) mediates interactions with several proteins and an interacting peptide has been identified, chromodomains have so far not been shown to contact proteins or peptides. Mutations of the clr protein analogous to the ones made in Mof abolish its silencing capacity (Ivanova, 1998). Small deletions of the polycomb protein containing these residues lead to its delocalization in SL-2 cells. Non-coding RNAs may be more commonly involved in organizing regulatory complexes than has been appreciated to date (Eddy, 1999). Identification of the RNA structure motif that determines the specific interaction with the chromodomain remains a challenge for the future. Interestingly, dosage compensation in mammals also involves a non-coding RNA, Xist, coating the inactive X. It is tempting to speculate that roX and Xist RNAs may target regulatory proteins to the X chromosomes of Drosophila and humans (Akhtar, 2000b).
Dosage compensation ensures equal expression of X-linked genes in males and females. In Drosophila, equalization is achieved by hypertranscription of the male X chromosome. This process requires an RNA/protein containing dosage compensation complex (DCC). RNA interference of individual DCC components has been used to define the order of complex assembly in Schneider cells. Interaction of MOF with MSL-3 leads to specific acetylation of MSL-3 at a single lysine residue adjacent to one of its chromodomains. Localization of MSL-3 to the X chromosome is RNA dependent and acetylation sensitive. The acetylation status of MSL-3 determines its interaction with roX2 RNA. Furthermore, RPD3 interacts with MSL-3 and MSL-3 can be deacetylated by the RPD3 complex. It is proposed that regulated acetylation of MSL-3 may provide a mechanistic explanation for spreading of the dosage compensation complex along the male X chromosome (Buscaino, 2003).
As in male flies, MSL-2 is central to the assembly of the dosage compensation complex in SL-2 cells, since its depletion by RNAi leads to disassembly of the complex. MOF protein requires prior assembly of MSL-1, MSL-2, and MSL-3, while the MSL-3 protein requires prior assembly of at least MSL-1 and MSL-2 proteins. Since in MOF dsRNA-treated cells approximately 10% of MOF protein can still be detected by Western blot analysis, it remains possible that small amount of MOF enzyme may be sufficient for MSL-3 localization to the X chromosome. In MOF mutant flies, MSL-3 protein localization on polytene chromosomes is restricted only to chromatin entry sites, as detected by immunostaining of the polytene chromosomes, suggesting that incomplete knockdown of MOF may be a plausible explanation for the apparent unaffected MSL-3 localization in these cells. However, it is also important to note that due to the limited size of SL-2 cells, it is difficult to resolve entry sites in SL-2 cells in comparison to the polytene chromosomes. Alternatively, MSL-3 localization to the X chromosome in MOF dsRNA-treated cells could also be a feature specific to SL-2 cells. Interestingly, depletion of MSL-2, MSL-3, or MOF led to dissociation of MLE from the X chromosome. It is therefore suggested that MLE localization is sensitive to the assembly of the rest of the complex in SL-2 cells (Buscaino, 2003).
MOF protein is associated with the X chromosome in an RNase-sensitive manner. This observation has been extended and it has been show that MSL-3 protein is also tethered to the X chromosome via RNA. MSL-3 interacts with rox2 in immunoprecipitation experiments and not with another nuclear RNA, suggesting that roX2 is a likely candidate for mediating this interaction in SL-2 cells. However, it remains plausible that an as yet unidentified RNA (or protein) may act as a bridge between MSL-3 and the X chromosome in vivo (Buscaino, 2003).
Surprisingly, it was found that association of MSL-3 with the X chromosome is not only sensitive to RNase treatment but also to TSA treatment. These results suggest that the acetylation status of MSL-3 protein is likely to be of a dynamic nature and that slight imbalance in cellular acetylation levels leads to dramatic consequences for the MSL-3 protein in vivo. This result is intriguing, since MOF, which is also capable of autoacetylation, did not show the same phenotype under these conditions. Whether autoacetylation of MOF leads to other effects remains possible (Buscaino, 2003).
In the case where the MSL-3 protein levels are reduced by MSL-3 dsRNA treatment, the localization of MOF protein to the X chromosome is severely compromised. This result at first sight appears contradictory to the observation that MOF localization to the X chromosome is unaffected upon TSA treatment for 30 min. However, it is important to note that incubation of SL-2 cells with TSA for periods longer than 4 hr also affects localization of the rest of the complex, including MOF, suggesting that the DCC as a whole is sensitive to overall acetylation levels within the cells. Furthermore, it remains possible that additional modifications of DCC factors contribute to the stability and dynamics of the complex as a whole. However, the findings strongly suggest that MSL-3 is particularly sensitive to acetylation changes and therefore follows a more rapid dissociation than the rest of the complex members in the conditions tested (Buscaino, 2003).
The results demonstrate that the MSL-3 protein is regulated by acetylation and that MOF acetylates a single lysine residue in MSL-3 in vitro. This finding underscores the stringent substrate specificity of MOF, consistent with the fact that MOF also acetylates only a single lysine in a nucleosomal substrate. A striking consequence of MSL-3 acetylation is the loss of its interaction with RNA and its failure to localize to the X chromosome. The sensitivity of acetylated MSL-3 protein seems to be specific for roX2 in vitro since neither nonspecific DNA nor nonspecific RNA binding was affected. The MSL-3 protein contains two chromodomains, and MSL-3 has been shown to bind RNA in vitro via its chromodomain; remarkably, the acetylation occurs next to one of its chromodomains (Buscaino, 2003).
Acetylation of the X chromosome seems to have a high turnover, and the dosage compensation complex members, particularly MSL-3, appear to be sensitive to changes in endogenous acetylation levels. It is therefore proposed that regulated acetylation of MSL-3 may cause a conformational change that leads to temporary loss of interaction with RNA from one of the chromodomains. A cycle of deacetylation may follow that will allow MSL-3 to contact RNA again on a nearby affinity site. By continuous cycle of acetylation and deacetylation, MSL-3, along with the rest of the complex, may be able to spread from a chromatin entry site. These findings also provide further insight into the essential nature of MOF histone acetyl transferase, which is required not only for acetylation of the X chromosome but also for regulation of other members of the complex. Association of RPD3 with the dosage compensation members provides strong supporting evidence for this hypothesis. Interestingly, RPD3 hypomorphic mutants show a reduced male to female ratio. Furthermore, the S. cerevisiae homolog of MSL-3, EAF3, part of the NuA4 complex that contains ESA1, copurifyies with yeast RPD3 using the tandem affinity purification (TAP) procedure. It therefore appears likely that this property of MSL-3/EAF3 is conserved and that MSL-3/EAF3 may act as a bridge between two different histone-modifying activities. The transient interaction of dosage compensation complex members with a histone deacetylase may be required for fine tuning the effects of hyperacetylation by MOF protein to achieve the proper level of dosage compensation (Buscaino, 2003).
In male Drosophila, histone H4 acetylated at Lys16 is enriched on the X chromosome, and most X-linked genes are transcribed at a higher rate than in females (thus achieving dosage compensation). Five proteins, collectively called the MSLs, are required for dosage compensation and male viability. Here it has been shown that one of these proteins, Msl1, interacts with three others, Msl2, MSL3 and Mof. The latter is a putative histone acetyl transferase. Overexpression of either the N- or C-terminal domain of Msl1 has dominant-negative effects, i.e. causes male-specific lethality. The lethality due to expression of the N-terminal domain is reduced if msl2 is co-overexpressed. Msl2 co-purifies over a FLAG affinity column with the tagged region of Msl1, and both MSL3 and Mof co-purify with the FLAG-tagged Msl1 C-terminal domain. Furthermore, the Msl1 C-terminal domain binds specifically to a GST-Mof fusion protein and co-immunoprecipitates with HA-tagged MSL3. The Msl1 C-terminal domain shows similarity to a region of mouse CBP, a transcription co-activator. It is concluded that a main role of Msl1 is to serve as the backbone for assembly of the MSL complex (Scott, 2000).
In general, the amino acid sequences of the MSLs suggest regions or domains within the proteins that could be important for function in vivo. Indeed, this has been confirmed by mapping loss-of-function mutations to the domain, such as the helicase domain of MLE, the putative acetylase domain of Mof and the RING finger region of Msl2. The amino acid sequence of Msl1 is the least informative, containing no recognizable domains, although regions rich in acidic amino acids and possible PEST sequences have been identified. To identify regions within Msl1 that are important for function in vivo, it was determined which regions have dominant-negative effects when overexpressed. Two regions of Msl1, one near the N-terminus and the other at the C-terminus, are likely to be important for assembly of the MSL complex in vivo, because overexpression of either region causes male-specific lethality. Genetic evidence, decreased male viability of msl2 heterozygotes and increased male viability by co-overexpression of Msl2, suggests that the region of Msl1 at the N-terminus interacts with Msl2. This has been confirmed by co-purification of Msl2 with FLAG-tagged versions of Msl1 over FLAG affinity columns. Similarly, the C-terminal region of Msl1 interacts with both Mof and MSL3. Furthermore, expression of the C-terminal domain results in significant loss of Mof from the male X chromosome (Scott, 2000).
The N-terminal FN region of Msl1 that binds to Msl2 was chosen originally for expression in flies because it was predicted that almost half of FN (amino acids 96-172) would form a two-stranded, alpha-helical, coiled-coil structure. Coiled-coil structures are comprised of a heptad repeat (abcdefg)n where hydrophobic residues occupy positions a and d on the same side of the alpha-helix. The coiled-coil motif of GCN4 mediates dimerization. If a similar structure mediates the formation of the Msl1-Msl2 heterodimer, then part of the region of Msl2 that interacts with Msl1 should form a coiled-coil structure. The Ring finger domain region of Msl2 interacts with Msl1. It is predicted that the region immediately preceding the RING finger could form a coiled-coil structure. It is particularly significant that several of the mutations that disrupt the interaction with Msl1 in yeast introduce amino acid changes that either significantly disrupt the alpha-helix (leucine to proline) or introduce a charged amino acid into the predicted hydrophobic face of the alpha-helix. The RING domain is found in a number of proteins, including the V(D)J recombination-activating protein RAG1. The crystal structure of the RAG1 dimerization domain, which includes the RING finger, reveals that dimerization is stabilized by interaction between alpha-helices that form a hydrophobic core. The RING finger is thought to form the structural scaffold upon which the dimer interface is formed. It is tempting to speculate, by analogy with RAG1, that the association of Msl1 and Msl2 involves the interaction of amphipathic alpha-helices that depend on the RING finger domain. This could best be addressed by determining the crystal structure of the Msl1-Msl2 complex (Scott, 2000).
In vitro translated Msl1 C-terminal domain co-immunoprecipitates with in vitro translated HA…MSL3 but not HA…Mof. Thus, C interacts directly with MSL3 but the interaction with Mof requires either another factor present in fly extracts or post-translational modification of Msl1 or Mof. While the possibility of a nucleic acid component of the FC-Mof complex cannot be ruled out, the possiblity (post-translational modification of Msl1 or Mof) is favored since a silver stain of FLAG affinity-purified FC-Mof complex separated by SDS-PAGE shows only two main bands corresponding to the sizes expected for FC and Mof. The C-terminal domain of Msl1 is rich in serine and threonine residues, and contains several potential phosphorylation sites and a predicted PEST sequence. PEST sequences have been suggested to contribute to the instability of the Msl1 protein. However, the role of these sequences in Msl1 has not been determined. Indeed, an alternative function for the PEST sequences is suggested by the observations that the PEST domains of PU.1 and IB are required for their respective interactions with Pip and c-Rel. In both cases, phosphorylation of a serine residue within the PEST sequence is required for the respective protein-protein interactions. The recent finding that a serine/threonine kinase is associated preferentially with the male X chromosome raises the possibility that Msl1 or another MSL is phosphorylated by this enzyme (Scott, 2000).
In the sequential model for assembly of the MSL complex, the first step involves the binding of the Msl1-Msl2 complex to several 'high affinity' sites on the male X chromosome. Since the localization of both Mof and MSL3 to the X chromosome requires mle+ function, this suggests that the association of Mof and MSL3 with the Msl1-Msl2 complex is Mle dependent. Mle could either bind directly to Mof and/or MSL3, or somehow stabilize the MSL complex together with roX-1 RNA. In support of the latter model, Mof and MSL3 bind directly to the C-terminal domain of Msl1. Furthermore, Mle does not co-purify with an FC-Mof-MSL3 complex over an affinity column. However, the affinity chromatography experiments were designed to maximize the likelihood of detecting protein-protein association and are not quantitative. It is possible that Mof and MSL3 may have a higher affinity for the C-terminal domain of Msl1 than full-length Msl1. Thus, one possible mechanism is that in vivo the C-terminal domain of Msl1 is not freely available to bind to Mof and/or MSL3, and that the binding of Mle to the Msl1-Msl2 complex causes a conformational change in Msl1, such that the C-terminal domain becomes more accessible (Scott, 2000).
Previous searches of the protein sequence database with the complete Msl1 sequence have failed to identify any significant similarities. However, when a search is carried out with just the C-terminal domain sequence, some similarity is found to a 254 amino acid region of mouse CBP. Although the similarity is not high, given that the similarity extends across almost the entire C-terminal domain of Msl1, and that both CBP and the Msl1 C-terminal domain bind to histone acetyl transferases (or putative histone acetyl transferases), it is thought that this homology may be significant. If this similarity reflects a conserved function, then it would be predicted that the Msl1-similar region of CBP, which has no known function, would associate with either an Mof-like histone acetyl transferase or an MSL3-like protein in mammalian cells (Scott, 2000).
It is not known how the MSL complex binds to the male X chromosome. None of the MSLs contain a recognizable DNA-binding motif. The F84 version of Msl1, lacking the first 84 amino acids, binds to Msl2, MSL3 and Mof but does not bind preferentially to the male X chromosome. This suggests that the male lethality that results from overexpression of F84 is due to this protein being able to bind to three MSLs, but not being able to bind to the X chromosome because the first 84 amino acids of Msl1 are required for recognition of the X chromosome. Alternatively, the lack of binding of F84 to the male X chromosome could be because the beginning of Msl1 is required for assembly of the MSL complex in vivo. However, if so, then it would be expected that F84 would have bound to the 'high affinity' sites since F84 does bind to Msl2. Assuming that Msl1 and Msl2 are the only components of the high affinity complex, it would then appear more likely that the first 84 amino acids of Msl1 are required for X chromosome binding rather than complex formation. However, there are several lines of evidence that suggest that the roX RNAs are part of the MSL complex, which raises the possibility that one or both of the roX RNAs could be part of the high affinity complex. Thus it will be of interest to determine if the MSL complex containing the F84 protein binds to roX RNA with a lower affinity than the complex containing full-length Msl1 (Scott, 2000).
Analysis of the Mof amino acid sequence reveals a putative acetyl-coenzyme A binding pocket close to the C terminus as well as zinc finger and chromodomain motifs in the central part of the protein. In the mof-1 allele, which gives rise to the male-specific lethality and leads to loss of acetylation of H4 at lysine 16 on male X chromosomes, glycine 691 is replaced by a glutamate in the presumed acetyl-coenzyme A binding pocket. In order to analyze the activity of Mof, the full-length protein was expressed in E. coli and purified to homogeneity. The mof-1 allele (hereafter referred to as Mof-G691E) was also produced as a control. To analyze the functional status of the recombinant enzymes, histone acetyltransferase assays were performed using recombinant histones and derivatives lacking the N- and C-terminal domains containing the physiological acetylation targets. Efficient incorporation of 3H-labeled acetyl groups into protein has identified full-length Mof as a robust histone acetyltransferase, whereas Mof-G691E is about ten times less active. Histones lacking their N-terminal 'tails' are only poor substrates. Autoacetylation of Mof may contribute to the residual incorporation of [3H]acetyl into protein in reactions containing truncated histones (Akhtar, 2000a).
Mof exhibits a preference for free histones, and shows a clear substrate preference for histone H4 but also acetylates H3 to some extent. No acetylation of histone H2A and H2B has been observed using the intact protein. However, an N-terminal deletion mutant of Mof that lacks the first 352 amino acids (Mof-N352) is able to acetylate histone H2A, suggesting that the N terminus of Mof may be involved in substrate selectivity. On Triton-Acid-Urea (TAU) gels, the acetylated histone H4 resolves into a mixture of isoforms with a preponderance of the monoacetylated isoform over di- and tri-acetylated species (Akhtar, 2000a).
In order to test Mof activity on a more physiological substrate, nucleosomal arrays were assembled on linear DNA coupled to paramagnetic beads using NAP-1 as a histone deposition vehicle, and this was used as a nucleosomal substrate for acetylation. The analysis of the acetylated histones on the TAU gel demonstrates that nucleosomal arrays serve as substrates for Mof and reveals an increased substrate specificity. When compared to the acetylation of free histones, predominantly monoacetylated histone H4 is obtained. To test directly whether Mof can acetylate lysine 16 (K16) of H4, the products of the acetylation reactions were subjected to Western blot analysis and probed with antibodies against specific histone isoforms. This analysis reveals a clear substrate preference of H4 lysine 16 over lysine 12, when either free histones or nucleosomal substrates are acetylated. The faint signal obtained using the H4K12 antibody is not due to the poor sensitivity of the antibody since the antibody easily detects acetylation of lysine 12 by HAT1. Taken together, these experiments demonstrate that recombinant Mof is able to monoacetylate histone H4 within a nucleosomal array, preferentially at lysine 16. and that the mof-1 mutation largely abolishes this activity. It is therefore very likely that Mof is directly responsible for the acetylation pattern observed on the male X chromosome (Akhtar, 2000a).
To determine whether the poor acetyltransferase activity of Mof-G691E is solely due to a defect in the catalytic mechanism or whether the interaction with the substrate is also affected, a series of substrate binding experiments were carried out. For this purpose, chromatin was assembled on linear DNA attached to paramagnetic beads using the Drosophila embryo extract system. The washed chromatin beads were then incubated with Mof or Mof-G691E; unbound enzyme was washed out, and bound protein was subjected to Western blot analysis. As a control, binding of the proteins to free DNA was assayed. Interaction of wild-type Mof with free DNA is barely detectable, while a small but significant amount of DNA-bound Mof-G691E is observed. In contrast, both wild-type and Mof-G691E are retained on the chromatin beads. Significantly, the interaction of Mof-G691E with chromatin is not impaired; rather, the mutant enzyme interacts consistently better with the chromatin substrate. The Mof-DeltaN352 deletion derivative is also able to interact with nucleosomes (Akhtar, 2000a).
In order to ensure that the Mof-nucleosome interaction is direct and not mediated by other factors endogenous to the chromatin assembly extract, an interaction of Mof with mononucleosomes, reconstituted by salt gradient dialysis from free histones, was sought. As before, an interaction of Mof with the nucleosome, but not with free DNA, was detected. Similar results were obtained when nucleosomal arrays were assembled from pure histones using NAP-1 as chaperone. These results demonstrate a direct interaction of Mof with the substrate and suggest that the phenotype of the mof-1 mutation is due to an impaired catalysis rather than a failure of interaction with the substrate (Akhtar, 2000a).
In vivo, acetylation of histone H4 at lysine 16 correlates with an increased transcriptional activity. The ability to acetylate reconstituted chromatin with recombinant Mof allowed for a direct test of whether acetylation of histone H4 is cause or consequence of transcriptional activity. A cell-free transcription system derived from Drosophila embryos has been shown to responded to the acetylation status of histones. However, in these experiments, histone H4 is tetra-acetylated and other histones are also modified. Nucleosome arrays were assembled on the immobilized template, and the extent of the array was monitored by micrococcal nuclease digestion. The immobilized array was washed in buffer and used as a substrate for acetylation by recombinant wild-type or mutant Mof. Following acetylation, the template was incubated with transcription extract and the resulting transcripts were monitored by primer extension. Assembly of nucleosomes leads to tight repression of an hsp26 template. Acetylation of lysine 16 of H4 with wild-type Mof leads to a remarkable derepression, whereas the acetylation-deficient enzyme is unable to facilitate transcription. If histones are acetylated prior to assembly and then used to reconstitute nucleosomes, the resulting chromatin is also active. Reconstitution of transcriptionally permissive nucleosomal arrays requires acetyl-CoA, in addition to a functional acetyltransferase. The bulk of the effect of Mof is apparently due to acetylation of histone H4 rather than modification of the transcription apparatus since transcription of naked templates is only marginaly affected by the acetylation reaction (Akhtar, 2000a).
The biochemical analysis suggests that Mof may function as a chromatin-specific activator in vivo. To directly test this hypothesis, Mof was targeted to a responsive promoter in yeast by fusing the enzyme to a heterologous Gal4 DNA-binding domain. A similar strategy has previously been used to target general transcription factors or transcription activators to promoters bearing Gal4 binding sites in vivo, making convenient use of the matchmaker system of plasmids and yeast strains (Clontech). In short, the cDNA encoding full-length Mof or Mof-G691E was fused in frame to the 3' end of the yeast Gal4 DNA-binding domain (amino acids 1-147) in an expression vector. These constructs were then transformed into a yeast strain in which the only his3 gene had been engineered to be under the control of the Gal1 promoter. Deletion of the endogenous Gal4 activator renders this promoter dependent on exogenous activators that contain a Gal4 DNA-binding domain. Under these circumstances, survival of yeast under conditions of histidine starvation can be directly correlated to the extent by which the his3 promoter is activated by factors interacting with the Gal4 binding site. A more quantitative assessment of his3 transcription can be obtained from experiments in which yeast are grown in varying amounts of aminotriazole, a histidine analog. Survival of yeast in medium lacking histidine but containing 25 mM aminotriazole and containing 60 mM aminotriazole is supported by the Gal4(1-147)-Mof fusion protein. In contrast, neither the Gal4(1-147)-G691E fusion nor the Gal4-DBD alone are able to activate the Gal1 promoter. These results indicate that Mof can serve as a transcription activator in yeast and that this function relies on an intact acetyl-CoA binding site. The data also suggest that targeted acetylation of histone H4 at lysine 16 suffices to reverse the inhibitory effects of native yeast chromatin (Akhtar, 2000a).
A cDNA fragment containing the putative Mof catalytic domain (aa 518 to 827) was expressed and it was determined that the recombinant peptide can acetylate Drosophila histones with a preference for histone H4. This pattern is similar to that for a related yeast protein, Esa1p. Active full-length Mof could not be expressed, Mof was isolated as a component of a partially purified MSL complex. Tissue culture cells were used for the initial characterization of the MSL complex. S2 cells are male, based on the following criteria: they do not express the Sxl (Sex-lethal) gene product, which is necessary for female differentiation, and they express Msl2, a limiting component of the dosage compensation machinery whose synthesis is prevented by Sxl. S2 cells can be stably transfected, allowing the use of commercially available antibodies recognizing epitope tags. Transient transfection of S2 cells with Msl2 tagged at its carboxy terminus with the HA epitope reveals that the localization of the HA epitope is coincident with the location of endogenous Mof. After selection with hygromycin, most cells exhibit HA staining on the male X chromosome, the location of which is revealed by antibodies to H4Ac16 (Smith, 2000).
Immunoprecipitation of nuclear extracts from Msl2-HA cells with the 12CA5 (anti-HA) antiserum results in the same proteins as those obtained from S2 cells with an Msl1 antiserum. In salivary gland nuclei, Mle is released from the male X chromosome with RNase treatment. Furthermore, the roX1 and roX2 RNAs are found along the X chromosome with a distribution that mimics that of the MSL complex. Therefore, attempts were made to obtain a partially purified complex containing Mle and a roX RNA and to see whether the presence of either of these components depended on the other. 'RNA-friendly' conditions were developed to increase the chances of purifying Mle and roX RNA-containing complex. The method involved a cell line expressing Flag-tagged MSL3 and sonication under low-salt conditions, immunoprecipitation with Flag antibodies followed by peptide elution, and a second immunoprecipitation with either an MSL antibody or with the corresponding preimmune serum. By using this two-step procedure, a faint band was detected by silver staining that corresponds to Mle protein. Clear enrichment of Mle was seen in the Msl1 immunoprecipitate relative to the preimmune serum. However, following a brief treatment with 0.4 M NaCl, the Mle levels were significantly reduced (Smith, 2000).
To determine if roX RNAs are expressed in S2 cells, Northern blot analysis was performed and it was observed that roX2, but not roX1, is expressed in these cells, consistent with the observation that roX1 is dispensable in flies. The size of the major roX2 transcript observed by Northern analysis was ~ 600 nucleotides. To test if roX2 RNA is present in the Mle-containing immunoprecipitates, RNA was extracted from the immunoprecipitation pellets and a RT-PCR was performed with roX2-specific primers in the linear range. The results show a clear enrichment of roX2 RNA in the immune over the preimmune serum precipitates (Smith, 2000).
The MSL complex specifically acetylates lysine 16 of histone H4. When MSL-containing immunoprecipitates were incubated with nucleosomal substrates, significant acetyltransferase activity toward histone H4 was detected. Msl1 immunoprecipitates from S2 nuclear extracts and 12CA5 immunoprecipitates from Msl2-HA nuclear extracts contain H4-specific acetyltransferase activity, while control immunoglobulin G or 12CA5 immunoprecipitates from S2 cells do not. To demonstrate that the acetyltransferase activity of the MSL complex is ascribable to Mof, complexes were purified containing either wild-type Mof or a protein produced by the mutant allele mof1. This allele is a point mutation resulting in a glycine-to-glutamic acid replacement at the most highly conserved residue of the acetyl-CoA binding domain (G691E). Wild-type Mof-HA or G691E Mof-HA were overexpressed in S2 cells and immunoprecipitated with anti-HA antibodies to obtain complexes with only transfected Mof fusion proteins. Immunoprecipitates from G691E cells have markedly reduced acetylation, consistent with the conclusion that Mof is the sole acetyltransferase in the MSL complex (Smith, 2000).
Given the specificity of the MSL complex toward H4, it was of intereset to determine which particular lysines were acetylated. When acetylated histones were separated by acid-urea gel electrophoresis, predominantly monoacetylated H4 was detected. A similar acid-urea gel was blotted to PVDF, and the mono-acetylated band was subjected to microsequencing. Counts were found at lysine 16, while other potential acetylation sites (at position 5, 8, or 12) were unlabeled. This result provides a causative link between the presence of histone H4 acetylated at lysine 16 and the MSL complex on the X chromosome in Drosophila males (Smith, 2000).
In this report, an examination was carried out of how mutations in the principal sex determination gene, Sex lethal (Sxl), impact the H4 acetylation and gene expression on both the X and autosomes. When Sxl expression is missing in females, the sequestration occurs concordantly with reductions in autosomal H4Lys16 acetylation and gene expression on the whole. When Sxl is ectopically expressed in SxlM mutant males, the sequestration is disrupted, leading to an increase in autosomal H4Lys16 acetylation and overall gene expression. In both cases relatively little effect is found on X chromosomal gene expression (Bhadra, 2000).
In SxlM males, in which Sxl is ectopically expressed, the slow accumulation of the Sxl protein during development eventually prevents significant Msl-2 expression and hence reduces the Msl complex association with the X chromosome. This results in X and autosomal gene expression quite similar to that found in the mle mutant, i.e., little response of the X-linked genes, but an overall increase in autosomal expression. Similarly, the association of the Msl proteins in ~50% of the cells of heteroallelic Sxl females causes sequestration of MOF to the two X chromosomes. This sequestration reduces the H4Ac16 on the autosomal loci, resulting in a lowered expression. There is a concomitant increase of acetylation on the X chromosome, but little overall response of the X-linked genes (Bhadra, 2000).
In the SxlM and Sxlf;mle genotypes a low level of Msl-1/Msl-2 shows chromosomal binding to some degree, although binding takes present in distinct patterns in the two cases. This low level of binding, however, is insufficient to sequester all the available Males absent on the first (Mof) present in the cell. Previous and present data suggest that in the absence of a functional Msl complex, Mof still associates with the chromosomes and is active in modifying H4. The reduced amount of Msl-1/Msl-2 appears saturated with Mof, allowing the remainder to be uniformly distributed across the genome, which modulates gene expression (Bhadra, 2000).
The general trends of X and autosomal gene expression in SxlM and Sxlf mutants match the published autoradiographic data when the latter is considered as absolute levels rather than relative X to autosomal ratios. Autoradiographic grain counts over the X chromosome were changed very little in SxlM larvae compared to normal, but the counts over the autosomes were increased. Conversely, in Sxlf females, the autosomal counts were lower than in normal females with little change over the X. Because the data reported here are anchored to rRNA levels, which in turn do not vary per unit of DNA, the 'per cell' expression trends can be determined on an absolute rather than a relative comparison basis; they indicate greater changes of the autosomes compared to the X chromosome (Bhadra, 2000 and references therein).
The loss of individual components of the Msl complex in the msl mutant males releases the Mof acetylase from the X and a uniform H4 acetylation distribution results. Accordingly, autosomal gene expression is generally increased, reflecting an inverse effect of the X on the autosomes, because the normally sequestered acetylase is now dispersed: this results in higher acetylation levels on the autosomes. An increase or decrease of acetylation level on the X is not reflected in major changes in gene expression, suggesting that some member of the Msl complex insulates genes on the X from responding to the much increased acetylation level (Bhadra, 2000 and references therein).
The collective data indicate that models that posit an association of the Msl complex with a gene for dosage compensation to occur are not supported. First of all, genetic destruction of the complex does not eliminate dosage compensation of most X-linked genes. However, one could perhaps argue that this action eliminates compensation of the regulatory genes on the X, which, because they are now dosage dependent, will compensate most of the 'housekeeping' genes that were assayed. This alternative is not favored for three reasons. (1) Ectopic expression of MSL2 in females as a transgene or in Sxlf mutants (present study) have the Msl complex present on their Xs but gene expression in general is not increased as predicted if the MSL complex alone conditions hyperactivation. (2) Dosage compensation also occurs in metafemales (3X chromosomes with diploid autosomes), where there is no complete complex and this compensation is related to that occurring in males. (3) Autosomal insertions of many X-derived genes still exhibit some degree of compensation despite the fact that these genes have no association with the Msl complex. Thus, there are several circumstances known in which compensation occurs without the Msl complex. The function of the Msl complex on the X chromosome appears to be to inhibit the response of most X-linked genes to high levels of histone acetylation (Bhadra, 2000 and references therein).
When these data are taken together along with previous studies on mle, a consistent model is supported, indicating that the effect of the Sex lethal gene is mediated through its control of the presence or absence of Msl-2. When Msl-2 protein is expressed, the sequestration of the MSL complex occurs with a resultant increase of H4Lys16 acetylation on the X at the expense of acetylation on the autosomes. In the absence of MSL-2, there is a uniform genomewide distribution of Mof and H4Lys16 acetylation. In general, gene expression on the autosomes responds positively to the level of acetylation, but the X is refractory to it in the presence of the Msl complex. In this way, the twofold inverse-dosage effect of the X is used to achieve a proper level of dosage compensation, but the effect on the autosomes is diminished. Thus, as the heteromorphic sex chromosomes have evolved, both the X and the autosomes have maintained nearly equal expression between the sexes (Bhadra, 2000).
Dosage compensation in Drosophila is mediated by a multiprotein, RNA-containing complex that associates with the X chromosome at multiple sites. Investigated were the roles that the enzymatic activities of two complex components, the histone acetyltransferase activity of Mof and the ATPase activity of Mle, may have in the targeting and association of the complex with the X chromosome. Mle and Mof activities are necessary for complexes to access the various X chromosome sites. The role that histone H4 acetylation plays in this process is supported by the observations that Mof overexpression leads to the ectopic association of the complex with autosomal sites (Gu, 2000).
The normal association of the MSL complex at hundreds of sites along the X chromosome appears to be a process with at least three major steps. The first is the formation of functional complexes at the two entry sites where the roX RNAs are transcribed. It should be noted that although the Msl1 and Msl2 proteins are able to access the X chromosome at the entry sites and to recruit Mle, further complex assembly can only occur in the presence of the roX RNAs. This contention is supported by the observation that, in the absence of the two roX genes, no complex is seen to form in embryonic stages where it is normally evident. A caveat is that removal of roX2 was accomplished by using a deletion of such size that other roX-like genes or other unidentified components of the complex or genes whose product is required for complex stability, closely linked to roX2, may have been deleted as well. In any event, since the roX RNAs are unstable unless they are associated with the complex, the process of assembly can proceed only at the entry sites containing the roX genes. Once complexes are formed, they access the X chromosome through all of the entry sites, presumably via the affinity of their Msl1/Msl2 components for these sites. Finally, the complexes spread from the entry sites to the many other sites along the X chromosome where they are normally found. This last step requires the histone acetyltransferase activity of Mof. It is suggested that the spreading process involves the acetylation of neighboring nucleosomes, thereby altering the conformation of adjacent chromatin and rendering it more accessible to the entry of additional MSL complexes. The latter may require the presence of acetylated histone H4 tails in order to stabilize their chromatin association. This conclusion is consistent with the observation that, in S2 cells overexpressing Mof, the resulting abnormal ectopic acetylation of histone H4 at Lys16 leads to the association of the MSL complex along autosomal chromatin. This may mirror the normal situation in vivo where complexes, initially attracted to the entry sites, acetylate histone H4 at Lys16 and thereby make adjacent chromatin regions accessible to more complexes. The affinity of the MSL complex for histone H4 tails implied in this model is reminiscent of a similar role played by histone tails in the spreading of complexes containing SIR2, 3 and 4 during the silencing of mating type loci and telomeric heterochromatin formation in yeast. Although critical to the spreading process, the role played by the ATPase function of Mle, either directly or in conjunction with roX RNA, is not sufficiently understood to be incorporated in the model (Gu, 2000).
It is thought that the process just described can provide the following explanations for the gaps in Msl binding that occur along the X chromosome, or at ectopic autosomal sites where the complex has been caused to form at the site of a roX transgene. It is possible that the spread of H4 acetylation and complex association may be stopped by some insulator or some as yet uncharacterized boundary elements. This would not necessarily require that the entry sites be entirely responsible for the pattern seen along the X chromosome. The interphase chromosome is believed to consist of a series of rosettes formed by loops of the chromatid fiber anchored to a central core by dispersed regions that have affinity for one another. In such an arrangement, a cluster of complexes that have been stopped by some boundary element could acetylate the nucleosomes on a neighboring loop, initiating a spreading process on the other side of a gap (Gu, 2000).
The above considerations raise a number of questions that remain to be resolved. Is the pattern of complex association on the X chromosome tissue specific? Is it dependent on a tissue-specific distribution of the entry sites (other than those containing the roX loci, which must remain invariant in all tissues)? Is the tissue-specific distribution established when the complex first forms in early embryogenesis and is the pattern perpetuated through the mitotic divisions that give rise to a particular tissue? To answer these questions will require a thorough melding of cytological and biochemical approaches (Gu, 2000).
The male-specific-lethal (MSL) proteins in Drosophila melanogaster serve to adjust gene expression levels in male flies containing a single X chromosome to equal those in females with a double dose of X-linked genes. Together with noncoding roX RNA, MSL proteins form the 'dosage compensation complex' (DCC), which interacts selectively with the X chromosome to restrict the transcription-activating histone H4 acetyltransferase MOF (Males-absent-on-the-first) to that chromosome. MSL3 is essential for the activation of MOF's nucleosomal histone acetyltransferase activity within an MSL1-MOF complex. By characterizing the MSL3 domain structure and its associated functions, it has been found that the nucleic acid binding determinants reside in the N terminus of MSL3, well separable from the C-terminal MRG signatures that form an integrated domain required for MSL1 interaction. Interaction with MSL1 mediates the activation of MOF in vitro and the targeting of MSL3 to the X-chromosomal territory in vivo. An N-terminal truncation that lacks the chromo-related domain and all nucleic acid binding activity is able to trigger de novo assembly of the DCC and establish an acetylated X-chromosome territory (Morales, 2005).
The MSL1 interaction surface maps to the C-terminal half of MSL3. This part of MSL3 is characterized by similarities to the MRG domain that subsumes MRG15, MSL3, and related proteins in multiple species into the so-called MRG family. The msl3 gene is related to the Drosophila mrg15 gene, suggesting an early gene duplication event. Accordingly, MRG sequences in MSL3 are highly conserved between D. melanogaster and Drosophila virilis. The MRG domain consists of three blocks of strong sequence similarity separated by short amino acid stretches of lesser conservation. Interestingly, these 'linker' regions harbor rather long insertions in MSL3 of flies and humans. The C terminus of MSL3 may thus be organized by folding of MRG signature sequences, which are disconnected in the primary sequence, into a compact unit from which the MSL3-specific structures 'loop out.' Consistent with this idea, it was found that every deletion in the C terminus of MSL3 compromises interaction with MSL1. Most of these deletions affect at least one of the blocks of MRG sequence similarity, most likely leading to global misfolding. However, one deletion that abolishes MSL1 binding (Delta328-433) selectively removed MSL3-specific sequences between two MRG blocks. There is considerable conservation of these sequences in the Drosophila species for which sequence information has recently become available, suggesting a conserved function, but whether this sequence contains a dedicated MSL1 interface remains to be explored. In any case, this analysis suggests that the MRG sequence similarity reflects a functional domain. The MRG-MSL1 contact is essential for targeting MSL3 to the X-chromosomal territory, confirming the functional importance of the interactions defined in vitro. It is suggested that MRG modules in other MRG family members may also constitute protein-protein interaction units (Morales, 2005).
In vitro analysis showed that MSL3 interacts better with single-stranded nucleic acids than with dsDNA. The significance of ssDNA interaction, if any, is unclear at the moment. In contrast, there is evidence that MSL3 interacts with roX RNA in vivo and in vitro, but the domain involved in RNA binding had not been defined. Biochemical analysis demonstrates that the nucleic acid binding structures reside in the N-terminal half of MSL3, which also contains the CRD. Previously, it has been suggested that RNA interaction of MSL3 is affected by its acetylation at lysine 116, close to the CRD. In the current studies, a fragment comprising the first 140 amino acids (and hence the CRD as well as K116) was not sufficient for nucleic acid binding, but sequences up to amino acid 259 contributed significantly. To what extent the CRD of MSL3 contributes to RNA binding needs to be established. The CRDs of MSL3 and MOF appear more related to each other than to canonical chromodomains. They lack the alpha-helix supporting the ß-sheet bundle and aromatic residues that may be involved in recognition of methylated histone N termini. The CRD of MOF also appears not to be sufficient for RNA binding. A further interesting similarity between MOF and MSL3 is that nucleic acid interactions are not the primary targeting determinant for either MOF (Morales, 2004) or MSL3. Although impairment of the CRDs leads to somewhat increased binding of the corresponding GFP fusion protein to autosomes, their concentration on the X-chromosomal territory is still obvious. However, the CRDs and noncoding RNA may have functions that are not assayed for in simple recruitment experiments. It is also possible that the CRDs of MOF and MSL3 provide partially redundant functions for DCC assembly. In contrast, mutations in MOF or MSL3 that abrogate their interaction with the C terminus of MSL1 prevent faithful recruitment to the X chromosome. Obviously, the recruitment assay employed may just reveal the strongest binary interaction that MSL3 or MOF are involved in. However, the fact that overexpression of an MSL3 lacking all nucleic acid binding capacity was able to complement an MSL3 deficiency and to trigger the accumulation of MOF and H4K16 acetylation on the X-chromosomal territory emphasizes the importance of the MSL protein interactions for the assembly of a functional DCC (Morales, 2005).
MSL complexes can be formed in vitro in the absence of RNA. A deficiency of roX RNA in vivo can be partially overcome by overexpression of the 'platform' proteins MSL1 and MSL2. It is possible that transient overexpression of MSL3 overcomes the RNA requirement and that under normal conditions of limiting MSL protein concentrations RNA is required for faithful DCC assembly (Morales, 2005).
The remarkable stimulation of MOF's HAT activity upon association of MSL3 with an MSL1-MOF complex was not due to enhanced binding of MSL3 to nucleic acids but rather required contact of MSL3 with the MSL1 scaffold. MOF and MSL3 are brought into proximity by interaction with adjacent structures in the C terminus of MSL1 (Morales, 2004). It is possible that the MSL1 scaffold stabilizes an otherwise transient and therefore nonproductive direct contact between MSL3 and MOF (Morales, 2004). The existence of such a contact has been inferred from the fact that MSL3 can be acetylated by MOF. However, when it comes to acetylation, MSL1 is a much better substrate for MOF than MSL3 (Morales, 2004). The new data reinforce a previous model of an acetylation 'checkpoint' built into DCC assembly. Accordingly, the regulatory potential of H4K16 acetylation would only be fully realized upon binding of MOF with MSL1 and the completion of the complex by association of MSL3 (Morales, 2004). Such a checkpoint would render full activation of MOF dependent on proper DCC assembly and hence 'maleness' and serve to restrict the critical epigenetic mark to the X chromosome (Morales, 2005).
The dosage compensation complex (DCC) in Drosophila melanogasteris responsible for up-regulating transcription from the single male X chromosome to equal the transcription from the two X chromosomes in females. Visualization of the DCC, a large ribonucleoprotein complex, on male larval polytene chromosomes reveals that the complex binds selectively to many interbands on the X chromosome. The targeting of the DCC is thought to be in part determined by DNA sequences that are enriched on the X. So far, lack of knowledge about DCC binding sites has prevented the identification of sequence determinants. Only three binding sites have been identified to date, but analysis of their DNA sequence did not allow the prediction of further binding sites. Chromatin immunoprecipitation was used to identify a number of new DCC binding fragments and characterized them in vivo by visualizing DCC binding to autosomal insertions of these fragments, and it has been demonstrated that these fragments possess a wide range of potential to recruit the DCC. By varying the in vivo concentration of the DCC, evidence is provided that this range of recruitment potential is due to differences in affinity of the complex to these sites. It was also established that DCC binding to ectopic high-affinity sites can allow nearby low-affinity sites to recruit the complex. Using the sequences of the newly identified and previously characterized binding fragments, a number of short sequence motifs have been uncovered, that in combination may contribute to DCC recruitment. These findings suggest that the DCC is recruited to the X via a number of binding sites of decreasing affinities, and that the presence of high- and moderate-affinity sites on the X may ensure that lower-affinity sites are occupied in a context-dependent manner. Bioinformatics analysis suggests that DCC binding sites may be composed of variable combinations of degenerate motifs (Dahlsveen, 2006).
Using a ChIP strategy, several new DCC binding fragments have been identified and it has been demonstrated that they possess a wide range of potential to recruit the DCC. Because the majority of the isolated candidate fragments co-map with endogenous DCC binding sites at the resolution afforded by staining of polytene chromosomes, it is believed that the ChIP selection procedure is appropriate. By tuning DCC levels in vivo, it was concluded that the difference in recruitment ability is due to different affinity of the DCC for these fragments. At limiting concentrations of complex, only the sites of highest affinity are occupied. Conversely, at non-physiologically high concentrations of DCC, even 'cryptic' binding sites on autosomes are recognized by the complex. This suggests, in accord with previous observations, that selective interaction of the DCC with the X chromosome is a function of tightly controlled levels of complex components that are adjusted to assure interaction with binding sites of varying affinity clustered on the X, but insufficient to occupy cryptic sequences on autosomes. These data are also in broad agreement with observations that numerous sites on the X chromosomes contain DCC binding determinants. These determinants are not all equal, but represent a diverse set of DCC targets that differ by a wide range of affinities for the complex, as expected from a sequence determinant that during evolution became gradually enriched on the X chromosome (Dahlsveen, 2006).
The use of the term 'chromatin entry sites' for the subset of DCC binding sites that are still occupied by partial complexes in the absence of MSL3, implies that these sites were somehow qualitatively and perhaps functionally distinct from the remaining sites that only attract the intact complex. Although it is possible that not all DCC binding sites are functionally equivalent, the characterization of several new examples of both types of DCC binding sites suggests support for the 'affinities model'. According to this model, 'chromatin entry sites' are not qualitatively different from other sites, but only represent those sites with the highest affinity for the complex. A prediction from this model that is further substantiated by the results is that non-functional complexes that lack MSL3 or the acetyltransferase activity of MOF have lower affinity for target sites. Only those determinants with highest affinity for the DCC are able to recruit partial complexes in the absence of MSL3. Sites with slightly lower affinity are still able to recruit the complex in the mof1 mutant. Because the interaction of the DCC with the X chromosome is thought to be largely mediated by MSL1 and MSL2, it remains to be explored whether MSL3 and the acetylase activity of MOF affect the active concentration of MSL1 and MSL2 or lead instead to the adoption of a high-affinity conformation of the complex. Conversely, it remains to be seen if over-expression of MSL1 and MSL2 in the msl-31 and mof1 mutants would allow partial complexes to bind additional sites. In this respect it is intriguing that the mutation of both roX RNAs, which is presumed to lead to incomplete and non-functional complexes, can be partially rescued by the over-expression of MSL1 and MSL2 (Dahlsveen, 2006).
During analysis of DCC recruitment to high-affinity sites inserted into autosomes of wild-type males, an additional band of DCC binding was observed close to the insertion site in three independent cases (one insert each of DBF9, DBF5, and DBF7). Such minimal and rare 'spreading' has previously been observed for ectopic insertions of the 18D high-affinity site and from roX transgenes in the wild-type male background. This study now reveals that these additional DCC binding sites are not a result of random spreading, but are most likely due to interaction of the DCC with one of the low-affinity sites on autosomes that happened to reside close to the insertion site. These sites are usually observed only when the DCC concentrations are globally increased by over-expression of MSL1 and MSL2. Accordingly, it is suggested that the autosomal insertion of a high-affinity DCC binding site leads to a local rise in complex concentration, which allows these low-affinity sites to be recognized by the DCC even in wild-type males. However, additional requirements must clearly be met to allow low-affinity sites to profit from local increases in complex concentration, since not all ectopic high-affinity sites support the phenomenon. Permissive conditions may include active transcription or the presence of specific epigenetic marks (Dahlsveen, 2006).
It is envisioned that the clustering of DCC binding determinants of high and intermediate affinity on the X chromosome (combined with the transcription of the roX RNAs) elevates the concentration of the DCC within the X chromosomal territory and ensures the occupancy of lower-affinity sites in a context-dependent manner. This may explain the observation that autosomally derived transgenes often acquire dosage compensation. The transgenes may contain cryptic DCC binding determinants and may thus acquire binding if placed in the context of the X chromosomal territory. Conversely, an X chromosomal fragment that harbors only low-affinity sites may not be recognized if translocated to an autosomal context, and the fragment DBF3 may be an example for such a scenario. The presence of a large number of low-affinity sites may also contribute significantly to restricting the binding of the DCC to the X chromosome (Dahlsveen, 2006).
The term 'spreading' has been used to describe the appearance of additional bands of DCC binding around autosomal insertions of roX cDNAs or fragments derived thereof. However, extensive, long-range spreading from roX transgenes, which leads to the appearance of many ectopic DCC bands at greater distances from the insertion sites, occurs only under unusual conditions and depends on the transcription of the roX RNA rather than the DCC binding sites on DNA. Long-range spreading of the complex also does not occur into autosomal chromatin translocated to the X chromosome. It is suggested that large translocations maintain their original chromosomal context (DCC enriched or not), and therefore no redistribution of DCC over the new chromosomal junction is observable at the resolution of the polytene chromosomes. Importantly, this study does not address the higher-resolution distribution of the DCC within a chromosomal band. It is possible that such a band contains many individual binding sites, also of varying affinity. At this resolution, the term “spreading” may characterize the local diffusion of the DCC from high- to low-affinity sites. This study does not exclude this type of spreading, or indeed any other kind of complex distribution within a chromosomal band. High-resolution ChIP analyses will be necessary to resolve the detailed nature of DCC distribution (Dahlsveen, 2006).
Previously, only three high-affinity binding sites for DCC were known. This study identified nine more fragments, and this encouraged investigation of common features within a larger pool. Interestingly, all new DBFs were found to map to gene-rich regions and either overlap with or lie close to essential genes. Three high-affinity fragments (DBF12, DBF9, and DBF6) reside entirely within genes. It is possible that specific recruitment sites, such as those inferred to reside within the DBFs, have been enriched in and around genes that require dosage compensation during evolution, and consequently, high-affinity sites may represent loci that are particularly dosage sensitive. Previous experiments indicated that the DCC tends to bind to the coding regions of genes, and it was suggested that this was linked to transcriptional activity. Although recent observations suggest that transcriptional activity alone is not sufficient to attract DCC binding, it is possible that transcription influences DCC recruitment to specific sites. For example, high-affinity sites, which show consistent and strong recruitment of the DCC at many chromosomal positions, may not be influenced by transcription. However, sites with lower affinity and variable recruitment ability may profit from transcriptional activity. Developmental differences in transcriptional activity may therefore also explain the lack of DCC recruitment in salivary glands to fragments isolated by ChIP from embryos (Dahlsveen, 2006).
This study has attempted to identify common sequence elements within previously characterized and new high-affinity DCC binding fragments and have uncovered a number of short sequence elements, whose clustering in combinations could contribute to DCC recruitment. Clearly, the importance of these elements remains to be tested experimentally. Previous analysis of the roX DCC binding sites identified a 110 bp sequence containing several blocks of conservation between roX1 and roX2. DCC binding was affected by mutation in several of the conserved blocks, indicating that DCC binding sites may be made up of combinations of shorter elements. Such combinations have be sought by defining pairs of elements found within a 200 bp window in the high-affinity DCC binding fragments. Those pairs that are significantly enriched on the X chromosome compared to other chromosomes are presented. Importantly, these X-enriched pairs often occur in multiple copies in the high-affinity fragments and at higher frequencies compared to the lower-affinity fragments DBF9-A, DBF1, DBF11, DBF13, and DBF3. Nonetheless, there is no obvious correlation between the location of individual pairs on the X and any specific features such as predicted genes. It is hypothesized that the elements that define these pairs (and other such elements that may have escaped attention) correspond to building blocks of DCC binding sites. Accordingly, a DCC binding site of given affinity for the complex would not be determined by a unique DNA sequence, but by clustering of variable combinations of short, degenerate sequence motifs. Individual low-affinity binding sites may not be unique to the X, but their clustering on the X may contribute to high-affinity binding. There are already indications that the DCC binds to several sites in close proximity. The two parts of DBF9, DBF9-A and DBF9-B, are both able to recruit the DCC, albeit with different affinity. The analysis of the 18D high-affinity fragment also suggested that multiple elements over 8.8 kb contribute to the binding of the complex (Dahlsveen, 2006).
The pairs have been ordered according to sequence similarity. Interestingly, a large family of elements contain GAGA-related motifs. Mutation of GAGA or CTCT motifs in the 110 bp roX1/roX2 consensus severely affects DCC recruitment to that sequence, indicating that GAGA motifs are involved in DCC binding. The fact these elements enriched in several independently identified high-affinity fragments demonstrates the appropriateness of the algorithms used to find them. Besides elements with a clear relationship to GAGA motifs, several other element families were identified defined by sequence similarity. In order to visualize the element families, the related words may be aligned such that sequence logos representing degenerate motifs can be derived using the WebLogo software (http://weblogo.cbr.nrc.ca). It is considered possible that some of these degenerate motifs may contribute to DCC binding sites. Evaluation of the contributions of these novel motifs to the targeting of the complex will require increased resolution analysis and systematic evaluation of candidate sequences in the in vivo recruitment assay (Dahlsveen, 2006).
This study suggests that high-affinity DCC binding sites are composed of variable combinations of clustered, degenerate sequence motifs. The degeneracy of the sequence motifs indicates that many individual elements may have low affinity. Therefore, the interaction of the DCC with each individual site should be in dynamic equilibrium. However, it was recently observed by photobleaching techniques that the DCC components most likely involved in chromatin binding, MSL2 and MSL1, interact with the X chromosomal territory in cultured cells in an unusually stable manner, which is not compatible with binding equilibria involving off-rates that commonly characterize protein-DNA interactions. Several hypotheses can be formulated, whose evaluation may lead to resolution of this apparent contradiction. (1) Formation of higher-order structures involving many DCC components engaged in numerous simultaneous DNA interactions may lead to a trapping of the DCC within the X chromosome territory. (2) An initial sequence-directed targeting event may be followed by a stabilization of the interaction through positive reinforcement involving additional principles, such as epigenetic marks or a topological linkage. (3) It is considered that the arrangement of the interphase genome in polytene chromosomes may differ in a relevant aspect from the more compact chromosomal territories of diploid cultured cells. Ultimately, the identification of the DNA-binding domains of DCC components and analysis of their mode of DNA interaction will be required to solve the targeting issue (Dahlsveen, 2006).
Dosage compensation in Drosophila is dependent on MSL proteins and involves hypertranscription of the male X chromosome, which ensures equal X-linked gene expression in both sexes. This paper reports the purification of enzymatically active MSL complexes from Drosophila embryos, Schneider cells, and human HeLa cells. A stable association of the histone H4 lysine 16-specific acetyltransferase MOF was found with the RNA/protein containing MSL complex as well as with an evolutionary conserved complex. The MSL complex interacts with several components of the nuclear pore, in particular Mtor/TPR and Nup153. Strikingly, knockdown of Mtor or Nup153 results in loss of the typical MSL X-chromosomal staining and dosage compensation in Drosophila male cells but not in female cells. These results reveal an unexpected physical and functional connection between nuclear pore components and chromatin regulation through MSL proteins, highlighting the role of nucleoporins in gene regulation in higher eukaryotes (Mendjan, 2006).
All Drosophila MSL proteins have mammalian orthologs. To address the evolutionary conservation, the human hMOF-containing complexes were purified from a stable HeLa cell line expressing hMOF tagged with one haemagglutinin (HA) and two FLAG epitopes (HA-2xFLAG-hMOF). The characterization of the interacting proteins revealed striking similarities in the complex composition between flies and humans (Mendjan, 2006).
Copurification of mammalian MSL orthologs showed that DCC is an evolutionary conserved protein complex. hMSL1, hMSL2, and hMSL3 were all present in the hMOF complex. Similar to Drosophila DCC, RNA helicase A (the ortholog of MLE) was not present in the complex, which is consistent with previous observations. Furthermore, two isoforms of hMSL3, hMSL3a and hMSL3c, were identified, copurifying with hMOF. The former represents the full-length protein, while the latter is an alternative splice isoform lacking the N-terminal chromobarrel domain (Mendjan, 2006).
In addition to the MSL proteins, most of the other proteins copurifying with TAP-MOF were also found in the hMOF complex. Z4 and Chriz/Chromator (Chr) lack clear mammalian orthologs, which could explain their absence. However, the Mtor ortholog TPR was identified in the HA-2xFLAG-hMOF purification. Human-specific proteins included the transcriptional coactivator HCF-1, O-linked N-acetylglucosaminetransferase OGT, and the forkhead and FHA domain containing transcription factor ILF-1/FOXK2. Interaction of hMSL3, hNSL1, hNSL2, hNSL3, and HCF-1 was further confirmed by Western blot analysis of eluted complex. Similar to the TAP-MOF and MSL-3FLAG complexes, the HA-2xFLAG-hMOF complex specifically acetylated histone H4 at lysine 16 on mononucleosomes (Mendjan, 2006).
Taken together, the data demonstrate that MOF interactions are evolutionary conserved and that the DCC is an evolutionary ancient complex that acetylates histone H4 at lysine 16 (Mendjan, 2006).
The purification of the MSL complex revealed quite an unusual complex composition. One would expect that a complex thought to modulate transcription and/or chromatin structure would contain a significant number of classical transcription factors, some of the numerous components associated with RNA polymerase II, or at least subunits of the ubiquitous chromatin remodeling and modifier complexes. However, none of these components was found. Instead, there seems to be a core MSL complex that interacts substoichiometrically with nucleoporins (Mtor, Nup153, Nup160, Nup98, and Nup154), interband binding proteins (Z4, Chromator/Chriz), and exosome components (Rrp6, Dis3) (Mendjan, 2006).
The results suggest that MOF is a subunit of two independent complexes in mammals and fruit flies. Several lines of evidence support this notion. This includes coimmunoprecipitation experiments and glycerol gradient centrifugation. Furthermore, hMOF was recently found in the MLL1 methyltransferase complex together with HCF-1, MCRS2, WDR5, NSL1, and PHF20, but this complex did not contain hMSL1. Finally, purification of the hMSL3 complex provides further evidence that hMSL3 does not associate with many of the MOF-interacting proteins. Therefore, it is suggested that the NSL complex contains at least MOF, NSL1, NSL2, NSL3, MCRS2, MBD-R2, and WDS, and in humans also HCF-1 and OGT (Mendjan, 2006).
The results presented here also suggest a molecular mechanism as to how the MOF complexes bifurcate. Both MSL-1 and NSL1 contain a PEHE domain in their C terminus. The NSL1 PEHE domain interacts directly with hMOF in vitro, and Drosophila MSL-1 has been shown to interact directly with MOF through the same domain. Furthermore, MSL-1 is required for full activity of MOF in vitro and for the assembly of the DCC on the male X chromosome. MSL-1 and NSL1 are the only two genes with a PEHE domain in the Drosophila genome, suggesting that it is an evolutionary conserved MOF-interacting domain. It is postulated that MSL1 and NSL1 serve as mutually exclusive bridging factors that assemble two different complexes around MOF, a histone H4 lysine 16-specific acetyltransferase (Mendjan, 2006).
In the current study, focus was placed on the mechanism of DCC function in Drosophila. All three purifications resulted in enzymatically active complexes with consistent copurification of MSL-1, MSL-2, MSL-3, MOF, roX1, and roX2 but not of MLE or JIL-1. The absence of MLE was expected, since its interaction with MSLs has reported to be salt and detergent sensitive. It is likely that JIL-1, like MLE, is sensitive to the purification conditions used in this study (Mendjan, 2006).
To examine the function of the new interacting proteins in dosage compensation, mutant flies were studied and RNAi was used in cell culture. In Z4 mutants or in MBD-R2-depleted SL-2 cells, MSL localization on the X chromosome was not affected. Consequently, these proteins are not required for MSL recruitment, or they have an alternative function with MOF that is independent of its role in dosage compensation (Mendjan, 2006).
However, an unexpected link was discovered between dosage compensation and the nuclear pore. Depletion of either Mtor or Nup153 but not of other nucleoporins or NXF1 delocalized MSL proteins from the X chromosome. The effects observed were not due to a general transport defect, since all the five MSL proteins and roX2 RNA remained nuclear in Mtor- and Nup153-depleted cells, and no accumulation was observe of bulk mRNA in these cells. Consistent with these observations, Mtor and Nup153 are required for proper dosage compensation of several classical MSL-dependent dosage-compensated genes in SL-2 cells. The expression of these genes was not affected in female Kc cells (Mendjan, 2006).
An important question raised from this study is whether the observed effects are due to a soluble fraction of Mtor and Nup153 in the nucleus or due to their function as components of the NPC. The latter is favored: (1) Nup153 staining is exclusively peripheral; (2) depletion of Nup153 delocalizes Mtor from the nuclear periphery and increases the soluble pool of Mtor in the nucleoplasm, but MSL proteins still remained delocalized in Nup153-depleted cells; (3) the fact that several nucleoporins, which exist together only at the nuclear pore, were copurified with the MSL complexes strongly favors the idea that there is an interaction between the DCC and the intact NPC. This interaction is substoichiometric but with clear functional importance for DCC assembly or maintenance on the X chromosome (Mendjan, 2006).
A wealth of information has been generated in budding yeast regarding nuclear organization and gene regulation. For instance, yeast telomeres associate with the nuclear periphery and form a transcriptionally silenced chromatin domain. However, a number of recent studies have shown that nuclear periphery is not just a domain of gene inactivation but also of activation. Consistent with these observations, yeast MLP1 and MLP2 (Mtor orthologs in yeast) associate with transcriptionally active genes and are involved in relocalization of active genes to the nuclear periphery. Furthermore, MLPs are involved in chromatin domain formation and pre-mRNA quality control (Mendjan, 2006 and references therein).
Interestingly, in Schneider cells, male embryos, salivary glands, and imaginal discs, the Drosophila male X chromosome appears localized at or near the nuclear periphery and in most cases even follows the nuclear rim curvature. The inactive X in mammals also localizes close to the nuclear periphery as the Barr body. Like the Drosophila male X chromosome, the inactive X has to be globally controlled (inactivated) and is characterized by a special histone modification (trimethylation of lysine 27 of histone H3). Another common feature between mammals and Drosophila is that noncoding RNAs play an essential role. A possible model that can account for these intriguing similarities is that the nuclear periphery is used to generate transcriptional domains that can be transcriptionally active or inactive in order to achieve coregulation of gene expression for a subset of genes. In the case of the Drosophila male X chromosome, hundreds of genes with different basal transcriptional properties need to be coactivated by a factor of two. This kind of a subtle transcriptional coregulation of a whole chromosome may be achieved by partial compartmentalization of the X chromosome mediated by the nucleoporin-MSL interaction, allowing the formation of hyperacetylated chromatin domains with unique transcriptional and/or posttranscriptionalproperties (Mendjan, 2006).
It is important to emphasize that Mtor and Nup153 may be required for general chromatin organization (not just individual chromosomes) through their interaction with chromatin-associated proteins. The DCC might mediate X-chromosomal tethering to the nuclear pore as a mechanism to coregulate a large set of genes by creating chromosomal loops or domains. This could happen by direct or indirect interactions of MSLs with Mtor/Nup153 located at or near high-affinity sites along the X chromosome, which are the binding sites of the DCC. Interactions with nuclear pore components may also be used to 'economize resources' and/or for efficient coupling of transcription to processing of the newly transcribed coregulated messages (Mendjan, 2006).
In summary, the purification of the MSL complex has revealed an unexpected link between dosage compensation and the NPC. In the context of data from other systems, this allows formulation of new hypotheses about the mechanism of dosage compensation that will be exciting to test in the future (Mendjan, 2006).
Home page: The Interactive Fly © 1997 Thomas B. Brody, Ph.D.
The Interactive Fly resides on the
Society for Developmental Biology's Web server.