The Interactive Fly

Zygotically transcribed genes

Chromatin organization and the Polycomb and Trithorax groups

What are Polycomb and trithorax group proteins?
Pathways that mediate gene activation and silencing throught chromatin
Genome-wide prediction of Polycomb/Trithorax response elements
Genome-wide analysis of Polycomb targets in Drosophila melanogaster
Histone H3 variants specify modes of chromatin assembly
Snipper, an Eri1 homologue, affects histone mRNA abundance and is crucial for normal Drosophila melanogaster development
Acetylation and methylation: Covalent modifications of chromatin and DNA that establish and maintain the heterochromatin-induced silenced state
General transcriptional silencing by a Polycomb response element in Drosophila
Systematic protein location mapping reveals five principal chromatin types in Drosophila cells
Heterochromatin remodeling by CDK12 contributes to learning in Drosophila
Helitrons shaping the genomic architecture of Drosophila: enrichment of DINE-TR1 in alpha- and beta-heterochromatin, satellite DNA emergence, and piRNA expression
A negative loop within the nuclear pore complex controls global chromatin organization
Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains
High-resolution in situ hybridization analysis on the chromosomal interval 61C7-61C8 of Drosophila melanogaster reveals interbands as open chromatin domains
Stable Chromosome Condensation Revealed by Chromosome Conformation Capture
Super-resolution imaging reveals distinct chromatin folding for different epigenetic states
Correspondence of Drosophila Polycomb Group proteins with broad H3K27me3 silent domains
Propagation of Polycomb-repressed chromatin requires sequence-specific recruitment to DNA
Drosophila O-GlcNAcase deletion globally perturbs chromatin O-GlcNAcylation
Genome-wide activities of Polycomb complexes control pervasive transcription
Chromatin proteins and RNA are associated with DNA during all phases of mitosis
Regulatory functions and chromatin loading dynamics of linker histone H1 during endoreplication in Drosophila
Stable Polycomb-dependent transgenerational inheritance of chromatin states in Drosophila
Dual functionality of cis-regulatory elements as developmental enhancers and Polycomb response elements
A comparison of nucleosome organization in Drosophila cell lines
Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo

Histone, Polycomb and trithorax group, and dosage compensation genes

  • Trithorax group
  • Polycomb group
  • PRC1 complex of Polycomb group proteins
  • Esc-E(z) complex of Polycomb group proteins (PRC2 complex)
  • Brahma complex of trithorax group proteins
  • SAGA related complex
  • Paf1 complex (coordinates histone modifications and changes in nucleosome structure with transcription activation and Pol II elongation)
  • TIP60 complex (Histone acetyltransferase complex)
  • NURD chromatin remodelling complex
  • COMPASS (Complex Proteins Associated with Set1) family proteins (H3K4 methyltransferases)
  • Chromatin Accessibility Complex (CHRAC)
  • Histones
  • Boundary elements/Insulator proteins
  • Other chromatin associated proteins, RNAs and histone chaperones
  • Enhancers and suppressors of position effect variegation
  • Myb-MuvB (MMB) complex also know as the Rb, E2F, and Myb-associated protein (dREAM) complex
  • LSD1-CoREST demethylase complex
  • LINT complex
  • Dosage compensation genes acting to modify chromatin of X chromosome

  • What are Polycomb and trithorax group proteins?

    Chromatin consists of proteins that serve as the structural organizer of DNA, binding DNA into higher order structures and ultimately forming the chromosome itself. Chromatin restricts the access of DNA to transcription factors. Both Polycomb and trithorax group proteins act to remodel chromatin altering the accessibility of DNA to factors required for gene transcription. Polycomb group genes are involved in chromatin based gene silencing, while trithorax group genes counteract the silencing effects of chromatin to maintain gene activity.

    Pathways that mediate gene activation and silencing throught chromatin

    There is an evolving understanding of the enzymes that function to remodel chromatin. At least two systems related to yeast SWI/SNF proteins function to open up chromatin, permitting access to transcription factors. Information on SWI/SNF homologs can be found at the ISWI and Brahma sites. Information about the potential role of the origin recognition complex in chromatin remodeling can be found at the Origin recognition complex 2 (ORC2) site.

    Nucleosome assembly protein-1 (NAP1) and Chromatin assemby factor 1 subunit (CAF1) play the role of histone chaperones in establishing an ordered nucleosome structure on newly synthesized DNA. Drosophila CAF-1 appears to comprise four subunits of 180, 105, 75 and 55 kDa. The smallest subunit of Drosophila CAF-1, p55, is homologous to a mammalian RbAp-48 protein which is associated with the HD1 histone deacetylase. A model for the role of core histone chaperones in chromatin assembly is as follows: CAF-1 binds to newly synthesized H3 and (acetylated) H4 and mediates the formation of the H3-H4 tetramer into newly replicated DNA: histones H2A-H2B are subsequently incorporated with the assistance of other histone chaperones, such as nucleoplasmin or NAP1, to give the complete histone octamer. The initial histone acetylation may be required to neutralize its high positive charge, allowing it to be assembled into chromatin. Deacetylation of histones carried out by histone deacetylase, could be a prerequisite to maturation of chromatin. In any case, it is now clear that chromatin assembly and maturation involves histone acetylation and that this process begins in cytoplasm and histones are subsequently transferred to the nucleus and the deacetylated (Tyler, 1996).

    Regulatory elements called enhancers, or locus control regions are capable of exerting their influence over long distances, and in a orientation-independent manner to orchestrate the complex gene expression patterns required for embryonic development. How are the effects of enhancers confined to the genes they regulate? In recent years the concept of chromatin based domain boundaries or insulator elements has developed, based on the genetic properties of several eukaryotic genes. One example of an insulator element is the Drosophila gypsy insulator. For discussion of the gypsy insulator, and the role of two proteins, Suppressor of Hairy wing and MOD(MDG4) in its regulation, see the su(Hw) site.

    One other aspect of gene silencing has been established for mammalian and yeast systems. Whereas histone acetylation is known to be involved in gene activation in Drosophila dosage compensation (See Male-specific lethal 2), a role for deacetylation in gene silencing has not yet been established in Drosophila. Two examples of the role of histone deacetylation in gene silencing in mammals will be described briefly here. Histone deacetylation plays a role in mammalian Myc mediated silencing (see Drosophila Myc Evolutionary Homologs section for more information) and in mammalian nuclear receptor mediated silencing (see Ecdysone receptor Evolutionary Homologs section for more information).

    Myc family proteins function through heterodimerization with the stable, constitutively expressed bHLH-Zip protein, Max. Human Mad protein homodimerizes poorly but binds Max in vitro, forming a sequence-specific DNA binding complex with properties very similar to those of Myc-Max. Both Myc-Max and Mad-Max heterocomplexes are favored over Max homodimers. Mad does not associate with Myc or with representative bHLH, bZip, or bHLH-Zip proteins. On the other hand, Myc-Max and Mad-Max complexes carry out opposing functions in transcription and Max plays a central role in this network of transcription factors (Ayer, 1993).

    Members of the Mad family of bHLH-Zip proteins heterodimerize with Max to repress transcription in a sequence-specific manner. Transcriptional repression by Mad:Max heterodimers is mediated by ternary complex formation with either of the corepressors mSin3A or mSin3B. mSin3A is an in vivo component of large, heterogeneous multiprotein complexes and is tightly and specifically associated with at least seven polypeptides. Two of the mSin3A-associated proteins, p50 and p55, are highly related to the histone deacetylase HDAC1. The mSin3A immunocomplexes possess histone deacetylase activity that is sensitive to the specific deacetylase inhibitor trapoxin. mSin3A-targeted repression is reduced by trapoxin treatment, suggesting that histone deacetylation mediates transcriptional repression through Mad-Max-mSin3A multimeric complexes (Hassig, 1997).

    The same proteins that mediate transcriptional silencing of Mad-Max also mediate transcriptional silencing of nuclear hormone receptors that are attached to DNA but free of ligand. Whereas liganded nuclear receptors serve as transcriptional activators, unliganded nuclear receptors serve as repressors. How does the unliganded nuclear receptor transmit a repressive signal to the transcriptional apparatus and what is the nature of this signal? In fact, the target of the unliganded nuclear receptor is not RNA polymerase but chromatin, and repression is mediated by corepressors, proteins that associate with unliganded nuclear receptors that assemble a macromolecular complex that modifies chromatin so as to silence gene activity. The macromolecular complex acts to deacetylate histone. The transcriptional corepressors SMRT and N-CoR function as silencing mediators for retinoid and thyroid hormone receptors. SMRT and N-CoR directly interact with unliganded nuclear receptors, and these corepressors in turn directly interact with mSin3A, a corepressor for the Mad-Max heterodimer and a homolog of the yeast global-transcriptional repressor Sin3p. The recently characterized histone deacetylase 1 (HDAC1) interacts with Sin3A and SMRT to form a multisubunit, ternary repressor complex. Histone deacetylase in turn targets chromatin, converting it into a form that is unaccessable to the transcriptional apparatus. Consistent with this model, it is found that HDAC inhibitors synergize with retinoic acid to stimulate hormone-responsive genes and the differentiation of myeloid leukemia (HL-60) cells. Addition of a deacetylase inhibitor such as Trichostatin A relieves transcriptional repression resulting in a promoter that is sensitive to the addition of activating hormone. This work establishes a convergence of repression pathways for bHLH-Zip proteins and nuclear receptors and suggests that this type of regulation may be more widely conserved than previously suspected (Nagy, 1997).

    Genome-wide prediction of Polycomb/Trithorax response elements

    Polycomb/Trithorax response elements (PRE/TREs) maintain transcriptional decisions to ensure correct cell identity during development and differentiation. There are thought to be over 100 PRE/TREs in the Drosophila genome, but only very few have been identified due to the lack of a defining consensus sequence. The definition of sequence criteria that distinguish PRE/TREs from non-PRE/TREs is reported in this study. Using this approach for genome-wide PRE/TRE prediction, 167 candidate PRE/TREs are reported, that map to genes involved in development and cell proliferation. Candidate PRE/TREs are shown to be bound and regulated by Polycomb proteins in vivo, thus demonstrating the validity of PRE/TRE prediction. Using the larger data set thus generated, three sequence motifs that are conserved in PRE/TRE sequences have been identified (Ringrose, 2003).

    The detection of PRE/TREs by prediction generates a large data set that can be used to search for further common sequence features. To this end, the 30 highest scoring PRE/TRE hits were scanned for motifs that occur significantly more often in PRE/TREs than in randomly generated sequence. Five significant motifs were found. Not surprisingly, but reassuringly, two known motifs, the GAF and PHO binding sites were found. The Zeste binding motif was not found by this analysis, although it occurs as frequently as GAGA factor in the 30 sequences analyzed. This is probably due to the shortness and degeneracy of the Zeste motif, and suggests that other such short motifs will also be missed by this approach (Ringrose, 2003).

    Nevertheless, three additional motifs were found. The first, called GTGT, is found several times in 14 of the sequences. The second motif, poly T, is found several times in almost all 30 PRE/TRE sequences analyzed. Some variants of this site match the binding consensus for the Hunchback protein, which has been shown to be an early regulator at some PRE/TREs. The third motif, TGC triplets, occurs several times in 13 of the PRE/TRE sequences. No binding factor for this sequence has yet been identified (Ringrose, 2003).

    To further examine these three motifs, motif occurrence was evaluated in all 167 predicted PRE/TREs and in the promoter peaks described above. In contrast to the known GAF, Z, and PHO motifs, the three motifs each occur in only a subset of predicted and known PRE/TREs, and do not occur significantly together. These motifs may thus each define a subclass of PRE/TREs. Consistent with this idea, some of the lowest scoring known PRE/TRE sequences indeed contain one or more of the three motifs (Ringrose, 2003).

    Although no correlation between particular sites and high scores was found, a negative correlation was found between numbers of GAF/Z and PHO sites (a correlation coefficient of -0.78, indicating that when many GAF/Z sites are present, there are few PHO sites, and vice versa). This suggests that each PRE/TRE may have a preferred ground state, in which it is either predisposed to silencing (many PHO sites) or to activation (many GAF/Z sites) (Ringrose, 2003).

    In summary, this analysis identifies three motifs that occur significantly in association with known PRE/TRE motifs. Further functional characterization of these motifs and the proteins that bind them may contribute to a more complete definition of the sequence requirement for PRE/TRE function, and of subclasses of PRE/TREs (Ringrose, 2003).

    This study offers four main contributions to the understanding of PRE/TRE function. First, a larger set of sequences have been defined that will facilitate the more complete definition of PRE/TRE sequence requirements. Three motifs have been identified that may contribute to this goal. The definition of the minimal requirement for PRE/TRE function will not be a trivial task. Analysis of motif composition and order in the 167 predicted PRE/TREs reveals that there is a great diversity of patterns, with no preferred linear order. It is possible that each different pattern of motifs reflects a subtly different function. However, the concept of a linear order of motifs may well be irrelevant, because these elements operate in the three-dimensional context of chromatin. The fact that such a diversity of PRE/TRE designs exist indicates that the vast majority of them would defy detection by conventional pattern-finding algorithms, and underlines the advantages of the approach described in this study (Ringrose, 2003).

    Although no linear constraints on motif order were found, the fact that only motif pairs, and not single motifs, are able to identify PRE/TREs strongly suggests that this close spacing of sites has functional significance. Multiple sites may work in concert, to promote cooperative binding of similar proteins (e.g., repeated PHO sites) or to provoke competition between dissimilar proteins (e.g., closely spaced GAGA factor and PHO sites). In addition, in chromatin, only a subset of sites will be exposed and optimally available for binding at any one time, while others will be occluded by nucleosomes. The trxG includes nucleosome remodeling machines, raising the intriguing possibility that remodeling of PRE/TREs in chromatin may contribute to epigenetic switching by exposing different sets of protein binding sites (Ringrose, 2003).

    Second, a PRE/TRE peak is observed at the promoter of all the genes examined. This strongly suggests that promoter binding is a general principle of PRE/TRE function. It has been reported that PcG proteins can interact with general transcription factors. It has hitherto been unclear whether the observed PcG/trxG binding at promoters of the genes they regulate is mediated indirectly via such an interaction, or whether the PcG and trxG bind directly to PRE/TREs at the promoters. The high scores observed at promoters favor the latter interpretation (Ringrose, 2003).

    Third, it has been shown that in most cases, PRE/TREs do not occur in isolation, but are accompanied by one or more other peaks nearby. These grouped PRE/TREs may create multiple attachment sites for PcG and trxG proteins, which come together to build a fully operational complex at the promoter. Alternatively, grouped PRE/TREs may be individually regulated by tissue-specific enhancers as in the BX-C. Thus, each of the many PRE/TREs of the homothorax gene may interact with the promoter PRE/TRE in different tissues. This idea is consistent with the fact that Homothorax has specific roles in diverse developmental processes (Ringrose, 2003).

    Finally, the current list of about ten PcG/trxG target genes has been expanded to over 150 genes, identifying candidates for epigenetic regulation. The genes thus identified encompass every stage of development, suggesting that the PcG/trxG are global regulators of cellular memory. Experiments to further investigate and compare this regulation for individual genes are currently underway (Ringrose, 2003).

    Genome-wide analysis of Polycomb targets in Drosophila melanogaster

    Polycomb group (PcG) complexes are multiprotein assemblages that bind to chromatin and establish chromatin states leading to epigenetic silencing PcG proteins regulate homeotic genes in flies and vertebrates, but little is known about other PcG targets and the role of the PcG in development, differentiation and disease. This study determined the distribution of the PcG proteins PC, E(Z) and PSC and of trimethylation of histone H3 Lys27 (me3K27) in the Drosophila genome using chromatin immunoprecipitation (ChIP) coupled with analysis of immunoprecipitated DNA with a high-density genomic tiling microarray. At more than 200 PcG target genes, binding sites for the three PcG proteins colocalize to presumptive Polycomb response elements (PREs). In contrast, H3 me3K27 forms broad domains including the entire transcription unit and regulatory regions. PcG targets are highly enriched in genes encoding transcription factors, but they also include genes coding for receptors, signaling proteins, morphogens and regulators representing all major developmental pathways (Schwartz, 2006).

    The components of PcG complexes are products of PcG genes, first discovered as crucial regulators of homeotic genes in Drosophila. Immunostaining of Drosophila polytene chromosomes, however, showed PcG proteins at about 100 cytological loci, implying a much larger number of target genes. Functional analysis has identified PREs as DNA sequences able to recruit PcG proteins and establish PcG silencing of neighboring genes. Two types of PcG complexes bind to PREs. PRC1-type complexes include a core quartet of proteins: PC, PSC, PH and dRing. PRC2-type complexes include E(Z), which methylates histone H3 Lys27. Mono- and dimethylated Lys27 is widely distributed in the genome, but PcG sites characteristically contain trimethylated Lys27 (me3K27). The activity of the E(Z) complex is essential for stable silencing, and it has been proposed that H3 me3K27 recruits the PRC1 complex through the specific affinity of the PC chromodomain for me3K27. But the relationships between PRC1 and PRC2 complexes, between their binding sites and histone methylation, and between binding, methylation and gene expression are not well understood and remain the subject of debate. The genomic distribution of three PcG proteins [PC, PSC and E(Z)] and of histone H3 me3K27 was examined using using chromatin immunoprecipitation (ChIP). Since PcG target genes may be repressed in some tissues and active in others, a cultured cell line was used to minimize heterogeneity (Schwartz, 2006).

    Viewed at the scale of a chromosome arm, the distributions of PC, PSC, E(Z) and me3K27 coincide at a number of distinct binding peaks (which are referred to as 'PcG sites') that correspond to 70% of the bands reported in salivary gland polytene chromosomes stained with the corresponding antibodies. To minimize false positives, the analysis focussed on the PcG sites that showed simultaneous binding of two or more proteins, each above twofold enrichment. Of the 149 PcG sites detected (see the supplemental figure), 95 showed strong binding of all four proteins ('strong' PcG sites), whereas in 54 sites the binding was lower and below threshold for one of the proteins ('weak' PcG sites). At higher resolution, most PcG sites involve two or more genes, often sharing structural or functional similarities. Thus, PcG sites involve the following: engrailed (en) and invected (inv); the PcG genes ph-p and ph-d; the Dorsocross T-box gene cluster; the muscle NK homeobox gene cluster; the wingless cluster; and the two homeotic complexes ANT-C and BX-C (Schwartz, 2006).

    The Bithorax complex (BX-C) is a cluster of three homeotic genes (Ubx, abd-A and Abd-B) responsible for segmental identity in the abdomen and posterior thorax. The most prominent features are two sharp binding peaks for all three PcG proteins at the sites of the bx and bxd PREs that control Ubx. No peak was detected over the Ubx proximal promoter, although the entire gene shows a low but significant level of PC. A series of lower peaks emerged in the abd-A region and part of the Abd-B gene. Some of these correspond to the known PREs iab-2. In contrast, the distribution of H3 me3K27 oscillated rapidly above a high plateau that covers Ubx and abd-A but not Abd-B. RT-PCR was used to determine the mRNA levels corresponding to these three genes. Transcription of Ubx and abd-A in these cells was very low but distinctly above background. Abd-B was highly transcribed, at levels 300 times higher than Ubx. This pattern of activity was reflected by the distribution of both PcG proteins and me3K27. It is noted that in the Abd-B regulatory region, the previously characterized Fab-7 and Fab-8 PREs neither bound PcG proteins nor were methylated in these cells. The Abd-B gene has five distinct promoters. A sharp resurgence of both methylation and PcG protein binding in the region of the most upstream Abd-B promoter suggests that, in contrast to the other four promoters, this one might be repressed in the cultured cells. RT-PCR analysis using primers specific for mRNAs initiating from each promoter confirmed that the most upstream promoter is silent and that the other four are active. These results support the view that binding of PcG proteins to PREs is associated with transcriptional quiescence, whereas robust transcriptional activity is accompanied by lack of binding to the PREs and lack of Lys27 methylation over the transcription unit (Schwartz, 2006).

    Strong genomic sites bind all three PcG proteins. The PSC and E(Z) peaks generally rise sharply and are contained within less than 2 kb, whereas PC frequently forms a broader peak that may include shoulders or subsidiary peaks absent for E(Z) and PSC and subsides to background more gradually. These peak binding regions are thought of as corresponding to PREs, which they in fact do in the cases where these are known. Additional binding peaks may be found within or downstream of the transcription unit. In contrast, distribution of H3 me3K27 at each site is very broad, forming a domain of tens or even hundreds of kilobases encompassing the transcription unit and regulatory regions of one or more genes but, rather than a level plateau, it consists of a series of deep oscillations (Schwartz, 2006).

    The strong binding peaks or putative PREs are often associated with low values or troughs in the methylation profile and at secondary peaks the PC distribution frequently echoes methylation peaks. Overall, their relationship does not support the idea that methylation of Lys27 suffices to recruit binding of PC. It is proposed instead that PC bound to the strong binding peaks, the presumptive PREs, is recruited by proteins that bind specifically to those sequences. The weaker PC binding peaks and tails that mirror the methylation profile near PREs may represent a second mode of PC binding mediated by the interaction of the chromodomain with H3 me3K27 (Schwartz, 2006).

    It is supposed that methylation domains initiated by a PRE might spread bidirectionally until they encounter 'active' chromatin, characterized by histone acetylation or methylation of H3 Lys4, marks typical of transcriptionally active genes. Alternatively, specific features might shape the methylation domain either positively, by attracting the methyltransferase complex, or negatively, by blocking productive interactions with the PRE. As in the case of the Abd-B gene or of CG7922 and CG7956 genes, sudden drops in levels of me3K27 are generally associated with transcriptional activity. Are insulators involved in protecting CG7922 and CG7956 from silencing, or is the activity of these two genes simply epigenetically maintained from the time the cell line was originally established? Further work is required to answer this question (Schwartz, 2006).

    In many cases, the presumptive PRE lies between divergently transcribed genes such as dco and Sox100B. Which of the two is the PRE target? As PREs can act at distances of 20-30 kb, the proximity of PcG peaks to a promoter is not a reliable guide. It is proposed that the methylation domain is the clue to the target of PcG regulation. A PcG peak is not considered to regulate a promoter if the gene is not included in the methylation domain. When multiple genes are included in the methylation domain, it is likely that they are all affected by PcG regulation. However, this study distinguishes between genes that contain methylation as well as one or more PcG proteins and genes that contain only methylation (Schwartz, 2006).

    The 95 'strong' binding sites in the genome encompass a total of 392 genes. Of these 392 genes, 186 contain both PcG binding and methylation, and the remainder are found within broad methylation domains associated with PcG proteins binding but do not bind PcG proteins over their own promoter or transcription unit. They may represent genes not directly targeted but affected by the spread of methylation. An analysis of their ontology indicates that these two classes are in fact very different. Transcription regulators constitute 64.5% of the first set, compared to 4.3% for the full annotation set. Instead they constitute only 4.0% of those genes that contain only me3K27. These comparisons strongly suggest that (1) genes that regulate transcription are preferred PcG targets, and (2) genes that only include the tails of a methylation domain are probably not primary targets of PcG regulation. A similar preference is also seen among the 'weak' binding sites. These include a total of 74 genes containing both PcG proteins and methylation, 28.4% of which encode transcription regulators. Flanking genes containing only methylation include only 5.7% transcription regulators. Although transcription regulators are preferred PcG targets, secreted proteins, growth factors or their receptors, and signaling proteins are also targeted. PcG target genes include components of all the major differentiation and morphogenetic pathways in Drosophila (Schwartz, 2006).

    The major features of PcG binding shown by this work are that, although the proteins themselves are highly localized at presumptive PREs, the domain of histone methylation they produce is much broader. If the E(Z) methyltransferase is localized at the PRE, how is the extensive methylation domain produced? A looping mechanism is proposed in which interaction of PRE-bound complexes with flanking chromatin is mediated by the PC chromodomain. The observed broader distribution of PC might result from crosslinking of the chromodomain to methylated H3, reflecting this mechanism (Schwartz, 2006).

    Are PREs defined by characteristic sequence motifs? Although the analysis of the sequences underlying the binding peaks will be presented elsewhere, it is noted that Ringrose (2003) devised an algorithm based on GAGA factor, PHO and Zeste binding motifs to identify sequences likely to represent PREs. This algorithm correctly predicts a number of the strong PcG binding sites (27%) and a few of the weaker sites (7%), overall 20%; however, it does not predict the majority of the PcG sites. The reverse is also true: only 22% of the PREs predicted by Ringrose bind PcG proteins in these experiments. Together, these data suggest that additional criteria are necessary to predict most PREs reliably (Schwartz, 2006).

    As expected, PcG proteins and me3K27 are associated with transcriptional quiescence, but the data suggest that this is not an absolute condition. Low but significant transcription levels are detected even for the repressed Ubx and abd-A genes. Two target sites, polyhomeotic and the Psc-Su(z)2 site, contain PcG genes, which must be active to ensure the functioning of the PcG mechanism. The polyhomeotic locus is one of two sites in the entire genome that bind PC but lack appreciable levels of E(Z) and of Lys27 methylation. Instead, the Psc-Su(z)2 region is well methylated and binds both PC and E(Z) at multiple peaks. It is concluded that PcG mechanisms do not invariably lead to transcriptional silencing and are compatible with moderate levels of transcription (Schwartz, 2006).

    Another point of interest is the number and kind of genes that are PcG targets. Considering the developmental difference between salivary gland cells and the embryo-derived tissue culture cells, the substantial number of shared PcG sites suggests that a majority of target sites are occupied in a large percent of cells. Target genes are in fact predominantly regulatory genes that control major differentiation and morphogenetic pathways. These pathways and their genes are highly conserved, and recent work shows that they are also regulated by PcG in mammals. It might be expected that in a given cell type most alternative genomic programs would be repressed save the subset required in that cell type. The emerging picture from these studies is that PcG regulation is a key mechanism in genomic programming (Schwartz, 2006).

    Histone H3 variants specify modes of chromatin assembly

    Histone variants have been known for 30 years, but their functions and the mechanism of their deposition are still largely unknown. Drosophila has three versions of histone H3. H3.3 marks active chromatin and may be essential for gene regulation, and Cid is the characteristic structural component of centromeric chromatin. The properties of these histones have been characterized by using a Drosophila cell-line system that allows precise analysis of both DNA replication and histone deposition. The deposition of H3 is restricted to replicating DNA. In striking contrast, H3.3 and Cid deposit throughout the cell cycle. Deposition of H3.3 occurs without any corresponding DNA replication. To confirm that the deposition of Cid is also replication-independent (RI), centromere replication was examined in cultured cells and neuroblasts. It was found that centromeres replicate out of phase with heterochromatin and display replication patterns that may limit H3 deposition. This confirms that both variants undergo RI deposition, but at different locations in the nucleus. How variant histones accomplish RI deposition is unknown, and raises basic questions about the stability of nucleosomes, the machinery that accomplishes nucleosome assembly, and the functional organization of the nucleus. The different in vivo properties of H3, H3.3, and Cid set the stage for identifying the mechanisms by which they are differentially targeted. It is suggested that local effects of 'open' chromatin and broader effects of nuclear organization help to guide the two different H3 variants to their target sites (Ahmad, 2002).

    Nucleosomes are the fundamental units of chromatin, consisting of 146 bp of DNA wrapped around an octamer of four core histones. Histone deposition occurs primarily as DNA replicates to complete chromatin doubling. During S phase of the cell cycle, new histones are produced in abundance for immediate replication-coupled deposition. In most metazoans, this abundant S-phase synthesis results from the tight regulation of tens to hundreds of intronless histone genes that have special 3' untranscribed regions instead of poly(A) tails. However, some histones are produced from orphan genes outside of S phase. In Drosophila, orphan genes encode two H3 variants: one encodes Cid, the centromeric histone, and two encode H3.3, the replacement variant. These variants have equivalents in many other eukaryotes. The H3.3 histone is nearly identical to H3, differing at only four amino acid positions. Cid differs profoundly from H3 in sequence, showing some significant identity only within the histone fold domain. Surprisingly, these three histones have different deposition properties. H3 and H3.3 are deposited as DNA replicates, but both H3.3 and Cid can be deposited at sites that are not undergoing DNA replication. Whereas only a minor fraction of the bulk genome is packaged into Cid- and H3.3-containing nucleosomes, each variant is targeted to different specialized sites, with Cid localizing to centromeres and H3.3 to transcriptionally active genes. Specific localization of centromeric H3-like histones (CenH3s) has been observed in various animals, fungi, and plants. Also, an H3.3-like histone targets the transcriptionally active macronucleus in ciliates. Thus, the targeting of H3 variants is likely a feature of every eukaryotic cell, where centromeres and transcribed regions are the major loci of activity in metaphase and interphase, respectively. Both kinds of loci use a distinct pathway for nucleosome assembly, and this study explores the properties of this process (Ahmad, 2002).

    Studies of histone deposition have generally been done using crude extracts, purified components or pools of cells from which bulk chromatin is extracted. These methods reveal the average properties of chromatin, and have shown that the bulk of chromatin doubles as DNA replicates. Extensive in vitro work has demonstrated that the assembly of nucleosomes is a stepwise process in which deposition of an (H3:H4)2 tetramer is followed by addition of two H2A:H2B dimers. The new histones are brought to the replication fork in a complex with chromatin assembly factor 1 (CAF1). CAF1 appears to be recruited to the replication fork by binding to the ring-shaped proliferating cell nuclear antigen (PCNA) that encircles the DNA template at each replication fork. Histones from the parent DNA are distributively segregated to the two sister chromatids behind the replication fork, and the gaps in their nucleosomal arrays are rapidly filled by step-wise assembly of new nucleosomes. These nucleosomes are then matured by addition of linker histones and covalent modification of histone tails to complete chromatin (Ahmad, 2002).

    Nucleosomes containing H3 variants comprise only a small proportion of bulk chromatin, and thus their properties have been generally undetectable. However, replacement H3 variants can become enriched in the chromatin of nonreplicating cells. This means that other ways of depositing histones must exist; but because such variant enrichment has been detectable only in unusual cell types (such as long-lived neurons or spermatocytes), studies of the phenomenon have been limited. The ability to tag histones and examine their deposition properties in single cells has allowed a gain in insight into chromatin assembly processes (Ahmad, 2002).

    A cytological assay system was developed for studying replication and chromatin assembly by using Drosophila Kc cells, a cell line that displays a regular cell division schedule and a consistent tetraploid karyotype. Organization of the Drosophila nucleus is visually simple, because the late-replicating heterochromatin typically coalesces into a compartment in the nucleus, termed the chromocenter. This provides both a temporal and spatial distinction between the early replicating, gene-rich euchromatin, and the late-replicating heterochromatin (Ahmad, 2002).

    DNA replication can be tracked either by pulse-labeling with nucleotide analogs or by using anti-PCNA antibody. Furthermore, by introducing histone-GFP fusion constructs and producing a pulse of the tagged protein, histone deposition can be tracked during the cell cycle. Using this system, it has been possible to quantitatively examine DNA replication and histone deposition in unsynchronized populations of cells (Ahmad, 2002).

    GFP-tagged H3 shows exclusively replication-coupled deposition, displaying co-localization with replication markers and showing no detectable deposition in cells in which replication has been blocked. The N-terminal tail of H3 is required, suggesting that the H3 tails of tetramer particles interact with accessory factors at some early step in nucleosome assembly in vivo (Ahmad, 2002).

    In contrast to the properties of GFP-tagged H3 in cells, tagged H3.3 deposits in a replication-independent manner at actively transcribing loci. Deposition can occur in any stage of the cell cycle, and it is not accompanied by unscheduled DNA synthesis. Incorporation of H4 also occurs at these target sites, as expected for deposition of (H3.3:H4)2 tetramers; but how replication-independent (RI) histone deposition occurs is virtually unknown. Tagged Cid can also deposit throughout the cell cycle, suggesting that its deposition is also replication-independent (Ahmad, 2002).

    However, this conclusion depends on knowing the timing of centromere replication. Centromeres replicate within a defined portion of S phase and Drosophila centromeres replicate as isolated domains within later-replicating heterochromatin (Ahmad, 2002).

    Historically, centromeres have been thought to replicate very late in the cell cycle. This is because they are embedded within pericentric heterochromatin, which replicates late. Analysis has usually relied on visualization at mitosis; but mitotic chromosomes have inherently low resolution because they are highly condensed. Indeed, a recent study showed that Drosophila centromeres cannot be resolved from heterochromatin in 44% of spread mitotic chromosomes. Despite this limitation, it has been concluded Cid-containing chromatin replicates on the same late schedule as pericentric heterochromatin. However, this could be late replication in pericentric heterochromatin that was mis-scored as replication of centromeres (Ahmad, 2002).

    This uncertainty has been addressed by analyzing mitotic chromosome replication patterns, providing brief 15-min pulses to Kc cells and examining mitotic figures after a chase. This provides a 'snapshot' of replication at single points in the cell cycle. Examples of heterochromatin replication patterns are observed similar to those previously reported, where labeling overlaps Cid spots. However, unambiguous examples of chromosomes that were intensely labeled throughout the euchromatic arms, with foci directly coinciding with centromeres, are also observed. These centromeric foci are surrounded by heterochromatin that did not replicate during the labeling pulse (Ahmad, 2002).

    Experiments using interphase Kc cells revealed that ~90% of centromere replication occurs when euchromatin is replicating. The remaining 10% may be late replication in centromeric regions, but is more likely the result of nearby heterochromatic replication foci that can not be resolved from sites with Cid. Such early replication of centromeres is not limited to tetraploid Kc cells -- similar replication patterns are observed in diploid larval neuroblasts -- although the much shorter cell cycle time and the more irregular chromocenter limits quantitative analysis. Therefore, this early timing of centromere replication appears to be general for Drosophila cells (Ahmad, 2002).

    A series of progressively more direct experiments have provided insight into the fine structure in the centromere region. A model for the centromeric constriction has suggested that loops of DNA coil through the constriction, with centromeric nucleosomes lying in the outward parts of these coils, and conventional nucleosomes in the interior portions. This would account for the polar structure of the entire centromere if centromeric nucleosomes nucleate kinetochore formation (and thus microtubule capture) and conventional nucleosomes recruit cohesins (and thus centromeric cohesion). The linear arrangement of nucleosomes along centromeric DNA would then be alternating blocks of centromeric and conventional nucleosomes within the centromeric domain. A study using stretched chromatin fibers has demonstrated that Cid and H3 are interspersed in Drosophila, although these are not included in the same nucleosome. Apparently, blocks packaged in one kind of nucleosome alternate with blocks packaged in the other (Ahmad, 2002).

    How could the duplication of such regular but discontinuous arrays of nucleosomes occur? The alternating pattern of nucleosomes on stretched chromatin fibers is reminiscent of replication patterns on fibers from normal chromatin. Replication origins within a chromatin domain often appear to be regularly spaced with an interval of 50-100 kb, and these origins fire synchronously. Perhaps the nucleosome blocks in the centromeric regions correspond to an underlying regular arrangement of replication origins throughout the entire centromeric domain. If Cid-containing blocks include the origins for these domains, and if replication initiates at a time when H3 is not available, ultimately only the RI deposition of Cid will package these blocks. The later replicating stretches would incorporate H3 as it becomes available. In this way, the fine pattern of replication would maintain the discontinuous Cid arrays over an extended region (Ahmad, 2002).

    The model for maintaining the higher-order chromatin structure of the entire centromere has precise requirements for replication patterns in this region: a discontinuously spaced arrangement of origins must correspond to the blocks of Cid-containing chromatin. At least two other patterns of replication in this region can be imagined: (1) all Cid- and H3-containing blocks might replicate simultaneously (pattern 2); (2) a single origin might replicate the entire domain (pattern 3) (Ahmad, 2002).

    The possibility of the existence of discontinuous replication track corresponding to blocks of centromeric chromatin was investigated by pulse-labeling cells for only 15 min. To prepare stretched chromatin fibers, nuclei spread on a glass slide were disrupted in a high-salt buffer. As the buffer runs off the slide, it pulls chromatin fibers behind it. Stretched centromeres were identified and those fibers were examined in which nucleotide incorporation was unambiguous. In each of these cases it was clear that replication was occurring in discrete patches scattered throughout the centromeric domain. These replication tracks must arise from multiple origins, and thus the possibilities that the entire domain replicates from a single origin, or that the whole domain replicates simultaneously can be ruled out (Ahmad, 2002).

    These patches correspond significantly with the segments between Cid-containing chromatin. Thus, from published experiments and the experiments described here it appears that replication occurs in two discrete phases: all CenH3-containing chromatin within a domain replicates, and at a different time all H3-containing chromatin replicates. Therefore, replication within this domain is discontinuous and initiates from multiple origins (Ahmad, 2002).

    Given that deposition of any H3 must occur in the form of (H3:H4)2 tetramers, there must be discrimination of H3- containing tetramers from tetramers containing variants. Thus analysis of RI assembly was used to initiated the mapping of discriminating sites within the histone variants. It was found that one type of discrimination is a cluster of three residues within the histone fold domain (HFD) of H3 that limits it to replication-coupled deposition. Furthermore, because both Cid and H3.3 undergo RI deposition but have mutually exclusive targets, there must be additional discrimination between these variants (Ahmad, 2002).

    Replication-coupled nucleosome assembly is aided by accessory factors that are recruited to the replication fork by binding to PCNA. However, the process of RI deposition must be different, because RI deposition of H3.3 does not require portions of the histone that are required for replication-coupled deposition. Furthermore, the lack of PCNA during gap phase deposition raises the question of what is recruiting histones to the sites. The phenomenon of CenH3 targeting has raised expectations that a specific, localized chromatin assembly factor or histone modification will be involved in the targeting of CenH3s. Indeed, a chromatin remodeler of the RSC family, PyBAF, localizes to kinetochores during mitosis of mammalian cells. Furthermore, RSC mutations in budding yeast alter chromatin structure specifically around centromeres, and perhaps RSC activity is involved in assembly of centromeric nucleosomes. Mutations in CAF and Hir genes also give centromere defects, and it has been suggested that these factors are involved in loading the yeast CenH3 Cse4p. However, a role for any of these factors does little to explain the specific targeting of CenH3s, because these factors are all widely distributed in the nucleus (Ahmad, 2002).

    The best candidate for a uniquely centromere-localized chromatin assembly factor is the Mis6 protein in fission yeast. This protein is required for centromeric localization of the CenH3 SpCENP-A, but Mis6 homologs in budding yeast (Ctf3) and in mammals (CENP-I) localize to centromeres but are not required for targeting CenH3s. Thus, Mis6 proteins appear to be structural components of centromeres, not histone assembly factors (Ahmad, 2002).

    An alternative model is that some feature of centromeric chromatin facilitates the targeting of its specialized histones. An obvious candidate for this feature is that centromeric nucleosomes themselves bind to and thereby recruit new CenH3 tetramers for future deposition. Such an interaction is a possible molecular mechanism for direct templating of centromere duplication. Regardless of whether CenH3 targeting involves specialized co-factors, templating, or both, the question remains as to why it should use an RI pathway (Ahmad, 2002).

    The targeted deposition of H3.3 to active genes is likewise replication-independent, although transcription-coupled assembly may facilitate (H3.3:H4)2 deposition. Perhaps H3.3 targeting is mediated by a component of RNA polymerase complexes (Ahmad, 2002).

    Because RNA polymerases move processively along the DNA during transcription, a contiguous transcribed segment of DNA might incorporate the H3.3 variant. Alternatively, RI deposition of H3.3 may be facilitated by any of a number of ATP-dependent chromatin remodeling complexes to target specific sites near transcription units. Any candidate factor might be expected to preferentially use H3.3 instead of H3, but whether there is any such discriminating factor is unknown, because all in vitro studies of higher eukaryotic chromatin assembly have been performed with H3. It is anticipated that this will soon be addressed. However, the prospects for identifying a unique remodeler that is required for RI deposition are uncertain, because budding yeast mutants that eliminate any known chromatin assembly factors do not eliminate chromatin assembly. Thus the possibility has to be considered that RI deposition at active genes and at centromeres uses generic remodeling activities, and that components or structural aspects common to both centromeres and actively transcribed genes may result in RI histone deposition at both kinds of sites (Ahmad, 2002).

    The deposition of histones throughout the cell cycle by a replication-independent process implies that previously existing nucleosomes are unraveled, and their histones released. It is known that the process of transcription results in a local unfolding of the chromatin fiber and an 'open' chromatin configuration. Although transcription of nucleosomal templates with bacterial polymerases can occur in vitro without displacing histone octamers from DNA, in vivo assays demonstrated that a measurable amount of transcription-dependent histone displacement does occur in eukaryotic nuclei. In fact, even in vitro, RNA polymerase II is virtually unable to transcribe nucleosomal DNA under physiological conditions. Transcription requires that histone-DNA contacts be broken for polymerase to transit the nucleosomal DNA. Although transcription can occur without histone displacement if the histone octamer releases some contacts with DNA and maintains others, at some frequency all contacts might be released. The histone octamer would then simply fall off. Additionally, localized remodeling factors will disrupt nucleosome structure as they act. The in vitro and in vivo observations can be reconciled if histone displacement occurs occasionally as nucleosomes are disrupted. Constraints on nucleosomes in a compacted chromatin fiber (i.e., 'closed' chromatin) would limit histone displacement (Ahmad, 2002).

    Although internucleosome forces within inactive chromatin are uncharacterized, they have been inferred from numerous experiments, including the tendency of nucleosomes within hetero-chromatin to form extremely regular and fixed arrays. A likely constraint in heterochromatin arises from the multimeric associations that occur between heterochromatin-specific non-histone chromatin proteins. Attention has focused on the heterochromatin protein-1 (HP1). HP1 is recruited to heterochromatic DNA by binding, through its chromodomain, to the H3 tail when it is methylated at lysine-9 (H3-K9me). The chromo shadow domain of HP1 mediates associations between HP1 molecules, and multimers of HP1 bound to methylated histone tails provides one basis for constraining arrays of nucleosomes (Ahmad, 2002).

    Although the state of chromatin in heterochromatin and in actively transcribed regions is well known, less is known about the chromatin fiber packaged by centromeric nucleosomes. However, these regions appear to be open. Centromeric DNA is sensitive to micrococcal nuclease digestion both in budding yeast and in the central core region of fission yeast centromeres where SpCCENP-A-containing nucleosomes reside, and plant meiotic centromeres appear decondensed. In addition, early replication is a feature of open chromatin, and centromeric chromatin replicates before surrounding heterochromatin (Ahmad, 2002).

    An open configuration may arise from at least three sources. (1) All CenH3s lack a canonical H3 tail. Because methyl-modification of lysine-9 appears to be the key epitope to maintain heterochromatin, the lack of this site in centromeric nucleosomes means that such regions cannot become heterochromatic. Indeed, the heterochromatin protein HP1 is not associated with chromatin packaged by CenH3s. (2) A recent study of Cid homologs in drosophilids has uncovered DNA minor-groove binding motifs in the Cid tail outside of the nucleosome core. Extension of the Cid tail along linker DNA between nucleosomes may inhibit compaction of the nucleosome strand, thus maintaining these regions in an open configuration. (3) Chromatin remodeling factors that destabilize nucleosomes are found both at active genes and centromeres, and their activity will promote histone replacement. It is suggested that an open chromatin configuration is the common basis for RI deposition at centromeres and at actively transcribed genes (Ahmad, 2002).

    If open chromatin were the sole basis for RI deposition, then we would expect that active genes and centromeres would incorporate both H3.3 and CenH3s. However, their deposition is mutually exclusive. This exclusivity is likely to rely on multiple mechanisms that act on all steps in nucleosome assembly. Factors that discriminate between H3.3 and Cid would be the best candidates for directing these variants to their targets. However, the organization of the nucleus provides a clue as to another way in which exclusive targeting may be accomplished. Centromeric DNA in Drosophila is flanked by repeated sequences that are packaged into heterochromatin, and this forms a compartment at interphase in which centromeres are embedded in heterochromatin. The active rDNA genes are the primary sites of H3.3 deposition and they are also found in a distinct nuclear compartment, the nucleolus, next to the chromocenter (Ahmad, 2002).

    This functional nuclear organization is very simple to see in Drosophila, where all heterochromatin typically associates into one large chromocenter, and the active rDNA arrays also often associate to present one large nucleolus. In fact, this general compartmentalization is almost invariant in eukaryotes, and has led to the idea that heterochromatin somehow protects centromeres and NORs. Although both Cid and H3.3 undergo RI deposition, their exclusive targeting could in part be accomplished by restricting one or both variants within the nucleus. For example, unincorporated (Cid:H4)2 tetramers might be sequestered within the heterochromatic chromocenter. Cid deposition would then appear targeted to the centromere, because this is the only site within the chromocenter with open chromatin (Ahmad, 2002).

    Whether (Cid:H4)2 tetramers are actually sequestered in this way is unknown. Indeed, whether sequestering substrates can have any effect on reactions within the nucleus has become a pressing issue. Many nuclear components remain mobile, but functional experiments argue that certain effects in the nucleus actually only occur when components are sequestered. It is likely that some reactions in the nucleus are relatively independent of localization because they associate efficiently with their partners and their reactions proceed quickly. Conversely, reactions that involve weak interactions or multiple steps may require raising the effective concentration of their substrates by nuclear sequestration (Ahmad, 2002).

    It has been suggested that the heterochromatic compartment is involved in histone traffic within the nucleus. The basis of this hypothesis was the realization that Cid-containing chromatin behaves unusually during S phase. Generally, the deposition of H3 quickly follows DNA replication. However, the replication of Cid-containing centromeric DNA occurs without H3 deposition, implying that the normal coupling between replication components and nucleosome assembly components must be broken. Because this coupling is thought to result from an interaction between chromatin assembly factor 1 histone complexes and PCNA, the simplest explanation for uncoupling the two processes would be to sequester replicative nucleosome assembly factors away from centromeres. It is imagined that unincorporated H3-containing tetramers might be sequestered in euchromatin in the first half of S phase, and would thus never (productively) see the replication forks at centromeres within the heterochromatic compartment. This uncoupling might be necessary to prevent dilution of centromeric nucleosomes by conventional nucleosomes that would assemble after replication-coupled deposition. Genetic experiments in budding yeast and Drosophila suggest that CenH3s and H3 do compete for assembly (Ahmad, 2002).

    One way that a competition between CenH3 and H3 histones can be probed is to change their relative concentration. A tagged Cid protein exclusively deposits at centromeres when it is ectopically expressed at low levels from a heat-shock-inducible promoter. However, it is apparent that expression from this construct remains low. Re-engineering the transcriptional start region of the construct to include a translational initiation consensus site now allows overproduction of Cid in cells (Ahmad, 2002).

    To analyze the behavior of excess quantities of Cid protein, an overexpression construct was introduced into Drosophila Kc cells. Cells receive varying amounts of transfected DNA, and thus express Cid over a wide range of levels. In cells that express low amounts of the ectopic protein, Cid localizes to centromeres, as expected. However, a new localization pattern for Cid is seen at high expression levels: the tagged protein localizes to centromeres and throughout euchromatin. The incorporation pattern of ectopic Cid is especially clear on mitotic chromosomes from these transfections, where the tagged protein is incorporated throughout the euchromatic arms as well as at centromeres. It is concluded from this result that excess Cid can be deposited at sites other than centromeres. Normal cells must have mechanisms to prevent euchromatic deposition, but over-expression is sufficient, by itself, to overcome this restriction (Ahmad, 2002).

    The mis-incorporation pattern of Cid shows an interesting specificity: Cid can deposit at centromeres and euchromatin but not in heterochromatin. Therefore, heterochromatin must either lack the feature that tolerates mis-incorporation, or must actively exclude Cid. It is argued that centromeres and euchromatin share the feature of open chromatin, which is proposed to be the first prerequisite for RI deposition of histone variants. Indeed, the mis-incorporation of Cid into euchromatin is replication-independent, because it occurs both when euchromatin is replicating in early S phase, and in late S phase when euchromatic replication is complete. It is suggested that Cid is contaminating open chromatin in the euchromatic compartment when it is overexpressed (Ahmad, 2002).

    What normally prevents the deposition of Cid into euchromatin? Endogenous Cid is present only at low levels, and mis-incorporation could be avoided if Cid were sequestered away from euchromatin in the nucleus. If unincorporated Cid were sequestered in the heterochromatic chromocenter, it would be unable to deposit in the closed chromatin of this compartment. Thus, sequestration might serve two purposes: deposition in euchromatin would be prevented and deposition at centromeres would be promoted. Overexpression of CenpA in mammalian cells also mis-incorporates into euchromatin (Ahmad, 2002).

    Although it has not been examined whether CenpA mis-incorporation is replication-independent, this is expected to be the case, because this is how CenpA deposits at centromeres (Ahmad, 2002).

    The idea that histone variants may respect nuclear compartments was first raised by experiments expressing heterologous CenH3s in Drosophila Kc and human HeLa cells. These extremely diverged heterologous histones do not localize to centromeres in these cells, implying that there is some kind of specificity for depositing the correct CenH3 at centromeres (Ahmad, 2002).

    Surprisingly, heterologous histones are preferentially enriched in the heterochromatic blocks. It has been suggested that there is a default ability of cells to enrich diverged H3 variants in the heterochromatic compartment. Perhaps heterochromatic enrichment is a normal first step in the deposition of the endogenous CenH3s (Ahmad, 2002).

    Those experiments and overexpression results encourage the view that nuclear compartments may guide histone variants to the correct subset of their potential deposition sites. Compartment effects may also affect the RI deposition of H3.3 in an inverse way to Cid: i.e., sequestering to promote H3.3 deposition at active genes, and preventing its deposition at centromeres (Ahmad, 2002).

    Because H3.3 is largely identical to H3, the hypothetical element that is recognized in H3 and results in its exclusion from chromocenters during centromere replication may also be present in H3.3. Perhaps this discrimination against canonical H3 histones also serves to prevent the RI deposition of H3.3 at centromeres (Ahmad, 2002).

    RI assembly permits immediate chromatin repair. The unfolding of chromatin during transcription may be damaging, in that the forces RNA polymerases apply to their template DNA should at least occasionally displace histone octamers from DNA. Additionally, histone octamers may sometimes be displaced by chromatin remodeling factors associated with transcriptional activity. In either case, these regions must be repackaged into nucleosomes. Similarly, replacement of CenH3s may be required to maintain the nucleosomal configuration of centromeres after mitosis. Bundles of microtubules drag a chromosome to the pole during anaphase, and the forces they apply may be sufficient to occasionally pull off histone octamers. Chromatin would then be stripped of some CenH3 histone octamers. RI deposition allows repair of this damage. In fact, the RI deposition of CenpA in mammalian cells seems to occur around the time of mitosis). The deposition of Cid in Drosophila cells occurs throughout the cell cycle, but may only be required at two points: as centromeric DNA replicates to double its chromatin, and after mitosis to repair stripped chromatin (Ahmad, 2002).

    The process of RI assembly at active genes provides a novel level of control over histone modifications. Replacement of nucleosomes in one modification state by new histones could switch chromatin to an active state. Initiation of transcription would start this process, and successive transits of RNA polymerases would promote RI assembly. The replacement H3 histone in alfalfa is hyperacetylated, and RI assembly with acetylated histones could enrich such modifications in active chromatin. However, histone modification by methylation has appeared more problematic. A number of histone methyl-transferases (HMTs) have been characterized, but no histone demethylase is known. Methylated lysine-9 in the H3 tail (H3K9me) is a critical epitope for recruiting heterochromatic chromatin proteins, because this is the binding site for HP1. HP1 recruits additional heterochromatic proteins including the Su-var3-9 HMT. Therefore, it is straightforward to imagine how these recruited proteins could perpetuate a heterochromatic state through replication-coupled nucleosome assembly and cell division (Ahmad, 2002).

    Because an irreversible methyl modification appears to specify the heterochromatic state, it has been unknown how a heterochromatic site could switch to an active state. One route for switching might be to prevent the methylation of nucleosomes assembled during replication. Successive cell cycles could then dilute methylated nucleosomes, allowing eventual activation (Ahmad, 2002).

    However, more rapid mechanisms for activating silenced chromatin must exist. Induction of silenced genes can occur within a single cell cycle; for example, X chromosomes become reactivated and lose H3K9me during diplotene in the Caenorhabditis ovary. Work using a reporter for heterochromatic gene silencing suggests that switching to an active state can occur in somatic cells without cell division. Thus, H3K9me can be removed without replication-coupled nucleosome assembly (Ahmad, 2002).

    RI deposition implies that the entire heterochromatic nucleosome may be unraveled and replaced. The process of transcriptional activation may force the disassembly of H3K9me-containing nucleosomes, followed by RI assembly of an unmarked nucleosome. Although the fate of the displaced methylated H3 is not known, it is know that RI deposition can occur at any time in the cell cycle, and thus should be able to rapidly derepress silencing. Conversely, an active gene could be silenced by methylating the tail of H3.3, which presents the same lysine-9 epitope. The stability of histone methylation gives it a distinct advantage over other histone modifications for heritable effects on chromatin. The possibility of RI deposition circumvents the irreversible nature of methylation, thus retaining the potential to switch the heritable chromatin state at a later time (Ahmad, 2002).

    It is concluded that H3 variants are used to package functionally specialized chromatin, where they play vital functional roles. Localizing these variants to centromeres and to transcriptionally active regions utilizes an RI process that is distinct from the nonspecific, replication-coupled method of packaging the bulk genome. It is argued that RI deposition is the consequence of the activities that impinge on these sites in the genome and creates an open chromatin structure. This flexibility in histone deposition may be necessary to maintain the nucleosomal structure of these regions. In higher eukaryotes, the RI deposition process allows specialized chromatin to be distinguished at the most basic level, where histone variants are incorporated into chromatin. The differences between the generic H3 -- which packages the bulk of the genome -- and the H3 variants may contribute to the physical properties of specialized regions and recruit particular non-histone chromatin proteins. Because histones remain associated with DNA through mitosis, these variants establish heritable distinctions in chromatin (Ahmad, 2002).

    Centromeres are a defining feature of eukaryotes, and all are likely to have a CenH3. However, the utilization of two conserved versions like H3 and H3.3 is not universal. For example, budding yeast has only one canonical H3 histone, which undergoes both replication-coupled and RI deposition. Surprisingly, this is H3.3: phylogenetic analysis reveals that ascomycetes have lost H3, whereas their sister clade basidiomycetes have both H3 and H3.3, as do animals. Therefore, an H3.3 gene performs all general functions in some organisms. The extraordinary conservation of H3.3, which is identical from mollusks to mammals, speaks to its fundamental role in the eukaryotic nucleus (Ahmad, 2002).

    Snipper, an Eri1 homologue, affects histone mRNA abundance and is crucial for normal Drosophila melanogaster development

    The conserved 3'-5' RNA exonuclease ERI1 is implicated in RNA interference inhibition, 5.8S rRNA maturation and histone mRNA maturation and turnover. The single ERI1 homologue in Drosophila melanogaster Snipper (Snp) is a 3'-5' exonuclease, but its in vivo function remains elusive. This study reports a Snp requirement for normal Drosophila development, since its perturbation leads to larval arrest and tissue-specific downregulation results in abnormal tissue development. Additionally, Snp directly interacts with histone mRNA, and its depletion results in drastic reduction in histone transcript levels. It is proposed that Snp protects the 3'-ends of histone mRNAs and upon its absence, histone transcripts are readily degraded. This in turn may lead to cell cycle delay or arrest, causing growth arrest and developmental perturbations (Alexiadis, 2017).

    Acetylation and methylation: Covalent modifications of chromatin and DNA that establish and maintain the heterochromatin-induced silenced state

    A self-reinforcing network of interactions among the three best-characterized covalent modifications that mark heterochromatin (histone hypoacetylation, histone H3-Lys9 methylation, and cytosine methylation) suggests a mechanistic basis for spreading of heterochromatin over large domains and for stable epigenetic inheritance of the silent state. Early cytological studies have distinguished two types of chromatin: euchromatin and heterochromatin. Heterochromatin was originally defined as that portion of the genome that remains condensed and deeply staining (heteropycnotic) as the cell makes the transition from metaphase to interphase; such material is generally associated with the telomeres and pericentric regions of chromosomes. Subsequent work has identified a cluster of structural features that characterizes heterochromatin. While heterochromatic regions are rich in repetitive sequences and have a low gene density, they are not devoid of genes; it is estimated that there are ~40-50 genes within the pericentric heterochromatin of Drosophila. An altered packaging of heterochromatin, to a less-accessible form, has been demonstrated by probing with nucleases and other reagents such as prokaryotic DNA methyltransferases. The data suggest that while nucleosome arrays in euchromatin are irregular, punctuated by the nucleosome-free hypersensitive sites (HS sites) characteristic of active genes, the nucleosomes in heterochromatin have a regular spacing over large arrays, with a higher proportion of the DNA associated with the histone core rather than in the linker. Euchromatic regions silenced by nucleosome packaging is referred to as 'silent chromatin,' reserving the term 'heterochromatin' for the classically defined heterochromatin (Richards, 2002).

    It is an interesting paradox that while the histones are among the most conserved proteins known in evolution, they are also among the most variable in posttranslational modification. The pattern of modifications has been suggested to act as an information code (the histone code), dictating both nucleosomal interactions and the association of nonhistone chromosomal proteins that collectively influence packaging and gene regulation. Modifications include acetylation, methylation, phosphorylation, ubiquitination, and ADP-ribosylation. Given the number of sites of posttranslational modification for each of the four core histones, an imposing number of differently modified nucleosomes is possible. The modification states of the N-terminal tails of histones H3 and H4 appear to play a major role in heterochromatin formation (Richards, 2002).

    One modification of histones, hypoacetylation of lysine residues, is associated with both formation of heterochromatin and gene silencing. Early attempts to fractionate chromatin and characterize the components led to the suggestion that heterochromatic domains were associated with hypoacetylated histones, while euchromatic domains were associated with hyperacetylated histones. This distinction is observed not only between constitutive heterochromatin and euchromatin, but also in mapping studies comparing an active or inducible gene to flanking regions (Richards, 2002).

    Histone H3 methylated at lysine 9 (H3-mLys9), a second modification of histones, has been identified as characteristic of the heterochromatic state. Immunofluorescent staining of Drosophila polytene chromosomes shows that the bulk of the H3-mLys9 is present in the pericentric heterochromatin and in a banded pattern on the fourth chromosome, known sites of repetitive DNA (Jacobs, 2001). Similarly, chromatin immunoprecipitation (ChIP) experiments demonstrate that H3-mLys9 is a prominent component of the silent mating type locus in fission yeast (Schizosaccharomyces pombe), while essentially absent from flanking regions containing inducible genes. Methylation of histone H3-Lys9 has also been associated with the silencing of euchromatic genes (Richards, 2002).

    A third biochemical marker of heterochromatin is the most common form of DNA modification in eukaryotes, namely cytosine methylation. Although absent in some eukaryotes, this DNA modification is widely distributed in the eukaryotic kingdom. It is particularly prevalent in plants and mammals where it is an important epigenetic mark that contributes to the stability of pericentromeric heterochromatin and plays a central role in cementing and maintaining epigenetic expression states, not only in heterochromatin but in silenced euchromatic domains (Richards, 2002).

    Hypoacetylation, particularly of histones H3 and H4, associated with heterochromatic domains from a range of organisms, has been studied in greatest detail in Saccharomyces cerevisiae. Many of the cis- and trans-acting factors necessary to establish and maintain the silent state at the telomeres and HML/HMR loci have been identified. These studies have demonstrated the need for hypoacetylated histones. Silencing is mediated by the multiprotein, nucleosome binding SIR(1-4) complex, recruited by interaction with specific DNA binding proteins. Sir3 and Sir4 interact specifically with the N-terminal tails of histones H3 and H4 in the hypoacetylated state. While the N-terminal tails of the histones are not required individually for growth in yeast, they do play an essential role in silencing, amino acids 4-20 of H3 and 16-29 of H4 being required. Certain sir3 alleles can suppress the silencing defect of histone H4 tail mutations, and Sir3 and Sir4 can bind to the amino termini of histones H3 and H4 in vitro, suggesting direct interaction. Recent studies using antibodies against different histone acetylated isoforms indicate that histones in the telomeric and HML/HMR heterochromatin are hypoacetylated at all modification sites (Richards, 2002 and references therein).

    What is the mechanism for histone hypoacetylation specifically at the heterochromatic domains? This function is apparently provided, at least in part, by Sir2 (Drosophila homolog: Sir2), shown to have a NAD-dependent protein deacetylase activity. Sir2 can efficiently deacetylate histones in vitro, preferentially deacetylating histone H4 at Lys16, although direct action in vivo has not yet been reported. Enzymatic activity of Sir2 is required for silencing in the heterochromatic domains. The acetylation status of H4-Lys16 may be of particular importance. Lys16 is the preferred site of acetylation in monoacetylated H4 of euchromatin in yeast, and this is the only acetylatable H4 site whose mutation strongly affects Sir3 binding in heterochromatin. Deletion of sir3 results in increased histone acetylation in heterochromatic domains, as well as a loss in silencing. The results suggest an assembly model in which interaction of the Sir2-Sir4 complex with specific DNA binding proteins leads to local histone deacetylation, permitting binding of Sir3. It appears that binding of Sir3 to the hypoacetylated histone blocks reacetylation. Given the interactions between Sir2, Sir4, and Sir3, once initiated, such a complex could spread along the nucleosome array, generating and maintaining the altered modification state (Richards, 2002 and references therein).

    In addition to the above, studies in S. pombe, Drosophila, and other organisms suggest that the histone acetylation level is used as a heritable mark of the chromatin state. Mutations in HDACs or treatment with trichostatin A (TSA), an inhibitor of some HDACs, frequently results in a loss of function in heterochromatic domains and a relaxation of silencing. For example, treatment with TSA results in functionally deficient centromeres and chromosome loss in S. pombe, concomitant with a loss of silencing for test genes within the centromeric heterochromatin. The hyperacetylated state is heritable following removal of TSA; it is linked in cis to the treated centromere locus, and correlates with inheritance of functionally defective centromeres, demonstrating an epigenetic phenomenon based on the chromatin structure. In contrast, acetylation is used as an inherited mark of activity. Histone H4-aLys16 is prominently associated with the dosage-compensated, 2-fold active X chromosome in males of Drosophila. This specific modification is due to MOF, an essential acetyltransferase of the dosage compensation complex that coats the male X chromosome. The complex remains associated with its target DNA throughout the cell cycle, providing the means to replicate the modification state. Recruitment of HATs to chromosome regions showing histone acetylation patterns corresponding to their own catalytic specificity has been observed, e.g., the histone acetyltransferase P/CAF binds preferentially to acetylated H4 and H3 peptides via a bromo domain. These observations provide evidence for use of the histone acetylation state as an epigenetic mark (Richards, 2002 and references therein).

    How does hypoacetylation impact chromatin structure? In the case described above, the hypoacetylated histone tails interact specifically with the SIR complex. While Sir2 orthologs have been identified, few proteins with similarity to the other Sir proteins have been found in multicellular eukaryotes. Nonetheless, there may be an equivalent of the SIR complex that makes similar use of the histone hypoacetylation signal. However, a significant effect might be realized through the interaction of the histone H3/H4 tails with the DNA and/or other nucleosomes in the chromatin fiber. The regions of the histone H3 and H4 tails that contribute to DNA binding, as observed in the crystal structure, are necessary for silencing of basal transcription in vivo. However, these regions are distinct from those critical for repression at the HM loci and telomeres. It appears unlikely that simply weakening intranucleosomal histone-DNA interactions by histone acetylation could alleviate the inhibitory effect of heterochromatin structure on transcription. An alternative possibility was suggested by the original crystals of the nucleosome (using histones from Xenopus), where histone H4 amino acids 16-24 were observed to interact with the acidic region formed by histones H2A and H2B on the surface of the adjacent histone octamer. The eight H2A/H2B amino acids involved in forming this negatively charged patch are highly conserved. Acetylation of the H4 tail might disrupt this interaction, leading to a loss of compaction along the chromatin fiber. However, this disposition of the histone H4 tail is not seen in crystals of the nucleosome made using yeast histones, and additional studies are needed to resolve this interesting question (Richards, 2002 and references therein).

    A key role for a second histone modification in the specification of heterochromatin is shown by the recent demonstration that mammalian homologs of Drosophila Su(var)3-9, including human SUV39H1 and murine Suv39h1, encode enzymes that specifically methylate histone H3 on lysine 9 (Rea, 2000). Su(var)3-9 was originally identified as a suppressor of PEV in Drosophila, indicating that the wild-type gene product is involved in heterochromatin formation (Tschiersch, 1994). A homolog in S. pombe, Clr4, is also a specific histone H3-Lys9 methyltransferase, suggesting that this activity is widely distributed and well conserved. clr4 mutants exhibit reduced heterochromatin formation at centromeres, with elevated mitotic chromosome loss and reduced silencing within both pericentromeric heterochromatin and the silent mating type locus. Similarly, mammalian Su(var)3-9-like proteins have been implicated in both centromere activity and gene silencing. Disruption of the murine Suv39h1 and Suv39h2 paralogs causes genome instability, chromosome mis-segregation, and male meiotic defects (Richards, 2002 and references therein).

    Further, the Suv39h1/SUV39H1 proteins are found in association with M31, a mouse Heterochromatin protein 1 (HP1) homolog. HP1, perhaps the best-characterized protein found in heterochromatin, was identified in Drosophila melanogaster in a screen of monoclonal antibodies prepared against proteins tightly bound in the nucleus. Immunofluorescent staining of the polytene chromosomes shows HP1 concentrated in the pericentric heterochromatin, the telomeres, and a banded pattern across the small fourth chromosome, known sites of repetitive DNA with characteristics of heterochromatin. A few prominent HP1 sites are observed within the euchromatic arms (e.g., region 31). Homologs of HP1 are associated with pericentric heterochromatin in organisms from S. pombe to humans. The protein (206 amino acids in Drosophila) has a conserved N-terminal chromo domain (CD) followed by a variable hinge region and a conserved C-terminal chromo shadow domain (CSD). The chromo domain was first recognized by similarity with a domain in Polycomb, a protein associated with silencing of the homeotic genes during development; this domain has now been identified in many other chromosomal proteins. Both point mutations in the chromo domain and presumed null mutations (early truncation of the translation product) in the gene encoding HP1 [Su(var)2-5] result in a loss of silencing, while an additional dose will increase silencing of a variegating euchromatic gene, i.e., one placed in a heterochromatic environment. Interestingly, the converse is true for those few genes normally resident within the pericentric heterochromatin (e.g., light), which appear to be dependent on HP1 for normal activity. The conserved structure of HP1 suggests that it might serve as a bifunctional reagent, helping to organize and maintain heterochromatin structure. HP1 interacts with a number of other chromosomal proteins, including several involved in nuclear assembly, replication, and gene regulation. These interactions have generally been mapped to the chromo shadow domain. The chromo shadow domain can homodimerize, and the dimer has been suggested to be the interactive species (Richards, 2002 and references therein).

    The HP1 chromo domain specifically binds histone H3 N-terminal tails methylated on lysine 9, and a variety of data suggest that this interaction is essential for maintenance of heterochromatin. The interaction appears quite specific; neither the chromo domain of Polycomb nor the chromo shadow domain of HP1 shows this interaction. The H3 tail fits within a groove established by conserved chromo domain residues; Su(var) mutation V26M results in an alteration of the structure and loss of H3-mLys9 binding. Studies in mammalian cells suggest that localization of HP1 in heterochromatin is dependent on the presence of histone H3-mLys9. However, HP1 association with heterochromatin in Drosophila can be driven either by the N-terminal portion (with the chromo domain) or the C-terminal portion (with the shadow domain), emphasizing the bifunctional nature of the protein. The above results argue that an interaction between the specifically modified histone H3 and HP1 is essential for maintaining a stable heterochromatin structure (Richards, 2002 and references therein).

    Histone H3-Lys9 methylation is influenced by preexisting modifications of histone H3 and affects other histone modifications, implying a set of functional interactions. The relationship between hypoacetylation of H3/H4 and methylation of H3 has been clarified by studies of heterochromatin formation in S. pombe. clr1-clr4, clr6, swi6, and rik1 mutations all identify trans-acting factors necessary for silencing at the S. pombe mating type locus. Swi6 is a homolog of HP1, while clr1 and rik1 code for putative DNA binding proteins. The products of clr3 and clr6 are homologs of HDACs. Clr4 is the H3-Lys9 methyltransferase. These genes work together, acting on the entire silent mating type domain to maintain it in the repressed state. Clr3, an H3-specific deacetylase, and Rik1 are required for histone H3-Lys9 methylation by Clr4, and Swi6 localization is dependent on Clr4 and Rik1 (Richards, 2002 and references therein)

    These observations suggest a progression of events leading to establishment of a distinctive heterochromatic structure based on the histone modification pattern. Deacetylation of histone H3 by Clr6 and/or Clr3 creates conditions favoring methylation at H3 Lys9 by the Clr4/Rik1 complex; methylation leads to binding of Swi6, establishing a chromatin configuration that is refractory to transcription and stably maintained. Mapping studies using chromatin immunoprecipitation show H3-mLys9 and Swi6 found throughout, and limited to, the 20 kb silent mating type domain. This 20 kb region is flanked by inverted repeats IR-L and IR-R, which appear to serve as barriers to the spread of silencing; removal of these repeats results in the appearance of H3-mLys9 and Swi6 on neighboring sequences. Silencing is dependent on the dosage of Swi6, which remains bound to the mating type region throughout the cell cycle and may itself be a marker for heterochromatin formation (Richards, 2002 and references therein).

    The findings suggest a mechanism for maintaining heterochromatin structure following replication and for driving the spread of heterochromatin. During replication, the DNA must be 'unpackaged' and the daughter DNA molecules repackaged into nucleosomes. Parental histones are efficiently reutilized, distributed randomly to the two daughter DNA molecules; an equal amount of newly synthesized histone is required to complete assembly. Assuming that the histone H3-mLys9 in a heterochromatic domain is stable (no histone demethylases have been identified as yet), it will associate with HP1 through the chromo domain. The presence of HP1 will result in assembly of a modifying complex, presumably through the chromo shadow domain, that will deacetylate and specifically methylate the newly arrived histone, perpetuating the pattern of modification and HP1 binding to establish a heterochromatic structure. Recovery of a SUV39H1-HDAC1 complex from Drosophila embryo extracts that can methylate preacetylated histones supports such a model (Czermin, 2001). Formation of complexes that both recognize a particular pattern of histone modification and have the ability to achieve that pattern provides a mechanism for epigenetic inheritance of chromatin structure. The same machinery could account for spreading of heterochromatin, requiring that boundaries to such spread be established (Richards, 2002 and references therein).

    Genetic analyses in S. pombe and Drosophila indicate that while the H3-mLys9/HP1 system is critical for heterochromatin formation and silencing in pericentric heterochromatin, it is of less importance at the telomeres, suggesting that an additional mechanism is used in those domains. Association of HP1 and a dependence on Su(var)3-9 activity have also been identified as critical in silencing particular euchromatic genes, both in mammalian systems and in Drosophila (Hwang, 2001). Interestingly, it appears that the histone H3-mLys9 modification at Rb-associated genes is quite limited; one nucleosome at the promoter is so modified, while an immediately upstream nucleosome is not, suggesting a difference in the capacity of the modified structure to spread. Histone H3-mLys9 is also associated with the inactive X chromosome in human cells, but no HP1 homologs have been identified preferentially associated with this domain. Whether differences in the degree of histone methylation or other modifications of histone H3 are important in determining any partner of H3-mLys9 in this case remains to be seen (Richards, 2002 and references therein).

    A third silent chromatin mark, 5-methylcytosine (5mC), affects the DNA itself. Postreplicative methylation of cytosine is carried out by a diverse group of cytosine DNA methyltransferases (Dnmt's). Beyond this, little is known about the mechanisms that establish, maintain, and modify cytosine methylation patterns. At the whole genome level, it is clear that cytosine methylation patterns can be quite dynamic. The best example is the erasure and resetting of cytosine methylation in early mammalian development. However, large swings in cytosine methylation levels have not been detected during zebrafish development, and the evidence in plants is contradictory. Regardless of whether de novo methylation occurs every generation or in rare initiating events, certain DNA sequences must be targeted for cytosine methylation. At present, little is understood about the primary DNA sequence determinants for targeting, if any. Analysis of a Neurospora sequence prone to de novo methylation indicates the presence of redundant elements promoting methylation and suggests that TpA-rich sequences may be important. Unfortunately, similar detailed studies are not available in other organisms. Certain cytosine methyltransferases, such as mouse Dnmt3a and Dnmt3b, are specialized to carry out de novo methylation. However, these enzymes do not appear to have the intrinsic capacity for discrimination among primary nucleotide sequences, nor among higher-order structures. These considerations suggest that de novo cytosine methyltransferases might be taking cues from another epigenetic mark (Richards, 2002 and references therein).

    Communication between the histone code and cytosine methylation may provide at least a partial answer to the long-standing question of how cytosine methylation patterns are established. The most direct evidence for a connection with histone methylation comes from genetic screens for cytosine hypomethylation mutants in Neurospora. The genome of this filamentous fungus contains 5-methylcytosine (~1.5 % of total C) concentrated in repetitive DNA (e.g., rRNA genes) and remnants from RIP activity (repeat induced point mutation, a hypermutation surveillance system that detects sequence duplications). Two Neurospora mutations completely abolish cytosine methylation in vegetative cells. One of these, dim-2, disrupts a gene encoding a cytosine methyltransferase. The other, dim-5, maps to a gene encoding a histone H3 methyltransferase. The predicted DIM-5 gene product contains a SET domain flanked by cysteine-rich elements and has sequence similarity to the histone methyltransferases Clr4 and Su(var)3-9, although it lacks a chromo domain. Recombinant DIM-5 protein exhibits histone methyltransferase activity in vitro. Strikingly, transformation of Neurospora with modified histone H3 genes with a substituted amino acid at Lys9 (the probable site of methylation by DIM-5) reduces cytosine methylation and relieves 5mC mediated gene silencing. Given that the dim-5 mutation appears to abolish all cytosine methylation, the results suggest that all DNA methylation in Neurospora takes its cue from histone H3-mLys9. It will be important to determine whether the histone methylation-DNA methylation connection is also found in other organisms, and if so, whether all cytosine methylation lies downstream of histone methylation. The dim-5 mutation causes more phenotypic defects than the dim-2 cytosine methyltransferase mutation, suggesting that a histone methylation deficiency has effects beyond those that result from loss of cytosine methylation (Richards, 2002 and references therein).

    A connection between the histone code and the 5mC code is supported by other findings. The presence in flowering plants of cytosine methyltransferases that contain a chromo domain is particularly intriguing. Such 'chromo methyltransferases' (CMTs) might be recruited to a genomic region by nucleosomes containing histone H3-mLys9; thus, histone modification would provide a foundation for establishing DNA methylation patterns. However, the CMTs have not yet been demonstrated to bind methylated histone H3, nor is it clear that these methyltransferases possess de novo methyltransferase activity. Moreover, chromo methyltransferases have not been documented outside of plant species. Consequently, chromo methyltransferases are unlikely to be solely responsible for translating the histone methylation code into the 5mC epigenetic mark (Richards, 2002 and references therein).

    Indirect models for the flow of information from histone H3-mLys9 to 5mC also need to be considered. The H3-mLys9 mark creates a foundation for HP1 interaction and subsequent heterochromatin formation. Cytosine methylation may be targeted to heterochromatin due to any number of characteristics, including nonhistone chromosomal protein content, subnuclear localization, or DNA replication timing. Disruption of heterochromatin by loss of the H3-mLys9 mark may lead to loss of 5mC through a number of intermediary steps. A 'chromatin first/cytosine methylation second' model is consistent with the demonstration that loss or alteration of cytosine methylation can be caused by mutations in SWI2/SNF2-like proteins in Arabidopsis, mice, and humans (Richards, 2002 and references therein).

    Once 5mC patterns have been established, they must be maintained in order to serve as an inherited epigenetic code. The potential of cytosine methylation as a mitotic memory device was first described in the 'maintenance methylation' model. The essential feature of the model is clonal inheritance of the 5mC patterns through mitotic, and possibly meiotic, divisions based on the symmetrical nature of the sequences modified (e.g., CpG) and the specificity of 'maintenance' DNA methyltransferases for hemimethylated DNA. The basic tenets of the maintenance methylation model have been supported by a wealth of evidence. The bulk of cytosine methylation occurs very shortly after DNA replication, catalyzed by methyltransferases that have hemimethylated substrate preferences, recruited to the vicinity of the replication fork by interaction with PCNA. However, the classic maintenance methylation model is inadequate to explain the variability of 5mC patterns within individuals and omits some of the known components of the cytosine methylation system. Not all cytosine methylation occurs at short symmetrical sequences, so a simple maintenance methyltransferase, making reference solely to cytosine methylation on the template strand, cannot perpetuate methylation patterns. Maintenance of 5mC patterns at nonsymmetrical sites might involve reiterated de novo methylation and may represent an additional tier of DNA methylation superimposed on the pattern of 5mC at symmetrical sites. The machinery necessary to maintain 5mC at asymmetric sites has not been firmly established, but clues are emerging. Dnmt3a has been implicated in the synthesis of 5mC at asymmetric sites in mice. In Neurospora, a single cytosine methyltransferase, DIM-2, is responsible for all vegetative 5mC, including both symmetrical and asymmetrical sites. In plants, 5mC in asymmetrical sequences has been associated with chromo methyltransferases and RNA-dependent DNA methylation (Richards, 2002 and references therein).

    The classical methylation maintenance model accounts for loss of 5mC through a passive mechanism: DNA replication in the absence of maintenance methylation. Cytological data using immuno-detection of 5mC argue that passive demethylation causes the dramatic erasure of DNA methylation patterns in early mammalian development. However, observation of 5mC loss in the absence of DNA replication has suggested an active demethylation mechanism as well. A 5mC-DNA glycosylase might also contribute to the dramatic swings in cytosine methylation seen in mammalian development (Richards, 2002 and references therein).

    The execution of gene silencing from the 5mC mark involves modulation of another epigenetic mark: hypoacetylation of histones. Two independent pathways have been discovered in vertebrates connecting 5mC to histone deacetylation. The first uses methyl cytosine binding proteins, MeCP, or MBD (methyl binding domain) proteins as adaptors connecting 5mC to histone deacetylase complexes. Several MBD/MeCP protein-HDAC complexes have been identified in mammalian cells. These complexes act to reduce local histone acetylation levels using the 5mC marks on the DNA as a guide (Richards, 2002 and references therein).

    A second pathway, also uncovered in mammals, operates through a physical interaction between the maintenance cytosine methyltransferase DNMT1 and HDACs. The catalytic domain of DNMT1 is not necessary for this interaction, suggesting that this cytosine methyltransferase is actually a transcriptional corepressor independent of its ability to methylate DNA. This interaction could act to reinforce inheritance of silent chromatin by facilitating histone deacetylation at the replication forks, where DNMT1 acts to maintain the 5mC epigenetic mark on methylated DNA sequences (Richards, 2002 and references therein).

    Epigenetic information may also flow from the histone acetylation state back to cytosine methylation. The HDAC inhibitor TSA leads to cytosine hypomethylation at specific sequences in Neurospora, and a similar effect has been noted in mammalian cells. The loss of DNA methylation may be related to transcriptional activation, but other mechanisms have been proposed, including activation of cytosine demethylases. Inhibition of histone deacetylation does not lead to global loss of DNA methylation, however. For example, disruption of a histone deacetylase gene in plants did not lead to a generalized loss of 5mC despite a 10-fold elevation in histone H4 acetylation. Regardless of the significance of the retrograde signaling, the well-established flow of information from 5mC to histone deacetylation closes the loop of a self-reinforcing cycle for those organisms that utilize cytosine methylation (Richards, 2002).

    The cycle of epigenetic marks discussed here suggests that initiation of heterochromatin formation, or similar silencing of euchromatic domains, requires acquisition of at least one epigenetic mark. What is known about entry into the cycle? In S. cerevisiae, protein interactions with specific cis-acting DNA sequences, such as E and I at the HM loci, or telomeric repeats, provide the foundation to recruit the SIR silencing complexes. The EF2-Rb-SUV39H1-HP1 interaction in mammals also implicates specific DNA sequences (binding sites for EF2) as initiation sites for silencing. Silencing within the mating type locus of S. pombe appears to be controlled both by local elements (REII and mat3 silencer) operating similarly to E and I in S. cerevisiae and by packaging of the domain as a whole, dependent on a block of repetitive DNA. In other organisms, the repetitive nature of the locus, rather than the primary DNA sequence, may be a trigger. The mechanisms at work are not clear, but hints can be derived from the repeat sensing/silencing phenomena in filamentous fungi, MIP (methylation induced premeiotically) in Ascobolus and RIP in Neurospora. In these systems, repeats appear to be recognized by a DNA-DNA pairing mechanism. In Ascobolus, cytosine methylation can be transferred between alleles, accompanying meiotic pairing and recombination events. RNA signals may provide another entrée into the cycle of epigenetic silencing. Two noncoding RNA species, Xist and Tsix, are pivotal for initiation and choice in X chromosome inactivation in mice, where H3-Lys9 methylation is an early event. RNA may also have a role in initiating silent chromatin formation by directing the acquisition of cytosine methylation marks. Resolution of this question will be one of the major goals of future research (Richards, 2002 and references therein).

    Once a genomic region has been targeted for silencing by acquisition of one or more covalent epigenetic marks, a silent chromatin identity can be propagated. The general features of the system include (1) positive signaling between the different covalent epigenetic marks and (2) enzymatic complexes/pathways that recognize each mark and catalyze the formation of the same mark. For example, in yeast, the histone H3/H4 deacetylation mark is recognized by Sir3, leading to recruitment of the Sir2 histone deacetylase. The histone H3-mLys9 mark is recognized by HP1, which can apparently recruit the histone methyltransferase activity of Su(var)3-9 homologs. The third self-reinforcing loop is carried out by maintenance cytosine methyltransferases, which have a substrate preference for hemimethylated DNA. The modification pathways operating on each covalent mark also interact and reinforce each other (Richards, 2002 and references therein).

    In organisms lacking 5mC, a histone modification code appears to be sufficient to mark and perpetuate silent chromatin domains. The feedback loop between histone methylation and histone deacetylation, coupled with mechanisms to maintain these modifications, apparently provides stable silencing. In fact, S. cerevisiae appears to utilize neither DNA modification nor the HP1/histone H3-mLys9 complex, relying solely on deacetylation of histones H3/H4 as an epigenetic mark to maintain silencing. The transmission of chromatin states requires that at least one of the covalent marks be inherited through mitotic, and possibly meiotic, cell divisions. All three of these marks meet the criteria of persistence through mitosis (Richards, 2002).

    While self-reinforcing mechanisms may be advantageous to ensure maintenance of silencing forgenomic sequences to be archived for the long-term in a nonexpressed state (e.g., transposons, pericentromeric repeats), there may be a need to reconfigure silenced chromatin as a prerequisite to expression of specific genes (e.g., mating type switching). In this case, what general mechanisms can be used to break the heterochromatin reinforcing cycle? Removal of the histone H3-mLys9 mark may require turnover of the entire protein, since no histone demethylase has yet been identified. Histones, however, are generally very stable. In comparison, the 5mC mark is more easily erased by passive or active demethylation mechanisms. The most malleable mark is the deacetylation of histones, the levels of which are set by the competing activities of histone acetylases and histone deacetylases (Richards, 2002).

    General transcriptional silencing by a Polycomb response element in Drosophila

    Polycomb response elements (PREs) are cis-regulatory sequences required for Polycomb repression of Hox genes in Drosophila. PREs function as potent silencers in the context of Hox reporter genes and they have been shown to partially repress a linked miniwhite reporter gene. The silencing capacity of PREs has not been systematically tested and, therefore, it has remained unclear whether only specific enhancers and promoters can respond to Polycomb silencing. Using a reporter gene assay in imaginal discs, it has been shown that a PRE from the Drosophila Hox gene Ultrabithorax potently silences different heterologous enhancers and promoters that are normally not subject to Polycomb repression. Silencing of these reporter genes is abolished in PcG mutants and excision of the PRE from the reporter gene during development results in loss of silencing within one cell generation. Together, these results suggest that PREs function as general silencer elements through which PcG proteins mediate transcriptional repression (Sengupta, 2004).

    A 1.6 kb fragment encompassing the PRE from the Ubx upstream control region was tested for its capacity to prevent transcriptional activation by enhancers from genes that are normally not under PcG control. For this purpose, three different enhancers were tested in a lacZ reporter gene assay in imaginal discs: dppWE, the imaginal disc enhancer from the decapentaplegic (dpp) gene; vgQE the quadrant enhancer from the vestigial (vg) gene; and vgBE, the vg D/V boundary enhancer. If linked to a reporter gene, each of these enhancers directs a distinct pattern of expression in the wing imaginal disc and activation by each enhancer is regulated by transcription factors that are controlled by a different signaling pathway. Specifically, the dpp enhancer contains binding sites for the Ci protein and is activated in response to hedgehog signaling, the vg quadrant enhancer contains binding sites for the Mad transcriptional regulator and is activated in response to dpp signaling, and the vg boundary enhancer contains binding sites for the Su(H) transcription factor and is regulated by Notch signaling. The dppWE, vgQE and vgBE enhancers were individually inserted into a lacZ reporter gene construct that contained the PRE fragment and either a TATA box minimal promoter from the hsp70 gene (here referred to as TATA), or a 4.1 kb fragment of the proximal Ubx promoter (here referred to as UbxP), fused to lacZ. In each construct, the PRE fragment was flanked by FRT sites that permit excision of the PRE fragment by flp recombinase. Several independent transgenic lines for each of the six PRE transgenes were generated. From individual transgene insertions, derivative transgenic lines were then generated by flp-mediated excision of the PRE in the germline. Thus expression of individual transgene insertions could be compared in the presence and absence of the PRE by staining wing imaginal discs for ß-galactosidase (ß-gal) activity. In the absence of the PRE, each of the three enhancers tested directs ß-gal expression in a characteristic previously characterized pattern. Each enhancer activated expression in the same pattern from either the TATA box minimal promoter or the Ubx promoter with some minor, promoter-specific differences with respect to the expression levels. By contrast, in most of the parental transformant lines, i.e., those carrying the corresponding reporter gene with the PRE, ß-gal expression is completely suppressed. These observations suggest that the PRE fragment very potently silences each of the six reporter genes. It is noted, however, that, at some transgene insertion sites, efficiency of silencing by the PRE fragment appeared to be impeded by flanking chromosomal sequences; in these cases, it was found that ß-gal expression is activated even in the presence of the PRE (Sengupta, 2004).

    To test whether silencing of the reporter genes by the PRE depends on PcG gene function, the PRE-containing transgenes >PRE>dppWE-TATA-lacZ and >PRE>vgQE-Ubx-lacZ were introduced into larvae that carried mutations in the PcG gene Suppressor of zeste 12 [Su(z)12]. Su(z)12 encodes a core component of the Esc-E(z) histone methyltransferase. Silencing of both transgenes is lost in Su(z)122/Su(z)123 mutant larvae, and the transgenes express ß-gal expression at levels comparable with the transgene derivatives that lack the PRE fragment. Taken together, these observations suggest that the 1.6 kb PRE fragment from Ubx is a very potent general transcriptional silencer element that represses transcription in a PcG protein-dependent manner. Thus, it appears that this PRE acts indiscriminately to block transcriptional activation by a variety of different activator proteins (Sengupta, 2004),

    To test the long-term requirement for the PRE for silencing of these reporter genes, the PRE was excised during larval development and ß-gal expression was then monitored at different time points after excision. Forty-eight hours after induction of flp expression, all six reporter genes showed robust derepression of ß-gal, suggesting that, in each case, removal of the PRE results in the loss of PcG silencing. Among the different enhancer-promoter combinations used in this study, the dppW enhancer fused to the TATA box minimal promoter appears to direct the highest levels of lacZ expression; >PRE>dppW–TZ transformant lines consistently show the strongest ß-gal staining after excision of the PRE. Therefore >PRE>dppW-TZ transformants were analyzed at 4, 8, 12 and 24 hours after induction of flp expression to study the kinetics of this derepression. No ß-gal signal was detected at 4 hours or even at 8 hours after flp induction, but 12 hours after flp induction, all discs showed robust ß-gal expression. Thus, even in the case of the most potent enhancer-promoter combination used (i.e. dppW enhancer and TATA box minimal promoter), a delay of 12 hours between flp induction and ß-gal expression was observed. Since the average cell cycle length of imaginal disc cells in third instar larvae is 12 hours, this implies that most disc cells have undergone a full division cycle within this period. Derepression of the reporter gene in this experiment requires several steps: (1) excision of the PRE by the flp recombinase; (2) dissociation of the PRE and PcG proteins attached to it -- possibly by disrupting PcG protein complexes formed between the PRE and factors bound at the promoter, and (3) transcriptional activation by factors binding to the enhancer in the construct. It is possible that one or several steps in this process require a specific process during the cell cycle (e.g., passage through S phase) (Sengupta, 2004),

    These experiments here show that three reporter genes, each containing a different enhancer linked to a canonical TATA box promoter, are completely silenced by a PRE placed upstream of the enhancer. The data suggest that PcG proteins that act through this PRE prevent indiscriminately activation by a variety of different transcription factors. The PcG machinery thus does not seem to require any specific enhancer and/or promoter sequences for repression (Sengupta, 2004),

    Two points deserve to be discussed in more detail. The first concerns the stability of silencing imposed by a PRE. Previous studies have suggested that transcriptional activation in the early embryo could prevent the establishment of PcG silencing by PREs. More specifically, early transcriptional activation of Hox genes by blastoderm enhancers may play an important role in preventing the establishment of permanent PcG silencing in segment primordia in which Hox genes need to be expressed at later developmental stages. Importantly, none of the three enhancers used in this study is active in the early embryo. Moreover, these enhancers probably do not contain binding sites for specific transcriptional repressors, such as the gap repressors, which are required for establishment of PcG silencing at some PREs in the early embryo. It is therefore imagined that, in these constructs, PcG silencing complexes assemble by default on the 1.6 kb Ubx PRE in the early embryo and that PcG silencing is thus firmly established by the stage when the imaginal discs enhancers would become active. Silencing by the PRE during larval stages therefore appears to be dominant overactivation and cannot be overcome by any of the enhancers used in this study. There is other evidence in support of the idea that PcG silencing during larval development is more stable than in embryos. In particular, a PRE reporter gene that contains a Gal4-inducible promoter is only transiently activated if a pulse of the transcriptional activator Gal4 is supplied during larval development; by contrast, a pulse of Gal4 during embryogenesis switches the PRE into an 'active mode' that supports transcriptional activation throughout development. Furthermore, recent studies in imaginal discs suggest that there is a distinction between transcriptional repression and the inheritance of the silenced state; the silenced state can be propagated for some period even if repression is lost. Specifically, loss of Hox gene silencing after removal of PcG proteins in proliferating cells can be reversed if the depleted PcG protein is resupplied within a few cell generations. Taken together, it thus appears that PcG silencing during postembryonic development is a remarkably stable process. Finally, the results reported in this study also imply that, once PcG silencing is established, Hox genes can `make use of virtually any type of transcriptional activator to maintain their expression; PcG silencing will ensure that activation by these factors only occurs in cells in which the Hox gene should be active. The analysis of Ubx control sequences supports this view; if individually linked to a reporter gene, most late-acting enhancers direct expression both within as well as outside of the normal Ubx expression domain (Sengupta, 2004),

    The second point to discuss concerns the repression mechanism used by PcG proteins. Biochemical purification of PRC1 has revealed that several TFIID components co-purify with the PcG proteins that constitute the core of PRC1. Moreover, formaldehyde crosslinking experiments in tissue culture cells showed that TFIID components are associated with promoters, even if these are repressed by PcG proteins. This suggests that PcG protein complexes anchored at the PRE interact with general transcription factors bound at the promoter. One possibility would be that PcG repressors directly target components of the general transcription machinery to prevent transcriptional activation by enhancer-binding factors. Three distinct activators act through the three enhancers used in this study and, according to these results, none of them is able to overcome the block imposed by the PcG machinery. But how do the known activities of PcG protein complexes [i.e., histone methylation by the Esc-E(z) complex and inhibition of chromatin remodeling by PRC1] fit into this scenario? Both these activities may be required for the repression process by altering the structure of chromatin around the transcription start site and thus prevent the formation of productive RNA Pol II complexes. Other scenarios are possible. For example, histone methylation may primarily serve to mark the chromatin for binding of PRC1 through Pc, and PRC1 components such as Psc then perform the actual repression process. Whatever the exact repression mechanism may be, the PRE-excision experiment shows that this repression is lost within one cell generation after removal of the PRE. This implies that changes in the chromatin generated by the action of PcG proteins cannot be propagated by the flanking chromatin (Sengupta, 2004).

    Systematic protein location mapping reveals five principal chromatin types in Drosophila cells

    Chromatin is important for the regulation of transcription and other functions, yet the diversity of chromatin composition and the distribution along chromosomes are still poorly characterized. By integrative analysis of genome-wide binding maps of 53 broadly selected chromatin components in Drosophila cells, this study shows that the genome is segmented into five principal chromatin types (see Chromatin types are characterized by distinctive protein combinations and histone modifications) that are defined by unique yet overlapping combinations of proteins and form domains that can extend over > 100 kb. A repressive chromatin type was identified that covers about half of the genome and lacks classic heterochromatin markers. Furthermore, transcriptionally active euchromatin consists of two types that differ in molecular organization and H3K36 methylation and regulate distinct classes of genes. Finally, evidence is provided that the different chromatin types help to target DNA-binding factors to specific genomic regions. These results provide a global view of chromatin diversity and domain organization in a metazoan cell (Filion, 2010).

    By systematic integration of 53 protein location maps this study found that the Drosophila genome is packaged into a mosaic of five principal chromatin types, each defined by a unique combination of proteins. Extensive evidence demonstrates that the five types differ in a wide range of characteristics besides protein composition, such as biochemical properties, transcriptional activity, histone modifications, replication timing, DNA binding factor (DBF) targeting, as well as sequence properties and functions of the embedded genes. This validates the classification by independent means and provides important insights into the functional properties of the five chromatin types (Filion, 2010).

    Identifying five chromatin states out of the binding profiles of 53 proteins comes out as a surprisingly low number (one can form approximately 1016 subsets of 53 elements). It is emphasized that the five chromatin types should be regarded as the major types. Some may be further divided into sub-types, depending on how fine-grained one wishes the classification to be. For example, within each of the transcriptionally active chromatin types, promoters and 3' ends of genes exhibit (mostly quantitative) differences in their protein composition and thus could be regarded as distinct sub-types. However, these local differences are minor relative to the differences between the five principal types that are described in this study. It cannot be excluded that the accumulation of binding profiles of additional proteins would reveal other novel chromatin types. It is also anticipated that the pattern of chromatin types along the genome will vary between cell types. For example, many genes that are embedded in 'BLACK' chromatin (defined in Kc167 cells) are activated in some other cell types. Thus, the chromatin of these genes is likely to switch to an active type (Filion, 2010).

    While the integration of data for 53 proteins provides substantial robustness to the classification of chromatin along the genome, a subset of only five marker proteins (histone H1, PC, HP1, MRG15 and BRM), which together occupy 97.6% of the genome, can recapitulate this classification with 85.5% agreement. Assuming that no unknown additional principal chromatin types exist in some cell types, DamID or ChIP of this small set of markers may thus provide an efficient means to examine the distribution of the five chromatin types in various cells and tissues, with acceptable accuracy (Filion, 2010).

    Previous work on the expression of integrated reporter genes had suggested that most of the fly genome is transcriptionally repressed, contrasting with the low coverage of PcG and HP1-marked chromatin. BLACK chromatin, which consists of a previously unknown combination of proteins and covers about half of the genome, may account for these observations. Essentially all genes in BLACK chromatin exhibit extremely low expression levels, and transgenes inserted in BLACK chromatin are frequently silenced, indicating that BLACK chromatin constitutes a strongly repressive environment. Importantly, BLACK chromatin is depleted of PcG proteins, HP1, SU(VAR)3-9 and associated proteins, and is also the latest to replicate, underscoring that it is different from previously characterized types of heterochromatin (identified as BLUE and GREEN chromatin in this study) (Filion, 2010).

    The proteins that mark BLACK domains provide important clues to the molecular biology of this type of chromatin. Loss of Lamin (LAM), Effete (EFF) or histone H1 causes lethality during Drosophila development. Extensive in vitro and in vivo evidence has suggested a role for H1 in gene repression, most likely through stabilization of nucleosome positions. The enrichment of LAM points to a role of the nuclear lamina in gene regulation in BLACK chromatin, consistent with the long-standing notion that peripheral chromatin is silent. Depletion of LAM causes derepression of several LAM-associated genes (Shevelyov, 2009), while artificial targeting of genes to the nuclear lamina can reduce their expression, suggesting a direct repressive contribution of the nuclear lamina in BLACK chromatin. D1 is a little-studied protein with 11 AT-hook domains. Overexpression of D1 causes ectopic pairing of intercalary heterochromatin (Smith, 2010), suggesting a role in the regulation of higher-order chromatin structure. SUUR specifically regulates late replication on polytene chromosomes (Zhimulev, 2003), which is of interest because BLACK chromatin is particularly late-replicating. EFF is highly similar to the yeast and mammalian ubiquitin ligase Ubc4 that mediates ubiquitination of histone H3, raising the possibility that nucleosomes in BLACK chromatin may carry specific ubiquitin marks. These insights suggest that BLACK chromatin is important for chromosome architecture as well as gene repression and provide important leads for further study of this previously unknown yet prevalent type of chromatin (Filion, 2010).

    In RED and YELLOW chromatin most genes are active, and the overall expression levels are similar between these two chromatin types. However, RED and YELLOW chromatin differ in many respects. One of the conspicuous distinctions is the disparate levels of H3K36me3 at active transcription units. This histone mark is thought to be laid down in the course of transcription elongation and may block the activity of cryptic promoters inside the transcription unit. Why active genes in RED chromatin lack H3K36me3 remains to be elucidated (Filion, 2010).

    The remarkably high protein occupancy in RED chromatin suggests that RED domains are 'hubs' of regulatory activity. This may be related to the predominantly tissue-specific expression of genes in RED chromatin, which presumably requires many regulatory proteins. It is noted that the DamID assay integrates protein binding events over nearly 24 hours, so it is likely that not all proteins bind simultaneously; some proteins may bind only during a specific stage of the cell cycle. It is highly unlikely that the high protein occupancy in RED chromatin originates from an artifact of DamID, e.g. caused by a high accessibility of RED chromatin. First, all DamID data are corrected for accessibility using parallel Dam-only measurements. Second, several proteins, such as EFF, SU(VAR)3-9 and histone H1 exhibit lower occupancies in RED than in any other chromatin type. Third, ORC also shows a specific enrichment in RED chromatin, even though it was mapped by ChIP, by another laboratory and on another detection platform. Fourth, DamID of Gal4-DBD does not show any enrichment in RED chromatin (Filion, 2010).

    RED chromatin resembles DBF binding hotspots that were previously discovered in a smaller-scale study in Drosophila cells. Discrete genomic regions targeted by many DBFs have recently also been found in mouse ES cells , hence it is tempting to speculate that an equivalent of RED chromatin may also exist in mammalian cells. Housekeeping and dynamically regulated genes in budding yeast also exhibit a dichotomy in chromatin organization which may be related to the distinction between YELLOW and RED chromatin. The observations that RED chromatin is generally the earliest to replicate and strongly enriched in ORC binding, suggest that this chromatin type may be not only involved in transcriptional regulation but also in the control of DNA replication (Filion, 2010).

    This analysis of DBF binding indicates that the five chromatin types together act as a guidance system to target DBFs to specific genomic regions. This system directs DBFs to certain genomic domains even though the DBF recognition motifs are more widely distributed. It is proposed that targeting specificity is at least in part achieved through interactions of DBFs with particular partner proteins that are present in some of the five chromatin types but not in others. The observation that yeast Gal4-DBD binds its motifs with nearly equal efficiency in all five chromatin types suggests that differences in compaction among the chromatin types represent overall a minor factor in the targeting of DBFs. Although additional studies will be needed to further investigate the molecular mechanisms of DBF guidance, the identification of five principal types of chromatin provides a firm basis for future dissection of the roles of chromatin organization in global gene regulation (Filion, 2010).

    Heterochromatin remodeling by CDK12 contributes to learning in Drosophila

    Dynamic regulation of chromatin structure is required to modulate the transcription of genes in eukaryotes. However, the factors that contribute to the plasticity of heterochromatin structure are elusive. This study reports that cyclin-dependent kinase 12 (CDK12), a transcription elongation-associated RNA polymerase II (RNAPII) kinase, antagonizes heterochromatin enrichment in Drosophila chromosomes. Notably, loss of CDK12 induces the ectopic accumulation of heterochromatin protein 1 (HP1) on euchromatic arms, with a prominent enrichment on the X chromosome. Furthermore, ChIP and sequencing analysis reveals that the heterochromatin enrichment on the X chromosome mainly occurs within long genes involved in neuronal functions. Consequently, heterochromatin enrichment reduces the transcription of neuronal genes in the adult brain and results in a defect in Drosophila courtship learning. Taken together, these results define a previously unidentified role of CDK12 in controlling the epigenetic transition between euchromatin and heterochromatin and suggest a chromatin regulatory mechanism in neuronal behaviors (Pan, 2015).

    Helitrons shaping the genomic architecture of Drosophila: enrichment of DINE-TR1 in alpha- and beta-heterochromatin, satellite DNA emergence, and piRNA expression

    Drosophila INterspersed Elements (DINEs) constitute an abundant but poorly understood group of Helitrons present in several Drosophila species. The general structure of DINEs includes two conserved blocks that may or not contain a region with tandem repeats in between. These central tandem repeats (CTRs) are similar within species but highly divergent between species. This study identified a subset of DINEs, termed DINE-TR1, which contain homologous CTRs of approximately 150 bp. DINE-TR1 are found in the sequenced genomes of several Drosophila species. However, interspecific high sequence identity (~88%) is limited to the first approximately 30 bp of each tandem repeat. Sequence analysis suggests vertical transmission. CTRs found within DINE-TR1 have independently expanded into satellite DNA-like arrays at least twice within Drosophila. By analyzing the genome of Drosophila virilis and Drosophila americana, it was shown that DINE-TR1 is highly abundant in pericentromeric heterochromatin boundaries, some telomeric regions and in the Y chromosome. It is also present in the centromeric region of one autosome from D. virilis and dispersed throughout several euchromatic sites in both species. DINE-TR1 was found to be abundant at piRNA clusters, and small DINE-TR1-derived RNA transcripts (~25 nt) are predominantly expressed in the testes and the ovaries, suggesting active targeting by the piRNA machinery. These features suggest potential piRNA-mediated regulatory roles for DINEs at local and genome-wide scales in Drosophila (Dias, 2015).

    A negative loop within the nuclear pore complex controls global chromatin organization

    The nuclear pore complex (NPC) tethers chromatin to create an environment for gene regulation, but little is known about how this activity is regulated to avoid excessive tethering of the genome. Tethering specific genomic loci to the NPC appears to contribute to transcriptional activation. Also, the NPC has been further implicated in creating a repressive environment or retaining genes at the periphery after repression, possibly contributing to epigenetic transcriptional memory. This paper proposes a negative regulatory loop within the NPC controlling the chromatin attachment state, in which Nup155 and Nup93 recruit Nup62 to suppress chromatin tethering by Nup155. Depletion of Nup62 severely disrupts chromatin distribution in the nuclei of female germlines and somatic cells, which can be reversed by codepleting Nup155. See a model for the chromatin attachment state controlled by an internal regulatory circuit in the NPC. Thus, this universal regulatory system within the NPC is crucial to control large-scale chromatin organization in the nucleus (Breuer, 2015).

    Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains

    Recent advances enabled by the Hi-C technique have unraveled many principles of chromosomal folding that were subsequently linked to disease and gene regulation. In particular, Hi-C revealed that chromosomes of animals are organized into Topologically Associating Domains (TADs), evolutionary conserved compact chromatin domains that influence gene expression. Mechanisms that underlie partitioning of the genome into TADs remain poorly understood. To explore principles of TAD folding in Drosophila melanogaster, Hi-C and PolyA+ RNA-seq was performed in four cell lines of various origins (S2, Kc167, DmBG3-c2, and OSC). Contrary to previous studies, this study found that regions between TADs (i.e. the inter-TADs and TAD boundaries) in Drosophila are only weakly enriched with the insulator protein dCTCF, while another insulator protein Su(Hw) is preferentially present within TADs. However, Drosophila inter-TADs harbor active chromatin and constitutively transcribed (housekeeping) genes. Accordingly, it was found that binding of insulator proteins dCTCF and Su(Hw) predicts TAD boundaries much worse than active chromatin marks do. Interestingly, inter-TADs correspond to decompacted interbands of polytene chromosomes, whereas TADs mostly correspond to densely packed bands. Collectively, these results suggest that TADs are condensed chromatin domains depleted in active chromatin marks, separated by regions of active chromatin. The mechanism of TAD self-assembly is proposed based on the ability of nucleosomes from inactive chromatin to aggregate, and lack of this ability is found in acetylated nucleosomal arrays. Finally, this hypothesis is tested by polymer simulations, and it was found that TAD partitioning may be explained by different modes of inter-nucleosomal interactions for active and inactive chromatin (Ulianov, 2015).

    Recently developed 3C-based methods coupled with high-throughput sequencing have enabled genome-wide investigation of chromatin organization. Studies performed in human, mouse, Drosophila, yeasts, Arabidopsis and several other species have unraveled general principles of genome folding. Chromosomes in mammals and Drosophila are organized hierarchically. At the megabase scale, mammalian chromosomes are partitioned into active and inactive compartments. At the sub-megabase scale, these compartments are subdivided into a set of self-interacting domains called Topologically Associating Domains (TADs); TADs themselves are often hierarchical and are split into smaller domains. Similar to mammals, Drosophila chromosomes are partitioned into TADs that are interspaced with short boundaries or longer inter-TAD regions (inter-TADs) (Ulianov, 2015).

    Partitioning of mammalian genomes into TADs appears to be largely cell-lineage independent and evolutionary conserved. Disruption of certain TAD boundaries leads to developmental defects in humans and mice. TADs correlate with units of replication timing regulation in mammals and colocalize with epigenetic domains (either active or repressed) in Drosophila. The internal structure of TADs was reported to change in response to environmental stress, during cell differentiation, and embryonic development. In addition, comparative Hi-C analysis has demonstrated that genomic rearrangements between related mammalian species occur predominantly at TAD boundaries. Consequently, TADs appear to evolve primarily as constant and unsplit units. Previous studies in Drosophila embryonic nuclei and embryo-derived Kc167 cells detected TADs of various sizes roughly corresponding to epigenetic domains. Additionally, long-range genomic contacts and clustering of pericentromeric regions were revealed, and TAD boundaries were found to be enriched with active chromatin marks and insulator proteins. Both active and inactive TADs were identified, and their spatial segregation was observed (Ulianov, 2015).

    Despite extensive studies, mechanisms underlying TAD formation remain obscure. Architectural proteins, including cohesin and CTCF, are often found at TAD boundaries; thus, they have been proposed to play a key role in the demarcation of TADs. However, several studies suggest that other mechanisms may be responsible for partitioning and formation of TADs. Firstly, depletion of various insulator proteins did not affect the profile of chromosome partitioning into TADs, but rather decreased intra-TAD interactions. Secondly, CTCF may mediate loops that occur between the start and the end of the so-called 'loop domains'. However, domains of similar sizes but without a loop were observed as well (so-called 'ordinary domains'. Thirdly, polymer simulations of a permanent chromatin loop yield a noticeable interaction between the loop bases on a simulated Hi-C map, but without a characteristic square shape of a TAD. Loops of this kind are thought to occur between insulator proteins such as Su(Hw) in the 'topological insulation' model. Finally, chromosomal domains similar to TADs in the bacterium Caulobacter crescentus are demarcated by actively transcribed genes, and are not affected by the knockout of SMC, a homolog of cohesin subunits (Ulianov, 2015).

    This study presents evidences that question the role of insulators in the organization of TAD boundaries in Drosophila . The results suggest that TADs are self-organized and potentially highly dynamic structures formed by numerous transient interactions between nucleosomes of inactive chromatin, while inter-TADs and TAD boundaries contain highly acetylated nucleosomes that are less prone to interactions. Finally, a polymer model of TAD formation is developed based on the two types of nucleosomes, and it was found that a polymer composed of active and inactive chromatin blocks forms TADs on a simulated Hi-C map (Ulianov, 2015).

    This study and others (Hou, 2012; Sexton, 2012) revealed that boundaries and inter-TADs in Drosophila, as opposed to TADs, are strongly enriched with active chromatin and its individual marks, as well as with active transcription and with constitutively transcribed housekeeping genes. Consequently, active chromatin marks, in the simplest case only total transcription and H3K4me3 (a mark of active promoters), can relatively well predict a TAD/inter-TAD profile. The existence of long inter-TADs composed of active chromatin is per se an argument for the ability of this type of chromatin to separate TADs. Furthermore, the current observations demonstrate that the presence of active chromatin and transcribed regions within TAD undermines the TAD integrity making TAD less compact and generating weak boundaries inside TAD. Consequently, a bona fide TAD is inactive; TADs containing active chromatin become less dense, acquire weak internal boundaries and eventually split into smaller TADs that are composed of inactive chromatin. The observation that the majority of housekeeping genes are located within inter- TADs and TAD boundaries suggests that evolutionary conservation and cell-type independence of TAD/inter-TAD profiles may be explained by conservation of positions of housekeeping genes along the chromosomes (Ulianov, 2015).

    It is noted that chromosomal interaction domains similar to TADs have been observed in the bacterium Caulobacter crescentus, where they are demarcated by sites of active transcription. Although the basic level of chromosomal folding is different in bacteria and eukaryotes, the model proposed in (Le, 2013) and the model stem from common principles. In Caulobacter, active transcription is thought to disrupt the fiber of supercoils (plectonemes) by creating a stretch of non-packaged DNA, free of plectonemes, which spatially separates chromosomal regions flanking it. In the model, transcription disrupts chromatin organization by introducing a 'non-sticky' region of chromatin, which is less compact and more unfolded in space, and thus spatially separates two flanking regions. Computer modeling shows that stickiness of non-acetylated (inactive) nucleosomes and the absence of stickiness for acetylated (active) nucleosomes are sufficient for chromatin partitioning into TADs and inter-TADs. Self-association of nucleosomes may be explained by the interaction of positively charged histone tails (in particular, the tail of histone H4) of one nucleosome with the acidic patch of histones H2A/H2B at an adjacent nucleosome. Acetylation of histone tails, which is typical of active chromatin, may interfere with inter-nucleosomal associations. In addition to a high level of histone acetylation, other features of active chromatin including lower nucleosome density in inter-TADs, manifested as the decreased histone H3 occupancy, might contribute to the generation of TAD profiles (Ulianov, 2015).

    It should be mentioned that a significant difference between the polymer simulations and models previously suggested by the Cavalli and Vaillant groups (Jost, 2014) is the use of saturating interactions between inactive nucleosomes. In the case of volume interactions, all nucleosomes of the same type adjacent in 3D space will attract each other; in the case of saturating interactions, each molecule may attract only one neighbor. Using volume interactions leads to the formation of a single dense blob, and does not produce TADs in a simulated Hi-C map.It is noted that the saturating nature of interactions between nucleosomes is based on the known properties of nucleosomal particles. Previous studies considered a variety of mechanisms that may lead to the formation of TADs. In particular, Barbieri (2012) studied segregation of two TADs using cubic lattice simulations of a short 152-monomer chain consisting of two TADs, assuming that inter- monomer interactions could only form between monomers belonging to the same TAD. In the current model, this study shows that TADs emerge without requiring such specific interactions; any two regions of sticky monomers separated by a non-sticky linker would form TADs. Another study proposed that transcription-induced supercoiling may be responsible for the formation of TADs (Benedetti, 2014). Although this model is consistent with the current observation that sites of active transcription demarcate TAD boundaries, there is limited evidence that supercoiling of chromatinized DNA exists in Drosophila and other organisms. On the contrary, the current model is based on known biochemical properties of nucleosomes (Ulianov, 2015).

    The fact that a minor fraction of TADs is built mostly from active chromatin apparently contradicts the current model, suggesting that additional ways of chromatin self-organization could exist. One possibility is the establishment of long-range contacts between enhancers and their cognate promoters, as well as loops between pairs of insulators. Such loops formed inside active unstructured chromatin linkers (i.e., inter-TADs) could probably be sufficient to compact them and thus to fold into TADs (Ulianov, 2015).

    TAD profiles of X chromosomes are almost identical in the male and female cell lines, that is in agreement with recently published observations (Ramírez, 2015). Thus, it seems that hyperacetylation of male X-chromosomes due to dosage compensation does not generate new TAD boundaries. However, it should be noted that MOF histone acetyltransferase of the MSL complex introduces only the H4K16ac mark. Although this modification is important to prevent inter-nucleosomal interactions, acetylation at other histone positions and H2B ubiquitylation contribute as well. Additionally, H4K16 acetylation generated by the dosage compensation system occurs preferentially at regions enriched with transcribed genes and hence within inter-TADs (Ulianov, 2015).

    The current analysis does not support the previously reported (Hou, 2012; Sexton, 2012) strong enrichments of insulator proteins Su(Hw) and dCTCF at TAD boundaries in Drosophila. To assess the possible reasons of this divergence, the dCTCF distribution was re-analyzed with respect to TAD positions in the current dataset using the raw ChIP-seq data. No strong difference was observed in the dCTCF coverage in TADs and inter-TADs. Interestingly, this study obtained the same result while analyzing dCTCF and Su(Hw) binding within TAD boundaries identified by Hou (2012). However, a strong enrichment of dCTCF at TAD boundaries was observed when the peak distribution was analyzed instead of read coverage. Additionally, the effect was much weaker when modENCODE peaks were used. Hence, the discrepancy may be caused by a different peak calling procedure in modENCODE and in Hou. (2012). The biological significance of these observations remains to be determined. It is noted that disruption of the cohesin/CTCF complex in mammals, as well as depletion of the Vtd (also known as Rad21) cohesin subunit in Drosophila, did not lead to disappearance of TAD boundaries, but rather only slightly decreased interactions inside TADs (in mammals) and reduced TAD boundary strength in the Drosophila genome. These observations favor a role for the cohesin/CTCF complex, which is known to form loops, in chromatin compaction inside the TADs (Ulianov, 2015).

    Binding of insulator proteins might contribute to establishing TAD boundaries through introducing active chromatin marks. Indeed, when inserted into an ectopic position, a classical insulator triggers hyperacetylation of the local chromatin domain and recruits chromatin-remodeling complexes. However, absence of strong enrichment of dCTCF at TAD boundaries and preferential location of Su(Hw) inside TADs mean that at least dCTCF- and Su(Hw)-dependent insulators are not the major determinants of TAD boundaries and inter-TADs (Ulianov, 2015).

    TADs are predicted based on the analysis of averaged data from a cell population. Although they are usually represented as large chromatin globules, direct experimental evidence for the existence of such globules in individual cells is controversial. Using confocal and 3D-SIM microscopy, ~1-Mb globular domains have been observed within chromosomal territories. However, using STORM microscopy, chromatin in individual mammalian cells has been found to be organized into 'clutches' composed of several nucleosomes, and that increased histone acetylation dramatically reduces size of these clutches. It is thus possible that sub-megabase TADs revealed by Hi-C represent a set of nucleosome clutches separated by relatively short spacers of various sizes. These short clutches may occupy various positions within TADs in different cells and stochastically assemble to form short-living aggregates. The stochastic nature of TADs is supported by computer simulations (Ulianov, 2015).

    High-resolution in situ hybridization analysis on the chromosomal interval 61C7-61C8 of Drosophila melanogaster reveals interbands as open chromatin domains

    Eukaryotic chromatin is organized in contiguous domains that differ in protein binding, histone modifications, transcriptional activity, and in their degree of compaction. Genome-wide comparisons suggest that, overall, the chromatin organization is similar in different cells within an organism. This study compared the structure and activity of the 61C7-61C8 interval in polytene and diploid cells of Drosophila. By in situ hybridization on polytene chromosomes combined with high-resolution microscopy, the boundaries were mapped of the 61C7-8 interband and of the 61C7 and C8 band regions, respectively. The results demonstrate that the 61C7-8 interband is significantly larger than estimated previously. This interband extends over 20 kbp and is in the range of the flanking band domains. It contains several active genes and therefore can be considered as an open chromatin domain. Comparing the 61C7-8 structure of Drosophila S2 cells and polytene salivary gland cells by ChIP for chromatin protein binding and histone modifications, a highly consistent domain structure was observed for the proximal 13 kbp of the domain in both cell types. However, the distal 7 kbp of the open domain differs in protein binding and histone modification between both tissues. The domain contains four protein-coding genes in the proximal part and two noncoding transcripts in the distal part. The differential transcriptional activity of one of the noncoding transcripts correlates with the observed differences in the chromatin structure between both tissues (Zielke, 2015).

    Stable Chromosome Condensation Revealed by Chromosome Conformation Capture

    Chemical cross-linking and DNA sequencing have revealed regions of intra-chromosomal interaction, referred to as topologically associating domains (TADs), interspersed with regions of little or no interaction, in interphase nuclei. TADs and the regions between them were found to correspond with the bands and interbands of polytene chromosomes of Drosophila. Further, the conservation of TADs between polytene and diploid cells of Drosophila was established. From direct measurements on light micrographs of polytene chromosomes, the states of chromatin folding in the diploid cell nucleus was deduced. Two states of folding, fully extended fibers containing regulatory regions and promoters, and fibers condensed up to 10-fold containing coding regions of active genes, constitute the euchromatin of the nuclear interior. Chromatin fibers condensed up to 30-fold, containing coding regions of inactive genes, represent the heterochromatin of the nuclear periphery. A convergence of molecular analysis with direct observation thus reveals the architecture of interphase chromosomes (Eagen, 2015).

    Super-resolution imaging reveals distinct chromatin folding for different epigenetic states

    At the intermediate scale of genomic spatial organization of kilobases to megabases, which encompasses the sizes of genes, gene clusters and regulatory domains, the three-dimensional (3D) organization of DNA is implicated in multiple gene regulatory mechanisms. At this scale, the genome is partitioned into domains of different epigenetic states that are essential for regulating gene expression. This study investigated the 3D organization of chromatin in different epigenetic states using super-resolution imaging. Genomic domains were classified in Drosophila cells into transcriptionally active, inactive or Polycomb-repressed states, and distinct chromatin organizations were observed for each state. All three types of chromatin domains exhibit power-law scaling between their physical sizes in 3D and their domain lengths, but each type has a distinct scaling exponent. Polycomb-repressed domains show the densest packing and most intriguing chromatin folding behaviour, in which chromatin packing density increases with domain length. Distinct from the self-similar organization displayed by transcriptionally active and inactive chromatin, the Polycomb-repressed domains are characterized by a high degree of chromatin intermixing within the domain. Moreover, compared to inactive domains, Polycomb-repressed domains spatially exclude neighbouring active chromatin to a much stronger degree. Computational modelling and knockdown experiments suggest that reversible chromatin interactions mediated by Polycomb-group proteins play an important role in these unique packaging properties of the repressed chromatin. Taken together, these super-resolution images reveal distinct chromatin packaging for different epigenetic states at the kilobase-to-megabase scale, a length scale that is directly relevant to genome regulation (Boettiger, 2016).

    Correspondence of Drosophila Polycomb Group proteins with broad H3K27me3 silent domains

    The Polycomb group (PcG) proteins are key conserved regulators of development, initially discovered in Drosophila and now strongly implicated in human disease. Nevertheless, differing silencing properties between the Drosophila and mammalian PcG systems have been observed. While specific DNA targeting sites for PcG proteins called Polycomb response elements (PREs) have been identified only in Drosophila, involvement of non-coding RNAs for PcG targeting has been favored in mammals. Another difference lies in the distribution patterns of PcG proteins. In mouse and human cells, PcG proteins show broad distributions, significantly overlapping with H3K27me3 domains. In contrast, only sharp peaks on PRE regions are observed for most PcG proteins in Drosophila, raising the question of how large domains of H3K27me3, up to many tens of kilobases, are formed and maintained in Drosophila. This study provides evidence that PcG distributions on silent chromatin in Drosophila are considerably broader than previously detected. Using BioTAP-XL, a chromatin crosslinking and tandem affinity purification approach, a broad, rather than PRE-limited overlap of PcG proteins with H3K27me3 was found, suggesting a conserved spreading mechanism for PcG in flies and mammals (Jung, 2016).

    Propagation of Polycomb-repressed chromatin requires sequence-specific recruitment to DNA

    Epigenetic inheritance models posit that during Polycomb repression, Polycomb Repressive Complex 2 (PRC2) propagates histone H3K27 tri-methylation (H3K27me3) independently of DNA sequence. This study shows that insertion of Polycomb Response Element (PRE) DNA into the Drosophila genome creates extended domains of H3K27me3-modified nucleosomes in the flanking chromatin and causes repression of a linked reporter gene. After excision of PRE DNA, H3K27me3 nucleosomes become diluted with each round of DNA replication and reporter gene repression is lost, whereas in replication-stalled cells, H3K27me3 levels stay high and repression persists. Hence, H3K27me3-marked nucleosomes provide a memory of repression that is transmitted in a sequence-independent manner to daughter strand DNA during replication. In contrast, propagation of H3K27 tri-methylation to newly incorporated nucleosomes requires sequence-specific targeting of PRC2 to PRE DNA (Laprell, 2017).

    The ability of certain histone-modifying enzymes to bind to the modification they generated has led to models where such enzymes might propagate modified chromatin domains by a positive feedback loop, independently of the underlying DNA sequence. Two paradigms of chromatin states have been proposed to be maintained by such an epigenetic inheritance mechanism: constitutive heterochromatin with histone H3 lysine 9 di- and tri-methylation (H3K9me2/3) generated by Suv39/Clr4 enzymes, and Polycomb-repressed chromatin marked with H3K27me3 by PRC2. In both chromatin states, these histone modifications are essential for repressing gene transcription. To date, there is compelling evidence that H3K9me2/3- and H3K27me3-modified nucleosomes are transmitted to daughter strand DNA during replication. However, the steps required to propagate these modifications are much less understood. Fission yeast Clr4 has the capacity to propagate ectopically induced H3K9me2/3 domains over many cell divisions by an H3K9me2/3-based positive feedback loop but only in cells mutated for H3K9me2/3 demethylase activity. In the case of PRC2, allosteric activation of the enzyme induced by binding to H3K27me3 has been proposed to be the foundation for propagating H3K27me3 chromatin. In mammalian cells, transient DNA-tethering of PRC2 generates short ectopic H3K27me3 domains that were at least partially maintained for several cell divisions after release of DNA-tethered PRC2. However, in Drosophila, where PRC2 and other Polycomb group (PcG) protein complexes are targeted to PREs, repression imposed by insertion of PRE DNA next to a reporter gene was lost upon excision of PRE DNA. This study investigated how insertion and excision of PRE DNA at ectopic sites in Drosophila affects binding of PcG proteins and H3K27me3 at the molecular level (Laprell, 2017).

    Two previously described strains were analyzed that each carried a single copy of the >PRE>dppWE-TZ reporter gene, integrated at different chromosomal locations. >PRE>dppWE-TZ contains a 1.6 kilobase (kb) DNA fragment of the bxd PRE from the HOX gene Ultrabithorax (Ubx), flanked by FRT recombination sites (>PRE>) to permit excision of PRE DNA by Flp-mediated recombination. Adjacent to the >PRE> cassette, the construct contains a reporter gene comprising the wing imaginal disc enhancer from decapentaplegic (dpp) (E), linked to the hsp70 TATA box minimal promoter (T) and LacZ sequences encoding β-galactosidase (Z) . In the presence of the >PRE> cassette, the transgene was silenced and no β-galactosidase activity could be detected in wing imaginal discs of >PRE>dppWE-TZ transgenic animals. In contrast, >dppWE-TZ transgenic animals, generated by excision of the >PRE> cassette in the germ line, showed strong β-galactosidase expression in the characteristic pattern driven by the dpp enhancer. The observation that silencing of the intact >PRE>dppWE-TZ reporter gene is lost in mutants lacking PRC2 function, prompted determination of the H3K27 methylation profile and binding of PcG proteins across the transgene. In both lines, the transgene had inserted into a genomic location normally devoid of H3K27me3 and PcG protein binding. Chromatin immunoprecipitation (ChIP) assays were performed on batches of wing imaginal discs from >PRE>dppWE-TZ and the corresponding >dppWE-TZ transgenic animals, and the immunoprecipitates were analyzed by quantitative real-time PCR (qPCR). For qPCR, primer pairs were used that selectively amplified transgene sequences and sequences in the genomic regions flanking the transgene insert. As controls, primer pairs were used amplifying sequences at the endogenous bx PRE in Ubx that are known to be bound by PcG proteins (C2) or enriched for H3K27me3 (C1 and C3) and at two regions elsewhere in the genome (C4 and C5) without PcG protein binding or H3K27me3 (Laprell, 2017).

    The PRC1 subunits Polycomb (Pc), Polyhomeotic (Ph) and the PRC2 subunit E (z) were specifically enriched at the transgene PRE in animals carrying >PRE>dppWE-TZ and, as expected, no binding was detected in >dppWE-TZ animals. In both >PRE>dppWE-TZ transgenic lines, H3K27me3 was present at high levels across a domain that extended about 4-5 kb to either side of the >PRE> cassette, spanning almost the entire construct. No enrichment of H3K27me3 was detectable at the >dppWE-TZ transgene. At >PRE>dppWE-TZ, PRC2 thus tri-methylates H3K27 across a chromatin interval that spans about 8-10 kb (Laprell, 2017).

    To estimate to what extent nucleosomes at the >PRE>dppWE-TZ transgene are tri-methylated at H3K27, the H3K27me2 profile was determined. H3K27me2 levels across the >PRE>dppWE-TZ transgene were much lower than at C4 and C5 and comparable to the levels at Ubx (regions C1-C3) that is repressed and predominantly tri-methylated at H3K27 in wing imaginal discs. Conversely, across >dppWE-TZ, H3K27me2 levels were much higher and comparable to those seen at C4 and C5. This suggest that the nucleosomes across the >PRE>dppWE-TZ transgene are predominantly tri-methylated at H3K27 (Laprell, 2017).

    Excision of the >PRE> cassette from >PRE>dppWE-TZ transgenic animals by heat-shock induced expression of Flp during larval development results in appearance of β-galactosidase expression in the dpp pattern 12 hours after the heat shock. Efficiency of PRE excision was measured and it was found that 8 hours after a single 1-hour heat shock, excision had occurred in about 95% of wing imaginal disc cells. The delayed increase of β-galactosidase expression over time suggests a gradual rather than abrupt loss of repression. ChIP analyses were performed on chromatin prepared from batches of entire wing imaginal discs dissected from >PRE>dppWE-TZ transgenic animals 12, 32 or 56 hours after Flp-induction. This allowed monitoring the consequences of PRE excision in cells that had undergone at least one (+12 hours), at least two (+32 hours), or more than four (+56 hours) cell divisions. 12 hours after Flp-induction, H3K27me3 levels were at least two-fold reduced across the entire transgene and further reduced by at least two-fold at the 32 hours time point. 56 hours after Flp-induction, H3K27me3 levels across the transgene were nearly as low as in >dppWE-TZ animals derived from >dppWE-TZ germ cells. The histone H3 profile was unaltered at all time points, suggesting that PRE excision does not cause global disruption of nucleosome occupancy across the transgene. The loss of H3K27me3 after PRE excision suggests that PRC2 is unable to propagate H3K27me3 across the >dppWE-TZ transgene in the absence of PRE DNA (Laprell, 2017).

    In parallel, Pc protein binding was monitored after PRE excision. Pc, unlike Ph or other PRC1 subunits, is not only bound at PREs but also associates with the chromatin flanking PREs likely reflecting its interaction with H3K27me3-modified nucleosomes. 12 hours after PRE excision, Pc binding at the transgene was already almost reduced to background levels (Laprell, 2017).

    The H3K27me3 profile at the >PRE>dppWE-UZ transgene that contains a 4.1 kb fragment of the Ubx promoter instead of the hsp70 minimal promoter was then analyzed. At >PRE>dppWE-UZ, the H3K27me3 domain spans about 12 kb and is thus about 4 kb longer than at >PRE>dppWE-TZ. Nevertheless, after PRE excision, H3K27me3 at dppWE-UZ was lost at a rate comparable to that seen at dppWE-TZ. Ubx promoter DNA thus does not enable H3K27me3 propagation. It is concluded that even at a domain that spans 12 kb and therefore comprises about 60 nucleosomes, PRC2 is unable to propagate H3K27me3 in the absence of PRE DNA (Laprell, 2017).

    The H3K27me3 profile and reporter gene repression was then analyzed after PRE excision in animals in which DNA replication had been blocked. Larvae were reared in liquid medium containing Aphidicolin, an inhibitor of DNA polymerases A and D, which resulted in a complete block of DNA replication in imaginal discs. In larvae reared in Aphidicolin-containing medium, Flp-induced PRE excision from >PRE>dppWE-TZ was as efficient as under normal growth conditions but 12 hours after excision, H3K27me3 levels at the transgene were undiminished compared to +PRE control larvae. In larvae reared in liquid medium without Aphidicolin, PRE excision resulted in the expected two-fold reduction of H3K27me3 levels after 12 hours. Together, this suggests that the loss of H3K27me3 nucleosomes after PRE excision in proliferating cells reflects their dilution as they become transmitted to DNA daughter strands during replication. Unlike under normal growth conditions, Aphidicolin-treated larvae lacked detectable β-galactosidase expression 12 hours after PRE excision. When these animals were permitted to recover in medium lacking Aphidocolin, they resumed DNA replication and began expressing β-galactosidase. If DNA replication is blocked and H3K27me3 levels stay high, repression is thus also sustained in the absence of PRE DNA, possibly by PRC1 (Laprell, 2017).

    Finally, PRE excision was induced from >PRE>dppWE-TZ in larvae that were hemizygous for UtxΔ, a null mutation in the single H3K27me3 demethylase in Drosophila. 12 hours after Flp-induction, H3K27me3 levels at the transgene were reduced about two-fold, like in wild-type animals. This suggest that demethylation of H3K27me3 by Utx does not contribute to the disappearance of H3K27me3 from transgene chromatin after PRE excision (Laprell, 2017).

    These results lead to the following conclusions. First, PRE cis-regulatory DNA provides the genetic basis not only for generating but also for propagating H3K27me3-modified chromatin. This argues against a simple epigenetic model where PRC2 binding to parental H3K27me3 nucleosomes after replication would suffice to propagate H3K27 tri-methylation in daughter strand chromatin. PRC2 needs to be recruited to PRE DNA first, before allosteric activation through interaction with H3K27me3 nucleosomes in flanking chromatin may then facilitate methylation of newly incorporated nucleosomes. Secondly, following PRE excision and replication, parental H3K27me3 nucleosomes remain associated with the same underlying DNA in daughter cells and thus provide epigenetic memory. However, while in replication-stalled cells high H3K27me3 levels permit to sustain repression also in the absence of PRE DNA, their dilution in proliferating cells is accompanied with loss of repression after one cell division. H3K27me3 nucleosomes therefore only appear to provide short-term epigenetic memory of the repressed state. Hence, DNA targeting of PRC2 after replication to replenish H3K27me3 is critical to preserve repression (Laprell, 2017).

    Drosophila HOX and other large-size PcG target genes often contain multiple PREs and H3K27me3 domains that span dozens of kilobases. Deletion of single PREs from these genes typically results in only minor diminution of the H3K27me3 profile and misexpression is less severe than misexpression of the native genes in PcG mutants. Furthermore, when the same >PRE> cassette that was used in this study was excised from a Ubx-LacZ reporter gene with more extended Ubx upstream regulatory sequences, repression was lost with a longer delay\, suggesting that additional elements with PRE properties in those Ubx sequences permitted to sustain repression through more cell divisions. The evolution of PRE DNA sequences and of their frequency and arrangement within target genes may thus ultimately determine stability and heritability of H3K27me3 chromatin and PcG repression (Laprell, 2017).

    Drosophila O-GlcNAcase deletion globally perturbs chromatin O-GlcNAcylation

    O-GlcNAc Transferase (OGT/SXC) is essential for Polycomb repression suggesting that the O-GlcNAcylation of proteins plays a key role in regulating development. OGT transfers O-GlcNAc onto serine and threonine residues in intrinsically disordered domains of key transcriptional regulators; O-GlcNAcase (OGA) removes the modification. To pinpoint genomic regions that are regulated by O-GlcNAc levels, ChIP-chip and microarray analysis analysis was performed after OGT or OGA RNAi knockdown in S2 cells. After OGA RNAi, a genome-wide increase was observed in the intensity of most O-GlcNAc-occupied regions. In contrast, O-GlcNAc levels were strikingly insensitive to OGA RNAi at sites of polycomb repression. Microarray analysis suggested that altered O-GlcNAc cycling perturbed the expression of genes associated with morphogenesis and cell cycle regulation. A viable null allele of oga (ogadel.1) was produced in Drosophila allowing visualization of altered O-GlcNAc cycling on polytene chromosomes. Trithorax (Trx), Absent small or homeotic discs 1 (Ash1) and Compass member Set1 histone methyl-transferases were O-GlcNAc-modified in ogadel.1 mutants. The ogadel.1 mutants displayed altered expression of a distinct set of cell cycle related genes. These results show that the loss of Oga in Drosophila globally impacts the epigenetic machinery allowing O-GlcNAc accumulation on RNA Polymerase II and numerous chromatin factors including Trx, Ash1 and Set1 (Akin, 2016).

    Genome-wide activities of Polycomb complexes control pervasive transcription

    Polycomb group (PcG) complexes PRC1 and PRC2 are well known for silencing specific developmental genes. PRC2 is a methyltransferase targeting histone H3K27 and producing H3K27me3, essential for stable silencing. Less well known but quantitatively much more important is the genome-wide role of PRC2 that dimethylates approximately 70% of total H3K27. H3K27me2 occurs in inverse proportion to transcriptional activity in most non-PcG target genes and intergenic regions and is governed by opposing roaming activities of PRC2 and complexes containing the H3K27 demethylase UTX. Surprisingly, loss of H3K27me2 results in global transcriptional derepression proportionally greatest in silent or weakly transcribed intergenic and genic regions and accompanied by an increase of H3K27ac and H3K4me1. H3K27me2 therefore sets a threshold that prevents random, unscheduled transcription all over the genome and even limits the activity of highly transcribed genes. PRC1-type complexes also have global roles. Unexpectedly, a pervasive distribution of histone H2A ubiquitylated at lysine 118 (H2AK118ub) was found outside of canonical PcG target regions, dependent on the RING/Sce subunit of PRC1-type complexes. It was shown, however, that H2AK118ub does not mediate the global PRC2 activity or the global repression and is predominantly produced by a new complex involving L(3)73Ah, a homolog of mammalian PCGF3 (Lee, 2016).

    Chromatin proteins and RNA are associated with DNA during all phases of mitosis

    Mitosis brings about major changes to chromosome and nuclear structure. This study used recently developed proximity ligation assay-based techniques to investigate the association with DNA of chromatin-associated proteins and RNAs in Drosophila embryos during mitosis. All groups of tested proteins, histone-modifying and chromatin-remodeling proteins and methylated histones remained in close proximity to DNA during all phases of mitosis. RNA transcripts were found to be associated with DNA during all stages of mitosis. Reduction of H3K27me3 levels or elimination of RNAs had no effect on the association of the components of PcG and TrxG complexes to DNA. Using a combination of proximity ligation assay-based techniques and super-resolution microscopy, he number of protein-DNA and RNA-DNA foci was found to undergo significant reduction during mitosis, suggesting that mitosis may be accompanied by structural re-arrangement or compaction of specific chromatin domains (Black, 2016).

    Regulatory functions and chromatin loading dynamics of linker histone H1 during endoreplication in Drosophila

    Eukaryotic DNA replicates asynchronously, with discrete genomic loci replicating during different stages of S phase. Drosophila larval tissues undergo endoreplication without cell division, and the latest replicating regions occasionally fail to complete endoreplication, resulting in underreplicated domains of polytene chromosomes. This study shows that linker histone H1 is required for the underreplication (UR) phenomenon in Drosophila salivary glands. H1 directly interacts with the Suppressor of UR (SUUR) protein and is required for SUUR binding to chromatin in vivo. These observations implicate H1 as a critical factor in the formation of underreplicated regions and an upstream effector of SUUR. It was also demonstrated that the localization of H1 in chromatin changes profoundly during the endocycle. At the onset of endocycle S (endo-S) phase, H1 is heavily and specifically loaded into late replicating genomic regions and is then redistributed during the course of endoreplication. The data suggest that cell cycle-dependent chromosome occupancy of H1 is governed by several independent processes. In addition to the ubiquitous replication-related disassembly and reassembly of chromatin, H1 is deposited into chromatin through a novel pathway that is replication-independent, rapid, and locus-specific. This cell cycle-directed dynamic localization of H1 in chromatin may play an important role in the regulation of DNA replication timing (Andreyeva, 2017).

    This study demonstrated that virtually all major sites of UR throughout the Drosophila genome exhibit a substantial increase in salivary gland DNA copy number upon depletion of the linker histone H1, thus implicating H1 in the regulation of endoreplication. In control knockdown salivary glands, 46 underreplicated domains were identified. While these regions are in general agreement with previous efforts to map underreplicated domains by less sensitive microarray analyses, fewer underreplicated sites were identified than a recent report that used high-throughput sequencing of salivary gland DNA (Yarosh, 2014). Notably, the underreplicated domains that the current analyses failed to detect represent sites with the weakest degree of UR. One possible source of variation is the distinct technical approach that was used compared with Yarosh (2014), as simultaneous sequencing of a nonpolytenized (embryonic) genome as a means to normalize the reads from underrepresented sequences in polytenized tissues (Yarosh, 2014) likely provides additional sensitivity. Another potential explanation could lie in the relative sequencing depth of the respective assays (approximately fourfold lower in the current study), considered crucial for the analyses of next-generation sequencing data. However, this explanation is less likely, as subsampling of the current reads to much lower depths yielded no appreciable difference in the number and location of identified underreplicated sites or the change in copy number upon H1 knockdown (Andreyeva, 2017).

    On average, a moderate knockdown of H1 led to an ~50% copy number gain at the center of underreplicated domains in intercalary heterochromatin (IH; large dense bands scattered in euchromatin comprising clusters of repressed genes. The copy number is not restored to the same degree as that in a SuUR genetic mutant. The difference is likely attributable to the incomplete depletion of H1. In fact, in an independent biological validation experiment that resulted in an ~95% depletion of H1, an almost complete restoration of copy number was observed. The observation of an almost complete reversal of UR in cells depleted of H1 (but still wild type for SuUR) strongly suggests an epistatic mechanism of action in which both H1 and SUUR act together in the same biochemical pathway (Andreyeva, 2017).

    This study found that H1 and SUUR are also involved in UR of PH. For instance, both the mapped pericentric regions and TE sequences, which are highly abundant in pericentric regions, exhibit an increase of DNA copy number upon H1 knockdown. The SuURES mutation also results in a robust loss of UR at PH, as measured by changes in DNA copy number at TEs. The abrogation of H1 expression gives rise to a somewhat weaker effect on the UR of PH than that of IH, which is consistent with an almost complete elimination of SUUR protein from polytene chromosome arms in salivary glands depleted of H1 by RNAi but the persistence of residual SUUR at their PH. The role of H1 in maintaining the underreplicated state of PH may be relevant to its important regulatory functions in constitutive heterochromatin, where it recruits Su(var)3-9, facilitates H3K9 methylation, and maintains TEs in a transcriptionally repressed state. Recently, it was proposed that TE repression in ovarian somatic cells involves an H3K9 methylation-independent process through recruitment of H1 by Piwi-piRNA complexes, resulting in reduced chromatin accessibility. The current results also implicate UR of TE sequences in polytenized cells as yet another putative mechanism that contributes to regulation of their expression. Interestingly, it was shown previously that double mutants encompassing both the Su(var)3-9 and SuUR mutant alleles exhibit a synthetically increased predominance of novel band-interband structures at PH compared with the mutation of SuUR alone. While the evidence suggests a relationship between UR and transcriptionally repressive epigenetic states, such as H3K9 methylation, the nature of this relationship remains largely speculative (Andreyeva, 2017).

    This study demonstrated that SUUR protein physically interacts with H1 in both a complex mixture of whole-cell extracts that contain endogenous native H1 and recombinant purified H1 polypeptides. Furthermore, the particular structural domains of the two proteins were delimited that are required for the interaction. SUUR protein contains several sequence features that have been implicated in regulation of UR and binding to specific proteins. Although SUUR possesses a putative bromodomain, it contains no identifiable DNA-binding domain, so the mechanism that allows SUUR to exhibit a preference for specific genomic underreplicated loci is unknown. The positively charged central region is both necessary and sufficient to interact with heterochromatin protein 1a (HP1a), which suggests a possible involvement of HP1a in tethering SUUR to H3K9me2/3-rich PH. However, the specific localization of SUUR to underreplicated IH, which is not enriched for H3K9me2/3, remains enigmatic. This study now demonstrates that the central region of SUUR is also sufficient for binding directly to H1 in vitro. Considering that the central region of SUUR is essential for the faithful localization of the protein to chromatin in vivo, including underreplicated IH, it seems likely that H1 directly mediates the tethering of SUUR to chromatin in underreplicated regions (Andreyeva, 2017).

    The tripartite structure of H1 provides multiple binding interfaces for interacting proteins and thus allows H1 to mediate several biochemically separable functions in vivo. For instance, the globular domain and proximal 25% of the CTD are required for H1 loading into chromatin, while the proximal 75% of the CTD is needed for normal polytene morphology, H3K9 methylation, and physical interactions with Su(var)3-9. This study discovered a previously unknown function for the distal 25% of the H1 CTD, which is shown to be essential for binding to SUUR. Deletion of this region of H1 results in a near-complete loss of the interaction with SUUR. Thus, in addition to its critical functions in heterochromatin structure and activity, the CTD of H1 is likely also important in facilitating UR (Andreyeva, 2017).

    One of the most striking findings in this study is the observation that the genomic occupancy of H1 undergoes profound changes during the endoreplication cycle. It also remains largely mutually exclusive with that of DNA polymerase clamp loader PCNA, which is consistent with the observed depletion of H1 in nascent chromatin compared with mature chromatin (Andreyeva, 2017).

    H1 is heavily loaded into late replicating loci at the onset of replication (when these loci are silent for replication). Combined, the current observations indicate that the chromosome distribution of H1 during the endocycle is governed by at least three independent processes. Two of them [replication-dependent (RD) eviction of H1 and RD deposition of H1 after the passage of replication fork] are related to the well-recognized obligatory processes of chromatin disassembly and reassembly during replication. The third pathway, which directs early deposition of H1 into late replicating loci, has not been described previously. This process is (1) replication-independent (RI); (2) locus-specific, with a strong preference for late replicating sites; and (3) apparently more rapid than the RD deposition of H1, since very high levels of H1 occupancy are observed in all nuclei immediately after the initiation of endo-S. It is possible that the RI pathway of H1 loading into chromatin is mediated by a selective recruitment of H1 based on epigenetic core histone modification-dependent mechanisms. For instance, mammalian H1.2 was reported to recognize H3K27me3, and this modification is very abundant in IH (Sher et al. 2012) (Andreyeva, 2017).

    Also, the RI mechanism for deposition of H1 probably does not involve de novo nucleosome assembly, as H1 is known to exhibit a mutually exclusive distribution with RI core histone variants, and there is no known nuclear process during early S phase that requires core histone turnover. In the future, it will be interesting to further confirm that RI nucleosome assembly does not take place during early replication in salivary gland polytene chromosomes. Finally, the locus-specific RI deposition of H1 in early endo-S chromatin may be conserved in the normal S phase of diploid tissues, and it will require independent experimentation with sorted mitotically dividing cells to confirm this possibility (Andreyeva, 2017).

    This study also provides cytological evidence that the functions of H1 and SUUR are biochemically linked. Specifically, it was demonstrated that SUUR localizes to a subset of H1-positive bands and requires H1 for its precise distribution in polytene chromosomes, nuclear localization, and stability in salivary gland cells. These observations implicate H1 as an upstream effector of SUUR functions in vivo and an essential component of the biological pathway that maintains loci of reduced ploidy in polytenized cells. Importantly, this finding adds to a growing list of biochemical partners of H1 that mediate their chromatin-directed functions in an H1-dependent fashion (Andreyeva, 2017).

    Interestingly, even a moderate depletion of H1 (to ~30% of normal) results in a complete removal of SUUR from chromosome arms. Thus, H1-dependent localization of SUUR requires high concentrations of the linker histone in chromatin. This conclusion is also consistent with SUUR colocalization with polytene loci that are the most strongly stained for H1. In contrast, elimination of the H3K9me2 mark from polytene spreads requires very extensive depletion of H1, whereas the moderate depletion of H1 does not strongly affect H3K9 dimethylation in the chromocenter or polytene arms. Therefore, the robust effect of even moderate H1 depletion on SUUR localization in chromatin is unlikely to be mediated indirectly through disorganization of heterochromatin structure (Andreyeva, 2017).

    Unexpectedly, the cell cycle-dependent temporal pattern of H1 localization is not identical to that of SUUR. In contrast to H1, SUUR protein (1) is only weakly present in IH during early endo-S phase, (2) achieves the maximal occupancy at IH loci only in the late endo-S, and (3) colocalizes with PCNA at certain sites. The observations made in this study and in previous works can be summarized in the following model for H1-mediated regulation of SUUR association with chromatin. The initiation of the deposition of SUUR in chromosomes is strongly dependent on H1. More specifically, SUUR is preferentially localized to chromatin domains that are highly enriched for H1. For instance, the tremendously elevated concentration of H1 in IH of early endo-S cells promotes and nucleates the initiation of deposition of SUUR into these regions. However, the pattern of SUUR occupancy at these sites does not occur temporally in parallel with that of H1. Initially, the exceptionally high abundance of H1 in late replicating loci during early endo-S is not paralleled by a simultaneous comparable increase of SUUR occupancy. Rather, loading of SUUR into these sites lags significantly behind H1 occupancy. Thus, the rate of SUUR localization to H1-rich IH appears to be much slower than that of the RI deposition of H1 into these loci. After the initial recruitment, further loading of SUUR does not require H1, and SUUR continues (in a slower fashion) to accumulate at IH throughout the endo-S phase even when H1-enriched domains dissipate in the course of DNA endoreplication. The additional loading of SUUR in chromatin is likely facilitated by its self-association through dimerization of the N terminus and physical interactions with the replication fork, as proposed previously. In this fashion, SUUR achieves its maximal concentration in IH loci by the late endo-S (Andreyeva, 2017).

    This study has demonstrated that H1 has a pivotal function in the establishment of UR of specific IH loci in polytenized salivary gland cells. The findings that H1 interacts directly with SUUR in vitro and is required for SUUR localization to late replicating IH in polytene chromosomes in vivo strongly suggest that the H1-mediated recruitment of SUUR promotes UR by obstructing replication fork progression in its cognate underreplicated loci but does not affect replication origin firing. However, the remarkable temporal pattern of H1 distribution in endoreplicating polytene chromosomes suggests that it may also play a direct SUUR-independent role in regulation of endoreplication. This is especially plausible considering that the temporal distribution patterns of SUUR and H1 are dissimilar (Andreyeva, 2017).

    In contrast to the role of SUUR in slowing down the replication fork progression during late endo-S phase, H1 (acting in the absence of SUUR during early endo-S) may function to repress the initiation of endoreplication, as proposed in several studies. DNA-seq analyses also suggest this mechanism. Compared with the relatively smooth, flat profiles of DNA copy numbers in SuURES mutant salivary glands, the profiles in H1-depleted cells exhibit a jagged, uneven appearance, indicative of aberrant local initiation of replication. Unfortunately, the experimental system (cytological analyzes of salivary glands) cannot be used to further confirm this idea. First, an extensive depletion of H1 results in the loss of polytene morphology; second, since the staging of endo-S progression is based on PCNA staining, a spurious activation of ectopic replication origins would result in an incorrect calling of the stage. To further complicate these analyses, polytenized cells are not amenable to other methods of cell cycle staging, such as fluorescence-activated cell sorting (FACS). In the future, it will be important to examine the role of H1 in regulation of DNA replication timing in sorted Drosophila diploid cells (Andreyeva, 2017).

    Stable Polycomb-dependent transgenerational inheritance of chromatin states in Drosophila

    Transgenerational epigenetic inheritance (TEI) describes the transmission of alternative functional states through multiple generations in the presence of the same genomic DNA sequence. Very little is known about the principles and the molecular mechanisms governing this type of inheritance. In this study, by transiently enhancing 3D chromatin interactions, stable and isogenic Drosophila epilines were established that carry alternative epialleles, as defined by differential levels of Polycomb-dependent trimethylation of histone H3 Lys27 (forming H3K27me3). After being established, epialleles can be dominantly transmitted to naive flies and can induce paramutation. Importantly, epilines can be reset to a naive state by disruption of chromatin interactions. Finally, it was found that environmental changes modulate the expressivity of the epialleles, and this paradigm was extended to naturally occurring phenotypes. This work sheds light on how nuclear organization and Polycomb group (PcG) proteins contribute to epigenetically inheritable phenotypic variability (Ciabrelli, 2017).

    Dual functionality of cis-regulatory elements as developmental enhancers and Polycomb response elements

    Developmental gene expression is tightly regulated through enhancer elements, which initiate dynamic spatio-temporal expression, and Polycomb response elements (PREs), which maintain stable gene silencing. These two cis-regulatory functions are thought to operate through distinct dedicated elements. By examining the occupancy of the Drosophila pleiohomeotic repressive complex (PhoRC) during embryogenesis, extensive co-occupancy was revealed at developmental enhancers. Using an established in vivo assay for PRE activity, it was demonstrated that a subset of characterized developmental enhancers can function as PREs, silencing transcription in a Polycomb-dependent manner. Conversely, some classic Drosophila PREs can function as developmental enhancers in vivo, activating spatio-temporal expression. This study therefore uncovers elements with dual function: activating transcription in some cells (enhancers) while stably maintaining transcriptional silencing in others (PREs). Given that enhancers initiate spatio-temporal gene expression, reuse of the same elements by the Polycomb group (PcG) system may help fine-tune gene expression and ensure the timely maintenance of cell identities (Erceg, 2017).

    While enhancers initiate spatio-temporal transcriptional activity, PREs maintain a previously determined transcriptional state of their target genes, thus leading to transcriptional memory. PREs are generally thought to be dedicated solely to gene silencing and not to contain enhancer-like features to activate gene expression. This study presents evidence to the contrary, that both functions can be encoded in the same cis-regulatory element, depending on the cellular context. This is not a rare event -- almost 25% of PhoRC occupancy is at developmental enhancers. Of the 16 elements that this study tested experimentally (either enhancers for PRE activity or PREs for enhancer activity), nine have dual function, being sufficient to activate transcription in a specific spatio-temporal pattern and mediate PcG-dependent silencing in vivo (Erceg, 2017).

    These dual elements have interesting implications for transcriptional regulation during embryonic development. First, at the level of PcG protein recruitment, this subset of enhancers is highly enriched in the Pho motif, which distinguishes them from other developmental enhancers. This suggests that the recruitment of Pho to PhoRC enhancers is direct via sequence-specific DNA binding, consistent with an instructive model of recruitment, although other factors are likely involved. PcG proteins and developmental TFs bind in close proximity to each other within the same element (a single DNase hypersensitive site), raising the possibility of direct interplay between the two. The results indicate that the activity of PhoRC-bound enhancers is dominated by tissue-specific TFs that activate transcription in some cells while being dominated by a functional PcG complex in other cells. Is this due to mutually exclusive occupancy of developmental TFs and PcG proteins in different tissues, or do they compete functionally at these elements? The dramatic derepression of enhancer activity in different cell types upon PcG protein removal suggests that other tissue-specific TFs must occupy these enhancers in the PcG silenced cell. This has interesting implications for enhancer activity, as it is well known that TFs bind to thousands of sites (tens of thousands in mammalian cells), but only a subset of associated target genes changes expression when the TF is removed. This has led to the general assumption that the majority of binding events is nonfunctional or neutral. These data suggest that at least a subset of this embryonic occupancy can be functional if not actively antagonized by the presence of PcGs (Erceg, 2017).

    Second, enhancer-mediated polycomb recruitment has interesting implications for the mechanism of PcG-mediated silencing. The current models suggest that PcG proteins silence transcription mainly by silencing a gene's promoter, in keeping with PcG recruitment to CpG islands in vertebrates, or by coordinating a three-dimensional repressive topology, where the entire gene's locus is silenced. In either mode, a gene's promoter would not be permissive to enhancer activation. The data suggest that there may be a third mode of very local silencing at an individual enhancer, leaving the promoter and the rest of the gene's regulatory landscape open for activation by other enhancers, as was observed at the prat2 locus. This would allow for much more fine-tuning of silencing in individual tissues and stages. It also suggests that PcG proteins could play a more dynamic role, similar to a 'standard' transcriptional repressor at enhancers (Erceg, 2017).

    Third, this may have broader implications for cell fate decisions during rapid developmental transitions. When multipotent cells become specified into different lineages, a specific transcriptional program often needs to be activated in one cell while being repressed in other cells from the same progenitor population. Having active enhancers in the precursor cells remain accessible to directly recruit the PcG complexes would ensure that these enhancers become silenced in a timely manner. Conversely, having maternally deposited PcG proteins already bound to enhancers early in development may serve as placeholders to ensure that these dual elements remain open and available for TFs to activate at the appropriate development stage. Interestingly, in the majority of the tested cases, PcG proteins and developmental TFs use these dual elements to regulate the same target gene, the vast majority of which is key developmental regulators of cell identity (Erceg, 2017).

    The identification of PREs in other species has remained a key challenge, with only a handful of PREs identified in mammals and plants to date. In mammals, the PcG system is recruited to inactive CpG islands, with few specific sequence features. Although there are mammalian homologs of the Drosophila Pho and dSfmbt proteins, Yin Yang 1 (YY1) and SFMBT, respectively, the conservation of PhoRC as a complex and its involvement in mammalian PcG silencing remain unclear. It is proposed that such dual enhancers/PREs will also exist in mammals, although, given this apparent lack of conservation of YY1 function, their mechanism of PcG recruitment may have diverged (Erceg, 2017).

    A comparison of nucleosome organization in Drosophila cell lines

    Changes in the distribution of nucleosomes along the genome influence chromatin structure and impact gene expression by modulating the accessibility of DNA to transcriptional machinery. This study compared genome-wide nucleosome positioning and occupancy in five different Drosophila tissue-specific cell lines, and in reconstituted chromatin, and then tests were performed for correlations between nucleosome positioning, transcription factor binding motifs, and gene expression. Nucleosomes in all cell lines were positioned in accordance with previously known DNA-nucleosome interactions, with helically repeating A/T di-nucleotide pairs arranged within nucleosomal DNAs and AT-rich pentamers generally excluded from nucleosomal DNA. Nucleosome organization in all cell lines differed markedly from in vitro reconstituted chromatin, with highly expressed genes showing strong nucleosome organization around transcriptional start sites. Importantly, comparative analysis identified genomic regions that exhibited cell line-specific nucleosome enrichment or depletion. Further analysis of these regions identified 91 out of 16,384 possible heptamer sequences that showed differential nucleosomal occupation between cell lines, and 49 of the heptamers matched one or more known transcription factor binding sites. These results demonstrate that there is differential nucleosome positioning between these Drosophila cell lines and therefore identify a system that could be used to investigate the functional significance of differential nucleosomal positioning in cell type specification (Martin, 2017).

    Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo

    High-throughput assays of three-dimensional interactions of chromosomes have shed considerable light on the structure of animal chromatin. Despite this progress, the precise physical nature of observed structures and the forces that govern their establishment remain poorly understood. This study presents high resolution Hi-C data from early Drosophila embryos. Boundaries between topological domains of various sizes were shown to map to DNA elements that resemble classical insulator elements: short genomic regions sensitive to DNase digestion that are strongly bound by known insulator proteins and are frequently located between divergent promoters. Further, a striking correspondence was shown between these elements and the locations of mapped polytene interband regions. It is likely this relationship between insulators, topological boundaries, and polytene interbands extends across the genome, and a model is proposed in which decompaction of boundary-insulator-interband regions drives the organization of interphase chromosomes by creating stable physical separation between adjacent domains (Stadler, 2017).


    Ahmad, K. and Henikoff, S. (2002). Histone H3 variants specify modes of chromatin assembly. Proc. Natl. Acad. Sci. 99 Suppl 4: 16477-84. 1217744

    Akan, I., Love, D. C., Harwood, K., Bond, M. R. and Hanover, J. A. (2016). Drosophila O-GlcNAcase deletion globally perturbs chromatin O-GlcNAcylation. J Biol Chem [Epub ahead of print]. PubMed ID: 26957542

    Alexiadis, A., Delidakis, C. and Kalantidis, K. (2017). Snipper, an Eri1 homologue, affects histone mRNA abundance and is crucial for normal Drosophila melanogaster development. FEBS Lett [Epub ahead of print]. PubMed ID: 28626879

    Andreyeva, E. N., et al. (2017). Regulatory functions and chromatin loading dynamics of linker histone H1 during endoreplication in Drosophila. Genes Dev 31(6): 603-616. PubMed ID: 28404631

    Ayer, D. E., Kretzner, L. and Eisenman, R. N. (1993). Mad: a heterodimeric partner for Max that antagonizes Myc transcriptional activity. Cell 72: 211-222. 8425218

    Barbieri, M., Chotalia, M., Fraser, J., Lavitas, L. M., Dostie, J., Pombo, A. and Nicodemi, M. (2012). Complexity of chromatin folding is captured by the strings and binders switch model. Proc Natl Acad Sci U S A 109: 16173-16178. PubMed ID: 22988072

    Benedetti, F., Dorier, J., Burnier, Y. and Stasiak, A. (2014). Models that include supercoiling of topological domains reproduce several known features of interphase chromosomes. Nucleic Acids Res 42: 2848-2855. PubMed ID: 24366878

    Black, K, L., Petruk, S., Fenstermaker, T. K., Hodgson, J. W., Caplan, J. L., Brock, H. W. and Mazo, A. (2016). Chromatin proteins and RNA are associated with DNA during all phases of mitosis. Cell Discov 2: 16038. PubMed ID: 27807477

    Boettiger, A. N., Bintu, B., Moffitt, J. R., Wang, S., Beliveau, B. J., Fudenberg, G., Imakaev, M., Mirny, L. A., Wu, C. T. and Zhuang, X. (2016). Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature [Epub ahead of print]. PubMed ID: 26760202

    Breuer, M. and Ohkura, H. (2015). A negative loop within the nuclear pore complex controls global chromatin organization. Genes Dev 29: 1789-1794. PubMed ID: 26341556

    Ciabrelli, F., Comoglio, F., Fellous, S., Bonev, B., Ninova, M., Szabo, Q., Xuereb, A., Klopp, C., Aravin, A., Paro, R., Bantignies, F. and Cavalli, G. (2017). Stable Polycomb-dependent transgenerational inheritance of chromatin states in Drosophila. Nat Genet 49(6): 876-886. PubMed ID: 28436983

    Czermin, B., Schotta, G., Hulsmann, B. B., Brehm, A., Becker, P. B., Reuter, G. and Imhof, A. (2001). Physical and functional association of SU(VAR)3-9 and HDAC1 in Drosophila EMBO Rep. 2: 915-919. 11571273

    Dias, G. B., Heringer, P., Svartman, M. and Kuhn, G. C. (2015). Helitrons shaping the genomic architecture of Drosophila: enrichment of DINE-TR1 in alpha- and beta-heterochromatin, satellite DNA emergence, and piRNA expression. Chromosome Res [Epub ahead of print]. PubMed ID: 26408292

    Eagen, K. P., Hartl, T. A. and Kornberg, R. D. (2015). Stable Chromosome Condensation Revealed by Chromosome Conformation Capture. Cell 163: 934-946. PubMed ID: 26544940

    Erceg, J., Pakozdi, T., Marco-Ferreres, R., Ghavi-Helm, Y., Girardot, C., Bracken, A. P. and Furlong, E. E. (2017). Dual functionality of cis-regulatory elements as developmental enhancers and Polycomb response elements. Genes Dev 31(6): 590-602. PubMed ID: 28381411

    Filion, G. J., van Bemmel, J. G., Braunschweig, U., Talhout, W., Kind, J., Ward, L. D., Brugman, W., de Castro, I. J., Kerkhoven, R. M., Bussemaker, H. J. and van Steensel, B. (2010). Systematic protein location mapping reveals five principal chromatin types in Drosophila cells. Cell 143: 212-224. PubMed ID: 20888037

    Hassig, C. A., et al. (1997). Histone deacetylase activity is required for full transcriptional repression by mSin3A. Cell 89 (3): 341-347. PubMed ID: 9150133

    Hou, L., Wang, L., Berg, A., Qian, M., Zhu, Y., Li, F. and Deng, M. (2012). Comparison and evaluation of network clustering algorithms applied to genetic interaction networks. Front Biosci (Elite Ed) 4: 2150-2161. PubMed ID: 22202027

    Hwang, K. K., Eissenberg, J. C. and Worman, H. J. (2001). Transcriptional repression of euchromatic genes by Drosophila heterochromatin protein 1 and histone modifiers. Proc. Natl. Acad. Sci. 98: 11423-11427. 11562500

    Jacobs, S. A., Taverna, S. D., Zhang, Y., Briggs, S. D., Li, J., Eissenberg, J. C., Allis, C. D. and Khorasanizadeh, S. (2001). Specificity of the HP1 chromo domain for the methylated N-terminus of histone H3. EMBO J. 20: 5232-5241. 11566886

    Jost, D., Carrivain, P., Cavalli, G. and Vaillant, C. (2014). Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res 42: 9553-9561. PubMed ID: 25092923

    Jung, Y.L., Kang, H., Park, P.J. and Kuroda, M.I. (2016). Correspondence of Drosophila Polycomb Group proteins with broad H3K27me3 silent domains. Fly (Austin) [Epub ahead of print]. PubMed ID: 26940990

    Laprell, F., Finkl, K. and Muller, J. (2017). Propagation of Polycomb-repressed chromatin requires sequence-specific recruitment to DNA. Science 356(6333):85-88. PubMed ID: 28302792

    Le, T. B., Imakaev, M. V., Mirny, L. A. and Laub, M. T. (2013). High-resolution mapping of the spatial organization of a bacterial chromosome. Science 342: 731-734. PubMed ID: 24158908

    Lee, H. G., Kahn, T. G., Simcox, A., Schwartz, Y. B. and Pirrotta, V. (2015). Genome-wide activities of Polycomb complexes control pervasive transcription. Genome Res 25: 1170-1181. PubMed ID: 25986499

    Martin, R. L., Maiorano, J., Beitel, G. J., Marko, J. F., McVicker, G. and Fondufe-Mittendorf, Y. N. (2017). A comparison of nucleosome organization in Drosophila cell lines. PLoS One 12(6): e0178590. PubMed ID: 28570602

    Nagy, L., et al. (1997). Nuclear receptor repression mediated by a complex containing SMRT, mSin3A, and histone deacetylase. Cell 89 (3): 373-380

    Pan, L., Xie, W., Li, K.L., Yang, Z., Xu, J., Zhang, W., Liu, L.P., Ren, X., He, Z., Wu, J., Sun, J., Wei, H.M., Wang, D., Xie, W., Li, W., Ni, J.Q. and Sun, F.L. (2015). Heterochromatin remodeling by CDK12 contributes to learning in Drosophila. Proc Natl Acad Sci U S A [Epub ahead of print]. PubMed ID: 26508632

    Ramírez, F., Lingg, T., Toscano, S., Lam, K. C., Georgiev, P., Chung, H. R., Lajoie, B. R., de Wit, E., Zhan, Y., de Laat, W., Dekker, J., Manke, T. and Akhtar, A. (2015). High-affinity sites form an interaction network to facilitate spreading of the MSL complex across the X chromosome in Drosophila. Mol Cell 60: 146-162. PubMed ID: 26431028

    Rea, S., Eisenhaber, F., O'Carroll, D., Strahl, B. D., Sun, Z. W., Schmid, M., Opravil, S., Mechtler, K., Ponting, C. P., Allis, C. D. and Jenuwein, T. (2000). Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. 10949293

    Richards, E. J. and Elgin, S. R. C. (2002). Epigenetic codes for heterochromatin formation and silencing: rounding up the usual suspects. Cell 108: 489-500. 11909520

    Ringrose, L., Rehmsmeier, M., Dura, J. M. and Paro, R. (2003). Genome-wide prediction of Polycomb/Trithorax response elements in Drosophila melanogaster. Dev. Cell 5: 759-771. 14602076

    Schwartz, Y. B., et al. (2006). Genome-wide analysis of Polycomb targets in Drosophila melanogaster. Nat. Genet. 38(6): 700-5. 16732288

    Sengupta, A. K., Kuhrs, A. and Müller, J. (2004). General transcriptional silencing by a Polycomb response element in Drosophila. Development 131: 1959-1965. 15056613

    Sexton, T., Yaffe, E., Kenigsberg, E., Bantignies, F., Leblanc, B., Hoichman, M., Parrinello, H., Tanay, A. and Cavalli, G. (2012). Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148: 458-472. PubMed ID: 22265598

    Shevelyov, Y. Y., Lavrov, S. A., Mikhaylova, L. M., Nurminsky, I. D., Kulathinal, R. J., Egorova, K. S., Rozovsky, Y. M. and Nurminsky, D. I. (2009). The B-type lamin is required for somatic repression of testis-specific gene clusters. Proc Natl Acad Sci U S A 106: 3282-3287. PubMed ID: 19218438

    Smith, M. B. and Weiler, K. S. (2010). Drosophila D1 overexpression induces ectopic pairing of polytene chromosomes and is deleterious to development. Chromosoma 119: 287-309. PubMed ID: 20127347

    Stadler, M. R., Haines, J. E. and Eisen, M. (2017). Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo. Elife 6. PubMed ID: 29148971

    Tschiersch, B., Hofmann, A., Krauss, V., Dorn, R., Korge, G. and Reuter, G. (1994). The protein encoded by the Drosophila position-effect variegation suppressor gene Su(var)3-9 combines domains of antagonistic regulators of homeotic gene complexes. EMBO J. 13: 3822-3831. 7915232

    Tyler, J. K., et al. (1996). The p55 subunit of Drosophila chromatin assembly factor 1 is homologous to a Histone deacetylase-associated protein. Mol. Cell. Biol. 16: 6149-6159. 8887645

    Ulianov, S. V., Khrameeva, E. E., Gavrilov, A. A., Flyamer, I. M., Kos, P., Mikhaleva, E. A., Penin, A. A., Logacheva, M. D., Imakaev, M. V., Chertovich, A., Gelfand, M. S., Shevelyov, Y. Y. and Razin, S. V. (2015). Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res. PubMed ID: 26518482

    Yarosh, W. and Spradling, A. C. (2014). Incomplete replication generates somatic DNA alterations within Drosophila polytene salivary gland cells. Genes Dev 28(16): 1840-1855. PubMed ID: 25128500

    Zhimulev, I. F., Belyaeva, E. S., Makunin, I. V., Pirrotta, V., Volkova, E. I., Alekseyenko, A. A., Andreyeva, E. N., Makarevich, G. F., Boldyreva, L. V., Nanayev, R. A. and Demakova, O. V. (2003). Influence of the SuUR gene on intercalary heterochromatin in Drosophila melanogaster polytene chromosomes. Chromosoma 111: 377-398. PubMed ID: 12644953

    Zielke, T., Glotov, A. and Saumweber, H. (2015). High-resolution in situ hybridization analysis on the chromosomal interval 61C7-61C8 of Drosophila melanogaster reveals interbands as open chromatin domains. Chromosoma [Epub ahead of print]. PubMed ID: 26520107

    Zygotically transcribed genes

    Home page: The Interactive Fly © 1995, 1996 Thomas B. Brody, Ph.D.

    The Interactive Fly resides on the
    Society for Developmental Biology's Web server.