The Interactive Fly
Zygotically transcribed genes
Chromatin consists of proteins that serve as the structural organizer of DNA, binding DNA into higher order structures and ultimately forming the chromosome itself. Chromatin restricts the access of DNA to transcription factors. Both Polycomb and trithorax group proteins act to remodel chromatin altering the accessibility of DNA to factors required for gene transcription. Polycomb group genes are involved in chromatin based gene silencing, while trithorax group genes counteract the silencing effects of chromatin to maintain gene activity.
There is an evolving understanding of the enzymes that function to remodel chromatin. At least two systems related to yeast SWI/SNF proteins function to open up chromatin, permitting access to transcription factors. Information on SWI/SNF homologs can be found at the ISWI and Brahma sites. Information about the potential role of the origin recognition complex in chromatin remodeling can be found at the Origin recognition complex 2 (ORC2) site.
Nucleosome assembly protein-1 (NAP1) and Chromatin assembly factor 1 subunit (CAF1) play the role of histone chaperones in establishing an ordered nucleosome structure on newly synthesized DNA. Drosophila CAF-1 appears to comprise four subunits of 180, 105, 75 and 55 kDa. The smallest subunit of Drosophila CAF-1, p55, is homologous to a mammalian RbAp-48 protein which is associated with the HD1 histone deacetylase. A model for the role of core histone chaperones in chromatin assembly is as follows: CAF-1 binds to newly synthesized H3 and (acetylated) H4 and mediates the formation of the H3-H4 tetramer into newly replicated DNA: histones H2A-H2B are subsequently incorporated with the assistance of other histone chaperones, such as nucleoplasmin or NAP1, to give the complete histone octamer. The initial histone acetylation may be required to neutralize its high positive charge, allowing it to be assembled into chromatin. Deacetylation of histones carried out by histone deacetylase, could be a prerequisite to maturation of chromatin. In any case, it is now clear that chromatin assembly and maturation involves histone acetylation and that this process begins in cytoplasm and histones are subsequently transferred to the nucleus and the deacetylated (Tyler, 1996).
Regulatory elements called enhancers, or locus control regions are capable of exerting their influence over long distances, and in an orientation-independent manner to orchestrate the complex gene expression patterns required for embryonic development. How are the effects of enhancers confined to the genes they regulate? In recent years the concept of chromatin based domain boundaries or insulator elements has developed, based on the genetic properties of several eukaryotic genes. One example of an insulator element is the Drosophila gypsy insulator. For discussion of the gypsy insulator, and the role of two proteins, Suppressor of Hairy wing and MOD(MDG4) in its regulation, see the su(Hw) site.
One other aspect of gene silencing has been established for mammalian and yeast systems. Whereas histone acetylation is known to be involved in gene activation in Drosophila dosage compensation (See Male-specific lethal 2), a role for deacetylation in gene silencing has not yet been established in Drosophila. Two examples of the role of histone deacetylation in gene silencing in mammals will be described briefly here. Histone deacetylation plays a role in mammalian Myc mediated silencing (see Drosophila Myc Evolutionary Homologs section for more information) and in mammalian nuclear receptor mediated silencing (see Ecdysone receptor Evolutionary Homologs section for more information).
Myc family proteins function through heterodimerization with the stable, constitutively expressed bHLH-Zip protein, Max. Human Mad protein homodimerizes poorly but binds Max in vitro, forming a sequence-specific DNA binding complex with properties very similar to those of Myc-Max. Both Myc-Max and Mad-Max heterocomplexes are favored over Max homodimers. Mad does not associate with Myc or with representative bHLH, bZip, or bHLH-Zip proteins. On the other hand, Myc-Max and Mad-Max complexes carry out opposing functions in transcription and Max plays a central role in this network of transcription factors (Ayer, 1993).
Members of the Mad family of bHLH-Zip proteins heterodimerize with Max to repress transcription in a sequence-specific manner. Transcriptional repression by Mad:Max heterodimers is mediated by ternary complex formation with either of the corepressors mSin3A or mSin3B. mSin3A is an in vivo component of large, heterogeneous multiprotein complexes and is tightly and specifically associated with at least seven polypeptides. Two of the mSin3A-associated proteins, p50 and p55, are highly related to the histone deacetylase HDAC1. The mSin3A immunocomplexes possess histone deacetylase activity that is sensitive to the specific deacetylase inhibitor trapoxin. mSin3A-targeted repression is reduced by trapoxin treatment, suggesting that histone deacetylation mediates transcriptional repression through Mad-Max-mSin3A multimeric complexes (Hassig, 1997).
The same proteins that mediate transcriptional silencing of Mad-Max also mediate transcriptional silencing of nuclear hormone receptors that are attached to DNA but free of ligand. Whereas liganded nuclear receptors serve as transcriptional activators, unliganded nuclear receptors serve as repressors. How does the unliganded nuclear receptor transmit a repressive signal to the transcriptional apparatus and what is the nature of this signal? In fact, the target of the unliganded nuclear receptor is not RNA polymerase but chromatin, and repression is mediated by corepressors, proteins that associate with unliganded nuclear receptors that assemble a macromolecular complex that modifies chromatin so as to silence gene activity. The macromolecular complex acts to deacetylate histone. The transcriptional corepressors SMRT and N-CoR function as silencing mediators for retinoid and thyroid hormone receptors. SMRT and N-CoR directly interact with unliganded nuclear receptors, and these corepressors in turn directly interact with mSin3A, a corepressor for the Mad-Max heterodimer and a homolog of the yeast global-transcriptional repressor Sin3p. The recently characterized histone deacetylase 1 (HDAC1) interacts with Sin3A and SMRT to form a multisubunit, ternary repressor complex. Histone deacetylase in turn targets chromatin, converting it into a form that is unaccessible to the transcriptional apparatus. Consistent with this model, it is found that HDAC inhibitors synergize with retinoic acid to stimulate hormone-responsive genes and the differentiation of myeloid leukemia (HL-60) cells. Addition of a deacetylase inhibitor such as Trichostatin A relieves transcriptional repression resulting in a promoter that is sensitive to the addition of activating hormone. This work establishes a convergence of repression pathways for bHLH-Zip proteins and nuclear receptors and suggests that this type of regulation may be more widely conserved than previously suspected (Nagy, 1997).
Polycomb/Trithorax response elements (PRE/TREs) maintain transcriptional decisions to ensure correct cell identity during development and differentiation. There are thought to be over 100 PRE/TREs in the Drosophila genome, but only very few have been identified due to the lack of a defining consensus sequence. The definition of sequence criteria that distinguish PRE/TREs from non-PRE/TREs is reported in this study. Using this approach for genome-wide PRE/TRE prediction, 167 candidate PRE/TREs are reported, that map to genes involved in development and cell proliferation. Candidate PRE/TREs are shown to be bound and regulated by Polycomb proteins in vivo, thus demonstrating the validity of PRE/TRE prediction. Using the larger data set thus generated, three sequence motifs that are conserved in PRE/TRE sequences have been identified (Ringrose, 2003).
The detection of PRE/TREs by prediction generates a large data set that can be used to search for further common sequence features. To this end, the 30 highest scoring PRE/TRE hits were scanned for motifs that occur significantly more often in PRE/TREs than in randomly generated sequence. Five significant motifs were found. Not surprisingly, but reassuringly, two known motifs, the GAF and PHO binding sites were found. The Zeste binding motif was not found by this analysis, although it occurs as frequently as GAGA factor in the 30 sequences analyzed. This is probably due to the shortness and degeneracy of the Zeste motif, and suggests that other such short motifs will also be missed by this approach (Ringrose, 2003).
Nevertheless, three additional motifs were found. The first, called GTGT, is found several times in 14 of the sequences. The second motif, poly T, is found several times in almost all 30 PRE/TRE sequences analyzed. Some variants of this site match the binding consensus for the Hunchback protein, which has been shown to be an early regulator at some PRE/TREs. The third motif, TGC triplets, occurs several times in 13 of the PRE/TRE sequences. No binding factor for this sequence has yet been identified (Ringrose, 2003).
To further examine these three motifs, motif occurrence was evaluated in all 167 predicted PRE/TREs and in the promoter peaks described above. In contrast to the known GAF, Z, and PHO motifs, the three motifs each occur in only a subset of predicted and known PRE/TREs, and do not occur significantly together. These motifs may thus each define a subclass of PRE/TREs. Consistent with this idea, some of the lowest scoring known PRE/TRE sequences indeed contain one or more of the three motifs (Ringrose, 2003).
Although no correlation between particular sites and high scores was found, a negative correlation was found between numbers of GAF/Z and PHO sites (a correlation coefficient of -0.78, indicating that when many GAF/Z sites are present, there are few PHO sites, and vice versa). This suggests that each PRE/TRE may have a preferred ground state, in which it is either predisposed to silencing (many PHO sites) or to activation (many GAF/Z sites) (Ringrose, 2003).
In summary, this analysis identifies three motifs that occur significantly in association with known PRE/TRE motifs. Further functional characterization of these motifs and the proteins that bind them may contribute to a more complete definition of the sequence requirement for PRE/TRE function, and of subclasses of PRE/TREs (Ringrose, 2003).
This study offers four main contributions to the understanding of PRE/TRE function. First, a larger set of sequences have been defined that will facilitate the more complete definition of PRE/TRE sequence requirements. Three motifs have been identified that may contribute to this goal. The definition of the minimal requirement for PRE/TRE function will not be a trivial task. Analysis of motif composition and order in the 167 predicted PRE/TREs reveals that there is a great diversity of patterns, with no preferred linear order. It is possible that each different pattern of motifs reflects a subtly different function. However, the concept of a linear order of motifs may well be irrelevant, because these elements operate in the three-dimensional context of chromatin. The fact that such a diversity of PRE/TRE designs exist indicates that the vast majority of them would defy detection by conventional pattern-finding algorithms, and underlines the advantages of the approach described in this study (Ringrose, 2003).
Although no linear constraints on motif order were found, the fact that only motif pairs, and not single motifs, are able to identify PRE/TREs strongly suggests that this close spacing of sites has functional significance. Multiple sites may work in concert, to promote cooperative binding of similar proteins (e.g., repeated PHO sites) or to provoke competition between dissimilar proteins (e.g., closely spaced GAGA factor and PHO sites). In addition, in chromatin, only a subset of sites will be exposed and optimally available for binding at any one time, while others will be occluded by nucleosomes. The trxG includes nucleosome remodeling machines, raising the intriguing possibility that remodeling of PRE/TREs in chromatin may contribute to epigenetic switching by exposing different sets of protein binding sites (Ringrose, 2003).
Second, a PRE/TRE peak is observed at the promoter of all the genes examined. This strongly suggests that promoter binding is a general principle of PRE/TRE function. It has been reported that PcG proteins can interact with general transcription factors. It has hitherto been unclear whether the observed PcG/trxG binding at promoters of the genes they regulate is mediated indirectly via such an interaction, or whether the PcG and trxG bind directly to PRE/TREs at the promoters. The high scores observed at promoters favor the latter interpretation (Ringrose, 2003).
Third, it has been shown that in most cases, PRE/TREs do not occur in isolation, but are accompanied by one or more other peaks nearby. These grouped PRE/TREs may create multiple attachment sites for PcG and trxG proteins, which come together to build a fully operational complex at the promoter. Alternatively, grouped PRE/TREs may be individually regulated by tissue-specific enhancers as in the BX-C. Thus, each of the many PRE/TREs of the homothorax gene may interact with the promoter PRE/TRE in different tissues. This idea is consistent with the fact that Homothorax has specific roles in diverse developmental processes (Ringrose, 2003).
Finally, the current list of about ten PcG/trxG target genes has been expanded to over 150 genes, identifying candidates for epigenetic regulation. The genes thus identified encompass every stage of development, suggesting that the PcG/trxG are global regulators of cellular memory. Experiments to further investigate and compare this regulation for individual genes are currently underway (Ringrose, 2003).
Polycomb group (PcG) complexes are multiprotein assemblages that bind to chromatin and establish chromatin states leading to epigenetic silencing PcG proteins regulate homeotic genes in flies and vertebrates, but little is known about other PcG targets and the role of the PcG in development, differentiation and disease. This study determined the distribution of the PcG proteins PC, E(Z) and PSC and of trimethylation of histone H3 Lys27 (me3K27) in the Drosophila genome
using chromatin immunoprecipitation (ChIP) coupled with analysis of immunoprecipitated DNA with a high-density genomic tiling microarray. At more than 200 PcG target genes, binding sites for the three PcG proteins colocalize to presumptive Polycomb response elements (PREs). In contrast, H3 me3K27 forms broad domains including the entire transcription unit and regulatory regions. PcG targets are highly enriched in genes encoding transcription factors, but they also include genes coding for receptors, signaling proteins, morphogens and regulators representing all major developmental pathways (Schwartz, 2006).
The components of PcG complexes are products of PcG genes, first discovered as crucial regulators of homeotic genes in Drosophila. Immunostaining of Drosophila polytene chromosomes, however, showed PcG proteins at about 100 cytological loci, implying a much larger number of target genes. Functional analysis has identified PREs as DNA sequences able to recruit PcG proteins and establish PcG silencing of neighboring genes. Two types of PcG complexes bind to PREs. PRC1-type complexes include a core quartet of proteins: PC, PSC, PH and dRing. PRC2-type complexes include E(Z), which methylates histone H3 Lys27. Mono- and dimethylated Lys27 is widely distributed in the genome, but PcG sites characteristically contain trimethylated Lys27 (me3K27). The activity of the E(Z) complex is essential for stable silencing, and it has been proposed that H3 me3K27 recruits the PRC1 complex through the specific affinity of the PC chromodomain for me3K27. But the relationships between PRC1 and PRC2 complexes, between their binding sites and histone methylation, and between binding, methylation and gene expression are not well understood and remain the subject of debate. The genomic distribution of three PcG proteins [PC, PSC and E(Z)] and of histone H3 me3K27 was examined using chromatin immunoprecipitation (ChIP). Since PcG target genes may be repressed in some tissues and active in others, a cultured cell line was used to minimize heterogeneity (Schwartz, 2006).
Viewed at the scale of a chromosome arm, the distributions of PC, PSC, E(Z) and me3K27 coincide at a number of distinct binding peaks (which are referred to as 'PcG sites') that correspond to 70% of the bands reported in salivary gland polytene chromosomes stained with the corresponding antibodies. To minimize false positives, the analysis focused on the PcG sites that showed simultaneous binding of two or more proteins, each above twofold enrichment. Of the 149 PcG sites detected (see the supplemental figure), 95 showed strong binding of all four proteins ('strong' PcG sites), whereas in 54 sites the binding was lower and below threshold for one of the proteins ('weak' PcG sites). At higher resolution, most PcG sites involve two or more genes, often sharing structural or functional similarities. Thus, PcG sites involve the following: engrailed (en) and invected (inv); the PcG genes ph-p and ph-d; the Dorsocross T-box gene cluster; the muscle NK homeobox gene cluster; the wingless cluster; and the two homeotic complexes ANT-C and BX-C (Schwartz, 2006).
The Bithorax complex (BX-C) is a cluster of three homeotic genes (Ubx, abd-A and Abd-B) responsible for segmental identity in the abdomen and posterior thorax. The most prominent features are two sharp binding peaks for all three PcG proteins at the sites of the bx and bxd PREs that control Ubx. No peak was detected over the Ubx proximal promoter, although the entire gene shows a low but significant level of PC. A series of lower peaks emerged in the abd-A region and part of the Abd-B gene. Some of these correspond to the known PREs iab-2. In contrast, the distribution of H3 me3K27 oscillated rapidly above a high plateau that covers Ubx and abd-A but not Abd-B. RT-PCR was used to determine the mRNA levels corresponding to these three genes. Transcription of Ubx and abd-A in these cells was very low but distinctly above background. Abd-B was highly transcribed, at levels 300 times higher than Ubx. This pattern of activity was reflected by the distribution of both PcG proteins and me3K27. It is noted that in the Abd-B regulatory region, the previously characterized Fab-7 and Fab-8 PREs neither bound PcG proteins nor were methylated in these cells. The Abd-B gene has five distinct promoters. A sharp resurgence of both methylation and PcG protein binding in the region of the most upstream Abd-B promoter suggests that, in contrast to the other four promoters, this one might be repressed in the cultured cells. RT-PCR analysis using primers specific for mRNAs initiating from each promoter confirmed that the most upstream promoter is silent and that the other four are active. These results support the view that binding of PcG proteins to PREs is associated with transcriptional quiescence, whereas robust transcriptional activity is accompanied by lack of binding to the PREs and lack of Lys27 methylation over the transcription unit (Schwartz, 2006).
Strong genomic sites bind all three PcG proteins. The PSC and E(Z) peaks generally rise sharply and are contained within less than 2 kb, whereas PC frequently forms a broader peak that may include shoulders or subsidiary peaks absent for E(Z) and PSC and subsides to background more gradually. These peak binding regions are thought of as corresponding to PREs, which they in fact do in the cases where these are known. Additional binding peaks may be found within or downstream of the transcription unit. In contrast, distribution of H3 me3K27 at each site is very broad, forming a domain of tens or even hundreds of kilobases encompassing the transcription unit and regulatory regions of one or more genes but, rather than a level plateau, it consists of a series of deep oscillations (Schwartz, 2006).
The strong binding peaks or putative PREs are often associated with low values or troughs in the methylation profile and at secondary peaks the PC distribution frequently echoes methylation peaks. Overall, their relationship does not support the idea that methylation of Lys27 suffices to recruit binding of PC. It is proposed instead that PC bound to the strong binding peaks, the presumptive PREs, is recruited by proteins that bind specifically to those sequences. The weaker PC binding peaks and tails that mirror the methylation profile near PREs may represent a second mode of PC binding mediated by the interaction of the chromodomain with H3 me3K27 (Schwartz, 2006).
It is supposed that methylation domains initiated by a PRE might spread bidirectionally until they encounter 'active' chromatin, characterized by histone acetylation or methylation of H3 Lys4, marks typical of transcriptionally active genes. Alternatively, specific features might shape the methylation domain either positively, by attracting the methyltransferase complex, or negatively, by blocking productive interactions with the PRE. As in the case of the Abd-B gene or of CG7922 and CG7956 genes, sudden drops in levels of me3K27 are generally associated with transcriptional activity. Are insulators involved in protecting CG7922 and CG7956 from silencing, or is the activity of these two genes simply epigenetically maintained from the time the cell line was originally established? Further work is required to answer this question (Schwartz, 2006).
In many cases, the presumptive PRE lies between divergently transcribed genes such as dco and Sox100B. Which of the two is the PRE target? As PREs can act at distances of 20-30 kb, the proximity of PcG peaks to a promoter is not a reliable guide. It is proposed that the methylation domain is the clue to the target of PcG regulation. A PcG peak is not considered to regulate a promoter if the gene is not included in the methylation domain. When multiple genes are included in the methylation domain, it is likely that they are all affected by PcG regulation. However, this study distinguishes between genes that contain methylation as well as one or more PcG proteins and genes that contain only methylation (Schwartz, 2006).
The 95 'strong' binding sites in the genome encompass a total of 392 genes. Of these 392 genes, 186 contain both PcG binding and methylation, and the remainder are found within broad methylation domains associated with PcG proteins binding but do not bind PcG proteins over their own promoter or transcription unit. They may represent genes not directly targeted but affected by the spread of methylation. An analysis of their ontology indicates that these two classes are in fact very different. Transcription regulators constitute 64.5% of the first set, compared to 4.3% for the full annotation set. Instead they constitute only 4.0% of those genes that contain only me3K27. These comparisons strongly suggest that (1) genes that regulate transcription are preferred PcG targets, and (2) genes that only include the tails of a methylation domain are probably not primary targets of PcG regulation. A similar preference is also seen among the 'weak' binding sites. These include a total of 74 genes containing both PcG proteins and methylation, 28.4% of which encode transcription regulators. Flanking genes containing only methylation include only 5.7% transcription regulators. Although transcription regulators are preferred PcG targets, secreted proteins, growth factors or their receptors, and signaling proteins are also targeted. PcG target genes include components of all the major differentiation and morphogenetic pathways in Drosophila (Schwartz, 2006).
The major features of PcG binding shown by this work are that, although the proteins themselves are highly localized at presumptive PREs, the domain of histone methylation they produce is much broader. If the E(Z) methyltransferase is localized at the PRE, how is the extensive methylation domain produced? A looping mechanism is proposed in which interaction of PRE-bound complexes with flanking chromatin is mediated by the PC chromodomain. The observed broader distribution of PC might result from crosslinking of the chromodomain to methylated H3, reflecting this mechanism (Schwartz, 2006).
Are PREs defined by characteristic sequence motifs? Although the analysis of the sequences underlying the binding peaks will be presented elsewhere, it is noted that Ringrose (2003) devised an algorithm based on GAGA factor, PHO and Zeste binding motifs to identify sequences likely to represent PREs. This algorithm correctly predicts a number of the strong PcG binding sites (27%) and a few of the weaker sites (7%), overall 20%; however, it does not predict the majority of the PcG sites. The reverse is also true: only 22% of the PREs predicted by Ringrose bind PcG proteins in these experiments. Together, these data suggest that additional criteria are necessary to predict most PREs reliably (Schwartz, 2006).
As expected, PcG proteins and me3K27 are associated with transcriptional quiescence, but the data suggest that this is not an absolute condition. Low but significant transcription levels are detected even for the repressed Ubx and abd-A genes. Two target sites, polyhomeotic and the Psc-Su(z)2 site, contain PcG genes, which must be active to ensure the functioning of the PcG mechanism. The polyhomeotic locus is one of two sites in the entire genome that bind PC but lack appreciable levels of E(Z) and of Lys27 methylation. Instead, the Psc-Su(z)2 region is well methylated and binds both PC and E(Z) at multiple peaks. It is concluded that PcG mechanisms do not invariably lead to transcriptional silencing and are compatible with moderate levels of transcription (Schwartz, 2006).
Another point of interest is the number and kind of genes that are PcG targets. Considering the developmental difference between salivary gland cells and the embryo-derived tissue culture cells, the substantial number of shared PcG sites suggests that a majority of target sites are occupied in a large percent of cells. Target genes are in fact predominantly regulatory genes that control major differentiation and morphogenetic pathways. These pathways and their genes are highly conserved, and recent work shows that they are also regulated by PcG in mammals. It might be expected that in a given cell type most alternative genomic programs would be repressed save the subset required in that cell type. The emerging picture from these studies is that PcG regulation is a key mechanism in genomic programming (Schwartz, 2006).
Histone H3 variants specify modes of chromatin assembly Histone variants have been known for 30 years, but their functions
and the mechanism of their deposition are still largely unknown.
Drosophila has three versions of histone H3. H3.3 marks active chromatin and may be essential for
gene regulation, and Cid is the characteristic structural component
of centromeric chromatin. The properties of
these histones have been characterized by using a Drosophila cell-line system that allows precise analysis of both DNA replication and histone deposition. The deposition of H3 is restricted to replicating DNA. In striking contrast, H3.3 and Cid deposit throughout the cell cycle. Deposition of H3.3 occurs without any corresponding DNA replication. To
confirm that the deposition of Cid is also replication-independent
(RI), centromere replication was examined in cultured cells and
neuroblasts. It was found that centromeres replicate out of phase
with heterochromatin and display replication patterns that may
limit H3 deposition. This confirms that both variants undergo RI
deposition, but at different locations in the nucleus. How variant
histones accomplish RI deposition is unknown, and raises basic
questions about the stability of nucleosomes, the machinery that
accomplishes nucleosome assembly, and the functional organization
of the nucleus. The different in vivo properties of H3, H3.3, and
Cid set the stage for identifying the mechanisms by which they are
differentially targeted. It is suggested that local effects of
'open' chromatin and broader effects of nuclear organization help to guide the two different H3 variants to their target sites (Ahmad, 2002).
Nucleosomes are the fundamental units of chromatin, consisting
of 146 bp of DNA wrapped around an octamer of
four core histones. Histone deposition occurs primarily as DNA
replicates to complete chromatin doubling. During S phase
of the cell cycle, new histones are produced in abundance for
immediate replication-coupled deposition. In most metazoans,
this abundant S-phase synthesis results from the tight regulation
of tens to hundreds of intronless histone genes that have special
3' untranscribed regions instead of poly(A) tails. However,
some histones are produced from orphan genes outside of S
phase. In Drosophila, orphan genes encode two H3 variants: one
encodes Cid, the centromeric histone, and two encode H3.3,
the replacement variant. These variants
have equivalents in many other eukaryotes. The H3.3
histone is nearly identical to H3, differing at only four amino acid
positions. Cid differs profoundly from H3 in sequence, showing
some significant identity only within the histone fold domain.
Surprisingly, these three histones have different deposition
properties. H3 and H3.3 are deposited as DNA replicates, but
both H3.3 and Cid can be deposited at sites that are not
undergoing DNA replication. Whereas only
a minor fraction of the bulk genome is packaged into Cid- and
H3.3-containing nucleosomes, each variant is targeted to different
specialized sites, with Cid localizing to centromeres and H3.3
to transcriptionally active genes. Specific localization of centromeric
H3-like histones (CenH3s) has been observed in various
animals, fungi, and plants. Also, an H3.3-like
histone targets the transcriptionally active macronucleus in
ciliates. Thus, the targeting of H3 variants is likely a feature
of every eukaryotic cell, where centromeres and transcribed
regions are the major loci of activity in metaphase and interphase,
respectively. Both kinds of loci use a distinct pathway for
nucleosome assembly, and this study explores the properties
of this process (Ahmad, 2002).
Studies of histone deposition have generally been done using
crude extracts, purified components or pools of cells from which
bulk chromatin is extracted. These methods reveal the
average properties of chromatin, and have shown that the bulk
of chromatin doubles as DNA replicates. Extensive in vitro work
has demonstrated that the assembly of nucleosomes is a stepwise
process in which deposition of an (H3:H4)2 tetramer is followed
by addition of two H2A:H2B dimers. The new histones are
brought to the replication fork in a complex with chromatin
assembly factor 1 (CAF1). CAF1 appears to be recruited to the
replication fork by binding to the ring-shaped proliferating cell
nuclear antigen (PCNA) that encircles the DNA template at
each replication fork. Histones from the parent DNA are
distributively segregated to the two sister chromatids behind the
replication fork, and the gaps in their nucleosomal arrays are
rapidly filled by step-wise assembly of new nucleosomes. These
nucleosomes are then matured by addition of linker histones and
covalent modification of histone tails to complete chromatin (Ahmad, 2002).
Nucleosomes containing H3 variants comprise only a small
proportion of bulk chromatin, and thus their properties have
been generally undetectable. However, replacement H3 variants
can become enriched in the chromatin of nonreplicating cells. This means that other ways of depositing histones must
exist; but because such variant enrichment has been detectable only
in unusual cell types (such as long-lived neurons or spermatocytes), studies of the phenomenon have been limited. The ability
to tag histones and examine their deposition properties in single
cells has allowed a gain in insight into chromatin assembly
processes (Ahmad, 2002).
A cytological assay system was developed for studying replication
and chromatin assembly by using Drosophila Kc cells, a cell
line that displays a regular cell division schedule and
a consistent tetraploid karyotype. Organization of the Drosophila
nucleus is visually simple, because the late-replicating heterochromatin
typically coalesces into a compartment in the
nucleus, termed the chromocenter. This provides both
a temporal and spatial distinction between the early replicating,
gene-rich euchromatin, and the late-replicating heterochromatin (Ahmad, 2002).
DNA replication can be tracked either by pulse-labeling with
nucleotide analogs or by using anti-PCNA antibody. Furthermore,
by introducing histone-GFP fusion constructs and producing
a pulse of the tagged protein, histone deposition can be tracked during the cell cycle. Using this system, it has been possible to quantitatively examine DNA replication and histone
deposition in unsynchronized populations of cells (Ahmad, 2002).
GFP-tagged H3 shows exclusively replication-coupled deposition,
displaying co-localization with replication markers and
showing no detectable deposition in cells in which replication has
been blocked. The N-terminal tail of H3 is required, suggesting
that the H3 tails of tetramer particles interact with
accessory factors at some early step in nucleosome assembly in
vivo (Ahmad, 2002).
In contrast to the properties of GFP-tagged H3 in cells, tagged
H3.3 deposits in a replication-independent manner at actively
transcribing loci. Deposition can occur in any stage of the cell
cycle, and it is not accompanied by
unscheduled DNA synthesis. Incorporation of H4 also occurs at
these target sites, as expected for deposition of (H3.3:H4)2
tetramers; but how replication-independent (RI) histone deposition
occurs is virtually unknown. Tagged Cid can also deposit throughout the cell cycle, suggesting that its deposition is also replication-independent (Ahmad, 2002).
However, this conclusion depends on knowing the timing of
centromere replication. Centromeres replicate
within a defined portion of S phase and Drosophila centromeres replicate as isolated domains within
later-replicating heterochromatin (Ahmad, 2002).
Historically, centromeres have been thought to replicate very
late in the cell cycle. This is because they are embedded within
pericentric heterochromatin, which replicates late. Analysis has
usually relied on visualization at mitosis; but mitotic chromosomes
have inherently low resolution because they are highly
condensed. Indeed, a recent study showed that Drosophila
centromeres cannot be resolved from heterochromatin in 44% of
spread mitotic chromosomes. Despite this limitation, it has been concluded Cid-containing chromatin replicates
on the same late schedule as pericentric heterochromatin. However,
this could be late replication in pericentric heterochromatin
that was mis-scored as replication of centromeres (Ahmad, 2002).
This uncertainty has been addressed by analyzing mitotic
chromosome replication patterns, providing brief 15-min pulses
to Kc cells and examining mitotic figures after a chase. This
provides a 'snapshot' of replication at single points in the cell
cycle. Examples of heterochromatin replication
patterns are observed similar to those previously reported, where labeling
overlaps Cid spots. However, unambiguous
examples of chromosomes that were intensely labeled
throughout the euchromatic arms, with foci directly coinciding
with centromeres, are also observed. These centromeric foci are surrounded by heterochromatin that did not replicate during the
labeling pulse (Ahmad, 2002).
Experiments using interphase Kc cells revealed
that ~90% of centromere replication occurs when euchromatin
is replicating. The remaining 10% may be late replication
in centromeric regions, but is more likely the result of nearby
heterochromatic replication foci that can not be resolved from
sites with Cid. Such early replication of centromeres is not
limited to tetraploid Kc cells --
similar replication patterns are observed in diploid larval neuroblasts -- although the much shorter cell cycle time and the more
irregular chromocenter limits quantitative analysis. Therefore,
this early timing of centromere replication appears to be general
for Drosophila cells (Ahmad, 2002).
A series of progressively more direct experiments have provided
insight into the fine structure in the centromere region. A model
for the centromeric constriction has suggested that loops of DNA
coil through the constriction, with centromeric nucleosomes
lying in the outward parts of these coils, and conventional
nucleosomes in the interior portions. This would
account for the polar structure of the entire centromere if
centromeric nucleosomes nucleate kinetochore formation (and
thus microtubule capture) and conventional nucleosomes recruit
cohesins (and thus centromeric cohesion). The linear arrangement
of nucleosomes along centromeric DNA would then be
alternating blocks of centromeric and conventional nucleosomes
within the centromeric domain. A study using
stretched chromatin fibers has demonstrated that Cid and H3 are
interspersed in Drosophila, although these are not included in the
same nucleosome. Apparently, blocks packaged in one kind
of nucleosome alternate with blocks packaged in the other (Ahmad, 2002).
How could the duplication of such regular but discontinuous arrays of
nucleosomes occur? The alternating pattern of nucleosomes on stretched chromatin fibers is reminiscent of replication patterns on fibers from
normal chromatin. Replication origins within a chromatin
domain often appear to be regularly spaced with an interval of
50-100 kb, and these origins fire synchronously. Perhaps the
nucleosome blocks in the centromeric regions correspond to an
underlying regular arrangement of replication origins throughout
the entire centromeric domain. If Cid-containing blocks
include the origins for these domains, and if replication initiates
at a time when H3 is not available, ultimately only the RI
deposition of Cid will package these blocks. The later replicating
stretches would incorporate H3 as it becomes available. In this
way, the fine pattern of replication would maintain the discontinuous
Cid arrays over an extended region (Ahmad, 2002).
The model for maintaining the higher-order chromatin structure
of the entire centromere has precise requirements for
replication patterns in this region: a discontinuously spaced
arrangement of origins must correspond to the blocks of Cid-containing
chromatin. At least two other
patterns of replication in this region can be imagined: (1) all
Cid- and H3-containing blocks might replicate simultaneously
(pattern 2); (2) a single origin might replicate the entire
domain (pattern 3) (Ahmad, 2002).
The possibility of the existence of discontinuous replication track corresponding to blocks of centromeric chromatin was investigated by pulse-labeling cells for only
15 min. To prepare stretched chromatin fibers,
nuclei spread on a glass slide were disrupted in a high-salt buffer. As the buffer
runs off the slide, it pulls chromatin fibers behind it. Stretched centromeres were identified and those fibers were examined in
which nucleotide incorporation was unambiguous. In each of
these cases it was clear that replication was occurring in discrete
patches scattered throughout the centromeric domain. These replication tracks must arise from multiple origins, and
thus the possibilities that the entire domain
replicates from a single origin, or that the whole domain
replicates simultaneously can be ruled out (Ahmad, 2002).
These patches correspond significantly with the segments
between Cid-containing chromatin. Thus, from published experiments
and the experiments described here it appears that
replication occurs in two discrete phases: all CenH3-containing
chromatin within a domain replicates, and at a different time all
H3-containing chromatin replicates. Therefore, replication
within this domain is discontinuous and initiates from multiple
origins (Ahmad, 2002).
Given that deposition of any H3 must occur in the form of
(H3:H4)2 tetramers, there must be discrimination of H3-
containing tetramers from tetramers containing variants. Thus
analysis of RI assembly was used to initiated the mapping of discriminating sites within the histone variants. It was found that one type of discrimination is a cluster of three residues within the histone
fold domain (HFD) of H3 that limits it to replication-coupled
deposition. Furthermore, because both Cid and H3.3 undergo
RI deposition but have mutually exclusive targets, there must be
additional discrimination between these variants (Ahmad, 2002).
Replication-coupled nucleosome assembly is aided by accessory
factors that are recruited to the replication fork by binding
to PCNA. However, the process of RI deposition must be
different, because RI deposition of H3.3 does not require
portions of the histone that are required for replication-coupled
deposition. Furthermore, the lack of PCNA during gap phase
deposition raises the question of what is recruiting histones to the
sites. The phenomenon of CenH3 targeting has raised expectations
that a specific, localized chromatin assembly factor or
histone modification will be involved in the targeting of CenH3s. Indeed, a chromatin remodeler of the RSC family,
PyBAF, localizes to kinetochores during mitosis of mammalian
cells. Furthermore, RSC mutations in budding yeast alter chromatin structure
specifically around centromeres, and perhaps RSC activity
is involved in assembly of centromeric nucleosomes. Mutations
in CAF and Hir genes also give centromere defects, and it has
been suggested that these factors are involved in loading the
yeast CenH3 Cse4p. However, a role for any of these factors
does little to explain the specific targeting of CenH3s, because
these factors are all widely distributed in the nucleus (Ahmad, 2002).
The best candidate for a uniquely centromere-localized chromatin
assembly factor is the Mis6 protein in fission yeast.
This protein is required for centromeric localization of the
CenH3 SpCENP-A, but Mis6 homologs in budding yeast (Ctf3) and in mammals (CENP-I) localize to centromeres
but are not required for targeting CenH3s. Thus, Mis6
proteins appear to be structural components of centromeres, not
histone assembly factors (Ahmad, 2002).
An alternative model is that some feature of centromeric
chromatin facilitates the targeting of its specialized histones. An
obvious candidate for this feature is that centromeric nucleosomes
themselves bind to and thereby recruit new CenH3
tetramers for future deposition. Such an interaction is a possible
molecular mechanism for direct templating of centromere duplication. Regardless of whether CenH3 targeting involves
specialized co-factors, templating, or both, the question remains
as to why it should use an RI pathway (Ahmad, 2002).
The targeted deposition of H3.3 to active genes is likewise
replication-independent, although transcription-coupled assembly
may facilitate (H3.3:H4)2 deposition. Perhaps H3.3 targeting
is mediated by a component of RNA polymerase complexes (Ahmad, 2002).
Because RNA polymerases move processively along the DNA
during transcription, a contiguous transcribed segment of DNA
might incorporate the H3.3 variant. Alternatively, RI deposition
of H3.3 may be facilitated by any of a number of ATP-dependent
chromatin remodeling complexes to target specific sites near
transcription units. Any candidate factor might be expected to
preferentially use H3.3 instead of H3, but whether there is any
such discriminating factor is unknown, because all in vitro studies
of higher eukaryotic chromatin assembly have been performed
with H3. It is anticipated that this will soon be addressed. However,
the prospects for identifying a unique remodeler that is
required for RI deposition are uncertain, because budding yeast
mutants that eliminate any known chromatin assembly factors do
not eliminate chromatin assembly. Thus
the possibility has to be considered that RI deposition at active genes and at centromeres uses generic remodeling activities, and that components or structural aspects common to both centromeres and actively
transcribed genes may result in RI histone deposition at both
kinds of sites (Ahmad, 2002).
The deposition of histones throughout the cell cycle by a
replication-independent process implies that previously existing
nucleosomes are unraveled, and their histones released. It is
known that the process of transcription results in a local unfolding
of the chromatin fiber and an 'open' chromatin configuration. Although transcription of nucleosomal templates with bacterial polymerases can occur in vitro without displacing histone octamers from DNA, in vivo assays demonstrated that a measurable amount of transcription-dependent
histone displacement does occur in eukaryotic nuclei. In fact, even in vitro, RNA polymerase II is virtually unable to transcribe nucleosomal
DNA under physiological conditions. Transcription requires
that histone-DNA contacts be broken for polymerase to
transit the nucleosomal DNA. Although transcription can occur
without histone displacement if the histone octamer releases
some contacts with DNA and maintains others, at some
frequency all contacts might be released. The histone octamer
would then simply fall off. Additionally, localized remodeling
factors will disrupt nucleosome structure as they act. The in vitro
and in vivo observations can be reconciled if histone displacement
occurs occasionally as nucleosomes are disrupted. Constraints on nucleosomes in a compacted chromatin fiber (i.e., 'closed' chromatin) would limit histone displacement (Ahmad, 2002).
Although internucleosome forces within inactive chromatin are
uncharacterized, they have been inferred from numerous experiments,
including the tendency of nucleosomes within hetero-chromatin
to form extremely regular and fixed arrays. A
likely constraint in heterochromatin arises from the multimeric
associations that occur between heterochromatin-specific non-histone
chromatin proteins. Attention has focused on the heterochromatin
protein-1 (HP1). HP1 is recruited to heterochromatic
DNA by binding, through its chromodomain, to the H3 tail
when it is methylated at lysine-9 (H3-K9me). The chromo
shadow domain of HP1 mediates associations between HP1
molecules, and multimers of HP1 bound to methylated histone
tails provides one basis for constraining arrays of nucleosomes (Ahmad, 2002).
Although the state of chromatin in heterochromatin and in
actively transcribed regions is well known, less is known about
the chromatin fiber packaged by centromeric nucleosomes. However, these regions appear to be open. Centromeric DNA is
sensitive to micrococcal nuclease digestion both in budding yeast and in the central core region of fission yeast centromeres
where SpCCENP-A-containing nucleosomes reside, and
plant meiotic centromeres appear decondensed. In addition,
early replication is a feature of open chromatin, and
centromeric chromatin replicates before surrounding heterochromatin (Ahmad, 2002).
An open configuration may arise from at least three
sources. (1) All CenH3s lack a canonical H3 tail. Because
methyl-modification of lysine-9 appears to be the key epitope to
maintain heterochromatin, the lack of this site in centromeric
nucleosomes means that such regions cannot become heterochromatic. Indeed, the heterochromatin protein HP1 is not associated with chromatin packaged by CenH3s. (2) A recent study of Cid homologs in drosophilids has uncovered DNA minor-groove binding motifs in the Cid tail outside of the nucleosome core. Extension of the Cid tail along linker DNA between nucleosomes may inhibit compaction of the
nucleosome strand, thus maintaining these regions in an open
configuration. (3) Chromatin remodeling factors that
destabilize nucleosomes are found both at active genes and centromeres, and their activity will promote histone replacement. It is suggested that an open chromatin configuration is the common basis for RI deposition at centromeres and at actively transcribed genes (Ahmad, 2002).
If open chromatin were the sole basis for RI deposition, then we
would expect that active genes and centromeres would incorporate
both H3.3 and CenH3s. However, their deposition is
mutually exclusive. This exclusivity is likely to rely on multiple
mechanisms that act on all steps in nucleosome assembly. Factors that discriminate between H3.3 and Cid would be the
best candidates for directing these variants to their targets. However, the organization of the nucleus provides a clue as to
another way in which exclusive targeting may be accomplished. Centromeric DNA in Drosophila is flanked by repeated sequences
that are packaged into heterochromatin, and this forms
a compartment at interphase in which centromeres are embedded
in heterochromatin. The active rDNA genes are the
primary sites of H3.3 deposition and they are also found in a
distinct nuclear compartment, the nucleolus, next to the chromocenter (Ahmad, 2002).
This functional nuclear organization is very simple to
see in Drosophila, where all heterochromatin typically associates
into one large chromocenter, and the active rDNA arrays also
often associate to present one large nucleolus. In fact, this
general compartmentalization is almost invariant in eukaryotes,
and has led to the idea that heterochromatin somehow protects
centromeres and NORs. Although both Cid and H3.3
undergo RI deposition, their exclusive targeting could in part be
accomplished by restricting one or both variants within the
nucleus. For example, unincorporated (Cid:H4)2 tetramers
might be sequestered within the heterochromatic chromocenter. Cid deposition would then appear targeted to the centromere,
because this is the only site within the chromocenter with open
chromatin (Ahmad, 2002).
Whether (Cid:H4)2 tetramers are actually sequestered in this
way is unknown. Indeed, whether sequestering substrates can
have any effect on reactions within the nucleus has become a
pressing issue. Many nuclear components remain mobile,
but functional experiments argue that certain effects in the
nucleus actually only occur when components are sequestered. It is likely that some reactions in the nucleus are relatively
independent of localization because they associate efficiently
with their partners and their reactions proceed quickly. Conversely,
reactions that involve weak interactions or multiple steps
may require raising the effective concentration of their substrates
by nuclear sequestration (Ahmad, 2002).
It has been suggested that the heterochromatic compartment
is involved in histone traffic within the nucleus. The
basis of this hypothesis was the realization that Cid-containing
chromatin behaves unusually during S phase. Generally, the
deposition of H3 quickly follows DNA replication. However, the
replication of Cid-containing centromeric DNA occurs without
H3 deposition, implying that the normal coupling between
replication components and nucleosome assembly components
must be broken. Because this coupling is thought to result from
an interaction between chromatin assembly factor 1 histone complexes and PCNA, the simplest explanation for
uncoupling the two processes would be to sequester replicative
nucleosome assembly factors away from centromeres. It is imagined
that unincorporated H3-containing tetramers might be
sequestered in euchromatin in the first half of S phase, and would
thus never (productively) see the replication forks at centromeres
within the heterochromatic compartment. This uncoupling
might be necessary to prevent dilution of centromeric
nucleosomes by conventional nucleosomes that would assemble
after replication-coupled deposition. Genetic experiments in
budding yeast and Drosophila suggest that CenH3s and H3 do
compete for assembly (Ahmad, 2002).
One way that a competition between CenH3 and H3 histones
can be probed is to change their relative concentration. A tagged Cid protein exclusively deposits at centromeres when it is ectopically expressed at low levels
from a heat-shock-inducible promoter. However, it is
apparent that expression from this construct remains low. Re-engineering the transcriptional start region of the construct
to include a translational initiation consensus site now allows
overproduction of Cid in cells (Ahmad, 2002).
To analyze the behavior of excess quantities of Cid protein, an overexpression construct was introduced into Drosophila Kc cells. Cells receive varying amounts of transfected DNA, and
thus express Cid over a wide range of levels. In cells that express
low amounts of the ectopic protein, Cid localizes to centromeres,
as expected. However, a new localization pattern for Cid is seen
at high expression levels: the tagged protein localizes to centromeres
and throughout euchromatin. The incorporation pattern
of ectopic Cid is especially clear on mitotic chromosomes from
these transfections, where the tagged protein is incorporated
throughout the euchromatic arms as well as at centromeres. It is concluded from this result that excess Cid can be
deposited at sites other than centromeres. Normal cells must
have mechanisms to prevent euchromatic deposition, but over-expression
is sufficient, by itself, to overcome this restriction (Ahmad, 2002).
The mis-incorporation pattern of Cid shows an interesting
specificity: Cid can deposit at centromeres and euchromatin but
not in heterochromatin. Therefore, heterochromatin
must either lack the feature that tolerates mis-incorporation, or must
actively exclude Cid. It is argued that centromeres and
euchromatin share the feature of open chromatin, which is
proposed to be the first prerequisite for RI deposition of histone
variants. Indeed, the mis-incorporation of Cid into euchromatin
is replication-independent, because it occurs both when euchromatin
is replicating in early S phase, and in late S phase
when euchromatic replication is complete. It is suggested
that Cid is contaminating open chromatin in the euchromatic
compartment when it is overexpressed (Ahmad, 2002).
What normally prevents the deposition of Cid into euchromatin?
Endogenous Cid is present only at low levels, and
mis-incorporation could be avoided if Cid were sequestered
away from euchromatin in the nucleus. If unincorporated Cid
were sequestered in the heterochromatic chromocenter, it would
be unable to deposit in the closed chromatin of this compartment. Thus, sequestration might serve two purposes: deposition
in euchromatin would be prevented and deposition at centromeres
would be promoted. Overexpression of CenpA in
mammalian cells also mis-incorporates into euchromatin (Ahmad, 2002).
Although it has not been examined whether CenpA mis-incorporation
is replication-independent, this is expected to be the
case, because this is how CenpA deposits at centromeres (Ahmad, 2002).
The idea that histone variants may respect nuclear compartments
was first raised by experiments expressing heterologous
CenH3s in Drosophila Kc and human HeLa cells. These extremely diverged heterologous histones do not localize to centromeres in these cells, implying that there is some kind of specificity for depositing the correct CenH3 at centromeres (Ahmad, 2002).
Surprisingly, heterologous histones are preferentially enriched
in the heterochromatic blocks. It has been suggested that there is a default ability of cells to enrich diverged H3 variants in the heterochromatic compartment. Perhaps heterochromatic enrichment is a
normal first step in the deposition of the endogenous CenH3s (Ahmad, 2002).
Those experiments and overexpression results encourage the
view that nuclear compartments may guide histone variants to
the correct subset of their potential deposition sites. Compartment
effects may also affect the RI deposition of H3.3 in an
inverse way to Cid: i.e., sequestering to promote H3.3 deposition
at active genes, and preventing its deposition at centromeres (Ahmad, 2002).
Because H3.3 is largely identical to H3, the hypothetical element
that is recognized in H3 and results in its exclusion from
chromocenters during centromere replication may also be
present in H3.3. Perhaps this discrimination against canonical
H3 histones also serves to prevent the RI deposition of H3.3 at
centromeres (Ahmad, 2002).
RI assembly permits immediate chromatin repair. The unfolding
of chromatin during transcription may be damaging, in that the
forces RNA polymerases apply to their template DNA should at
least occasionally displace histone octamers from DNA. Additionally, histone octamers may sometimes be displaced by
chromatin remodeling factors associated with transcriptional
activity. In either case, these regions must be repackaged into
nucleosomes. Similarly, replacement of CenH3s may be required
to maintain the nucleosomal configuration of centromeres after
mitosis. Bundles of microtubules drag a chromosome to the pole
during anaphase, and the forces they apply may be sufficient
to occasionally pull off histone octamers. Chromatin would then
be stripped of some CenH3 histone octamers. RI deposition
allows repair of this damage. In fact, the RI deposition of CenpA
in mammalian cells seems to occur around the time of mitosis). The deposition of Cid in Drosophila cells occurs throughout
the cell cycle, but may only be required at two points: as
centromeric DNA replicates to double its chromatin, and after
mitosis to repair stripped chromatin (Ahmad, 2002).
The process of RI assembly at active genes provides a novel
level of control over histone modifications. Replacement of
nucleosomes in one modification state by new histones could
switch chromatin to an active state. Initiation of transcription
would start this process, and successive transits of RNA polymerases
would promote RI assembly. The replacement H3
histone in alfalfa is hyperacetylated, and RI assembly with
acetylated histones could enrich such modifications in active
chromatin. However, histone modification by methylation has
appeared more problematic. A number of histone methyl-transferases
(HMTs) have been characterized, but no
histone demethylase is known. Methylated lysine-9 in the H3 tail
(H3K9me) is a critical epitope for recruiting heterochromatic
chromatin proteins, because this is the binding site for HP1. HP1
recruits additional heterochromatic proteins including the Su-var3-9 HMT. Therefore, it is straightforward to imagine how
these recruited proteins could perpetuate a heterochromatic
state through replication-coupled nucleosome assembly and cell
division (Ahmad, 2002).
Because an irreversible methyl modification appears to specify
the heterochromatic state, it has been unknown how a heterochromatic
site could switch to an active state. One route for
switching might be to prevent the methylation of nucleosomes
assembled during replication. Successive cell cycles could then
dilute methylated nucleosomes, allowing eventual activation (Ahmad, 2002).
However, more rapid mechanisms for activating silenced chromatin
must exist. Induction of silenced genes can occur within a
single cell cycle; for example, X chromosomes become reactivated
and lose H3K9me during diplotene in the Caenorhabditis
ovary. Work using a reporter for heterochromatic
gene silencing suggests that switching to an active
state can occur in somatic cells without cell division. Thus,
H3K9me can be removed without replication-coupled nucleosome
assembly (Ahmad, 2002).
RI deposition implies that the entire heterochromatic nucleosome
may be unraveled and replaced. The process of
transcriptional activation may force the disassembly of H3K9me-containing
nucleosomes, followed by RI assembly of an unmarked
nucleosome. Although the fate of the
displaced methylated H3 is not known, it is know that RI deposition can
occur at any time in the cell cycle, and thus should be able to
rapidly derepress silencing. Conversely, an active gene
could be silenced by methylating the tail of H3.3, which presents
the same lysine-9 epitope. The stability of histone methylation
gives it a distinct advantage over other histone modifications for
heritable effects on chromatin. The possibility of RI deposition
circumvents the irreversible nature of methylation, thus retaining
the potential to switch the heritable chromatin state at a later
time (Ahmad, 2002).
It is concluded that H3 variants are used to package functionally specialized chromatin,
where they play vital functional roles. Localizing these
variants to centromeres and to transcriptionally active regions
utilizes an RI process that is distinct from the nonspecific,
replication-coupled method of packaging the bulk genome. It is
argued that RI deposition is the consequence of the
activities that impinge on these sites in the genome and creates an
open chromatin structure. This flexibility in histone deposition
may be necessary to maintain the nucleosomal structure of these
regions. In higher eukaryotes, the RI deposition process allows
specialized chromatin to be distinguished at the most basic level,
where histone variants are incorporated into chromatin. The
differences between the generic H3 -- which packages the bulk of
the genome -- and the H3 variants may contribute to the physical
properties of specialized regions and recruit particular non-histone
chromatin proteins. Because histones remain associated
with DNA through mitosis, these variants establish heritable
distinctions in chromatin (Ahmad, 2002).
Centromeres are a defining feature of eukaryotes, and all are
likely to have a CenH3. However, the utilization of two conserved
versions like H3 and H3.3 is not universal. For example,
budding yeast has only one canonical H3 histone, which undergoes
both replication-coupled and RI deposition. Surprisingly,
this is H3.3: phylogenetic analysis reveals that ascomycetes
have lost H3, whereas their sister clade basidiomycetes have both
H3 and H3.3, as do animals. Therefore, an H3.3
gene performs all general functions in some organisms. The
extraordinary conservation of H3.3, which is identical from
mollusks to mammals, speaks to its fundamental role in the
eukaryotic nucleus (Ahmad, 2002).
The conserved 3'-5' RNA exonuclease ERI1 is implicated in RNA interference inhibition, 5.8S rRNA maturation and histone mRNA maturation and turnover. The single ERI1 homologue in Drosophila melanogaster Snipper (Snp) is a 3'-5' exonuclease, but its in vivo function remains elusive. This study reports a Snp requirement for normal Drosophila development, since its perturbation leads to larval arrest and tissue-specific downregulation results in abnormal tissue development. Additionally, Snp directly interacts with histone mRNA, and its depletion results in drastic reduction in histone transcript levels. It is proposed that Snp protects the 3'-ends of histone mRNAs and upon its absence, histone transcripts are readily degraded. This in turn may lead to cell cycle delay or arrest, causing growth arrest and developmental perturbations (Alexiadis, 2017).
Acetylation and methylation: Covalent modifications of chromatin and DNA that establish and maintain the heterochromatin-induced silenced state A self-reinforcing network of interactions among the three best-characterized covalent modifications that mark heterochromatin (histone hypoacetylation, histone H3-Lys9 methylation, and cytosine methylation) suggests a mechanistic basis for spreading of heterochromatin over large domains and for stable epigenetic inheritance of the silent state. Early cytological studies have distinguished two types of chromatin: euchromatin and heterochromatin. Heterochromatin was originally defined as that portion of the genome that remains condensed and deeply staining (heteropycnotic) as the cell makes the transition from metaphase to interphase; such material is generally associated with the telomeres and pericentric regions of chromosomes. Subsequent work has identified a cluster of structural features that characterizes heterochromatin. While heterochromatic regions are rich in repetitive sequences and have a low gene density, they are not devoid of genes; it is estimated that there are ~40-50 genes within the pericentric heterochromatin of Drosophila. An altered packaging of heterochromatin, to a less-accessible form, has been demonstrated by probing with nucleases and other reagents such as prokaryotic DNA methyltransferases. The data suggest that while nucleosome arrays in euchromatin are irregular, punctuated by the nucleosome-free hypersensitive sites (HS sites) characteristic of active genes, the nucleosomes in heterochromatin have a regular spacing over large arrays, with a higher proportion of the DNA associated with the histone core rather than in the linker. Euchromatic regions silenced by nucleosome packaging is referred to as 'silent chromatin,' reserving the term 'heterochromatin' for the classically defined heterochromatin (Richards, 2002).
It is an interesting paradox that while the histones are among the most conserved proteins known in evolution, they are also among the most variable in posttranslational modification. The pattern of modifications has been suggested to act as an information code (the histone code), dictating both nucleosomal interactions and the association of nonhistone chromosomal proteins that collectively influence packaging and gene regulation. Modifications include acetylation, methylation, phosphorylation, ubiquitination, and ADP-ribosylation. Given the number of sites of posttranslational modification for each of the four core histones, an imposing number of differently modified nucleosomes is possible. The modification states of the N-terminal tails of histones H3 and H4 appear to play a major role in heterochromatin formation (Richards, 2002).
One modification of histones, hypoacetylation of lysine residues, is associated with both formation of heterochromatin and gene silencing. Early attempts to fractionate chromatin and characterize the components led to the suggestion that heterochromatic domains were associated with hypoacetylated histones, while euchromatic domains were associated with hyperacetylated histones. This distinction is observed not only between constitutive heterochromatin and euchromatin, but also in mapping studies comparing an active or inducible gene to flanking regions (Richards, 2002).
Histone H3 methylated at lysine 9 (H3-mLys9), a second modification of histones, has been identified as characteristic of the heterochromatic state. Immunofluorescent staining of Drosophila polytene chromosomes shows that the bulk of the H3-mLys9 is present in the pericentric heterochromatin and in a banded pattern on the fourth chromosome, known sites of repetitive DNA (Jacobs, 2001). Similarly, chromatin immunoprecipitation (ChIP) experiments demonstrate that H3-mLys9 is a prominent component of the silent mating type locus in fission yeast (Schizosaccharomyces pombe), while essentially absent from flanking regions containing inducible genes. Methylation of histone H3-Lys9 has also been associated with the silencing of euchromatic genes (Richards, 2002).
A third biochemical marker of heterochromatin is the most common form of DNA modification in eukaryotes, namely cytosine methylation. Although absent in some eukaryotes, this DNA modification is widely distributed in the eukaryotic kingdom. It is particularly prevalent in plants and mammals where it is an important epigenetic mark that contributes to the stability of pericentromeric heterochromatin and plays a central role in cementing and maintaining epigenetic expression states, not only in heterochromatin but in silenced euchromatic domains (Richards, 2002).
Hypoacetylation, particularly of histones H3 and H4, associated with heterochromatic domains from a range of organisms, has been studied in greatest detail in Saccharomyces cerevisiae. Many of the cis- and trans-acting factors necessary to establish and maintain the silent state at the telomeres and HML/HMR loci have been identified. These studies have demonstrated the need for hypoacetylated histones. Silencing is mediated by the multiprotein, nucleosome binding SIR(1-4) complex, recruited by interaction with specific DNA binding proteins. Sir3 and Sir4 interact specifically with the N-terminal tails of histones H3 and H4 in the hypoacetylated state. While the N-terminal tails of the histones are not required individually for growth in yeast, they do play an essential role in silencing, amino acids 4-20 of H3 and 16-29 of H4 being required. Certain sir3 alleles can suppress the silencing defect of histone H4 tail mutations, and Sir3 and Sir4 can bind to the amino termini of histones H3 and H4 in vitro, suggesting direct interaction. Recent studies using antibodies against different histone acetylated isoforms indicate that histones in the telomeric and HML/HMR heterochromatin are hypoacetylated at all modification sites (Richards, 2002 and references therein).
What is the mechanism for histone hypoacetylation specifically at the heterochromatic domains? This function is apparently provided, at least in part, by Sir2 (Drosophila homolog: Sir2), shown to have a NAD-dependent protein deacetylase activity. Sir2 can efficiently deacetylate histones in vitro, preferentially deacetylating histone H4 at Lys16, although direct action in vivo has not yet been reported. Enzymatic activity of Sir2 is required for silencing in the heterochromatic domains. The acetylation status of H4-Lys16 may be of particular importance. Lys16 is the preferred site of acetylation in monoacetylated H4 of euchromatin in yeast, and this is the only acetylatable H4 site whose mutation strongly affects Sir3 binding in heterochromatin. Deletion of sir3 results in increased histone acetylation in heterochromatic domains, as well as a loss in silencing. The results suggest an assembly model in which interaction of the Sir2-Sir4 complex with specific DNA binding proteins leads to local histone deacetylation, permitting binding of Sir3. It appears that binding of Sir3 to the hypoacetylated histone blocks reacetylation. Given the interactions between Sir2, Sir4, and Sir3, once initiated, such a complex could spread along the nucleosome array, generating and maintaining the altered modification state (Richards, 2002 and references therein).
In addition to the above, studies in S. pombe, Drosophila, and other organisms suggest that the histone acetylation level is used as a heritable mark of the chromatin state. Mutations in HDACs or treatment with trichostatin A (TSA), an inhibitor of some HDACs, frequently results in a loss of function in heterochromatic domains and a relaxation of silencing. For example, treatment with TSA results in functionally deficient centromeres and chromosome loss in S. pombe, concomitant with a loss of silencing for test genes within the centromeric heterochromatin. The hyperacetylated state is heritable following removal of TSA; it is linked in cis to the treated centromere locus, and correlates with inheritance of functionally defective centromeres, demonstrating an epigenetic phenomenon based on the chromatin structure. In contrast, acetylation is used as an inherited mark of activity. Histone H4-aLys16 is prominently associated with the dosage-compensated, 2-fold active X chromosome in males of Drosophila. This specific modification is due to MOF, an essential acetyltransferase of the dosage compensation complex that coats the male X chromosome. The complex remains associated with its target DNA throughout the cell cycle, providing the means to replicate the modification state. Recruitment of HATs to chromosome regions showing histone acetylation patterns corresponding to their own catalytic specificity has been observed, e.g., the histone acetyltransferase P/CAF binds preferentially to acetylated H4 and H3 peptides via a bromo domain. These observations provide evidence for use of the histone acetylation state as an epigenetic mark (Richards, 2002 and references therein).
How does hypoacetylation impact chromatin structure? In the case described above, the hypoacetylated histone tails interact specifically with the SIR complex. While Sir2 orthologs have been identified, few proteins with similarity to the other Sir proteins have been found in multicellular eukaryotes. Nonetheless, there may be an equivalent of the SIR complex that makes similar use of the histone hypoacetylation signal. However, a significant effect might be realized through the interaction of the histone H3/H4 tails with the DNA and/or other nucleosomes in the chromatin fiber. The regions of the histone H3 and H4 tails that contribute to DNA binding, as observed in the crystal structure, are necessary for silencing of basal transcription in vivo. However, these regions are distinct from those critical for repression at the HM loci and telomeres. It appears unlikely that simply weakening intranucleosomal histone-DNA interactions by histone acetylation could alleviate the inhibitory effect of heterochromatin structure on transcription. An alternative possibility was suggested by the original crystals of the nucleosome (using histones from Xenopus), where histone H4 amino acids 16-24 were observed to interact with the acidic region formed by histones H2A and H2B on the surface of the adjacent histone octamer. The eight H2A/H2B amino acids involved in forming this negatively charged patch are highly conserved. Acetylation of the H4 tail might disrupt this interaction, leading to a loss of compaction along the chromatin fiber. However, this disposition of the histone H4 tail is not seen in crystals of the nucleosome made using yeast histones, and additional studies are needed to resolve this interesting question (Richards, 2002 and references therein).
A key role for a second histone modification in the specification of heterochromatin is shown by the recent demonstration that mammalian homologs of Drosophila Su(var)3-9, including human SUV39H1 and murine Suv39h1, encode enzymes that specifically methylate histone H3 on lysine 9 (Rea, 2000). Su(var)3-9 was originally identified as a suppressor of PEV in Drosophila, indicating that the wild-type gene product is involved in heterochromatin formation (Tschiersch, 1994). A homolog in S. pombe, Clr4, is also a specific histone H3-Lys9 methyltransferase, suggesting that this activity is widely distributed and well conserved. clr4 mutants exhibit reduced heterochromatin formation at centromeres, with elevated mitotic chromosome loss and reduced silencing within both pericentromeric heterochromatin and the silent mating type locus. Similarly, mammalian Su(var)3-9-like proteins have been implicated in both centromere activity and gene silencing. Disruption of the murine Suv39h1 and Suv39h2 paralogs causes genome instability, chromosome mis-segregation, and male meiotic defects (Richards, 2002 and references therein).
Further, the Suv39h1/SUV39H1 proteins are found in association with M31, a mouse Heterochromatin protein 1 (HP1) homolog. HP1, perhaps the best-characterized protein found in heterochromatin, was identified in Drosophila melanogaster in a screen of monoclonal antibodies prepared against proteins tightly bound in the nucleus. Immunofluorescent staining of the polytene chromosomes shows HP1 concentrated in the pericentric heterochromatin, the telomeres, and a banded pattern across the small fourth chromosome, known sites of repetitive DNA with characteristics of heterochromatin. A few prominent HP1 sites are observed within the euchromatic arms (e.g., region 31). Homologs of HP1 are associated with pericentric heterochromatin in organisms from S. pombe to humans. The protein (206 amino acids in Drosophila) has a conserved N-terminal chromo domain (CD) followed by a variable hinge region and a conserved C-terminal chromo shadow domain (CSD). The chromo domain was first recognized by similarity with a domain in Polycomb, a protein associated with silencing of the homeotic genes during development; this domain has now been identified in many other chromosomal proteins. Both point mutations in the chromo domain and presumed null mutations (early truncation of the translation product) in the gene encoding HP1 [Su(var)2-5] result in a loss of silencing, while an additional dose will increase silencing of a variegating euchromatic gene, i.e., one placed in a heterochromatic environment. Interestingly, the converse is true for those few genes normally resident within the pericentric heterochromatin (e.g., light), which appear to be dependent on HP1 for normal activity. The conserved structure of HP1 suggests that it might serve as a bifunctional reagent, helping to organize and maintain heterochromatin structure. HP1 interacts with a number of other chromosomal proteins, including several involved in nuclear assembly, replication, and gene regulation. These interactions have generally been mapped to the chromo shadow domain. The chromo shadow domain can homodimerize, and the dimer has been suggested to be the interactive species (Richards, 2002 and references therein).
The HP1 chromo domain specifically binds histone H3 N-terminal tails methylated on lysine 9, and a variety of data suggest that this interaction is essential for maintenance of heterochromatin. The interaction appears quite specific; neither the chromo domain of Polycomb nor the chromo shadow domain of HP1 shows this interaction. The H3 tail fits within a groove established by conserved chromo domain residues; Su(var) mutation V26M results in an alteration of the structure and loss of H3-mLys9 binding. Studies in mammalian cells suggest that localization of HP1 in heterochromatin is dependent on the presence of histone H3-mLys9. However, HP1 association with heterochromatin in Drosophila can be driven either by the N-terminal portion (with the chromo domain) or the C-terminal portion (with the shadow domain), emphasizing the bifunctional nature of the protein. The above results argue that an interaction between the specifically modified histone H3 and HP1 is essential for maintaining a stable heterochromatin structure (Richards, 2002 and references therein).
Histone H3-Lys9 methylation is influenced by preexisting modifications of histone H3 and affects other histone modifications, implying a set of functional interactions. The relationship between hypoacetylation of H3/H4 and methylation of H3 has been clarified by studies of heterochromatin formation in S. pombe. clr1-clr4, clr6, swi6, and rik1 mutations all identify trans-acting factors necessary for silencing at the S. pombe mating type locus. Swi6 is a homolog of HP1, while clr1 and rik1 code for putative DNA binding proteins. The products of clr3 and clr6 are homologs of HDACs. Clr4 is the H3-Lys9 methyltransferase. These genes work together, acting on the entire silent mating type domain to maintain it in the repressed state. Clr3, an H3-specific deacetylase, and Rik1 are required for histone H3-Lys9 methylation by Clr4, and Swi6 localization is dependent on Clr4 and Rik1 (Richards, 2002 and references therein)
These observations suggest a progression of events leading to establishment of a distinctive heterochromatic structure based on the histone modification pattern. Deacetylation of histone H3 by Clr6 and/or Clr3 creates conditions favoring methylation at H3 Lys9 by the Clr4/Rik1 complex; methylation leads to binding of Swi6, establishing a chromatin configuration that is refractory to transcription and stably maintained. Mapping studies using chromatin immunoprecipitation show H3-mLys9 and Swi6 found throughout, and limited to, the 20 kb silent mating type domain. This 20 kb region is flanked by inverted repeats IR-L and IR-R, which appear to serve as barriers to the spread of silencing; removal of these repeats results in the appearance of H3-mLys9 and Swi6 on neighboring sequences. Silencing is dependent on the dosage of Swi6, which remains bound to the mating type region throughout the cell cycle and may itself be a marker for heterochromatin formation (Richards, 2002 and references therein).
The findings suggest a mechanism for maintaining heterochromatin structure following replication and for driving the spread of heterochromatin. During replication, the DNA must be 'unpackaged' and the daughter DNA molecules repackaged into nucleosomes. Parental histones are efficiently reutilized, distributed randomly to the two daughter DNA molecules; an equal amount of newly synthesized histone is required to complete assembly. Assuming that the histone H3-mLys9 in a heterochromatic domain is stable (no histone demethylases have been identified as yet), it will associate with HP1 through the chromo domain. The presence of HP1 will result in assembly of a modifying complex, presumably through the chromo shadow domain, that will deacetylate and specifically methylate the newly arrived histone, perpetuating the pattern of modification and HP1 binding to establish a heterochromatic structure. Recovery of a SUV39H1-HDAC1 complex from Drosophila embryo extracts that can methylate preacetylated histones supports such a model (Czermin, 2001). Formation of complexes that both recognize a particular pattern of histone modification and have the ability to achieve that pattern provides a mechanism for epigenetic inheritance of chromatin structure. The same machinery could account for spreading of heterochromatin, requiring that boundaries to such spread be established (Richards, 2002 and references therein).
Genetic analyses in S. pombe and Drosophila indicate that while the H3-mLys9/HP1 system is critical for heterochromatin formation and silencing in pericentric heterochromatin, it is of less importance at the telomeres, suggesting that an additional mechanism is used in those domains. Association of HP1 and a dependence on Su(var)3-9 activity have also been identified as critical in silencing particular euchromatic genes, both in mammalian systems and in Drosophila (Hwang, 2001). Interestingly, it appears that the histone H3-mLys9 modification at Rb-associated genes is quite limited; one nucleosome at the promoter is so modified, while an immediately upstream nucleosome is not, suggesting a difference in the capacity of the modified structure to spread. Histone H3-mLys9 is also associated with the inactive X chromosome in human cells, but no HP1 homologs have been identified preferentially associated with this domain. Whether differences in the degree of histone methylation or other modifications of histone H3 are important in determining any partner of H3-mLys9 in this case remains to be seen (Richards, 2002 and references therein).
A third silent chromatin mark, 5-methylcytosine (5mC), affects the DNA itself. Postreplicative methylation of cytosine is carried out by a diverse group of cytosine DNA methyltransferases (Dnmt's). Beyond this, little is known about the mechanisms that establish, maintain, and modify cytosine methylation patterns. At the whole genome level, it is clear that cytosine methylation patterns can be quite dynamic. The best example is the erasure and resetting of cytosine methylation in early mammalian development. However, large swings in cytosine methylation levels have not been detected during zebrafish development, and the evidence in plants is contradictory. Regardless of whether de novo methylation occurs every generation or in rare initiating events, certain DNA sequences must be targeted for cytosine methylation. At present, little is understood about the primary DNA sequence determinants for targeting, if any. Analysis of a Neurospora sequence prone to de novo methylation indicates the presence of redundant elements promoting methylation and suggests that TpA-rich sequences may be important. Unfortunately, similar detailed studies are not available in other organisms. Certain cytosine methyltransferases, such as mouse Dnmt3a and Dnmt3b, are specialized to carry out de novo methylation. However, these enzymes do not appear to have the intrinsic capacity for discrimination among primary nucleotide sequences, nor among higher-order structures. These considerations suggest that de novo cytosine methyltransferases might be taking cues from another epigenetic mark (Richards, 2002 and references therein).
Communication between the histone code and cytosine methylation may provide at least a partial answer to the long-standing question of how cytosine methylation patterns are established. The most direct evidence for a connection with histone methylation comes from genetic screens for cytosine hypomethylation mutants in Neurospora. The genome of this filamentous fungus contains 5-methylcytosine (~1.5 % of total C) concentrated in repetitive DNA (e.g., rRNA genes) and remnants from RIP activity (repeat induced point mutation, a hypermutation surveillance system that detects sequence duplications). Two Neurospora mutations completely abolish cytosine methylation in vegetative cells. One of these, dim-2, disrupts a gene encoding a cytosine methyltransferase. The other, dim-5, maps to a gene encoding a histone H3 methyltransferase. The predicted DIM-5 gene product contains a SET domain flanked by cysteine-rich elements and has sequence similarity to the histone methyltransferases Clr4 and Su(var)3-9, although it lacks a chromo domain. Recombinant DIM-5 protein exhibits histone methyltransferase activity in vitro. Strikingly, transformation of Neurospora with modified histone H3 genes with a substituted amino acid at Lys9 (the probable site of methylation by DIM-5) reduces cytosine methylation and relieves 5mC mediated gene silencing. Given that the dim-5 mutation appears to abolish all cytosine methylation, the results suggest that all DNA methylation in Neurospora takes its cue from histone H3-mLys9. It will be important to determine whether the histone methylation-DNA methylation connection is also found in other organisms, and if so, whether all cytosine methylation lies downstream of histone methylation. The dim-5 mutation causes more phenotypic defects than the dim-2 cytosine methyltransferase mutation, suggesting that a histone methylation deficiency has effects beyond those that result from loss of cytosine methylation (Richards, 2002 and references therein).
A connection between the histone code and the 5mC code is supported by other findings. The presence in flowering plants of cytosine methyltransferases that contain a chromo domain is particularly intriguing. Such 'chromo methyltransferases' (CMTs) might be recruited to a genomic region by nucleosomes containing histone H3-mLys9; thus, histone modification would provide a foundation for establishing DNA methylation patterns. However, the CMTs have not yet been demonstrated to bind methylated histone H3, nor is it clear that these methyltransferases possess de novo methyltransferase activity. Moreover, chromo methyltransferases have not been documented outside of plant species. Consequently, chromo methyltransferases are unlikely to be solely responsible for translating the histone methylation code into the 5mC epigenetic mark (Richards, 2002 and references therein).
Indirect models for the flow of information from histone H3-mLys9 to 5mC also need to be considered. The H3-mLys9 mark creates a foundation for HP1 interaction and subsequent heterochromatin formation. Cytosine methylation may be targeted to heterochromatin due to any number of characteristics, including nonhistone chromosomal protein content, subnuclear localization, or DNA replication timing. Disruption of heterochromatin by loss of the H3-mLys9 mark may lead to loss of 5mC through a number of intermediary steps. A 'chromatin first/cytosine methylation second' model is consistent with the demonstration that loss or alteration of cytosine methylation can be caused by mutations in SWI2/SNF2-like proteins in Arabidopsis, mice, and humans (Richards, 2002 and references therein).
Once 5mC patterns have been established, they must be maintained in order to serve as an inherited epigenetic code. The potential of cytosine methylation as a mitotic memory device was first described in the 'maintenance methylation' model. The essential feature of the model is clonal inheritance of the 5mC patterns through mitotic, and possibly meiotic, divisions based on the symmetrical nature of the sequences modified (e.g., CpG) and the specificity of 'maintenance' DNA methyltransferases for hemimethylated DNA. The basic tenets of the maintenance methylation model have been supported by a wealth of evidence. The bulk of cytosine methylation occurs very shortly after DNA replication, catalyzed by methyltransferases that have hemimethylated substrate preferences, recruited to the vicinity of the replication fork by interaction with PCNA. However, the classic maintenance methylation model is inadequate to explain the variability of 5mC patterns within individuals and omits some of the known components of the cytosine methylation system. Not all cytosine methylation occurs at short symmetrical sequences, so a simple maintenance methyltransferase, making reference solely to cytosine methylation on the template strand, cannot perpetuate methylation patterns. Maintenance of 5mC patterns at nonsymmetrical sites might involve reiterated de novo methylation and may represent an additional tier of DNA methylation superimposed on the pattern of 5mC at symmetrical sites. The machinery necessary to maintain 5mC at asymmetric sites has not been firmly established, but clues are emerging. Dnmt3a has been implicated in the synthesis of 5mC at asymmetric sites in mice. In Neurospora, a single cytosine methyltransferase, DIM-2, is responsible for all vegetative 5mC, including both symmetrical and asymmetrical sites. In plants, 5mC in asymmetrical sequences has been associated with chromo methyltransferases and RNA-dependent DNA methylation (Richards, 2002 and references therein).
The classical methylation maintenance model accounts for loss of 5mC through a passive mechanism: DNA replication in the absence of maintenance methylation. Cytological data using immuno-detection of 5mC argue that passive demethylation causes the dramatic erasure of DNA methylation patterns in early mammalian development. However, observation of 5mC loss in the absence of DNA replication has suggested an active demethylation mechanism as well. A 5mC-DNA glycosylase might also contribute to the dramatic swings in cytosine methylation seen in mammalian development (Richards, 2002 and references therein).
The execution of gene silencing from the 5mC mark involves modulation of another epigenetic mark: hypoacetylation of histones. Two independent pathways have been discovered in vertebrates connecting 5mC to histone deacetylation. The first uses methyl cytosine binding proteins, MeCP, or MBD (methyl binding domain) proteins as adaptors connecting 5mC to histone deacetylase complexes. Several MBD/MeCP protein-HDAC complexes have been identified in mammalian cells. These complexes act to reduce local histone acetylation levels using the 5mC marks on the DNA as a guide (Richards, 2002 and references therein).
A second pathway, also uncovered in mammals, operates through a physical interaction between the maintenance cytosine methyltransferase DNMT1 and HDACs. The catalytic domain of DNMT1 is not necessary for this interaction, suggesting that this cytosine methyltransferase is actually a transcriptional corepressor independent of its ability to methylate DNA. This interaction could act to reinforce inheritance of silent chromatin by facilitating histone deacetylation at the replication forks, where DNMT1 acts to maintain the 5mC epigenetic mark on methylated DNA sequences (Richards, 2002 and references therein).
Epigenetic information may also flow from the histone acetylation state back to cytosine methylation. The HDAC inhibitor TSA leads to cytosine hypomethylation at specific sequences in Neurospora, and a similar effect has been noted in mammalian cells. The loss of DNA methylation may be related to transcriptional activation, but other mechanisms have been proposed, including activation of cytosine demethylases. Inhibition of histone deacetylation does not lead to global loss of DNA methylation, however. For example, disruption of a histone deacetylase gene in plants did not lead to a generalized loss of 5mC despite a 10-fold elevation in histone H4 acetylation. Regardless of the significance of the retrograde signaling, the well-established flow of information from 5mC to histone deacetylation closes the loop of a self-reinforcing cycle for those organisms that utilize cytosine methylation (Richards, 2002).
The cycle of epigenetic marks discussed here suggests that initiation of heterochromatin formation, or similar silencing of euchromatic domains, requires acquisition of at least one epigenetic mark. What is known about entry into the cycle? In S. cerevisiae, protein interactions with specific cis-acting DNA sequences, such as E and I at the HM loci, or telomeric repeats, provide the foundation to recruit the SIR silencing complexes. The EF2-Rb-SUV39H1-HP1 interaction in mammals also implicates specific DNA sequences (binding sites for EF2) as initiation sites for silencing. Silencing within the mating type locus of S. pombe appears to be controlled both by local elements (REII and mat3 silencer) operating similarly to E and I in S. cerevisiae and by packaging of the domain as a whole, dependent on a block of repetitive DNA. In other organisms, the repetitive nature of the locus, rather than the primary DNA sequence, may be a trigger. The mechanisms at work are not clear, but hints can be derived from the repeat sensing/silencing phenomena in filamentous fungi, MIP (methylation induced premeiotically) in Ascobolus and RIP in Neurospora. In these systems, repeats appear to be recognized by a DNA-DNA pairing mechanism. In Ascobolus, cytosine methylation can be transferred between alleles, accompanying meiotic pairing and recombination events. RNA signals may provide another entré into the cycle of epigenetic silencing. Two noncoding RNA species, Xist and Tsix, are pivotal for initiation and choice in X chromosome inactivation in mice, where H3-Lys9 methylation is an early event. RNA may also have a role in initiating silent chromatin formation by directing the acquisition of cytosine methylation marks. Resolution of this question will be one of the major goals of future research (Richards, 2002 and references therein).
Once a genomic region has been targeted for silencing by acquisition of one or more covalent epigenetic marks, a silent chromatin identity can be propagated. The general features of the system include (1) positive signaling between the different covalent epigenetic marks and (2) enzymatic complexes/pathways that recognize each mark and catalyze the formation of the same mark. For example, in yeast, the histone H3/H4 deacetylation mark is recognized by Sir3, leading to recruitment of the Sir2 histone deacetylase. The histone H3-mLys9 mark is recognized by HP1, which can apparently recruit the histone methyltransferase activity of Su(var)3-9 homologs. The third self-reinforcing loop is carried out by maintenance cytosine methyltransferases, which have a substrate preference for hemimethylated DNA. The modification pathways operating on each covalent mark also interact and reinforce each other (Richards, 2002 and references therein).
In organisms lacking 5mC, a histone modification code appears to be sufficient to mark and perpetuate silent chromatin domains. The feedback loop between histone methylation and histone deacetylation, coupled with mechanisms to maintain these modifications, apparently provides stable silencing. In fact, S. cerevisiae appears to utilize neither DNA modification nor the HP1/histone H3-mLys9 complex, relying solely on deacetylation of histones H3/H4 as an epigenetic mark to maintain silencing. The transmission of chromatin states requires that at least one of the covalent marks be inherited through mitotic, and possibly meiotic, cell divisions. All three of these marks meet the criteria of persistence through mitosis (Richards, 2002).
While self-reinforcing mechanisms may be advantageous to ensure maintenance of silencing forgenomic sequences to be archived for the long-term in a nonexpressed state (e.g., transposons, pericentromeric repeats), there may be a need to reconfigure silenced chromatin as a prerequisite to expression of specific genes (e.g., mating type switching). In this case, what general mechanisms can be used to break the heterochromatin reinforcing cycle? Removal of the histone H3-mLys9 mark may require turnover of the entire protein, since no histone demethylase has yet been identified. Histones, however, are generally very stable. In comparison, the 5mC mark is more easily erased by passive or active demethylation mechanisms. The most malleable mark is the deacetylation of histones, the levels of which are set by the competing activities of histone acetylases and histone deacetylases (Richards, 2002).
Heterochromatin-mediated repression is essential for controlling the expression of transposons and for coordinated cell type-specific gene regulation. The small ovary (sov) locus was identified in a screen for female-sterile mutations in Drosophila melanogaster, and mutants show dramatic ovarian morphogenesis defects. The null sov phenotype is lethal and maps to the uncharacterized gene CG14438, which encodes a nuclear zinc-finger protein that colocalizes with the essential Heterochromatin Protein 1 (HP1a). Sov functions to repress inappropriate gene expression in the ovary, silence transposons, and suppress position-effect variegation in the eye, suggesting a central role in heterochromatin stabilization (Benner, 2019).
Polycomb response elements (PREs) are cis-regulatory sequences required for Polycomb repression of Hox genes in Drosophila. PREs function as potent silencers in the context of Hox reporter genes and they have been shown to partially repress a linked miniwhite reporter gene. The silencing capacity of PREs has not been systematically tested and, therefore, it has remained unclear whether only specific enhancers and promoters can respond to Polycomb silencing. Using a reporter gene assay in imaginal discs, it has been shown that a PRE from the Drosophila Hox gene Ultrabithorax potently silences different heterologous enhancers and promoters that are normally not subject to Polycomb repression. Silencing of these reporter genes is abolished in PcG mutants and excision of the PRE from the reporter gene during development results in loss of silencing within one cell generation. Together, these results suggest that PREs function as general silencer elements through which PcG proteins mediate transcriptional repression (Sengupta, 2004).
A 1.6 kb fragment encompassing the PRE from the Ubx
upstream control region was tested for its capacity to prevent transcriptional activation by enhancers from genes that are normally not under PcG control. For this
purpose, three different enhancers were tested in a lacZ reporter
gene assay in imaginal discs: dppWE, the imaginal disc
enhancer from the decapentaplegic (dpp) gene; vgQE the quadrant enhancer from the
vestigial (vg) gene; and vgBE, the vg D/V boundary enhancer. If linked to a reporter gene, each of these enhancers directs a distinct pattern of expression in the wing imaginal disc and activation by each enhancer is regulated by transcription factors that are
controlled by a different signaling pathway. Specifically, the dpp
enhancer contains binding sites for the Ci protein and is activated in
response to hedgehog signaling, the vg quadrant enhancer contains binding sites
for the Mad transcriptional regulator and is activated in response to
dpp signaling, and the vg boundary enhancer contains binding sites for the Su(H) transcription factor and is regulated by Notch signaling.
The dppWE, vgQE and
vgBE enhancers were individually inserted into a lacZ reporter gene construct that contained the PRE fragment and either a TATA box
minimal promoter from the hsp70 gene (here referred to as
TATA), or a 4.1 kb fragment of the proximal Ubx promoter
(here referred to as UbxP), fused to lacZ. In each construct, the PRE fragment was flanked by FRT sites that permit excision of the PRE fragment by flp recombinase. Several
independent transgenic lines for each of the six PRE transgenes were generated. From individual transgene insertions, derivative transgenic lines were then generated by flp-mediated excision of the PRE in the germline. Thus
expression of individual transgene insertions could be compared in the presence and absence of the PRE by staining wing imaginal discs for ß-galactosidase (ß-gal) activity. In the absence of the PRE, each of the three enhancers tested directs ß-gal expression in a characteristic previously characterized pattern. Each enhancer activated expression in the same pattern from
either the TATA box minimal promoter or the Ubx promoter with some
minor, promoter-specific differences with respect to the expression levels. By contrast, in most of the
parental transformant lines, i.e., those carrying the corresponding reporter gene with the PRE, ß-gal expression is completely suppressed. These observations suggest that the PRE fragment very potently silences each of the six reporter genes. It is noted, however, that, at some transgene insertion sites, efficiency of silencing by the PRE fragment appeared to be impeded by flanking chromosomal sequences; in these cases, it was found that ß-gal expression is activated even in the presence of the PRE (Sengupta, 2004).
To test whether silencing of the reporter genes by the PRE depends on PcG gene function, the PRE-containing transgenes
>PRE>dppWE-TATA-lacZ and
>PRE>vgQE-Ubx-lacZ were introduced into larvae that carried mutations in the PcG gene Suppressor of zeste 12 [Su(z)12]. Su(z)12 encodes a core component of the Esc-E(z) histone methyltransferase. Silencing of both transgenes is lost in
Su(z)122/Su(z)123 mutant larvae, and the
transgenes express ß-gal expression at levels comparable with the
transgene derivatives that lack the PRE fragment. Taken together, these observations suggest that the 1.6 kb PRE fragment from Ubx is a very potent general transcriptional silencer element that represses transcription in a PcG protein-dependent manner. Thus, it appears that this PRE acts indiscriminately to block transcriptional activation by a variety of different activator proteins (Sengupta, 2004),
To test the long-term requirement for the PRE for silencing of these
reporter genes, the PRE was excised during larval development and ß-gal expression was then
monitored at different time points after excision.
Forty-eight hours after induction of flp expression, all six reporter genes
showed robust derepression of ß-gal, suggesting that, in each
case, removal of the PRE results in the loss of PcG silencing.
Among the different enhancer-promoter combinations used in this study, the
dppW enhancer fused to the TATA box minimal promoter
appears to direct the highest levels of lacZ expression;
>PRE>dppWTZ transformant lines consistently
show the strongest ß-gal staining after excision of the PRE. Therefore >PRE>dppW-TZ transformants were analyzed at 4, 8, 12 and 24 hours
after induction of flp expression to study the kinetics of this derepression.
No ß-gal signal was detected at 4 hours or even at 8 hours after flp
induction, but 12 hours after flp induction, all discs showed robust ß-gal expression. Thus, even
in the case of the most potent enhancer-promoter combination used (i.e.
dppW enhancer and TATA box minimal promoter), a
delay of 12 hours between flp induction and ß-gal expression was observed. Since the average cell cycle length of imaginal disc cells in third instar larvae is 12 hours, this implies that most disc cells have undergone a full division cycle within this period. Derepression of the reporter gene in
this experiment requires several steps: (1) excision of the PRE by the flp
recombinase; (2) dissociation of the PRE and PcG proteins attached to it
-- possibly by disrupting PcG protein complexes formed between the PRE
and factors bound at the promoter, and (3) transcriptional activation by factors binding to the enhancer in the construct. It is possible that one or several steps in this process require a specific process during the cell cycle (e.g., passage through S phase) (Sengupta, 2004),
These experiments here show that three reporter genes, each containing a different enhancer linked to a canonical TATA box promoter, are completely silenced by a PRE placed upstream of the enhancer. The data suggest that PcG proteins that act through this PRE prevent indiscriminately activation by a variety of different transcription factors. The PcG machinery thus does not seem to require any specific enhancer and/or promoter sequences for repression (Sengupta, 2004),
Two points deserve to be discussed in more detail. The first concerns the stability of silencing imposed by a PRE. Previous studies have suggested that transcriptional activation in the early embryo could prevent the establishment of PcG silencing by PREs. More specifically, early transcriptional activation of Hox
genes by blastoderm enhancers may play an important role in preventing the
establishment of permanent PcG silencing in segment primordia in which Hox
genes need to be expressed at later developmental stages.
Importantly, none of the three enhancers used in this study is active in the early embryo. Moreover, these enhancers probably do not contain binding sites for specific transcriptional repressors, such as the gap repressors, which are required for establishment of PcG silencing at some PREs in the early embryo. It is therefore imagined that, in these constructs, PcG silencing complexes assemble by default on the 1.6 kb Ubx PRE in the early embryo and that PcG
silencing is thus firmly established by the stage when the imaginal discs
enhancers would become active. Silencing by the PRE during larval stages
therefore appears to be dominant overactivation and cannot be overcome by any of the enhancers used in this study. There is other evidence in support of the idea that PcG silencing during larval development is more stable than in embryos. In particular, a PRE reporter gene that contains a Gal4-inducible promoter is only transiently activated if a pulse of the transcriptional activator Gal4 is supplied during larval development; by contrast, a pulse of Gal4 during embryogenesis switches the PRE into an 'active mode' that supports
transcriptional activation throughout development.
Furthermore, recent studies in imaginal discs suggest that there is a
distinction between transcriptional repression and the inheritance of the
silenced state; the silenced state can be propagated for some period even if repression is lost. Specifically, loss of Hox gene silencing after removal of PcG proteins in proliferating cells can be reversed if the depleted PcG protein is resupplied within a few cell generations. Taken
together, it thus appears that PcG silencing during postembryonic development is a remarkably stable process. Finally, the results reported in this study also imply that, once PcG silencing is established, Hox genes can `make use of virtually any type of transcriptional activator to maintain their expression; PcG silencing will ensure that activation by these factors only occurs in cells in which the Hox gene should be active. The analysis of Ubx control sequences supports this view; if individually linked to a reporter gene, most late-acting enhancers direct expression both within as well as outside of the normal Ubx expression domain (Sengupta, 2004),
The second point to discuss concerns the repression mechanism used by
PcG proteins. Biochemical purification of PRC1 has revealed that several TFIID
components co-purify with the PcG proteins that constitute the core of PRC1. Moreover, formaldehyde crosslinking experiments in tissue culture cells showed that TFIID components are associated with promoters, even if these are repressed by PcG proteins. This suggests that PcG protein complexes anchored at the
PRE interact with general transcription factors bound at the promoter. One
possibility would be that PcG repressors directly target components of the
general transcription machinery to prevent transcriptional activation by
enhancer-binding factors. Three distinct activators act
through the three enhancers used in this study and, according to these results, none of them is able to overcome the block imposed by the PcG machinery. But how do the known
activities of PcG protein complexes [i.e., histone methylation by the Esc-E(z) complex and inhibition of chromatin remodeling by PRC1] fit into this
scenario? Both these activities may be required for the repression process by altering the structure of chromatin around the transcription start site and thus prevent the formation of productive RNA Pol II complexes. Other scenarios are possible. For example, histone methylation may primarily serve to mark the chromatin for binding of PRC1 through Pc, and PRC1 components such as Psc then perform the actual repression process.
Whatever the exact repression mechanism may be, the PRE-excision experiment shows that this repression is lost within one cell generation after removal of the PRE. This implies that changes in the chromatin generated by the action of PcG proteins cannot be propagated by the flanking chromatin (Sengupta, 2004).
Chromatin is important for the regulation of transcription and other functions, yet the diversity of chromatin composition and the distribution along chromosomes are still poorly characterized. By integrative analysis of genome-wide binding maps of 53 broadly selected chromatin components in Drosophila cells, this study shows that the genome is segmented into five principal chromatin types (see Chromatin types are characterized by distinctive protein combinations and histone modifications) that are defined by unique yet overlapping combinations of proteins and form domains that can extend over > 100 kb. A repressive chromatin type was identified that covers about half of the genome and lacks classic heterochromatin markers. Furthermore, transcriptionally active euchromatin consists of two types that differ in molecular organization and H3K36 methylation and regulate distinct classes of genes. Finally, evidence is provided that the different chromatin types help to target DNA-binding factors to specific genomic regions. These results provide a global view of chromatin diversity and domain organization in a metazoan cell (Filion, 2010).
By systematic integration of 53 protein location maps this study found that the Drosophila genome is packaged into a mosaic of five principal chromatin types, each defined by a unique combination of proteins. Extensive evidence demonstrates that the five types differ in a wide range of characteristics besides protein composition, such as biochemical properties, transcriptional activity, histone modifications, replication timing, DNA binding factor (DBF) targeting, as well as sequence properties and functions of the embedded genes. This validates the classification by independent means and provides important insights into the functional properties of the five chromatin types (Filion, 2010).
Identifying five chromatin states out of the binding profiles of 53 proteins comes out as a surprisingly low number (one can form approximately 1016 subsets of 53 elements). It is emphasized that the five chromatin types should be regarded as the major types. Some may be further divided into sub-types, depending on how fine-grained one wishes the classification to be. For example, within each of the transcriptionally active chromatin types, promoters and 3' ends of genes exhibit (mostly quantitative) differences in their protein composition and thus could be regarded as distinct sub-types. However, these local differences are minor relative to the differences between the five principal types that are described in this study. It cannot be excluded that the accumulation of binding profiles of additional proteins would reveal other novel chromatin types. It is also anticipated that the pattern of chromatin types along the genome will vary between cell types. For example, many genes that are embedded in 'BLACK' chromatin (defined in Kc167 cells) are activated in some other cell types. Thus, the chromatin of these genes is likely to switch to an active type (Filion, 2010).
While the integration of data for 53 proteins provides substantial robustness to the classification of chromatin along the genome, a subset of only five marker proteins (histone H1, PC, HP1, MRG15 and BRM), which together occupy 97.6% of the genome, can recapitulate this classification with 85.5% agreement. Assuming that no unknown additional principal chromatin types exist in some cell types, DamID or ChIP of this small set of markers may thus provide an efficient means to examine the distribution of the five chromatin types in various cells and tissues, with acceptable accuracy (Filion, 2010).
Previous work on the expression of integrated reporter genes had suggested that most of the fly genome is transcriptionally repressed, contrasting with the low coverage of PcG and HP1-marked chromatin. BLACK chromatin, which consists of a previously unknown combination of proteins and covers about half of the genome, may account for these observations. Essentially all genes in BLACK chromatin exhibit extremely low expression levels, and transgenes inserted in BLACK chromatin are frequently silenced, indicating that BLACK chromatin constitutes a strongly repressive environment. Importantly, BLACK chromatin is depleted of PcG proteins, HP1, SU(VAR)3-9 and associated proteins, and is also the latest to replicate, underscoring that it is different from previously characterized types of heterochromatin (identified as BLUE and GREEN chromatin in this study) (Filion, 2010).
The proteins that mark BLACK domains provide important clues to the molecular biology of this type of chromatin. Loss of Lamin (LAM), Effete (EFF) or histone H1 causes lethality during Drosophila development. Extensive in vitro and in vivo evidence has suggested a role for H1 in gene repression, most likely through stabilization of nucleosome positions. The enrichment of LAM points to a role of the nuclear lamina in gene regulation in BLACK chromatin, consistent with the long-standing notion that peripheral chromatin is silent. Depletion of LAM causes derepression of several LAM-associated genes (Shevelyov, 2009), while artificial targeting of genes to the nuclear lamina can reduce their expression, suggesting a direct repressive contribution of the nuclear lamina in BLACK chromatin. D1 is a little-studied protein with 11 AT-hook domains. Overexpression of D1 causes ectopic pairing of intercalary heterochromatin (Smith, 2010), suggesting a role in the regulation of higher-order chromatin structure. SUUR specifically regulates late replication on polytene chromosomes (Zhimulev, 2003), which is of interest because BLACK chromatin is particularly late-replicating. EFF is highly similar to the yeast and mammalian ubiquitin ligase Ubc4 that mediates ubiquitination of histone H3, raising the possibility that nucleosomes in BLACK chromatin may carry specific ubiquitin marks. These insights suggest that BLACK chromatin is important for chromosome architecture as well as gene repression and provide important leads for further study of this previously unknown yet prevalent type of chromatin (Filion, 2010).
In RED and YELLOW chromatin most genes are active, and the overall expression levels are similar between these two chromatin types. However, RED and YELLOW chromatin differ in many respects. One of the conspicuous distinctions is the disparate levels of H3K36me3 at active transcription units. This histone mark is thought to be laid down in the course of transcription elongation and may block the activity of cryptic promoters inside the transcription unit. Why active genes in RED chromatin lack H3K36me3 remains to be elucidated (Filion, 2010).
The remarkably high protein occupancy in RED chromatin suggests that RED domains are 'hubs' of regulatory activity. This may be related to the predominantly tissue-specific expression of genes in RED chromatin, which presumably requires many regulatory proteins. It is noted that the DamID assay integrates protein binding events over nearly 24 hours, so it is likely that not all proteins bind simultaneously; some proteins may bind only during a specific stage of the cell cycle. It is highly unlikely that the high protein occupancy in RED chromatin originates from an artifact of DamID, e.g. caused by a high accessibility of RED chromatin. First, all DamID data are corrected for accessibility using parallel Dam-only measurements. Second, several proteins, such as EFF, SU(VAR)3-9 and histone H1 exhibit lower occupancies in RED than in any other chromatin type. Third, ORC also shows a specific enrichment in RED chromatin, even though it was mapped by ChIP, by another laboratory and on another detection platform. Fourth, DamID of Gal4-DBD does not show any enrichment in RED chromatin (Filion, 2010).
RED chromatin resembles DBF binding hotspots that were previously discovered in a smaller-scale study in Drosophila cells. Discrete genomic regions targeted by many DBFs have recently also been found in mouse ES cells , hence it is tempting to speculate that an equivalent of RED chromatin may also exist in mammalian cells. Housekeeping and dynamically regulated genes in budding yeast also exhibit a dichotomy in chromatin organization which may be related to the distinction between YELLOW and RED chromatin. The observations that RED chromatin is generally the earliest to replicate and strongly enriched in ORC binding, suggest that this chromatin type may be not only involved in transcriptional regulation but also in the control of DNA replication (Filion, 2010).
This analysis of DBF binding indicates that the five chromatin types together act as a guidance system to target DBFs to specific genomic regions. This system directs DBFs to certain genomic domains even though the DBF recognition motifs are more widely distributed. It is proposed that targeting specificity is at least in part achieved through interactions of DBFs with particular partner proteins that are present in some of the five chromatin types but not in others. The observation that yeast Gal4-DBD binds its motifs with nearly equal efficiency in all five chromatin types suggests that differences in compaction among the chromatin types represent overall a minor factor in the targeting of DBFs. Although additional studies will be needed to further investigate the molecular mechanisms of DBF guidance, the identification of five principal types of chromatin provides a firm basis for future dissection of the roles of chromatin organization in global gene regulation (Filion, 2010).
Chromosomes are the physical realization of genetic information and thus form the basis for its readout and propagation. This study presents a high-resolution chromosomal contact map derived from a modified genome-wide chromosome conformation capture approach applied to Drosophila embryonic nuclei. The data show that the entire genome is linearly partitioned into well-demarcated physical domains that overlap extensively with active and repressive epigenetic marks. Chromosomal contacts are hierarchically organized between domains. Global modeling of contact density and clustering of domains show that inactive domains are condensed and confined to their chromosomal territories, whereas active domains reach out of the territory to form remote intra- and interchromosomal contacts. Moreover, specific long-range intrachromosomal contacts between Polycomb-repressed domains were systematically identified. Together, these observations allow for quantitative prediction of the Drosophila chromosomal contact map, laying the foundation for detailed studies of chromosome structure and function in a genetically tractable system (Sexton, 2012).
A simplified Hi-C procedure was developed for minimally biased profiling of chromosomal contacts on a genomic scale. Using this technique, chromosomal architecture in Drosophila melanogaster embryonic nuclei was comprehensively and accurately characterized. The chromosomal contact map relaxes the classical trade-off between coverage and resolution in the study of chromosome structure. The data provide sufficient resolution to observe local contact profiles derived from 4C and consistently deliver such resolution for essentially any genomic locus. The effective resolution limitations of the map depend on the features being studied. Demarcation of physical domains can be achieved within a precision of one or a few DpnII fragments (i.e., of ~1 kb), as many fragments with high expected contact probability contribute to their identification. On the other hand, detection of long-range contacts with statistical confidence greatly depends on their absolute intensity compared to the background, which decays significantly with genomic separation. For example, based on the current sequencing depth, the decay in background contact probability with genomic distance, and the average DpnII restriction site density, it is estimated that a contact with 4-fold enrichment over the background could be confidently detected at a resolution of ~10 kb for genomic separations of 100 kb, a resolution of ~30 kb for genomic separations of 1 Mb, and ~125 kb for interchromosomal coassociations. Regardless of these considerations, and despite the fact that the experiment assayed a large and heterogeneous set of nuclei, the derived Hi-C map reveals a clear structure and allows for multiple chromosome folding principles to be explored systematically. The implications of the Drosophila map are therefore far reaching, and the analysis presented in this study can be viewed as a baseline on which further efforts directed to understand genomic and epigenomic patterns at particular cell states or genetic backgrounds can be developed (Sexton, 2012).
The Hi-C map is rich in local and global structure, describing contact frequencies that vary within five orders of magnitude. This studied has tried to explain the distributions of contact frequencies in the map using quantitative models based on the simplest principles and has tried to justify any progressive increases in model complexity by proven discrepancies between the data and a simpler version of the model. One of the most remarkable patterns observed in the map was the partitioning of chromosomes into physical domains, which showed up in the matrix as diagonal submatrices with high contact intensities. A quantitative probabilistic model was used to show that contacts inside these domains are governed by a distinct regime that cannot be attributed to denser contacts or more compact chromosomal structure alone. Further analysis showed that physical domains form the backbones of a hierarchical chromosome structure, as the contact intensities between genomic elements are mostly determined by the identities of the domains containing them, rather than the element's location within the domain. Previous lower-resolution exploration of human chromosome architecture identified a global power law decay of contact frequency with genomic separation, and this was used to propose a fractal globule model of chromosome folding. Although a roughly similar global decay curve was found for Drosophila chromatin, higher-resolution analysis of contact decays within the context of physical domains challenges this model and suggests that, in scales of 10-100 kb, the predominant factor affecting chromosome folding is the modular organization. This promotes hierarchical chromosomal organization as an attractive paradigm to facilitate functional epigenetic organization but leaves open questions about the scales at which it may be observed in different genomes that vary significantly in size and gene density (Sexton, 2012).
Remarkably, the physical domains inferred from the Hi-C contact map were compatible with numerous linear epigenetic profiles describing enrichment for histone modification or DNA-binding factors. Thus the physical domains, which are key fundamental units of chromosome folding, are reflected and possibly caused by their underlying epigenetic marks. Large silent chromosomal regions that are either enriched with repressive histone marks (H3K27me3 or HP1/H3K9me2) or void of any detectable epigenetic enrichment were shown to form modular chromosomal entities, which are interspersed with small domains associated with active chromosomal marks. By analyzing the epigenomic marks at the borders of physical domains, it was observed that a transition in transcriptional activity (as indicated by peaks of H3K4me3) is sometimes sufficient to disturb the compaction of flanking repressive chromatin domains. This may result in the formation of 'punctuated' repressed domains, with active genes forming 'passive' physical boundaries. However, in most cases, this study finds that insulator proteins, particularly CP190 and Chromator, sharply demarcate the borders of physical domains. This is especially apparent at borders marked by both CP190 and H3K4me3, as CP190 binds precisely at the physical domain boundary, with the H3K4me3 peak shifted ~500 bp toward the Active domain. Interestingly, a recent study suggested that binding of the 'accessory' insulator protein CP190 is required for a functional insulator. In agreement with this, it was found that CP190 correlates most strongly with physical boundary domains, whereas many regions bound by the DNA sequence-specific binding insulator proteins CTCF and Su(Hw) are not linked to physical domain boundaries. Chromator emerged from this analyses as another major factor organizing physical domains. Although little is known about the function of the mitotic spindle protein during interphase, Chromator has been shown to be necessary for the maintenance of polytene chromosome structure. The current findings appear to extend the structural function of Chromator to diploid embryonic nuclei. By providing an architectural context to epigenomic chromatin domains, the Hi-C map thus provides a reference epigenomic model, directing future efforts for analyses of the correlations between hundreds of measured linear epigenomic profiles (Sexton, 2012).
Chromosomes clearly fold in a complicated, heterogeneous regime. In order to make any reasoned claims about the significance of previously reported individual cases of long-range chromatin interactions, it is important to first understand the basic principles of what defines 'standard' folding of a chromosome fiber. This Hi-C dataset allows formulation and testing hypotheses on chromatin folding with progressively more complex quantitative models. First, this study has account for heterogeneity in contact density, facilitating identification of physical chromatin modules and their hierarchical pattern of folding. Next, it was possible able to group physical domains into two clusters (annotated postfactum as active or inactive) based on their intrachromosomal contacts and to generally describe interdomain contacts as those within or between clusters. This supported and extended previous findings on the relationship between transcriptional activity and position within chromosome territories. Although the combined model explains much of the chromosome folding behavior, specific long-range chromatin interactions were still apparent. One group of functional long-range interactions that has already been investigated and is clearly visible in the Hi-C map associates PcG-regulated genes that co-occupy Polycomb bodies (Sexton, 2012).
In summary, this Hi-C study has provided a fundamental chromatin interaction map framework, providing the basis for mathematical models to assess the link between chromosome structure and function. The characterization of hierarchically folded discrete physical modules, which may be epigenetically defined, forms a hitherto unappreciated base from which more complicated chromosome topologies can arise. It is posited that this and future Hi-C datasets, combined with specific perturbation experiments, will inform more sophisticated mathematical models of chromosome folding, forming a foundation for new important insights into what shapes nuclear structure and how this in turn affects genome function (Sexton, 2012).
The mechanisms responsible for the establishment of physical domains in metazoan chromosomes are poorly understood. This study finds that physical domains in Drosophila chromosomes are demarcated at regions of active transcription and high gene density that are enriched for transcription factors and specific combinations of insulator proteins. Physical domains contain different types of chromatin defined by the presence of specific proteins and epigenetic marks, with active chromatin preferentially located at the borders and silenced chromatin in the interior. Domain boundaries participate in long-range interactions that may contribute to the clustering of regions of active or silenced chromatin in the nucleus. Analysis of transgenes suggests that chromatin is more accessible and permissive to transcription at the borders than inside domains, independent of the presence of active or silencing histone modifications. These results suggest that the higher-order physical organization of chromatin may impose an additional level of regulation over classical epigenetic marks (Hou, 2012).
The use of Hi-C to map intra- and inter-chromosomal interactions in metazoan genomes has given important insights into the organization of the chromatin fiber in eukaryotic nuclei. One important conclusion from these studies is that eukaryotic chromosomes are organized into a series of chromatin domains, perhaps formed by a series of local interactions among various regulatory sequences and the genes they control. Long-range interactions between chromatin domains may result in additional levels of folding to create larger domains. These results complement and converge with evidence suggesting that specific sequences come together in the nucleus in the process of, or with the purpose of, carrying out various nuclear processes. For example, actively transcribed genes and their regulatory sequences have been shown to colocalize at transcription factories, whereas genes silenced by PcG proteins converge at repressive factories termed Pc bodies. It is unclear whether these associations are a consequence of self-organizing principles with no functional outcomes, i.e., they result from interactions among multiprotein complexes present at active or silenced genes, or they play a functional role in gene expression and are mediated by structural proteins specifically involved in mediating inter- and intrachromosomal interactions (Hou, 2012).
A critical roadblock in understanding the principles governing the folding of metazoan genomes is the identification of proteins or forces responsible for the formation of chromosome domains and the boundaries that separate these structures. Results from the analysis of mixed-cell populations in Drosophila embryos indicate a correlation between the formation of domain boundaries and the presence of insulator proteins and the transcription factor Chromator. Similar results in mouse and human cells find a high degree of correlation between the presence of CTCF and housekeeping genes and the formation of domain boundaries (Hou, 2012).
To further explore the mechanisms of physical domain partition in metazoans, a Hi-C analysis was carried using Drosophila Kc167 cells. Physical domains did not exactly correlate with functional domains defined by epigenetic marks. Furthermore, domain boundaries usually form at regions enriched for active histone modifications such as H3K4me3 but also form in regions enriched for silencing marks such as H3K27me3 and LAM. The common theme among domain boundaries, even those present in regions enriched for H3K27me3 and LAM, is a high density of actively transcribed genes. The likely causal role of transcription in the establishment of domains boundaries is underscored by the formation of multiple small physical domains in regions of the genome enriched for active genes. Regions of the genome enriched for silenced chromatin form large domains, with boundaries between these domains often forming when closely spaced and transcribed genes are present at the domain borders. The high correlation between gene density, transcription, and the formation of domain boundaries helps explain why these domains are conserved across different cell types of the same or different species (Hou, 2012).
In agreement with these observations, RNAPII, transcription factors, and insulator proteins are also found enriched at the borders of domains. Drosophila insulator proteins, with the exception of Su(Hw), are preferentially located adjacent to promoter regions of actively transcribed genes. It is then possible that insulators play an active role in the formation of domain boundaries and that the observed increase in actively transcribed genes in these regions is a consequence of their close association with insulator proteins. Alternatively, active transcription in regions of high gene density may be the driving force behind the formation of physical domains, and the enrichment of insulator proteins at the boundaries may be a result of their presence adjacent to these genes. Given the demonstrated role of insulators in mediating interactions between different sequences in the genome, it is possible that a combination of these two possibilities is actually responsible for domain formation. An interesting observation that may offer additional clues as to the role of insulators in the formation of physical domains is the specific enrichment of clusters of insulator proteins at the boundaries. Drosophila insulator proteins Su(Hw), BEAF, and CTCF bind specific DNA sequences and recruit CP190 and Mod(mdg4); these two proteins then interact with each other and/or themselves to bridge contacts between distant sites. The presence of multiple insulator DNA binding proteins would, presumably, make for a stronger insulator, able to mediate more frequent long-distance interactions. This hypothesis is supported by the observation that long-distance interactions involving domain boundaries are significantly higher than expected. These interactions can bring together highly transcribed regions, offering a mechanism to explain the formation of transcription factories (Hou, 2012).
An important question is whether this differential compaction of the chromatin between the inside and the borders of physical domains has an effect on gene expression. This issue was addressed by examining the insertion frequency and the expression levels of a large collection of P element transgenes. The frequency of transgene insertion is much higher at the borders of the domains than in the interior, independent of the type of chromatin, suggesting that the DNA inside physical domains is more compacted than at the borders. Furthermore, independent of the epigenetic marks present in the chromatin, transgenes inserted in the region surrounding the domain boundaries are less repressed than those inserted in the domain interior. Therefore, the physical compaction of DNA arising from the higher-order organization of the chromatin may add a different layer of regulatory information superimposed on that resulting from classical epigenetic marks (Hou, 2012).
Dynamic regulation of chromatin structure is required to modulate the transcription of genes in eukaryotes. However, the factors that contribute to the plasticity of heterochromatin structure are elusive. This study reports that cyclin-dependent kinase 12 (CDK12), a transcription elongation-associated RNA polymerase II (RNAPII) kinase, antagonizes heterochromatin enrichment in Drosophila chromosomes. Notably, loss of CDK12 induces the ectopic accumulation of heterochromatin protein 1 (HP1) on euchromatic arms, with a prominent enrichment on the X chromosome. Furthermore, ChIP and sequencing analysis reveals that the heterochromatin enrichment on the X chromosome mainly occurs within long genes involved in neuronal functions. Consequently, heterochromatin enrichment reduces the transcription of neuronal genes in the adult brain and results in a defect in Drosophila courtship learning. Taken together, these results define a previously unidentified role of CDK12 in controlling the epigenetic transition between euchromatin and heterochromatin and suggest a chromatin regulatory mechanism in neuronal behaviors (Pan, 2015).
ATP-dependent chromatin remodeling complexes are multi-protein machines highly conserved across eukaryotic genomes. They control sliding and displacing of the nucleosomes, modulating histone-DNA interactions and making nucleosomal DNA more accessible to specific binding proteins during replication, transcription, and DNA repair, which are processes involved in cell division. The SRCAP and p400/Tip60 chromatin remodeling complexes in humans and the related Drosophila Tip60 complex belong to the evolutionary conserved INO80 family, whose main function is promoting the exchange of canonical histone H2A with the histone variant H2A in different eukaryotic species. Some subunits of these complexes were additionally shown to relocate to the mitotic apparatus and proposed to play direct roles in cell division in human cells. However, whether this phenomenon reflects a more general function of remodeling complex components and its evolutionary conservation remains unexplored. This study has combined cell biology, reverse genetics, and biochemical approaches to study the subcellular distribution of a number of subunits belonging to the SRCAP and p400/Tip60 complexes and assess their involvement during cell division progression in HeLa cells. Interestingly, beyond their canonical chromatin localization, the subunits under investigation accumulate at different sites of the mitotic apparatus (centrosomes, spindle, and midbody), with their depletion yielding an array of aberrant outcomes of mitosis and cytokinesis, thus causing genomic instability. Importantly, this behavior was conserved by the Drosophila melanogaster orthologs tested, despite the evolutionary divergence between fly and humans has been estimated at approximately 780 million years ago. Overall, these results support the existence of evolutionarily conserved diverse roles of chromatin remodeling complexes, whereby subunits of the SRCAP and p400/Tip60 complexes relocate from the interphase chromatin to the mitotic apparatus, playing moonlighting functions required for proper execution of cell division (Messina, 2022).
Drosophila INterspersed Elements (DINEs) constitute an abundant but poorly understood group of Helitrons present in several Drosophila species. The general structure of DINEs includes two conserved blocks that may or not contain a region with tandem repeats in between. These central tandem repeats (CTRs) are similar within species but highly divergent between species. This study identified a subset of DINEs, termed DINE-TR1, which contain homologous CTRs of approximately 150 bp. DINE-TR1 are found in the sequenced genomes of several Drosophila species. However, interspecific high sequence identity (~88%) is limited to the first approximately 30 bp of each tandem repeat. Sequence analysis suggests vertical transmission. CTRs found within DINE-TR1 have independently expanded into satellite DNA-like arrays at least twice within Drosophila. By analyzing the genome of Drosophila virilis and Drosophila americana, it was shown that DINE-TR1 is highly abundant in pericentromeric heterochromatin boundaries, some telomeric regions and in the Y chromosome. It is also present in the centromeric region of one autosome from D. virilis and dispersed throughout several euchromatic sites in both species. DINE-TR1 was found to be abundant at piRNA clusters, and small DINE-TR1-derived RNA transcripts (~25 nt) are predominantly expressed in the testes and the ovaries, suggesting active targeting by the piRNA machinery. These features suggest potential piRNA-mediated regulatory roles for DINEs at local and genome-wide scales in Drosophila (Dias, 2015).
Nuclear pore complexes have emerged in recent years as chromatin-binding nuclear scaffolds, able to influence target gene expression. However, how nucleoporins (Nups) exert this control remains poorly understood. This study shows that ectopically tethering Drosophila Nups, especially Sec13, to chromatin is sufficient to induce chromatin decondensation. This decondensation is mediated through chromatin-remodeling complex PBAP, as PBAP is both robustly recruited by Sec13 and required for Sec13-induced decondensation. This phenomenon is not correlated with localization of the target locus to the nuclear periphery, but is correlated with robust recruitment of Nup Elys. Furthermore, this study identified a biochemical interaction between endogenous Sec13 and Elys with PBAP, and a role for endogenous Elys in global as well as gene-specific chromatin decompaction. Together, these findings reveal a functional role and mechanism for specific nuclear pore components in promoting an open chromatin state (Kuhn, 2019).
Recent advances enabled by the Hi-C technique have unraveled many principles of chromosomal folding that were subsequently linked to disease and gene regulation. In particular, Hi-C revealed that chromosomes of animals are organized into Topologically Associating Domains (TADs), evolutionary conserved compact chromatin domains that influence gene expression. Mechanisms that underlie partitioning of the genome into TADs remain poorly understood. To explore principles of TAD folding in Drosophila melanogaster, Hi-C and PolyA+ RNA-seq was performed in four cell lines of various origins (S2, Kc167, DmBG3-c2, and OSC). Contrary to previous studies, this study found that regions between TADs (i.e. the inter-TADs and TAD boundaries) in Drosophila are only weakly enriched with the insulator protein dCTCF, while another insulator protein Su(Hw) is preferentially present within TADs. However, Drosophila inter-TADs harbor active chromatin and constitutively transcribed (housekeeping) genes. Accordingly, it was found that binding of insulator proteins dCTCF and Su(Hw) predicts TAD boundaries much worse than active chromatin marks do. Interestingly, inter-TADs correspond to decompacted interbands of polytene chromosomes, whereas TADs mostly correspond to densely packed bands. Collectively, these results suggest that TADs are condensed chromatin domains depleted in active chromatin marks, separated by regions of active chromatin. The mechanism of TAD self-assembly is proposed based on the ability of nucleosomes from inactive chromatin to aggregate, and lack of this ability is found in acetylated nucleosomal arrays. Finally, this hypothesis was tested by polymer simulations, and it was found that TAD partitioning may be explained by different modes of inter-nucleosomal interactions for active and inactive chromatin (Ulianov, 2015).
Recently developed 3C-based methods coupled with high-throughput sequencing have enabled genome-wide investigation of chromatin organization. Studies performed in human, mouse, Drosophila, yeasts, Arabidopsis and several other species have unraveled general principles of genome folding. Chromosomes in mammals and Drosophila are organized hierarchically. At the megabase scale, mammalian chromosomes are partitioned into active and inactive compartments. At the sub-megabase scale, these compartments are subdivided into a set of self-interacting domains called Topologically Associating Domains (TADs); TADs themselves are often hierarchical and are split into smaller domains. Similar to mammals, Drosophila chromosomes are partitioned into TADs that are interspaced with short boundaries or longer inter-TAD regions (inter-TADs) (Ulianov, 2015).
Partitioning of mammalian genomes into TADs appears to be largely cell-lineage independent and evolutionary conserved. Disruption of certain TAD boundaries leads to developmental defects in humans and mice. TADs correlate with units of replication timing regulation in mammals and colocalize with epigenetic domains (either active or repressed) in Drosophila. The internal structure of TADs was reported to change in response to environmental stress, during cell differentiation, and embryonic development. In addition, comparative Hi-C analysis has demonstrated that genomic rearrangements between related mammalian species occur predominantly at TAD boundaries. Consequently, TADs appear to evolve primarily as constant and unsplit units. Previous studies in Drosophila embryonic nuclei and embryo-derived Kc167 cells detected TADs of various sizes roughly corresponding to epigenetic domains. Additionally, long-range genomic contacts and clustering of pericentromeric regions were revealed, and TAD boundaries were found to be enriched with active chromatin marks and insulator proteins. Both active and inactive TADs were identified, and their spatial segregation was observed (Ulianov, 2015).
Despite extensive studies, mechanisms underlying TAD formation remain obscure. Architectural proteins, including cohesin and CTCF, are often found at TAD boundaries; thus, they have been proposed to play a key role in the demarcation of TADs. However, several studies suggest that other mechanisms may be responsible for partitioning and formation of TADs. Firstly, depletion of various insulator proteins did not affect the profile of chromosome partitioning into TADs, but rather decreased intra-TAD interactions. Secondly, CTCF may mediate loops that occur between the start and the end of the so-called 'loop domains'. However, domains of similar sizes but without a loop were observed as well (so-called 'ordinary domains'. Thirdly, polymer simulations of a permanent chromatin loop yield a noticeable interaction between the loop bases on a simulated Hi-C map, but without a characteristic square shape of a TAD. Loops of this kind are thought to occur between insulator proteins such as Su(Hw) in the 'topological insulation' model. Finally, chromosomal domains similar to TADs in the bacterium Caulobacter crescentus are demarcated by actively transcribed genes, and are not affected by the knockout of SMC, a homolog of cohesin subunits (Ulianov, 2015).
This study presents evidences that question the role of insulators in the organization of TAD boundaries in Drosophila . The results suggest that TADs are self-organized and potentially highly dynamic structures formed by numerous transient interactions between nucleosomes of inactive chromatin, while inter-TADs and TAD boundaries contain highly acetylated nucleosomes that are less prone to interactions. Finally, a polymer model of TAD formation is developed based on the two types of nucleosomes, and it was found that a polymer composed of active and inactive chromatin blocks forms TADs on a simulated Hi-C map (Ulianov, 2015).
This study and others (Hou, 2012; Sexton, 2012) revealed that boundaries and inter-TADs in Drosophila, as opposed to TADs, are strongly enriched with active chromatin and its individual marks, as well as with active transcription and with constitutively transcribed housekeeping genes. Consequently, active chromatin marks, in the simplest case only total transcription and H3K4me3 (a mark of active promoters), can relatively well predict a TAD/inter-TAD profile. The existence of long inter-TADs composed of active chromatin is per se an argument for the ability of this type of chromatin to separate TADs. Furthermore, the current observations demonstrate that the presence of active chromatin and transcribed regions within TAD undermines the TAD integrity making TAD less compact and generating weak boundaries inside TAD. Consequently, a bona fide TAD is inactive; TADs containing active chromatin become less dense, acquire weak internal boundaries and eventually split into smaller TADs that are composed of inactive chromatin. The observation that the majority of housekeeping genes are located within inter- TADs and TAD boundaries suggests that evolutionary conservation and cell-type independence of TAD/inter-TAD profiles may be explained by conservation of positions of housekeeping genes along the chromosomes (Ulianov, 2015).
It is noted that chromosomal interaction domains similar to TADs have been observed in the bacterium Caulobacter crescentus, where they are demarcated by sites of active transcription. Although the basic level of chromosomal folding is different in bacteria and eukaryotes, the model proposed in (Le, 2013) and the model stem from common principles. In Caulobacter, active transcription is thought to disrupt the fiber of supercoils (plectonemes) by creating a stretch of non-packaged DNA, free of plectonemes, which spatially separates chromosomal regions flanking it. In the model, transcription disrupts chromatin organization by introducing a 'non-sticky' region of chromatin, which is less compact and more unfolded in space, and thus spatially separates two flanking regions. Computer modeling shows that stickiness of non-acetylated (inactive) nucleosomes and the absence of stickiness for acetylated (active) nucleosomes are sufficient for chromatin partitioning into TADs and inter-TADs. Self-association of nucleosomes may be explained by the interaction of positively charged histone tails (in particular, the tail of histone H4) of one nucleosome with the acidic patch of histones H2A/H2B at an adjacent nucleosome. Acetylation of histone tails, which is typical of active chromatin, may interfere with inter-nucleosomal associations. In addition to a high level of histone acetylation, other features of active chromatin including lower nucleosome density in inter-TADs, manifested as the decreased histone H3 occupancy, might contribute to the generation of TAD profiles (Ulianov, 2015).
It should be mentioned that a significant difference between the polymer simulations and models previously suggested by the Cavalli and Vaillant groups (Jost, 2014) is the use of saturating interactions between inactive nucleosomes. In the case of volume interactions, all nucleosomes of the same type adjacent in 3D space will attract each other; in the case of saturating interactions, each molecule may attract only one neighbor. Using volume interactions leads to the formation of a single dense blob, and does not produce TADs in a simulated Hi-C map.It is noted that the saturating nature of interactions between nucleosomes is based on the known properties of nucleosomal particles. Previous studies considered a variety of mechanisms that may lead to the formation of TADs. In particular, Barbieri (2012) studied segregation of two TADs using cubic lattice simulations of a short 152-monomer chain consisting of two TADs, assuming that inter- monomer interactions could only form between monomers belonging to the same TAD. In the current model, this study shows that TADs emerge without requiring such specific interactions; any two regions of sticky monomers separated by a non-sticky linker would form TADs. Another study proposed that transcription-induced supercoiling may be responsible for the formation of TADs (Benedetti, 2014). Although this model is consistent with the current observation that sites of active transcription demarcate TAD boundaries, there is limited evidence that supercoiling of chromatinized DNA exists in Drosophila and other organisms. On the contrary, the current model is based on known biochemical properties of nucleosomes (Ulianov, 2015).
The fact that a minor fraction of TADs is built mostly from active chromatin apparently contradicts the current model, suggesting that additional ways of chromatin self-organization could exist. One possibility is the establishment of long-range contacts between enhancers and their cognate promoters, as well as loops between pairs of insulators. Such loops formed inside active unstructured chromatin linkers (i.e., inter-TADs) could probably be sufficient to compact them and thus to fold into TADs (Ulianov, 2015).
TAD profiles of X chromosomes are almost identical in the male and female cell lines, that is in agreement with recently published observations (Ramírez, 2015). Thus, it seems that hyperacetylation of male X-chromosomes due to dosage compensation does not generate new TAD boundaries. However, it should be noted that MOF histone acetyltransferase of the MSL complex introduces only the H4K16ac mark. Although this modification is important to prevent inter-nucleosomal interactions, acetylation at other histone positions and H2B ubiquitylation contribute as well. Additionally, H4K16 acetylation generated by the dosage compensation system occurs preferentially at regions enriched with transcribed genes and hence within inter-TADs (Ulianov, 2015).
The current analysis does not support the previously reported (Hou, 2012; Sexton, 2012) strong enrichments of insulator proteins Su(Hw) and dCTCF at TAD boundaries in Drosophila. To assess the possible reasons of this divergence, the dCTCF distribution was re-analyzed with respect to TAD positions in the current dataset using the raw ChIP-seq data. No strong difference was observed in the dCTCF coverage in TADs and inter-TADs. Interestingly, this study obtained the same result while analyzing dCTCF and Su(Hw) binding within TAD boundaries identified by Hou (2012). However, a strong enrichment of dCTCF at TAD boundaries was observed when the peak distribution was analyzed instead of read coverage. Additionally, the effect was much weaker when modENCODE peaks were used. Hence, the discrepancy may be caused by a different peak calling procedure in modENCODE and in Hou. (2012). The biological significance of these observations remains to be determined. It is noted that disruption of the cohesin/CTCF complex in mammals, as well as depletion of the Vtd (also known as Rad21) cohesin subunit in Drosophila, did not lead to disappearance of TAD boundaries, but rather only slightly decreased interactions inside TADs (in mammals) and reduced TAD boundary strength in the Drosophila genome. These observations favor a role for the cohesin/CTCF complex, which is known to form loops, in chromatin compaction inside the TADs (Ulianov, 2015).
Binding of insulator proteins might contribute to establishing TAD boundaries through introducing active chromatin marks. Indeed, when inserted into an ectopic position, a classical insulator triggers hyperacetylation of the local chromatin domain and recruits chromatin-remodeling complexes. However, absence of strong enrichment of dCTCF at TAD boundaries and preferential location of Su(Hw) inside TADs mean that at least dCTCF- and Su(Hw)-dependent insulators are not the major determinants of TAD boundaries and inter-TADs (Ulianov, 2015).
TADs are predicted based on the analysis of averaged data from a cell population. Although they are usually represented as large chromatin globules, direct experimental evidence for the existence of such globules in individual cells is controversial. Using confocal and 3D-SIM microscopy, ~1-Mb globular domains have been observed within chromosomal territories. However, using STORM microscopy, chromatin in individual mammalian cells has been found to be organized into 'clutches' composed of several nucleosomes, and that increased histone acetylation dramatically reduces size of these clutches. It is thus possible that sub-megabase TADs revealed by Hi-C represent a set of nucleosome clutches separated by relatively short spacers of various sizes. These short clutches may occupy various positions within TADs in different cells and stochastically assemble to form short-living aggregates. The stochastic nature of TADs is supported by computer simulations (Ulianov, 2015).
Topologically associating domains, or TADs, are functional units that organize chromosomes into 3D structures of interacting chromatin. While the mechanisms of TAD formation have been well-studied, current knowledge on the patterns of TAD evolution across species is limited. Due to the integral role TADs play in gene regulation, their structure and organization is expected to be conserved during evolution. However, more recent research suggests that TAD structures diverge relatively rapidly. This study used Hi-C chromosome conformation capture to measure evolutionary conservation of whole TADs and TAD boundary elements between D. melanogaster and D. triauraria, two early-branching species from the melanogaster species group which diverged ∼15 million years ago. The majority of TADs were found to have been reorganized since the common ancestor of D. melanogaster and D. triauraria, via a combination of chromosomal rearrangements and gain/loss of TAD boundaries. TAD reorganization between these two species is associated with a localized effect on gene expression, near the site of disruption. By separating TADs into subtypes based on their chromatin state, we find that different subtypes are evolving under different evolutionary forces. TADs enriched for broadly expressed, transcriptionally active genes are evolving rapidly, potentially due to positive selection, whereas TADs enriched for developmentally-regulated genes remain conserved, presumably due to their importance in restricting gene-regulatory element interactions. These results provide novel insight into the evolutionary dynamics of TADs and help to reconcile contradictory reports related to the evolutionary conservation of TADs and whether changes in TAD structure affect gene expression (Torosin, 2020).
Topologically associating domains (TADs) were recently identified as fundamental units of three-dimensional eukaryotic genomic organization, although knowledge of the influence of TADs on genome evolution remains preliminary. To study the molecular evolution of TADs in Drosophila species, a new reference-grade genome assembly and accompanying high-resolution TAD map for D. pseudoobscura was constructed. Comparison of D. pseudoobscura and D. melanogaster, which are separated by ∼49 million years of divergence, showed that ∼30%-40% of their genomes retain conserved TADs. Comparative genomic analysis of 17 Drosophila species revealed that chromosomal rearrangement breakpoints are enriched at TAD boundaries but depleted within TADs. Additionally, genes within conserved TADs show lower expression divergence than those located in nonconserved TADs. Furthermore, it was found that a substantial proportion of long genes (>50 kbp) in D. melanogaster (42%) and D. pseudoobscura (26%) constitute their own TADs, implying transcript structure may be one of the deterministic factors for TAD formation. By using structural variants (SVs) identified from 14 D. melanogaster strains, its three closest sibling species from the D. simulans species complex, and two obscura clade species, evidence was uncovered of selection acting on SVs at TAD boundaries, but with the nature of selection differing between SV types. Deletions are depleted at TAD boundaries in both divergent and polymorphic SVs, suggesting purifying selection, whereas divergent tandem duplications are enriched at TAD boundaries relative to polymorphism, suggesting they are adaptive. These findings highlight how important TADs are in shaping the acquisition and retention of structural mutations that fundamentally alter genome organization (Liao, 2021).
Technological advances have lead to the creation of large epigenetic datasets, including information about DNA binding proteins and DNA spatial structure. Hi-C experiments have revealed that chromosomes are subdivided into sets of self-interacting domains called Topologically Associating Domains (TADs). TADs are involved in the regulation of gene expression activity, but the mechanisms of their formation are not yet fully understood. This study focused on machine learning methods to characterize DNA folding patterns in Drosophila based on chromatin marks across three cell lines. This study presents linear regression models with four types of regularization, gradient boosting, and recurrent neural networks (RNN) as tools to study chromatin folding characteristics associated with TADs given epigenetic chromatin immunoprecipitation data. The bidirectional long short-term memory recurrent neural network architecture produced the best prediction scores and identified biologically relevant features. Distribution of protein Chriz (Chromator) and histone modification H3K4me3 were selected as the most informative features for the prediction of TADs characteristics. This approach may be adapted to any similar biological dataset of chromatin features across various cell lines and species. The code for the implemented pipeline, Hi-ChiP-ML, is publicly available (Rozenwald, 2020).
Several distinct activities and functions have been described for chromatin insulators, which separate genes along chromosomes into functional units. This paper describes a novel mechanism of functional separation whereby an insulator prevents gene repression. When the homie insulator is deleted from the end of a Drosophila even skipped (eve) locus, a flanking P-element promoter is activated in a partial eve pattern, causing expression driven by enhancers in the 3' region to be repressed. The mechanism involves transcriptional read-through from the flanking promoter. This conclusion is based on the following. Read-through driven by a heterologous enhancer is sufficient to repress, even when homie is in place. Furthermore, when the flanking promoter is turned around, repression is minimal. Transcriptional read-through that does not produce anti-sense RNA can still repress expression, ruling out RNAi as the mechanism in this case. Thus, transcriptional interference, caused by enhancer capture and read-through when the insulator is removed, represses eve promoter-driven expression. We also show that enhancer-promoter specificity and processivity of transcription can have decisive effects on the consequences of insulator removal. First, a core heat shock 70 promoter that is not activated well by eve enhancers did not cause read-through sufficient to repress the eve promoter. Second, these transcripts are less processive than those initiated at the P-promoter, measured by how far they extend through the eve locus, and so are less disruptive. These results highlight the importance of considering transcriptional read-through when assessing the effects of insulators on gene expression (Fujioka, 2021).
Chromatin architecture plays an important role in gene regulation. Recent advances in super-resolution microscopy have made it possible to measure chromatin 3D structure and transcription in thousands of single cells. However, leveraging these complex data sets with a computationally unbiased method has been challenging. This study presents a deep learning-based approach to better understand to what degree chromatin structure relates to transcriptional state of individual cells. Furthermore, methods were explored to "unpack the black box" to determine in an unbiased manner which structural features of chromatin regulation are most important for gene expression state. This approach was applied to an Optical Reconstruction of Chromatin Architecture dataset of the Bithorax gene cluster in Drosophila; it was shown to outperforms previous contact-focused methods in predicting expression state from 3D structure. The structural information is distributed across the domain, overlapping and extending beyond domains identified by prior genetic analyses. Individual enhancer-promoter interactions are a minor contributor to predictions of activity (Rajpurkar, 2021).
The relationship between chromatin organization and gene regulation remains unclear. While disruption of chromatin domains and domain boundaries can lead to misexpression of developmental genes, acute depletion of regulators of genome organization has a relatively small effect on gene expression. It is therefore uncertain whether gene expression and chromatin state drive chromatin organization or whether changes in chromatin organization facilitate cell-type-specific activation of gene expression. Using the dorsoventral patterning of the Drosophila melanogaster embryo as a model system, this study provides evidence for the independence of chromatin organization and dorsoventral gene expression. Tissue-specific enhancers were defined and link them to expression patterns using single-cell RNA-seq. Surprisingly, despite tissue-specific chromatin states and gene expression, chromatin organization is largely maintained across tissues. These results indicate that tissue-specific chromatin conformation is not necessary for tissue-specific gene expression but rather acts as a scaffold facilitating gene expression when enhancers become active (Ing-Simmons, 2021).
Acquisition of cell fate is thought to rely on the specific interaction of remote cis-regulatory modules (CRMs), for example, enhancers and target promoters. However, the precise interplay between chromatin structure and gene expression is still unclear, particularly within multicellular developing organisms. This study employed Hi-M, a single-cell spatial genomics approach, to detect CRM-promoter looping interactions within topologically associating domains (TADs) during early Drosophila development. By comparing cis-regulatory loops in alternate cell types, it was shown that physical proximity does not necessarily instruct transcriptional states. Moreover, multi-way analyses reveal that multiple CRMs spatially coalesce to form hubs. Loops and CRM hubs are established early during development, before the emergence of TADs. Moreover, CRM hubs are formed, in part, via the action of the pioneer transcription factor Zelda and precede transcriptional activation. This approach provides insight into the role of CRM-promoter interactions in defining transcriptional states, as well as distinct cell types (Espinola 2021).
This study used a high-resolution, imaging-based, single-cell spatial genomics approach (Hi-M) to link chromosome topology and transcriptional regulation during early Drosophila development. This approach has notable advantages, such as the detection of multi-way interactions and transcriptional output with spatial resolution. Extensive interaction networks were revealed within developmental TADs primarily involving CRMs. Critically, these networks arise thanks to the spatial clustering of multiple enhancers (CRM hubs) and are mostly invariant during cell fate specification and gene activation. Networks of pairwise CRM contacts and CRM hubs arise during early development, before the onset of gene expression and before the emergence of TADs, and require the pioneering activity of the transcription factor Zld (Espinola 2021).
One of the important results of the present study is that physical proximity between multiple CRMs and promoters is observed with very similar frequencies in cells with distinct fates and appeared during early embryogenesis. These results are consistent with those obtained at later stages of Drosophila embryogenesis, showing that enhancers located at considerably larger distances (~100 kb) can also form binary loops that are present in cells from different tissues. Similarly, E-P interactions at the mouse Hoxd locus were detected in tissues where target genes were not expressed. The results are further supported by a companion paper that applied Hi-C and micro-C to study tissue-specific Drosophila chromosome organization at similar stages of development. From a developmental perspective, the formation of loops between promoters and distal regulatory elements in cells where genes need to be repressed can be seen as a ‘dangerous liaison'. Indeed, once a loop has been established, transcriptional activation could rapidly occur in cells where that specific promoter should be kept inactive (Espinola 2021).
This apparent dichotomy can, however, be rationalized in terms of the spatiotemporal patterning of the cis-regulatory logic of TFs during embryogenesis. For instance, in M (mesodermal) cells, most doc CRMs are bound by the spatially localized transcriptional repressor Sna, which acts as a silencer in the M. In this case, communication between promoters and distal CRMs may reinforce transcriptional repression. This interpretation is in agreement with the finding that many enhancers can act as silencers in alternate cell types during Drosophila development; however, other silencing mechanisms may also be at play. Thus, it is hypothesized that the optimal mechanism to ensure rapid and efficient activation or repression during development may involve two steps: the rapid priming of key CRMs via ubiquitously maternally deposited pioneer factors (for example, Zld), followed by regulation of transcriptional output by spatially and temporally localized transcriptional activators and repressors. In this model, three-dimensional (3D) chromatin architecture plays a double role because 3D contacts could serve to reinforce both activation and repression at a particular developmental stage while allowing for flexibility at later stages. For example, a repressive CRM loop in a tissue at an early developmental stage may switch to a CRM loop with activation capacities at later stages by changing TF occupancy. Future experiments testing whether CRM loops and hubs display more differences in active and repressed tissues at later stages of development will be important to test these hypotheses (Espinola 2021).
Previous studies suggested that invariant enhancer-promoter (E-P loops may be pre-established and stable (Allahyar, 2018; Dudelaar, Tsai, 2019). In agreement with these results, the data indicate that E-P loops can form early, well before the onset of gene expression. However, in all cases, low frequencies were measured of looping interactions between functional elements. These results are consistent with previous measurements of absolute contact frequencies within TADs and between E-Ps. Thus, these results indicate that different sets of multi-way E-E and E-P contacts are present in different cells, and that these contacts may be highly dynamic (Espinola 2021).
Recent studies reported the existence of enhancer hubs: spatially localized clusters containing multiple enhancers that may facilitate transcriptional activation by creating a local microenvironment whereby transcriptional resources are shared, akin to early models of 'transcription hubs'. Formation of enhancer hubs may require interactions between components of the transcriptional machinery, which could contribute to, or result from, the assembly of phase-separated condensates. In this model, enhancers need not directly touch their target promoters but merely come into close proximity (~200-300 nm). Overall, these findings and models are consistent with the observation that multiple endogenous CRMs within a TAD come together in space to form hubs in single, actively transcribing nuclei. The formation of similar hubs was also observed in inactive nuclei, suggesting that repressive elements may also form spatially localized clusters of transcriptional repressors to share resources and reinforce their silencing activities. CRM hubs are formed at early stages of development in pluripotent cells. Thus, a model is favored in which preferential CRM interaction networks are pre-formed at early stages and are subsequently specified (into activation or repression hubs) during nc14 or later (Espinola 2021).
In Drosophila, TADs emerge concomitant with the major wave of zygotic gene activation. Previous studies reported the existence of chromatin loops typically at considerably large genomic distances spanning two or more TADs or concerning Polycomb-binding sites. In the present study, it was observed that chromatin loops between CRMs within Drosophila TADs are widespread, mimicking the common CTCF-mediated chromatin loops present within mammalian TADs. In addition, this study found that multiple CRMs can cluster together to form cis-regulatory hubs located within TADs, suggesting a mechanism to sequester enhancers in space to reduce the activation of genes in neighboring TADs. Importantly, formation of CRM hubs precedes the emergence of TADs, consistent with the finding in mammalian cells that subsets of E-P contacts arise rapidly after mitosis before TADs are re-formed. Thus, the results suggest that CRM hubs and TADs probably form by different mechanisms. Overall, it is hypothesized that CRM hubs represent an additional functional level of genome organization, independent of TADs. This additional layer can also be regulated by priming of enhancers and promoters by paused polymerases or pioneer factors, as well as by chromatin marks. As interactions between Zld CRMs appear before TADs, it is unclear how specificity of CRM interactions may be regulated to favor intra-TAD contacts (Espinola 2021).
It is interesting that it is observed that interactions between Zld-bound CRMs, as well as interactions between CRMs and cognate promoters, are established very early in pluripotent nuclei, before cell fate commitments. These long-range interactions occur between related CRMs (within doc- and sna-TADs) as well as between unrelated but Zld-bound CRMs, suggesting that a common link could be their regulation by broad factors such as Zld. Critically, preferential contacts involving Zld-bound CRMs were considerably attenuated upon Zld depletion. If has been recently shown that Zld forms nuclear hubs in early Drosophila embryos, and that Zld hubs are re-established by the end of mitosis, before transcriptional activation. Taken together, these results suggest that Zld fosters the formation of CRM hubs by rendering chromatin accessible during early development, as a first step of cell specification to ensure maximum plasticity. Future work involving the detection of a larger number of CRMs will be needed to elucidate the factors and mechanisms involved in spatial clustering of developmental CRMs into nuclear microenvironments (Espinola 2021).
Genome organization is driven by forces affecting transcriptional state, but the relationship between transcription and genome architecture remains unclear. This study identified the Drosophila transcription factor Motif 1 Binding Protein (M1BP) in physical association with the gypsy chromatin insulator core complex, including the universal insulator protein CP190. M1BP is required for enhancer-blocking and barrier activities of the gypsy insulator as well as its proper nuclear localization. Genome-wide, M1BP specifically colocalizes with CP190 at Motif 1-containing promoters, which are enriched at topologically associating domain (TAD) borders. M1BP facilitates CP190 chromatin binding at many shared sites and vice versa. Both factors promote Motif 1-dependent gene expression and transcription near TAD borders genome-wide. Finally, loss of M1BP reduces chromatin accessibility and increases both inter- and intra-TAD local genome compaction. These results reveal physical and functional interaction between CP190 and M1BP to activate transcription at TAD borders and mediate chromatin insulator-dependent genome organization (Bag, 2021).
In eukaryotic cells, the three-dimensional organization of the genome plays a critical role in achieving proper spatial and temporal patterns of gene expression during development. Chromatin insulators are DNA-protein complexes involved in the establishment, maintenance, and regulation of nuclear organization to modulate gene expression. Insulators regulate interactions between cis-regulatory elements such as enhancers and promoters and demarcate silent and active chromatin regions to ensure their proper regulation. They can inhibit the interaction between an enhancer and a promoter when positioned between the two elements and can act as a barrier to stop repressive chromatin from spreading over active genes. Furthermore, chromatin insulators can promote intra- and inter-chromosomal looping to control topology of the genome. Certain insulator proteins are highly enriched at the self-interacting boundaries of topologically associating domains (TADs) throughout the genome. In mammals, only a single insulator protein, CCCTC-binding Factor (CTCF), has thus far been identified, and CTCF indeed is enriched at TAD borders and is required for TAD formation. In contrast, Drosophila melanogaster CTCF is not particularly enriched at TAD borders, and a recent study indicates that CTCF plays a limited role in TAD formation in flies. In fact, Drosophila harbors a variety of insulator protein complexes, all of which contain the protein Centrosomal protein 190 (CP190). CP190 is highly enriched at TAD borders, suggesting a possible role in TAD formation. Another notable feature of genome organization that has been explored in detail in Drosophila is the key role of transcription and the presence of constitutively active genes at TAD borders. General inhibition of transcription using chemical treatments or heat shock results in disruption of TADs and compartments, but the mechanistic details of how transcription contributes to genome organization are yet to be elucidated (Bag, 2021).
The Drosophila gypsy insulator, also known as the Suppressor of Hairy wing [Su(Hw)] insulator, was the first characterized CP190-containing insulator complex. The zinc-finger DNA-binding protein Su(Hw) provides binding specificity of the complex, and both CP190 and the Modifier of mdg4 [Mod(mdg4)] 67.2 kDa isoform [Mod(mdg4)67.2] contain an N-terminal Broad-Complex, Tramtrack, and Bric a brac (BTB) domain that can homodimerize or heterodimerize to facilitate insulator-insulator interactions and promote formation of long range insulator-mediated loops. Initially, the gypsy insulator complex was characterized as binding the 5'-untranslated region of the gypsy retroelement. However, the core complex also binds thousands of endogenous sites throughout the genome and can function similarly at least at a subset of those sites. Moreover, the three gypsy insulator core components do not colocalize absolutely at all binding sites throughout the genome, and each protein can interact with other insulator proteins. In diploid interphase nuclei, gypsy insulator proteins coalesce into large foci termed insulator bodies. These structures can be induced by stress, and insulator bodies have also been proposed to serve as storage depots for insulator proteins. Nevertheless, there is a high correlation between proper insulator function and insulator body localization. In summary, the gypsy insulator complex contributes to higher order nuclear organization on several levels (Bag, 2021).
CP190 also associates with a variety of additional DNA-binding proteins that likely impart specificity of the respective complex. The BED finger-containing proteins BEAF-32, Ibf1, and Ibf2 interact with CP190 and promote insulator function (Cuartero, 2014). Three additional zinc-finger proteins Pita, ZIPIC, and CTCF also interact with CP190 and contribute to insulator activity. Recently, the zinc-finger protein CLAMP was demonstrated to positively affect gypsy insulator activity and to colocalize particularly with CP190 at promoters throughout the genome. In addition, previous work showed that CP190 preferentially binds Motif 1-containing promoters, but the functional significance of this observation is currently unknown. The precise functions of CP190, its associated factors, as well as their relationship with transcription regulation have not yet been elucidated (Bag, 2021).
Motif 1 binding protein (M1BP) is a ubiquitously expressed transcriptional activator that is required for the expression of predominantly constitutive genes. A zinc-finger DNA-binding protein, M1BP, specifically binds to the core promoter element Motif 1 consensus sequence that is distinct from the canonical TATA box and mainly controls the expression of constitutively active genes that are transiently paused (Li, 2013). For example, M1BP interacts with the TATA-binding protein-related factor 2 (TRF2) to activate transcription of ribosomal protein genes in a Motif 1-dependent manner (Baumann, 2017a). Finally, recent studies found that Motif 1 and M1BP are highly enriched at TAD boundaries along with CP190 and BEAF-32 (Cubenas-Potts, 2017; Ramirez, 2018; Sexton, 2012; Hou, 2012). Depletion of M1BP led to increased inter-chromosomal Hi-C contacts; however, concomitant cell cycle disruption precluded interpretation of these results (Ramirez, 2018). The possible role of M1BP-dependent transcriptional regulation in genome organization has not yet been interrogated in detail (Bag, 2021).
This study identified M1BP as a physical interactor and positive regulator of the gypsy insulator complex. Depletion of M1BP decreases gypsy-dependent enhancer blocking and barrier activities and reduces the association of the core insulator complex with the gypsy insulator sequence. ChIP-seq analysis reveals extensive genome-wide overlap of M1BP particularly with promoter-bound CP190, and depletion of M1BP results in extensive loss of CP190 chromatin association genome-wide. Depletion of CP190 also disrupts M1BP binding at many of its binding sites. Nascent euRNA-seq (neuRNA-seq) analysis of M1BP- or CP190-depleted cells indicates that both factors co-regulate a similar set of genes genome-wide. In particular, loss of gene activation correlates with disrupted M1BP and CP190 binding, and these events are frequently observed at TAD borders. Depletion of M1BP disrupts gypsy insulator body localization within the nucleus and alters both inter- and intra-TAD local genome compaction. Finally, knockdown of M1BP decreases chromatin accessibility at its binding sites, including genes that it activates and regions in proximity of TAD borders. Taken together, these findings identify a mechanistic relationship between M1BP and CP190 to activate Motif 1-dependent transcription as well as to promote chromatin insulator activity and nuclear organization (Bag, 2021).
This study has shown that M1BP is required for proper gypsy insulator function and insulator body formation and that M1BP and CP190 together activate transcription at TAD borders. M1BP was shown to physically associates with CP190 as well as core gypsy components and promotes enhancer blocking and barrier activities. Genome-wide M1BP colocalizes mainly with CP190 at Motif 1-containing promoters, which are enriched at TAD borders. M1BP is required for CP190 binding at many sites throughout the genome and vice versa, and loss of either factor reduces gene expression at TAD borders. M1BP is required for proper nuclear localization of insulator bodies, and loss of M1BP increases local genome compaction across TAD borders as well as within large TADs. Finally, M1BP promotes local chromatin accessibility at its binding sites, including transcriptionally activated genes and regions near TAD borders. Taken together, these findings suggest that M1BP may play a role in 3D genome organization through a CP190- and transcription-dependent mechanism (Bag, 2021).
As M1BP is ubiquitously expressed throughout development, effects on insulator activity and complex localization after M1BP depletion were observed in all tissues and stages of development tested. M1BP associates physically with chromatin at the Su(Hw)-binding sites of the gypsy insulator in conjunction with core insulator proteins, and M1BP is required for the binding of all three factors. These findings suggest that M1BP directly affects insulator activity by aiding the recruitment of gypsy core components to gypsy insulator sites, all three of which are required for proper insulator activity. Interestingly, Motif 1 is not present at this sequence, and M1BP binding at this site is also dependent on the presence of CP190. One scenario is that binding of the two factors could be cooperative, and M1BP recruitment may additionally heo stabilize the multimerization and/or higher order organization of insulator complexes. Consistent with this hypothesis, depletion of M1BP results in increased numbers of smaller insulator bodies, similar to the effect of complete loss of the BTB-containing core insulator protein Mod(mdg4)67.2. Another possibility is that depletion of M1BP results in cellular stress that induces insulator body formation. Since CP190 is a universal insulator protein in Drosophila, mislocalization of CP190 may result in, or at least serve as an indicator of, disrupted genome organization when M1BP is depleted (Bag, 2021).
Although M1BP physically interacts with each of the core gypsy insulator components, it was observed that M1BP colocalizes mainly with just CP190 throughout the genome, particularly at Motif 1-containing promoters. Distinct binding of the transcriptional activator M1BP compared to Su(Hw) and Mod(mdg4)67.2 genome-wide is not entirely surprising considering sub-stoichiometric levels of co-immunoprecipitation that could also reflect interaction off of chromatin. Furthermore, it has been observed that Su(Hw) binding can correlate with transcriptional repression rather than insulator activity, which may depend on the presence, or absence, of particular interacting proteins. Importantly, recruitment of CP190 is dependent on M1BP and vice versa at many co-occupied sites genome-wide. Why M1BP is only partially dependent on CP190 for binding is unclear, but these results are consistent with the known ability of M1BP to bind DNA directly (Li, 2013), whereas CP190 is believed to require interaction with a specific DNA-binding protein in order to associate with chromatin. Aside from the insulator protein BEAF-32, which binds an AT-rich dual core sequence, a large extent of overlap between M1BP and other DNA-binding insulator proteins that have been shown to be involved in recruiting CP190 to DNA was not generally observed. Future studies may reveal a possible functional relationship between M1BP and BEAF-32 in insulator activity or regulation of gene expression (Bag, 2021).
The results suggest that M1BP promotes gypsy insulator function through interaction with CP190 in a manner distinct from CP190 interaction with the zinc-finger DNA-binding protein CLAMP. A large extent of genome-wide overlap between M1BP and CLAMP, a recently identified positive regulator of gypsy insulator activity, was not observed. The sequence binding specificity of CLAMP is similar to that of GAF45, while M1BP and GAF bind to and regulate distinct sets of promoters (Li, 2013). Although either CLAMP or M1BP depletion reduces gypsy enhancer blocking and barrier activities as well as alter insulator body localization, unlike M1BP, CLAMP depletion does not affect CP190 chromatin association throughout the genome. In fact, CP190 depletion had a substantial effect on CLAMP chromatin association, again suggesting that CP190 may affect the ability of certain DNA-binding proteins, including M1BP, to associate with chromatin through cooperative or higher order physical interactions (Bag, 2021).
The results suggest that CP190 may play a more direct role in transcriptional regulation than previously appreciated, in part through interaction with M1BP. Genome-wide profiling studies have shown that CP190 preferentially associates with promoters genome-wide. CP190 was found to be enriched particularly at active promoters and was shown to affect steady-state gene expression when depleted; however, direct and indirect effects as well as transcriptional and posttranscriptional effects could not be separated. In order to avoid the complication of interpreting steady-state gene expression profiles, neuRNA-seq was performed after either CP190 or M1BP depletion in order to measure newly synthesized transcripts. Intriguingly, nascent RNA expression profiles of CP190 or M1BP-depleted cells showed a remarkably high level of correlation. Since both M1BP and CP190 are particularly associated with promoters of genes that require either factor for adequate expression, it is likely that both factors mainly function in transcriptional activation rather than repression. M1BP was previously shown to activate transcriptionally paused genes, and the ATAC-seq analysis of M1BP-depleted cells supports this conclusion and further demonstrates that M1BP promotes chromatin accessibility surrounding the TSS. Interestingly, depletion of CP190 has no effect on promoter accessibility. Furthermore, this study found that CP190 is specifically required for Motif 1-dependent expression of two previously characterized ribosomal protein genes. Whether TRF2 recruitment to Motif 1-containing promoters is affected by CP190 as well as the precise mechanism by which CP190 contributes to M1BP-dependent transcription will be important topics of further study (Bag, 2021).
This study found that genes that require M1BP and CP190 for adequate expression are frequently located at TAD borders, as are both proteins. It has previously been proposed that constitutively active transcription, particularly at/near TAD borders (also referred to as 'compartmental domains'), may be a defining or at least key feature of overall genome organization in cells and throughout development. CP190 was previously shown to be associated particularly with Motif 1-containing promoters (Rach, 2011), and CP190 was observed to be specifically enriched at TAD borders. Recently, Motif 1 was also found to be an enriched sequence at TAD borders present in Kc cells (Ramirez, 2018) and is apparent when TAD borders first appear in embryonic development, findings consistent with these previous studies. A previous study of M1BP involvement in genome organization provided limited evidence using Hi-C to suggest that chromosome intermingling may be increased after M1BP depletion (Ramirez, 2018). However, the extent and duration of M1BP depletion in their study caused a major disruption of cell cycle and cellular growth, thus obscuring interpretation of those results. No changes were observed in CT intermingling in Oligopaint FISH experiments in M1BP-depleted G1 cells, nor did was any difference observed in distance between two distant regions on the same chromosome. Therefore, large-scale changes in genome organization were not observed after M1BP depletion (Bag, 2021).
Increased local genome compaction was observed after M1BP depletion. This finding led to a test of whether TAD borders may be specifically disrupted, perhaps leading to fusion of neighboring TADs. Because Oligopaint FISH probes are limited to a minimum of 30 kb, this study was restricted to intra-TAD analysis of larger TADs, which are typically lower in transcriptional activity (PcG, inactive, null). Increased compaction was found to occur both across TAD borders and within large TADs after M1BP depletion. These effects occur in the vicinity of altered local transcription and reduced chromatin accessibility particularly near TAD borders, suggesting that M1BP-dependent transcriptional changes might alter local chromatin structure that culminates in changes in genome compaction in surrounding regions not restricted to TAD borders. However, the extent of reduced chromatin accessibility observed in M1BP-depleted cells by ATAC-seq is modest relative to the genomic sequence space interrogated by FISH, suggesting that loss of accessibility likely does not directly explain increased local genome compaction. In contrast, CP190 depletion affected transcription to a similar degree, although not identically, yet did not result in changes in local compaction or extensive loss of chromatin accessibility genome-wide. These differences perhaps reflect the multifunctional nature of CP190 as a universal insulator protein contributing to opposing forces, or alternatively, effects on transcription in M1BP-depleted cells may be functionally unrelated to increased genome compaction. This work shows the requirement of M1BP for accurate CP190 binding throughout the genome as well as gypsy-dependent chromatin insulator activity and nuclear localization through interaction with CP190 and other core insulator proteins. Overall, the results provide evidence that M1BP and CP190 at TAD borders and perhaps their ability to activate transcription of constitutively expressed genes located at TAD borders, combined with the capacity of M1BP to promote chromatin accessibility, may play a role in genome organization (Bag, 2021).
M1BP has been shown to activate transcription of genes at which RNA Pol II is transiently paused in the promoter proximal region, and these promoter regions may themselves possess insulator activity. Intriguingly, a previous study showed that stalled Hox promoters, including Motif 1-containing Abd-B, possess intrinsic enhancer blocking insulator activity47. However, the second paused promoter identified in this study, Ubx, does not harbor Motif 1; thus, M1BP may not necessarily be involved in the enhancer blocking activity of all stalled promoters. Recently, it was shown that M1BP promoter binding can prime the recruitment of the Hox protein Abd-A to the promoter in order to release paused Pol II and activate transcription (Chopra, 2009). This study found that M1BP is similarly required for CP190 recruitment at a large number of sites throughout the genome and thus propose that M1BP and CP190 are together required to maintain active gene expression near TAD borders. Active transcription and increased accessibility at these sites may be needed for higher order chromatin organization such as TAD insulation, formation of active compartmental domains, and/or proper local genome structure. Future studies will elucidate the precise mechanisms by which M1BP, CP190, and transcription contribute to higher order chromatin organization (Bag, 2021).
Drosophila bithorax complex (BX-C) is one of the best model systems for studying the role of boundaries (insulators) in gene regulation. Expression of three homeotic genes, Ubx, abd-A, and Abd-B, is orchestrated by nine parasegment-specific regulatory domains. These domains are flanked by boundary elements, which function to block crosstalk between adjacent domains, ensuring that they can act autonomously. Paradoxically, seven of the BX-C regulatory domains are separated from their gene target by at least one boundary, and must "jump over" the intervening boundaries. To understand the jumping mechanism, the Mcp boundary was replaced with Fab-7 and Fab-8. Mcp is located between the iab-4 and iab-5 domains, and defines the border between the set of regulatory domains controlling abd-A and Abd-B. When Mcp is replaced by Fab-7 or Fab-8, they direct the iab-4 domain (which regulates abd-A) to inappropriately activate Abd-B in abdominal segment A4. For the Fab-8 replacement, ectopic induction was only observed when it was inserted in the same orientation as the endogenous Fab-8 boundary. A similar orientation dependence for bypass activity was observed when Fab-7 was replaced by Fab-8. Thus, boundaries perform two opposite functions in the context of BX-C-they block crosstalk between neighboring regulatory domains, but at the same time actively facilitate long distance communication between the regulatory domains and their respective target genes (Postaka, 2018).
Boundaries flanking the Abd-B regulatory domains must block crosstalk between adjacent regulatory domains but at the same time allow more distal domains to jump over one or more intervening boundaries and activate Abd-B expression. While several models have been advanced to account for these two paradoxical activities, replacement experiments argued that both must be intrinsic properties of the Abd-B boundaries. Thus Fab-7 and Fab-8 have blocking and bypass activities in Fab-7 replacement experiments, while heterologous boundaries including multimerized dCTCF sites and Mcp from BX-C do not. One idea is that Fab-7 and Fab-8 are simply 'permissive' for bypass. They allow bypass to occur, while boundaries like multimerized dCTCF or Mcp are not permissive in the context of Fab-7. Another is that they actively facilitate bypass by directing the distal Abd-B regulatory domains to the Abd-B promoter. Potentially consistent with an 'active' mechanism that involves boundary pairing interactions, the bypass activity of Fab-8 and to a lesser extent Fab-7 is orientation dependent (Postaka, 2018).
In the studies reported it this study have tested these two models further. For this purpose the Mcp boundary was used for in situ replacement experiments. Mcp defines the border between the regulatory domains that control expression of abd-A and Abd-B. In this location, it is required to block crosstalk between the flanking domains iab-4 and iab-5, but it does not need to mediate bypass. In this respect, it differs from the boundaries that are located within the set of regulatory domains that control either abd-A or Abd-B, as these boundaries must have both activities. If bypass were simply passive, insertion of a 'permissive' Fab-7 or Fab-8 boundary in either orientation in place of Mcp would be no different from insertion of a generic 'non-permissive' boundary such as multimerized dCTCF sites. Assuming that Fab-7 and Fab-8 can block crosstalk out of context, they should fully substitute for Mcp. In contrast, if bypass in the normal context involves an active mechanism in which more distal regulatory domains are brought to the Abd-B promoter, then Fab-7 and Fab-8 replacements might also be able to bring iab-4 to the Abd-B promoter in a configuration that activates transcription. If they do so, then this process would be expected to show the same orientation dependence as is observed for bypass of the Abd-B regulatory domains in Fab-7 replacements (Postaka, 2018).
Consistent with the idea that a boundary located at the border between the domains that regulate abd-A and Abd-B need not have bypass activity, it was found that multimerized binding sites for the dCTCF protein fully substitute for Mcp. Like the multimerized dCTCF sites, Fab-7 and Fab-8 are also able to block crosstalk between iab-4 and iab-5. In the case of Fab-7, its' blocking activity is incomplete and there are small clones of cells in which the mini-y reporter is activated in A4. In contrast, the blocking activity of Fab-8 is comparable to the multimerized dCTCF sites and the mini-y reporter is off throughout A4. One plausible reason for this difference is that Mcp and the boundaries flanking Mcp (Fab-4 and Fab-6) utilize dCTCF as does Fab-8, while this architectural protein does not bind to Fab-7 (Postaka, 2018).
Importantly, in spite of their normal (or near normal) ability to block crosstalk, both boundaries still perturb Abd-B regulation. In the case of Fab-8, the misregulation of Abd-B is orientation dependent just like the bypass activity of this boundary when it is used to replace Fab-7. When inserted in the reverse orientation, Fab-8 behaves like multimerized dCTCF sites and it fully rescues the Mcp deletion. In contrast, when inserted in the forward orientation, Fab-8 induces the expression of Abd-B in A4 (PS9), and the misspecification of this parasegment. Unlike classical Mcp deletions or the McpPRE replacement described in this study, expression of the Abd-B gene in PS9 is driven by iab-4, not iab-5. This conclusion is supported by two lines of evidence. First, the mini-y reporter inserted in iab-5 is off in PS9 cells indicating that iab-5 is silenced by PcG factors as it should be in this parasegment. Second, the ectopic expression of Abd-B is eliminated when the iab-4 regulatory domain is inactivated (Postaka, 2018).
These results, taken together with previous studies, support a model in which the chromatin loops formed by Fab-8 inserted at Mcp in the forward orientation brings the enhancers in the iab-4 regulatory domain in close proximity to the Abd-B promoter, leading to the activation of Abd-B in A4 (PS9). In contrast, when inserted in the opposite orientation, the topology of the chromatin loops formed by the ectopic Fab-8 boundary are not compatible with productive interactions between iab-4 and the Abd-B promoter. Moreover, it would appear that boundary bypass for the regulatory domains that control Abd-B expression is not a passive process in which the boundaries are simply permissive for interactions between the regulatory domains and the Abd-B promoter. Instead, it seems to be an active process in which the boundaries are responsible for bringing the regulatory domains into contact with the Abd-B gene. It also seems likely that bypass activity of Fab-8 (and also Fab-7) may have a predisposed preference, namely it is targeted for interactions with the Abd-B gene. This idea would fit with transgene bypass experiments, which showed that both Fab-7 and Fab-8 interacted with an insulator like element upstream of the Abd-B promoter, AB-I, while the Mcp boundary didn't (Postaka, 2018).
Similar conclusions can be drawn from the induction of Abd-B expression in A4 (PS9) when Fab-7 is inserted in place of Mcp. Like Fab-8, this boundary inappropriately targets the iab-4 regulatory domain to Abd-B. Unlike Fab-8, Abd-B is ectopically activated when Fab-7 is inserted in both the forward and reverse orientations. While the effects are milder in the reverse orientation, the lack of pronounced orientation dependence is consistent with experiments in which Fab-7 was inserted at its endogenous location in the reverse orientation. Unlike Fab-8 only very minor iab-6 bypass defects were observed. In addition to the activation of Abd-B in A4 (PS9) the Fab-7 Mcp replacements also alter the pattern of Abd-B regulation in more posterior segments. In the forward orientation, A4 and A5 are transformed towards an A6 identity, while A6 is also misspecified. Similar though somewhat less severe effects are observed in these segments when Fab-7 is inserted in the reverse orientation. At this point the mechanisms responsible for these novel phenotypic effects are uncertain. One possibility is that pairing interactions between the Fab-7 insert and the endogenous Fab-7 boundary disrupt the normal topological organization of the regulatory domains in a manner similar to that seen in boundary competition transgene assays. An alternative possibility is that Fab-7 targets iab-4 to the Abd-B promoter not only in A4 (PS9) but also in cells in A5 (PS10) and A6 (PS11). In this model, Abd-B would be regulated not only by the domain that normally specifies the identity of the parasegment (e.g., iab-5 in PS10), but also by interactions with iab-4. This dual regulation would increase the levels of Abd-B, giving the weak GOF phenotypes. Potentially consistent with this second model, inactivating iab-4 in the McpF8 replacement not only rescues the A4 (PS9) GOF phenotypes but also suppresses the loss of anterior trichomes in the A6 tergite (Postaka, 2018).
Insulators play important roles in genome structure and function in eukaryotes. Interactions between a DNA binding insulator protein and its interacting partner proteins define the properties of each insulator site. The different roles of insulator protein partners in the Drosophila genome and how they confer functional specificity remain poorly understood. The Suppressor of Hairy wing [Su(Hw)] insulator is targeted to the nuclear lamina, preferentially localizes at euchromatin/heterochromatin boundaries, and is associated with the gypsy retrotransposon. Insulator activity relies on the ability of the Su(Hw) protein to bind the DNA at specific sites and interact with Mod(mdg4)67.2 and CP190 partner proteins. HP1 and insulator partner protein 1 (HIPP1) is a partner of Su(Hw), but how HIPP1 contributes to the function of Su(Hw) insulator complexes is unclear. This study demonstrates that HIPP1 colocalizes with the Su(Hw) insulator complex in polytene chromatin and in stress-induced insulator bodies. The overexpression of either HIPP1 or Su(Hw) or mutation of the HIPP1 crotonase-like domain (CLD) causes defects in cell proliferation by limiting the progression of DNA replication. This study also showed that HIPP1 overexpression suppresses the Su(Hw) insulator enhancer-blocking function, while mutation of the HIPP1 CLD does not affect Su(Hw) enhancer blocking. These findings demonstrate a functional relationship between HIPP1 and the Su(Hw) insulator complex and suggest that the CLD, while not involved in enhancer blocking, influences cell cycle progression (Stow, 2022).
Mounting evidence implicates liquid-liquid phase separation (LLPS), the condensation of biomolecules into liquid-like droplets in the formation and dissolution of membraneless intracellular organelles (MLOs). Cells use MLOs or condensates for various biological processes, including emergency signaling and spatiotemporal control over steady-state biochemical reactions and heterochromatin formation. Insulator proteins are architectural elements involved in establishing independent domains of transcriptional activity within eukaryotic genomes. In Drosophila, insulator proteins form nuclear foci known as insulator bodies in response to osmotic stress. However, the mechanism through which insulator proteins assemble into bodies is yet to be investigated. This study identified signatures of LLPS by insulator bodies, including high disorder tendency in insulator proteins, scaffold-client-dependent assembly, extensive fusion behavior, sphericity, and sensitivity to 1,6-hexanediol. The cohesin subunit Rad21 is shown to be a component of insulator bodies, adding to the known insulator protein constituents and γH2Av. These data suggest a concerted role of cohesin and insulator proteins in insulator body formation and under physiological conditions. A mechanism is proposed whereby these architectural proteins modulate 3D genome organization through LLPS (Amankwaa, 2022).
The Drosophila Boundary Element-Associated Factor of 32 kDa (BEAF) binds in promoter regions of a few thousand mostly housekeeping genes. This study shows that BEAF physically interacts with the polybromo subunit (Pbro) of PBAP, a SWI/SNF-class chromatin remodeling complex. BEAF also shows genetic interactions with Pbro and other PBAP subunits. The effect of this interaction on gene expression and chromatin structure was examined using precision run-on sequencing and micrococcal nuclease sequencing after RNAi-mediated knockdown in cultured S2 cells. The results are consistent with the interaction playing a subtle role in gene activation. Fewer than 5% of BEAF-associated genes were significantly affected after BEAF knockdown. Most were downregulated, accompanied by fill-in of the promoter nucleosome-depleted region and a slight upstream shift of the +1 nucleosome. Pbro knockdown caused downregulation of several hundred genes and showed a correlation with BEAF knockdown but a better correlation with promoter-proximal GAGA factor binding. Micrococcal nuclease sequencing supports that BEAF binds near housekeeping gene promoters while Pbro is more important at regulated genes. Yet there is a similar general but slight reduction of promoter-proximal pausing by RNA polymerase II and increase in nucleosome-depleted region nucleosome occupancy after knockdown of either protein. The possibility is discussed of redundant factors keeping BEAF-associated promoters active and masking the role of interactions between BEAF and the Pbro subunit of PBAP in S2 cells. Facilitates Chromatin Transcription (FACT) and Nucleosome Remodeling Factor (NURF) were identified as candidate redundant factors (McKowen, 2022).
The spatial organization of chromatin at the scale of topologically associating domains (TADs) and below displays large cell-to-cell variations. Up until now, how this heterogeneity in chromatin conformation is shaped by chromatin condensation, TAD insulation, and transcription has remained mostly elusive. This study used Hi-M, a multiplexed DNA-FISH imaging technique providing developmental timing and transcriptional status, to show that the emergence of TADs at the ensemble level of the doc locus partially segregates the conformational space explored by single nuclei during the early development of Drosophila embryos. Surprisingly, a substantial fraction of nuclei display strong insulation even before TADs emerge. Moreover, active transcription within a TAD leads to minor changes to the local inter- and intra-TAD chromatin conformation in single nuclei and only weakly affects insulation to the neighboring TAD. Overall, these results indicate that multiple parameters contribute to shaping the chromatin architecture of single nuclei at the TAD scale (Gotz, 2022).
DNA within chromosomes in the nucleus is non-randomly organized into chromosome territories, compartments and topologically associated domains (TADs). Chromosomal rearrangements have the potential to alter chromatin organization and modify gene expression leading to selection against these structural variants. Drosophila pseudoobscura has a wealth of naturally occurring gene arrangements that were generated by overlapping inversion mutations caused by two chromosomal breaks that rejoin the central region in reverse order. Unlike humans, Drosophila inversion heterozygotes do not have negative effects associated with crossing over during meiosis because males use achiasmate mechanisms for proper segregation, and aberrant recombinant meiotic products generated in females are lost in polar bodies. As a result, Drosophila populations are found to harbour extensive inversion polymorphisms. It is not clear, however, whether chromatin architecture constrains which inversions breakpoints persist in populations. This study mapped the breakpoints of seven inversions in D. pseudoobscura to the TAD map to determine if persisting inversion breakpoints are more likely to occur at boundaries between TADs. The results show that breakpoints occur at TAD boundaries more than expected by chance. Some breakpoints may alter gene expression within TADs supporting the hypothesis that position effects contribute to inversion establishment (Wright, 2022).
Chromatin topology is intricately linked to gene expression, yet its functional requirement remains unclear. This study comprehensively assessed the interplay between genome topology and gene expression using highly rearranged chromosomes (balancers) spanning ~75% of the Drosophila genome. Using transheterozyte (balancer/wild-type) embryos, allele-specific changes were measured in topology and gene expression in cis, while minimizing trans effects. Through genome sequencing, eight large nested inversions, smaller inversions, duplications and thousands of deletions were identified. These extensive rearrangements caused many changes to chromatin topology, disrupting long-range loops, topologically associating domains (TADs) and promoter interactions, yet these are not predictive of changes in expression. Gene expression is generally not altered around inversion breakpoints, indicating that mis-appropriate enhancer-promoter activation is a rare event. Similarly, shuffling or fusing TADs, changing intra-TAD connections and disrupting long-range inter-TAD loops does not alter expression for the majority of genes. These results suggest that properties other than chromatin topology ensure productive enhancer-promoter interactions (Ghavi-Helm, 2019).
Position effect variegation (PEV) in Drosophila results from new juxtapositions of euchromatic and heterochromatic chromosomal regions, and manifests as striking bimodal patterns of gene expression. The semirandom patterns of PEV, reflecting clonal relationships between cells, have been interpreted as gene-expression states that are set in development and thereafter maintained without change through subsequent cell divisions. Many properties of PEV are not predicted from currently accepted biochemical and theoretical models. This work investigated the time at which expressivity of silencing is set and finds that it is determined before heterochromatin exists. A mathematical simulation and a corroborating experimental approach are employed to monitor switching (i.e., gains and losses of silencing) through development. In contrast to current views, this study finds that gene silencing is incompletely set early in embryogenesis, but nevertheless is repeatedly lost and gained in individual cells throughout development. The data support an alternative to locus-specific 'epigenetic' silencing at variegating gene promoters that more fully accounts for the final patterns of PEV (Bughio, 2019).
Deciphering the rules of genome folding in the cell nucleus is essential to understand its functions. Recent chromosome conformation capture (Hi-C) studies have revealed that the genome is partitioned into topologically associating domains (TADs), which demarcate functional epigenetic domains defined by combinations of specific chromatin marks. However, whether TADs are true physical units in each cell nucleus or whether they reflect statistical frequencies of measured interactions within cell populations is unclear. Using a combination of Hi-C, three-dimensional (3D) fluorescent in situ hybridization, super-resolution microscopy, and polymer modeling, this study provides an integrative view of chromatin folding in Drosophila. Repressed TADs form a succession of discrete nanocompartments, interspersed by less condensed active regions. Single-cell analysis revealed a consistent TAD-based physical compartmentalization of the chromatin fiber, with some degree of heterogeneity in intra-TAD conformations and in cis and trans inter-TAD contact events. These results indicate that TADs are fundamental 3D genome units that engage in dynamic higher-order inter-TAD connections. This domain-based architecture is likely to play a major role in regulatory transactions during DNA-dependent processes (Szabo, 2018).
Eukaryotic chromatin is organized in contiguous domains that differ in protein binding, histone modifications, transcriptional activity, and in their degree of compaction. Genome-wide comparisons suggest that, overall, the chromatin organization is similar in different cells within an organism. This study compared the structure and activity of the 61C7-61C8 interval in polytene and diploid cells of Drosophila. By in situ hybridization on polytene chromosomes combined with high-resolution microscopy, the boundaries were mapped of the 61C7-8 interband and of the 61C7 and C8 band regions, respectively. The results demonstrate that the 61C7-8 interband is significantly larger than estimated previously. This interband extends over 20 kbp and is in the range of the flanking band domains. It contains several active genes and therefore can be considered as an open chromatin domain. Comparing the 61C7-8 structure of Drosophila S2 cells and polytene salivary gland cells by ChIP for chromatin protein binding and histone modifications, a highly consistent domain structure was observed for the proximal 13 kbp of the domain in both cell types. However, the distal 7 kbp of the open domain differs in protein binding and histone modification between both tissues. The domain contains four protein-coding genes in the proximal part and two noncoding transcripts in the distal part. The differential transcriptional activity of one of the noncoding transcripts correlates with the observed differences in the chromatin structure between both tissues (Zielke, 2015).
Drosophila melanogaster polytene chromosomes display specific banding pattern; the underlying genetic organization of this pattern has remained elusive for many years. This paper analyzed 32 cytology-mapped polytene chromosome interbands. Molecular locations of these interbands was estimated, their molecular and genetic organization was described and it was demonstrated that polytene chromosome interbands contain the 5' ends of housekeeping genes. As a rule, interbands display preferential 'head-to-head' orientation of genes. They are enriched for 'broad' class promoters characteristic of housekeeping genes and associate with open chromatin proteins and Origin Recognition Complex (ORC) components. In two regions, 10A and 100B, coding sequences of genes whose 5'-ends reside in interbands map to constantly loosely compacted, early-replicating, so-called 'grey' bands. Comparison of expression patterns of genes mapping to late-replicating dense bands vs genes whose promoter regions map to interbands shows that the former are generally tissue-specific, whereas the latter are represented by ubiquitously active genes. Analysis of RNA-seq data (modENCODE-FlyBase) indicates that transcripts from interband-mapping genes are present in most tissues and cell lines studied, across most developmental stages and upon various treatment conditions. A special algorithm was developed to computationally process protein localization data generated by the modENCODE project; it was shown that Drosophila genome has about 5700 sites that demonstrate all the features shared by the interbands cytologically mapped to date (Zhimulev, 2014).
Drosophila polytene chromosomes have served as the best available model of eukaryotic interphase chromosome. They are prominent for their banding pattern formed by dark transverse stripes (called bands), which encompass large chunks of chromatin material. These bands alternate with fine, lighter-colored stripes that have less material and are more loosely packed. Such light-transparent structures between bands are known as interbands (Zhimulev, 2014).
Genetic organization of bands and interbands defined as the pattern that sets positioning of genes and genetic features relatively to the structural elements of a chromosome, is still largely elusive. This is due to the fact that despite the availability of the Drosophila genome, methods to even approximately map band/interband borders on a physical map are still lacking (Zhimulev, 2014).
Yet, many interesting hypotheses regarding the genetic organization of bands and interbands in polytene chromosomes have been put forth. Some of the points of these hypotheses were experimentally validated, so it is important to consider them below (Zhimulev, 2014).
Genes were proposed to reside in interbands, or bands (1-2 genes/band). Also, bands were proposed to contain many structural genes transcribed coordinately and as polycistronic messages. In some models, band and interband were considered to form a single genetic unit, where one part of the gene was embedded in a band, and the other part mapped to an interband. The Paul model (Paul, 1972) is of special interest. The author considered interband regions as essentially polymerase-binding sites, so transcription would progress into band regions from initiation sites which were likely situated near the band-interband junctions. (Zhimulev, 2014).
Very interesting conclusions were made regarding the general meaning of banding pattern: bands were regarded as hosting inactivated genes, interbands were represented by the genes in a steady state of activity; in other models, interbands were believed to contain constantly active housekeeping genes. Further details on various banding pattern models are available in (Zhimulev, 1999 and Zimulev, 2014 and references therein).
Several recent technological advances have dramatically moved forward understanding of polytene chromosome organization. First, efforts of the modENCODE project have produced genome-wide profiling data for many proteins that specifically localized to bands or interbands in interphase chromosomes (Zhimulev, 2014).
Secondly, using genome-wide DamID mapping of 53 chromosomal proteins and histone modifications Filion (2010) generated a map of Drosophila chromatin landscape and demonstrated that the genome can be segmented into five main chromatin types. Conditionally named 'BLUE' (Pc-dependent repression) and 'BLACK' (repression mechanism not defined) chromatin types associated with repressed chromatin, 'YELLOW' chromatin contained ubiquitously expressed genes, whereas 'RED' chromatin harbored active genes with more complex expression patterns. 'GREEN' chromatin type was defined by enrichment of heterochromatin-specific proteins HP1 and Su(var)3-9). More refined analysis of modENCODE data resulted in description of many more chromatin states. Significant proportion of genome sequence is known to map to a special class of polytene chromosome bands, called intercalary heterochromatin (IH). These chromosome regions associate with BLACK chromatin proteins (H1, SUUR, LAM, D1) and range from 100 to 700 kb in length. Here, DNA replicates late and compared to the genome average, these regions have lower gene density. Genomic localization of proteins that constitute repressed chromatin can thus be used as a marker to establish the molecular position of IH (Zhimulev, 2014).
Third, recent studies developed an approach to simultaneously map the interband material on polytene chromosomes and in the genome using transposon insertion tags. This allows exact localization of insertion sites both on cytological and physical maps as well as precise identification of sequences around the transposon integration sites (Zhimulev, 2014).
Using this approach, protein composition and other chromatin parameters were described in 12 DNA sequences corresponding to polytene chromosome interbands. They display general features of open chromatin: low nucleosome density, histone H1 dips, association with TSS-specific proteins such as RNA polymerase II, various transcription factors, nucleosome remodeling proteins - NURF, ISWI, WDS, interband-specific proteins (CHRIZ/CHROMATOR, CHRIZ hereafter), proteins of origin recognition complexes (ORC). Moreover, they show clustering of DNaseI hypersensitive sites (DHS) (Zhimulev, 2014).
Based on these data, subdivided all polytene chromosome bands were divided into two contrasting groups: loosely compacted early-replicating, so-called 'grey' bands and dense late-replicating compact bands ('black' IH bands). They differ in many aspects of their protein and genetic make-up, as well as in DNA compactization (Zhimulev, 2014).
Previously work has shown that polytene chromosomes and interphase chromosomes from dividing cells display identical organization. Namely, interbands from polytene chromosomes and the corresponding DNA sequences from cell line chromosomes share similar features in terms of localization of open chromatin-type proteins. Consequently, banding pattern appears as a fundamental organization principle of interphase chromosomes. In both types of chromosomes, homologous interbands and bands have identical physical borders and length; importantly, they also associate with identical sets of proteins. Hence, the notion of an interband defined as a decondensed region in the context of polytene chromosomes is also applicable to other types of interphase chromosomes. In other words, the term 'interband' should be viewed as an equivalent of a constantly decondensed region in the context of any interphase chromosome. Accordingly, hereafter this wider definition of an interband is used (Zhimulev, 2014).
In the present work, using various cytological approaches, a new set of precisely mapped interbands were characterized, and then the modENCODE data on localization of active chromatin proteins was processed using a custom-designed computation model. This analysis suggests that interphase chromosome interbands contain constantly active promoter regions of ubiquitously active genes. Coding sequences of these genes, at least in two regions studied, map to adjacent loosely compacted early-replicating 'grey' bands. In contrast, densely packed, late-replicating bands of polytene chromosomes appear to preferentially harbor tissue-specific genes (Zhimulev, 2014).
The present study aims at unraveling the genetic and functional organization of basic morphological features of interphase chromosomes. In the context of polytene chromosomes, these features display distinct degrees of chromatin packaging and comprise interbands, loosely compacted grey bands and dense IH bands. This study attempted to correlate positions of gene elements, gene expression and the epigenetic state of underlying chromatin for these structures. To do so, these morphological elements were first accurately located on the physical map of the genome. This allowed comparison of their positions with genetic and epigenetic maps, as well as with protein localization profiles, transcription profiles and other features of chromatin. So, functional domains could be related to the banding pattern of polytene chromosomes. (Zhimulev, 2014).
Dense black bands are the most prominent structures in polytene chromosomes. They are readily noticeable due to their highly compacted state, large size, lack of transcription, late replication in the S phase, and a tendency to form ectopic pairing with other bands and pericentric heterochromatin. In fact, black bands are in many regards very similar to pericentric heterochromatin, hence they were called IH. In polytene chromosomes, IH bands frequently fail to complete replication during the S phase endocycles, and are therefore underreplicated. It has recently become clear that underreplication results from the absence of internal replication origins within IH and is dependent on SUUR protein, which maps to IH bands and modulates replication by decreasing the rate of replication fork progression (Zhimulev, 2014).
Underreplication regions showing lowered DNA copy number in polytene chromosomes were molecularly mapped. This analysis established that IH bands encompass clusters of widely-spaced unique genes (i.e., genes with large intergenic regions, with 6-40 genes per IH band), and that they are generally quite large (100-600 kb). Combined with the data on localization of chromatin proteins, IH borders were precisely mapped for 60 IH regions, which enabled a more refined analysis of these structures (Zhimulev, 2014).
The current data and those of Filion (2010) indicate that IH bands are composed of tissue-specific genes showing low expression levels. One of the prominent features of IH regions is their evolutionary conservation, i.e. they tend to display conserved gene content and order throughout evolution, as has been demonstrated by microsynteny analysis in nine Drosophila species (Zhimulev, 2014).
As compared to IH bands, it is far less trivial to provide accurate mapping for interbands and grey bands, because these regions are fully replicated and are much smaller. Yet, using a combination of EM, P-element tagging and FISH, it was possible to unambiguously map the positions of 32 interbands. Using the data on the features of interband chromatin, a mathematical model was developed that defines four basic chromatin states in the drosophila genome. This model allows identification of interband regions chromosome-wide. Accordingly, the limits of the DNA sequences corresponding to interbands were defined as borders of cyan fragments (Zhimulev, 2014).
With these data in hands, the molecular and epigenetic organization of interbands was analyzed. Interbands clearly displayed features of transcriptionally active regions: H3K4me3 histone modification, lower nucleosome density and histone H1 dips, presence of DHS, localization of RNA polymerase II and components of nucleosome remodeling complexes such as NURF, ISWI, WDS. One characteristic feature of interbands is that they are specifically bound by the chromodomain-containing CHRIZ protein. CHRIZ associates with another interband-specific protein Z4, which directly binds DNA via its seven zinc fingers (Zhimulev, 2014).
According to different estimates, there are 3500-5000 bands and interbands in Drosophila melanogaster polytene chromosomes. Earlier, the existence of about 3500 interbands was predicted. This study used an advanced model that takes into account more factors and hence is more accurate. 5674 cyan fragments were discovered each spanning 2.7 kb on average. Notably, both previous and current estimates of interband numbers are very close to those obtained by cytology (Zhimulev, 2014).
The major finding of this analysis of functional organization of interbands is that they typically encompass 5'-regions of multiply active genes (constitutively and actively transcribed). In a number of instances, it was observed that short genes can be entirely engulfed by interbands, however in most cases the body of the gene is found in the adjacent loosely compacted grey band. Thus, the interband+grey band duo appears as a single functional unit for many multiply active genes, so this unit is heterogeneous in terms of compaction; likewise it shows non-uniform localization of protein markers. Whereas interbands are specifically decorated with CHRIZ, grey bands lack CHRIZ and instead they are enriched with RNApolII. CHRIZ can be speculated to provide the permanently open chromatin state to interbands, where it serves as a pioneer-factor recruiting other transcription components. It is also possible that the observed wide-spread transcription activity of interband regions results from the static physical properties of interband DNA, such as sequence-dependent DNA flexibility, which may create nucleosome-free regions at promoters. Such regions may serve as 'entry points' to recruit proteins promoting further binding of transcription factors, chromatin remodelers, etc (Zhimulev, 2014).
The findings, therefore, resonate well with several early ideas regarding the interplay of structural and functional organization of banding pattern in polytene chromosomes. These include interband localization of multiply active genes, and the hypothesis of a single functional unit composed of band+interband (Zhimulev, 2014).
Recently, there has been an avalanche of publications describing various types of domain organization in the genomes of eukaryotes. So, the current data can be conveniently compared with other genome-wide chromatin annotation projects. Domain organization is summarized of a 400 kb fragment of the X chromosome encompassing various types of bands and accurately mapped interbands. Two large domains, 189 and 170 kb long, correspond to polytene chromosome bands 10A1-2 and 10B1-2, and display features of intercalary heterochromatin (magenta-green chromatin states). In between these late-replicating domains, there is a region composed of alternating interbands and grey bands (cyan and blue fragments). When applied to this region, 5-state chromatin classification model by Filion (2010) produces very similar domains, - the important difference however is that regions of YELLOW chromatin (active gene transcription according to Filion do not discriminate between small grey bands and interbands, nor between regulatory vs gene body parts (Zhimulev, 2014).
Kharchenko (2011) performed genome-wide profiling of 18 histone modifications and constructed 9-state chromatin models. In two contrasting cell lines (S2 and BG3), transcriptionally silent chromatin corresponds to the IH bands 10A1-2 and 10B1-2, whereas state 1 chromatin (active promoters and TSS) maps to the active genes and perfectly matches the cyan state of interbands, as defined by the analysis (Zhimulev, 2014).
Using the modified Hi-C approach based on the ligation of chromatin fragments that are found in close proximity in cross-linked chromatin, high-resolution chromosomal contact maps were generated. As it follows from this analysis, the entire genome is partitioned into a series of physical domains containing active and repressive epigenetic marks. These domains are delimited by boundaries demonstrating insulator binding, high DNaseI sensitivity and a set of specific proteins: CHRIZ and the active histone mark H3K4me3. The regions of interbands match well with the boundary sites that delimit the contacting domains identified via Hi-C. Localization of cyan chromatin and physical domains were compared throughout the genome. Of 1100 boundary sites referenced in Sexton (2012), 760 (69%) map to interbands (cyan) and loosely compacted grey bands (blue chromatin) (Zhimulev, 2014).
Positions of interbands and loosely compacted grey bands display co-localization with clusters of multiply active genes. According to the current analysis, there are 12 genes nested between the 10A1-2 and 10B1-2, with their 5'-ends mapping to cyan chromatin. Of these, 5 genes were classified as housekeeping genes (Zhimulev, 2014).
Using the data from the genes were classified as 'housekeeping' or 'differentially regulated'. Under this classification, the region 10A1-2 to 10B1-2 harbors 11 housekeeping genes, of which 9 genes correspond to the definition of a multiply active gene (Zhimulev, 2014).
NSL complexes are reportedly regulators of multiply active genes and bind promoters demonstrating broad transcriptional pattern and nucleosome-free regions. Positions of NSL-binding peaks match nicely the interband positions found in the study. All these comparisons further confirm the main conclusion of this work about interbands as sites of continuously active genes (Zhimulev, 2014).
This is the first study to investigate the molecular-genetic organization of polytene chromosome interbands located on both molecular and cytological maps of Drosophila genome. The majority of the studied interbands contained one gene with a single transcription initiation site; the remaining interbands contained one gene with several alternative promoters, two or more unidirectional genes, and "head-to-head" arranged genes. In addition, intricately arranged interbands containing three or more genes in both unidirectional and bidirectional orientation were found. Insulator proteins, ORC, P-insertions, DNase I hypersensitive sites, and other open chromatin structures were situated in the promoter region of the genes located in the interbands. This area is critical for the formation of the interband, an open chromatin region in which gene transcription and replication are combined (Zykova, 2019).
Despite of the long-term studies available on genetic organization of polytene chromosome bands and interbands, little is known regarding long gene location on chromosomes. To analyze it, bioinformatic approaches were used, and genome-wide distribution of introns in gene bodies and in different chromatin states was characterized, and using fluorescent in situ hybridization they were juxtaposed with the chromosome structures. Short introns up to 2 kb in length are located in the bodies of housekeeping genes (grey bands or lazurite chromatin). In the group of 70 longest genes in the Drosophila genome, 95% of total gene length accrues to introns. The mapping of the 15 long genes showed that they could occupy extended sections of polytene chromosomes containing band and interband series, with promoters located in the interband fragments (aquamarine chromatin). Introns (malachite and ruby chromatin) in polytene chromosomes form independent bands, which can contain either both introns and exons or intron material only. Thus, a novel type of the gene arrangement in polytene chromosomes was discovered; peculiarities of such genetic organization are discussed (Khoroshko, 2020).
Chemical cross-linking and DNA sequencing have revealed regions of intra-chromosomal interaction, referred to as topologically associating domains (TADs), interspersed with regions of little or no interaction, in interphase nuclei. TADs and the regions between them were found to correspond with the bands and interbands of polytene chromosomes of Drosophila. Further, the conservation of TADs between polytene and diploid cells of Drosophila was established. From direct measurements on light micrographs of polytene chromosomes, the states of chromatin folding in the diploid cell nucleus was deduced. Two states of folding, fully extended fibers containing regulatory regions and promoters, and fibers condensed up to 10-fold containing coding regions of active genes, constitute the euchromatin of the nuclear interior. Chromatin fibers condensed up to 30-fold, containing coding regions of inactive genes, represent the heterochromatin of the nuclear periphery. A convergence of molecular analysis with direct observation thus reveals the architecture of interphase chromosomes (Eagen, 2015).
At the intermediate scale of genomic spatial organization of kilobases to megabases, which encompasses the sizes of genes, gene clusters and regulatory domains, the three-dimensional (3D) organization of DNA is implicated in multiple gene regulatory mechanisms. At this scale, the genome is partitioned into domains of different epigenetic states that are essential for regulating gene expression. This study investigated the 3D organization of chromatin in different epigenetic states using super-resolution imaging. Genomic domains were classified in Drosophila cells into transcriptionally active, inactive or Polycomb-repressed states, and distinct chromatin organizations were observed for each state. All three types of chromatin domains exhibit power-law scaling between their physical sizes in 3D and their domain lengths, but each type has a distinct scaling exponent. Polycomb-repressed domains show the densest packing and most intriguing chromatin folding behaviour, in which chromatin packing density increases with domain length. Distinct from the self-similar organization displayed by transcriptionally active and inactive chromatin, the Polycomb-repressed domains are characterized by a high degree of chromatin intermixing within the domain. Moreover, compared to inactive domains, Polycomb-repressed domains spatially exclude neighbouring active chromatin to a much stronger degree. Computational modelling and knockdown experiments suggest that reversible chromatin interactions mediated by Polycomb-group proteins play an important role in these unique packaging properties of the repressed chromatin. Taken together, these super-resolution images reveal distinct chromatin packaging for different epigenetic states at the kilobase-to-megabase scale, a length scale that is directly relevant to genome regulation (Boettiger, 2016).
Suppressor of Hairy wing [Su(Hw)] is an insulator protein that participates in regulating chromatin architecture and gene repression in Drosophila. Previous studies have shown that Su(Hw) is also required for pre-replication complex (pre-RC) recruitment on Su(Hw)-bound sites (SBSs) in Drosophila S2 cells and pupa. This study describes the effect of Su(Hw) on developmentally regulated amplification of 66D and 7F Drosophila amplicons in follicle cells (DAFCs), widely used as models in replication studies. Su(Hw) binding co-localizes with all known DAFCs in Drosophila ovaries, whereas disruption of Su(Hw) binding to 66D and 7F DAFCs causes a two-fold decrease in the amplification of these loci. The complete loss of Su(Hw) binding to chromatin impairs pre-RC recruitment to all amplification regulatory regions of 66D and 7F loci at early oogenesis (prior to DAFCs amplification). These changes coincide with a considerable Su(Hw)-dependent condensation of chromatin at 66D and 7F loci. Although this study observed the Brm, ISWI, Mi-2, and CHD1 chromatin remodelers at SBSs genome wide, their remodeler activity does not appear to be responsible for chromatin decondensation at the 66D and 7F amplification regulatory regions. This study has discovered that, in addition to the CBP/Nejire and Chameau histone acetyltransferases, the Gcn5 acetyltransferase binds to 66D and 7F DAFCs at SBSs and this binding is dependent on Su(Hw). It is proposed that the main function of Su(Hw) in developmental amplification of 66D and 7F DAFCs is to establish a chromatin structure that is permissive to pre-RC recruitment (Vorobyeva, 2021).
To elucidate the principles governing insulator architectural functions, this study used two insulators, Homie and Nhomie, that flank the Drosophila even-skipped locus. It was shown that homologous insulator interactions in trans, between Homie on one homolog and Homie on the other, or between Nhomie on one homolog and Nhomie on the other, mediate transvection. Critically, these homologous insulator:insulator interactions are orientation-dependent. Consistent with a role in the alignment and pairing of homologs, self-pairing in trans is head-to-head. Head-to-head self-interactions in cis have been reported for other fly insulators, suggesting that this is a general principle of self-pairing. Homie and Nhomie not only pair with themselves, but with each other. Heterologous Homie-Nhomie interactions occur in cis, and they serve to delimit a looped chromosomal domain that contains the even skipped transcription unit and its associated enhancers. The topology of this loop is defined by the heterologous pairing properties of Homie and Nhomie. Instead of being head-to-head, which would generate a circular loop, Homie-Nhomie pairing is head-to-tail. Head-to-tail pairing in cis generates a stem-loop, a configuration much like that observed in classical lampbrush chromosomes. These pairing principles provide a mechanistic underpinning for the observed topologies within and between chromosomes (Fujioka, 2016).
The highly regular and reproducible physical organization of chromosomes in multicellular eukaryotes was recognized a century ago in cytological studies on the lampbrush chromosomes that are found in oocytes arrested at the diplotene phase of meiosis I. At this stage, homologous chromosomes are paired. The two homologs display a similar and reproducible architecture. It consists of a series of loops emanating from the main axis, that are arranged in pairs, one from each homolog. In between the loops are regions of more compacted chromatin. A similar physical organization is evident in insect polytene chromosomes. As with lampbrush chromosomes, the paired homologs are aligned in precise register. However, instead of one copy of each homolog, there are hundreds. While loops are not readily visible, each polytene segment has a unique pattern of bands and interbands that depends upon the underlying DNA sequence and chromosome structure (Fujioka, 2016).
Subsequent studies have shown that the key features of chromosome architecture evident in lampbrush and polytene chromosomes are also found in diploid somatic cells. One of these is the subdivision of the chromatin fiber into a series of loop domains. There are now many lines of evidence indicating that looping is a characteristic architectural feature. Biochemical evidence comes from chromosome conformation capture (3C) experiments, which show that distant sites come into contact with each other in a consistent pattern of topologically associating domains (TADs). While the first studies in mammals suggested that TADs have an average length of 1 Mb, subsequent experiments showed that the average is only about 180 kb. In flies, TADs are smaller, between 10-100 kb (Sexton, 2012; Hou, 2012). Neighboring TADs are separated from each other by boundaries that constrain both physical and regulatory interactions. In mammals and also in flies, these boundaries typically correspond to sequences bound by insulator proteins like CTCF (Fujioka, 2016).
That TAD boundaries correspond to insulators is consistent with the known properties of these elements. Insulators subdivide the chromosome into functionally autonomous regulatory domains. When interposed between enhancers or silencers and target promoters, insulators block regulatory interactions. They also have an architectural function in that they can bring distant chromosomal sequences together, and in the proper configuration can promote rather than restrict regulatory interactions. Moreover, insulators are known to mediate contacts between distant sequences (loop formation), and these physical contacts depend upon specific interactions between proteins bound to the insulators (Fujioka, 2016).
The notion that insulators are responsible for subdividing eukaryotic chromosomes into a series of looped domains raises questions about the rules governing loop formation in cis. One of these is the basis for partner choice. Is choice based simply on proximity, or is there an intrinsic partner preference? A second concerns the topology of the loop formed by interacting partners in cis. Do the partners interact to form a stem-loop-like structure, or does the interaction generate a circular loop ('circle-loop')? The answer to this question will depend upon whether there is an orientation dependence to the interactions between two heterologous insulators. In flies, homologs are typically paired in somatic cells, not just in cells that are polyploid. This means that the loop domains in each homolog must be aligned in precise register along their entire length. A plausible hypothesis is that both alignment and homolog pairing are mediated by insulator interactions in trans. If this is case, there are similar questions about the rules that govern trans interactions. Is there a partner preference in the interactions that mediate homolog pairing? Is there an orientation dependence, and if so, what is the topological outcome of the looped domains generated by insulator interactions in paired chromosomes in cis and in trans? (Fujioka, 2016).
This study has used insulators from the even skipped (eve) locus to address the questions posed above about the architecture of eukaryotic chromosomes. The eve domain spans 16 kb and is bordered upstream by the Nhomie (Neighbor of Homie) insulator and downstream by Homie (Homing insulator at eve). eve encodes a homeodomain transcription factor that is required initially for segmentation, and subsequently in the development of the CNS, muscles, and anal plate. It has a complex set of enhancers that activate expression at different stages and tissues, and a Polycomb response element (PRE) that silences the gene in cells where it isn't needed. In early embryos, the stripe enhancers upstream (3+7, 2, late stripes) and downstream (4+6, 1, and 5) of the eve gene activate transcription in a pair-rule pattern. Later in development, around the time that germband retraction commences, mesodermal (Me) and neuronal (CNS) enhancers turn on eve expression in a subset of cells in each of these tissues. These late enhancers continue to function once germband retraction is complete, while another enhancer (APR) induces transcription in the presumptive anal plate. Located just upstream of eve is CG12134, while the TER94 gene is downstream. Unlike eve, both of these genes are ubiquitously expressed throughout much of embryogenesis (Fujioka, 2016).
The importance of insulators in organizing eukaryotic chromosomes has been recognized since their discovery in the 1980's. However, the principles underlying their architectural and genetic functions have not been fully elucidated. With this goal in mind, this study asked how these elements shape two critical architectural features of chromosomes. The first is homolog pairing. Homologs pair in flies from the blastoderm stage onward, and the consequent trans-interactions are important for proper gene regulation. The phenomenon of homolog pairing is not unique to Drosophila. Homologs are paired in lampbrush chromosomes of invertebrate and vertebrate oocytes. The second is the looped domain organization. Although there is now compelling evidence that insulators subdivide chromosomes into topologically independent looped domains (and that these domains play a central role in gene regulation), the topology of the loops is unknown. Moreover, while the loops must emanate from the main axis of the chromosome, the relationships between the loops, the insulators that delimit them, and the main chromosomal axis are not understood. As homolog pairing is more straightforward and the likely mechanism better documented, it is considered first (Fujioka, 2016).
Homolog pairing requires mechanisms for aligning homologs in precise register, and maintaining their stable association. While many schemes are imaginable, the simplest utilizes elements distributed along each homolog that have self-interaction specificity. Such a mechanism would be consistent with the persistence of local pairing and transvection in chromosomal rearrangements. It would also fit with studies on the pairing process. Self-association of pairing elements would locally align sequences in register, and ultimately link homologs together along their entire length. In this mechanism, self-association must be specific and also directional, namely head-to-head. This avoids the introduction of unresolvable loops and maximizes pairing for transvection (Fujioka, 2016).
In Drosophila, the homing of P-element transgenes, in which normally random insertion becomes targeted, suggested the ability of genomic elements to self-interact. Such a homing activity was found in the engrailed locus for a region that includes two PREs, and later studies showed that some insulators and a promoter region also possess homing activity. The self-interaction implied by homing suggests that these elements may facilitate homolog pairing. However, in contrast to PREs and promoters, insulators have consistently been found to engage in specific self-interactions. Thus, among the known elements in the fly genome, insulators are the best candidates to align homologs in register and maintain pairing. Moreover, genome-wide chromatin immunoprecipitation experiments (ChIPs) show that insulators are distributed at appropriate intervals along each chromosome (Fujioka, 2016).
A role in homolog pairing was first suggested by the discovery that the su(Hw) and Mcp insulators each can mediate regulatory interactions between transgenes inserted at distant sites. The Fab-7 insulator can also mediate long-range regulatory effects. Further evidence that self-association is characteristic of fly insulators came from insulator bypass experiments. These experiments showed that bypass is observed when an insulator is paired with itself, while heterologous combinations are less effective or don't give bypass. Moreover, self-pairing is, with few exceptions, head-to-head (Fujioka, 2016).
That insulators mediate homolog pairing through specific self-interactions is further supported by the current studies. Using a classical transvection assay, this study found that Homie-Homie and Nhomie-Nhomie combinations stimulate trans-regulatory interactions between enhancers on one homolog and a reporter on the other. Moreover, the parameters that favor transvection dovetail with those expected for a pairing mechanism based on insulator self-interactions in trans. First, the two insulators must be in the same orientation. When they are in opposite orientations, transvection is not enhanced (or enhancement is much weaker). Second, the enhancers and reporter must be located on the same side (centromere proximal or distal) of the insulators. In addition to transvection, Homie and Nhomie also engage in highly specific and directional distant regulatory interactions (Fujioka, 2016).
While there is compelling evidence that insulator self-interactions are responsible for homolog pairing, many issues remained unresolved. Perhaps the most important is the nature of the code used for self-recognition and orientation. The best hint comes from bypass experiments using multimerized binding sites for Su(Hw), dCTCF, or Zw5. Homologous multimer combinations give bypass, while heterologous combinations do not. However, bypass is observed for composite multimers when they are inserted in opposite orientations (e.g., Su(Hw) dCTCF >-< dCTCF Su(Hw)), but not the same orientation (e.g., Su(Hw) dCTCF >> Su(Hw) dCTCF). These findings argue that the identity and order of proteins bound to the insulator determine its self-association properties (Fujioka, 2016).
The first direct evidence that insulators generate loops came from 3C experiments on the mouse β-globin and the fly 87A7 heat shock loci. These studies suggested that physical interactions between adjacent insulators in cis could subdivide chromosomes into looped domains. Subsequent work has confirmed this conclusion (Rao, 2014). However, while these experiments demonstrate that cis insulator interactions generate loops, they provided no information about the topology of these loops, or how they are arranged (Fujioka, 2016).
Cis interactions could, a priori, be either head-to-head like self-association in trans, or head-to-tail. The consequences are quite different. Head-to-head interactions generate a circle-loop, while head-to-tail interactions generate a stem-loop. If heterologous insulators interact with only one specific partner, the circle-loop or the stem-loop will be linked to neighboring circles or stem-loops by loops without anchors. These unanchored loops would correspond to the main axis of the chromosome, and the circle-loops or stem-loops would then protrude from the main axis in a random orientation and at distances determined by the length and compaction of the unanchored loops (Fujioka, 2016).
On the other hand, if insulators in a chromosomal segment are able to interact with both of their neighbors, then the main axis of the chromosome in this region would be defined by the insulators. Quite different structures are predicted for head-to-head and head-to-tail interactions. Head-to-head would give a series of variably sized circle-loops linked together at their base by an array of interacting insulators. The base would correspond to the main axis of the chromosome, and each circle-loop would extend from one side of the main axis to the other. If the direction of coiling were always the same, this would give a structure resembling a helix anchored to a rod. If the direction of coiling were random, the structure would be more complicated and variable, since neighboring circle-loops could extend out from the main axis in either the same or the opposite direction (not illustrated). The loop-axis relationship would be more regular for head-to-tail insulator pairing in cis. Adjacent stem-loops would extend out from the main axis in opposite directions much like the lampbrush chromosomes formed when haploid sperm heads are injected into amphibian oocytes. This stem-loop organization would also fit with the radial loop model proposed by Laemmli and others for the first level of folding of metaphase chromosomes (Fujioka, 2016).
Since the current experiments show that Homie-Nhomie association is head-to-tail, the topology of the eve locus in vivo is a stem-loop, not a circle-loop. This finding raises a number of questions. Perhaps the most important is whether head-to-tail interactions are the rule rather than the exception. While the orientation dependence of homologous interactions has been extensively investigated, there have been no systematic studies on interactions between neighboring insulators. However, there are reasons to think that cis interactions are more likely head-to-tail than head-to-head. One is homolog pairing. As mentioned above, the circle-loops formed by head-to-head interactions can coil in either direction, either left-handed or right-handed. If coiling were random, then about half of the circle-loops on each homolog would be coiled in opposite directions. In this case, head-to-head pairing of homologous insulators in each homolog would generate a structure in which the circle-loops would point in opposite directions. This topology would not be compatible with transvection. Coiling of the circle-loops in the same direction on both homologs would permit interdigitation of one circle-loop inside the other; however, the chromatin fiber from the inside circle-loop would need to cross in on one side and out on the other. If the main axis of the chromosome in the paired region is defined by a series of interacting insulators in cis, then generating a topology permissive for transvection (not illustrated) would require coiling of successive homologous circle-loops on each homolog in the same direction, one inside the other (Fujioka, 2016).
These topological issues aren't encountered when heterologous insulator interactions in cis are head-to-tail. Head-to-head pairing of homologous insulators in trans would bring regulatory elements and genes in the two homologous stem-loops into close proximity. Alignment of the two homologs is straightforward whether or not the main axis of the chromosome is defined by a series of interacting insulators. Alternating loops extending upwards and downwards from the main axis of the chromosome would be directly aligned when homologous insulators pair head-to-head in trans (Fujioka, 2016).
While the requirements for aligning and pairing homologs would appear to favor stem-loops between heterologous insulators in cis in flies, homolog pairing does not occur in vertebrates except in specialized cell types. This could mean that circle-loops formed by cis interactions between heterologous insulators are permissible in vertebrate chromosomes. However, even in organisms in which homolog pairing doesn't occur in somatic cells, it seems possible that cis-pairing interactions more commonly generate stem-loops than circle-loops (see Chromosome architecture: pairing head-to-head and head-to-tail in cis). First, following DNA replication and before mitosis (during the S and G2 phases of the cell cycle), sister chromatids are aligned. Maintaining this alignment may facilitate epigenetic mechanisms that template chromatin structures from one cellular generation to the next, such as the copying of histone modifications onto both daughter chromosomes. The simpler topology of stem-loops could facilitate this sister chromatid pairing, as well as their separation during mitosis. Second, recent studies on the relationship between loop domains and CTCF insulators showed that in more than 90% of the cases, the CTCF binding sites on opposite ends of a loop are in opposite orientation. Thus, assuming that the orientation of pairing is such that the CTCF sites are aligned in parallel to form the loop, pairing between CTCF insulators at the ends of the loop would generate stem-loops rather than circle-loops. If insulators form the main axis of the chromosome, there is an additional explanation for such a bias. Head-to-head pairing in cis could generate a series of circular loops that extend out from the same side of the main axis. This configuration would be favorable for crosstalk between regulatory elements and genes in adjacent loops. By contrast, head-to-tail pairing, where adjacent stem-loops extend out in opposite directions, would disfavor crosstalk, helping to explain how insulators block enhancer-promoter communication between adjacent loops (Fujioka, 2016).
Genome organization involves cis and trans chromosomal interactions, both implicated in gene regulation, development, and disease. This study focused on trans interactions in Drosophila, where homologous chromosomes are paired in somatic cells from embryogenesis through adulthood. First long-standing questions were addressed regarding the structure of embryonic homolog pairing and, to this end, a haplotype-resolved Hi-C approach was developed to minimize homolog misassignment and thus robustly distinguish trans-homolog from cis contacts. This computational approach, which is called Ohm, reveals pairing to be surprisingly structured genome-wide, with trans-homolog domains, compartments, and interaction peaks, many coinciding with analogous cis features. a significant genome-wide correlation was found between pairing, transcription during zygotic genome activation, and binding of the pioneer factor Zelda. The findings reveal a complex, highly structured organization underlying homolog pairing, first discovered a century ago in Drosophila. Finally, the versatility of this haplotype-resolved approach was demonstrated by applying it to mammalian embryos (Erceg, 2019)
Trans-homolog interactions have been studied extensively in Drosophila, where homologs are paired in somatic cells and transvection is prevalent. Nevertheless, the detailed structure of pairing and its functional impact have not been thoroughly investigated. Accordingly, this study generated a diploid cell line from divergent parents and applied haplotype-resolved Hi-C, showing that homologs pair with varying precision genome-wide, in addition to establishing trans-homolog domains and compartments. This study revealed at least two forms of pairing: tight pairing, spanning contiguous small domains, and loose pairing, consisting of single larger domains. Strikingly, active genomic regions (A-type compartments, active chromatin, expressed genes) correlated with tight pairing, suggesting that pairing has a functional implication genome-wide. Finally, using RNAi and haplotype-resolved Hi-C, it was shown that disruption of pairing-promoting factors results in global changes in pairing, including the disruption of some interaction peaks (AlHaj Abed, 2019).
Metazoan chromosomes are folded into discrete sub-nuclear domains, referred to as chromosome territories (CTs). The molecular mechanisms that underlie the formation and maintenance of CTs during the cell cycle remain largely unknown. This paper reports the development of high-resolution chromosome paints to investigate CT organization in Drosophila cycling cells. Large-scale chromosome folding patterns and levels of chromosome intermixing are shown to be remarkably stable across various cell types. The data also suggest that the nucleus scales to accommodate fluctuations in chromosome size throughout the cell cycle, which limits the degree of intermixing between neighboring CTs. Finally, this study shows that the cohesin and condensin complexes are required for different scales of chromosome folding, with condensin II being especially important for the size, shape, and level of intermixing between CTs in interphase. These findings suggest that large-scale chromosome folding driven by condensin II influences the extent to which chromosomes interact, which may have direct consequences for cell-type specific genome stability (Rosin, 2018).
Metazoan genomes are arranged into a nested hierarchy of structural features, ranging from small chromatin loops to larger insulated neighborhoods or topologically associated domains (TADs). TADs are believed to direct and insulate gene regulatory networks, which can engage in long-range interactions with each other, ultimately packaging chromosomes into sub-nuclear compartments termed chromosome territories (CTs) (Rosin, 2018).
CTs are a widespread feature of nuclear organization across a variety of cell types and species, as revealed by both fluorescence in situ hybridization (FISH) and chromosome-conformation-capture (3C)-based studies. Recently, several studies have implicated the ring-shaped SMC (structural maintenance of chromosomes) complexes-cohesin and condensin-in the regulation of large-scale chromatin folding and CT formation. However, the contribution of each complex to local topology, large-scale chromatin folding, and chromosome individualization at single-cell resolution has been hindered by technical limitations. The consequence of CT loss during interphase also remains unclear. This is due, in part, to both the paucity of factors known to directly influence this level of organization and the difficulty in visualizing their effects at single cell resolution. However, CT intermixing has been theorized to influence the location and frequency of translocations and the position of a gene within and between CTs seems to influence its access to the machinery responsible for specific nuclear functions, such as transcription, splicing, and DNA repair (Rosin, 2018).
This study leveraged the flexible, scalable Oligopaint FISH technology to generate high-resolution chromosome paints to the entire Drosophila genome. Combined with a custom 3D segmentation pipeline, a comprehensive picture of chromosome size, shape, and position at single-cell resolution. The results show that various cell types in Drosophila harbor spatially partitioned CTs. Interestingly, widespread somatic homolog pairing in Drosophila results in homologs sharing a single CT, suggesting that homologous and heterologous chromosomes are distinguished at the cellular level in this species. Further, this study characterized the differential roles of cohesin and condensin complexes in local chromatin compaction, large-scale chromatin folding, and CT formation. Cohesin and condensin II were shown to drive different scales of chromatin folding during interphase, with condensin II being especially important for large-scale interactions and the spatial partitioning of chromosomes. These findings indicate that condensin II-driven large-scale chromatin conformations during interphase influence the extent to which chromosomes interact, which has the potential to affect gene regulation and genome stability (Rosin, 2018).
In this study, we demonstrate that Drosophila cells harbor spatially distinct CTs and found remarkably consistent levels of intermixing in a variety of cell types and throughout the cell cycle. While the vast majority of cells showed contact between all three major chromosomes, it was possible to measure that, on average, only 40% of the Drosophila genome is intermixed (not accounting for homologous chromosomes). This is strikingly similar to the estimate of 40-46% CT intermixing in human lymphocytes, possibly indicating a widespread and conserved restraint on inter-chromosomal interactions. However, it is noted that a small population of cells do exhibit >90% overlap between neighboring CTs. The fate of these cells will be important to explore in the future (Rosin, 2018).
Further, the condensin II complex was identified as an essential factor for CT formation in cycling cells. These results are consistent with those reported on condensin in yeast, tetrahymena, and post-mitotic polytene cells of Drosophila. These data are also in line with previous work showing that condensin II serves as an 'anti-pairing' factor that disrupts pairing interactions and separates homologous loci. Additionally, it was shown that condensin II overexpression can further compact chromosomes and reduce the level of CT intermixing. Together, these data highlight the highly conserved role of the condensin II complex in controlling the level of inter-chromosomal associations in eukaryotic cells (Rosin, 2018).
If condensin II has the capacity to spatially separate homologous and heterologous chromosomes, how does somatic pairing persist in Drosophila cells that have CTs? One possibility is that pairing interactions are established prior to CT formation and thus, homologous chromosomes would be folded in concert. This would be consistent with some persistence of homolog pairing through mitosis and suggests a model in which chromosomes are folded into CTs through post-mitotic condensin II activity. In addition, pairing interactions may require additional condensin activity to separate homologous versus heterologous interactions. Indeed, these studies showed that condensin II overexpression increases whole-chromosome unpairing in Kc167 and BG3 cells. It is speculated that interphase condensin II levels and thus inter-chromosomal associations are tightly regulated, and could be modified in a cell-type-specific manner. For instance, in contrast to virtually all other cell types in Drosophila, homologous chromosomes in germline stem cells remain unpaired throughout development. This separation between homologs could potentially reflect increased levels of condensin II activity and may indicate that inter-chromosomal associations are reduced to protect the stem-cell population from potentially deleterious rearrangements. Indeed, previous work has shown that different extents of chromosome intermixing correlate with translocation frequencies-both those occurring naturally in the human population and those induced experimentally in human and mouse lymphocytes. Therefore, an alteration in condensin II activity and subsequent CT intermixing levels has the potential to influence the location and frequency with which translocations occur. Intriguingly, mice carrying a hypomorphic allele of cap-H2 were recently shown to frequently develop T-cell lymphomas with highly rearranged chromosomes in the transformed cells. It will be important to determine whether this increased genome instability is associated with increased CT contact prior to the rearrangement event (Rosin, 2018).
When accounting for the popular model of loop extrusion and the stabilizing function of SMC complexes, condensin II activity could potentially fold whole chromosomes into a configuration that limit their interactions with the rest of the genome. While the nature of these interactions remains unknown, they are clearly distinct from cohesin-driven interactions given that cohesin depletion does not significantly change intermixing levels in Drosophila or yeast. Consistent with this hypothesis, a recent study demonstrated that depletion of the cohesin complex in mammals eliminates chromatin looping and TAD formation but does not disrupt long-range interactions between similar chromatin states, highlighting the notion that local insulation and higher-order folding must rely on distinct molecular determinants. Combined with the current findings that large-scale configurations are stable throughout the cell cycle and require condensin II activity, it is proposed that condensin II drives long-range interactions that are established early in interphase. In this model, condensin II may act as an 'organizational bookmark' by prioritizing intra-chromosomal folding immediately following mitotic exit. As condensin II is enriched at highly active regions of the genome marked by H3K4me3, its activity could potentially allow gene regulatory networks and chromatin compartments to favor intra- versus inter-chromosomal interactions. Further studies identifying the interactions driven by condensin II in relation to cohesin will be critical for understanding how these molecular machines cooperatively guide the genome through the cell cycle and development (Rosin, 2018).
Finally, this report describes an efficient and scalable method of high-resolution chromosome painting using Oligopaint FISH technology. Combined with a custom 3D-segmentation pipeline, quantitative measurements of chromosome size, shape, position, and overlap can be analyzed in a systematic and potentially high-throughput fashion. Moreover, the ability to conduct sequential rounds of hybridization with Oligopaints permits 3D analysis of many, if not all, CTs simultaneously. It is anticipated that this technology will lead to an enhanced ability to visualize and karyotype chromosomes in a number of systems, providing a novel battery of assays to better characterize how chromatin is packaged and spatially partitioned in the nucleus (Rosin, 2018).
Interaction domains in Drosophila chromosomes form by segregation of active and inactive chromatin in the absence of CTCF loops, but the role of transcription versus other architectural proteins in chromatin organization is unclear. This study finds that positioning of RNAPII via transcription elongation is essential in the formation of gene loops, which in turn interact to form compartmental domains. Inhibition of transcription elongation or depletion of cohesin decreases gene looping and formation of active compartmental domains. In contrast, depletion of condensin II, which also localizes to active chromatin, causes increased gene looping, formation of compartmental domains, and stronger intra-chromosomal compartmental interactions. Condensin II has a similar role in maintaining inter-chromosomal interactions responsible for pairing between homologous chromosomes, whereas inhibition of transcription elongation or cohesin depletion has little effect on homolog pairing. The results suggest distinct roles for cohesin and condensin II in the establishment of 3D nuclear organization in Drosophila (Rowley, 2019).
Inter- and intra-chromosomal interactions among DNA-bound proteins establish patterns of chromatin organization detectable by Hi-C. The original low-resolution genome-wide Hi-C maps described the segregation of active and inactive chromatin into A and B compartments. Later, higher-resolution maps identified domains characterized by preferential intra- versus inter-domain contacts. Interaction domains have been described in different organisms and are commonly referred to as topologically associating domains (TADs). In addition to these features, intense point-to-point loops have been detected by high-resolution Hi-C in mammals. The anchors of these loops are enriched in CTCF and cohesin, and predominantly contain CTCF motifs in convergent orientation (Rowley, 2019).
CTCF loops are an important component of chromatin organization in vertebrates, yet plants and invertebrates either lack a homolog or CTCF does not appear to form stable loops. Instead, chromosomal domains in these organisms, including Drosophila, correspond to the transcriptional state of specific sequences in the genome. Borders between these domains form at discontinuities between active and inactive regions containing proteins and histone modifications characteristic of their transcriptional state. This pattern of 3D organization is similar to that observed in mammals after depletion of CTCF or Rad21 and has been studied in detail in Drosophila, where analyses of high-resolution Hi-C data show that chromatin is predominately organized by the fine-scale segregation of active and inactive chromatin into A and B compartmental domains (Rowley, 2017). Indeed, transcriptional state alone can be used to computationally simulate the experimental Hi-C interaction pattern at 1-kb resolution with great accuracy (Rowley, 2017). In further support for a role of transcription or factors associated with the transcriptional state of genes in chromatin organization, inhibition of transcription initiation and subsequent degradation of RNA polymerase II (RNAPII) using triptolide disrupts Drosophila compartmental domains and their interactions. Interestingly, the extent of disruption of 3D organization correlates with the levels of RNAPII after triptolide treatment. Drosophila Hi-C maps also show a few hundred punctate signals corresponding to specific point-to-point interactions, but these loops are not associated with CTCF. Instead, the loop anchors are enriched for developmental enhancers, Pc, and Rad21. It is unclear whether these Pc loops are formed by cohesin-mediated loop extrusion as it has been proposed for CTCF loops in mammals (Rowley, 2019).
In addition to inter- and intra-chromosomal interactions, Drosophila chromosomes participate in extensive pairing with their homologs. Pairing between homologs is responsible for the transvection phenomenon, which involves interactions between enhancers and promoters of genes located in two homologous chromosomes. Analysis of the extent of this pairing typically makes use of fluorescence in situ hybridization (FISH) probes hundreds of kilobases long, making it difficult to determine whether pairing occurs at discrete loci or in large regions. Several proteins have been shown to affect homolog pairing including condensin II, the levels of which are regulated by the SCFSlimb ubiquitin ligase. Depletion of Slimb increases levels of condensin II and decreases homolog pairing, while depletion of condensin II increases homolog pairing, suggesting that condensin II antagonizes chromosome pairing. While the role of condensin II in this aspect of nuclear organization is well known, its relationship to other aspects of chromosome organization is largely unexplored (Rowley, 2019).
This study examined the contribution of condensin II, cohesin, and the distribution of RNAPII to the establishment of various features of Drosophila 3D chromatin organization. Furthermore, analysis of homologous pairing interactions using Hi-C data suggests that pairing occurs at discrete loci with an average length of 6.4 kb enriched for architectural proteins. The results highlight the importance and distinct roles of RNAPII or other components of the transcription complex, cohesin, and condensin II in the establishment of nuclear organization (Rowley, 2019).
These results support a model of chromatin organization where RNAPII and cohesin promote interactions within genes to create small gene domains. Interactions between adjacent gene domains result in the formation of active compartmental domains, and interactions among these domains give rise to the characteristic plaid pattern of Hi-C heatmaps often referred to as the A compartment. The frequency of interactions within and between genes and A compartmental domains correlates with the amount of RNAPII and cohesin, which co-localize extensively in the genome. Because of this, the allocation of a specific sequence to the A compartment should not be done in absolute terms. Rather, sequences in the A compartment have different positive eigenvector values that correlate with the amount of RNAPII and cohesin. Contiguous sequences lacking RNAPII and cohesin have a negative eigenvector value and form B compartmental domains. Interactions among B compartmental domains in Drosophila are more infrequent compared to those among A compartmental domains, that is, the plaid pattern of Hi-C heatmaps in Drosophila arises in large part due to interactions between A compartments. However, sequences within B compartmental domains interact as frequently as those located in A domains. These interactions may arise as a consequence of proteins present in silenced genes. Alternatively, or in addition, interactions within B compartmental domains may result from interactions between adjacent A domains, which enclose B domains within loops similar to those formed by CTCF/cohesin in vertebrates. This is supported by results showing that inhibition of transcription initiation with triptolide or using the heat shock response, which result in the loss of A compartmental domains, also result in decreased interaction frequencies within B domains (Rowley, 2019).
These findings suggest that, whereas interaction frequency of sequences in active genes correlates with transcription elongation, it is likely that the presence of RNAPII, or other components of the transcription/elongation complexes, is a better candidate to explain the correlation between transcription and 3D organization. Inhibition of transcription results in dramatic changes to chromatin domains in Drosophila, yet transcription inhibition was reported to have little effect in mammalian embryonic nuclei. It is speculated that transcription inhibition studies in mammalian cells could be affected by the prevalence of CTCF loop domains. These loops may tether chromatin together such that inhibition of transcription for short periods of time is insufficient to disrupt chromatin organization. Meanwhile, in organisms that lack CTCF loops, such as Drosophila and prokaryotes, the larger effect of transcription inhibition may be due to the lack of point-to-point chromatin tethering by CTCF loops. It would be interesting to analyze whether absence of transcription or depletion of RNAPII with inhibitors such as triptolide have a stronger effect in cells depleted of CTCF (Rowley, 2019).
Previous results have shown a role for condensin II in chromatin structure during interphase. Condensin II colocalizes extensively with Drosophila architectural proteins, but in spite of the similar distribution, some observations suggest a distinct role for Cap-H2 in chromatin biology with respect to other architectural proteins. For example, all architectural proteins, including Rad21, are re-distributed during the heat shock response and they accumulate at enhancer sequences. However, the amount of enhancer-bound Cap-H2 and the number of occupied enhancers decreases after temperature stress. These observations may be explained by the opposing roles that condensin II and cohesin play in mediating intra-chromosomal interactions. Condensin II is present in active chromatin but it antagonizes the formation of gene domains and A compartmental domains, and condensin II depletion results in an increase to long-range A-A compartmental interactions. These results are in line with recent observations indicating that chromosome volume, as detected by Oligopaint, increases in Cap-H2 knockdown Drosophila cells (Rosin, 2018). The mechanisms by which these two SMC motors play opposing role in chromatin interactions is unclear. Presumably, their function in chromatin 3D organization is related to their ability to extrude loops, as was proposed for cohesin in mammals. Condensin has also been shown to extrude loops in vitro (Ganji, 2018), and it would be interesting to understand whether its role, opposite to that of cohesin, is based on different potential extrusion mechanisms between these two complexes. Thus, condensin II could antagonize cohesin interactions by directly inhibiting these same interactions or by promoting different interactions (Rowley, 2019).
Drosophila chromosomes participate in extensive homologous chromosome pairing, but the details of the mechanisms underlying this phenomenon are not well understood. Analysis of Hi-C data support a button model of pairing, where the buttons are short pairing sites likely corresponding to binding sites for specific proteins, rather than large domains. These pairing sites are enriched in architectural proteins, including Rad21 and Cap-H2. Although depletion of Rad21 only has no effect on pairing, it is possible that some architectural proteins may promote pairing while others act as anti-pairers, as is the case for Cap-H2. The general antagonistic role of condensin II in the establishment of interactions between homologs as well as short- and long-range intra-chromosomal contacts suggests common mechanisms responsible for these apparently different phenomena (Rowley, 2019).
High-throughput assays of three-dimensional interactions of chromosomes have shed considerable light on the structure of animal chromatin. Despite this progress, the precise physical nature of observed structures and the forces that govern their establishment remain poorly understood. This study presents high resolution Hi-C data from early Drosophila embryos. Boundaries between topological domains of various sizes were shown to map to DNA elements that resemble classical insulator elements: short genomic regions sensitive to DNase digestion that are strongly bound by known insulator proteins and are frequently located between divergent promoters. Further, a striking correspondence was shown between these elements and the locations of mapped polytene interband regions. It is likely this relationship between insulators, topological boundaries, and polytene interbands extends across the genome, and a model is proposed in which decompaction of boundary-insulator-interband regions drives the organization of interphase chromosomes by creating stable physical separation between adjacent domains (Stadler, 2017).
Several Hi-C studies in flies have identified enrichments of insulator proteins at topologically associated domains (TAD) boundaries. These studies varied in their resolution (due to use of 4- vs. 6-cutter enzymes and sequencing depth), methods (solution vs. in situ Hi-C), and, critically, in the methods used to identify TAD boundaries. As a result, each study relied on distinct sets of boundaries for analyses of the molecular features of these structures. This study explored several methods to identify topological domains and associated boundaries and found that no single approach was sufficient to exhaustively identify all of these features in the genome. Rather, by using a combination of visual inspection of Hi-C maps at a large number of loci, unbiased hand-calling, and computational searches, a very close, two-way association was consistently observed between sites of combinatorial insulator protein binding (insulators) and the boundaries between topological domains. This result supports prior studies which found enriched insulator protein binding at topological boundaries, and extends this finding by localizing boundaries to discrete insulator elements. Hi-C data are exceptionally complex and reveal many layers of genomic organization, and it is suspected that many questions in this field will only be resolved by the combined work of multiple groups using distinct analysis strategies and techniques (Stadler, 2017).
The most intriguing finding of this study is the association of TAD boundaries with polytene interbands. The implication that these elements are decompacted, extended chromatin regions provides an attractive model in which simple physical separation explains multiple activities associated with insulators, including the ability to block enhancer-promoter interactions, prevent the spread of silenced chromatin, and organize chromatin structure (Stadler, 2017).
A number of prior observations are consistent with the identity of insulators/boundaries as interbands. First, estimates suggest that there are ~5000 interbands constituting 5% of genomic DNA, with an average length of 2 kb, numbers that are in line with the current estimates of boundary element length and number. Second, interbands are associated with insulator proteins, with CP190 appearing to be a constitutive feature of all or nearly all interbands, which is precisely what was observed for boundary elements. Third, interbands and boundary elements are highly sensitive to DNase digestion. Fourth, interbands have been shown to contain the promoters and 5' ends of genes, and this study shows a strong enrichment for promoters oriented to transcribe away from boundaries, which would place upstream regulatory elements within or near the interband. Finally, deletion of both isoforms of BEAF-32, the second-most highly enriched insulator protein at boundary elements, results in polytene X chromosomes that exhibit loss of banding and are wider and shorter than wild type, consistent with a loss of decompacted BEAF-32-bound regions. It is possible that interbands in polytene chromosomes result from multiple underlying molecular phenomena, but it is likely that decompacted insulator elements constitute a significant fraction of these structures (Stadler, 2017).
While frequent looping of insulators has not been seen in Hi-C data from fly tissue, the current model of chromatin compaction at insulators is not mutually exclusive with a role for looping in the function of some insulators. Indeed, a limited set of cases was seern in which interactions between boundaries seem to organize special genome structures with, at least in the case of the Scr locus, clear functional implications. It is likely that additional boundary-associated distal interactions will be found in other tissues and stages of fly development. However, it is emphasized that these interactions are rare and do not appear to be general features of the function of boundary elements (Stadler, 2017).
The data presented in this study offer a picture of the structure of the interphase chromatin of Drosophila that attempts to unify years of studies of polytene chromosomes with modern genomic methods. In this picture, interphase chromatin consists of alternating stretches of compacted, folded chromatin domains separated by regions of decompacted, stretched regions. The compacted regions vary in size from a few to hundreds of kilobases and correspond to both polytene band regions and TADs in Hi-C data. Decompacted regions that separate these domains are short DNA elements that are defined by the strong binding of insulator proteins and correspond to polytene interbands and TAD boundaries (insulators). An intuitive view of this structure in a non-polytene context might resemble the well-worn 'beads on a string', in which insulator/interband regions are the string and bands/TADs form beads of various sizes. Future work, including experimental manipulation of the sequences underlying these structures, will focus on validating and refining this model, exploring how it fits into hierarchical levels of genome organization, and understanding its implications for genome function (Stadler, 2017).
The Polycomb group (PcG) proteins are key conserved regulators of development, initially discovered in Drosophila and now strongly implicated in human disease. Nevertheless, differing silencing properties between the Drosophila and mammalian PcG systems have been observed. While specific DNA targeting sites for PcG proteins called Polycomb response elements (PREs) have been identified only in Drosophila, involvement of non-coding RNAs for PcG targeting has been favored in mammals. Another difference lies in the distribution patterns of PcG proteins. In mouse and human cells, PcG proteins show broad distributions, significantly overlapping with H3K27me3 domains. In contrast, only sharp peaks on PRE regions are observed for most PcG proteins in Drosophila, raising the question of how large domains of H3K27me3, up to many tens of kilobases, are formed and maintained in Drosophila. This study provides evidence that PcG distributions on silent chromatin in Drosophila are considerably broader than previously detected. Using BioTAP-XL, a chromatin crosslinking and tandem affinity purification approach, a broad, rather than PRE-limited overlap of PcG proteins with H3K27me3 was found, suggesting a conserved spreading mechanism for PcG in flies and mammals (Jung, 2016).
Epigenetic inheritance models posit that during Polycomb repression, Polycomb Repressive Complex 2 (PRC2) propagates histone H3K27 tri-methylation (H3K27me3) independently of DNA sequence. This study shows that insertion of Polycomb Response Element (PRE) DNA into the Drosophila genome creates extended domains of H3K27me3-modified nucleosomes in the flanking chromatin and causes repression of a linked reporter gene. After excision of PRE DNA, H3K27me3 nucleosomes become diluted with each round of DNA replication and reporter gene repression is lost, whereas in replication-stalled cells, H3K27me3 levels stay high and repression persists. Hence, H3K27me3-marked nucleosomes provide a memory of repression that is transmitted in a sequence-independent manner to daughter strand DNA during replication. In contrast, propagation of H3K27 tri-methylation to newly incorporated nucleosomes requires sequence-specific targeting of PRC2 to PRE DNA (Laprell, 2017).
The ability of certain histone-modifying enzymes to bind to the modification they generated has led to models where such enzymes might propagate modified chromatin domains by a positive feedback loop, independently of the underlying DNA sequence. Two paradigms of chromatin states have been proposed to be maintained by such an epigenetic inheritance mechanism: constitutive heterochromatin with histone H3 lysine 9 di- and tri-methylation (H3K9me2/3) generated by Suv39/Clr4 enzymes, and Polycomb-repressed chromatin marked with H3K27me3 by PRC2. In both chromatin states, these histone modifications are essential for repressing gene transcription. To date, there is compelling evidence that H3K9me2/3- and H3K27me3-modified nucleosomes are transmitted to daughter strand DNA during replication. However, the steps required to propagate these modifications are much less understood. Fission yeast Clr4 has the capacity to propagate ectopically induced H3K9me2/3 domains over many cell divisions by an H3K9me2/3-based positive feedback loop but only in cells mutated for H3K9me2/3 demethylase activity. In the case of PRC2, allosteric activation of the enzyme induced by binding to H3K27me3 has been proposed to be the foundation for propagating H3K27me3 chromatin. In mammalian cells, transient DNA-tethering of PRC2 generates short ectopic H3K27me3 domains that were at least partially maintained for several cell divisions after release of DNA-tethered PRC2. However, in Drosophila, where PRC2 and other Polycomb group (PcG) protein complexes are targeted to PREs, repression imposed by insertion of PRE DNA next to a reporter gene was lost upon excision of PRE DNA. This study investigated how insertion and excision of PRE DNA at ectopic sites in Drosophila affects binding of PcG proteins and H3K27me3 at the molecular level (Laprell, 2017).
Two previously described strains were analyzed that each carried a single copy of the >PRE>dppWE-TZ reporter gene, integrated at different chromosomal locations. >PRE>dppWE-TZ contains a 1.6 kilobase (kb) DNA fragment of the bxd PRE from the HOX gene Ultrabithorax (Ubx), flanked by FRT recombination sites (>PRE>) to permit excision of PRE DNA by Flp-mediated recombination. Adjacent to the >PRE> cassette, the construct contains a reporter gene comprising the wing imaginal disc enhancer from decapentaplegic (dpp) (E), linked to the hsp70 TATA box minimal promoter (T) and LacZ sequences encoding β-galactosidase (Z) . In the presence of the >PRE> cassette, the transgene was silenced and no β-galactosidase activity could be detected in wing imaginal discs of >PRE>dppWE-TZ transgenic animals. In contrast, >dppWE-TZ transgenic animals, generated by excision of the >PRE> cassette in the germ line, showed strong β-galactosidase expression in the characteristic pattern driven by the dpp enhancer. The observation that silencing of the intact >PRE>dppWE-TZ reporter gene is lost in mutants lacking PRC2 function, prompted determination of the H3K27 methylation profile and binding of PcG proteins across the transgene. In both lines, the transgene had inserted into a genomic location normally devoid of H3K27me3 and PcG protein binding. Chromatin immunoprecipitation (ChIP) assays were performed on batches of wing imaginal discs from >PRE>dppWE-TZ and the corresponding >dppWE-TZ transgenic animals, and the immunoprecipitates were analyzed by quantitative real-time PCR (qPCR). For qPCR, primer pairs were used that selectively amplified transgene sequences and sequences in the genomic regions flanking the transgene insert. As controls, primer pairs were used amplifying sequences at the endogenous bx PRE in Ubx that are known to be bound by PcG proteins (C2) or enriched for H3K27me3 (C1 and C3) and at two regions elsewhere in the genome (C4 and C5) without PcG protein binding or H3K27me3 (Laprell, 2017).
The PRC1 subunits Polycomb (Pc), Polyhomeotic (Ph) and the PRC2 subunit E (z) were specifically enriched at the transgene PRE in animals carrying >PRE>dppWE-TZ and, as expected, no binding was detected in >dppWE-TZ animals. In both >PRE>dppWE-TZ transgenic lines, H3K27me3 was present at high levels across a domain that extended about 4-5 kb to either side of the >PRE> cassette, spanning almost the entire construct. No enrichment of H3K27me3 was detectable at the >dppWE-TZ transgene. At >PRE>dppWE-TZ, PRC2 thus tri-methylates H3K27 across a chromatin interval that spans about 8-10 kb (Laprell, 2017).
To estimate to what extent nucleosomes at the >PRE>dppWE-TZ transgene are tri-methylated at H3K27, the H3K27me2 profile was determined. H3K27me2 levels across the >PRE>dppWE-TZ transgene were much lower than at C4 and C5 and comparable to the levels at Ubx (regions C1-C3) that is repressed and predominantly tri-methylated at H3K27 in wing imaginal discs. Conversely, across >dppWE-TZ, H3K27me2 levels were much higher and comparable to those seen at C4 and C5. This suggest that the nucleosomes across the >PRE>dppWE-TZ transgene are predominantly tri-methylated at H3K27 (Laprell, 2017).
Excision of the >PRE> cassette from >PRE>dppWE-TZ transgenic animals by heat-shock induced expression of Flp during larval development results in appearance of β-galactosidase expression in the dpp pattern 12 hours after the heat shock. Efficiency of PRE excision was measured and it was found that 8 hours after a single 1-hour heat shock, excision had occurred in about 95% of wing imaginal disc cells. The delayed increase of β-galactosidase expression over time suggests a gradual rather than abrupt loss of repression. ChIP analyses were performed on chromatin prepared from batches of entire wing imaginal discs dissected from >PRE>dppWE-TZ transgenic animals 12, 32 or 56 hours after Flp-induction. This allowed monitoring the consequences of PRE excision in cells that had undergone at least one (+12 hours), at least two (+32 hours), or more than four (+56 hours) cell divisions. 12 hours after Flp-induction, H3K27me3 levels were at least two-fold reduced across the entire transgene and further reduced by at least two-fold at the 32 hour time point. 56 hours after Flp-induction, H3K27me3 levels across the transgene were nearly as low as in >dppWE-TZ animals derived from >dppWE-TZ germ cells. The histone H3 profile was unaltered at all time points, suggesting that PRE excision does not cause global disruption of nucleosome occupancy across the transgene. The loss of H3K27me3 after PRE excision suggests that PRC2 is unable to propagate H3K27me3 across the >dppWE-TZ transgene in the absence of PRE DNA (Laprell, 2017).
In parallel, Pc protein binding was monitored after PRE excision. Pc, unlike Ph or other PRC1 subunits, is not only bound at PREs but also associates with the chromatin flanking PREs likely reflecting its interaction with H3K27me3-modified nucleosomes. 12 hours after PRE excision, Pc binding at the transgene was already almost reduced to background levels (Laprell, 2017).
The H3K27me3 profile at the >PRE>dppWE-UZ transgene that contains a 4.1 kb fragment of the Ubx promoter instead of the hsp70 minimal promoter was then analyzed. At >PRE>dppWE-UZ, the H3K27me3 domain spans about 12 kb and is thus about 4 kb longer than at >PRE>dppWE-TZ. Nevertheless, after PRE excision, H3K27me3 at dppWE-UZ was lost at a rate comparable to that seen at dppWE-TZ. Ubx promoter DNA thus does not enable H3K27me3 propagation. It is concluded that even at a domain that spans 12 kb and therefore comprises about 60 nucleosomes, PRC2 is unable to propagate H3K27me3 in the absence of PRE DNA (Laprell, 2017).
The H3K27me3 profile and reporter gene repression was then analyzed after PRE excision in animals in which DNA replication had been blocked. Larvae were reared in liquid medium containing Aphidicolin, an inhibitor of DNA polymerases A and D, which resulted in a complete block of DNA replication in imaginal discs. In larvae reared in Aphidicolin-containing medium, Flp-induced PRE excision from >PRE>dppWE-TZ was as efficient as under normal growth conditions but 12 hours after excision, H3K27me3 levels at the transgene were undiminished compared to +PRE control larvae. In larvae reared in liquid medium without Aphidicolin, PRE excision resulted in the expected two-fold reduction of H3K27me3 levels after 12 hours. Together, this suggests that the loss of H3K27me3 nucleosomes after PRE excision in proliferating cells reflects their dilution as they become transmitted to DNA daughter strands during replication. Unlike under normal growth conditions, Aphidicolin-treated larvae lacked detectable β-galactosidase expression 12 hours after PRE excision. When these animals were permitted to recover in medium lacking Aphidocolin, they resumed DNA replication and began expressing β-galactosidase. If DNA replication is blocked and H3K27me3 levels stay high, repression is thus also sustained in the absence of PRE DNA, possibly by PRC1 (Laprell, 2017).
Finally, PRE excision was induced from >PRE>dppWE-TZ in larvae that were hemizygous for UtxΔ, a null mutation in the single H3K27me3 demethylase in Drosophila. 12 hours after Flp-induction, H3K27me3 levels at the transgene were reduced about two-fold, like in wild-type animals. This suggest that demethylation of H3K27me3 by Utx does not contribute to the disappearance of H3K27me3 from transgene chromatin after PRE excision (Laprell, 2017).
These results lead to the following conclusions. First, PRE cis-regulatory DNA provides the genetic basis not only for generating but also for propagating H3K27me3-modified chromatin. This argues against a simple epigenetic model where PRC2 binding to parental H3K27me3 nucleosomes after replication would suffice to propagate H3K27 tri-methylation in daughter strand chromatin. PRC2 needs to be recruited to PRE DNA first, before allosteric activation through interaction with H3K27me3 nucleosomes in flanking chromatin may then facilitate methylation of newly incorporated nucleosomes. Secondly, following PRE excision and replication, parental H3K27me3 nucleosomes remain associated with the same underlying DNA in daughter cells and thus provide epigenetic memory. However, while in replication-stalled cells high H3K27me3 levels permit to sustain repression also in the absence of PRE DNA, their dilution in proliferating cells is accompanied with loss of repression after one cell division. H3K27me3 nucleosomes therefore only appear to provide short-term epigenetic memory of the repressed state. Hence, DNA targeting of PRC2 after replication to replenish H3K27me3 is critical to preserve repression (Laprell, 2017).
Drosophila HOX and other large-size PcG target genes often contain multiple PREs and H3K27me3 domains that span dozens of kilobases. Deletion of single PREs from these genes typically results in only minor diminution of the H3K27me3 profile and misexpression is less severe than misexpression of the native genes in PcG mutants. Furthermore, when the same >PRE> cassette that was used in this study was excised from a Ubx-LacZ reporter gene with more extended Ubx upstream regulatory sequences, repression was lost with a longer delay\, suggesting that additional elements with PRE properties in those Ubx sequences permitted to sustain repression through more cell divisions. The evolution of PRE DNA sequences and of their frequency and arrangement within target genes may thus ultimately determine stability and heritability of H3K27me3 chromatin and PcG repression (Laprell, 2017).
O-GlcNAc Transferase (OGT/SXC) is essential for Polycomb repression suggesting that the O-GlcNAcylation of proteins plays a key role in regulating development. OGT transfers O-GlcNAc onto serine and threonine residues in intrinsically disordered domains of key transcriptional regulators; O-GlcNAcase (OGA) removes the modification. To pinpoint genomic regions that are regulated by O-GlcNAc levels, ChIP-chip and microarray analysis was performed after OGT or OGA RNAi knockdown in S2 cells. After OGA RNAi, a genome-wide increase was observed in the intensity of most O-GlcNAc-occupied regions. In contrast, O-GlcNAc levels were strikingly insensitive to OGA RNAi at sites of polycomb repression. Microarray analysis suggested that altered O-GlcNAc cycling perturbed the expression of genes associated with morphogenesis and cell cycle regulation. A viable null allele of oga (ogadel.1) was produced in Drosophila allowing visualization of altered O-GlcNAc cycling on polytene chromosomes. Trithorax (Trx), Absent small or homeotic discs 1 (Ash1) and Compass member Set1 histone methyl-transferases were O-GlcNAc-modified in ogadel.1 mutants. The ogadel.1 mutants displayed altered expression of a distinct set of cell cycle related genes. These results show that the loss of Oga in Drosophila globally impacts the epigenetic machinery allowing O-GlcNAc accumulation on RNA Polymerase II and numerous chromatin factors including Trx, Ash1 and Set1 (Akin, 2016).
Polycomb group (PcG) complexes PRC1 and PRC2 are well known for silencing specific developmental genes. PRC2 is a methyltransferase targeting histone H3K27 and producing H3K27me3, essential for stable silencing. Less well known but quantitatively much more important is the genome-wide role of PRC2 that dimethylates approximately 70% of total H3K27. H3K27me2 occurs in inverse proportion to transcriptional activity in most non-PcG target genes and intergenic regions and is governed by opposing roaming activities of PRC2 and complexes containing the H3K27 demethylase UTX. Surprisingly, loss of H3K27me2 results in global transcriptional derepression proportionally greatest in silent or weakly transcribed intergenic and genic regions and accompanied by an increase of H3K27ac and H3K4me1. H3K27me2 therefore sets a threshold that prevents random, unscheduled transcription all over the genome and even limits the activity of highly transcribed genes. PRC1-type complexes also have global roles. Unexpectedly, a pervasive distribution of histone H2A ubiquitylated at lysine 118 (H2AK118ub) was found outside of canonical PcG target regions, dependent on the RING/Sce subunit of PRC1-type complexes. It was shown, however, that H2AK118ub does not mediate the global PRC2 activity or the global repression and is predominantly produced by a new complex involving L(3)73Ah, a homolog of mammalian PCGF3 (Lee, 2016).
Mitosis brings about major changes to chromosome and nuclear structure. This study used recently developed proximity ligation assay-based techniques to investigate the association with DNA of chromatin-associated proteins and RNAs in Drosophila embryos during mitosis. All groups of tested proteins, histone-modifying and chromatin-remodeling proteins and methylated histones remained in close proximity to DNA during all phases of mitosis. RNA transcripts were found to be associated with DNA during all stages of mitosis. Reduction of H3K27me3 levels or elimination of RNAs had no effect on the association of the components of PcG and TrxG complexes to DNA. Using a combination of proximity ligation assay-based techniques and super-resolution microscopy, he number of protein-DNA and RNA-DNA foci was found to undergo significant reduction during mitosis, suggesting that mitosis may be accompanied by structural re-arrangement or compaction of specific chromatin domains (Black, 2016).
Eukaryotic DNA replicates asynchronously, with discrete genomic loci replicating during different
stages of S phase. Drosophila larval tissues undergo endoreplication without cell division, and the
latest replicating regions occasionally fail to complete endoreplication, resulting in
underreplicated domains of polytene chromosomes. This study shows that linker histone H1 is required for the underreplication (UR) phenomenon
in Drosophila salivary glands. H1 directly interacts with the
Suppressor of UR (SUUR) protein and is
required for SUUR binding to chromatin in vivo. These observations implicate H1 as a critical factor
in the formation of underreplicated regions and an upstream effector of SUUR. It was also
demonstrated that the localization of H1 in chromatin changes profoundly during the endocycle. At
the onset of endocycle S (endo-S) phase, H1 is heavily and specifically loaded into late replicating
genomic regions and is then redistributed during the course of endoreplication. The data suggest
that cell cycle-dependent chromosome occupancy of H1 is governed by several independent processes.
In addition to the ubiquitous replication-related disassembly and reassembly of chromatin, H1 is
deposited into chromatin through a novel pathway that is replication-independent, rapid, and
locus-specific. This cell cycle-directed dynamic localization of H1 in chromatin may play an
important role in the regulation of DNA replication timing (Andreyeva, 2017).
This study demonstrated that virtually all major sites of UR throughout the Drosophila genome exhibit a
substantial increase in salivary gland DNA copy number upon depletion of the linker histone H1, thus implicating H1 in the regulation of endoreplication. In control knockdown salivary glands, 46 underreplicated domains were identified. While these regions are in general agreement with previous efforts
to map underreplicated domains by less sensitive microarray analyses, fewer underreplicated sites were identified than a recent report that used high-throughput
sequencing of salivary gland DNA (Yarosh, 2014). Notably, the underreplicated domains
that the current analyses failed to detect represent sites with the weakest degree of UR. One possible
source of variation is the distinct technical approach that was used compared with Yarosh (2014), as simultaneous sequencing of a nonpolytenized (embryonic) genome as a means to
normalize the reads from underrepresented sequences in polytenized tissues (Yarosh, 2014) likely provides additional sensitivity. Another potential explanation could lie in the
relative sequencing depth of the respective assays (approximately fourfold lower in the current study),
considered crucial for the analyses of next-generation sequencing data. However, this explanation is less likely, as subsampling of the current reads to much lower
depths yielded no appreciable difference in the number and location
of identified underreplicated sites or the change in copy number upon H1 knockdown (Andreyeva, 2017).
On average, a moderate knockdown of H1 led to an ~50% copy number gain at the center of
underreplicated domains in intercalary heterochromatin (IH; large dense bands scattered in euchromatin comprising clusters of repressed genes. The copy number is not restored to the same degree as that
in a SuUR genetic mutant. The difference is
likely attributable to the incomplete depletion of H1. In fact, in an independent biological
validation experiment that resulted in an ~95% depletion of H1, an almost complete restoration of
copy number was observed. The observation of an almost complete reversal of UR
in cells depleted of H1 (but still wild type for SuUR) strongly suggests an epistatic mechanism of
action in which both H1 and SUUR act together in the same biochemical pathway (Andreyeva, 2017).
This study found that H1 and SUUR are also involved in UR of PH. For instance, both the mapped pericentric
regions and TE sequences, which are
highly abundant in pericentric regions, exhibit an
increase of DNA copy number upon H1 knockdown. The SuURES mutation also results in a
robust loss of UR at PH, as measured by changes in DNA copy number at TEs. The abrogation of H1
expression gives rise to a somewhat weaker effect on the UR of PH than that of IH, which is consistent with an almost complete elimination of SUUR protein from polytene
chromosome arms in salivary glands depleted of H1 by RNAi but the persistence of residual SUUR at
their PH. The role of H1 in maintaining the underreplicated state of PH may be relevant to
its important regulatory functions in constitutive heterochromatin, where it recruits Su(var)3-9,
facilitates H3K9 methylation, and maintains TEs in a transcriptionally repressed state. Recently, it was proposed that TE repression in ovarian somatic cells involves an H3K9
methylation-independent process through recruitment of H1 by Piwi-piRNA complexes, resulting in
reduced chromatin accessibility. The current results also implicate UR of TE sequences
in polytenized cells as yet another putative mechanism that contributes to regulation of their
expression. Interestingly, it was shown previously that double mutants encompassing both the
Su(var)3-9 and SuUR mutant alleles exhibit a synthetically increased predominance of novel
band-interband structures at PH compared with the mutation of SuUR alone.
While the evidence suggests a relationship between UR and transcriptionally repressive epigenetic
states, such as H3K9 methylation, the nature of this relationship remains largely speculative (Andreyeva, 2017).
This study demonstrated that SUUR
protein physically interacts with H1 in both a complex mixture of whole-cell extracts that contain
endogenous native H1 and recombinant purified H1 polypeptides. Furthermore, the particular structural domains of the two proteins were delimited that are required for the interaction. SUUR protein contains several sequence features that have
been implicated in regulation of UR and binding to specific proteins. Although SUUR possesses a
putative bromodomain, it contains no identifiable DNA-binding domain, so the
mechanism that allows SUUR to exhibit a preference for specific genomic underreplicated loci is
unknown. The positively charged central region is both necessary and sufficient to interact with
heterochromatin protein 1a (HP1a), which suggests a possible involvement of
HP1a in tethering SUUR to H3K9me2/3-rich PH. However, the specific localization of SUUR to
underreplicated IH, which is not enriched for H3K9me2/3, remains enigmatic. This study
now demonstrates that the central region of SUUR is also sufficient for binding directly to H1 in
vitro. Considering that the central region of SUUR is essential for the faithful localization of the
protein to chromatin in vivo, including underreplicated IH, it seems likely that H1 directly mediates the tethering of SUUR to chromatin in underreplicated regions (Andreyeva, 2017).
The tripartite structure of H1 provides multiple binding interfaces for interacting proteins and
thus allows H1 to mediate several biochemically separable functions in vivo. For
instance, the globular domain and proximal 25% of the CTD are required for H1 loading into
chromatin, while the proximal 75% of the CTD is needed for normal polytene morphology, H3K9
methylation, and physical interactions with Su(var)3-9. This study discovered a previously
unknown function for the distal 25% of the H1 CTD, which is shown to be essential for binding to
SUUR. Deletion of this region of H1 results in a near-complete loss of the interaction with SUUR. Thus, in addition to its critical functions in heterochromatin structure and activity,
the CTD of H1 is likely also important in facilitating UR (Andreyeva, 2017).
One of the most
striking findings in this study is the observation that the genomic occupancy of H1 undergoes
profound changes during the endoreplication cycle. It also remains largely mutually exclusive with
that of DNA polymerase clamp loader PCNA, which is consistent with the observed depletion
of H1 in nascent chromatin compared with mature chromatin (Andreyeva, 2017).
H1 is heavily loaded into late replicating loci at the onset of replication (when these loci are
silent for replication). Combined, the current observations indicate that the chromosome distribution of H1
during the endocycle is governed by at least three independent processes. Two
of them [replication-dependent (RD) eviction of H1 and RD deposition of H1 after the passage of
replication fork] are related to the well-recognized obligatory processes of chromatin disassembly
and reassembly during replication. The third
pathway, which directs early deposition of H1 into late replicating loci, has not been described
previously. This process is (1) replication-independent (RI); (2) locus-specific, with a strong
preference for late replicating sites; and (3) apparently more rapid than the RD deposition of H1,
since very high levels of H1 occupancy are observed in all nuclei immediately after the initiation
of endo-S. It is possible that the RI pathway of H1 loading into chromatin is mediated by a
selective recruitment of H1 based on epigenetic core histone modification-dependent mechanisms. For
instance, mammalian H1.2 was reported to recognize H3K27me3, and this modification
is very abundant in IH (Sher et al. 2012) (Andreyeva, 2017).
Also, the RI mechanism for deposition of H1 probably does not involve de novo nucleosome assembly,
as H1 is known to exhibit a mutually exclusive distribution with RI core histone variants, and there is no known nuclear process during early S phase that requires
core histone turnover. In the future, it will be interesting to further confirm that RI nucleosome
assembly does not take place during early replication in salivary gland polytene chromosomes.
Finally, the locus-specific RI deposition of H1 in early endo-S chromatin may be conserved in the
normal S phase of diploid tissues, and it will require independent experimentation with sorted
mitotically dividing cells to confirm this possibility (Andreyeva, 2017).
This study also provides cytological evidence that
the functions of H1 and SUUR are biochemically linked. Specifically, it was demonstrated that SUUR
localizes to a subset of H1-positive bands and requires H1 for its precise distribution
in polytene chromosomes, nuclear localization, and stability in salivary gland
cells. These observations implicate H1 as an upstream effector of SUUR functions in vivo
and an essential component of the biological pathway that maintains loci of reduced ploidy in
polytenized cells. Importantly, this finding adds to a growing list of biochemical partners of H1
that mediate their chromatin-directed functions in an H1-dependent fashion (Andreyeva, 2017).
Interestingly, even a moderate depletion of H1 (to ~30% of normal) results in a complete removal of
SUUR from chromosome arms. Thus, H1-dependent localization of SUUR
requires high concentrations of the linker histone in chromatin. This conclusion is also consistent
with SUUR colocalization with polytene loci that are the most strongly stained for H1.
In contrast, elimination of the H3K9me2 mark from polytene spreads requires very extensive depletion
of H1, whereas the moderate depletion of H1 does not strongly affect H3K9
dimethylation in the chromocenter or polytene arms. Therefore, the robust
effect of even moderate H1 depletion on SUUR localization in chromatin is unlikely to be mediated
indirectly through disorganization of heterochromatin structure (Andreyeva, 2017).
Unexpectedly, the cell cycle-dependent temporal pattern of H1 localization is not identical
to that of SUUR. In contrast to H1, SUUR protein (1) is only weakly present in IH during early
endo-S phase, (2) achieves the maximal occupancy at IH loci only in the late endo-S, and (3)
colocalizes with PCNA at certain sites. The observations made in this
study and in previous works can be summarized in the following model for H1-mediated regulation of
SUUR association with chromatin. The initiation of the deposition of SUUR in
chromosomes is strongly dependent on H1. More specifically, SUUR is preferentially localized to
chromatin domains that are highly enriched for H1. For instance, the tremendously elevated
concentration of H1 in IH of early endo-S cells promotes and nucleates the initiation of deposition
of SUUR into these regions. However, the pattern of SUUR occupancy at these sites does not occur
temporally in parallel with that of H1. Initially, the exceptionally high abundance of H1 in late
replicating loci during early endo-S is not paralleled by a simultaneous comparable increase of SUUR
occupancy. Rather, loading of SUUR into these sites lags
significantly behind H1 occupancy. Thus, the rate of SUUR localization to H1-rich IH appears to be
much slower than that of the RI deposition of H1 into these loci. After the initial recruitment,
further loading of SUUR does not require H1, and SUUR continues (in a slower fashion) to accumulate
at IH throughout the endo-S phase even when H1-enriched domains dissipate in the course of DNA
endoreplication. The additional loading of SUUR in chromatin is likely facilitated by its
self-association through dimerization of the N terminus and physical
interactions with the replication fork, as proposed previously. In this
fashion, SUUR achieves its maximal concentration in IH loci by the late endo-S (Andreyeva, 2017).
This study has demonstrated that H1 has a pivotal
function in the establishment of UR of specific IH loci in polytenized salivary gland cells. The
findings that H1 interacts directly with SUUR in vitro and is required for SUUR localization to late
replicating IH in polytene chromosomes in vivo strongly suggest that the H1-mediated recruitment of
SUUR promotes UR by obstructing replication fork progression in its cognate underreplicated loci but
does not affect replication origin firing. However, the remarkable temporal
pattern of H1 distribution in endoreplicating polytene chromosomes suggests that it may also play a
direct SUUR-independent role in regulation of endoreplication. This is especially plausible
considering that the temporal distribution patterns of SUUR and H1 are dissimilar (Andreyeva, 2017).
In contrast to the role of SUUR in slowing down the replication fork progression during late endo-S
phase, H1 (acting in the absence of SUUR during early endo-S) may function to repress the initiation
of endoreplication, as proposed in several studies. DNA-seq analyses also suggest this mechanism. Compared with the relatively smooth, flat profiles of
DNA copy numbers in SuURES mutant salivary glands, the profiles in
H1-depleted cells exhibit a jagged, uneven appearance, indicative of aberrant local
initiation of replication. Unfortunately, the experimental system (cytological analyzes of salivary
glands) cannot be used to further confirm this idea. First, an extensive depletion of H1 results in
the loss of polytene morphology; second, since the staging of endo-S progression is
based on PCNA staining, a spurious activation of ectopic replication origins would result in an
incorrect calling of the stage. To further complicate these analyses, polytenized cells are not
amenable to other methods of cell cycle staging, such as fluorescence-activated cell sorting (FACS).
In the future, it will be important to examine the role of H1 in regulation of DNA replication
timing in sorted Drosophila diploid cells (Andreyeva, 2017).
Post-translational modifications (PTMs) of core histones are important epigenetic determinants that correlate with functional chromatin states. Thus study addresses the function of PTMs in Drosophila that encodes a single somatic linker histone, dH1. It has been reported that dH1 is dimethylated at K27 (dH1K27me2). This study shows that dH1K27me2 is a major PTM of Drosophila heterochromatin. At mitosis, dH1K27me2 accumulates at pericentromeric heterochromatin, while, in interphase, it is also detected at intercalary heterochromatin. ChIPseq experiments show that >98% of dH1K27me2 enriched regions map to heterochromatic repetitive DNA elements, including transposable elements, simple DNA repeats and satellite DNAs. Moreover, expression of a mutated dH1K27A form, which impairs dH1K27me2, alters heterochromatin organization, upregulates expression of heterochromatic transposable elements and results in the accumulation of RNA:DNA hybrids (R-loops) in heterochromatin, without affecting H3K9 methylation and HP1a binding. The pattern of dH1K27me2 is H3K9 methylation independent, as it is equally detected in flies carrying a H3K9R mutation, and is not affected by depletion of Su(var)3-9, HP1a or Su(var)4-20. Altogether these results suggest that dH1K27me2 contributes to heterochromatin organization independently of H3K9 methylation (Bernues, 2022).
Transgenerational epigenetic inheritance (TEI) describes the transmission of alternative functional states through multiple generations in the presence of the same genomic DNA sequence. Very little is known about the principles and the molecular mechanisms governing this type of inheritance. In this study, by transiently enhancing 3D chromatin interactions, stable and isogenic Drosophila epilines were established that carry alternative epialleles, as defined by differential levels of Polycomb-dependent trimethylation of histone H3 Lys27 (forming H3K27me3). After being established, epialleles can be dominantly transmitted to naive flies and can induce paramutation. Importantly, epilines can be reset to a naive state by disruption of chromatin interactions. Finally, it was found that environmental changes modulate the expressivity of the epialleles, and this paradigm was extended to naturally occurring phenotypes. This work sheds light on how nuclear organization and Polycomb group (PcG) proteins contribute to epigenetically inheritable phenotypic variability (Ciabrelli, 2017).
Developmental gene expression is tightly regulated through enhancer elements, which initiate dynamic
spatio-temporal expression, and Polycomb response elements
(PREs), which maintain stable gene silencing. These two cis-regulatory functions are thought to
operate through distinct dedicated elements. By examining the occupancy of the Drosophila pleiohomeotic repressive complex (PhoRC) during embryogenesis,
extensive co-occupancy was revealed at developmental enhancers. Using an established in vivo assay
for PRE activity, it was demonstrated that a subset of characterized developmental enhancers can
function as PREs, silencing transcription in a Polycomb-dependent manner. Conversely, some classic
Drosophila PREs can function as developmental enhancers in vivo, activating spatio-temporal
expression. This study therefore uncovers elements with dual function: activating transcription in
some cells (enhancers) while stably maintaining transcriptional silencing in others (PREs). Given
that enhancers initiate spatio-temporal gene expression, reuse of the same elements by the Polycomb
group (PcG) system may help fine-tune gene expression and ensure the timely maintenance of cell
identities (Erceg, 2017).
While enhancers initiate spatio-temporal transcriptional activity, PREs maintain a previously determined transcriptional state of their target genes, thus leading to transcriptional memory. PREs are generally thought to be dedicated solely to gene silencing and not to contain enhancer-like features to activate gene expression. This study presents evidence to the contrary, that both functions can be encoded in the same cis-regulatory element, depending on the cellular context. This is not a rare event -- almost 25% of PhoRC occupancy is at developmental enhancers. Of the 16 elements that this study tested experimentally (either enhancers for PRE activity or PREs for enhancer activity), nine have dual function, being sufficient to activate transcription in a specific spatio-temporal pattern and mediate PcG-dependent silencing in vivo (Erceg, 2017).
These dual elements have interesting implications for transcriptional regulation during embryonic development. First, at the level of PcG protein recruitment, this subset of enhancers is highly enriched in the Pho motif, which distinguishes them from other developmental enhancers. This suggests that the recruitment of Pho to PhoRC enhancers is direct via sequence-specific DNA binding, consistent with an instructive model of recruitment, although other factors are likely involved. PcG proteins and developmental TFs bind in close proximity to each other within the same element (a single DNase hypersensitive site), raising the possibility of direct interplay between the two. The results
indicate that the activity of PhoRC-bound enhancers is dominated by tissue-specific TFs that activate transcription in some cells while being dominated by a functional PcG complex in other cells. Is this due to mutually exclusive occupancy of developmental TFs and PcG proteins in different tissues, or do they compete functionally at these elements? The dramatic derepression of enhancer activity in different cell types upon PcG protein removal suggests that other tissue-specific TFs must occupy these enhancers in the PcG silenced cell. This has interesting implications for enhancer activity, as it is well known that TFs bind to thousands of sites (tens of thousands in mammalian cells), but only a subset of associated target genes changes expression when the TF is removed. This has led to the general assumption that the majority of binding events is nonfunctional or neutral. These data suggest that at least a subset of this embryonic occupancy can be functional if not actively antagonized by the presence of PcGs (Erceg, 2017).
Second, enhancer-mediated polycomb recruitment has interesting implications for the mechanism of PcG-mediated silencing. The current models suggest that PcG proteins silence transcription mainly by silencing a gene's promoter, in keeping with PcG recruitment to CpG islands in vertebrates, or by coordinating a three-dimensional repressive topology, where the entire gene's locus is silenced. In either mode, a gene's promoter would not be permissive to enhancer activation. The data suggest that there may be a third mode of very local silencing at an individual enhancer, leaving the promoter and the rest of the gene's regulatory landscape open for activation by other enhancers, as was observed at the prat2 locus. This would allow for much more fine-tuning of silencing in individual tissues and stages. It also suggests that PcG proteins could play a more dynamic role, similar to a 'standard' transcriptional repressor at enhancers (Erceg, 2017).
Third, this may have broader implications for cell fate decisions during rapid developmental transitions. When multipotent cells become specified into different lineages, a specific transcriptional program often needs to be activated in one cell while being repressed in other cells from the same progenitor population. Having active enhancers in the precursor cells remain accessible to directly recruit the PcG complexes would ensure that these enhancers become silenced in a timely manner. Conversely, having maternally deposited PcG proteins already bound to enhancers early in development may serve as placeholders to ensure that these dual elements remain open and available for TFs to activate at the appropriate development stage. Interestingly, in the majority of the tested cases, PcG proteins and developmental TFs use these dual elements to regulate the same target gene, the vast majority of which is key developmental regulators of cell identity (Erceg, 2017).
The identification of PREs in other species has remained a key challenge, with only a handful of PREs identified in mammals and plants to date. In mammals, the PcG system is recruited to inactive CpG islands, with few specific sequence features. Although there are mammalian homologs of the Drosophila Pho and dSfmbt proteins, Yin Yang 1 (YY1) and SFMBT, respectively, the conservation of PhoRC as a complex and its involvement in mammalian PcG silencing remain unclear. It is proposed that such dual enhancers/PREs will also exist in mammals, although, given this apparent lack of conservation of YY1 function, their mechanism of PcG recruitment may have diverged (Erceg, 2017).
The Polycomb group (PcG) and Trithorax group (TrxG) proteins are key epigenetic regulators controlling the silenced and active states of genes in multicellular organisms, respectively. While precise mechanisms of the PcG/TrxG protein recruitment remain unknown, the important role is suggested to belong to sequence-specific DNA-binding factors. At the same time, it was demonstrated that the PRE DNA-binding proteins are not exclusively localized to PREs but can bind other DNA regulatory elements, including enhancers, promoters, and boundaries. To gain an insight into the PRE DNA-binding protein regulatory network, in this study, differences in abundance of the Combgap, Zeste, Psq, and Adf1 PRE DNA-binding proteins were sought. While there were no conspicuous differences in co-localization of these proteins with other functional transcription factors, it was shown that Combgap and Zeste are more tightly associated with the Polycomb repressive complex 1 (PRC1), while Psq interacts strongly with the TrxG proteins, including the BAP SWI/SNF complex. The Adf1 interactome contained Mediator subunits as the top interactors. In addition, Combgap efficiently interacted with AGO2, NELF, and TFIID. Combgap, Psq, and Adf1 have architectural proteins in their networks. This study further investigated the existence of direct interactions between different PRE DNA-binding proteins and demonstrated that Combgap-Adf1, Psq-Dsp1, and Pho-Spps can interact in the yeast two-hybrid assay (Chetverina, 2022).
Polycomb-mediated repression of gene expression is essential for development, with a pivotal role played by trimethylation of histone H3 lysine 27 (H3K27me3), which is deposited by Polycomb Repressive Complex 2 (PRC2). The mechanism by which PRC2 is recruited to target genes has remained largely elusive, particularly in vertebrates. This study demonstrates that MTF2, one of the three vertebrate homologs of Drosophila melanogaster Polycomblike, is a DNA-binding, methylation-sensitive PRC2 recruiter in mouse embryonic stem cells. MTF2 directly binds to DNA and is essential for recruitment of PRC2 both in vitro and in vivo. Genome-wide recruitment of the PRC2 catalytic subunit EZH2 is abrogated in Mtf2 knockout cells, resulting in greatly reduced H3K27me3 deposition. MTF2 selectively binds regions with a high density of unmethylated CpGs in a context of reduced helix twist, which distinguishes target from non-target CpG islands. These results demonstrate instructive recruitment of PRC2 to genomic targets by MTF2 (Perino, 2018).
A ten-eleven translocation (TET) ortholog exists as a DNA N(6)-methyladenine (6mA) demethylase (DMAD) in Drosophila. However, the molecular roles of 6mA and DMAD remain unexplored. Through genome-wide 6mA and transcriptome profiling in Drosophila brains and neuronal cells, this study found that 6mA may epigenetically regulate a group of genes involved in neurodevelopment and neuronal functions. Mechanistically, DMAD interacts with the Trithorax-related complex protein Wds to maintain active transcription by dynamically demethylating intragenic 6mA. Accumulation of 6mA by depleting DMAD coordinates with Polycomb proteins and contributes to transcriptional repression of these genes. These findings suggest that active 6mA demethylation by DMAD plays essential roles in fly CNS by orchestrating through added epigenetic mechanisms (Yao, B. 2018).
During central nervous system (CNS) development, spatiotemporal gene expression programs mediate specific lineage decisions to generate neuronal and glial cell types from neural stem cells (NSCs). However, little is known about the epigenetic landscape underlying these highly complex developmental events. This study performed ChIP-seq on distinct subtypes of Drosophila FACS- purified neural stem cells (NSCs) and their differentiated progeny to dissect the epigenetic changes accompanying the major lineage decisions in vivo By analyzing active and repressive histone modifications, this study shows that stem cell identity genes are silenced during differentiation by loss of their activating marks and not via repressive histone modifications. This analysis also uncovers a new set of genes specifically required for altering lineage patterns in type II neuroblasts, one of the two main Drosophila NSC identities. Finally, it was demonstrated that this subtype specification in NBs, unlike NSC differentiation, requires Polycomb-group (PcG)-mediated repression (Abdusselamoglu, 2019).
During development of the central nervous system (CNS), neural stem cells (NSCs) divide asymmetrically to generate daughter cells with self-renewing capacity but also complex neurogenic and gliogenic lineages. Regulation of this process requires tight and highly dynamic control of multiple cell fate decisions. For cells to commit to their ultimate cell identity, spatiotemporal gene expression programs are required. It is assumed that activation of lineage-specific genes and silencing of stem cell genes is accompanied by changes in chromatin states. How histone modifications change over time during neurogenesis in vivo, however, is not very well described (Abdusselamoglu, 2019).
The Drosophila larval CNS has become a key model for the fundamental mechanisms underlying brain development and chromatin states. The larval CNS is populated by distinct types of NSCs, or neuroblasts (NBs), which vary in abundance, neuronal output and division mode. Together, these NBs give rise to the majority of the neurons of the adult brain. The majority of the central brain NBs are of type I (NBIs). Each NBI gives rise to another NBI and a ganglion mother cell (GMC), which divides once more to generate two differentiated neurons or glia. Type II NBs (NBIIs) are a rare subpopulation with only eight NBII per brain lobe. Unlike NBIs, NBIIs divide into one NBII and one transit-amplifying cell called an intermediate neural progenitor (INP). NBIIs generate many more neurons, because INPs continue to divide asymmetrically five or six times, each time giving rise to a GMC that divides into two neurons or glia cells. Other than lineage structure and size, cell markers can also be used to differentiate NB subtypes. Whereas NBIs express both Asense (Ase) and Deadpan (Dpn), NBIIs only express Dpn. During neurogenesis, both NB subtypes divide asymmetrically to give rise to their respective progeny. Brain tumors form if the asymmetric segregation of cell fate determinants during NB cell division is disrupted. Among these determinants are the TRIM-NHL protein Brain tumor (Brat) and the Notch inhibitor Numb. Although Brat depletion results in the generation of ectopic NBII-like tumor NBs (tNBs) at the expense of differentiated brain cells, simultaneous loss of Brat and Numb causes the NBI-like tNBs to overproliferate (Abdusselamoglu, 2019).
In many cell types, transitions in chromatin states are regulated by the evolutionarily conserved Polycomb group (PcG) and Trithorax group (TrxG) proteins. PcG and TrxG have emerged as antagonistic regulators that silence or activate gene expression, respectively. These multimeric protein complexes regulate the transcriptional state of genes by post-translationally modifying amino acid residues of histone tails. PcG proteins exert a repressive activity via two main complexes, the Polycomb repressive complexes 1 and 2 (PRC1 and PRC2). Although PRC1 and PRC2 can exist in various compositions and associate with context-specific accessory proteins, both have been shown to contain a specific core set of proteins including subunits with catalytic activity. Within PRC2, Enhancer of zeste [E(z) in Drosophila, EZH1/2 in mammals] catalyzes the trimethylation of lysine 27 on histone 3 (H3K27me3). H3K27me3 is recognized by PRC1, which in turn includes the histone H2A ubiquityltransferase Sce [RING1A (RING1) and RING1B (RNF2) in mammals]. Histone modifications associated with active transcription are deposited by TrxG proteins, which counteract repressive marks by histone acetylation or methylation, in particular by trimethylation of lysine 4 on histone H3 at active promoters (Abdusselamoglu, 2019).
Although well-known for their role in long-term transcriptional memory, PcG and TrxG complexes are highly dynamic during development and thus facilitate cellular plasticity. In the last decade, it has been shown that PcG and TrxG complexes are crucial for ensuring correct neurogenesis in mammals as well as in Drosophila. Despite the strength of genetic in vivo experiments, however, global analysis of the histone modifications underlying their function, and therefore target genes, has mainly been performed in vitro. This constitutes a real knowledge gap, as recent studies have demonstrated that the chromatin states may vary significantly between in vivo tissues and their related in vitro cell lines, mainly owing to culture conditions. Given also that epigenetic changes are highly context- and developmental time-dependent, providing in vivo datasets to investigate chromatin states of different cell types in complex tissues will increase understanding of how the epigenetic landscape dynamically defines cellular states (Abdusselamoglu, 2019).
In recent years, in vivo studies made use of Drosophila to shed light on the dynamics of chromatin state changes during embryonic neural differentiation and during larval stages. Profiling the binding of chromatin remodelers has highlighted the plasticity of chromatin states during differentiation. Although binding of chromatin factors is associated with active or repressive chromatin, binding does not necessarily reflect downstream histone modifications. For example, the histone marks can change drastically between parasegments of the Drosophila embryo, whereas the occupancy of PcG proteins remains unchanged. Thus, investigating the dynamics of chromatin states based on chromatin marks is crucial for understanding the functional specialization of cells during development. Moreover, how PcG/TrxG complexes target genes on the chromatin level between different subtypes of progenitor cells during neuronal differentiation or tumorigenic transformation has remained elusive (Abdusselamoglu, 2019).
This study has used the Drosophila larval CNS to track in vivo changes of histone modifications not only upon differentiation, but also between different populations of NSCs and their tumorigenic counterparts. A fluorescence-activated cell sorting (FACS)-based method was developed to sort different cell types and perform ChIP-seq for the active histone mark, H3K4me3, and the repressive mark, H3K27me3. The FACS-based approach provides an in vivo dataset that reveals dynamic histone modifications during neuronal differentiation. In particular, it was observed that self-renewal and cell-division genes are repressed independently of H3K27me3 levels. In contrast, it was further shown that H3K27me3-mediated repression is crucial for silencing lineage-specific stem cell factors, including known factors as well as a new set of genes that are specific to NBIIs. Finally, genetic evidence is presented for the requirement of these new NBII-specific factors for self-renewal and demonstrate the role of PcG complexes in defining different subtypes of neural stem cells (Abdusselamoglu, 2019).
This study provides a resource of histone modification datasets for different types of NSCs and their differentiated progeny. In combination with chromatin accessibility and binding maps of chromatin remodelers of Drosophila brain cells, it is hoped that this dataset will serve as a useful community resource. During differentiation, stem cell identity genes are silenced in a PcG-independent manner, which supports previous findings showing that these genes are silenced through HP1-enriched chromatin. In addition, PcG-mediated silencing is unlikely to instruct the stepwise inactivation of stem cell genes during differentiation as loss of H3K27me3 did not induce ectopic NBs (Abdusselamoglu, 2019).
This study has taken advantage of in vivo genetic labeling to investigate chromatin dynamics of different NB subtypes. As the type II NBs are very lowly abundant, tumor NBs of type I and type II origins were used as a proxy in order to obtain enough material to be able to compare these two cell types. Each change was further validated by comparing tumor with healthy type I NBs and excluded artifacts present because of the tumorigenic state of the cells. The data show that both TrxG and PcG are required to establish NBII identity. A set of NBII-specific genes was identified, including previously identified btd and Sp1. Dll and eya, which are specifically required for NBII maintenance, were identified. It has been previously described that btd acts as an activator of Dll in the development of the ventral imaginal discs. This suggests that in NBII-identity specification the Trithorax-target btd could act together with Dll and eya. Such a mechanism would explain why the loss of btd causes a distinct phenotype compared with the loss of Dll and eya. Interestingly, an NBI to NBII conversion is observed only in 18% of NBIs ectopically overexpressing btd, indicating that either co-factors are missing or that the chromatin of btd targets is inaccessible. The data of NB subtype-specific genes being characterized by H3K27me3 repressive chromatin favor the latter. Therefore, as opposed to TrxG-activated stem cell and mitosis genes, the repression of NBII-specific genes is ensured by PcG-mediated H3K27me3 histone modifications, suggesting that Polycomb plays a role in defining the diversity of NSC lineages. Moreover, the data indicate that PcG repression is required not only for the silencing of HOX genes but also for the self-renewal capacity of NBs. Unlike TrxG, the loss of catalytic subunits of PcG complexes did not convert NBIIs to NBIs or vice versa. This suggests that NB subtype specification cannot be explained solely by an absence of repression but requires a further activation mechanism. Strikingly, loss of PcG complexes caused a significant decrease in the number of NBs. Interestingly, across all the cell types, developmental genes such as caudal, eve, peb, scr and slp1, as well as genes involved in embryonic NB temporal patterning [hb, kr, pdm (nub), cas and grh], are heavily marked with H3K27me3. It is therefore possible that PcG-mediated repression is required to silence these developmentally crucial genes in addition to the Hox genes. Thus, the observed reduction in NB stemness might be caused by the de-repression of these genes (Abdusselamoglu, 2019).
Besides an overall decrease in NB maintenance, an increased sensitivity of NBII lineages to a reduction in PRC2 activity was observed. Interestingly, opa and ham, two previously described temporal switch genes in NBII lineages, are also enriched in H3K27me3 marks in NBs. Opa and ham are expressed in the immediate NBII progeny, the INPs, and ectopic expression of these genes limits self-renewal of NBIIs and causes NBIIs to disappear. Even though these two genes are heavily marked with H3K27me3 across all NB samples, NBIIs might be specifically sensitive to PRC2 depletion because they could be more receptive to premature de-repression of genes, the expression of which is normally restricted NBII progeny only (Abdusselamoglu, 2019).
In the future, investigating the downstream targets of PcG in different NB subtypes could reveal the underlying mechanisms of subtype specification. In conclusion, these data provide a useful resource to investigate how chromatin state dynamics orchestrate the diversity and correct progression of NSC lineages (Abdusselamoglu, 2019).
Polycomb Group (PcG) proteins form memory of transient transcriptional repression that is necessary for development. In Drosophila, DNA elements termed Polycomb Response Elements (PREs) recruit PcG proteins. How PcG activities are targeted to PREs to maintain repressed states only in appropriate developmental contexts has been difficult to elucidate. PcG complexes modify chromatin, but also interact with both RNA and DNA, and RNA is implicated in PcG targeting and function. This study shows that R-loops, three-stranded nucleic acid structures formed when an RNA hybridizes to a complementary DNA strand, thereby displacing the second DNA strand, form at many PREs in Drosophila embryos, and correlate with repressive states. In vitro, both PRC1 and PRC2 can recognize R-loops and open DNA bubbles. Unexpectedly, this study found that PRC2 [E(z), Esc and Su(z)12] drives formation of RNA-DNA hybrids, the key component of R-loops, from RNA and dsDNA. These results identify R-loop formation as a feature of Drosophila PREs that can be recognized by PcG complexes, and RNA-DNA strand exchange as a PRC2 activity that could contribute to R-loop formation (Alecki, 2020).
During Drosophila embryogenesis, transiently expressed transcription factors activate homeotic (Hox) genes in certain regions of the embryo and repress them in others to dictate the future body plan. Polycomb Group (PcG) proteins form a memory of these early cues by maintaining patterns of Hox gene repression for the rest of development. This paradigm for transcriptional memory is believed to be used by the PcG at many genes in Drosophila, and to underlie the conserved and essential functions of PcG proteins in cell differentiation and development from plants to mammals. Polycomb response elements (PREs) are DNA elements that can recruit PcG proteins, but they also recapitulate the memory function of the PcG-when combined with early acting, region-specific enhancers in transgenes, they maintain transgene repression in a PcG-dependent manner only in regions where the early enhancer was not active. PREs contain a high density of binding sites for transcription factors that can recruit PcG proteins through physical interactions. However, the widespread expression, binding pattern, and properties of factors that bind PREs cannot explain how PREs can exist in alternate, transcription-history dependent states to maintain restricted patterns of gene expression, or how they can switch between states. Furthermore, DNA sequences with PRE-like properties have been difficult to identify in other species despite the conservation of PcG complexes, their biochemical activities, and their critical roles in development (Alecki, 2020).
RNAs may provide context specificity to PcG protein recruitment and function. Some PREs, and some PcG-binding sites in mammalian and plant cells, are transcribed into ncRNA, while others reside in gene bodies, and thus are transcribed when the gene is expressed (Herzog, 2014). Both the direction and level of transcription have been correlated with the functional state of PREs. The PcG complex Polycomb Repressive Complex 2 (PRC2) has a well-described high affinity for RNA. RNA is suggested to recruit PRC2 to specific chromatin sites13, but RNA binding can also compete for chromatin binding and inhibit PRC2 activity. One way for RNA to interact with the genome is by the formation of R-loops, three-stranded nucleic acid structures formed when an RNA hybridizes to a complementary DNA strand, thereby displacing the second DNA strand. R-loops have been linked to regulation of transcription and chromatin previously, through a variety of mechanisms. This includes links to PcG regulation in mammalian cells. The formation of R-loops over genes with low to moderate expression is associated with increased PcG binding and H3K27 trimethylation (H3K27me3) in human cells and R-loops have recently been implicated in promoting PRC1 and PRC2 recruitment in mammalian cells, although other evidence suggests they antagonize recruitment of PRC2. It is hypothesized that R-loop formation could biochemically link RNA to PcG-mediated silencing through PREs and tested this idea in the Drosophila system (Alecki, 2020).
This study identified R-loop forming sequencing in Drosophila embryos and S2 cells and observe that ~25% of PREs form R-loops. Interestingly, PREs that form R-loops are more likely to be bound by PcG proteins compared with PREs that do not form R-loops, suggesting that R-loops may be involved in PcG targeting. In vitro, PRC1 and PRC2 recognize R-loops and open DNA-bubbles. Further, when provided dsDNA and RNA, PRC2 induces the formation of RNA-DNA hybrids, the key components of R-loops. These data suggest a mechanism for RNA to contribute to targeting of PcG proteins via R-loop formation induced by the RNA-DNA strand exchange activity of PRC2 (Alecki, 2020).
The demonstration that PRC2 induces the formation of RNA-DNA hybrids in vitro, that PRC2 and PRC1 recognize R-loops in vitro, and that R-loops are present at PREs in vivo suggest a mechanistic model for how RNAs could induce or maintain the OFF state of PREs. If PREs (or the gene they control and in many cases are embedded in) are highly transcribed, the RNA could compete for PRC2 binding to chromatin, as has been demonstrated in vitro and in vivo. However, a lower level of transcription through a PRE (or transcription in an orientation that is favourable for R-loop formation) could allow R-loops to form, possibly via the RNA-DNA hybrid forming activity of PRC2. R-loop formation will repress additional RNA production by preventing RNA polymerase passage allowing recruitment of additional PRC2 (by PRE-binding transcription factors or interactions with other PcG proteins) and its retention on chromatin. PRC2 could then modify histones to maintain a repressive chromatin state. The R-loop, in conjunction with H3K27me3 and PRE-binding transcription factors, would also promote binding of PRC2 and PRC1. R-loops may also interfere with binding or function of proteins that promote the active state of PREs, although this remains to be tested. The data indicate that both coding and ncRNAs form R-loops. The regulation of these RNAs and therefore of R-loops could provide transcriptional memory and developmental context specificity to PcG recruitment by transcription factors that constitutively recognize PREs. A conceptually similar model for how high levels of RNA production at PREs could promote the ON state and low levels the OFF state has been proposed; R-loop formation provides one mechanism by which it can occur. Although this model is highly speculative at this time, it integrates many observations, and provides testable hypotheses (Alecki, 2020).
Observations in Drosophila are also consistent with a possible connection between R-loops and PcG function. The helicase Rm62 interacts genetically with both PcG and TrxG genes, and colocalizes with the PRE-binding protein Dsp1 on polytene chromosomes. Rm62 is the Drosophila homologue of the DDX5 helicase, which can unwind RNA-DNA hybrids in vitro and is implicated in R-loop resolution in vivo. A recent genome-wide RNAi screen for TrxG interacting genes (which should antagonize PcG function) identified the gene for RNaseH140. RNA has been suggested to be important in switching PREs between OFF and ON states, although this has been contested by experiments aiming to test whether transcription through a PRE can switch it to the active state. Resolution of R-loops by cellular RNases or RNA-DNA helicases could contribute to switching PRE states, which will be intriguing to test. It is also likely that even in the simple model suggested in the paper (see Model for the role of R-loop formation driven by PRC2 in PcG gene silencing), the levels of RNA corresponding to 'low' and 'high', and the strength of the effect will depend both on the genomic context and the sequences of the RNAs that are produced (Alecki, 2020).
R-loop formation is observed at ~30% of PREs; these may represent a specific class of PREs. Most R-loops are believed to form co-transcriptionally, so that R-loops would be predicted to depend on PRE transcription. Indeed, >70% of R-loops formed at PREs overlap an annotated coding or non-coding RNA, and PREs with R-loops are more likely to have RNA Pol II signal in ChIP-seq experiments. However, ~67% of PREs where R-loops were not observed also overlap an annotated transcript. Further, a fraction of PREs with R-loops (and a fraction of total R-loops) either do not overlap any annotated transcripts, or overlap a transcript in the opposite orientation as the R-loop. While some of these discrepancies likely reflect incomplete annotation of rare transcripts, they raise the intriguing possibility that the RNA used to form the R-loops could be supplied in trans. Careful analysis of the RNA component of R-loops at PREs will be needed to resolve this. Although speculative at this time, the ability of PRC2 to induce RNA-DNA hybrids could contribute to non-co-transcriptional R-loop formation (Alecki, 2020).
This study finds that PRC2 can induce RNA-DNA strand exchange from RNA and linear dsDNA in vitro. A small number of other proteins have been shown to have similar activity, using various types of substrates. These include the repair proteins Rad52/RecA and PALB2, the human capping enzyme (CE)50, the viral protein ICP8 and the telomere-binding protein TRF2. Like the activity of PRC2, none of these reactions require ATP hydrolysis (although R-loop formation by RecA is stimulated by ATPγS), and most use linear DNA substrates or an unpaired or ssDNA region. The exceptions are TRF2 and ICP8. ICP8 can mediate R-loop formation from an RNA and a supercoiled plasmid. TRF2 stimulates invasion of RNA oligos into a supercoiled plasmid encoding a telomeric DNA array, but the mechanism is believed to be induction of positive supercoiling by TRF2 that facilitates DNA unwinding and RNA invasion. RNA-DNA strand exchange has been investigated most closely for Rad52, and its homologue RecA. Rad52 has been shown both to carry out 'inverse strand exchange' where Rad52 first binds the dsDNA, allowing RNA strand exchange, and to use an RNA-bridging mechanism, in which Rad52 first binds the RNA, and can bridge two dsDNA fragments by forming RNA-DNA hybrids with segments of each of them. Both of these mechanisms are candidates to mediate RNA-mediated repair of DSBs. PRC2 requires a DNA end for RNA-DNA strand exchange in vitro; for this activity to occur in vivo, either a DNA break would be required, or PRC2 would need to be able to use DNA opened by (an)other factors, or by transcription. These requirements may limit PRC2 strand exchange activity at PREs. In order to fully understand the impact of this activity in vivo and to what extent PRC2 contributes to R-loop formation at PREs, additional experiments will be necessary. Interestingly, Topoisomerase II interacts with a subunit of PRC1, colocalizes with PcG proteins in the BX-C, and is implicated in PRE-mediated silencing; transient Topo II induced breaks have been implicated in regulation of transcription and chromatin compaction, and could also be used by PRC2. It is also possible that the activity of PRC2 contributes to RNA-DNA strand exchange at DNA breaks where RNA-DNA hybrids have been shown to form and where PRC2 is recruited (Alecki, 2020).
The connection between RNA and PRC2 has been recognized for some time, in species from plants to humans, but mechanisms beyond RNA binding by PRC2 have not previously been described. This discovery of PRC2-mediated RNA-DNA strand exchange, suggests one mechanism to connect RNA to PcG targeting and function (Alecki, 2020).
R-loops are involved in transcriptional regulation, DNA and histone post-translational modifications, genome replication and genome stability. To what extent R-loop abundance and genome-wide localization is actively regulated during metazoan embryogenesis is unknown. Drosophila embryogenesis provides a powerful system to address these questions due to its well-characterized developmental program, the sudden onset of zygotic transcription and available genome-wide data sets. This study measured the overall abundance and genome localization of R-loops in early and late-stage embryos relative to Drosophila cultured cells. It was demonstrated that absolute R-loop levels change during embryogenesis and that RNaseH1 catalytic activity is critical for embryonic development. R-loop mapping by strand-specific DRIP-seq reveals that R-loop localization is plastic across development, both in the genes which form R-loops and where they localize relative to gene bodies. Importantly, these changes are not driven by changes in the transcriptional program. Negative GC skew and absolute changes in AT skew are associated with R-loop formation in Drosophila. Furthermore, this study demonstrated that while some chromatin binding proteins and histone modifications such as H3K27me3 are associated with R-loops throughout development, other chromatin factors associated with R-loops in a developmental specific manner. These findings highlight the importance and developmental plasticity of R-loops during Drosophila embryogenesis (Munden, 2022).
Diffuse midline gliomas and posterior fossa type A ependymomas contain the recurrent histone H3 lysine 27 (H3 K27M; see Drosophila H3) mutation and express the H3 K27M-mimic EZHIP (CXorf67), respectively. H3 K27M and EZHIP are competitive inhibitors of Polycomb Repressive Complex 2 (PRC2) lysine methyltransferase activity. In vivo, these proteins reduce overall H3 lysine 27 trimethylation (H3K27me3) levels; however, residual peaks of H3K27me3 remain at CpG islands (CGIs) through an unknown mechanism. This study reports that EZHIP and H3 K27M preferentially interact with PRC2 that is allosterically activated by H3K27me3 at CGIs and impede its spreading. Moreover, H3 K27M oncohistones reduce H3K27me3 in trans, independent of their incorporation into the chromatin. Although EZHIP is not found outside placental mammals, expression of human EZHIP reduces H3K27me3 in Drosophila melanogaster through a conserved mechanism. These results provide mechanistic insights for the retention of residual H3K27me3 in tumors driven by H3 K27M and EZHIP (Jain, 2020).
Polycomb repressive complexes 1 and 2 have been historically described as transcriptional repressors, but recent reports suggest that PRC1 might also support activation, although the underlying mechanisms remain elusive. This study shows that stage-specific PRC1 binding at a subset of active promoters and enhancers during Drosophila development coincides with the formation of three-dimensional (3D) loops, an increase in expression during development and repression in PRC1 mutants. Dissection of the dachshund locus indicates that PRC1-anchored loops are versatile architectural platforms that persist when surrounding genes are transcriptionally active and fine-tune their expression. The analysis of mammalian RING1B binding profiles and 3D contacts during neural differentiation in mice suggests that this role is conserved in mammals (Loubiere, 2020).
Polycomb group proteins (PcG) assemble into two main epigenetic complexes called Polycomb repressive complexes 1 and 2 (PRC1 and PRC2), which are highly conserved across metazoans and collaborate at multiple levels to maintain their target genes in a repressed state. PRC1 also binds a subset of active promoters and enhancers devoid of the PRC2-mediated H3K27me3 repressive mark in both Drosophila and mammals. Loss-of-function experiments suggest that PRC1 might contribute to the transcriptional activation of a subset of genes. However, since PRC1 binds to a large number of sites, disentangling direct from indirect regulatory effects has been proven difficult, and the molecular mechanisms that might support transcriptional activation by PRC1 are obscure. This study tested whether PRC1 might mediate gene activation by forming enhancer-promoter loops, in addition to the repressive chromatin loops that were previously described (Loubiere, 2020).
PRC1 plays important roles during normal physiology and in cancer, but how it might play a dual silencing and an activating role is a matter of great interest. The data provided in this study suggest that, rather than behaving as transcriptional repressors, PRC1-mediated loops establish versatile architectural platforms that can induce repression and activation. In the absence of transcription factors, PRC1 might cooperate with PRC2 to establish repressive loops and to form silent Polycomb domains, whereas the binding of developmental transcription factors might exploit PRC1-dependent enhancer-promoter contacts to coordinate the timely induction of cognate genes during development. The net effect of looping appears to be gene specific since expression of the CG5888 gene is more sensitive to disruption of a PRC1-dependent loop than the neighboring dac gene. At dac, one of the PRC1 binding sites is within a few hundred base pairs (bp) from the TSS, whereas the PRC1-binding site closest to the two alternative CG5888 alternative promoters is located, respectively, around 16 and 25 kb away. One possibility is therefore that the relative location of regulatory elements might modulate the effects of PRC1-dependent loops. In addition to the role in 3D looping, recent work has shown that PRC1 might assist transcription by modulating occupancy and phosphorylation of RNA polymerase II, as well as association of the pausing-elongation factor Spt5 to enhancers and promoters. In future studies, it will be important to analyze the relative role and the interplay of these mechanisms in individual cells at the onset of silencing, as well as during transcriptional activation to understand the chain of molecular events that triggers this dual function for Polycomb-bound regions (Loubiere, 2020).
The Polycomb group (PcG) complex PRC1 localizes in the nucleus in condensed structures called Polycomb bodies. The PRC1 subunit Polyhomeotic (Ph) contains an oligomerizing sterile alpha motif (SAM) that is implicated in both PcG body formation and chromatin organization in Drosophila and mammalian cells. A truncated version of Ph containing the SAM (mini-Ph) forms phase-separated condensates with DNA or chromatin in vitro, suggesting that PcG bodies may form through SAM-driven phase separation. In cells, Ph forms multiple small condensates, while mini-Ph typically forms a single large nuclear condensate. It is therefore hypothesized that sequences outside of mini-Ph, which are predicted to be intrinsically disordered, are required for proper condensate formation. In this study identified three distinct low-complexity regions in Ph based on sequence composition. The role of each of these sequences was systematically tested in Ph condensates using live imaging of transfected Drosophila S2 cells. Each sequence uniquely affected Ph SAM-dependent condensate size, number, and morphology, but the most dramatic effects occurred when the central, glutamine-rich intrinsically disordered region (IDR) was removed, which resulted in large Ph condensates. Like mini-Ph condensates, condensates lacking the glutamine-rich IDR excluded chromatin. Chromatin fractionation experiments indicated that the removal of the glutamine-rich IDR reduced chromatin binding and that the removal of either of the other IDRs increased chromatin binding. These data suggest that all three IDRs, and functional interactions among them, regulate Ph condensate size and number. The results can be explained by a model in which tight chromatin binding by Ph IDRs antagonizes Ph SAM-driven phase separation. These observations highlight the complexity of regulation of biological condensates housed in single proteins (Kapur, 2022).
Under stress conditions, the coactivator Multiprotein bridging factor 1 (Mbf1) translocates from the cytoplasm into the nucleus to induce stress-response genes. However, its role in the cytoplasm, where it is mainly located, has remained elusive. This study shows that Drosophila Mbf1 associates with E(z) mRNA and protects it from degradation by the exoribonuclease Pacman (Pcm), thereby ensuring Polycomb silencing. In genetic studies, loss of mbf1 function enhanced a Polycomb phenotype in Polycomb group mutants, and was accompanied by a significant reduction in E(z) mRNA expression. Furthermore, a pcm mutation suppressed the Polycomb phenotype and restored the expression level of E(z) mRNA, while pcm overexpression exhibited the Polycomb phenotype in the mbf1 mutant but not in the wild-type background. In vitro, Mbf1 protected E(z) RNA from Pcm activity. These results suggest that Mbf1 buffers fluctuations in Pcm activity to maintain an E(z) mRNA expression level sufficient for Polycomb silencing (Nishioka, 2018).
Polycomb silencing is essential for the developmental regulation of gene expression. The silencing needs to be robust to tightly repress the expression of developmental genes in undifferentiated cells, such as stem cells, but should also be flexible for rapid release upon differentiation. However, this paradoxical aspect of Polycomb silencing is not well understood (Nishioka, 2018).
Mbf1 was originally identified as an evolutionarily conserved coactivator that connects a transcriptional activator with the TATA element-binding protein (Li, 1994; Takemaru, 1997; Takemaru, 1998). Usually, Mbf1 is present in the cytoplasm; however, under stress conditions, Mbf1 translocates into the nucleus to induce stress-response genes. Previous studies have revealed roles for the coactivator in axon guidance, oxidative stress response, defense against microbial infection, and resistance to drugs such as tamoxifen. However, the cytoplasmic role of Mbf1 has remained elusive, except for mRNA or ribosomal binding (Nishioka, 2018).
Pacman (Pcm/Xrn1) is an evolutionarily conserved 5'-3' exoribonuclease that degrades decapped mRNA. Genetic studies have demonstrated that Drosophila pcm is involved in epithelial closure, male fertility, apoptosis and growth control. Null mutants of pcm are lethal during early pupal stages, suggesting the enzyme plays an essential role in development (Nishioka, 2018 and references therein).
Using a genetic approach in Drosophila, this study shows that cytoplasmic Mbf1 ensures Polycomb silencing by protecting E(z) mRNA from degradation by Pcm. The results thus demonstrate an unexpected component of the regulatory mechanism underlying Polycomb silencing. This mechanism might also allow flexibility in Polycomb silencing, as Mbf1 protein expression declines upon differentiation (Nishioka, 2018).
To address the cytoplasmic role of Mbf1, novel genes were sought that interact with mbf1. Surprisingly, the mbf1 mutation enhanced a classical Polycomb phenotype of Psc and Pc mutants, namely the appearance of an ectopic sex comb tooth or teeth on the male mid-leg. Although mbf12/+ or mbf12/mbf12 flies never exhibited the Polycomb phenotype, penetrance of the phenotype in Psc1/+ increased significantly in Psc1/+; mbf12/+, and further increased in Psc1/+; mbf12/mbf12. The penetrance was restored to the Psc1/+ level by expressing wild-type Mbf1 protein from a transgene. Similar effects of the mbf12 allele were observed with the Pc6 mutation (Nishioka, 2018).
To gain insight into the mechanism underlying the genetic interaction between Psc and mbf1, the expression of the representative Polycomb group genes Pc, E(z) and pho was analyzed. Results of reverse transcription-quantitative PCR (RT-qPCR) analyses demonstrated a prominent reduction in the expression level of E(z) mRNA in Psc1/+; mbf12/+ larvae, whereas Pc and pho mRNA levels remained unchanged. Immunostaining of wing discs demonstrated that E(z) protein expression was severely compromised in Psc1/+; mbf12/+ compared with that in wild type, mbf12/+ or Psc1/+. By contrast, the expression of Pc and Pho proteins was not significantly affected. Western blot analyses confirmed the marked decrease in the E(z) protein level in both wing and leg discs from Psc1/+; mbf12/+. Consistently, Psc1/+; E(z)731/+ exhibited the extra sex comb phenotype, which was comparable to Psc1/+; mbf12/+ (Nishioka, 2018).
It is unlikely that Mbf1 affects E(z) transcription because no significant difference was detected in the E(z) mRNA level between wild-type and mbf12/mbf12 larvae. Consistently, it was not possible to detect any significant difference in the expression of E(z) in the wing disc upon knockdown or overexpression of Mbf1 using a posterior compartment-specific Gal4 driver. When cytoplasmic and nuclear RNA fractions from wing discs were analyzed by RT-qPCR, the nuclear E(z) mRNA level was similar between wild type and Psc1/+; mbf12/+. However, the cytoplasmic E(z) mRNA level in Psc1/+; mbf12/+ decreased to ~20% of the wild-type level. Collectively, these results suggest that mbf1 regulates the E(z) mRNA level post-transcriptionally in the cytoplasm (Nishioka, 2018).
Considering that Mbf1 binds to mRNA, it was hypothesized that cytoplasmic Mbf1 might bind to E(z) mRNA to protect it from degradation, and thereby regulates the E(z) mRNA level. Results of RNA-immunoprecipitation (RIP) experiments revealed a preferential binding of Mbf1 to E(z) mRNA. A ~10-fold enrichment of E(z) mRNA was found in the anti-Mbf1 antibody pull-down fraction from cytoplasmic extracts of embryos. The pull-down was clearly selective, as enrichment of abundant mRNAs, such as RpL32 and RpL30, was not observed. By contrast, E(z) mRNA was barely detectable in the anti-Mbf1 antibody pull-down fraction from embryonic extracts of the mbf1 mutant, used as a negative control. This is not due to absence of E(z) mRNA in the mbf1 mutant (Nishioka, 2018).
Following the observed preferential binding of Mbf1 to E(z) mRNA, this study focused on the Polycomb phenotype and reduced E(z) mRNA expression level, which were not caused by the mbf1 mutation alone. Enhancement of the Polycomb phenotype and the reduction of E(z) mRNA were only detected in the double mbf1 and Polycomb group gene mutant. To explain the synergistic effect of mbf1 and Polycomb group mutations, it was posited that a component of the mRNA degradation pathway was only activated in the Polycomb group mutant background. Therefore, attempts were made to identify the component of the pathway that was activated in the Psc or Pc mutants. Among the mRNAs tested, only pcm mRNA, which encodes the 5'-exoribonuclease, was upregulated in Psc1/+ and Pc6/+ larvae. Neither the decapping enzyme (Dcp2), components of the exosome [Dis3, Prp6 (CG6841) and Prp40 (CG3542)], nor components in the 3'-deadenylation-mediated pathway (twin and Nab2) appeared to be activated. Western blot analyses revealed a 2-fold increase in the Pcm protein level in wing discs from Psc1/+ or Pc6/+ larvae compared with that from wild type. These results led to an investigation of the effects of the pcm mutation on Polycomb silencing and E(z) mRNA expression (Nishioka, 2018).
Strikingly, the pcmΔ1 mutation resulted in significant suppression of the Polycomb phenotype in Psc1/+ and Psc1/+; mbf12/+. This suppression was rescued by expressing the wild-type Pcm protein from a transgene. Similar results were obtained using the Pc6 mutant. Consistent with this result, the pcmΔ1 mutation restored the E(z) mRNA levels in Psc1/+ and Psc1/+; mbf12/+ to near wild-type levels (Nishioka, 2018).
In addition to the extra sex comb phenotype, Psc1/+; mbf12/+ exhibited misexpression of Ubx in wing discs. The signals appeared as spots consisting of clusters of Ubx-positive cells. The pcmΔ1 mutation decreased the number of spots per wing disc. The misexpression occurred predominantly around the dorsoventral border in the posterior compartment. Consistently, adult wing defects were observed along the posterior wing margin, which was also suppressed by pcmΔ1 (Nishioka, 2018).
Importantly, the extra sex comb phenotype was detected under mild overexpression of pcm in mbf12/hs-pcm double heterozygotes at 25°C, even in the wild-type Polycomb group background. hs-pcm/+ exhibited an ~2.5-fold overexpression of Pcm at 25°C. Nevertheless, hs-pcm heterozygotes in the wild-type mbf1 background did not show any Polycomb phenotype. These results suggest that Mbf1 stabilizes Polycomb silencing against fluctuations in the Pcm protein level in vivo. Enhancement of the Polycomb phenotype was also observed in Psc1/+; hs-pcm/+ compared with that in Psc1/+ (Nishioka, 2018).
Biochemical analyses using purified recombinant Mbf1 and Pcm proteins revealed that Mbf1 protects E(z) RNA from degradation by Pcm. RNA protection assays were performed in which in vitro-synthesized E(z) RNA was treated with the RNA pyrophosphatase RppH to convert the 5'-triphosphoryl end into the 5'-monophosphoryl form, which is a Pcm substrate. The RNA was digested with Pcm in the presence or absence of Mbf1. Mbf1 inhibited the digestion of E(z) RNA. In the absence of RppH, RNA degradation was barely detectable, suggesting that the digestion was due to 5'-exoribonuclease activity. Gel filtration of a mixture of Pcm and Mbf1 resulted in the elution of each protein in a clearly separated peak. Furthermore, Mbf1 did not co-immunoprecipitate with Pcm and vice versa. These results suggest that Mbf1 does not inhibit Pcm activity through protein-protein interactions. Collectively, it is concluded that Mbf1 protects E(z) mRNA from degradation by Pcm both in vivo and in vitro (Nishioka, 2018).
It is proposed that cytoplasmic Mbf1 ensures Polycomb silencing by protecting E(z) mRNA from the activity of Pcm. In the mbf1 mutant, E(z) mRNA is free from Mbf1 protein, but pcm expression is downregulated by Polycomb group genes. In the Polycomb group mutant, Pcm expression is upregulated, but E(z) mRNA is partly protected by Mbf1. In the mbf1 Polycomb group double mutant, E(z) mRNA is free from Mbf1 protein and is subject to Pcm attack. Whereas Mbf1 is highly expressed in undifferentiated cells, such as those of embryos, larval testis, ovary, imaginal discs and neuroblasts, its expression is reduced in differentiated tissues, similar to the situation in the mbf1 mutant. This would facilitate the rapid release of developmental genes from Polycomb silencing upon differentiation. Interestingly, expression of mammalian Mbf1 [also termed endothelial differentiation-related factor 1 (Edf1)] and Ezh2 declines immediately after the onset of differentiation (Nishioka, 2018).
A recent study demonstrated that Pcm prevents apoptosis in imaginal discs and downregulates specific transcripts such as hid and reaper. However, suppression of apoptosis did not rescue the lethality of a pcm null mutation at the early pupal stage. Therefore, there might be other targets of Pcm that are essential for early pupal development. The present study indicates that E(z) mRNA could be one such target (Nishioka, 2018).
The mRNA-binding activity of Mbf1 was selective, but might not be strictly specific to E(z) mRNA. Although Polycomb silencing is central to the developmental regulation of gene expression, there could be other mRNAs that bind to Mbf1 in a similar manner, thereby modulating another biological function. Therefore, RIP-seq analysis was conducted to identify Mbf1-bound mRNAs. To ensure robustness of the RIP-seq data, the results were compared independently with two publicly available datasets and identified 804 commonly enriched mRNAs. Among these, the enrichment of four representative mRNAs (GstD5, Ide, Tep2 and Pebp1) was confirmed by RIP RT-qPCR analyses. Interestingly, the expression levels of these four mRNAs decreased in Psc1/+; mbf12/+ and increased in pcmΔ1/Y compared with those in wild type, suggesting that the model can be applied to a wider range of mRNAs than just E(z). However, dependency on the Mbf1/Pcm antagonism appears to differ among the mRNAs (Nishioka, 2018).
Gene ontology and pathway analyses of the 804 genes revealed some interesting properties of the Mbf1-associated mRNAs. The gene ontology terms 'glutathione metabolic process', 'oxidation-reduction process' and 'neurogenesis' which includes E(z), are consistent with the fact that previous studies found defects in oxidative stress defense and axon guidance in the mbf1 mutant. Also of interest are the groups 'positive regulation of innate immune response' and 'defense response to Gram-negative bacterium', as Arabidopsis MBF1 is involved in host defense against microbial infection. Moreover, pathway analysis of the enriched genes implicated Mbf1 in 'drug metabolism', as previously suggested for tamoxifen resistance. This raises an intriguing possibility that Mbf1 contributes to various types of stress defense, metabolic processes and neurogenesis as both a nuclear coactivator and as a cytoplasmic mRNA-stabilizing protein. Although mbf1 null mutants are viable under laboratory conditions, evolutionary conservation of mbf1 suggests that it has essential role(s) under real-world stress conditions (Nishioka, 2018).
Polycomb silencing represses gene expression and provides a molecular memory of chromatin state that is essential for animal development. This study shows that Drosophila female germline stem cells (GSCs) provide a powerful system for studying Polycomb silencing. GSCs have a non-canonical distribution of PRC2 activity and lack silenced chromatin like embryonic progenitors. As GSC daughters differentiate into nurse cells and oocytes, nurse cells, like embryonic somatic cells, silence genes in traditional Polycomb domains and in generally inactive chromatin. Developmentally controlled expression of two Polycomb repressive complex 2 (PRC2)-interacting proteins, Pcl and Scm, initiate silencing during differentiation. In GSCs, abundant Pcl inhibits PRC2-dependent silencing globally, while in nurse cells Pcl declines and newly induced Scm concentrates PRC2 activity on traditional Polycomb domains. These results suggest that PRC2-dependent silencing is developmentally regulated by accessory proteins that either increase the concentration of PRC2 at target sites or inhibit the rate that PRC2 samples chromatin (DeLuca, 2020).
The work described here shows that the Drosophila female germline has multiple advantages for studying the developmental regulation of chromatin silencing both before and during differentiation. Female GSCs continuously divide to produce new undifferentiated progenitors, which expand and differentiate into nurse cells or oocytes, generating large amounts of a much simpler tissue than a developing embryo. Additionally, an inducible reporter assay compatible with the female germline was developed that sensitively responds to developmental changes in local chromatin repression in individual cells. In contrast to RNAseq, which measures steady state RNA levels, or ChIPseq, which correlates chromatin epitopes with their perceived function on gene expression, the reporters directly test how local chromatin influences the inducibility of surrounding genes, and are easily combined with tissue specific knockdowns to identify trans-acting factors contributing to reporter inducibility. Finally, a genetic engineering approach allows any construct (not just hsGFP) to be efficiently integrated into many pre-existing 'donor' sites, including those used previously with other reporters, or sites heavily silenced by repressive chromatin in differentiated cells. Although the number of different donor sites in certain types of chromatin is currently limited, new sites continue to be generated using CRISPR/Cas9 targeting and the method can ultimately be applied virtually anywhere in the genome (DeLuca, 2020).
Analysis of Polycomb repression with reporters, ChIP, and PcG-gene knockdowns provided numerous insights into how chromatin affects gene expression and female germline development in Drosophila. GSCs, the precursors of oocytes and nurse cells, contain a non-canonical, binary distribution of moderate H3K27me3 enrichment on all transcriptionally inactive loci and very low enrichment on active chromatin. A similar non-canonical H3K27me3 distribution was observed in early fly embryos, suggesting that noncanonical chromatin represents a 'ground state' for progenitors that will propagate future generations of undifferentiated germ cells or somatic cells that differentiate into specialized tissues. Such non-canonical chromatin was first identified in mouse oocytes and preimplantation embryos. If non-canonical H3K27me3 chromatin is a characteristic of undifferentiated, totipotent cells, what function might it confer to account for its conservation (DeLuca, 2020)?
Experiments confirmed previous workshowing that chromatin modified by PRC2 is essential for the Drosophila female germ cell cycle. Germline cysts lacking PRC2 are unable to stably generate oocytes, and E(z) GermLine-specific RNAi Knock Down (GLKD) nurse cells mis-express multiple genes and degenerate at about stage 5. In contrast, GSCs lacking PRC2 properly populate their niche, divide, and produce daughters that interact with female follicle cells and begin nurse cell differentiation. Removing PRC2 activity from GSCs did not generally increase the steady state abundance of genes or the inducibility of reporters in H3K27me3-enriched inactive or PcG domains. These results suggest that PRC2 and non-canonical chromatin lack vital functions in undifferentiated germline progenitors but are critical for repressing genes upon differentiation. However, a requirement for PRC2 or non-canonical chromatin under stress conditions or prolonged aging cannot be dismissed. For example, PRC2 could promote the long-term maintenance of female GSCs, similarly to how it maintains male germline progenitors in flies and mice (DeLuca, 2020).
Germline cysts and nurse cells are found in diverse animal species across the entire phylogenetic spectrum, but their function has been well studied mostly in insects such as Drosophila where they persist throughout most of oogenesis. While nurse cells have traditionally been considered germ cells rather than late-differentiating somatic cells, this study shows that that Drosophila nurse cells initiate Polycomb silencing and enrich PRC2 activity on a nearly identical collection of PcG domains as somatic cells. In more distant species, such as mice, nurse cells initially develop in a similar manner within germline cysts and contribute their cytoplasm to oocytes, but undergo programmed cell death before the vast majority of oocyte growth. Consequently, it remains an open question whether somatic differentiation plays a role in nurse cell function in mammals and many other groups (DeLuca, 2020).
The size and composition of oocyte cytoplasm are uniquely tailored to promote optimal fecundity and meet the demands of early development. In some species, including flies, nurse cells synthesize large amounts of specialized ooplasm to rapidly produce multitudes of large, pre-patterned embryos. In others, including mammals, oocytes more slowly synthesize the majority of ooplasm. Interestingly, both ooplasm synthesis strategies apparently require Polycomb silencing. However, the nurse cell-based strategy in flies primarily requires PRC2 but not PRC1 to silence hundreds of somatic genes, while the oocyte-based strategy in mice requires PRC1 but not PRC2 (DeLuca, 2020).
Different strategies of ooplasm synthesis may have evolved to be compatible with noncanonical germ cell chromatin. Staining experiments show that Drosophila oocytes maintain a widely distributed, non-canonical H3K27me3 distribution similar to pre-meiotic precursors or mouse oocytes, suggesting that non-canonical chromatin is conserved and maintained throughout the germ cell cycle. Similar to mouse oocytes, Drosophila spermatocytes also contain non-canonical chromatin and autonomously synthesize large amounts of cytoplasm by deploying PRC1 but not PRC2. Thus, three different types of germ cells are filled with large amounts of differentiated cytoplasm that requires Polycomb silencing for its synthesis, but nevertheless maintain a non-canonical, silencing-deficient PRC2 activity (DeLuca, 2020).
The conservation of undifferentiated, non-canonical chromatin despite a strong selection for Polycomb silencing during ooplasm synthesis argues that non-canonical chromatin must have a presently unappreciated fundamental purpose in germ cells. Noncanonical chromatin could regulate multigenerational processes like mutation, recombination, or transposition, that are not easily assayed in sterile individuals. Tests of these ideas will require a better understanding of how non-canonical chromatin is regulated and methods to disrupt non-canonical chromatin without disrupting other functions required for germline viability. Additionally, non-canonical chromatin could simply result from the silencing- incompetent PRC2 that was observed in progenitors (DeLuca, 2020).
Pcl was uncovered as both an inhibitor of PRC2 silencing and promoter of non-canonical chromatin in GSCs. PclGLKD dramatically altered the footprint of PRC2 activity in GSCs. PclGLKD favored H3K27me3 enrichment on PREs versus inactive domains, and increased the total amount of H3K27me1 by 13-fold and H3K27me2 by 1.4-fold and decreased the total amount of H3K27me3 by 1.8 fold. By binding DNA through its winged-helix domain, Pcl triples PRC2's residence time on chromatin and promotes higher states of H3K27 methylation in vitro. In GSCs, Pcl could simply change the result of each PRC2-chromatin binding event from H3K27me1 to me3. However, it is hard to imagine how an equivalent number of nucleosomes bearing a higher H3K27 methylation state could explain how Pcl inhibits silencing. Instead, it is proposed that Pcl inhibits silencing by reducing the number of PRC2-chomatin binding events per unit time by increasing the residence time of PRC2 on chromatin with each binding event. In this model, PclGLKD would not only convert many H3K27me3 nucleosomes into H3K27me1 nucleosomes, it would also convert many unmethylated nucleosomes into H3K27me1 nucleosomes. PclGLKD would more subtly affect H3K27me2 abundance because it simultaneously increases the number of PRC2-chromatin binding events while reducing the probability of each binding event leading to H3K27me2 versus me1 (DeLuca, 2020).
By reducing the number of PRC2 binding events, Pcl could increase the abundance of unmethylated H3K27 residues available for acetylation - a transcription promoting modification. In both flies and mammals, PRC2 transiently associates with chromatin to mono- and dimethylate H3K27 outside of traditional PcG domains, blocking H3K27 acetylation and antagonizing transcription. This study similarly found strong and widespread PRC2-dependent silencing in H3K27me1/2 enriched chromatin in nurse cells. Because inactive domain silencing was not affected by depletion of Pcl, Jarid2, or H3K27me3, it is proposed that core-PRC2, but not Pcl-PRC2 or H3K27me3, primarily silences inactive chromatin (DeLuca, 2020).
In GSCs, abundant Pcl could saturate PRC2, effectively depleting faster-sampling core-PRC2 complexes in favor of slower sampling Pcl-PRC2. In somatic embryonic cells, Pcl is present in a small fraction of PRC2 complexes. Compared to other fly tissues, Pcl mRNA is most abundant in the ovary, and within the ovary, Pcl protein is much more abundant in GSCs and nurse cell precursors than differentiated nurse cells and somatic cells. Within each differentiated germline cyst, Pcl mRNA is depleted from nurse cells and enriched in oocytes, suggesting that Pcl protein levels may be regulated by an mRNA transport mechanism induced in region 2 that also triggers the differentiation of oocytes from nurse cells (DeLuca, 2020).
Pcl, and a second PRC2-interacting protein, Scm, regulate the transition from noncanonical to canonical chromatin and initiate Polycomb repression. During nurse cell differentiation, it is proposed that Pcl depletion frees core-PRC2 to rapidly sample and silence inactive domains, while Scm (which is absent from the GSC) induction recruits high levels of PRC1 and PRC2 activity around PREs. ScmGLKD nurse cell chromatin retained a noncanonical H3K27me3 pattern characteristic of GSCs, as if differentiation at PcG domains had not occurred. In mice, Scm homologue, Scml2, similarly associates with PcG domains to recruit PRC1/2 and silence PcG targets during male germline development. However, unlike its fly orthologue in female GSCs, Scml2 is expressed in male germline precursors. This difference could explain why mammalian PGCs partially enrich PRC2 activity on CGIs while fly female GSCs do not enrich PRC2 on specific sites. While PcG domain-associated Scm is sufficient to enrich PRC2 activity above background levels found throughout inactive chromatin, a second PRC2 interacting protein, Pcl, is additionally required to promote full PRC2 and H3K27me3 enrichment on PcG domains. Because Scm oligomerizes and interacts with PRC2 in vitro, it could form an array of PRC2 binding sites anchored to PREs through Sfmbt. It is proposed that two cooperative interactions, PRC2 with PRE-tethered Scm, and Pcl with DNA, preferentially concentrate H3K27me3-generating Pcl-PRC2 versus H3K27me1/2-generating core-PRC2 on PcG domains (Figure 6D). H3K27me3 could then be further enriched by H3K27me3-induced allosteric PRC2 activation through the Esc subunit (DeLuca, 2020).
By promoting PRC1 and PRC2 concentration on PcG domains, Scm enhances silencing on PcG-localized reporters. While Scm-depleted nurse cells completed oogenesis, a subset of PcG domain-localized PcG including chinmo and the posterior Hox gene, Abd-b, escaped repression and were potentially loaded into embryos. Eggs derived from ScmGLKD nurse cells failed to hatch, and mis-expressed Abd-b in anterior segments following germ band elongation. This defect more closely resembled maternal plus zygotic than maternal-only Scm mutants, suggesting that ScmGLKD may deplete both maternal and zygotic Scm (DeLuca, 2020).
However, the additional possibility that mis-regulation of maternal Polycomb targets like chinmo contribute to the subtle embryonic defects observed in maternal-only Scm mutant clones cannot be excluded (DeLuca, 2020).
Further study of the Polycomb-mediated repression described in this study will help define the gene regulation program of Drosophila nurse cells and its contribution to oocyte growth. Additional characterization and perturbation of non-canonical chromatin throughout the germ cell cycle will yield further insights into its function in development. Finally, incorporating studies of other chromatin modifications, including H3K9me3-based repression, during germ cell development will contribute to a fuller understanding of how chromatin contributes to an immortal cell lineage (DeLuca, 2020).
Epigenetic silencing by Polycomb group (PcG) complexes can promote epithelial-mesenchymal transition (EMT) and stemness and is associated with malignancy of solid cancers. This study reports a role for Drosophila PcG repression in a partial EMT event that occurs during wing disc eversion, an early event during metamorphosis. A screen for genes required for eversion uncovered the PcG genes Sex combs extra (Sce) and Sex combs midleg (Scm). Depletion of Sce or Scm resulted in internalised wings and thoracic clefts, and loss of Sce inhibited the EMT of the peripodial epithelium and basement membrane breakdown, ex vivo. Targeted DamID (TaDa) using Dam-Pol II showed that Sce knockdown caused a genomic transcriptional response consistent with a shift towards a more stable epithelial fate. Surprisingly only 17 genes were significantly upregulated in Sce-depleted cells, including Abd-B, abd-A, caudal, and nubbin. Each of these loci were enriched for Dam-Pc binding. Of the four genes, only Abd-B was robustly upregulated in cells lacking Sce expression. RNAi knockdown of all four genes could partly suppress the Sce RNAi eversion phenotype, though Abd-B had the strongest effect. The results suggest that in the absence of continued PcG repression peripodial cells express genes such as Abd-B, which promote epithelial state and thereby disrupt eversion. These results emphasise the important role that PcG suppression can play in maintaining cell states required for morphogenetic events throughout development and suggest that PcG repression of Hox genes may affect epithelial traits that could contribute to metastasis (Jefferies, 2020).
Dynamic changes in the three-dimensional (3D) organization of chromatin are associated with central biological processes, such as transcription, replication and development. Therefore, the comprehensive identification and quantification of these changes is fundamental to understanding of evolutionary and regulatory mechanisms. This study presents Comparison of Hi-C Experiments using Structural Similarity (CHESS), an algorithm for the comparison of chromatin contact maps and automatic differential feature extraction. The robustness of CHESS to experimental variability is presented, and its biological applications were showcased on (1) interspecies comparisons of syntenic regions in human and mouse models; (2) intraspecies identification of conformational changes in Zelda-depleted Drosophila embryos; (3) patient-specific aberrant chromatin conformation in a diffuse large B-cell lymphoma sample; and (4) the systematic identification of chromatin contact differences in high-resolution Capture-C data. In summary, CHESS is a computationally efficient method for the comparison and classification of changes in chromatin contact data (Galan, 2020).
Polycomb group proteins (PcGs) drive target gene repression and form large chromatin domains. In Drosophila, DNA elements known as Polycomb group response elements (PREs) recruit PcGs to the DNA. This study shows that, within the invected-engrailed (inv-en) Polycomb domain, strong, constitutive PREs are dispensable for Polycomb domain structure and function. It is suggested that the endogenous chromosomal location imparts stability to this Polycomb domain. To test this possibility, a 79-kb en transgene was inserted into other chromosomal locations. This transgene is functional and forms a Polycomb domain. The spreading of the H3K27me3 repressive mark, characteristic of PcG domains, varies depending on the chromatin context of the transgene. Unlike at the endogenous locus, deletion of the strong, constitutive PREs from the transgene leads to both loss- and gain-of function phenotypes, demonstrating the important role of these regulatory elements. These data show that chromatin context plays an important role in Polycomb domain structure and function (De, 2019).
Polycomb group proteins (PcGs) are critical for organismal development and stem cell maintenance. PcGs were first found in Drosophila as repressors of homeotic genes, and PcG repression is one of the earliest epigenetic regulatory mechanisms to be identified. In Drosophila, nearly all PcG proteins are subunits of one of four principal protein complexes: Polycomb repressive complexes 1 and 2 (PRC1 and PRC2), Pho repressive complex (PhoRC), and Polycomb repressive deubiquitinase (PR-DUB). PcG protein complexes bind to DNA elements known as Polycomb group response elements (PREs), deposit the repressive chromatin modification mark H3K27me3, and drive chromatin compaction leading to gene repression. Genes repressed by PcG are covered with H3K27me3 and are thought to form their own topologically associating domains (TADs). In Drosophila, TADs vary from a few kilobases to several hundred kilobases in size. Regulatory DNAs present within TADs preferentially interact with genes located within the same domain, with limited contacts outside of the TAD boundaries. In Drosophila, genome-wide data suggest that mini-domains formed by actively transcribed regions form the boundaries of some TADs, while other TAD boundaries are demarcated by insulator elements. In mammals, the insulator protein CTCF colocalizes with a subset of TAD boundaries (De, 2019).
The current understanding in chromatin biology hypothesizes that the folding of chromatin into domains assists in the packaging of long stretches of DNA inside the eukaryotic nucleus. Further, it is the organization of these domains that facilitates spatial and temporal regulation of genes within them. Thus, understanding how the domains are formed is of prime importance. How large PcG domains/TADs are formed is a central question. To date, researchers have extensively studied how PcG proteins are recruited to specific DNA sequences and which proteins are present in PcG complexes. PREs are required for the recruitment of PcG proteins in Drosophila and are thought to initiate the formation of a Polycomb domain. The 113-kb PcG domain that encompasses the invected (inv) and engrailed (en) genes of Drosophila. Unexpectedly, deletion of strong, constitutive PREs from the endogenous inv-en domain had little effect on inv-en PcG domain organization. Weak PREs present in the inv-en domain were sufficient to establish and maintain the overall domain organization, and some weak PREs overlap with enhancers present within the inv-en domain. Similarly, deletion of the bxd PRE had a mild effect on Ubx expression, whereas deletion of the iab7 PRE resulted in misexpression of Abd-B in very specific parasegments. A recent report showed that deletion of two PREs from the dac locus causes prominent structural and functional changes in the locus (De, 2019).
The primary aim in this work was to study the effect of "chromatin context" on a PcG domain; the following main conclusions can be drawn from this study. First, the en PcG domain at ectopic genomic sites rescues the null mutants of inv-en. This shows that the information to form the chromatin domain is primarily present within the domain itself. Second, spreading of the chromatin state at different sites of the genome is very much dependent on the local "context" itself. Third, a repressive chromatin domain can interact with null chromatin (has no chromatin mark) but stays segregated from the active chromatin. Fourth, a fragment of DNA containing the strong, constitutive PREs is required at an ectopic location to ensure PcG silencing in all tissues. Strikingly, the endogenous locus is resilient to loss of the same DNA fragment, indicating the importance of context on PcG chromatin domain formation and function. Fifth, ubiquitously expressed flanking genes may act as boundaries for enhancers within PcG domains. These data provide experimental evidence for the hypothesis that chromosomal neighborhood plays an important role in regulating gene expression (De, 2019).
Transgenic assays to test 'functional PREs' have highlighted the effect of chromatin context on reporter gene expression. For example, the strong PREs upstream of en, present in the 1.5-kb fragment examined in this study, only act to repress reporter transgene expression in about 50% of chromosomal insertion sites. Previous work has shown that these PREs are dispensable from the endogenous inv-en locus. The normal phenotype of en80Δ1.5 flies also supports these data. However, this study shows that the [email protected] transgene has a higher level of H3K27me3 accumulation than HAen79Δ[email protected]. This reduced level of H3K27me3 is sufficient to maintain correct en expression in embryos and imaginal discs. Notably, the absence of strong PREs in HAen79Δ[email protected] caused misexpression of en during adult abdomen development. These data emphasize the importance of context itself on the stability and resiliency of the PcG domain toward modification (mutation or deletion) of regulatory sequences present within the domain. The data also show that deletion of the 1.5-kb DNA fragment that includes the strong en PREs rendered the HAen79 transgene unable to rescue inv en mutants. Strikingly, HA-en expression from HAen79Δ1.5 was very low in the embryonic nervous system, a result not seen with HAen79 or in en80Δ1.5 flies. These data suggest that the embryonic nervous system enhancers are not able to interact well with the HA-en promoter in the absence of the 1.5-kb DNA fragment. This same deletion, when present at the endogenous locus, does not cause a loss of en expression in the nervous system. This suggests that the structure of the endogenous locus is resilient to the loss of this DNA. This study suggests that the chromosomal locationof the endogenous locus aids in the proper folding of en to facilitate enhancer-promoter communication in the embryonic nervous system. Another explanation for this observation is that the 1.5-kb PRE could be acting as a Trithorax response element (TRE) and bind to Trithorax group proteins for proper en expression. In any case, the essential role of 1.5-kb PRE/TRE at the ectopic locus is alleviated by the 'local context' effect at the endogenous locus (De, 2019).
It has been proposed that establishment and inheritance of H3K27me3 are dependent on two factors: (i) sequence-specific recruitment of the chromatin modifiers and (ii) the ability of H3K27me3 to act as a template for PRC2 to bind and modify other nucleosomes present in the vicinity. The current experiments show that the spreading of H3K27me3 to the flanking chromatin surrounding the insertion site is not very efficient, although no active chromatin mark was present in the vicinity (particularly at attP3). However, 4C-seq analysis showed that the ectopic en PcG domain interacted with flanking chromatin significantly until it encountered the active domain. These observations indicate that mere interactions between the PcG domain and flanking chromatin were not able to spread the repressive mark efficiently. This observation highlights the importance of PcG recruitment via PREs for PcG domain formation (De, 2019).
One of the surprising observations in this study was the deposition of the repressive mark over the exons of Msp300 in larvae that contain HAen79@attP40. This accumulation is also observed in Kc167 cells but not in larval samples that did not have HAen79 inserted at attP40. It is assumed that some regions in the genome are more susceptible to the PcG regulation and that the weak binding peaks of PcGs near the attP40 site might be acting as PREs to facilitate spreading of the mark. It is posited that, when a PcG domain is inserted into attP40, it renders the adjacent chromatin more likely to form an H3K27me3 domain. It is noted that the Msp300 gene is covered by both H3K27me3 and H3K36me3. This is likely due to the mixed cell population in larval brains and discs. What about the spreading of the H3K27me3 mark over the exons? This is unusual; however, pre-mRNA and cotranscriptional activity have been linked to the local chromatin structure. In a genomic study of many histone modifications in human and Caenorhabditis elegans DNA, the H3K27me3 mark was enriched over exons. In addition, physical interaction between mammalian splicing factors (U2snRNP and Sf3b1) and PcG proteins (Zfp144 and Rnf2) was reported to be required for proper repression of Hox genes. It is speculated that cotranscriptional recruitment
of PcG proteins over the Msp300 gene in the transgenic line might have established the exon-specific deposition of the H3K27me3 mark. It is proposed that the exon-specific H3K27me3 accumulation can act as an intermediate step to repress transcriptionally active target genes and establish PcG domains during development (De, 2019).
Two interesting questions for chromatin biologists are the following: How does a meter-long genome fit into a nucleus and how does this folding influence genome function? High-resolution Hi-C experiments have given structural insights into interphase chromosomes in eukaryotic nuclei. According to the Hi-C data, chromatin is organized into TADs or 'contact domains', and the TADs form compartments A (enriched with active domains) and B (enriched with inactive domains). While the Drosophila genome has 'compartments,' the existence of TADs in Drosophila is disputed, and despite the evidence that TADs and compartments are important for chromatin organization and function, basic information about how these structures are formed and maintained has been
incomplete. The current data shows that it is the intrinsic property of chromatin to segregate based on histone modifications and gene activity. This property of chromatin is not locus specific; in different chromatin contexts, repressed chromatin tends to segregate from active domains. Biochemical and molecular evidence on the antagonistic behavior between the H3K27me3 and H3K36me3 modifications also support this claim and provide evidence toward H3K36me3 as a chromatin component that restricts the PcG-mediated spread of H3K27me3. How are large PcG domains formed and maintained? The establishment of the PcG domain and the spreading of H3K27me3 start from the strong PREs present in the PcG domain during cell cycle 14. Repressive loops between PREs within PcG domains are also formed during cell cycle 14. These loops are proposed to play a synergistic role in establishing PcG domains. Surprisingly, deletion of some strong PREs in situ resulted in weak phenotypes, suggesting redundancy of PcG recruitment. This study provides evidence that apart from the minor PREs present in the large PcG domains, chromatin context itself is a critical factor that determines robustness and function of PcG domains (De, 2019).
Polycomb-group (PcG) proteins are epigenetic regulators that maintain the transcriptional repression of target genes following their initial repression by transcription factors. PcG target genes are repressed in some cells, but active in others. Therefore, a mechanism must exist by which PcG proteins distinguish between the repressed and active states and only assemble repressive chromatin environments at target genes that are repressed. This study presents experimental evidence that the repressed state of a Drosophila PcG target gene, giant (gt), is not identified by the presence of a repressor. Rather, de novo establishment of PcG-mediated silencing at gt is the default state that is prevented by the presence of an activator or coactivator, which may inhibit the catalytic activity of Polycomb-repressive complex 2 (PRC2) (Ghotbi, 2021).
Tissue homeostasis requires long-term lineage fidelity of somatic stem cells. Whether and how age-related changes in somatic stem cells impact the faithful execution of lineage decisions remains largely unknown. This study addressed this question using genome-wide chromatin accessibility and transcriptome analysis as well as single-cell RNA-seq to explore stem-cell-intrinsic changes in the aging Drosophila intestine. These studies indicate that in stem cells of old flies, promoters of Polycomb (Pc) target genes become differentially accessible, resulting in the increased expression of enteroendocrine (EE) cell specification genes. Consistently, age-related changes were found in the composition of the EE progenitor cell population in aging intestines, as well as a significant increase in the proportion of EE-specified intestinal stem cells (ISCs) and progenitors in aging flies. This study further confirmed that Pc-mediated chromatin regulation is a critical determinant of EE cell specification in the Drosophila intestine. Pc is required to maintain expression of stem cell genes while ensuring repression of differentiation and specification genes. These results identify Pc group proteins as central regulators of lineage identity in the intestinal epithelium and highlight the impact of age-related decline in chromatin regulation on tissue homeostasis (Tauc, 2021).
This study identifies a loss of lineage fidelity in ISCs of aging flies that results in an imbalance of EE vs EC differentiation and contributes to epithelial dysplasia. This loss of lineage fidelity is caused by age-related deregulation of Pc function in ISCs, resulting in de-repression and preferential expression of EE genes. It is proposed that the chronic activation of stress signaling in ISCs, triggered by local and systemic inflammatory stimuli in the aging intestine, promotes the deregulation of Pc-controlled gene activity. This is supported by the fact that genetically elevating JNK activity in ISCs disrupts lineage fidelity and causes an increase in the proportion of EEs in the gut epithelium. The increase in EE numbers contributes to epithelial dysplasia in the aging gut, as EEs can promote ISC proliferation. The stress-induced changes in lineage fidelity in ISCs thus likely set up a vicious cycle that causes progressive dysplasia and results in disruption of epithelial structure and function in the aging intestine (Tauc, 2021).
The scRNA-seq analysis of the Drosophila gut is consistent with a recent scRNA-seq study from young flies (Hung, 2020), but also captures the changes in intestinal cell states across aging. Notably, the well-characterized age-associated increase in mitotically active ISCs was observed, but a unique 'stressed ISC' cell population was identified that increases with age. The transcriptional signature distinguishing this cell population encompasses over 25% of significantly upregulated genes observed in a bulk RNA-seq study from purified old ISCs, supporting the robustness and complementarity of the two methods. This transcriptional signature was enriched in genes involved in glutathione metabolic processes, chaperone-mediated protein folding, response to heat, and regulation of cytoskeleton organization, consistent with a stressed or damaged cell state. The appearance of this 'stressed' stem cell population is further consistent with the previously described increase in inflammatory and oxidative stress in the aging intestinal epithelium, and may reflect the age-associated accumulation of oxidative, proteostatic and genomic damage in these cells. Overall, the majority of old ISCs reside in an activated cell state (~50%), whereas the 'stressed ISC' population makes up only a small percentage (<10%). How this 'stressed' population affects tissue homeostasis requires further studies. Intriguingly, the p53 and DNA repair pathways are upregulated in this cell population, while cell cycle genes are repressed, indicating that these cells may represent a correlate to mammalian senescent cells (Tauc, 2021).
The EE progenitor population within one cluster not only increases in size with age, but also upregulates pro-neural genes that are markers for neural stem cells (NSCs). It was recently shown that in a neuroendocrine tumor model, ISCs undergo an identity switch that results in the acquisition of NSC-like features (Li, 2020). This NSC gene signature was also upregulated in old ISCs analyzed by bulk RNA-seq, suggesting that a general upregulation of these genes may contribute to age-related ISC phenotypes (Tauc, 2021).
ATAC-seq data from purified ISC populations revealed only moderate changes in chromatin organization in ISCs of increasingly older animals, suggesting that ISC gene regulation is tightly controlled throughout life. At the same time, the significant increase in H3K27 dimethylation levels in aging ISCs, the fact that H3K27me2 levels are higher in EEs than in ISCs in young guts, and the observation that E(z)-mediated methylation of H3K27 is required for EE specification, all support a role for increases in H3K27me2 in skewing ISC identity towards the EE fate. Despite the high genomic abundance of H3K27me2, which accounts for up to 70% of total histone H3, the functional role of H3K27me2 remains largely uncharacterized. The broad genomic distribution of H3K27me2 was shown to suppress aberrant gene activation by controlling enhancer fidelity in mammals, and access to transcription factors and RNA Pol II to DNA in flies. If and how genomic abundance and/or distribution of H3K27me2 affects cell identity or other cellular functions has not been well explored. One study found that perturbing the ratio of H3K27me2/H3K27me3 in mouse embryonic stem cells (ESCs) affected the acquisition and repression of specific fates of these cells, indicating the importance of appropriate regulation of these marks in different cell types. The differential abundance of H3K27me2 that was observed in young ISCs and EEs further supports the importance of dynamic H3K27me2 regulation in the ISC lineage and of appropriate control of this mark to maintain lineage commitment. The loss of EE cell differentiation upon Pc or E(z) depletion in ISCs further supports a critical role for PRC in regulating H3K27 methylation status and thereby lineage fidelity. Of note, the expression of Pc, E(z) and other PcG genes, as well as the expression of trx, Trl and z, was not significantly altered in aging ISCs, suggesting that aging most likely affects their post-transcriptional regulation and/or function (Tauc, 2021).
Age-related changes in H3K27 methylation have been reported in mammalian SC populations: in aging HSCs and muscle SCs, the H3K27me3 signal exhibits broader coverage and increased intensity at transcriptional start sites and intergenic regions, indicating that there is an evolutionarily conserved effect of aging on PRC function in tissue stem cells. It would be intriguing to explore whether these alterations in H3K27me3 may underlie the age-related dysfunction in lineage potential observed in HSCs of old mice (Tauc, 2021).
Since loss of Pc induces the expression of EC genes, represses EE gene expression, and results in less accessible chromatin associated with ISC identity genes (esg, spdo) as well as pro-neural genes (dpn), it is proposed that Pc activity regulates multiple aspects of ISC specification. Despite the upregulation of EC genes after Pc depletion, ISCs did not spontaneously differentiate, ISC numbers remained normal and ISCs could still mount a proliferative response to infection. Thus, ISC function remained largely intact suggesting the primary function of Pc in ISCs is to regulate lineage commitment (Tauc, 2021).
The contrasting function of trx in EE differentiation is consistent with the known antagonism between Trx and Pc complexes and exemplifies a tightly regulated interplay of these systems in lineage commitment. In addition to upregulating EE genes, loss of trx also induced cell cycle genes and ISC proliferation, suggesting additional roles in controlling ISC function. This finding is in line with a recently published study showing TrxG factors Kismet and Trr limit ISC proliferation in the fly midgut. The fact that this group did not report changes in the EE lineage most likely reflects functional differences across TrxG complexes, the composition of which varies greatly (Tauc, 2021).
A similar function in stem cell specification has been described for PRC1 and PRC2 in the mouse, where both complexes are important to preserve ISC and progenitor cell identity in the gut, while regulating specification into specific daughter cell lineages. Loss of PRC1 function in the intestinal epithelium resulted in impairment of ISC self-renewal via de-repression non-intestinal lineage genes as well as negative regulators of the Wnt signaling pathway. Interestingly, the effects of PRC1 loss were independent of H3K27me3, revealing instead the role of H2AK119 mono-ubiquitination. PRC2 was shown to be important in ISCs only during damage-induced regeneration. In contrast, another study found significant degeneration of the SC compartment under homeostatic conditions as well. Reconciling these findings, a recent study revealed that in the absence of PRC2, mammalian cells shed H3K27me3 exclusively by replicational dilution of modified nucleosomes, and that the effects of PRC2 deletion are thus only observed in lineage progeny rather than in stem cells themselves. Both previous studies are in agreement, however, that PRC2 controls cell fate decisions, as loss of PRC2 leads to an accumulation of secretory cells, evidently due to de-repression of the secretory lineage master regulator, Atoh1, resulting in ISC differentiation. It remains unknown how aging affects PRC function and H3K27 methylation in the mammalian intestine (Tauc, 2021).
Only a few studies have rigorously investigated age-associated changes in the mammalian intestine on a histological and cell-type-specific level. One study reported changes in crypt architecture, decreased mitotic potential of ISCs and an increase in the secretory cell lineage, most likely due to increased Atoh1 expression. Given the role of PRC2 in regulating the secretory lineage in both the mammalian intestine and the fly, it is tempting to speculate a conserved age-related increase in the secretory lineage that stems from deregulation of PRC. The results support a role for increased stress signaling in driving this lineage imbalance, as overactive JNK in ISCs promotes EE differentiation. While JNK signaling has been reported to suppress Pc complex function, the data indicate that in the ISC lineage, this interaction is more complex, as both JNK activation promotes EE specification, while Pc knockdown inhibits EE specification. Further studies are needed to explore the molecular mechanisms mediating JNK/Pc interactions in the ISC lineage (Tauc, 2021).
Chronic elevation of inflammatory signaling is a well-characterized hallmark of the aging fly intestine and a hallmark of many intestinal disorders including inflammatory bowel disease (IBD), infection and colorectal cancer. Alterations in EE cell numbers and secretory activity have been reported to play a role in many diseases. In IBD, for example, EE cells were shown to contribute to pathogenesis by producing pro-inflammatory cytokines. In another study, increased numbers of EE cells were reported in human patients with chronic ulcerative colitis, potentially promoting IBD associated neoplasias. Notably, this study showed that lowering EE numbers by long-term depletion of Pc in ISCs inhibited age-induced intestinal dysplasia, supporting a pathological role for EEs in aging (Tauc, 2021).
The role of epigenetic alterations, and specifically the role of PRC, in inflammatory diseases and cancer is still under investigation. It was recently reported that suppressing EZH2 activity ameliorates experimental intestinal inflammation and delays the onset of colitis-associated cancer. However, these effects may be a consequence of EZH2 suppression in myeloid cells rather than in intestinal stem cells. Disruption in PRC2 function may also underlie human cancers, where PRC2 is often hyperactive or overexpressed. Activating EZH2 mutations, which increase total H3K27me3 levels, increase tumor survival and growth in pre-clinical models and are found in up to 24% of diffuse large B cell lymphomas. Additional work will be needed to establish whether age-related changes in PRC activity contribute to the increased onset of gastrointestinal cancers during aging (Tauc, 2021).
Taken together, these findings provide evidence for altered ISC cell states in old flies that affect intestinal homeostasis and contribute to tissue dysplasia. The results exemplify the importance of maintaining appropriate lineage decisions, as overproduction of EE cells is detrimental to the epithelium, but can be rescued by re-balancing the system towards normal EE numbers. Age-associated deregulation of lineage fidelity of ISCs due to elevated stress and misregulation of Pc are proposed as key drivers of functional decline of the intestinal epithelium. Pc group proteins may thus represent valuable therapeutic targets for age-related morbidities (Tauc, 2021).
Maternally-deposited morphogens specify the fates of embryonic cells via hierarchically regulating the expression of zygotic genes that encode various classes of developmental regulators. Once the cell fates are determined, Polycomb-group proteins frequently maintain the repressed state of the genes. This study investigates how Polycomb-group proteins repress the expression of tailless, which encodes a developmental regulator in Drosophila embryo. Previous studies have shown that maternal Tramtrack69 facilitates maternal GAGA-binding factor and Heat shock factor binding to the torso response element (tor-RE) to initiate tailless repression in the stage-4 embryo. Chromatin-immunoprecipitation and genetic-interaction studies exhibit that maternally-deposited Polycomb repressive complex 1 (PRC1) recruited by the tor-RE-associated Tramtrack69 represses tailless expression in the stage-4 embryo. A noncanonical Polycomb-group response element (PRE) is mapped to the tailless proximal region. High levels of Bric-a-brac, Tramtrack, and Broad (BTB)-domain proteins are fundamental for maintaining tailless repression in the stage-8 to -10 embryos. Trmtrack69 sporadically distributes in the linear BTB-domain oligomer, which recruits and retains a high level of PRC1 near the GCCAT cluster for repressing tll expression in the stage-14 embryos. Disrupting the retention of PRC1 decreases the levels of PRC1 and Pleiohomeotic protein substantially on the PRE and causes tailless derepression in the stage-14 embryo. Furthermore, the retained PRC1 potentially serves as a second foundation for assembling the well-characterized polymer of the Sterile alpha motif domain in Polyhomeotic= protein, which compacts chromatin to maintain the repressed state of tailless in the embryos after stage 14 (Liaw, 2022).
Polycomb group (PcG) and trithorax group (trxG) proteins contribute to the specialization of cell types by maintaining differential gene expression patterns. This study aimed at discovering novel factors that elicit an anti-silencing effect to facilitate trxG-mediated gene activation. This study has developed a cell-based reporter system and performed a genome-wide RNAi screen to discover novel factors involved in trxG-mediated gene regulation in Drosophila. More than 200 genes were discovered affecting the reporter in a manner similar to trxG genes. From the list of top candidates, (Enoki mushroom), a known histone acetyltransferase, was characterized as an important regulator of trxG in Drosophila. Mutants of enok strongly suppressed extra sex comb phenotype of Pc mutants and enhanced homeotic transformations associated with trx mutations. Enok colocalizes with both TRX and PC at chromatin. Moreover, depletion of Enok specifically resulted in an increased enrichment of PC and consequently silencing of trxG targets. This downregulation of trxG targets was also accompanied by a decreased occupancy of RNA-Pol-II in the gene body, correlating with an increased stalling at the transcription start sites of these genes. It is proposed that Enok facilitates trxG-mediated maintenance of gene activation by specifically counteracting PcG-mediated repression. This ex vivo approach led to identification of new trxG candidate genes that warrant further investigation. Presence of chromatin modifiers as well as known members of trxG and their interactors in the genome-wide RNAi screen validated the reverse genetics approach. Genetic and molecular characterization of Enok revealed a hitherto unknown interplay between Enok and PcG/trxG system. It is concluded that histone acetylation by Enok positively impacts the maintenance of trxG-regulated gene activation by inhibiting PRC1-mediated transcriptional repression (Umer, 2019).
This study has developed an ex vivo approach that led to the discovery of several new genes regulating trxG-mediated gene activation. Using a well-characterized bxd-PRE-reporter, comprised of Ubx promoter and enhancers, a cell-based assay was developed, and a genome-wide RNAi screen in Drosophila was performed. Based on the Z scores of trx and ash1 knockdown, a stringent cut-off was defined and more than 200 genes affecting the reporter in a manner similar to trxG genes were identifed. Identification of known members of trxG and their interactors as well as chromatin modifiers in the genome-wide RNAi screen validated the reverse genetics approach and efficacy of the reporter system to discover new regulators of trxG. Moreover, presence of chromatin modifiers like members of TIP60 complex and proteins associated with RNA polymerase II, known to interact with trxG, further substantiates that regulators of gene activation were predominantly identified. Although only a subset of known trxG members were identified in the screen, failure to identify all can be attributed to the highly context-dependent working of PcG/trxG system. Since two specific enhancers of Ubx drive the expression of the reporter, it might be regulated by only a subset of trxG members, which could further explain the failure to identify all members of trxG. Interestingly, some of the top scoring candidates in the screen were also recently found to be a part of the interaction network of GAGA factor, a known trxG member (Umer, 2019).
TrxG-like behavior of Enok was characterized, and its genetic and molecular link with trxG was established. Although Drosophila Enok has previously been shown to interact with PC and Ash1 , its physiological relevance with PcG/trxG or epigenetic cellular memory remains elusive. The current results demonstrate that enok behaves like a trxG gene, by antagonizing PcG, and is essential for maintaining active gene expression in Drosophila. Appearance of extra sex combs in Pc heterozygous males is a consequence of ectopic activation of homeotic genes which relies upon the trxG. However, depletion of trxG proteins counteracts the reduced dose of PC, restoring normal regulation of homeotic genes and suppressing the extra sex comb phenotype. Strong suppression of extra sex comb phenotype by two different mutants of enok illustrates that it acts as a trxG gene, consequently counteracting repression maintained by PcG. This finding is further supported by the fact that both mutant alleles of enok strongly enhance trx mutant phenotype, which also corroborates with drastic reduction in transcript levels of trxG target genes in embryos lacking functional enok. A significant overlap between Enok and TRX at chromatin further validates the genetic analysis. Since depletion of enok led to increased PC binding and enhanced H2AK118ub1 at trxG targets, it is suggested that Enok may specifically inhibit PRC1 and facilitate anti-silencing activity of trxG. In contrast, no change in enrichment of E(z) and its associated mark, H3K27me3, was observed at TSS of trxG targets in cells with reduced enok, indicating recruitment of PRC1 in a potentially H3K27me3-independent manner. Such PRC2-independent recruitment of PRC1 has also been reported previously (Umer, 2019).
In light of these results, it is proposed that Enok counteracts PRC1-mediated block of transcription, evident in the form of stalled Pol-II at the TSS of pnr and pnt in cells with depleted enok. Molecular interaction of Enok with PRC1 on developmental genes in flies and humans (Kang, 2017) further supports the notion that Enok facilitates trxG by inhibiting PRC1. In mice, MOZ (homolog of Enok) is known to play an antagonistic role to PcG member BMI1 in regulating Hox genes (Sheikh, 2015). In agreement with the finding that PC chromodomain binding to H3K27me3 requires an unmodified H3K23, the data suggest that Enok-mediated H3K23ac inhibits binding of PC to its target genes. It is proposed that in the presence of Enok at active loci, acetylated H3K23 inhibits binding of PRC1 leading to increased transcriptional activity of Pol-II. In contrast, loss of Enok leads to decreased H3K23ac, thus allowing PRC1 binding and consequent stalling of Pol-II at TSS. Since Enok was also found to associate with silent loci (bxd, Dfd, iab-7) and interact with PRC1, it is suggested that Enok is kept in an inactive state on these loci by PC in a manner similar to the inhibitory interaction between PC and CBP. Further molecular and biochemical characterization of this intricate relationship between PcG and Enok will help discover how trxG maintains dynamic gene expression patterns during development (Umer, 2019).
In summary, this study has developed a cell-based assay for an ex vivo genome-wide RNAi screen to identify potential trxG regulators in Drosophila. The RNAi screen led to the discovery of more than 200 genes that perturbed the luciferase-based reporter in a manner similar to known trxG members. This study has also provided evidence that Enok, a top trxG candidate in the screen, contributes to anti-silencing action of trxG by counteracting PcG proteins. It is proposed that H3K23 acetylation by Enok counteracts PcG-mediated suppression by inhibiting PRC1 recruitment, contributing to gene activation. Genetic and molecular evidence obtained suggests that Enok interacts with trxG and as a result with their major developmental regulatory targets, thus providing a possible molecular link through which it could influence epigenetic cell memory (Umer, 2019).
Regulatory decisions in Drosophila require Polycomb group (PcG) proteins to maintain the silent state and Trithorax group (TrxG) proteins to oppose silencing. Since PcG and TrxG are ubiquitous and lack apparent sequence specificity, a long-standing model is that targeting occurs via protein interactions; for instance, between repressors and PcG proteins. Instead, this study found that Pc-repressive complex 1 (PRC1) purifies with coactivators Fs(1)h [female sterile (1) homeotic] and Enok/Br140 during embryogenesis. Fs(1)h is a TrxG member and the ortholog of BRD4, a bromodomain protein that binds to acetylated histones and is a key transcriptional coactivator in mammals. Enok and Br140, another bromodomain protein, are orthologous to subunits of a mammalian MOZ/MORF acetyltransferase complex. This study confirmed PRC1-Br140 and PRC1-Fs(1)h interactions and identified their genomic binding sites. PRC1-Br140 bind developmental genes in fly embryos, with analogous co-occupancy of PRC1 and a Br140 ortholog, BRD1, at bivalent loci in human embryonic stem (ES) cells. It is proposed that identification of PRC1-Br140 'bivalent complexes' in fly embryos supports and extends the bivalency model posited in mammalian cells, in which the coexistence of H3K4me3 and H3K27me3 at developmental promoters represents a poised transcriptional state. It is further speculated that local competition between acetylation and deacetylation may play a critical role in the resolution of bivalent protein complexes during developments (Kang, 2017).
Inappropriate activation and/or repression of gene expression underlies many human diseases, yet the mechanisms that execute transitions in developmental gene expression remain poorly defined. How are genes chosen to be initially active or repressed, and how are transitions in gene activity managed with fidelity? Transcription factors clearly regulate these changes, but how can this regulation occur with such specificity when their consensus binding sites and genomic occupancy appear so promiscuous? Together, proteomic and ChIP-seq analyses suggest a model in which PRC1 and MOZ/MORF function to create a poised regulatory state during development (see Model for the role of bivalent complexes in developmental transitions of transcriptional state). As cells differentiate, bivalent protein complexes may eventually be diminished locally, as most loci resolve into either an active or silent state. It is speculated that the choice of activation may occur via increased acetylation, influenced by nearby transcription factors, and subsequent enrichment of Fs(1)h and TrxG proteins such as Ash1, which was specifically recovered in a Br140 pull-down. A transition toward silencing may involve deacetylation and a decrease in TrxG (Kang, 2017).
The retention of some bivalency after initial transcriptional choices are made in embryogenesis is likely to allow critical reversibility for subsequent gene expression programming. However, if transcriptional state is not dictated strictly by the occupancy of bivalent components, how are these states manifested? It is speculated that local post-translational modifications (PTMs) may be critical for the specification of transcriptional state and for reversibility. For example, the Enok subunit of dMOZ/MORF is known to acetylate H3K23, while this mark is incompatible with Pc chromodomain binding to H3K27me3 on the same histone tail. Interestingly, enrichment of H3K23ac from modENCODE data sets on the set of potentially bivalent genes was not observed, but further analysis will be required to investigate the significance of this finding. Competition between the cognate enzymatic activities within bivalent complexes and their interactors may be central to their ability to act as reversible switches of transcriptional state. Future studies to address this hypothesis will require improved approaches to comprehensive PTM detection as well as in vitro reconstitution of key interactions and biochemical activities of bivalent complexes containing the appropriately modified subunits (Kang, 2017).
The results are consistent with recent studies in which PRC1 is found on active genes in many systems, and PRC1 targeting is largely independent of PRC2 (Kahn, 2016). Most exciting is the likely conservation in zebrafish (Laue, 2008) and mice (Sheikh, 2015), based on the opposing genetic activities of PRC1 and MOZ/MORF complexes in regulation of the Hox genes. The reliance on a universal transducer of transcription factor activity in developmental decisions would be an elegant solution to the problem of widespread binding of sequence-specific regulators, as, in the model (see Model for the role of bivalent complexes in developmental transitions of transcriptional state), only local interactions with preset bivalency will result in functional consequences (Kang, 2017).
Key fundamental questions remain. In particular, how are PRC1 and MOZ/MORF targeted in the first place? PREs are cis-acting regulatory elements that can recruit PRC1 and PRC2 to target genes in Drosophila. PREs lack universal consensus sequences but contain combinations of motifs for many DNA-binding proteins. Therefore, diverse protein–protein interactions with the PcG could be critical for initial binding, as postulated from classical genetics. A speculative alternative is that the 5′ TSSs of developmentally regulated genes may remain epigenetically marked throughout the life cycle of the organism to specify the initial association of bivalent complexes. Both BRD4 and BRPF1 have been identified as 'bookmarking proteins' that may retain vital information throughout the cell cycle, based on their ability to remain at their chromosomal binding sites through mitosis (Dey, 2003; Laue, 2008). Furthermore, Fs(1)h and Enok are essential for oogenesis, and genic acetylation is detected very early in embryogenesis (Li, 2014). Finally, the importance of maternal E(z) suggests that H3K27me3 could be at least part of such an inherited mark for developmental genes (Kang, 2017).
In summary, the results provide evidence for bivalent protein complexes that may correspond to a bivalent transcriptional state in Drosophila embryos and mammalian stem cells. Beyond identification of these intriguing protein interactions in flies, it is speculated that their identity reveals a likely role for acetylation in the resolution of bivalency. It is envisioned that the choice toward activation may be triggered and maintained by cell-specific transcription factors that drive the acetylated state, favoring MOZ/MORF and BRD4 bromodomain-dependent association with chromatin. Cell type decisions may be governed by a constant assessment of the amount of acetylation at each TSS, consistent with the enrichment of deacetylases on even very active genes. Deacetylation would favor loss of bromodomain–acetyl interactions and ultimately the loss of coactivators, leading toward the establishment of a stably silenced state. The ability to regulate genes while only partially resolving bivalent complexes is likely to be critical for reversibility in response to changes in cell type-specific transcription factor expression and binding. It is proposed that regulatory elements possess the intrinsic ability to switch fate dependent on this local balance, with de novo targeting rarely required (Kang, 2017).
Neural progenitors produce diverse cells in a stereotyped birth order, but can specify each cell type for only a limited duration. In the Drosophila embryo, neuroblasts (neural progenitors) specify multiple, distinct neurons by sequentially expressing a series of temporal identity transcription factors with each division. Hunchback (Hb), the first of the series, specifies early-born neuronal identity. Neuroblast competence to generate early-born neurons is terminated when the hb gene relocates to the neuroblast nuclear lamina, rendering it refractory to activation in descendent neurons. Mechanisms and trans-acting factors underlying this process are poorly understood. This study identified Corto, an enhancer of Trithorax/Polycomb (ETP) protein, as a new regulator of neuroblast competence. The GAL4/UAS system was used to drive persistent misexpression of Hb in neuroblast 7-1 (NB7-1), a model lineage for which the early competence window has been well characterized, to examine the role of Corto in neuroblast competence. immuno-DNA Fluorescence in situ hybridization (DNA FISH) was used in whole embryos to track the position of the hb gene locus specifically in neuroblasts across developmental time, comparing corto mutants to control embryos. Finally, immunostaining was used in whole embryos to examine Corto's role in repression of Hb and a known target gene, Abdominal B (Abd-B). In corto mutants, the hb gene relocation to the neuroblast nuclear lamina was found to be delayed and the early competence window is extended. The delay in gene relocation occurs after hb transcription is already terminated in the neuroblast and is not due to prolonged transcriptional activity. Further, it was found that Corto genetically interacts with Posterior Sex Combs (Psc), a core subunit of polycomb group complex 1 (PRC1), to terminate early competence. Loss of Corto does not result in derepression of Hb or its Hox target, Abd-B, specifically in neuroblasts. These results show that in neuroblasts, Corto genetically interacts with PRC1 to regulate timing of nuclear architecture reorganization and support the model that distinct mechanisms of silencing are implemented in a step-wise fashion during development to regulate cell fate gene expression in neuronal progeny (Hafer, 2022).
To maintain cellular identities during development, gene expression profiles must be faithfully propagated through cell generations. The reestablishment of gene expression patterns upon mitotic exit is mediated, in part, by transcription factors (TF) mitotic bookmarking. However, the mechanisms and functions of TF mitotic bookmarking during early embryogenesis remain poorly understood. This study took advantage of the naturally synchronized mitoses of Drosophila early embryos, providing evidence that GAGA pioneer factor (GAF) acts as a stable mitotic bookmarker during zygotic genome activation. During mitosis, GAF remains associated to a large fraction of its interphase targets, including at cis-regulatory sequences of key developmental genes with both active and repressive chromatin signatures. GAF mitotic targets are globally accessible during mitosis and are bookmarked via histone acetylation (H4K8ac). By monitoring the kinetics of transcriptional activation in living embryos, this study reports that GAF binding establishes competence for rapid activation upon mitotic exit (Bellec, 2022).
This study set out to determine how gene regulation by a transcription factor might be propagated through mitosis in a developing embryo. By using a combination of quantitative live imaging and genomics, evidence is provided that the pioneer-like factor GAF acts as a stable mitotic bookmarker during zygotic genome activation in Drosophila embryos (Bellec, 2022).
The results indicate that during mitosis, GAF binds to an important fraction of its interphase targets, largely representing cis-regulatory sequences of key developmental genes. It was noticed that GAF mitotically retained targets contain a larger number of GAGA repeats than GAF interphase-only targets and that this number of GAGA repeats correlates with the broadness of accessibility. Multiple experiments, with model genes in vitro (e.g., hsp70, hsp26) or from genome-wide approaches clearly demonstrated that GAF contributes to the generation of nucleosome-free regions. The general view is that this capacity is permitted through the interaction of GAF with nucleosome remodeling factors as PBAP (SWI/SNIF), NURF (ISWI), or FACT. Although not yet confirmed with live imaging, immunostaining data suggest that NURF is removed during metaphase but re-engages chromatin by anaphase. If the other partners of GAF implicated in chromatin remodeling are evicted during early mitosis, chromatin accessibility at GAF mitotic targets could be established prior to mitosis onset and then maintained through mitosis owing to the remarkable stability of GAF binding. However, GAF interactions with other chromatin remodelers (e.g., PBAP) during mitosis and a scenario whereby mitotic accessibility at GAF targets would be dynamically established during mitosis thanks to the coordinated action of GAF and its partners cannot be excluded (Bellec, 2022).
It is proposed that the function of GAF as a mitotic bookmarker is possible because GAF has the intrinsic property to remain bound to chromatin for long periods (residence time in the order of minutes). This long engagement of GAF to DNA is in sharp contrast with the binding kinetics of many other TF, such as Zelda or Bicoid in Drosophila embryos or pluripotency TF in mouse ES cells. Another particularity of GAF binding, contrasting with other TF, resides in the multimerization of its DNA-binding sites as GAGAG repeats in a subset of its targets (76% of mitotically retained peaks display four or more repetitions of GAGAG motifs). Given the known oligomerization of GAF70 and as GAF is able to regulate transcription in a cooperative manner, it is tempting to speculate that GAF cooperative binding on long stretches of GAGAG motifs may contribute to a long residence time (Bellec, 2022).
Collectively, it is proposed that the combination of long residence time and the organization of GAF-binding sites in the genome may allow the stable bookmarking of a subset of GAF targets during mitosis (Bellec, 2022).
In this study, it was also discovered that a combination of GAF and histone modification could be at play to maintain the chromatin state during mitosis. Indeed, mitotic bookmarking may also be supported by the propagation of histone tail modifications from mother to daughter cells. Work from mammalian cultured cells revealed widespread mitotic bookmarking by epigenetic modifications, such as H3K27ac and H4K16ac. Moreover, H4K16ac transmission from maternal germline to embryos has recently been established. In the case of GAF, it is proposed that the combinatorial action of GAF and epigenetic marks, possibly selected via GAF interacting partners, will contribute to the propagation of various epigenetic programs. It would be therefore interesting to employ the established mitotic ChIP method to survey the extent to which cis-regulatory regions exhibit different mitotic histone mark modifications during embryogenesis (Bellec, 2022).
A key aspect of mitotic bookmarking is to relate mitotic binding to the rapid transcriptional activation after mitosis. This study has shown that GAF plays a role in the timing of reactivation after mitosis. However, it is noted that GAF binding during mitosis is not the only means to accelerate gene activation. Indeed, it has been shown shown that mechanisms such as enhancer priming by Zelda, paused polymerase or redundant enhancers contribute to fast gene activation. Moreover, a transcriptional memory bias can occur for a transgene not regulated by GAF. By modeling the transcriptional activation of the gene scylla, it was revealed that GAF accelerates the epigenetic steps prior to activation, selectively in the descendants of active nuclei. A model is proposed where GAF binding helps in the decision-making of the postmitotic epigenetic path. In this model, mitotic bookmarking by GAF would favor an epigenetic path with fast transitions after mitosis. In the context of embryogenesis, bookmarking would lead to the fast transmission of select epigenetic states and may contribute to gene expression precision (Bellec, 2022).
Interestingly, GAF vertebrate homolog (vGAF/Th-POK) has recently been implicated in the maintenance of chromatin domains during zebrafish development. It is therefore suspected that GAF action as a stable bookmarking factor controlling transcriptional memory during Drosophila ZGA might be conserved in vertebrates (Bellec, 2022).
Environmental temperature can affect chromatin-based gene regulation, in particular in ectotherms such as insects. Genes regulated by the Polycomb group (PcG) vary in their transcriptional output in response to changes in temperature. Expression of PcG-regulated genes typically increases with decreasing temperatures. This study examined variations in temperature-sensitive expression of PcG target genes in natural populations from different climates of Drosophila melanogaster, and differences thereof across different fly stages and tissues. Temperature-induced expression plasticity was found to be stage- and sex-specific with differences in the specificity between the examined PcG target genes. Some tissues and stages, however, showed a higher number of PcG target genes with temperature-sensitive expression than others. Overall, higher levels of temperature-induced expression plasticity was found in African tropical flies from the ancestral species range than in flies from temperate Europe. Differences were also observed between temperate flies, however, with more reduction of expression plasticity in warm-temperate than in cold-temperate populations. Although in general, temperature-sensitive expression appeared to be detrimental in temperate climates, there were also cases in which plasticity was increased in temperate flies, as well as no changes in expression plasticity between flies from different climates (Voigt, 2021).
The mechanisms coordinating energy consumption during stress response in multicellular organisms are not well understood. This study shows that loss of the epigenetic regulator G9a in Drosophila causes a shift in the transcriptional and metabolic responses to oxidative stress (OS) that leads to decreased survival time upon feeding the xenobiotic paraquat. During OS exposure, G9a mutants show overactivation of stress response genes, rapid depletion of glycogen, and inability to access lipid energy stores. The OS survival deficiency of G9a mutants can be rescued by a high-sugar diet. Control flies also show improved OS survival when fed a high-sugar diet, suggesting that energy availability is generally a limiting factor for OS tolerance. Directly limiting access to glycogen stores by knocking down glycogen phosphorylase recapitulates the OS-induced survival defects of G9a mutants. It is proposed that G9a mutants are sensitive to stress because they experience a net reduction in available energy due to (1) rapid glycogen use, (2) an inability to access lipid energy stores, and (3) an overinduced transcriptional response to stress that further exacerbates energy demands. This suggests that G9a acts as a critical regulatory hub between the transcriptional and metabolic responses to OS. These findings, together with recent studies that established a role for G9a in hypoxia resistance in cancer cell lines, suggest that G9a is of wide importance in controlling the cellular and organismal response to multiple types of stress (Riahi, 2019).
3' end cleavage of metazoan replication-dependent histone pre-mRNAs requires the multi-subunit holo-U7 snRNP and the stem-loop binding protein (SLBP). The exact composition of the U7 snRNP and details of SLBP function in processing remain unclear. To identify components of the U7 snRNP in an unbiased manner, a novel approach was developed for purifying processing complexes from Drosophila and mouse nuclear extracts. In this method, catalytically active processing complexes are assembled in vitro on a cleavage-resistant histone pre-mRNA containing biotin and a photo-sensitive linker, and eluted from streptavidin beads by UV irradiation for direct analysis by mass spectrometry. In the purified processing complexes, Drosophila and mouse U7 snRNP have a remarkably similar composition, always being associated with CPSF73, CPSF100, symplekin and CstF64. Many other proteins previously implicated in the U7-dependent processing are not present. Drosophila U7 snRNP bound to histone pre-mRNA in the absence of SLBP contains the same subset of polyadenylation factors but is catalytically inactive and addition of recombinant SLBP is sufficient to trigger cleavage. This result suggests that Drosophila SLBP promotes a structural rearrangement of the processing complex, resulting in juxtaposition of the CPSF73 endonuclease with the cleavage site in the pre-mRNA substrate (Skrajna, 2018).
In metazoans, 3' end processing of replication-dependent histone pre-mRNAs occurs through a single endonucleolytic cleavage, generating mature histone mRNAs that lack a poly(A) tail. This specialized 3' end processing reaction depends on the U7 snRNP, the core of which consists of a ∼60-nt U7 snRNA and a unique heptameric Sm ring. In the ring, the spliceosomal subunits SmD1 and SmD2 are replaced by the related Lsm10 and Lsm11 proteins, whereas the remaining subunits (SmB, SmD3, SmE, SmF and SmG) are shared with the spliceosomal snRNPs (Skrajna, 2018).
Lsm11 contains an extended N-terminal region that interacts with the N-terminal region of the 220 kDa protein FLASH. Together, they recruit a specific subset of the proteins that participate in 3' end processing of canonical pre-mRNAs by cleavage and polyadenylation, resulting in formation of the holo-U7 snRNP (Skrajna, 2014). This subset of polyadenylation factors is referred to as the histone pre-mRNA cleavage complex (HCC) and in mammalian nuclear extracts includes symplekin, all subunits of CPSF (CPSF160, WDR33, CPSF100, CPSF73, Fip1 and CPSF30) and CstF64 as the only CstF subunit. The remaining components of the cleavage and polyadenylation machinery, including CstF50 and CstF77, the two CF Im subunits of 68 and 25 kDa, and the two subunits of CF IIm (Clp1 and Pcf11) were consistently absent in the HCC (Yang, 2013). A similar subset of polyadenylation factors is associated with the Drosophila holo-U7 snRNP (Sabath, 2013; Skrajna, 2018 and references therein).
The substrate specificity in the processing reaction is provided by the U7 snRNA, which through its 5' terminal region base pairs with the histone downstream element (HDE), a sequence in histone pre-mRNA located downstream of the cleavage site. This interaction is assisted by the stem-loop binding protein (SLBP), which binds the highly conserved stem-loop structure located upstream of the cleavage site (Wang, 1996; Martin, 1997; Tan, 2013) and stabilizes the complex of U7 snRNP with histone pre-mRNA (Dominski, 1999), likely by contacting FLASH and Lsm11 (Skrajna, 2017). In mammalian nuclear extracts, histone pre-mRNAs that form a strong duplex with the U7 snRNA are cleaved efficiently in the absence of SLBP. In contrast, Drosophila nuclear extracts lacking SLBP are inactive in cleaving histone pre-mRNAs, suggesting that Drosophila SLBP plays an essential role in processing in addition to stabilizing binding of the U7 snRNP to histone pre-mRNA (Skrajna, 2017, Sabath, 2013, Dominski, 2003; Lanzotti, 2002; Dominski, 2005; Skrajna, 2018 and references therein).
Within the HCC, CPSF73 is the endonuclease, acting in a close partnership with its catalytically inactive homolog, CPSF100, and the heat-labile scaffolding protein symplekin (Kolev, 2005). RNAi-mediated depletion of these three HCC subunits in Drosophila cultured cells results in generation of polyadenylated histone mRNAs, an indication of their essential role in the U7-dependent processing. Depletion of the remaining components of the HCC had no effect on the 3' end of histone mRNAs and their function in the U7 snRNP, if any, is less clear. Previous in vivo studies implicated multiple other proteins, in addition to SLBP and components of the U7 snRNP, in generation of correctly processed histone pre-mRNAs. These proteins include ZFP100, CDC73/parafibromin, NELF E, Ars2, CDK9, CF Im68 and RNA-binding protein FUS/TLS (Fused in Sarcoma/Translocated in Sarcoma). ZFP100, CF Im68 and FUS were shown to interact with Lsm11, whereas Ars2 was shown to interact with FLASH, raising the possibility that they may be essential components of the cleavage machinery (Skrajna, 2018).
To determine which factors are required for the cleavage reaction, a novel method for purification of in vitro assembled Drosophila and mouse processing complexes was developed. In this method, histone pre-mRNAs containing biotin and a photo-cleavable linker in either cis or trans are incubated with a nuclear extract and the assembled processing complexes are immobilized on streptavidin beads, washed and released into solution by irradiation with long wave UV. This approach yielded remarkably pure processing complexes that were suitable for direct and unbiased analysis by mass spectrometry, providing a complete view of the holo-U7 snRNP and other proteins that associate with histone pre-mRNA for 3' end processing (Skrajna, 2018).
In this method, processing complexes were assembled in a nuclear extract on a synthetic histone pre-mRNA containing biotin and a photo-cleavable linker at the 5' end. The major cleavage site and the two neighboring nucleotides on each side were modified with a 2'O-methyl group, hence preventing endonucleolytic cleavage of the pre-mRNA and increasing the efficiency of capturing intact processing complexes. Following immobilization on streptavidin beads, the pre-mRNA and the bound proteins were washed and released to solution by irradiation with long wave UV. This UV-elution step, by eliminating all background proteins non-specifically bound to streptavidin beads, resulted in isolation of remarkably pure processing complexes that were suitable for direct analysis by mass spectrometry. This is the first successful use of the photo-cleavable linker and the UV-elution step for purification of an in vitro assembled RNA/protein complex. Parallel experiments with pre-mRNA substrates lacking 2'O-methyl nucleotides at the cleavage site demonstrated that the immobilized processing complexes retain catalytic activity. Thus, the mass spectrometry analysis of the UV-eluted material is likely to provide a global and unbiased view of all essential proteins that associate with histone pre-mRNA for 3' end processing (Skrajna, 2018).
Since chemical synthesis of RNAs containing covalently attached biotin and the photo-cleavable linker (cis configuration) is both expensive and limited to sequences not exceeding 60-70 nt, longer histone pre-mRNAs generated by T7 transcription were tested. Biotin and the photo-cleavable linker can be attached to the 3' end of these pre-mRNAs in trans via a short complementary oligonucleotide. This modification makes the UV-elution method more cost effective and potentially applicable for purification of RNA-protein complexes that require longer RNA binding targets, including spliceosomes and complexes involved in cleavage and polyadenylation (Skrajna, 2018).
In the UV-eluted mouse and Drosophila processing complexes, mass spectrometry identified SLBP and all known subunits of the U7-specific Sm ring, including Lsm10 and Lsm11. Readily detectable in mouse and Drosophila processing complexes were also FLASH and subunits of the HCC. The HCC is remarkably similar in composition between the two species, with symplekin, CPSF100, CPSF73 and CstF64 being most abundant and present in close to stoichiometric amounts, as determined by both silver staining and emPAI value analysis. The remaining CPSF subunits (CPSF160, WDR33, Fip1 and CPSF30) are present in lower amounts, suggesting that they are substoichiometric, being stably associated only with a fraction of the U7 snRNP (Skrajna, 2018).
In both mouse and Drosophila experiments, SLBP and the components of the U7 snRNP were the only proteins that consistently failed to bind histone pre-mRNAs in the presence of processing competitors: SL RNA and αU7 oligonucleotide. Other proteins were detected both in the samples containing processing complexes and in the matching negative controls, where formation of processing complexes was blocked. Among them, the most prevalent were non-specific RNA binding proteins, including hnRNP Q in mouse nuclear extracts, and IGF2BP1 in Drosophila nuclear extracts. All these proteins likely bind to sites in histone pre-mRNAs unoccupied by SLBP and U7 snRNP, and play no essential role in processing (Skrajna, 2018).
CstF50 and CstF77 were not detected in the UV-eluted mouse processing complexes and were present only in some Drosophila complexes, always with low scores, consistent with a previous conclusion that of the three CstF subunits only CstF64 stably associates with the U7 snRNP (Yang, 2013). No peptides were detected for CF Im (68 and 25 kDa) and CF IIm (Clp1 and Pcf11) in any of the mouse experiments, suggesting that these factors are also uniquely involved in cleavage and polyadenylation. Mass spectrometry identified the orthologues of the 68 and 25 kDa subunits in some Drosophila experiments, but they were clearly contaminants, persisting in the presence of the SL RNA and αU7 oligonucleotide. CF Im68 was previously reported to interact with Lsm11 and to co-purify with U7 snRNP. Based on this analysis, this subunit is unlikely to interact with Lsm11 in the processing complex (Skrajna, 2018).
Catalytically active mouse processing complexes also lacked ZFP100 (ZN473), a zinc finger protein that co-localizes with Lsm11 and stimulates expression of a reporter gene containing U7-dependent processing signals. ZFP100 was initially identified by the yeast two-hybrid system as a protein interacting with SLBP bound to the SL RNA and suggested to function as a bridging factor in the SLBP-mediated recruitment of the U7 snRNP to histone pre-mRNA. However, the absence of ZFP100 in the UV-eluted mouse processing complexes containing both SLBP and U7 snRNP strongly argues against this function. ZFP100 may instead participate in a different aspect of histone gene expression in vivo, perhaps acting as a coupling factor that integrates transcription of histone genes with 3' end processing of the nascent histone pre-mRNAs (Skrajna, 2018).
A similar role in vivo may be played by the multi-functional protein FUS and other proteins previously linked to 3' end processing of histone pre-mRNAs in mammalian cells, including Ars2, CDC73/parafibromin, NELF E and CDK9. These factors were never specifically detected in the UV-eluted mouse processing complexes, suggesting that they have no direct role in processing in vitro. Their downregulation by RNAi results in production of a small amount of polyadenylated histone mRNAs, which may be due to a defect in coupling of histone gene transcription with processing and/or cell-cycle progression (Skrajna, 2018).
Although this study identified several polyadenylation subunits in a stable association with the U7 snRNP, the experiments do not directly address which of them are essential for processing of histone pre-mRNAs. In Drosophila cultured cells, RNAi-mediated depletion of each of only three U7-associated polyadenylation subunits, symplekin, CPSF100 and CPSF73, consistently resulted in accumulation of histone mRNAs terminated with a poly(A) tail, an indication of a defect in the U7-dependent processing mechanism. Depletion of the remaining HCC subunits had no effect, suggesting that their association with the U7 snRNP is not essential for 3' end processing of histone pre-mRNAs. Symplekin, CPSF100 and CPSF73 are present in Drosophila cells as a stable sub-complex (Sullivan, 2009) and likely act together as an autonomous cleavage module recruited for processing to either histone or canonical pre-mRNAs by specialized RNA recognition sub-complexes. For canonical pre-mRNAs, this role is played by the remaining CPSF subunits, CPSF160, WDR33, Fip1 and CPSF30, recently shown to co-operate in recognizing the AAUAAA signal during the polyadenylation step. In 3' end processing of histone pre-mRNAs, the recruitment of the cleavage sub-complex is mediated by the U7 snRNA, which recognizes the substrate by the base pairing interaction, further arguing that CPSF160, WDR33, Fip1 and CPSF30 are likely non-essential bystanders in the U7 snRNP (Skrajna, 2018).
A less clear role in 3' end processing of histone pre-mRNAs is played by CstF64, which in spite of being relatively abundant in Drosophila U7 snRNP can be depleted from Drosophila cells without causing a detectable misprocessing of histone pre-mRNAs. A defect in the U7-dependent processing was however observed in human cells partially depleted of CstF64, suggesting that in mammalian cells this subunit may play a more critical role, perhaps helping to stabilize the three essential subunits of the HCC on the FLASH/Lsm11 complex. Clearly, determining which subunits are essential for cleavage will require reconstitution of a catalytically active processing complex from recombinant components (Skrajna, 2018).
This study brings a new perspective on the essential role of Drosophila SLBP in processing. It was recently demonstrated that Drosophila SLBP, like its mammalian counterpart, enhances the recruitment of U7 snRNP to histone pre-mRNA (Skrajna, 2017). A small amount of U7 snRNP binds to histone pre-mRNA in the absence of Drosophila SLBP but the bound U7 snRNP in spite of containing all major HCC subunits is catalytically inactive. This study now shows that processing complexes assembled in the absence of SLBP can be activated for cleavage by simply adding recombinant WT SLBP, providing evidence that SLBP is the only missing factor in the assembled complexes. A mutant Drosophila SLBP that is deficient in recruiting U7 snRNP to histone pre-mRNA is also unable to activate the assembled complex for cleavage. Based on these results, it is proposed that the interaction of Drosophila SLBP with the U7 snRNP promotes an essential structural rearrangement of the entire processing complexes that juxtaposes the catalytic site of CPSF73 with the pre-mRNA (see A hypothetical model explaining essential role of Drosophila in processing). It is possible that higher metazoans developed an additional positioning mechanism for the CPSF73 endonuclease, resulting in efficient cleavage in the absence of SLBP (Skrajna, 2018).
The early embryos of many animals including flies, fish, and frogs have unusually rapid cell cycles and delayed onset of transcription. These divisions are dependent on maternally supplied RNAs and proteins including histones. Previous work suggests that the pool size of maternally provided histones can alter the timing of zygotic genome activation (ZGA) in frogs and fish. This study examine the effects of under and overexpression of maternal histones in Drosophila embryogenesis. Decreasing histone concentration advances zygotic transcription, cell cycle elongation, Chk1 activation, and gastrulation. Conversely, increasing histone concentration delays transcription and results in an additional nuclear cycle before gastrulation. Numerous zygotic transcripts are sensitive to histone concentration, and the promoters of histone sensitive genes are associated with specific chromatin features linked to increased histone turnover. These include enrichment of the pioneer transcription factor Zelda and lack of SIN3A and associated histone deacetylases. These findings uncover a critical regulatory role for histone concentrations in ZGA of Drosophila (Wilky, 2019).
To understand the effects of histone concentration on the MBT maternally supplied histones were reduced by downregulating the gene encoding a crucial histone regulator, Stem-Loop Binding Protein (Slbp) via maternally driven RNAi. Under these conditions, histone H2B was reduced by ~50% and H3 by ~60% at the MBT. Approximately 50% of embryos laid by Slbp RNAi mothers (henceforth Slbp embryos) that form a successful blastoderm do not undergo the final division and attempt gastrulation in NC13. Another ~30% exhibit an intermediate phenotype of partial arrest, with only part of the embryo entering NC14. A minority of Slbp embryos begin gastrulation with all nuclei in NC14. NC12 duration was predictive of NC13 arrest, with NC12 being an average of ~5min longer in Slbp embryos that went on to arrest compared with those that did not arrest (Wilky, 2019).
Cellularization was first detected in wild-type (WT) embryos ~20 min into NC14. Partially arrested Slbp embryos also began cellularization ~20 min into NC14, with nuclei that arrested in NC13 waiting until the remainder of the embryo had entered NC14 to cellularize. Fully arrested embryos began cellularization ~20min into NC13, initiating cellularization one cycle early and ~20min earlier in overall developmental time than WT. Despite their reduced cell number, these embryos form mitotic domains and gastrulate without obvious defects, however they die before hatching (Wilky, 2019).
To examine the effects of increased histone concentration on developmental timing cell cycle progression was monitored in embryos from abnormal oocyte (abo) mutant mothers (henceforth abo embryos). abo is a histone locus-specific transcription factor, the knockdown of which increases the production of replication-coupled histones, particularly H2A and H2B (Berloco, 2001). abo increased H2B by ~90%, whereas total (combined replication-coupled and replication-independent) H3 was not affected in NC14 embryos. Approximately 60% of abo embryos displayed fertilization defects or catastrophic early nuclear divisions. Of abo embryos that formed a functioning blastoderm, ~6% underwent a complete extra nuclear division before gastrulating in NC15, whereas ~4% underwent a partial extra nuclear division. Embryos from abo mothers that completed total extra divisions had faster NC14s in which they did not cellularize and spent 40-60 min in NC15 before gastrulating. This suggests an alteration of the normal transcription-dependent developmental program. In some cases, the cell cycle program and transcriptional program may be decoupled, evidenced by the fact that some abo embryos attempted to gastrulate while still in the process of division. abo embryos that underwent extra divisions exhibited a range of gastrulation defects including expanded mitotic domains and ectopic furrow formation (Wilky, 2019).
Since alterations in histone levels can both decrease and increase the number of divisions before cell cycle slowing, it was reasoned that histone levels might affect activation of checkpoint kinase 1 (Chk1, also known as grp), which is required for cell cycle slowing at the MBT. To test this, a fluorescent biosensor of Chk1 activity was crosses into the Slbp background. Even in Slbp embryos that did not undergo early gastrulation, Chk1 activity was higher than in WT, consistent with the lengthened cell cycle. This result indicates that the observed cell cycle phenotypes in the histone-manipulated embryos are likely mediated through changes in Chk1 activity (Wilky, 2019).
As cellularization and gastrulation require zygotic transcription, it was suspected that embryos with altered development likely have altered gene expression. Single-embryo RNA-seq was performed on staged Slbp embryos that remained in NC13 for more than 30 min. These were compared with either NC-matched (NC13) or time-matched (NC14) WT embryos. To control for maternal effects of Slbp RNAi, pre-blastoderm stage WT and Slbp embryos were compared. The Slbp embryos underwent ZGA one NC earlier than WT. ~5000 genes were identified that were differentially expressed between Slbp and WT NC13, with ~60% being upregulated. The upregulated genes have largely previously been identified as new zygotic transcripts, including cell cycle regulators such as fruhstart (frs, also known as Z600) and signaling molecules such as four-jointed (fj), whereas the downregulated genes are enriched for maternally degraded transcripts. This is thought to represent a coherent change in ZGA timing instead of global transcription dysregulation, as 98% of the genes that are overexpressed in Slbp are expressed before the end of NC14 in the control or previously published datasets. Indeed, the transcriptomes of histone-depleted embryos that stopped in NC13 are more similar to WT NC14 than WT NC13, which suggests a role for cell cycle elongation in ZGA. Nonetheless, ~1500 genes are differentially expressed between Slbp NC13 and WT NC14 without accounting for differences in ploidy. Of these, the majority of the ~1000 overexpressed genes are again associated with zygotic transcription, and downregulated genes associated with maternal products. Thus, ZGA is even further accelerated in the histone knockdown than can be explained by purely time alone (Wilky, 2019).
As ZGA is accelerated by histone depletion, it was asked whether ZGA would be delayed in the histone overexpression mutant. RNA-seq was performed on pools of abo and WT embryos collected 15-30 min into NC14. >1000 genes were identified that were differentially expressed between abo and WT, with approximately equal numbers of genes up- and down-regulated. As expected, the downregulated genes in abo were enriched for previously identified zygotically expressed transcripts, and upregulated transcripts were enriched for maternally deposited genes. Thus, histone overexpression delays the onset of ZGA (Wilky, 2019).
Zygotic genes, the transcription of which is upregulated by histone depletion and downregulated by histone overexpression, contain many important developmental and cell cycle regulators including: frs, hairy (h), fushi tarazu (ftz) and odd-skipped (odd). Conversely, the maternally degraded transcripts that are destabilized by histone depletion and stabilized by histone overexpression include several cell cycle regulators such as Cyclin B (CycB), string (stg, also known as Cdc25string) and Myt1. Therefore, histone concentration can modulate the expression and stability of specific cell cycle regulators, which may contribute to the onset of MBT (Wilky, 2019).
Since histone concentration has previously been implicated in sensing the nuclear-cytoplasmic (N/C) ratio (Amodeo, 2015), this study compared the genes that are changed in both the histone under- and overexpression embryos with those that had previously been found to be dependent on either the N/C ratio or developmental time (Lu, 2009). Both previously identified N/C ratio-dependent and time-dependent genes (Lu, 2009) followed the same general trends as the total zygotic gene sets, indicating that histone availability cannot explain these previous classifications (Wilky, 2019).
Next, attempts were made to disentangle the effects of cell cycle length from transcription in the histone overexpression mutant. Single-embryo time-course RNA-seq was performed on abo and WT embryos collected 3 min before mitosis of NC10-NC13 and 3 min into NC14. In addition, unfertilized embryos (henceforth NC0) of both genotypes were collected to control for differences in maternal contribution. Even with a stringent selection process that accounted for cell cycle time and embryo health, a small set of robustly upregulated (179) and downregulated (260) genes was detected across NC10-NC14. Of the newly transcribed genes, 111 genes were detected with delayed transcription, including frs and only 37 that are upregulated. These results were confirmed using qPCR. When compared with previous datasets, zygotic genes tend to be underexpressed, as was the case for the pooled abo dataset; however, the majority of these enrichments are not statistically significant. Nonetheless the majority of these underexpressed genes are expressed during NC14 in WT. This geneset, in combination with the time-matched Slbp comparison, enables further examination of the chromatin features that underlie histone sensitivity for transcription independent of cell cycle changes (Wilky, 2019).
To identify chromatin features associated with histone sensitivity, the presence was compared of 143 modENCODE chromatin signals near the transcriptional start site (TSS±500 bp) of genes whose expression was altered by changes in histone concentration independent of cell cycle time. A clear pattern was found of unique chromatin features for the histone-sensitive genes, compared with all newly transcribed genes, that was highly similar between the histone over- and underexpression experiments. The pioneer transcription factor Zld, known to be important for nucleosome eviction during ZGA, was enriched in the promoters of histone-sensitive genes. Insulator proteins such as BEAF-32 and CP190 were depleted in histone-sensitive genes. Promoters of histone-sensitive genes also show a strong reduction for SIN3A, a transcriptional repressor associated with cell cycle regulation. SIN3A is known to recruit HDACs to TSSs, and almost all HDACs also show significant de-enrichment at the TSSs of histone-sensitive genes. Taken together, these marks make up a unique chromatin signature that may sensitize a locus to changes in histone concentration, as is likely for pioneer factors such as Zld. Other aspects of this signature may indicate that these genes are subsequently subject to later developmental regulation, as indicated by H3K4me3 and H3K27me3 (Wilky, 2019).
This study has demonstrated that histone concentration regulates the timing of the MBT in Drosophila, resulting in both early gastrulation and extra pre-MBT divisions from histone reduction and increase, respectively. Histone concentration also regulates ZGA. Thousands of genes are prematurely transcribed in histone-depleted embryos and hundreds of genes are delayed in histone-overexpressing embryos. The majority of these genes appear to be downstream of changes in cell cycle duration, suggesting a model in which histones directly regulate cell cycle progression. In other cell types, histone loss halts the cell cycle via accumulation of DNA damage and stalled replication forks. In the early embryo, changes in histone availability may similarly create replication stress to directly or indirectly activate Chk1 as this study has shown. In turn, Chk1 would inhibit Stg and/or Twine to slow the cell cycle. This mechanism is supported by previous observations that loss of zygotic histones causes the downregulation of Stg in the first post-MBT cell cycle. In this case, the observed transcriptional changes would be independent or downstream of the altered cell cycle (Wilky, 2019).
Alternatively, direct changes in transcription downstream of histone availability may feed into the cell cycle. In bulk, histone-sensitive transcripts might underlie the replication stress that has been previously proposed to slow the cell cycle at the MBT. Consistent with this, the cell cycle lengthening and partial arrest phenotypes observed in mutant RNA Pol II embryos occur at a similar frequency to those observed as the result of histone depletion. Another possibility is that specific histone-sensitive transcripts are responsible for cell cycle elongation. One promising candidate for a histone-sensitive cell cycle regulator is the N/C ratio-sensitive CDK inhibitor frs, as zygotic transcription of frs plays a crucial role in stopping the cell cycle at the MBT. In contrast, tribbles, an N/C ratio-dependent inhibitor of Twine that has also been implicated in cell cycle slowing, does not show a consistent response between histone perturbations. In this previously proposed model, maternal histone stores may compete with pioneer transcription factors to set the timing of transcription initiation. Indeed, the central Drosophila pioneer transcription factor Zld is enriched at the promoters of histone-sensitive genes. Moreover, this study has identified a broader set of chromatin features that may sensitize individual loci to changes in histone concentrations. These include less obvious candidates for global early transcriptional regulators, such as SIN3A, HDACs and class I insulator proteins that may protect transcripts from changes in histone concentrations. This work highlights the importance of histone concentration in regulating the timing of MBT and provides evidence that promoters of histone-sensitive genes possess a unique chromatin signature. However, future studies will be required to isolate the specific downstream effectors that respond to changes in histone concentrations in the early embryo (Wilky, 2019).
Regulation of transcription is the main mechanism responsible for precise control of gene expression. Whereas the majority of transcriptional regulation is mediated by DNA-binding transcription factors that bind to regulatory gene regions, an elegant alternative strategy employs small RNA guides, Piwi-interacting RNAs (piRNAs) to identify targets of transcriptional repression. This study shows that in Drosophila the small ubiquitin-like protein SUMO and the SUMO E3 ligase Su(var)2-10 are required for piRNA-guided deposition of repressive chromatin marks and transcriptional silencing of piRNA targets. Su(var)2-10 links the piRNA-guided target recognition complex to the silencing effector by binding the piRNA/Piwi complex and inducing SUMO-dependent recruitment of the SetDB1 (Eggless)/Wde histone methyltransferase effector. It is proposed that in Drosophila, the nuclear piRNA pathway has co-opted a conserved mechanism of SUMO-dependent recruitment of the SetDB1/Wde chromatin modifier to confer repression of genomic parasites (Ninova, 2020a).
The majority of transcriptional control is achieved by transcription factors that bind short sequence motifs on DNA. In many eukaryotic organisms, transcriptional repression can also be guided by small RNAs, which (in complex with Argonaute proteins) recognize their genomic targets using complementary interactions with nascent RNA. Small RNA-based regulation provides flexibility in target selection without the need for new transcription factors and as such is well suited for genome surveillance systems to identify and repress the activity of harmful genetic elements such as transposons (Ninova, 2020a).
Transcriptional repression guided by small RNAs correlates with the deposition of repressive chromatin marks, particularly histone 3 lysine 9 methylation (H3K9me) in S. pombe, plants, and animals. In addition, plants and mammals also employ CpG DNA methylation for target silencing. Small RNA/Ago-induced transcriptional gene silencing is best understood in S. pombe, where the RNA-induced transcriptional silencing complex (RITS) was studied biochemically and genetically. In contrast to yeast, the molecular mechanism of RITS in Metazoans remains poorly understood. Small RNA-induced transcriptional repression mechanisms might have independently evolved several times during evolution and thus might mechanistically differ from that of S. pombe (Ninova, 2020a).
In Metazoans, small RNA-guided transcriptional repression is mediated by Piwi proteins, a distinct clade of the Argonaute family, and their associated Piwi-interacting RNAs (piRNAs). Both in Drosophila and in mouse, the two best-studied Metazoan systems, nuclear Piwis are responsible for transcriptional silencing of transposons. Based on the current model, targets are recognized through binding of the Piwi/piRNA complex to nascent transcripts of target genes. In both Drosophila and mouse, piRNA-dependent silencing of transposons correlates with accumulation of repressive chromatin marks (H3K9me3 and, in mouse, CpG methylation of DNA) on target sequences. These marks can recruit repressor proteins, such as HP1, providing a mechanism for transcriptional silencing. However, how recognition of nascent RNA by the Piwi/piRNA complex leads to deposition of repressive marks at the target locus is not well understood. Several proteins, Asterix (Arx)/Gtsf1, Panoramix (Panx)/Silencio, and Nxf2, were shown to associate with Piwi and are required for transcriptional silencing. Accumulation of H3K9me3 on Piwi/Panx targets requires the activity of the histone methyltransferase SetDB1 (also known as Egg). However, a mechanistic link between the Piwi/Arx/Panx/Nxf2 complex, which recognizes targets, and the effector chromatin modifier has not been established (Ninova, 2020a and references therein).
This study identified Su(var)2-10/dPIAS to provide the link between the Piwi/piRNA and the SetDB1 complex in piRNA-induced transcriptional silencing. In Drosophila, Su(var)2-10 mutation causes suppression of position effect variegation, a phenotype indicative of its involvement in chromatin repression. Su(var)2-10 associates with chromatin and regulates chromosome structure. It also emerged in screens as a putative interactor of the central heterochromatin component HP1, a repressor of enhancer function, and a small ubiquitin-like modifier (SUMO) pathway component. However, its molecular functions in chromatin silencing were not investigated. Su(var)2-10 belongs to the conserved PIAS/Siz protein family, of which the yeast, plant, and mammalian homologs act as E3 ligases for SUMOylation of several substrates. This paper reports the role of Su(var)2-10 in germ cells of the ovary, where chromatin maintenance and transposon repression are essential to grant genomic stability across generations. Germ cell depletion of Su(var)2-10 phenocopies loss of Piwi; both lead to strong transcriptional activation of transposons and loss of repressive chromatin marks over transposon sequences. Su(var)2-10 genetically and physically interacts with Piwi and its auxiliary factors, Arx and Panx. It was demonstrated that the repressive function of Su(var)2-10 is dependent on its SUMO E3 ligase activity and the SUMO pathway. These data point to a model in which Su(var)2-10 acts downstream of the piRNA/Piwi complex to induce local SUMOylation, which in turn leads to the recruitment of the SetDB1/Wde complex. SUMO modification was shown to play a role in the formation of silencing chromatin in various systems from yeast to mammals, including the recruitment of the silencing effector SETDB1 and its co-factor MCAF1 by repressive transcription factors. Together, these findings indicate that the piRNA pathway utilizes a conserved mechanism of silencing complex recruitment through SUMOylation to confer transcriptional repression (Ninova, 2020a).
In both insect and mammals, piRNA-guided transcriptional silencing is associated with the deposition of repressive chromatin marks on genomic targets. In Drosophila, the conserved histone methyltransferase SetDB1 (Egg) is responsible for deposition of the silencing H3K9me3 mark at Piwi targets. However, the molecular mechanism leading to the recruitment of SetDB1 by the Piwi/piRNA complex remained unknown. Thus study showed that in Drosophila SUMO and the SUMO E3 ligase Su(var)2-10 act together downstream of the piRNA-guided complex to recruit the histone methyltransferase complex SetDB1/Wde and cause transcriptional silencing. The results suggest a model for the molecular mechanism of piRNA-guided transcriptional silencing in which Su(var)2-10 provides the connection between the target recognition complex composed of piRNA/Piwi/Panx/Arx and the chromatin effector complex composed of SetDB1 and Wde (Ninova, 2020a).
This study has identified a new role for the SUMO pathway in piRNA-guided transcriptional silencing. The SUMO pathway plays important roles in heterochromatin formation and maintenance, and genome stability in different organisms from yeast to humans. Among different functions, SUMO is required for recruitment and activity of the histone methyltransferase complex composed of SetDB1 and MCAF1 (Wde in Drosophila), which confers transposon silencing in mammals. Remarkably, SUMO-dependent recruitment of SetDB1 to TEs in mammalian somatic cells does not require piRNAs but is instead mediated by the large vertebrate-specific family of Krüppel-associated box domain-zinc finger proteins (KRAB-ZFPs) that bind specific DNA motifs. Although distinct members of the KRAB-ZFP family recognize different sequence motifs in target transposons, repression of all targets by various KRAB-ZFPs requires the universal co-repressor KAP1/TIF1b (KRAB-associated protein 1). KAP1 is a SUMO E3 ligase, and its auto-SUMOylation leads to SetDB1 recruitment. The current results suggest that Drosophila Su(var)2-10 can be SUMOylated, and SetDB1 and Wde have functional SIMs, suggesting that Su(var)2-10 auto-SUMOylation might induce SetDB1/Wde recruitment. These results suggest that two distinct transposon repression pathways, by DNA-binding proteins and by piRNAs, both rely on SUMO-dependent recruitment of the conserved silencing effector to the target (Ninova, 2020a).
The results in Drosophila and studies in mammals suggest that in both clades self-SUMOylation of SUMO E3 ligases might be involved in recruitment of SetDB1 to chromatin. However, these results do not exclude the possibility that the recruitment of SetDB1 is facilitated by SUMOylation of additional chromatin proteins by Su(var)2-10. Studies in yeast led to the 'SUMO spray' hypothesis that postulates that SUMOylation of multiple different proteins localized in physical proximity promotes the assembly of multi-unit effector complexes. Local concentration of multiple SUMO moieties leads to efficient recruitment of SUMO-interacting proteins. According to this hypothesis, multiple SUMO-SIM interactions within a protein complex act synergistically, and thus SUMOylation of any single protein is neither necessary nor sufficient to trigger downstream processes. Assembly of such 'SUMO spray' on chromatin might be governed by the same principles of multiple weak interactions as was recently recognized for the formation of various phase-separated liquid-droplet compartments in the cell. The presence of Su(var)2-10 on a chromatin locus might lead to SUMOylation of multiple chromatin-associated proteins that are collectively required for the recruitment of effector chromatin modifiers. The SUMOylation consensus (ΨKxE/D) is very simple and therefore quite common in the fly proteome. Consistent with this, several hundred SUMOylated proteins were identified in proteomic studies in Drosophila. Thus, it is possible that collective SUMOylation of multiple chromatin-associated proteins contributes to recruitment and stabilization of the SetDB1 complex on chromatin (Ninova, 2020a).
The cascade of events leading to repression initiated by target recognition by piRNA/Piwi, followed by interaction with Su(var)2-10 and subsequent SUMO-dependent recruitment of SetDB1/Wde, suggests that the three complexes tightly cooperate. But do these three complexes (Piwi, Su(var)2-10, and SetDB1) always work together, or does each complex have additional functions independent of the other two? Genome-wide analysis suggests that the vast majority of Piwi targets are repressed through SUMO/Su(var)2-10 and, likely, SetDB1/Wde, suggesting that Piwi always requires these other complexes for its function in transcriptional silencing. On the other hand, multiple instances were found of host genes that are repressed by Su(var)2-10 and SetDB1 but do not require piRNAs. Su(var)2-10 and SetDB1 are also expressed outside of the gonads and were implicated in chromatin silencing in somatic tissues that lack an active piRNA pathway. It is speculated that Su(var)2-10 might bind to specific targets directly through its SAP domain or might get recruited by specific DNA-binding proteins, similar to the way SetDB1 is recruited to ERVs by KRAB-ZFP in mammals, though specific factors are yet to be uncovered (Ninova, 2020a).
Though both Drosophila and mouse have nuclear Piwi proteins involved in transcriptional silencing of transposons, these proteins, PIWI and MIWI2, are not one-to-one orthologs. Unlike Drosophila, other insects including the silkworm Bombyx mori, the flour beetle Tribolium castaneum, and the honeybee Apis mellifera encode only two Piwi proteins, and at least in B. mori, these proteins do not localize to the nucleus. These observations suggest that the nuclear Piwi pathway in Drosophila has evolved independently in this lineage. In light of this evolutionary interpretation, the interaction of the Piwi complex and the E3 SUMO ligase Su(var)2-10 indicates that in Drosophila the nuclear piRNA pathway co-opted an ancient mechanism of SUMO-dependent recruitment of the histone-modifying complex for transcriptional silencing of transposons. The molecular mechanism of piRNA-induced transcriptional repression in other clades such as mammals might have evolved independently of the corresponding pathway in flies. It will be interesting to investigate if mammals also use SUMO-dependent recruitment of silencing complexes for transcriptional repression of piRNA targets (Ninova, 2020a).
Chromatin is critical for genome compaction and gene expression. On a coarse scale, the genome is divided into euchromatin, which harbors the majority of genes and is enriched in active chromatin marks, and heterochromatin, which is gene-poor but repeat-rich. The conserved molecular hallmark of heterochromatin is the H3K9me3 modification, which is associated with gene silencing. This study found that in Drosophila, deposition of most of the H3K9me3 mark depends on SUMO and the SUMO ligase Su(var)2-10, which recruits the histone methyltransferase complex SetDB1 (Eggless)/Wde. In addition to repressing repeats, H3K9me3 influences expression of both hetero- and euchromatic host genes. High H3K9me3 levels in heterochromatin are required to suppress spurious transcription and ensure proper gene expression. In euchromatin, a set of conserved genes is repressed by Su(var)2-10/SetDB1-induced H3K9 trimethylation, ensuring tissue-specific gene expression. Several components of heterochromatin are themselves repressed by this pathway, providing a negative feedback mechanism to ensure chromatin homeostasis (Ninova, 2020b).
This study shows that in addition to the effects on TE silencing (Ninova, 2020a), Su(var)2-10 and H3K9me3 influence the expression of protein-coding genes. Su(var)2-10-dependent H3K9me3 deposition on TEs affects the expression of genes located in heterochromatin and of euchromatic genes adjacent to TE insertions. Su(var)2-10 is also involved in TE-independent H3K9me3 deposition on host genes, which is essential for the suppression of ectopic expression of tissue-specific genes, thereby conferring correct cell type identity (Ninova, 2020b).
Approximately half of the human genome comprises TE sequences, and the TE fraction is as high as 90% in several plant species. One new TE insertion per generation is estimated to propagate to the offspring. Somatic TE insertions, although difficult to detect, are likely even more prevalent. Thus, TE activity is a major source of genetic variation that can occur on a very short timescale. The effects of TEs on the host transcriptome have been the subject of many studies ever since Barbara McClintock identified 'control' elements that regulate gene expression before genome compositions were known. TEs can disrupt gene expression by inserting into coding regions or into or close to cis-regulatory sequences. TE insertions are not always disruptive: insertions into non-coding regions can bring new regulatory elements that change gene expression patterns, resulting in increased fitness. Instances of positive selection for TE insertions are well documented in Drosophila. TE-derived promoters also drive the expression of numerous mouse and human genes, suggesting that TE insertions can be co-opted into gene regulatory pathways (Ninova, 2020b).
In addition to changes in the DNA sequence, TE insertions may introduce local epigenetic effects. Active TEs are transcriptionally silenced by H3K9 trimethylation and/or DNA methylation. The H3K9me3 mark can spread several kilobases outside the TE region, affecting adjacent cis-regulatory elements of host genes, and thereby interfering with their normal expression. TE insertions with high levels of H3K9me3 are strongly selected against, supporting a model that TEs can alter the expression of host genes through epigenetic changes (Ninova, 2020b).
The finding that Su(var)2-10 is responsible for the deposition of H3K9me3 on TE bodies and flanking sequences allows separation if the effect of direct damage to cis-regulatory elements from the effect on chromatin. Evidence was found that TE insertions can lead to H3K9me3-dependent changes in gene expression, as shown for the jheh3 and frl loci. Notably, the BARI insertion at the jheh3 locus was shown to be positively selected in the D. melanogaster population, indicating that Su(var)2-10-dependent epigenetic silencing caused by a TE insertion can be used for beneficial rewiring of host gene regulatory networks (Ninova, 2020b).
The current results suggest that TEs can rewire gene regulatory networks on a short timescale, at least in part via their effects on chromatin. Euchromatic H3K9me3 peaks due to TE insertions are widespread in Drosophila, indicating that TE insertions may be a common cause of gene regulatory variation. New TE insertions during development generate genomic diversity between different cell types in human and mouse with implications for tumorigenesis and brain development. Future studies are required to elicit the epigenetic effects of somatic TE insertions on gene regulatory networks (Ninova, 2020b).
Heterochromatin domains include nearly 30% of the fly genome. Although relatively gene-poor, heterochromatin hosts several hundred protein-coding genes. Studies of chromosomal rearrangements suggested that heterochromatic localization is required for the proper expression of heterochromatic genes. However, the molecular mechanism of the positive effect of the heterochromatin environment on expression is not fully understood (Ninova, 2020b).
Consistent with previous studies, this study observed many active genes in H3K9me3-rich heterochromatic regions and found that for many active heterochromatic genes, Su(var)2-10-induced H3K9 methylation is not only permissive but also required for proper expression (Ninova, 2020b).
How can the same chromatin mark lead to the repression of genes in euchromatin and activation in heterochromatin? H3K9me3 is present over the gene bodies and regions flanking heterochromatic genes, but is depleted at promoters, which instead carry typical active marks such as H3K4me3 and Pol II occupancy. Thus, H3K9me3 over gene bodies appears to be compatible with transcription. H3K9me3 loss upon Su(var)2-10 GLKD correlated with increased levels of intronic RNAs and the appearance of H3K4me2/3 and Pol II signals in introns, indicating the upregulation of spurious transcripts originating from within host-gene introns. One possible source of such transcripts is the activation of TE promoters that are highly abundant within introns and flanking sequences of heterochromatic genes. It is proposed that transcription from TE promoters located in introns and flanking sequences interferes with proper gene expression through transcriptional interference (Ninova, 2020b).
H3K9me3 loss also disrupted the normal isoform regulation of heterochromatic genes, as was observed both truncated and extended mRNA isoforms with coding potential distinct from the canonical gene mRNA upon the depletion of Su(var)2-10. The activation of cryptic promoters may disrupt proper gene expression through multiple mechanisms, such as reduction in canonical mRNA output or dominant negative effects of the extended or truncated protein isoforms. Not all heterochromatic genes that lose H3K9me3 upon Su(var)2-10 germline knockdown (GLKD) show signs of interfering transcripts or cryptic promoters, indicating that H3K9me3 may have other functions in heterochromatic gene activation. For example, the compaction of heterochromatin by HP1 may bring distant enhancers of heterochromatic genes into physical proximity of promoters to activate expression. The results, combined with previous studies, indicate that genes positioned in heterochromatin require high H3K9me3 levels for proper expression and isoform selection (Ninova, 2020b).
Discrete Su(var)2-10-dependent H3K9me3 peaks are present in a number of euchromatic genes. Some of these peaks have no TEs in their vicinity, and their H3K9me3-based repression is conserved between D. melanogaster and D. virilis, two species that separated >45 million years ago and have no common TE insertions. The expression of many of these TE-independently repressed genes is restricted to specific tissues such as testis, the digestive system, or the CNS, and the loss of H3K9me3 leads to ectopic expression in the female germline. The finding is in line with a recent report that SetDB1 depletion in the female germline was associated with the loss of H3K9me3 and the mis-expression of male-specific genes. H3K9me3, SetDB1, and the SUMO pathway were also implicated in lineage-specific gene expression and cell fate commitment in mammals. These data suggest that a TE-independent H3K9me3 deposition via the SUMO-SetDB1 pathway plays an evolutionarily conserved role in restricting gene expression to proper cell lineages (Ninova, 2020b).
SUMO- and Su(var)2-10-dependent H3K9me3 repression also regulates several factors involved in heterochromatin formation and maintenance, such as SUMO (smt3), Wde, Sov, and CG30403. Wde is the homolog of the mammalian MCAF1/ATF7IP, which is required for the nuclear localization and stability of SetDB1 and promotes its methyltransferase activity. Drosophila Wde also associates with SetDB1, and their germline depletion results in a similar phenotype, supporting the role of Wde as a conserved SetDB1 co-factor. The current data in Drosophila and studies in mammals suggest that SUMO is involved in SetDB1/Wde recruitment to its targets. HP1 is an H3K9me3 reader that is responsible for the structural properties of heterochromatin and also serves as a hub for many other heterochromatin proteins. Both Sov and CG30403 interact with HP1, and Sov is critical for heterochromatin maintenance (Ninova, 2020b).
The genes encoding Wde, SUMO, Sov, and CG30403 reside in euchromatin and are repressed by local H3K9me3. Unlike tissue-restricted genes, which are often completely repressed by Su(var)2-10 in the female germline, these factors are not fully silenced, although they are upregulated upon Su(var)2-10 depletion. The results indicate that these four genes are part of a negative feedback mechanism that controls heterochromatin formation. Negative feedback in biological circuits maintains protein levels within a certain range, providing homeostatic regulation. It is proposed that SUMO-dependent repression of heterochromatin proteins provides such homeostatic regulation to maintain the proper ratio and boundaries of hetero- and euchromatin. According to this model, specific genes, such as wde, act as sensors of the overall H3K9me3 level. Insufficient levels of H3K9 methylation lead to elevated sensor gene expression due to decreased H3K9me3 at their promoters, which in turn enhances H3K9me3 deposition and heterochromatin formation throughout the genome. Concomitant repression of sensor genes ensures that H3K9me3 is restricted to proper genomic domains and does not spread to euchromatic regions that should remain active. Inspection of ENCODE data showed that the mammalian homolog of wde, ATF7IP, is decorated by H3K9me3 in some human cell lines, suggesting that this mode of regulation may be deeply conserved (Ninova, 2020b).
A reminiscent negative feedback loop was identified in yeast. The single H3K9 methyltransferase clr4 is suppressed by H3K9me3 to restrict ectopic spreading of silencing chromatin. In mammals, genes encoding proteins from the KRAB-ZFP family of transcriptional repressors reside in H3K9me3- and HP1-enriched loci. Thus, autoregulation of heterochromatin effectors is a conserved mode of chromatin regulation, although the genes involved in the feedback mechanism differ between different organisms. In the future, it will be important to dissect the network architecture of heterochromatin regulation. As heterochromatin formation and maintenance was reported to be disrupted in cancer and during aging, this mechanism may be a promising target of therapeutic interventions (Ninova, 2020b).
H3K9me3 writer enzymes are targeted to genomic loci by different mechanisms. In the case of TE repression in germ cells, piRNAs bound to nuclear Piwi proteins serve as sequence-specific guides that bind complementary nascent transcripts and recruit Su(var)2-10, which induces H3K9me3 deposition by SetDB1. Su(var)2-10 identifies non-TE targets in a piRNA-independent fashion, in agreement with a broader function of Su(var)2-10 in development. The observation that H3K9me3 peaks at homologous euchromatic genes are also present in the distantly related D. virilis points to a conserved mechanism of H3K9me3 deposition in host-gene regulation (Ninova, 2020b).
The molecular mechanism of piRNA-independent recruitment of Su(var)2-10 remains to be explored. Su(var)2-10 has a putative DNA binding SAP domain that may be sufficient for its binding to DNA. However, motif enrichment analysis failed to identify a common sequence motif among TE-independent Su(var)2-10 targets (MEME-ChIP), suggesting that different partners may recruit Su(var)2-10 to distinct targets. In mammals, a large family of transcription factors, the KRAB-ZFPs, are responsible for SetDB1 recruitment and H3K9me3 deposition on many different targets, primarily endogenous retroviruses. Individual members of the KRAB-ZFP family influence distinct targets due to differences in DNA-binding specificities of their zinc-finger DNA-binding domains. Notably, SetDB1 recruitment through KRAB-ZFPs occurs through a SUMO-dependent mechanism. The KRAB-ZFP family is vertebrate specific, and there are no known proteins in D. melanogaster that can recruit H3K9me3 activity. A preliminary search for direct Su(var)2-10 interactors using a yeast two-hybrid screen identified several proteins with putative DNA-binding domains. Thus, it is proposed that analogous to the KRAB-ZFP pathway in mammals, Su(var)2-10 may link DNA-binding proteins to the SetDB1 silencing machinery. Future studies are necessary to identify the proteins that guide Su(var)2-10 to target loci and to elucidate TE-independent recruitment mechanisms of the silencing machinery (Ninova, 2020b).
Linker histones H1 are principal chromatin components, whose contribution to the epigenetic regulation of chromatin structure and function is not fully understood. In metazoa, specific linker histones are expressed in the germline, with female-specific H1s being normally retained in the early-embryo. Embryonic H1s are present while the zygotic genome is transcriptionally silent and they are replaced by somatic variants upon activation, suggesting a contribution to transcriptional silencing. This study directly address this question by ectopically expressing dBigH1 in Drosophila S2 cells, which lack dBigH1. dBigH1 was shown to bind across chromatin, replaces somatic histone H1 and reduces nucleosome repeat length (NRL). Concomitantly, dBigH1 expression down-regulates gene expression by impairing RNApol II binding and histone acetylation. These effects depend on the acidic N-terminal ED-domain of dBigH1 since a truncated form lacking this domain binds across chromatin and replaces dH1 like full-length dBigH1, but it does not affect NRL either transcription. In vitro reconstitution experiments using Drosophila preblastodermic embryo extracts corroborate these results. Altogether these results suggest that the negatively charged N-terminal tail of dBigH1 alters the functional state of active chromatin compromising transcription (Climent-Canto, 2020).
Linker histones H1 constitute an evolutionarily conserved family of chromosomal proteins that play an important structural role in regulating chromatin compaction and higher order chromatin organization. In metazoan species, histones H1 usually exist as multiple variants, some of which are specifically expressed in the germline. For instance, four of the eleven mice/human H1 isoforms are germline specific, of which three are expressed in males (H1T, HILS1 and H1T2) and one in females (H1oo). Female-specific variants usually accumulate in the oocyte and are retained during early embryogenesis. In comparison to most metazoa, H1 complexity in Drosophila is much reduced since it contains a single somatic dH1 variant, which is ubiquitously expressed throughout development, and a single germline specific variant dBigH1, which is expressed in both the female and male germlines, and it is retained in the early embryo. Embryonic H1s persist as long as the zygotic genome remains transcriptionally silent, being replaced by somatic variants when transcription begins during zygotic genome activation (ZGA). In Drosophila, dBigH1 is present during early embryogenesis until ZGA onset at cellularization. At this stage, dBigH1 is replaced by somatic dH1 in somatic cells, whereas it is retained in the primordial germ cells (PGC), which remain transcriptionally silent (Climent-Canto, 2020).
These observations suggest that dBigH1, and embryonic H1s in general, are general transcriptional regulators that contribute to silencing. Linker histones H1 have been usually associated with transcription repression, but somatic H1s are readily detected across expressed genes. In this regard, it has been reported that somatic H1s can even enhance the synergism between transcription factors. In contrast, in the presence of dBigH1, chromatin appears to be transcriptionally silent, suggesting that dBigH1 enhances transcriptional silencing. This study analyzed the mechanisms of action of dBigH1. For this purpose, ectopic expression experiments were performed in Drosophila S2 cells, which lack dBigH1. These experiments confirm the contribution of dBigH1 to transcriptional silencing, identifying the acidic N-terminal ED-domain as responsible for its negative effect on transcription (Climent-Canto, 2020).
The mechanism of dBigH1 action in transcription regulation was addressed using ectopic dBigH1 expression experiments in S2 cells. Though weakly, the genomic distribution of ectopically expressed dBigH1 positively correlates with those observed in embryos and testes, where dBigH1 is naturally expressed, suggesting that the mechanisms governing dBigH1 deposition might be partially conserved in S2 cells. The results suggest that binding of dBigH1 negatively affects transcription. Upon dBigH1 expression, more than two-thirds of the differentially expressed genes were down-regulated. This effect was probably underestimated since, though only in one replicate, was a global decrease in gene expression observed that, considering the methodology used for normalization, could hamper identification of differentially down-regulated genes. This down-regulation occurred at the transcriptional level, as down-regulated genes showed reduced RNApol II content. Conversely, RNApol II content of up-regulated genes was not increased upon dBigH1 expression, suggesting that the observed up-regulation was not transcriptional. Moreover, in vitro experiments showed that dBigH1 inhibited transcription of a chromatin template. Consistent with the negative effect on transcription, dBigH1 expression specifically decreased H3K36me3 levels at CDS of down-regulated genes (Climent-Canto, 2020).
The results show that dBigH1 replaces dH1. In these experiments, dBigH1 binding to chromatin was likely taking place in the absence of DNA replication, as dBigH1 induction was sustained for 24h and, during this time, cell density did not increase noticeably. Whether dBigH1 deposition involves active dH1 replacement remains to be determined. In vitro, incubation of purified nuclei with Drosophila embryo extract results in binding of dBigH1 to chromatin without dH1 displacement, suggesting that the replacement observed in S2 cells responds to an active process. Along the same lines, it was observed that dBigH1 was preferentially deposited at regions enriched in dH1. Replacement of somatic H1s by embryonic H1s has been reported in nuclear transfer experiments and NAP-1 has been shown to be involved in both B4/H1M deposition and somatic H1s removal in Xenopus. Further work is required to determine the mechanisms regulating dBigH1 deposition (Climent-Canto, 2020).
The acidic ED-domain of dBigH1 is required to inhibit transcription since expression of the truncated dBigH1ΔED form, which also replaced somatic dH1, did not down-regulate gene expression either affected RNApol II loading or H3Kac levels. The presence of the negatively charged acidic ED-domain in dBigH1 is peculiar as histones are highly positively charged. It is possible that, due to the negative charge of the ED-domain, the structural organization of chromatin is compromised in the presence of dBigH1. Actually, the overall NRL changed upon dBigH1 expression, but not when dBigH1ΔED was expressed. Interestingly, although the ED-domain of dBigH1 is not conserved outside of the Drosophila genus, embryonic H1s are generally more acidic than somatic ones. In this regard, it was shown that both the Xenopus B4/H1M and the mammalian H1oo embryonic linker histones alter chromatin organization and dynamics. An altered chromatin organization would perturb access to chromatin and/or functioning of chromatin remodelers/modifiers and transcription factors that, ultimately, would affect RNApol II loading and transcription. In fact, regardless of the actual transcriptional outcome, dBigH1 expression globally affected H3Kac. In contrast, it was reported earlier that incubation of purified nuclei with Drosophila embryo extract, which also results in dBigH1 binding, increased H3Kac levels. However, it is important to note that the increase in H3Kac levels observed in this case was independent of dBigH1 binding (Climent-Canto, 2020).
It might be argued that the down-regulation observed upon dBigH1 expression is a consequence of increased global linker histones content. However, similar or even higher levels of expression of the truncated dBigH1ΔED and dBigH1ΔNTD forms did not affect transcription. Moreover, binding of dBigH1 is compensated by removal of dH1, thus total linker histones content is not greatly increased (Climent-Canto, 2020).
dBigH1 binding affected expression of a relatively small subset of genes. This may reflect the fact that in the experimental setup used in this study, dBigH1 accounted for only 20-25% of total linker histones. Thus, from this point of view, affected genes appear to correspond to a subset of genes more sensitive to dBigH1 levels. In this regard, it was observed that down-regulated genes had strong RNApol II pausing, which tended to decrease upon dBigH1 expression. In addition, though dBigH1 binding reduced H3Kac globally, only the down-regulated genes were transcriptionally affected. Interestingly, impairing RNApol II pausing generally down-regulates gene expression, while reduced H3Kac levels preferentially affects expression of highly paused genes. These observations suggest that the higher sensitivity to dBigH1 expression of down-regulated genes is likely due to the way their transcription is regulated (Climent-Canto, 2020).
In summary, this study has presented evidence supporting that the acidic N-terminal tail of the embryonic dBigH1 linker histone of Drosophila compromises transcription by altering the functional epigenetic state of active chromatin. Other embryonic H1s might share similar properties since they are generally more acidic than their somatic counterparts (Climent-Canto, 2020).
Many membraneless organelles form through liquid-liquid phase separation, but how their size is controlled and whether size is linked to function remain poorly understood. The histone locus body (HLB) is an evolutionarily conserved nuclear body that regulates the transcription and processing of histone mRNAs. This study shows that Drosophila HLBs form through phase separation. During embryogenesis, the size of HLBs is controlled in a precise and dynamic manner that is dependent on the cell cycle and zygotic histone gene activation. Control of HLB growth is achieved by a mechanism integrating nascent mRNAs at the histone locus, which facilitates phase separation, and the nuclear concentration of the scaffold protein Multi-sex combs (Mxc), which is controlled by the activity of cyclin-dependent kinases. Reduced Cdk2 activity results in smaller HLBs and the appearance of nascent, misprocessed histone mRNAs. Thus, these experiments identify a mechanism linking nuclear body growth and size with gene expression (Hur, 2020).
Before zygotic genome activation (ZGA), the quiescent genome undergoes reprogramming to transition into the transcriptionally active state. However, the mechanisms underlying euchromatin establishment during early embryogenesis remain poorly understood. This study shows that histone H4 lysine 16 acetylation (H4K16ac) is maintained from oocytes to fertilized embryos in Drosophila and mammals. H4K16ac forms large domains that control nucleosome accessibility of promoters prior to ZGA in flies. Maternal depletion of MOF acetyltransferase leading to H4K16ac loss causes aberrant RNA Pol II recruitment, compromises the 3D organization of the active genomic compartments during ZGA, and causes downregulation of post-zygotically expressed genes. Germline depletion of histone deacetylases revealed that other acetyl marks cannot compensate for H4K16ac loss in the oocyte. Moreover, zygotic re-expression of MOF was neither able to restore embryonic viability nor onset of X chromosome dosage compensation. Thus, maternal H4K16ac provides an instructive function to the offspring, priming future gene activation (Samata, 2020).
The fusion of the maternal and paternal gametes triggers a remarkable transition from two fully differentiated cells to a totipotent zygote that gives rise to all tissues during embryogenesis. In flies, the development of the embryo during the first 13 synchronized nuclear divisions relies on maternally provided proteins and transcripts. These maternal elements are replaced by newly synthesized ones during the major wave of zygotic genome activation (ZGA) at the nuclear cycle (nc) 14 at embryonic stage (st) 5 when the zygotic genome has reformed to accommodate the transcriptional active status. Increase in nucleosome accessibility as well as gradual enrichment of RNA Polymerase II (RNA Pol II) are observed from nc11. The repressive mark H3K27me3 is inherited from the maternal germline restricting the activation of developmental genes, but most of the other acetyl and methyl marks only become prominent genome-wide at ZGA. The few transcripts that are activated before ZGA (during the minor zygotic wave) are under the control of the pioneer transcription factor Zelda, which mediates local chromatin accessibility. However, the mechanisms that guide the reprogramming of the entire genome are not fully understood (Samata, 2020).
Acetylation of histone tails is known to promote transcriptional activation. Among the histone modifications positively correlated with transcription activation, H4K16ac is unique because it prevents chromatin compaction in vitro. However, the developmental dynamics and the biological significance of this modification in the embryonic genome prior to ZGA remain unclear. H4K16ac is deposited by the histone acetyltransferase (HAT) males absent on the first (MOF). The MOF-containing male-specific lethal (MSL) complex is responsible for chromosome-wide upregulation of the male X chromosome to equalize its expression to the female X as well as to autosomal genes. The complex consists of five proteins (MSL1, MSL2, MSL3, MOF, and MLE) together with two long non-coding RNAs, RNA on the X 1 and 2 (roX1, roX2), and is capable of specifically recognizing the single male X chromosome. Interestingly, all MSL proteins, apart from MSL2, are maternally deposited as transcripts and proteins, which remain stable through the early embryonic stages. However, it has not been determined whether they form a functional complex (Samata, 2020).
By analyzing precisely staged Drosophila embryos before and after ZGA and by performing genetic and genomic experiments, this study shows that H4K16ac is intergenerationally transmitted from the female germline and has a fundamental role in controlling chromatin accessibility in the absence of ongoing transcription during early embryogenesis. Furthermore, it poises promoters for future gene activation (Samata, 2020).
MOF represents the major enzyme catalyzing H4K16ac. It is proposed that maternally provided MOF plays a dual role in 'depositing' and 'maintaining' H4K16ac. First, maternal MOF establishes H4K16ac in the maturing oocyte. Following fertilization, MOF exploits its unique ability to remain associated with mitotic chromosomes in order to actively propagate H4K16ac, hence acting as the maintenance factor for H4K16ac during the first and subsequent embryonic divisions. Continuous presence of MOF is essential because histone acetylation marks typically exhibit fast turnover. Thus, the proposed intergenerational H4K16ac transmission model is distinct from the mechanisms of inheritance of methylation marks, many of which rely on different catalytic modes for de novo deposition and propagation (Samata, 2020).
The combination of maternal deposition and early embryonic maintenance of the H4K16ac information is critical for marking genes prior to ZGA for future activation. Absence of this information leads to misregulation of H4K16ac targets and subsequently to increased embryonic lethality. Other acetyl histone marks were not able to restore proper gene expression in the absence of MOF, demonstrating a specific requirement for H4K16ac in oocytes. Furthermore, expression of the 'maintaining' (zygotic) MOF upon ZGA cannot compensate for loss of the 'depositing' (maternal) MOF function. It will be interesting to characterize the specificities of maternal and zygotic MOF and further explore whether genome structure, developmental timing, or other determinants affect the H4K16ac deposition pattern (Samata, 2020).
Maternally deposited H4K16ac primes a subset of genes for subsequent transcriptional activation upon the onset of ZGA and later in development. The transcription factor Zelda is responsible for activation of the first zygotic transcripts. However, neither Zelda-mediated transcription nor the chromatin marks around these Zelda-dependent regions explain the global emergence of chromatin accessibility in early embryos. The current data indicate that the establishment of H4K16ac-mediated nucleosome accessibility on numerous Zelda-independent promoters before ZGA creates a permissive chromatin state that enhances RNA Pol II recruitment and facilitates gene expression activation (Samata, 2020).
Maternal depletion of MOF also led to profound chromatin architecture changes. Even though the structure of TAD boundaries remained unaffected, substantial defects were observed in compartmentalization during ZGA. Analysis of Hi-C datasets from Drosophila st5 embryos whose transcription was abrogated by drug treatment (Hug, 2017) revealed similar phenotypes to those observed in embryos after maternal H4K16ac loss. However, the compartmentalization defects in maternal mof RNAi offspring were apparent only in early and not late embryos, despite persistent transcription misregulation at both stages. Thus, an aberrant transcriptional program is not the primary driving force for the genome compartmentalization defects observed upon H4K16ac loss early on. It is concluded that although maternal H4K16ac contributes toward establishing global genome organization early on, other factors can compensate for this loss at later embryonic stages (Samata, 2020).
The 'future' dosage compensated genes on the X chromosome are among the numerous targets that show H4K16ac signal prior to the onset of their transcription. By characterizing the developmental dynamics of H4K16ac, this study describes the sequence of events that leads to establishment of dosage compensation on the male X chromosome. H4K16ac decorates all chromosomes prior to ZGA but becomes strongly enriched on the male X chromosome during later stages. It is proposed that initiation of dosage compensation at both X-linked genes and high-affinity sites (HASs) relies on the instructive H4K16ac signal from the mother. Without maternal H4K16ac, MSL complex targeting is compromised in males and the mature dosage-compensated phase cannot be reached. Thus, the X chromosome serves as a readout of H4K16ac memory on a chromosomal scale (Samata, 2020).
H4K16ac is deposited in oocytes by a maternal MSL sub-complex composed of MSL1, MOF, and MSL3. This first step prepares the chromatin landscape for establishment of nucleosome accessibility and dosage compensation initiation in a sex-independent manner. Assembly of the canonical MSL complex requires the expression of the male-specific protein MSL2, whose expression starts at stage 5. MSL2 targets the X chromosome at ZGA and together with MOF mediates transcriptional activation of genes close to HASs in a male-specific manner. Interestingly though, no MSL complex 'spreading' in the vicinity of HASs is observed at this stage. Because X-chromosome territory formation coincides with the expression of the roX2 long non-coding RNA, it is possible that efficient MSL complex spreading is mediated by the contribution of roX2/MSL interactions. Thus, the MSL complex targeting and spreading on the X chromosome represent two temporally discrete steps with distinct requirements. Moreover, the three-dimensional organization of HASs may function as an additional stabilizing factor for X-territory maturation. Indeed, interactions that were observed between HASs were more abundant in stage 15 compared to stage 5 embryos, possibly because of the stronger chromatin compartmentalization in late embryos . It is therefore possible that maturation of the active compartment is required for the stronger clustering of HASs (Samata, 2020).
Although the dosage compensation defects represent a clear readout of the importance of maternal H4K16ac, the influence of this early mark is not restricted to male progeny. Furthermore, this study found the H4K16ac marking of the oocyte to be evolutionarily conserved in three Drosophila species (Drosophila melanogaster, D. virilis, and D. busckii) as well as in mammals. Given that mammals have a different dosage compensation mechanism, retention of H4K16ac in the early mammalian zygote likely indicates the importance of this histone modification in embryogenesis (Samata, 2020).
Maternal inheritance of H3K27me3 mediates gene silencing in both Drosophila and mammals. DNA methylation, H3K4me3, and H3K36me3 mediate zygotic genome activation in other organisms. However, these modifications are absent from the young Drosophila zygotes. A variety of mechanisms have thus evolved to propagate instructions to the next generation via histone modifications in the germline. Future work will elucidate the function of H4K16ac early presence in mice and human (Samata, 2020).
The histone locus body (HLB) assembles at replication-dependent (RD) histone loci and concentrates factors required for RD histone Drosophila Histone Locus Body assembly and function involves multiple interactionsmRNA biosynthesis. The D. melanogaster genome has a single locus comprised of ~100 copies of a tandemly arrayed 5 kB repeat unit containing one copy of each of the 5 RD histone genes. To determine sequence elements required for D. melanogaster HLB formation and histone gene expression, transgenic gene arrays were used containing 12 copies of the histone repeat unit that functionally complement loss of the ~200 endogenous RD histone genes. A 12x histone gene array in which all H3-H4 promoters were replaced with H2a-H2b promoters [12x(PR)] does not form an HLB or express high levels of RD histone mRNA in the presence of the endogenous histone genes. In contrast, this same transgenic array is active in HLB assembly and RD histone gene expression in the absence of the endogenous RD histone genes and rescues the lethality caused by homozygous deletion of the RD histone locus. The HLB formed in the absence of endogenous RD histone genes on the mutant 12x array contains all known factors present in the wild type HLB including CLAMP, which normally binds to GAGA repeats in the H3-H4 promoter. These data suggest that multiple protein-protein and/or protein-DNA interactions contribute to HLB formation, and that the large number of endogenous RD histone gene copies sequester available factor(s) from attenuated transgenic arrays, thereby preventing HLB formation and gene expression (Koreski, 2020).
An important organizing principle in cells is the use of membraneless compartments to spatially and temporally regulate diverse biological processes. Numerous membraneless compartments have been identified in both the nucleus (e.g., nucleoli, Cajal bodies, histone locus bodies) and the cytoplasm (e.g., P-bodies, stress granules, germ granules) and are collectively referred to as biomolecular condensates. There is increasing evidence suggesting that biomolecular condensates are formed through liquid-liquid phase separation or condensation (Alberti, 2019). This occurs when proteins and/or nucleic acids in the nucleoplasm or cytoplasm coalesce or demix into a condensed phase that often resembles liquid droplets. Large nuclear condensates that are visible under light microscopy are most often referred to as nuclear bodies (NBs) and represent an important organizing feature of the nucleus (Koreski, 2020).
The histone locus body (HLB) is a conserved NB that assembles at replication-dependent (RD) histone genes and concentrates factors required for RD histone mRNA biogenesis. RD histone mRNAs are the only eukaryotic mRNAs that are not polyadenylated. The unique stem loop at the 3'-end of RD histone mRNAs results from a processing reaction requiring a specialized suite of factors, some of which are constitutively localized in the HLB (Duronio, 2017). The HLB provides a powerful system to study how NBs form and function because it contains a well-characterized set of factors involved in producing a unique class of cell-cycle-regulated mRNAs.P demonstrated that concentrating factors (e.g., FLASH [FLICE-associated huge protein] and U7 snRNP) in the Drosophila melanogaster HLB is critical for efficient histone pre-mRNA processing. However, a full understanding of how the HLB participates in histone mRNA biosynthesis requires knowledge of HLB assembly at the molecular level (Koreski, 2020).
Prior studies of NBs have provided several important assembly concepts that are applicable to the HLB. Many NB components have an intrinsic ability to self-associate, an observation leading to two different models of NB assembly: (1) interactions among NB components occur stochastically, wherein individual factors can be recruited to the body in any order; or (2) components assemble in an ordered or hierarchical pathway, wherein the recruitment of components is predicated on prior recruitment of others. The HLB appears to employ a hybrid version of these two possibilities. For example, genetic loss of function experiments suggest a partially ordered assembly pathway of the Drosophila HLB with some components being required for subsequent recruitment of others. The scaffolding protein Mxc (multi sex combs), the Drosophila ortholog of human NPAT (nuclear protein, ataxia-telangiectasia locus), and FLASH likely form the core of the HLB and are required for subsequent recruitment of other factors (White, 2011). Tethering experiments in mammalian cells indicate that ectopic HLB formation also may be induced by several different HLB components, supporting a stochastic model of assembly (Koreski, 2020).
The initiation event in self-organizing NB assembly is the key step in the process and is not well understood. A prevalent model postulates a 'seeding' event that initiates the nucleation of critical components that form a platform for further recruitment of other components. In some instances, RNA is thought to help seed NB assembly, and NBs such as the nucleolus and ectopic paraspeckles can form at sites of specific transcription. Blocking transcription prevents complete HLB assembly in both zebrafish and flies. However, the HLB is present at RD histone genes even in G1 when the genes are not active, raising the possibility that histone genes themselves participate in seeding HLB assembly (Koreski, 2020).
In Drosophila, the RD histone genes are present at a single locus with ~100 copies of a tandemly arrayed 5-kb repeat unit, each of which contains one copy of the divergently transcribed H2a-H2b and H3-H4 gene pairs as well as the gene for linker histone H1. Using transgenes containing a wild-type or mutant derivative of a single histone repeat, previous work demonstrated that the bidirectional H3-H4 promoter stimulated HLB assembly and transcription of the single histone repeat in salivary glands (Salzler, 2013). It was subsequently shown that the conserved GAGA repeat elements present in the H3-H4 promoter region are targeted by the zinc-finger transcription factor CLAMP (chromatin-linked adaptor for MSL proteins), and that this interaction promotes HLB assembly (Rieder, 2017). Thus, the H3-H4 promoter region might act to seed HLB assembly (Koreski, 2020).
This work leveraged transgenic histone gene arrays to test whether the H3-H4 promoter region is necessary for in vivo function of the RD histone locus. Replacement of H3-H4 promoters with H2a-H2b promoters was shown to result in an attenuated transgenic histone gene array that does not function in the presence of the intact endogenous RD histone locus, but surprisingly provides full in vivo function, including normal HLB assembly and histone gene expression, when the endogenous RD histone locus is absent. These results suggest that multiple elements in the histone genes and core HLB proteins are involved in HLB assembly (Koreski, 2020).
This study used a histone gene replacement platform to analyze the cis acting elements within the Drosophila histone repeat unit that are necessary for HLB formation and histone gene expression. Previously it was shown using a single, transgenic histone gene repeat unit that the promoter region of the divergently transcribed H3-H4 gene pair is capable of stimulating HLB formation (Salzler, 2013). Subsequently this functionality was further mapped using a 12x gene array to conserved GAGA repeats in this region that are targeted by the CLAMP protein (Rieder, 2017). This study presents evidence that a 12xPR histone gene array devoid of the H3-H4 promoter and lacking any CLAMP binding elements cannot assemble an HLB in the presence of the ∼100 RD histone gene copies at the endogenous locus (HisC). However, the 12xPR array surprisingly can rescue homozygous deletion of HisC and fully support the entire Drosophila life cycle. In the HisC deletion background, the 12xPR array assembles an HLB and expresses the same amount of properly processed histone mRNAs as the endogenous genes or as a 12xDWT wild-type array. Below the implications of these observations on HLB assembly and organization are discussed (Koreski, 2020).
Biomolecular condensates form via a seeding event that promotes a high concentration of factors at a discrete location, leading to recruitment of additional factors that ultimately result in a structure that can be observed by light microscopy. A number of putative seeding events for biomolecular condensates have been described, but in many cases the precise mechanism of seeding is not known. Nucleic acids, particularly RNA, have been proposed to seed different NBs. Both the nucleolus and the HLB are associated with specific genomic loci, and it is likely that the DNA (or chromatin) and/or nascent RNA at the locus participates in the seeding event. The activation of zygotic transcription of rRNA leads to the precise spatiotemporal formation of the nucleolus in Drosophila embryos (Falahati, 2016). In the absence of rDNA, Drosophila nucleolar components still form high concentration assemblies, but these are smaller, more numerous, and do not form at the same time in the early embryo as the wild-type nucleolus (Koreski, 2020).
Drosophila HLB components also stochastically assemble smaller and more unstable foci in embryos lacking the RD histone locus, suggesting that HLBs and the nucleolus form similar seeding events. Indeed, the dynamics of HLB assembly in single early Drosophila embryos display properties consistent with liquid-liquid phase transition seeded by HisC (Hur, 2020). Blocking transcription in the early embryo prevents normal HLB growth (Hur, 2020), and a defective H3-H4 promoter (with mutated TATA boxes) does not support HLB formation in the context of a single copy histone gene repeat in salivary glands (Salzler, 2013). These data suggest that active transcription is essential for forming a complete HLB (Koreski, 2020).
It is important to note that HLBs assemble and persist in nonproliferating Drosophila tissues that do not express histone mRNAs and are also present in G0/G1 mammalian cells. Histone gene expression is activated as a result of phosphorylation of Mxc/NPAT by Cyclin E/Cdk2, resulting in changes in the HLB that promote histone gene transcription and pre-mRNA processing. It is proposed that in early embryonic development the histone locus DNA and/or chromatin seeds HLB assembly in Drosophila, with the H3-H4 promoter region being particularly important. It is further proposed that subsequent transcriptional activation of histone genes then drives HLB growth and maturation (Koreski, 2020).
Formation of an HLB on a transgenic RD histone gene array requires that this array compete effectively with the endogenous HisC locus for recruitment of HLB components. This is the situation with 12xHWT and 12XDWT arrays, which form HLBs in the presence of HisC. These results also indicate that there are no other elements within HisC that are necessary for HLB formation. Because the 12xPR array does not form an HLB in the presence of HisC, but it does so in the absence of HisC, it is hypothesized that the endogenous RD histone gene array sequesters critical HLB components, likely including Mxc and CLAMP, thereby preventing HLB assembly at the transgenic locus. By removing the H3-H4 promoter from the transgene, an element was removed that provided additional interactions with HLB components, notably CLAMP, weakening the overall ability of the locus to stably nucleate an HLB (Koreski, 2020).
Interactions among multivalent proteins, or multivalent protein-nucleic acid interactions, are driving forces in the assembly of biomolecular condensates. Mxc is likely the critical factor that together with histone genes seeds Drosophila HLB formation and activates histone gene expression. Mxc is a large (~1800 aa) protein that oligomerizes in vivo and likely provides a scaffold for multivalent interactions. A C-terminal truncation mutant of Mxc that fails to recruit histone pre-mRNA processing factors still forms an HLB and activates histone gene expression at sufficient levels to complete development, underscoring the multivalent nature of Mxc (Koreski, 2020).
Surprisingly, the HLB that assembles on the 12xPR array in the absence of HisC contains CLAMP, even though this study removed all of the known CLAMP binding sites from the histone repeat. Although CLAMP may bind another sequence in the 12xPR array, no other favorable GAGA repeats are present, and it was not possible to detect CLAMP bound to any other location in the histone array by ChIP-qPCR and ChIP-seq experiments. More likely, CLAMP interacts with other HLB components, possibly Mxc or the Mxc-FLASH complex, providing multivalent contacts between CLAMP and other HLB components. Deleting the GAGA sequences from the H3-H4 promoter did not affect transcription of the H3 or H4 genes in the absence of HisC, suggesting that CLAMP's major function is to promote HLB assembly and not to act as a canonical DNA binding transcription factor. Supporting this interpretation is the observation that another, more abundant transcription factor that binds to GAGA repeats, GAF, is not found at the HLB unless CLAMP is absent (Rieder, 2017), consistent with CLAMP's critical interactions with both the GAGA repeats and the HLB factors in seeding the HLB (Koreski, 2020).
Because the 12xPR array is capable of assembling a completely functional HLB in the absence of HisC, the H3-H4 promoter is not absolutely essential for HLB formation. One possibility is that there are multiple pathways for assembling functional HLBs. Previous work suggests that not all seeding events are equivalent in their ability to assemble biomolecular condensates. In artificial systems, changes in scaffold stoichiometry, which can stem from changes in valence, alter the recruitment of components. Further, mathematical modeling has revealed that scaffolds can nucleate distinct complexes when at different concentrations, and that this can qualitatively alter the transcriptional output. Additionally, P-bodies can form in multiple ways through different protein-protein or protein-nucleic acid interactions, with different interactions predominating under different conditions. Therefore, different nucleators of the HLB (i.e., the H3-H4 promoter or other sequences in the locus) may result in similar but not identical outcomes. Collectively these results suggest that HLB formation results from the contribution of many molecular interactions, and the loss of any single one may be overcome by other multivalent interactions within the body (Koreski, 2020).
His4r is the only known variant of histone H4 in Drosophila. It is encoded by the His4r single-copy gene that is located outside of the histone gene cluster and expressed in a different pattern than H4, although the encoded polypeptides are identical. A null mutant (His4r(Δ42)) was generated which is homozygous viable and fertile without any apparent morphological defects. Heterozygous His4r(Δ42) is a mild suppressor of position-effect variegation, suggesting that His4r has a role in the formation or maintenance of condensed chromatin. Under standard conditions loss of His4r has a modest effect on gene expression. Upon heat-stress the induction of the Heat shock protein (HSP) genes Hsp27 and Hsp68 is stronger in His4r(Δ42) mutants with concordantly increased survival rate. Analysis of chromatin accessibility after heat shock at a Hsp27 regulatory region showed less condensed chromatin in the absence of His4r while there was no difference at the gene body. Interestingly, preconditioning before heat shock led to increased chromatin accessibility, HSP gene transcription and survival rate in control flies while it did not cause notable changes in His4r(Δ42). Thus, these results suggest that His4r might play a role in fine tuning chromatin structure at inducible gene promoters upon environmental stress conditions (Farago, 2021).
Precise patterns of gene expression are driven by interactions between transcription factors, regulatory DNA sequence, and chromatin. How DNA mutations affecting any one of these regulatory 'layers' is buffered or propagated to gene expression remains unclear. To address this, allele-specific changes were quantified in chromatin accessibility, histone modifications, and gene expression in F1 embryos generated from eight Drosophila crosses at three embryonic stages, yielding a comprehensive dataset of 240 samples spanning multiple regulatory layers. Genetic variation (allelic imbalance) impacts gene expression more frequently than chromatin features, with metabolic and environmental response genes being most often affected. Allelic imbalance in cis-regulatory elements (enhancers) is common and highly heritable, yet its functional impact doesn't generally propagate to gene expression. When it does, genetic variation impacts RNA levels through H3K4me3 or independently through chromatin accessibility and H3K27ac. Changes in RNA are more predictive of variation in H3K4me3 than vice versa, suggesting a role for H3K4me3 downstream of transcription. The impact of a substantial proportion of genetic variation is consistent across embryonic stages, with 50% of allelic imbalanced features at one stage being also imbalanced at subsequent developmental stages. Crucially, buffering, as well as the magnitude and evolutionary impact of genetic variants, are influenced by regulatory complexity (i.e., number of enhancers regulating a gene), with transcription factors being most robust to cis-acting, but most influenced by trans-acting variation (Floc'hlay, 2020).
Histone-modifying systems play fundamental roles in gene regulation and the development of multicellular organisms. Histone modifications that are enriched at gene regulatory elements have been heavily studied, but the function of modifications found more broadly throughout the genome remains poorly understood. This is exemplified by histone H2A monoubiquitylation (H2AK119ub1), which is enriched at Polycomb-repressed gene promoters but also covers the genome at lower levels. Using inducible genetic perturbations and quantitative genomics, this study found that the BAP1 deubiquitylase plays an essential role in constraining H2AK119ub1 throughout the genome. Removal of BAP1 leads to pervasive genome-wide accumulation of H2AK119ub1, which causes widespread reductions in gene expression. Elevated H2AK119ub1 preferentially counteracts Ser5 phosphorylation on the C-terminal domain of RNA polymerase II at gene regulatory elements and causes reductions in transcription and transcription-associated histone modifications. Furthermore, failure to constrain pervasive H2AK119ub1 compromises Polycomb complex occupancy at a subset of Polycomb target genes, which leads to their derepression, providing a potential molecular rationale for why the BAP1 ortholog in Drosophila has been characterized as a Polycomb group gene. Together, these observations reveal that the transcriptional potential of the genome can be modulated by regulating the levels of a pervasive histone modification (Fursova, 2021).
Examining the role of chromatin modifications and gene expression in neurons is critical for understanding how the potential for behaviors are established and maintained. This study investigate this question by examining Drosophila melanogaster fru P1 neurons that underlie reproductive behaviors in both sexes. A method was developed to purify cell-type-specific chromatin (Chromatag), using a tagged histone H2B variant that is expressed using the versatile Gal4/UAS gene expression system. This study used Chromatag to evaluate five chromatin modifications, at three life stages in both sexes. Substantial changes were found in chromatin modification profiles across development and fewer differences between males and females. Additionally, chromatin modifications were found that persist in different sets of genes from pupal to adult stages, which may point to genes important for cell fate determination in fru P1 neurons. Cell-type-specific RNA-seq data sets were generated, using translating ribosome affinity purification (TRAP). Actively translated genes were identified in fru P1 neurons, revealing novel stage- and sex-differences in gene expression. Chromatin modification enrichment patterns were also found that are associated with gene expression. Next, the chromatin modification data was used to identify cell-type-specific super-enhancer-containing genes. Genes with super-enhancers in fru P1 neurons differ across development and between the sexes. This study validated that a set of genes are expressed in fru P1 neurons, which were chosen based on having a super-enhancer and TRAP-enriched expression in fru P1 neurons (Palmateer, 2021).
Sequence determinants of human gene regulatory elements DNA can determine where and when genes are expressed, but the full set of sequence determinants that control gene expression is unknown. This study measured the transcriptional activity of DNA sequences that represent an ~100 times larger sequence space than the human genome using massively parallel reporter assays (MPRAs). Machine learning models revealed that transcription factors (TFs) generally act in an additive manner with weak grammar and that most enhancers increase expression from a promoter by a mechanism that does not appear to involve specific TF-TF interactions. The enhancers themselves can be classified into three types: classical, closed chromatin and chromatin dependent. This study also showed that few TFs are strongly active in a cell, with most activities being similar between cell types. Individual TFs can have multiple gene regulatory activities, including chromatin opening and enhancing, promoting and determining transcription start site (TSS) activity, consistent with the view that the TF binding motif is the key atomic unit of gene expression (Sahu, 2022).
The temporal and spatial pattern of gene expression is encoded in the DNA sequence; this information is read and interpreted by TFs, which recognize and bind specific short DNA sequence motifs. Major efforts have been undertaken to determine the DNA-binding specificities of TFs in vitro and map their binding positions in vivo. TFs regulate gene expression by binding to distal enhancer elements and to promoters located close to the TSS. Both enhancers and promoters are characterized by RNA transcription, the presence of open chromatin and histone H3 lysine 27 acetylation (H3K27ac). In addition, promoters and enhancers are preferentially marked by histone H3 lysine 4 trimethylation and monomethylation, respectively. Although these features can be mapped genome-wide in a high-throughput manner, they are correlative in nature and do not establish that an element can act as an enhancer, increasing expression from a promoter irrespective of position and orientation. To more directly measure enhancer activity, MPRAs have been developed to study the activity of yeast, Drosophila and human gene regulatory elements on a genome-wide scale. However, unbiased discovery of sequence determinants of human gene expression using only genomic sequences is made difficult by the fact that the genome is repetitive and has evolved to perform multiple functions. Furthermore, the human genome is too short to even encode all combinations, orientations and spacings of approximately 1,639 human TFs in multiple independent sequence contexts. Thus, despite the vast amount of information generated by genome-scale experiments, most sequence determinants that drive the activity of human enhancers and promoters, and the interactions between them, remain unknown (Sahu, 2022).
This study found that the cellular gene regulatory system is relatively complex, consisting of several distinct kinds of elements. Motif grammar is relatively strong at the level of heterodimers but weaker at the level of spacing and orientation of specific TF motif combinations. In transcriptionally active sequences, precise TF arrangements such as those found in the interferon enhanceosome are rare, with most elements consisting of TFs acting together in a largely additive manner. The current results are consistent with a recent report showing that independent actions of TFs can explain over 92% of the transcriptional activity measured from random yeast promoters. The presence of a weak motif syntax is also consistent with known existence of spacing and orientation preferences of TFs in vitro and at human genomic enhancers and synthetic yeast promoters (Sahu, 2022).
The results also show that different cell types have very similar TF activities and that the topology of the gene regulatory network is hierarchical, with few TFs displaying very strong transactivation activity. This is consistent with previous work showing that relatively few TFs show strong DNA-binding activity in a cell and that many of the strong binders are common to various cell types. These findings contrast with the known tissue specificity of many putative enhancer elements in vivo. Interestingly, the level of conservation of many endogenous promoters and enhancers appears to be higher than the elements selected in the current assay. The simplest explanation for these two facts is that enhancers in vivo evolve to be specific and that due to the similarity between cells, specificity is more difficult to achieve than activity. Specificity will naturally require specific TF combinations and also fine-tuning using motif number, spacing, orientation and affinity. Specificity is also required to silence strongly active elements in cell types in which their target protein is not needed due to the substantial fitness cost of protein expression. Further analysis using main cell types representing all three germ layers is needed to determine whether and to what extent differentiated human cell types have retained the regulatory mechanisms that existed in their common unicellular ancestor. Moreover, the contribution of specific TFs to the transcriptional activity in the cell could be further dissected, for example, by testing mutated genomic fragments in MPRAs (Sahu, 2022).
The original functional definition described enhancers as genetic elements that can activate a promoter from a distance, irrespective of their orientation relative to the TSS. This study found that in addition to these elements, two other types of enhancing gene regulatory elements exist: chromatin-dependent enhancers and closed-chromatin enhancers. Chromatin-dependent enhancers are characterized by forkhead motifs, binding of Mediator and p300 protein and a strong signal for H3K27 acetylation. Unlike classical enhancers, chromatin-dependent enhancers do not transactivate a heterologous promoter strongly, most likely due to lack of binding of TFs with strong transactivator domains. Their presence is, however, strongly predictive of tissue-specific gene expression, suggesting that they act to increase gene expression via chromatin modification or structural changes in higher-order chromatin. Several chromatin-dependent enhancers also combine with a single classical enhancer to form superenhancers, indicating that these elements may be required for driving high levels of gene expression from distal promoters. The third element type, closed-chromatin enhancers, are located in regions that show little or no signal for DNase I hypersensitivity or ATAC-seq. They are not silenced by CpG methylation. These elements appear to consist of only a single TF (e.g., p53) or a set of closely bound TFs that fit between or associate directly with well-ordered nucleosomes. The prevalence of both the closed-chromatin enhancers and chromatin-dependent enhancers suggests that they may contribute substantially to regional control of gene expression (Sahu, 2022).
By using machine learning approaches, this study showed that transcriptional activity in human cells can be predicted from sequence features. Interestingly, it was found that the promoters enriched from completely random synthetic sequences in a single experimental step are even more predictive of transcriptional activity than the genomic sequences themselves. By analysis of de novo promoters enriched from random sequences, this study discovered a G-rich element that interacts with the TSS, potentially positioning RNA polymerase II to the TSS independently of the TATA box. Overall, TF activities could be classified into three groups: TSS position-determining activity (e.g., TATA box and YY), short-range promoting activity (e.g., NRF1) and enhancing activity (many TFs). This study did not detect a separate class of distal enhancing activity, suggesting that activities that would allow an enhancer to selectively act at a very long range are likely to be associated with chromatin-dependent enhancers and not classical enhancers. The three classes of activities detected are not mutually exclusive, suggesting that TFs act at multiple levels and/or scales to regulate transcription. For example, YY1 acts as both an enhancing TF and a TSS-determining one, and FOXA motifs are present at both chromatin-dependent and classical enhancer elements. The results thus indicate that TF motifs are the atomic units of gene expression and should be the ultimate basis of analysis and prediction of genomic elements controlling gene regulatory activity (Sahu, 2022).
The random promoter-enhancer design used in this study allowed unbiased discovery of features that facilitate interactions between classical enhancers and promoters at a relatively short range. No specific pair of motifs controlling such interactions was found. This, together with the fact that no specific TF that only acts from an enhancer was found, is consistent with a generic and indirect mechanism of action, where the activities of individual TFs bound to an enhancer are aggregated and their total activity then activates the promoter. Molecularly, these results are consistent with mediation of the effect by the least specific type of biochemical interaction, steric hindrance. The simplest mechanism for enhancer action would involve opening of higher-order and local chromatin in such a way that the steric hindrance that prevents large macromolecular complexes such as Mediator or RNA polymerase II from loading to DNA is decreased. However, in the highly evolved genomic context, more specific interactions can exist between chromatin-dependent enhancers and particular promoters, as reported in a few cases, such as multichromosome structures that control the expression of the repertoire of olfactory receptor genes or the complex regulatory landscape of HOX genes (Sahu, 2022).
In summary, this study shows that direct experimentation to interrogate transcriptional activities of sequences that represent on aggregate >100 times larger sequence space than that of the human genome can be used to determine mechanisms of action of, and interaction between, gene regulatory elements. The experiments revealed unexpected simplicity of gene regulatory logic. The discovery of the relative simplicity of the interactions, together with the ability to measure transcriptional activities of all TFs in a cell, represents a major step toward achieving the ultimate aim of regulatory genomics: predicting gene expression from a sequence (Sahu, 2022).
Histone variants are different from their canonical counterparts in structure and are encoded by solitary genes with unique regulation to fulfill tissue or differentiation specific functions. A single H4 variant gene (His4r or H4r) that is located outside of the histone cluster and gives rise to a polyA tailed messenger RNA via replication-independent expression is preserved in Drosophila strains despite that its protein product is identical with canonical H4. In order to reveal information on the possible role of this alternative H4 endogenous H4r was epitope tagged, and its spatial and temporal expression was studied, and its genome-wide localization to chromatin was revealed at the nucleosomal level. RNA and immunohistochemistry analysis of H4r expressed under its cognate regulation indicate expression of the gene throughout zygotic and larval development and presence of the protein product is evident already in the pronuclei of fertilized eggs. In the developing nervous system a slight disequibrium in H4r distribution is observable, cholinergic neurons are the most abundant among H4r-expressing cells. ChIP-seq experiments revealed H4r association with regulatory regions of genes involved in cellular stress response. The data presented in this study indicate that H4r has a variant histone function (Abraham, 2022).
Protein kinases that phosphorylate histones are ideally-placed to influence the behavior of chromosomes during cell division. Indeed, a number of conserved histone phosphorylation events occur prominently during mitosis and meiosis in most eukaryotes, including on histone H3 at threonine-3 (H3T3ph). At least two kinases, Haspin and VRK1 (NHK-1/ballchen in Drosophila), have been proposed to carry out this modification. Phosphorylation of H3 by Haspin has defined roles in mitosis, but the significance of VRK1 activity towards histones in dividing cells has been unclear. In this study, using in vitro kinase assays, KiPIK screening, RNA interference, and CRISPR/Cas9 approaches, a direct role for VRK1, or its paralogue VRK2, in the phosphorylation of threonine-3 or serine-10 of Histone H3 in mitosis could not be substantiated, although loss of VRK1 did slow cell proliferation. It is concluded that the role of VRKs, and their more recently identified association with neuromuscular disease and importance in cancers of the nervous system, are unlikely to involve mitotic histone kinase activity. In contrast, Haspin is required to generate H3T3ph during mitosis (Cartwright, 2022).
Changes in the distribution of nucleosomes along the genome influence chromatin structure and impact gene expression by modulating the accessibility of DNA to transcriptional machinery. This study compared genome-wide nucleosome positioning and occupancy in five different Drosophila tissue-specific cell lines, and in reconstituted chromatin, and then tests were performed for correlations between nucleosome positioning, transcription factor binding motifs, and gene expression. Nucleosomes in all cell lines were positioned in accordance with previously known DNA-nucleosome interactions, with helically repeating A/T di-nucleotide pairs arranged within nucleosomal DNAs and AT-rich pentamers generally excluded from nucleosomal DNA. Nucleosome organization in all cell lines differed markedly from in vitro reconstituted chromatin, with highly expressed genes showing strong nucleosome organization around transcriptional start sites. Importantly, comparative analysis identified genomic regions that exhibited cell line-specific nucleosome enrichment or depletion. Further analysis of these regions identified 91 out of 16,384 possible heptamer sequences that showed differential nucleosomal occupation between cell lines, and 49 of the heptamers matched one or more known transcription factor binding sites. These results demonstrate that there is differential nucleosome positioning between these Drosophila cell lines and therefore identify a system that could be used to investigate the functional significance of differential nucleosomal positioning in cell type specification (Martin, 2017).
Micrococcal nuclease (MNase) is widely used to map nucleosomes. However, its aggressive endo-/exo-nuclease activities make MNase-seq unreliable for determining nucleosome occupancies, because cleavages within linker regions produce oligo- and mono-nucleosomes, whereas cleavages within nucleosomes destroy them. This study introduces a theoretical framework for predicting nucleosome occupancies and an experimental protocol with appropriate spike-in normalization that confirms our theory and provides accurate occupancy levels over an MNase digestion time course. As with human cells, no overall differences were observed in nucleosome occupancies between Drosophila euchromatin and heterochromatin, which implies that heterochromatic compaction does not reduce MNase accessibility of linker DNA (Chereji, 2019).
The nature of chromatin as regular succession of nucleosomes has gained iconic status. However, since most nucleosomes in metazoans are poorly positioned it is unknown to which extent bulk genomic nucleosome repeat length reflects the regularity and spacing of nucleosome arrays at individual loci. This study describes a new approach to map nucleosome array regularity and spacing through sequencing oligonucleosome-derived DNA by Illumina sequencing and emergent nanopore technology. In Drosophila cells, this revealed modulation of array regularity and nucleosome repeat length depending on functional chromatin states independently of nucleosome positioning and even in unmappable regions. It was also found that nucleosome arrays downstream of silent promoters are considerably more regular than those downstream of highly expressed ones, despite more extensive nucleosome phasing of the latter. This approach is generally applicable and provides an important parameter of chromatin organization that so far had been missing (Baldi, 2018).
As the basal bricks, the dynamics and arrangement of nucleosomes orchestrate the higher architecture of chromatin in a fundamental way, thereby affecting almost all nuclear biology processes. Thanks to its rather simple protocol, ATAC-seq has been rapidly adopted as a major tool for chromatin-accessible profiling at both bulk and single-cell level. However, to picture the arrangement of nucleosomes per se remains a challenge with ATAC-seq. This work introduces a novel ATAC-seq analysis toolkit, named deNOPA, to predict nucleosome positions. Assessments showed that deNOPA not only outperformed state-of-the-art tools, but it is the only tool able to predict nucleosome position precisely with ultrasparse ATAC-seq data. The remarkable performance of deNOPA was fueled by the reads from short fragments, which compose nearly half of sequenced reads and are normally discarded from nucleosome position detection. However, this study found that the short fragment reads enrich information on nucleosome positions and that the linker regions were predicted by reads from both short and long fragments using Gaussian smoothing. deNOPA was applied to a single-cell ATAC-seq dataset and the intrapopulation heterogeneity of the human erythroleukemic cell line (K562) was deciphered. Last, using deNOPA, this study showed that the dynamics of nucleosome organization may not directly couple with chromatin accessibility in the cis-regulatory regions when human cells respond to heat shock stimulation. deNOPA provides a powerful tool with which to analyze the dynamics of chromatin at nucleosome position level in the single-cell ATAC-seq age (Xu, 2021).
This study developed a sensitive and precise nucleosome prediction algorithm based on ATAC-seq data called deNOPA. This assessment showed that deNOPA could accurately predict nucleosome positions, even with ultrasparse data that approximated the zone of single-cell ATAC-seq. With hundreds of scATAC-seq data pooled, deNOPA successfully predicted one or two magnitudes more nucleosome positions than existing algorithms. The capacity to predict nucleosome positions with ultrasparse data makes it possible for ordinary laboratories not equipped with sophisticated instruments or reagents to study the dynamics of nucleosomes at the single-cell level. More importantly, with the fast-developing scATAC-seq technologies, the capacity to predict nucleosome positions with ultrasparse data will also make it possible to study cell-to-cell variations of nucleosome arrangements in the foreseeable future (Xu, 2021).
The ultrasensitivity of deNOPA mainly stemmed from the additional reads rescued from ATAC-seq short fragments normally discarded in nucleosome-related studies. The cutting sites of short and long fragments were found to be highly correlated, suggesting the rich information carried by short fragments for nucleosome detection. In addition, for the bulk of data utilized by deNOPA, little accuracy is lost to achieve ultrasensitive nucleosome position prediction. It is well known that template searching-based strategies, e.g., nucleoATAC, have high accuracy in pattern recognition, while model-based methods are generally more sensitive and computationally efficient. Although the accuracy of deNOPA was found to be less than that of nucleoATAC in many cases, it is, in fact, comparable to, and much better than, all other competitors that were assessed. Thus, deNOPA can successfully balance sensitivity and accuracy (Xu, 2021).
It is worth to noting that the current version of deNOPA worked well only with the index-pool-index-based scATAC-seq data, as is shown in this work. However, the distribution of microfluidics-based, e.g., the chromium from 10x Genomics, scATAC data was found to be substantially different from that of index-pool-index-based data in the genome. As clarified in demo data by 10x Genomics, more than 70% of fragments locate at the DHS, which is nucleosome-depleted; consequently, few nucleosome positions could be predicted by deNOPA (Xu, 2021).
Finally, the usefulness of deNOPA was demonstrated by applying it on the heat shock response regulation problem of human cells with ordinary ATAC-seq sequencing depth. It has been reported that both chromatin accessibility and nucleosome positions may respond to temperature stress. However, most of these studies were performed in poikilotherm, in which the temperature in the experiments was relatively lower compared to the mammalian system. One possible reason for the shortage of such research in mammals might involve the temperature sensitivity of micrococcal nuclease. This made deNOPA a good choice to study chromatin and nucleosome dynamics in response to temperature changes in mammals. The present work is the first such study in the mammalian system on the dynamics of chromatin accessibility and nucleosome positioning in response to heat shock in human cells. The pattern observed in human cells was similar to that previously reported in fly, but different from that of yeast. A more detailed assay may need to be performed in the future to reveal species-specific regulation of chromatin and nucleosome in response to temperature changes, which trigger more complicated physiological consequences in homothermal animals compared to flies. In summary, deNOPA could be a powerful tool to address broad questions on dynamics of chromatin with ultralow resolution ATAC-seq data (Xu, 2021).
Developmental genes in metazoan genomes are surrounded by dense clusters of conserved noncoding elements (CNEs). CNEs exhibit unexplained extreme levels of sequence conservation, with many acting as developmental long-range enhancers. Clusters of CNEs define the span of regulatory inputs for many important developmental regulators and have been described previously as genomic regulatory blocks (GRBs). Their function and distribution around important regulatory genes raises the question of how they relate to 3D conformation of these loci. This study shows that clusters of CNEs strongly coincide with topological organisation, predicting the boundaries of hundreds of topologically associating domains (TADs) in human and Drosophila. The set of TADs that are associated with high levels of noncoding conservation exhibit distinct properties compared to TADs devoid of extreme noncoding conservation. The close correspondence between extreme noncoding conservation and TADs suggests that these TADs are ancient, revealing a regulatory architecture conserved over hundreds of millions of years. Metazoan genomes contain many clusters of conserved noncoding elements. This study provides evidence that these clusters coincide with distinct topologically associating domains in humans and Drosophila, revealing a conserved regulatory genomic architecture (Harmston, 2017).
This work shows that the span of clusters of CNEs, known as GRBs, is predictive of the span of a subset of topologically associated domains (TADs) in both humans and Drosophila. These sets of TADs, referred to as GRB-TADs, are some of the largest, strongest and most gene sparse in humans and Drosophila and show distinct patterns of retrotransposon density and CTCF binding. Regions containing homologous developmental genes are associated with the same type of conservation and structure in humans and Drosophila. Not only are regulatory elements in these regions under intense selective pressure, but the association between the boundaries of GRBs and TADs also suggests that the basic 3D structure of these loci has existed over hundreds of millions of years of evolution, at least back to the ancestor of chordates and arthropods. At even more distant timescales, the phenomenon of microsynteny has existed since early Metazoa and is apparent across bilateria1. The syntenic relationship between RUNX2 and SUPT3H is conserved between humans and sponges, and the GRB containing the Iroquois and Sowah genes is conserved across a wide range of bilaterians, apart from tetrapods. This correspondence suggests that the regulatory domains around developmental gene paralogues created by whole-genome duplications (WGDs) are ancestral and not the result of convergent evolution, and may have existed since the origin of Metazoa (Harmston, 2017).
The striking concordance between the distribution of CNEs and topological organisation has far-reaching implications for understanding the nature and evolution of long-range regulation at developmental loci. Combining the concepts of GRBs and TADs leads to a model of regulatory domains with stronger predictive and explanatory power than either concept alone. The GRB model provides a framework for long-range regulation, in which the majority of regulatory elements within a GRB are dedicated to the control of its target gene, with other genes not responding to long-range regulation despite being located close to regulatory element. Target and bystander genes appear to exhibit distinct features that may help to explain this specificity. Topological organisation data provide more precise boundary estimates for these regions, including the ability to separate adjacent GRBs, and contributes information about the stability of the organisation of these domains and regulatory interactions within them across cells and tissue types. Therefore, a TAD enriched for extreme noncoding conservation is not representative of the regulatory domain of all of its constitutive genes, but primarily corresponds to the regulatory domain of the target gene under long-range regulation (Harmston, 2017).
The lower density of active genes and increased likelihood of Polycomb repression at GRBs may explain some of their topological features. Since GRBs represent the regulatory domains of developmental genes, in any one tissue or developmental stage most GRBs will be inactive and marked by Polycomb/H3K27me358. The degree of intermixing of chromatin between genomic domains depends strongly on their epigenetic state, with Polycomb-repressed chromatin showing little or no spatial overlap with active or inactive chromatin. It has been suggested that regions of active chromatin within TADs interfere with the packaging of chromatin, disrupting TAD formation and leading to less compact TADs, or fragmentation into smaller TADs60. The enrichment of active euchromatin at the boundaries of these regions may reflect the formation of barriers (Harmston, 2017).
The conservation and divergence of CTCF-binding sites is thought to play important roles in the evolution of regulatory domains25, 45. Constitutive CTCF sites are enriched at GRB and GRB-TAD boundaries, suggesting these regions are consistently insulated from neighbouring domains. The enrichment of CTCF at GRB and GRB-TAD boundaries, combined with the strength of interactions associated with GRBs, further suggests that these regions do not strongly interact with elements in adjacent domains. This is in addition to the association of GRBs with Polycomb-repressed chromatin and depletion of active euchromatin, which also promotes insulation from neighbouring regions, as described above. These features are reflective of the multiple mechanisms involved in insulating target genes from ectopic regulation by elements in neighbouring domains. It is highly probable that this type of restricted topological structure arose early in Metazoan evolution as a result of strong selective pressure to prevent this (Harmston, 2017).
GRBs and their constituent CNEs are defined by sequence conservation alone, which is stable across all cell types, with TADs having been found to be largely invariant across cell lines and between species. Even though the current methods for estimating GRBs have limitations with respect to their precision and coverage, this study has shown that the distribution of CNEs can nevertheless serve as an excellent proxy for the extent of a functionally distinct subset of TADs in both humans and Drosophila. Therefore, clusters of CNEs identified between evolutionarily distant species could be used to infer regulatory domains and predict their topological organisation in species lacking Hi-C data. TADs are also organised into inactive and active chromatin compartments, and the assignment of TADs to compartments can vary across cell types. GRB-TADs are preferentially located within compartment B across all of the lineages investigated, although they are more likely to switch compartments than those TADs lacking extreme noncoding conservation. This is consistent with previous work showing that GRBs represent regions that encompass regulatory elements of specific developmental regulators, which are repressed during the majority of developmental stages and cell types and only expressed in a limited subset (Harmston, 2017).
A remarkable property of TADs and GRBs is that they expand and shrink along with the entire genome, suggesting that the 3D organisation of regulatory loci is robust towards gain and loss of DNA between its constituent CNEs, even though insertion of repetitive elements is disfavoured. Previously, it was observed that GRBs are on average much more compact in Drosophila than in mammals, and that they also scale with genome size in fish genomes, and as previously observed TADs are smaller in Drosophila, concordant with the current hypothesis. Duplicated loci containing developmental regulators are more likely to be retained after WGD. Therefore, since tetrapod lineages have undergone two WGDs, a larger number of developmental regulators and associated GRBs are expected compared to arthropods, which is confirmed by the current analysis (Harmston, 2017).
The expansion and shrinkage of TADs and the turnover of regulatory elements within them is likely to be an important mechanism in metazoan evolution and responsible for differences in organismal complexity. The disruption of TAD boundaries perturbs spatial organisation, enhancer-promoter interactions and the expression of target genes. Deletions within TADs can lead to changes in enhancer-promoter interactions, in some cases causing disease, which suggests some level of selective pressure against such re-arrangements. The depletion of retrotransposons within GRB-TADs suggests that their insertion within this type of regulatory domain is under intense negative selection. This pressure may be due to the ability for retrotransposons to create new cis-regulatory elements, potentially perturbing the organisation of interactions within a regulatory domain, leading to ectopic expression and resulting in a negative effect on fitness in the majority of cases. In future, analysis of the evolutionary dynamics of CNEs and genomic rearrangements within TADs in multiple species will help to provide insights into their evolutionary dynamics (Harmston, 2017).
Because of the unknown reason for the extreme conservation of CNEs, their function as enhancers, and that their distribution closely follows the span of the TADs around genes known to be involved in long-range developmental regulation, it is tempting to speculate that CNEs are somehow directly involved in the chromatin folding of TADs, precisely arranging promoters and enhancers in 3D space. Indeed, just like TADs, the spatial proximity of promoters and developmental enhancers seems to be stable across different cell types, regardless of the activity of either. Currently, there is no evidence from sequence analysis that CNEs are involved in sequence-mediated interactions, and their role in chromatin folding remains an open question (Harmston, 2017).
It is concluded that this subset of TADs, which are associated with high levels of noncoding conservation, are functionally distinct, evolutionarily ancient 3D structures which represent the regulatory domains of key genes involved in embryonic development and morphogenesis. However, the spatial correspondence of GRBs and TADs does not offer immediate suggestions for the origin of extreme noncoding conservation. Just like other potential sources of selective pressure acting on these elements, current models of genome folding do not include a mechanism that could account for this level of selective pressure on elements within TADs. The main findings of this paper may help with formulating new hypotheses by focusing on their potential roles within TADs (Harmston, 2017).
A key question in developmental biology is how cellular differentiation is controlled during development. While transitions between trithorax-group (TrxG) and polycomb-group (PcG) chromatin states are vital for the differentiation of ES cells to multipotent stem cells, little is known regarding the role of chromatin states during development of the brain. This study shows that large-scale chromatin remodelling occurs during Drosophila neural development. The majority of genes activated during neuronal differentiation are silent in neural stem cells (NSCs) and occupy black chromatin and a TrxG-repressive state. In neurons, almost all key NSC genes are switched off via HP1-mediated repression. PcG-mediated repression does not play a significant role in regulating these genes, but instead regulates lineage-specific transcription factors that control spatial and temporal patterning in the brain. Combined, these data suggest that forms of chromatin other than canonical PcG/TrxG transitions take over key roles during neural development (Marshall, 2017).
Using Targeted DamID, this study has compiled cell type-specific DNA-binding maps of key chromatin proteins in NSCs and neurons, and used these to investigate the chromatin transitions that occur during neural development in the Drosophila brain. The data reveal unexpected roles for chromatin states in this context (Marshall, 2017).
The Black chromatin state—a silent state lacking PcG or HP1a constituents—was identified in cultured embryonic Kc167 Drosophila cells but its role during development was previously unknown. The current data clearly demonstrate the importance of this form of chromatin and the novel TrxG-repressive state during development in vivo, with both states covering the majority of genes that are silent in NSCs but active in neurons. It is not yet known whether Black chromatin is actively repressive and requires chromatin remodelling to become active, or merely passively silent, requiring only the binding of appropriate transcription factors to activate transcription. Enrichment of the linker histone H1 in this state would suggest that chromatin is less accessible. Black chromatin in Drosophila and other organisms has been reported also to contain varying levels of the H3K27me2 mark, which may contribute to repression, and which suggested a link to PcG-associated chromatin. Interestingly, however, very few transitions were observed between Black and PcG chromatin: between NSCs and immature neurons, <0.5 Mb of DNA transitioned between Black and PcG states in either direction and the PcG-associated states separated strongly from Black chromatin in these models. If Black chromatin does contain H3K27me2 in the brain, there would appear to be little conversion between H3K27me2 and H3K27me3 during neural development (Marshall, 2017).
In addition to many genes that transitioned from Black chromatin in NSCs to an active TrxG state in neurons, a subset of genes was observed that transitioned between Black and non-TrxG active chromatin. These genes were strongly enriched for those encoding metabolic function (in accordance with previous reports of housekeeping and metabolism genes being associated with the non-TrxG active state) and were activated without the presence of Brm. These transitions may be similar to those recently described in nematodes and Drosophila imaginal discs for the activation of developmentally regulated genes without TrxG-associated chromatin marks (Marshall, 2017).
The TrxG-repressive state describe in this study is intriguing. This state is enriched for Brm binding in transcriptionally silent chromatin, and covers a large number of neuronal genes in NSCs. PcG-independent repression via the REST complex in ES cells has been reported to be associated with neuronal genes. These genes were found to be present in a poised state that included the TrxG-associated H3K4me3 mark, a situation very similar to the TrxG-repressive state that was observed in this study (Marshall, 2017).
In neural development, little role was observed for PcG repression in controlling broad cell fate, but instead a role was identifed in specifying different neuronal lineages. Specific PcG repression/TrxG activation has previously been described for lineage-specific transcription factors within the mushroom body of the brain. A mixed PcG chromatin state has also been previously associated with spatially compartmentalised transcription factors in ChIPseq of whole wing discs, although that study was not cell type-specific. This last study speculated that such a mixed state might be indicative of bivalent chromatin in Drosophila. However, both the former study and the current found concomitant occupancy of RNA Pol II on gene bodies in the PcG-mixed state, an observation at odds with silent bivalent chromatin. Combined with a significantly enriched association with lineage-specific transcription factors from existing RNA-seq data and immunofluorescence studies, the current data suggest that this chromatin state is a genuine mixture between PcG repression in some lineages and TrxG activation in other lineages. No evidence was seen in these data sets of true bivalent chromatin as reported in ES cells (strong binding of both PcG and TrxG components with no transcription) (Marshall, 2017).
In conclusion, these data present a picture of neural development that differs from that observed during the differentiation of ES cells in culture. Although lineage-specific transcription factors that generate neuronal diversity are regulated through PcG repression, the majority of genes activated in neurons follow different chromatin transitions. These genes are present in the silent Black chromatin state and a novel TrxG-repressive chromatin state in NSCs and transition to a TrxG permissive chromatin state in immature neurons. Furthermore, this study demonstrated that almost all NSC identity and cell cycle genes are repressed in HP1-associated chromatin in neurons, and that this repression occurs concomitantly with a wide-scale accumulation of HP1-associated binding across the genome. Although canonical PcG/TrxG transitions are vital during early development, these data suggest that other forms of chromatin take over important regulatory roles during neural development (Marshall, 2017).
Genome-wide studies has identified two enhancer classes in Drosophila that interact with different core promoters: housekeeping enhancers (hkCP) and developmental enhancers (dCP). It is hypothesized that the two enhancer classes are occupied by distinct architectural proteins, affecting their enhancer-promoter contacts. It was determined that both enhancer classes are enriched for RNA Polymerase II, CBP, and architectural proteins but there are also distinctions. hkCP enhancers contain H3K4me3 and exclusively bind Cap-H2, Chromator, DREF and Z4, whereas dCP enhancers contain H3K4me1 and are more enriched for Rad21 and Fs(1)h-L. Additionally, the interactions of each enhancer class were mapped utilizing a Hi-C dataset with <1 kb resolution. Results suggest that hkCP enhancers are more likely to form multi-TSS interaction networks and be associated with topologically associating domain (TAD) borders, while dCP enhancers are more often bound to one or two TSSs and are enriched at chromatin loop anchors. The data support a model suggesting that the unique architectural protein occupancy within enhancers is one contributor to enhancer-promoter interaction specificity (Cubenas-Potts, 2017).
This study characterize the protein occupancy, chromatin interactions and architecture profiles for the two enhancer classes found in Drosophila. Each enhancer class has distinct H3K4 methylation states, is bound by both common and distinct architectural proteins, and is involved in distinct types of chromatin interactions. First, it was established that hkCP enhancers exclusively bind CAP-H2, Chromator, DREF and Z4, while dCP enhancers do not and are preferentially enriched for but not exclusively bound by Fs(1)h-L and Rad21. In addition, hkCP enhancers are more likely than dCP enhancers to associate with multiple TSSs, which promotes a higher transcriptional output. Finally, hkCP enhancers preferentially associate with topologically associating domain (TAD) borders, whereas dCP enhancers are enriched at chromatin loop anchors present inside TADs. Interestingly, enhancers activated by both core promoters exhibit more hkCP enhancer like characteristics, indicating that the both CP enhancers may represent an intermediate among the distinctive hkCP and dCP enhancers. Altogether, these results provide strong correlative evidence, supporting a model suggesting that architectural proteins are critical regulators of enhancer-promoter interaction specificity and that the interactions between enhancers and promoters significantly contribute to the generation of 3D chromatin architecture (Cubenas-Potts, 2017).
The importance of architectural proteins in regulating enhancer-promoter interactions in Drosophila is supported by the observation that the vast majority of architectural protein sites present in the genome correspond to enhancers and promoters. Historically, architectural proteins were identified as insulators, which were functionally demonstrated to block enhancer-promoter interactions. The insulator function of architectural proteins correlates with their enrichment at TAD borders. However, several lines of evidence, including ChIA-PET analysis of CTCF- and cohesin-mediated interactions in mammals, suggest that these architectural proteins help mediate long range contacts among regulatory sequences. In Drosophila this study observed that nearly all of the Group 1 and Group 2 architectural protein sites are associated with enhancers or promoters defined by STARR-seq, TSSs or CBP peaks, suggesting that architectural proteins help mediate enhancer-promoter interactions. Notably, Group 3 architectural proteins include the classic insulator proteins CTCF, CP190, Mod(mdg4) and SuHw, and at least 25% of their peaks cannot be explained by enhancers or promoters. It is interesting to speculate that the non-enhancer-promoter sites may be involved in more classical insulator functions or contributing to the chromatin architecture of inactive regions of the genome (Cubenas-Potts, 2017).
The conclusion that architectural proteins are critical regulators of the specificity between enhancers and promoters is supported by two main lines of evidence. First, the current results demonstrate a strong correlation between each enhancer class and distinct architectural protein subcomplexes. Functional evidence supporting this conclusion comes from mutational analyses of the DRE motif in the distinct enhancer classes, which likely recruits DREF and the other hkCP enhancer associated architectural proteins. Zabidi (2015) demonstrated that the tandem DRE motif alone was sufficient to enhance expression of the housekeeping core promoter and that mutation of DRE motifs within an hkCP enhancer reduced its promoter interactions in a luciferase assay. Furthermore, addition of a DRE motif to a dCP enhancer changed its promoter specificity. Because DREF and potentially BEAF-32 bind to the DRE motif, these results strongly support a model suggesting that the differential occupancy of Cap-H2, Chromator, DREF and Z4 in the two enhancer classes is a critical regulator of their specific interactions with the core promoter types. However, the data cannot discount the notion that unique transcription factor binding at proximal TSSs also contribute to the specificity of enhancer-promoter interactions. Although hkCP enhancer identity is most highly correlated with CAP-H2, Chromator, DREF and Z4 localization, these four architectural proteins are not found in isolation within hkCP enhancers. BEAF-32 and CP190 are also strongly enriched in hkCP enhancers, which are also associated with high occupancy APBSs and TAD borders. Thus, the full architectural protein complement at hkCP enhancers is far more complex than the four hkCP-specific architectural proteins. In addition, architectural proteins that are truly unique to dCP enhancers were not detected. Because dCP enhancers exhibit higher cell type specificity, it cannot be discounted that there are additional dCP enhancers present in the Drosophila genome that were not identified by STARR-seq and thus, excluded from this analysis. From these studies, it is unclear if the enrichment of Fs(1)h-L and Rad21, particularly because Fs(1)h-L and Rad21 are present in hkCP enhancers at lower levels, or the absence of BEAF-32, CAP-H2, Chromator, CP190, DREF and Z4 truly distinguishes the architectural protein complexes found at dCP enhancers. In the future, careful biochemical analyses will be required to gain a comprehensive understanding of the complete organization of architectural protein subcomplexes associated with each enhancer class (Cubenas-Potts, 2017).
hkCP enhancers are associated with multi-TSS chromatin interactions and TAD borders. The promoter-clustering by hkCP enhancers results in a dose-dependent increase in transcriptional output for the interacting genes. Thus, one likely molecular mechanism by which hkCP enhancers promote robust transcriptional activation is by increasing the local concentration of RNA Polymerase II and general transcription factors (GTFs) by bringing multiple TSSs into close proximity. It is interesting that the hkCP enhancers, which form promoter clusters, are associated with TAD borders. It is speculated that the hkCP enhancer interactions involve inter-TAD contacts within the A-type compartment, indicative of the formation of transcription factories (70). From this analysis, it is unclear if the hkCP enhancers alone are sufficient for the formation of the 3D interactions or the neighboring TSSs and their associated transcription factors are also contributing to these contacts. It is hypothesized that the genes recruited to the factories contain the housekeeping promoter motifs (DRE, Ohler 1, Ohler 6 and TCT) and that the hkCP enhancer residents Cap-H2, Chromator, DREF and Z4, are critical to the formation of these 3D contacts (Cubenas-Potts, 2017).
dCP enhancers are more likely to be present within TADs and are enriched on the subTAD-like chromatin loop anchors. dCP enhancers do not form promoter clusters, but are more likely to interact with individual TSSs. One possible explanation for this observation is that the genes interacting with dCP enhancers require the binding of sequence-specific transcription factors, and increasing the concentration of GTFs and RNA polymerase II is not an effective mechanism to promote transcriptional output. The chromatin loop association is consistent with dCP enhancers forming a strong contact with a single TSS. However, it is acknowledged that dCP enhancers are likely one of multiple molecular mechanisms contributing to chromatin loop formation. Surprisingly, the chromatin loops that were observed in Drosophila are distinct from the chromatin loops described in humans. A recent study reported approximately 10,000 chromatin loops in the genome of GM12878 lymphoblastoid cells, but this study detected only 458 chromatin loops in Drosophila utilizing a similar method. The reason why there are so few chromatin loops in Drosophila compared to humans is unclear. It is possible that chromatin loops represent a more precise level of architecture within TADs between specific enhancers and promoters in mammals, but because TADs are significantly smaller in flies (median size 32.5 kb compared to 880 kb in mice, the chromatin loops are not as prominent or easily detected in the Drosophila genome. Notably, it appears that the chromatin loops are generated by different architectural proteins in the two species. The chromatin loops in humans are anchored by convergent CTCF motifs, while the results presented in this study demonstrate that the chromatin loop anchors in Drosophila are depleted of CTCF. Because the chromatin loops in Drosophila show a strong enrichment for Fs(1)h-L, a Brd4 homolog, and the architectural proteins Rad21, Nup98, TFIIIC and Mod(mdg4), it is possible that a combination of transcription and architectural proteins is required for chromatin loop formation in flies, which may be different from mammals . Altogether, it is clear that dCP enhancers are involved in individual contacts with TSSs and are likely one mechanism by which chromatin loops form in Drosophila (Cubenas-Potts, 2017).
Surprisingly, only ~20% and ~12.5% of all hkCP enhancer and ~7.5% and ~8.5% of dCP enhancer interactions involve a TSS or enhancer on the opposite anchor, respectively. The biological significance of the enhancer to non-TSS association is unclear. One possible explanation is that current methods for identifying statistically significant interactions are not sufficiently robust and that many of the enhancer to non-TSS interactions are not representative of biologically significant contacts. However, it cannot be discounted that the non-TSS interactions mediated by enhancers are real and the biological significance of these contacts remains to be determined. Throughout this analysis, the patterns of TSS interactions were compared with each enhancer class instead of drawing conclusions about the absolute number of TSSs bound per enhancer, minimizing the impact of any non-specific interactions within the data. Additional molecular studies for the various type of enhancer interactions (enhancer to promoter, enhancer to non-TSS, etc.) will be required to evaluate the various biological contributions of each (Cubenas-Potts, 2017).
This study found that the functional differences between enhancers that activate housekeeping versus developmental genes are reflected in their chromatin and architectural protein composition, and in the type of interactions they mediate. hkCP enhancers are marked by H3K4me3, associate with TAD borders, and mediate large TSS-clustered interactions to promote robust transcription. This class of enhancers contain the architectural proteins CAP-H2, Chromator, DREF and Z4. In contrast, dCP enhancers are marked by H3K4me1, associate with chromatin loop anchors and are more commonly associated with single TSS-contacts. dCP enhancers are depleted of the hkCP-specific architectural proteins and show an enrichment for Fs(1)h-L and Rad21. The results support a model suggesting that the unique occupancy of architectural proteins in the distinct enhancer classes are key contributors to the types of interactions that enhancers can mediate genome-wide, ultimately affecting enhancer-promoter specificity and 3D chromatin organization. In the future, further characterization of the broadly defined housekeeping and developmental enhancers into smaller subclasses may yield additional levels of regulation and formation of unique architectural protein and transcription factor protein complexes as key mediators of long range chromatin contacts (Cubenas-Potts, 2017).
Enhancers harbor binding motifs that recruit transcription factors (TFs) for gene activation. While cooperative binding of TFs at enhancers is known to be critical for transcriptional activation of a handful of developmental enhancers, the extent of TF cooperativity genome-wide is unknown. This study coupled high-resolution nuclease footprinting with single-molecule methylation profiling to characterize TF cooperativity at active enhancers in the Drosophila genome. Enrichment of short micrococcal nuclease (MNase)-protected DNA segments indicates that the majority of enhancers harbor two or more TF-binding sites, and this study uncovered protected fragments that correspond to co-bound sites in thousands of enhancers. From the analysis of co-binding, this study found that cooperativity dominates TF binding in vivo at the majority of active enhancers. Cooperativity is highest between sites spaced 50 bp apart, indicating that cooperativity occurs without apparent protein-protein interactions. These findings suggest nucleosomes promote cooperativity because co-binding may effectively clear nucleosomes and promote enhancer function (Rao, 2021).
This study exploited MNase-resistant protections of chromatin to detect bound proteins at high resolution and infer the regulatory architecture of enhancers. Enhancers have been thought to have poorly positioned nucleosomes, perhaps corresponding to weak initiation of transcription within elements, but this study found that alignment of active enhancers by the factor-protected regions within them resolves chromatin features, revealing that enhancers—-like active promoters—-are structured and have defined nucleosome-depleted regions (NDRs). Notably, while factor-protected regions within enhancers often encompass recognizable consensus motifs for known TFs, many features and even elements lack any statistically significant motif. As the Drosophila TF repertoire has been extensively characterized, this highlights that the rules dictating factor binding in vivo remain incomplete. Other aspects of chromatin beyond the sequences directly contacted by TF DNA-binding domains must promote the recognition and effective binding of regulatory sites (Rao, 2021).
Information that guides factor binding may come from DNA conformation around binding sites. Additionally, cooperativity between multiple TFs in a regulatory element can increase affinity and specificity for weaker consensus motifs. TFs juxtaposed on a regulatory element might also enhance the affinity of each factor to DNA in vitro, but given the fast transient binding of factors in vivo, it has not been clear how widespread factor cooperativity is. While multiple TFs do bind independently at some active enhancers, 64% of active enhancers in the fly genome display substantial degrees of factor cooperativity. These cooperative interactions are not due to dimeric factors, since the cases that were identified occur between factors that bind regulatory elements that are >30 bp apart. In some cases, cooperativity occurs between factors as far apart as 140 bp. Such long-distance synergies might be due to interacting factors that bridge distant sites or due to effects of nucleosome positioning (Rao, 2021).
TF cooperativity correlated with nucleosome occupancy and histone turnover at active enhancers, implying antagonism between TFs and histones for DNA. In the context of chromatin, binding of multiple spaced TFs competes with nucleosome formation. In dynamic nucleosomes where DNA is being unwrapped and rewrapped across the surface of a histone octamer, binding of TFs at exposed DNA can block rewrapping of octamers. The efficiency of blocking the restoration of a nucleosome depends on the relative positioning of factor binding sites, where multiple binding sites on one side of a nucleosome are better competitors. The observation that factor cooperativity occurs predominantly between sites spaced 50 bp apart in active enhancers fits with this idea. In this line of thinking, an important aspect of factor binding site grammars may be loosely constrained arrangements of sites that primarily act to destabilize nucleosomes. As many TFs recruit chromatin remodeling enzymes to their binding sites, catalyzed displacement of nucleosomes may also contribute to cooperative occupancy of regulatory elements (Rao, 2021).
A striking but unexplained observation in single-molecule profiling of eukaryotic chromatin is that factor binding sites are not bound by a cognate factor or occluded in a nucleosome up to ~25% of the time. These observations agree well with single-molecule tracking experiments that show only a small fraction of TFs to be bound stably to chromatin and most TFs to have a short residence time in the order of seconds at stably bound sites. Future experiments to directly measure the turnover time of TFs at their binding sites by high-resolution live imaging, by techniques like SNAP-tagging and Anchor-away, can uncover the underlying basis for such high levels of unbound states. How do regulatory elements function when no TF is bound? While TFs may often be absent from a regulatory element, part of the answer may lie in that restoration of nucleosomes is slow compared to the binding and release of factors in an active regulatory element. With slow nucleosomal restoration, transient binding of factors maintains a regulatory element in an exposed configuration where factors can cycle on and off. The persistence of histone modifications on flanking nucleosomes may similarly provide a short-term memory of regulatory events when factors are not bound. In these ways, nucleosome dynamics may provide a mechanism to temper stochastic effects of transient factor binding in vivo and thus provide stable regulatory output to direct gene expression (Rao, 2021).
In this study, TF binding was determined using two orthogonal methods: MNase-seq and dSMF. Mapping of TF binding by both these methods depends on the occupancy of a site, how tightly it is bound by a TF, and how effectively the TF protects underlying DNA—factors that are common to most genomic methods that map TF binding. Pairs of TFBSs separated by distances starting at 30 bp were studied because the methods used cannot resolve individual binding events at <30 bp. Future studies could determine if cooperativity between TFs is even stronger at distances shorter than 30 bp, with and without protein-protein interactions (Rao, 2021).
Chromatin condenses several folds to form mitotic chromosomes during cell division and decondenses post-mitotically to re-occupy their nuclear territory and re-gain their specific transcriptional profile in a precisely lineage specific manner. This necessitates that the features of nuclear architecture and DNA topology persist through mitosis. This study compared the proteome of nuclease and high salt resistant fraction of interphase nucleus known as nuclear matrix (NuMat) and an equivalent biochemical fraction in the mitotic chromosome known as mitotic chromosome scaffold (MiCS). This study elucidates that as much as 67% of the NuMat proteins are retained in the MiCS indicating that the features of nuclear architecture in interphase nucleus are retained on the mitotic chromosomes. Proteins of the NuMat/MiCS have large dynamic range of MS signal and were detected in sub-femtomolar amounts. Chromatin/RNA binding proteins with hydrolase and helicase activity are highly enriched in NuMat as well as MiCS. While several transcription factors involved in functioning of interphase nucleus are present exclusively in NuMat, protein components responsible for assembly of membrane-less nuclear bodies are uniquely retained in MiCS. This study clearly indicates that the features of nuclear architecture, in the structural context of NuMat, are retained in MiCS and possibly play an important role in maintenance of cell lineage specific transcriptional status during cell division and thereby, serve as components of cellular memory (Sureka, 2018).
Eukaryotic genome is compartmentalized into structural and functional domains. One of the concepts of higher order organization of chromatin posits that the DNA is organized in constrained loops that behave as independent functional domains. Nuclear Matrix (NuMat), a ribo-proteinaceous nucleoskeleton, provides the structural basis for this organization. DNA sequences located at base of the loops are known as the Matrix Attachment Regions (MARs). NuMat relates to multiple nuclear processes and is partly cell type specific in composition. It is a biochemically defined structure and several protocols have been used to isolate the NuMat where some of the steps have been critically evaluated. These sequences play an important role in genomic organization it is imperative to know their dynamics during development and differentiation. This study looked into the dynamics of MARs when the preparation process is varied and during embryonic development of D. melanogaster. A subset of MARs termed as "Core-MARs" present abundantly in pericentromeric heterochromatin, are constant unalterable anchor points as they associate with NuMat through embryonic development and are independent of the isolation procedure. Euchromatic MARs are dynamic and reflect the transcriptomic profile of the cell. New MARs are generated by nuclear stabilization, and during development, mostly at paused RNA polymerase II promoters. Paused Pol II MARs depend on RNA transcripts for NuMat association. These data reveals the role of MARs in functionally dynamic nucleus and contributes to the current understanding of nuclear architecture in genomic context (Sureka, 2022).
Lipid droplets (LDs) are highly dynamic organelles that store neutral lipids. Through a gene overexpression screen in the Drosophila larval fat body, this study has identified that MRT, an Myb/switching-defective protein 3 (Swi3), Adaptor 2 (Ada2), Nuclear receptor co-repressor (N-CoR), Transcription factor (TF)IIIB (SANT)-like DNA-binding domain-containing protein, regulates LD size and lipid storage. MRT directly interacts with, and is functionally dependent on, the PZG and NURF chromatin-remodeling complex components. MRT binds to the promoter of plin1, the gene encoding the LD-resident protein perilipin, and inhibits the transcription of plin1. In vitro LD coalescence assays suggest that mrt overexpression or loss of plin1 function facilitates LD coalescence. These findings suggest that MRT functions together with chromatin-remodeling factors to regulate LD size, likely through the transcriptional repression of plin1 (Yao, Y., 2018).
Lipid droplets (LDs) are widely distributed and highly evolutionarily conserved organelles that comprise a phospholipid monolayer, a neutral lipid core, and numerous LD-associated proteins. The size of LDs varies greatly in different kinds of cells and even in the same cell type under different physiological conditions. Adipocyte hypertrophy, characterized by increased LD size, has been proved to be the major mechanism that induces the expansion of adipose tissue in obese individuals (Yao, Y., 2018).
The size of LDs is regulated by several distinct processes such as targeted delivery of neutral lipids from the endoplasmic reticulum (ER) to LDs via LD-ER bridges, local triglyceride (TAG) synthesis by LDs, atypical LD fusion, and LD coalescence. Numerous key factors in these processes, including proteins and phospholipids, have been identified. Perilipin1 (PLIN1), the first protein that was identified on LDs, is important for fat mobilization and regulates lipolysis in both mammals and flies. PLIN1 is post-translationally phosphorylated by protein kinase A (PKA) after β-adrenergic stimulation, which is essential for its function in regulating lipolysis. In mice, PLIN1 also mediates atypical fusion of LDs by activating FSP27. Transcriptionally, the Plin1 gene is activated by peroxisome proliferator-activated receptor-γ (PPARγ) and suppressed by liver X receptor-α (LXR-α). Whole-genome loss-of-function screens and functional studies in Caenorhabditis elegans and Drosophila have also identified many genes involved in lipid storage and LD size regulation. These findings have made significant contributions to understanding the mechanism of LD size regulation. However, the mechanisms that regulate and functionally integrate the activity of these genes under different physiological and pathological conditions need to be further explored (Yao, Y., 2018).
Chromatin remodeling is an essential process for transcriptional regulation. The chromatin structure is rearranged to allow transcriptional regulatory proteins to access DNA that is wrapped around histones. Chromatin remodeling and nucleosome occupancy changes are associated with adipocyte differentiation and lipid homeostasis. In addition, histone modification changes are important for adipocyte differentiation, lipogenesis, and hepatic steatosis. However, the specific roles of chromatin remodeling in LD size regulation have not yet been fully explored (Yao, Y., 2018).
Through a genetic gain-of-function screen in Drosophila, this study identified an LD size regulator, MRT, which promotes LD coalescence. The regulatory effect of MRT on LD size requires the MRT-interacting proteins PZG and components of the nucleosome remodeling factor (NURF) complex. Together, these findings reveal that an MRT-PZG-NURF axis regulates the size of LDs (Yao, Y., 2018).
The LD coalescence process regulated by MRT and PLIN1 displays some interesting features. First, LDs were isolated from Drosophila larval fat bodies, and the LDs were kept in PBS buffer without adding ATP. Therefore, the LD coalescence process does not rely on an external energy supply. Second, unlike in the gradual LD fusion process induced by proteins, lipid transfer between LDs was not noticed during their contact time in most cases, and coalescence happened instantaneously. Third, the paired LDs sometimes separated, indicating that the LD-LD contact is not as stable as the protein-mediated LD-LD contact. Lastly, coalescence does not obviously depend on LD size or the size ratio of the LD pair. All of these features are quite consistent with the traits of LD coalescence triggered by membrane instability (Yao, Y., 2018).
The elevated LD coalescence rate may result from increased levels of phosphatidic acid (PA) or phosphatidylethanolamine (PE) and reduced levels of phosphatidylcholine (PC). mrt overexpression reduces the PC content in the fat body, which is consistent with the decreased mRNA levels of ck and Cct1 that are important for PC synthesis. However, overexpression of Cct1 does not suppress the large LD phenotype caused by mrt overexpression. This suggests that either the reduced level of Cct1 is not the main reason for the formation of large LDs or that other factors are also required. Along the same lines, it remains to be addressed whether there is a link between Drosophila PLIN1 and the levels of PA or PC (Yao, Y., 2018).
The transcriptional regulation of lipid metabolism, including adipogenesis, lipogenesis, and fatty acid oxidation, is being intensively studied. Compared to other aspects of lipid metabolism, the mechanisms for transcriptional regulation of LD size have been reported in only a few cases. PPARs and LXR-α transcriptionally regulate the expression of perilipin genes. In C. elegans, DAF-12 (homolog of vitamin D receptor [VDR]/LXR) promotes the thermosensitive formation of supersized LDs. In Drosophila, loss of function of TATA-binding protein [TBP]-related factor 2 (TRF2) significantly increases LD size by affecting the transcription of genes related to peroxisomal fatty acid β-oxidation. Interestingly, PZG has also been identified in the TRF2-containing complex. It appears to be contradictory that two complexes, TRF2-PZG and MRT-PZG, which share the same component, have opposite effects on LD size. It is possible that PZG negatively regulates TRF2 in LD size regulation. Alternatively, because PZG has TRF2-dependent and TRF2-independent functions, it is also possible that PZG is not required for TRF2-dependent LD size regulation or that the TRF2-PZG and MRT-PZG complexes have totally different targets in LD size regulation (Yao, Y., 2018).
As the determinants of the accessibility of transcription factors to their target promoters, chromatin-remodeling factors also regulate lipid metabolism. Both ATP-dependent chromatin-remodeling factors and histone modification factors contribute to lipid homeostasis in mammals. As subunits of the SWI/SNF complexes, BAF60a and BAF60c control hepatic lipid metabolism by activating the transcription of fatty acid oxidation genes or lipogenic genes in response to different nutrient conditions and hormone signals. Other studies have shown that histone deacetylases, including HDAC1 and HDAC3, are implicated in regulating adipocyte differentiation, lipogenesis, and hepatic steatosis (Yao, Y., 2018).
This study showed that MRT and its partner proteins the PZG and NURF complex components regulate LD size. MRT, with the help of the PZG and NURF complex, likely represses the transcription of its downstream targets to regulate LD size. plin1 is probably one of many MRT target genes, and further genomic and transcriptomic approaches may provide a global view of MRT-mediated transcription regulation. The relatively mild loss-of-function phenotype of mrt indicates that MRT, together with chromatin-remodeling complexes, likely modulates or balances the accessibility of promoters of LD size regulators, such as plin1. Moreover, the NURF complex belongs to the ISWI chromatin-remodeling family, and besides ISWI, there are three other ATP-dependent chromatin-remodeling families: SWI/SNF, CHD, and INO80. It remains to be determined whether other chromatin-remodeling complexes participate in LD size regulation and how they are functionally related to the MRT-PZG-NURF axis (Yao, Y., 2018).
The current findings suggest that a balance between 'open' and 'closed' chromatin states is important for the transcriptional regulation of key LD size regulators. The MRT-PZG-NURF axis is involved in the regulation of key factors such as PLIN1 through the 'closed' state. Identifying the regulators of the 'open' state will ultimately reveal the full picture of transcriptional regulation of LD size regulation at the chromatin level (Yao, Y., 2018).
Mutations in several genes encoding components of the SWI/SNF chromatin remodeling complex cause neurodevelopmental disorders (NDDs). This study reports on five individuals with mutations in SMARCD1; the individuals present with developmental delay, intellectual disability, hypotonia, feeding difficulties, and small hands and feet. Trio exome sequencing proved the mutations to be de novo in four of the five individuals. Although the individuals presented in this study have dysmorphisms and some clinical overlap with these syndromes, they lack their typical facial dysmorphisms. To gain insight into the function of SMARCD1 in neurons, this study investigated the Drosophila ortholog Bap60 in postmitotic memory-forming neurons of the adult Drosophila mushroom body (MB). Targeted knockdown of Bap60 in the MB of adult flies causes defects in long-term memory. Mushroom-body-specific transcriptome analysis revealed that Bap60 is required for context-dependent expression of genes involved in neuron function and development in juvenile flies when synaptic connections are actively being formed in response to experience. Taken together, this study identified an NDD caused by SMARCD1 mutations and establish a role for the SMARCD1 ortholog Bap60 in the regulation of neurodevelopmental genes during a critical time window of juvenile adult brain development when neuronal circuits that are required for learning and memory are formed (Nixon, 2019).
The Drosophila ovary is recognized as a powerful model to study stem cell self-renewal and differentiation. Decapentaplegic (Dpp) is secreted from the germline stem cell (GSC) niche to activate Bone Morphogenic Protein (BMP) signaling in GSCs for their self-renewal and is restricted in the differentiation niche for daughter cell differentiation. This study reports that Switch/sucrose non-fermentable (SWI/SNF) component Osa depletion in escort cells (ECs) results in a blockage of GSC progeny differentiation. Further molecular and genetic analyses suggest that the defective germline differentiation is partially attributed to the elevated dpp transcription in ECs. Moreover, ectopic Engrailed (En) expression in osa-depleted ECs partially contributes to upregulated dpp transcription. Furthermore, it was shown that Osa regulates germline differentiation in a Brahma (Brm)-associated protein (BAP)-complex-dependent manner. Additionally, the loss of EC long cellular processes upon osa depletion may also partly contribute to the germline differentiation defect. Taken together, these data suggest that the epigenetic factor Osa plays an important role in controlling EC characteristics and germline lineage differentiation (Hu, 2021).
In this work, a genome-wide study of the zinc finger protein CG9890 was conducted and showed that it is localized mostly on the promoters of active genes. The CG9890 binding sites are low-nucleosome-density regions and are colocalized with the chromatin modifying and remodeling complexes SAGA and dSWI/SNF, as well as with the ORC replication complex. The CG9890 protein was shown to be involved in the regulation of the expression of some genes on the promoters of which it is located, with the ecdysone cascade genes accounting for a significant percentage of these genes. Thus, the CG9890 protein is a new member of the transcriptional network which is localized on active promoters, interacts with the main transcription and replication complexes, and is involved in the regulation of both basal and inducible transcription (Fursova, 2020).
A major challenge in biology is to understand how complex gene expression patterns are encoded in the genome. While transcriptional enhancers have been studied extensively, few transcriptional silencers have been identified, and they remain poorly understood. This study used a novel strategy to screen hundreds of sequences for tissue-specific silencer activity in whole Drosophila embryos. Almost all of the transcriptional silencers that were identified were also active enhancers in other cellular contexts. These elements are bound by more transcription factors than non-silencers. A subset of these silencers forms long-range contacts with promoters. Deletion of a silencer caused derepression of its target gene. These results challenge the common practice of treating enhancers and silencers as separate classes of regulatory elements and suggest the possibility that thousands or more bifunctional CRMs remain to be discovered in Drosophila and in humans (Gisselbrecht, 2019).
This study has adapted an enhancer-fluorescence-activated cell sorting-sequencing (FACS-seq) technology for highly parallel screening of elements for enhancer activity in Drosophila embryos (Gisselbrecht, 2013) into silencer-FACS-seq (sFS) technology, which enriches for elements that tissue specifically silence reporter gene expression. Briefly, this study generated a reporter vector, pSFSdist, which drives GFP expression under the control of an element from a library of candidate silencers, positioned at least 100 bp upstream of a strong, ubiquitous enhancer (ChIPCRM2078; Gisselbrecht, 2013). This vector contains a target sequence for a site-specific recombinase, permitting assaying all of the tested elements in the same genomically integrated context. Flies carrying single insertions from the reporter library are crossed to a strain in which the expression of the exogenous marker protein rat CD2 is driven in a tissue or cell type of interest, and the resulting informative embryos are dissociated to produce a single-cell suspension. By sorting for CD2+ cells in which GFP expression is reduced from the level driven by the strong ubiquitous enhancer in the absence of silencing activity, cells were enriched for that contained silencers active in the cell type of interest, which were then recover and identify by high-throughput sequencing. Insertion of an element with known mesodermal silencing activity into this vector consistently yielded a larger fraction of CD2+GFPreduced cells) than were observed when a negative control sequence (derived from Escherichia coli genomic DNA) was used (Gisselbrecht, 2019).
A library of 591 genomic elements, chosen to represent a variety of chromatin states or enhancer activity patterns, was designed to test for silencer activity in the embryonic mesoderm at stages 11-12 (5.5-7.5 h after embryo deposition). Since general features of silencers are unknown, this library was designed to test 3 main hypotheses about what kinds of sequences act as silencers in this developmental context (Gisselbrecht, 2019).
First, it was noted that 2 bifunctional CRMs had been identified previously in Drosophila that function as enhancers in one context and as silencers in other contexts. As this phenomenon is known to occur in multiple eukaryotic systems from a small number of examples and could be important in understanding the architecture of regulatory DNA, it was desirable to assess the generality of this phenomenon. Therefore, CRMs were selected from the REDfly and CAD2 databases that exhibited no or highly restricted mesodermal expression at embryonic stage 11. Elements associated with genes that show widespread mesodermal expression at this stage were filtered out. Second, a potential mechanistic signature of transcriptional silencers is the binding of well-characterized transcriptional corepressors, by analogy to the prediction of enhancers by binding of the coactivator CBP. Therefore genomic elements identified by ChIP as binding sites for the co-repressors Groucho or CtBP were selected. As Groucho has canonically been associated with long-range repression and CtBP has been associated with short-range repression, it was predicted that Groucho binding sites would be a richer source of silencer activity in this assay, in which candidate silencers were placed >100 bp upstream of the enhancer driving reporter gene activity (Gisselbrecht, 2019).
Third, genomic regions were selected that were associated with the markers of both enhancers and repressed chromatin structure in whole-mesoderm or whole-embryo experiments. It was reasoned that silencers are active regulatory elements, distinct from the silenced chromatin that results from their activity, yet must recruit factors that exert repressive functions, and therefore may show association with both classes of chromatin marks. Moreover, these 'bivalent' chromatin states may represent sequences of the above-mentioned type, that act as enhancers in one cell type and as silencers in another. In this class, two sets of sequences were included: (1) DNase I hypersensitive sites (DHSs) that colocalize with ChIP signal for the well-studied repressive chromatin mark histone H3 trimethyllysine 27 (H3K27me3) in sorted mesoderm, and (2) coincident mesodermal peaks for H3K27me3, the canonical enhancer mark histone H3 monomethyllysine 4 (H3K4me1), and histone H3 acetyllysine 27 (H3K27ac), which has been associated with active enhancers and promoters. All of the sequences identified from genome-wide ChIP methods were filtered for the absence of widespread mesodermal expression of associated nearby genes (Gisselbrecht, 2019).
Finally, 15 sequences were included for which enhancer chromosomal contact sequencing (4C-seq) data for sorted mesodermal cells were available. Three positive control sequences were included that were previously shown to have mesodermal silencing activity and two types of putative negative controls were included: broadly active mesodermal enhancers and length-matched regions of the E. coli genomic sequence (Gisselbrecht, 2019).
The library of genomic elements was screened for silencer activity in the embryonic mesoderm in 2 rounds. Testing of this library yielded a readily detectable population of mesodermal cells in which GFP expression was reduced; the elements enriched in these cells are referred to as sFS+ elements. Of the 591 sequence elements chosen for inclusion in this library, 501 were genomically integrated into transgenic flies after injection of the pooled library (Gisselbrecht, 2019).
Overlap with transcription start sites (TSSs) were identified as a highly enriched feature of sFS+ elements, which likely reveals the presence of promoter competition. Competition among promoters for association with active enhancers is one mechanism that has been proposed to account for the specificity with which enhancers target genes for activation and has been shown to restrict enhancer-driven activation of gene expression in reporter assays. Overall, the initial set of 41 'hits' that overlapped promoter regions was significantly enriched for mapped instances of the TATA box. While these are technical positives in the silencer screen, since the goal of this study was to analyze CRMs that silence gene expression by other means, any sequences that overlapped promoter regions were omitted from subsequent analyses. Moreover, many of the library elements tested overlapped other library elements; these were mergend for downstream analysis. After filtering to remove elements that overlapped promoters and collapsing overlapping genomic regions, 29 of a total of 352 genomic regions tested for mesodermal silencer activity were positive in the sFS screen (Gisselbrecht, 2019).
To validate the results from the sFS screen, pure transgenic lines were generated from a subset of library elements, and then their silencer activity was assayed by FACS analysis of embryos resulting from these individual reporter strains. Next, whether the silencers detected by the sFS screen could also silence the activity of enhancers other than the strong, ubiquitous enhancer used in the sFS screen was investigated. Thus, silencing by several sFS+ elements was assessed visually by placing these elements upstream of the following mesoderm-specific enhancers and imaging the resulting GFP expression. ChIPCRM2613 is an intronic enhancer of the pan-mesodermal gene heartless, and it drives reporter gene expression throughout the presumptive mesoderm from the beginning of gastrulation (Gisselbrecht, 2013). The Mef2 I-ED5 enhancer drives expression in the fusion-competent myoblasts beginning in late stage 11. In every case examined (15 of 15), at least one of these additional enhancers showed reduced activity in the mesoderm in the context of the tested element. These results not only verify the silencing activity of these sFS+ elements but they also demonstrate that silencers are not specific for a particular enhancer (Gisselbrecht, 2019).
The resulting set of mesodermal silencers was analyzed to determine which genomic features that were explicitly sampled in the design of the element library were predictive of silencer activity. Despite the inclusion in the sFS library of ChIP peaks for transcriptional co-repressors and for a repressive chromatin mark, the only screened element types that were significantly enriched among the active mesodermal silencers were positive controls and non-mesodermal enhancers. In fact, 22 of 29 regions containing mesodermal silencers had been previously reported to have enhancer activity. Testing of the remaining 7 silencers for enhancer activity revealed that 6 of 7 also function as enhancers in the embryo: 5 of 6 were entirely non-mesodermal, while 1 showed restricted mesodermal expression. In total, 28 of 29 of the elements were found to act as mesodermal silencers also exhibited enhancer activity in a different cellular context. Overall, >10% (26 of 200) of the previously known enhancers tested in the assay exhibited mesodermal silencer activity. This constitutes more bifunctional transcriptional regulatory elements than were previously known across all biological systems (Gisselbrecht, 2019).
A different class of bifunctional CRMs was reported recently (Erceg, 2017), in which developmental enhancers have an additional function as Polycomb response elements (PREs). PREs provide genomic binding sites for sequence-specific DNA binding proteins that recruit protein subunits of Polycomb repressive complexes and could, in principle, play a role in silencing target genes. Therefore, this study tested the hypothesis that the silencer activity of enhancers discovered in the current assay resulted from PRE activity. Only 4 of 29 regions displaying mesodermal silencer activity overlapped PREs as defined by Erceg (2017) on the basis of ChIP for the PRE-binding factors Pho and dSfmbt, versus 24 of 323 mesodermal non-silencers, indicating that PREs are not a major source of mesodermal silencers (Gisselbrecht, 2019).
The results suggest a view of enhancers as CRMs with distinct spatiotemporal patterns of both activation and repression. To further assess the generality of this phenomenon, the effects of a subset of the newly discovered mesodermal silencers was visualized on enhancers that are active broadly in the mesoderm at different developmental stages. This enabled simultaneous evaluation of a variety of spatiotemporal domains of silencer and enhancer activity (Gisselbrecht, 2019).
Several elements exhibited apparently uniform silencing activity across the mesoderm and at different stages. The most commonly observed temporal pattern was a lack of silencing activity at gastrulation and strong silencing during the later stages at which sFS was performed. An element, hkb_0.6kbRIRV, was also observed that silenced much more strongly earlier than later in embryonic development; this element simultaneously acted as an enhancer in its previously characterized pattern in the midgut primordia. One element, the oc otd186 enhancer, which was observed to drive expression in the head, exhibited spatially patterned silencing within the mesoderm during gastrulation but not later in development; silencing was moderate in the anterior portion of the germband, but much stronger in the posterior portion, as seen in the context of 2 different early pan-mesodermal enhancers. Two different later-acting mesodermal enhancers showed moderate, uniform silencing across the anteroposterior extent of the germband. Finally, a tested element exhibited enhancer-specific silencing activity. The lz crystal cell enhancer was a moderately weak silencer when tested on different late-acting mesodermal enhancers and a strong silencer at gastrulation in the context of ChIPCRM2613, yet it completely failed to silence activity driven during gastrulation by ChIPCRM7759. These results highlight that silencers exhibit a similarly diverse range of spatiotemporal regulatory patterns as those of enhancers (Gisselbrecht, 2019).
To investigate whether the silencer activity observed in reporter assays reflects the activity of the putative silencer in its native chromosomal context, the average mesodermal transcription was profiled in the genomic neighborhood of silencers or other functional elements. Using published RNA-seq data from sorted mesodermal cells, reads were aggregated within 500-bp windows over a 25-kb region centered on each element, representing the typical size of chromatin state domains observed in a high-resolution Drosophila Hi-C experiment. Then the results were averaged over all of the elements in a class to create a meta-profile of transcript levels surrounding each class of cis-regulatory elements. As expected, transcription near a previously published set of mesodermal enhancers (Gisselbrecht, 2013) is elevated. In contrast, transcription near silencers is below the baseline level of transcription observed near a negative control set of genomic regions. Both effects decay to background levels in the meta-profiles over a scale of ~5 kb, suggesting that silencers act within approximately the same distance range as transcriptional enhancers. The bifunctional PREs reported by Erceg (2017) are also associated with strongly elevated transcription, but this effect appears to spread more broadly on the chromatin domain scale, suggesting that bifunctional PREs may act by a mechanism that is distinct from that of the silencers identified in the sFS screen (Gisselbrecht, 2019).
Next, to further demonstrate the functional importance of the silencers that were identified in the sFS screen, CRISPR-Cas9 genome editing was used to generate a Drosophila strain containing a deletion of the hkb_0.6RIRV element. This element was originally reported as an enhancer driving the expression of the gap gene huckebein (hkb) at the termini of the blastoderm embryo, and this study identified it as a mesodermal silencer in the screen. Mesodermal cells from embryos homozygous for this deletion and from wild-type control embryos were screened and hkb RNA was found to be significantly upregulated in the homozygous mutant mesoderm, which supports a role for this element in silencing its endogenous target gene during normal embryonic development (Gisselbrecht, 2019).
Various types of epigenomic features, including chromatin accessibility, post-translational modifications of histones, and occupancy by TFs and chromatin-modifying enzymes, have been associated with different categories of functional elements in the genome, such as active promoters and enhancers. However, relatively little is known about the chromatin features of active silencers. Therefore the epigenomic environment at the set of 29 mesodermal silencers was explored by assessing the enrichment or depletion of signal from various published epigenomic datasets, as compared to elements that did not display silencer activity in the sFS screen (Gisselbrecht, 2019).
It was hypothesized that since bifunctional elements are more functionally complex than CRMs that act only as enhancers, they may exhibit a more complex suite of TF interactions across various tissues. This study observed that validated silencers are strongly enriched for overlap with highly occupied target (HOT) regions as exceeding a TF complexity score threshold of ~10 overlapping bound factors. Since silencing activity is likely mediated through the effects of bound sequence-specific transcriptional repressors, the set of 29 mesodermal silencers was searched for enriched combinations of evolutionarily conserved DNA binding site motif occurrences for TFs annotated as repressors. The only motif combination that was significantly enriched among silencers was a 3-way combination of the motifs for the TFs Snail, Dorsal, and Tramtrack-PF, which were found together in 12 of 29 mesodermal silencers (Gisselbrecht, 2019).
Snail is a well-known repressor of non-mesodermal genes in the developing mesoderm. Dorsal and Tramtrack have also been shown to play roles in mesodermal gene repression. Analysis of ChIP data for Snail revealed significant enrichment for Snail occupancy at silencers. To validate that this enrichment reflects Snail activity at silencers, predicted Snail binding sites were mutated in 4 silencer elements with high levels of Snail ChIP signal, and the silencer activity of the mutant was compared to wild-type sequences within whole embryos in the FACS-based reporter assay. All 4 elements showed significantly reduced silencer activity. Mutating sites for an unrelated TF, as a negative control, caused no significant reduction in silencer activity. While Snail has been well characterized as a short-range repressor acting within 150 bp, all of the Snail binding sites that were found to be required for full silencer activity are >400 bp away from the silenced enhancer in the reporter construct, indicating that Snail can act as a repressor at distances longer than those described for short-range repression (Gisselbrecht, 2019).
Finally, the evidence was examined for the direct action of Snail binding to silencers on the expression of the silencers' endogenous target genes. Since the majority of elements exhibiting silencer activity in this study were originally reported as enhancers, it was possible to identify published target genes of these bifunctional elements and examine the effect of loss of snail function on their expression. Target genes of 12 elements bound by Snail in ChIP-seq data showed significant derepression in sna mutant embryos. In contrast, no target of any of the 14 Snail-unbound elements was significantly derepressed. Therefore, it is concluded that the known role of Snail in mesodermal repression explains the activity of a large minority (41%) of the observed silencers, while the transcriptional regulators mediating silencing activity through the majority of the silencers remain to be determined (Gisselbrecht, 2019).
In an attempt to identify a 'silencer signature' that is analogous to the previously described chromatin signatures of enhancers and promoters, published ChIP data was assembled from whole embryos or from sorted mesoderm, where available, for several chromatin marks previously associated with active or repressed chromatin states, and hierarchical clustering was performed of all 352 tested genomic regions according to these histone modification chromatin profiles. As expected, clusters of elements with greater signal for the repressive marks H3K27me3 and H3K9me3 are enriched for silencers, while other clusters are depleted of silencers. It was also observed, however, that many non-silencers belonged to these clusters, and that some silencers belonged to other clusters that were instead enriched for non-silencers, suggesting that these commonly profiled chromatin marks do not constitute a general chromatin signature of silencers. Similarly, neither the Groucho nor the CtBP co-repressors were significantly enriched at silencers (Gisselbrecht, 2019).
For a subset of silencers (18 of 29), the individual FACS validation data provided a measure of the strength of silencer activity, in terms of the percentage of cells in the GFPreduced population. Using these quantitative estimates of silencer activity, it was found that H3K27me3 and H3K9ac, a mark that previously had been associated with bivalent promoters and active enhancers, are significantly correlated with silencer strength across these 18 elements, possibly reflecting the fact that mesodermal silencers are active enhancers in other cellular contexts. No single mark or combination of marks that was tested from among the publicly available histone modification profiles accurately discriminates active silencers as a whole from other types of cis-elements (Gisselbrecht, 2019).
It has been well established that enhancers can act directly on their target promoters by looping to create direct, 3-dimensional (3D) physical contacts between genomic elements widely separated in sequence space. Such contacts have also been shown to play a role in repression by heterochromatin and at PREs. Silencers could, in principle, act directly to recruit repressive activities to regulated promoters, or alternatively by sequestering enhancers that would otherwise interact with promoters, or by other mechanisms that do not involve focal contacts to regulated elements, such as nucleating a repressive chromatin state that spreads along the chromosome (Gisselbrecht, 2019).
Therefore data was examined from assays of genomic contacts based on proximity ligation to attempt to distinguish among these hypotheses. Mesodermal silencer activity was observed in a CRM previously characterized by circular chromosome conformation capture (4C). This element makes mesodermally enriched contacts with 2 regions that overlap the promoters of genes that are not expressed in the mesoderm, suggesting the possibility that silencing may be mediated by direct silencer-promoter looping (Gisselbrecht, 2019).
To test the generality of this potential mechanism, Hi-C data was generated from sorted mesodermal and non-mesodermal cells at embryonic stages 11-12, the same developmental stages that was assayed by sFS. Mesodermally enriched contacts were observed at 1-kb resolution in each of 2 paired replicates using the chromoR package and compared the results to the mesodermally enriched 4C contacts previously reported. The frequency of random contacts observed in Hi-C assays is greater with closer genomic distance. Therefore, to control for such nonspecific interactions in this analysis, negative control sets of 'contact regions' were generated by reflecting each observed contact around the viewpoint region. Each Hi-C replicate showed significantly greater overlap with 4C contacts than with negative control regions, indicating that the Hi-C contacts agree with published genomic contacts (Gisselbrecht, 2019).
The features of these mesodermally enriched silencer contacts were examined, as these are potential targets of silencing activity. A list was created of these potential targets by filtering for contacts that were observed in both of the Hi-C replicates and that overlap sFS-tested elements, and then the (epi)genomic features of these regions were examined. Since the TF Snail (Sna) has previously been associated with short-range repression and 'antilooping', the features of regions contacted by Sna-bound mesodermal silencers, Sna-unbound mesodermal silencers, and elements that did not act as silencers in mesoderm were compared. Regions that made mesoderm-specific contacts to Sna-unbound mesodermal silencers are significantly enriched for overlapping TSSs, as compared to those contacting either Sna-bound silencers or non-silencers, indicating that the Sna-unbound mesodermal silencers contact promoters. This suggests that Snail-unbound silencers are targeted primarily by long-range repressors, whereas Snail-bound silencers show almost no promoter contacts, which is consistent with previous reports of short-range repression and antilooping associated with this TF (Gisselbrecht, 2019).
Next, whether the expression levels of silencer-contacted genes were consistent with silencer activity was inspected. For genes whose promoters made contact with any of the 352 tested library elements, RNA-seq data from sorted mesodermal cells was compared with the elements' histone marks that showed significant correlation with silencer activity. The level of H3K27me3 found at a library element was significantly anticorrelated with the mesodermal expression of genes contacted by that element, supporting the model of contact-based repression by silencers (Gisselbrecht, 2019).
Attempts were made to test the alternate model that silencers directly contact enhancers that would otherwise be active. Thus, the contacted regions were examined for overlap with CRMs that have been reported to act as enhancers, according to the REDfly database. Each of the 3 sets of library elements (Sna-bound silencers, Sna-unbound silencers, and non-silencers) were separately tested, and no significant enrichment or depletion of CRM contact was observed in any of the sets. It was further reasoned that direct action by silencers on enhancers would result in enrichment of the enhancer mark H3K4me1 in regions that contact silencers. In this scenario, this enrichment should be apparent in histone mark data from whole embryos and across a broad range of time points, reflecting their enhancer activity in non-mesodermal tissues and/or other developmental stages. Instead a significant depletion of H3K4me1 was observed at Sna-unbound silencer contacts versus non-silencer contacts, which does not support the model of silencers interacting directly with enhancers. The results support models in which distinct classes of transcriptional silencers act by antilooping or by acting directly on the promoters of repressed genes. While the existence of silencers that may sequester enhancers from contacting promoters cannot be ruled out, the results do not support this alternate mechanism (Gisselbrecht, 2019).
This study has developed a highly parallel reporter assay carried out in whole, developing animals to identify a set of transcriptional silencers on the basis of their tissue-specific function. Analysis of RNA-seq data indicated that genes located near these silencers in their endogenous context are expressed at lower levels. Deletion of 1 of these elements at its native genomic locus by CRISPR-Cas9 genome editing demonstrated the importance of that element for the proper expression level of its target gene. This study also integrated a wide variety of data types from previously published datasets, including ChIP of histone modifications and specific factors, with newly generated tissue-specific 3D chromosomal physical interaction data to assess enriched features of the set of tissue-specific silencers and has explored potential mechanisms (Gisselbrecht, 2019).
Many enhancers were found in fact to be bifunctional elements, capable of up- and downregulating gene expression in different cellular contexts. While this phenomenon has been observed previously in studies of individual regulatory elements, the extent of CRM bifunctionality had not been appreciated. It is important to note that many CRMs that failed to show silencing activity in the screen are known enhancers that are not active in the tissue tested. Silencers are therefore an identifiable set of active elements, distinct from 'quenched' or inactive enhancers that neither activate nor repress gene expression (Gisselbrecht, 2019).
While prior studies have found histone modifications associated with enhancer activity, this study suggests that despite the extensive genome-scale ChIP profiling studies by numerous investigators and consortia, the available chromatin profiling data are not sufficient to identify silencers. This is possibly explained by the existence of various silencer classes. Alternatively, there are dozens of chromatin marks that have not been characterized extensively that may mark silencers. Expanded efforts in profiling larger sets of tissue-specific chromatin marks may reveal a signature of active silencers. Similarly, it was surprising that co-repressor occupancy was not a good predictor of silencers. One potential explanation is that many of these elements may be silencers in other cell types or at other developmental stages than were assayed in this study, since co-repressor ChIP data were generated in whole embryos across a broad range of ages. Another possibility is that different subclasses of CRMs with silencer activity may be endowed with subclass-specific chromatin and/or TF signatures. The list of 29 silencers discovered by the sFS assay provides a training set that can be used for the further study of regulatory features that govern silencing (Gisselbrecht, 2019).
Enriched Snail binding was observed at a subset of mesodermal silencers. Snail is a well-characterized short-range repressor protein acting in the mesoderm, and it has been proposed to act by preventing regulated elements from looping to promoters. The current results are consistent with this general model; however, the effects of Snail repression spread over hundreds of base pairs and into neighboring regulatory elements in the reporter assay, in contrast to previously reported limits of short-range repression. Thus, the results indicate that Snail can act by different modes of repression, which had not been observed previously (Gisselbrecht, 2019).
This study provides evidence supporting a model of silencer activity in which a subset of silencers makes direct 3D contacts with the promoters of regulated genes. These physical interactions are important to consider when interpreting genome-wide maps of chromosome conformation. Not all promoter-interacting regions will act as enhancers, and it will be necessary to develop approaches that integrate a wide range of genomic data types to identify and functionally characterize cis-regulatory elements, including distinguishing those acting as enhancers versus silencers (Gisselbrecht, 2019).
It has recently been shown that many developmental enhancers also act as PREs. Despite some common features, including evidence for looping to target promoters, this set of bifunctional enhancer elements is nearly distinct from the elements this study has characterized that act as both enhancers and silencers, and appears to act by distinct mechanisms. It was previously reported that a Drosophila insulator element has a second role in mediating long-range enhancer-promoter interactions. It is suggested that a taxonomy of regulatory elements as enhancers, silencers, insulators, and so forth is likely an oversimplification, and that it is more useful to think generally of CRMs, which can activate and repress, recruit chromatin modifiers and remodelers, and/or structure the 3D genome in a context-sensitive fashion (Gisselbrecht, 2019).
It has been estimated that there may be >50,000 enhancers in the D. melanogaster genome. This study has detected mesodermal silencer activity in >10% of tested non-mesodermal enhancers. If these elements are representative of the broader enhancer population, then this result suggests that there may be thousands of such bifunctional elements across a range of tissues in Drosophila; since many of the elements were tested could be silencers in a cell type that was not examined or at a later developmental stage, these numbers are likely even higher. The sFS approach could be adapted in future studies to screen for bifunctional elements in mammals (Gisselbrecht, 2019).
These results suggest that most, if not all, silencers are also enhancers in a different cell type. CRM bifunctionality complicates the understanding of how gene regulation is specified in the genome and how it is read out in different cell types. The observation that the vast majority of complex trait- and disease-associated variants identified from genome-wide association studies (GWASs) map to noncoding sequences, most of which occur within DNase I hypersensitive sites, emphasizes the importance of understanding these elements. The characterization of bifunctional elements will help in elucidating how precise gene expression patterns are encoded in the genome and aid in the interpretation of cis-regulatory variation (Gisselbrecht, 2019).
The nuclear pore complex (NPC) tethers chromatin to create an environment for gene regulation, but little is known about how this activity is regulated to avoid excessive tethering of the genome.
Tethering specific genomic loci to the NPC appears to contribute to transcriptional activation. Also, the NPC has been further implicated in creating a repressive environment or retaining genes at the periphery after repression, possibly contributing to epigenetic transcriptional memory.
This paper proposes a negative regulatory loop within the NPC controlling the chromatin attachment state, in which Nup155 and Nup93 recruit Nup62 to suppress chromatin tethering by Nup155. Depletion of Nup62 severely disrupts chromatin distribution in the nuclei of female germlines and somatic cells, which can be reversed by codepleting Nup155. See a model for the chromatin attachment state controlled by an internal regulatory circuit in the NPC. Thus, this universal regulatory system within the NPC is crucial to control large-scale chromatin organization in the nucleus (Breuer, 2015).
Cytological study of the chromatin attachment state to the nuclear envelope is experimentally challenging, as chromatin usually occupies the entire nucleus. However, meiotic chromatin becomes fully detached from the nuclear envelope and compacted into a spherical structure, the karyosome, after recombination in Drosophila oocytes. Chromatin detachment and karyosome formation are crucial to make a single spindle and allow subsequent chromosome segregation and are conserved features also seen in mammalian oocytes. By taking advantage of this unique nuclear organization in oocytes, this study sought factors required for chromatin detachment from the nuclear envelope by individually knocking down various nuclear proteins in the female germline (the oocyte and nurse cells) by RNAi (Breuer, 2015).
Strikingly, the depletion of either of two nuclear pore proteins, Nup62 or Nup93, led to disruption of the compact karyosome morphology, while the depletion of several other pore proteins did not. The chromatin shifted near the nuclear periphery, resulting in strong (Nup62) or partial (Nup93) overlap with a nuclear pore marker in the oocytes in comparison with the control RNAi, which was confirmed by superresolution microscopy. Nup93 is a linker scaffold protein known to be required for the recruitment of Nup62, one of the central channel proteins containing FG repeats (Sachdev, 2012). It was confirmed that the defect is not an off-target effect by rescue experiments using RNAi-resistant transgenes. In addition, similar karyosome defects were observed in female sterile Nup62 mutants. In nurse cells (polytenized germline cells that support oocyte growth), chromatin also distributed irregularly and more toward the nuclear periphery after RNAi of these genes. This demonstrates a general role for both genes in global chromatin organization rather than being restricted to oocytes (Breuer, 2015).
To identify the cause of the karyosome defect upon Nup62 or Nup93 RNAi, the structural integrity and transport function of NPC was tested. RNAi of Nup62 or Nup93 did not disrupt the overall structural integrity of the NPC, as judged by the localization of FG-containing subunits and the core scaffold subunit Nup107. The active import function of the NPC showed small differences as assessed by fluorescence recovery after photobleaching (FRAP) of GFP fused with a nuclear localization signal (NLS). There was a significant increase in the nuclear size of early oocytes, which may be caused by a reduced ability of the nuclear pore to act as a diffusion barrier (Breuer, 2015).
Next, a relationship was examined with the meiotic recombination checkpoint, which is known to disrupt karyosome formation in the presence of unrepaired double-strand breaks (DSBs) in oocytes. Inactivation of the checkpoint did not suppress the karyosome defects of Nup62 or Nup93 RNAi, demonstrating that the defect is independent of the meiotic recombination checkpoint in oocytes (Breuer, 2015).
Considering the above results, it was hypothesized that chromatin is excessively anchored to the NPC in RNAi of Nup62 or Nup93. If this was the case, it was predicted that chromatin specifically interacting with the NPC must be preferentially accumulated at the nuclear periphery rather than random chromatin. In order to test this, previously identified genomic loci bound to another nuclear pore component, Nup98, were used in Drosophila S2 culture cells. Nup98 has two distinct populations -- one at nuclear pores and the other in the nucleoplasm -- that bind distinct genomic loci in S2 cells. Nurse cells were subjected to fluorescence in situ hybridization (FISH) using individual probes corresponding to genomic loci known to be associated with Nup98 within the NPC or located in the nucleoplasm in S2 cells and were costained with a DNA dye. A proportion was measured of the total DNA signals in the nuclear periphery region, defined by a distance from the nuclear lamina of <10% of the nuclear radius, which occupies ~20% of the nuclear area. In control RNAi, ~16%-17% of the total DNA (propidium iodide or DAPI signal) was located in the nuclear periphery region. For all genomic loci (three NPC-bound and four nucleoplasmic), 17%-25% of the signal foci were found in the nuclear periphery. This indicates that there is no preference for periphery locations of the total DNA or of these specific genomic regions in wild-type nurse cells. When Nup93 was knocked down, there was a small increase (from 16%-17% to 20%-24%) in the total DNA that occupies the nuclear periphery (Nup93 RNAi was used, since it gives a milder phenotype than Nup62 RNAi). Strikingly, a strong, consistent redistribution of all NPC-bound genomic loci to the periphery (from 17%-25% to >40%) was observed, whereas the nucleoplasmic loci showed smaller variable changes. The increases for NPC-bound loci were significantly higher than the increases for both total DNA and the nucleoplasmic loci, supporting the hypothesis that depletion of Nup62 or Nup93 results in an excessive attachment of specific chromatin regions to the NPC (Breuer, 2015).
The results suggest that Nup62 and Nup93 suppress the interaction between chromatin and another NPC subunit. If this is the case, codepletion of this hypothetical NPC subunit that mediates chromatin attachment to the nuclear pore should restore detachment of chromatin in Nup62- or Nup93-depleted oocytes. Several NPC subunits have previously been shown to have chromatin-binding activity, including Nup155, Nup50, and ELYS/Mel-28. Flies expressing two shRNAs were generated: one for Nup62 and the other for each of the aforementioned chromatin-binding NPC subunits, the non-chromatin-bound Nup160, or a control. Codepletion of Nup155 specifically restored normal karyosome morphology and detachment from the nuclear periphery in Nup62-depleted oocytes. Furthermore, in nurse cells, simultaneous RNAi of Nup155 also restored normal chromatin distribution caused by Nup62 RNAi. Crucially, codepletion of Nup155 did not rescue the larger nuclear size in Nup62-depleted oocytes. This demonstrates that Nup62's function on chromatin organization is independent of its function on nuclear size maintenance, which may reflect its function as a diffusion barrier (Breuer, 2015).
It was also found that single depletion of Nup155 led to a large reduction of Nup62 (one of the FG-containing subunits) from the nuclear envelope and its accumulation in the cytoplasm. However, it did not significantly reduce the total amount of the FG-containing subunits at the nuclear envelope in both Nup155 and Nup62/Nup155 double RNAi. This demonstrates that Nup155 is required for Nup62 recruitment, and the apparent rescue of the Nup62 depletion defect by Nup155 codepletion is not due to a loss of integrity or a reduced number of nuclear pores. Taken together, the results suggest a negative regulatory loop in which Nup155 recruits Nup62 to the nuclear pores, and, in turn, Nup62 suppresses chromatin anchoring by Nup155 (Breuer, 2015).
A potential negative regulatory circuit was uncovered within the NPC that controls the chromatin attachment state to the nuclear pores in the oocytes and nurse cells. Therefore, attempts were made to test whether a common regulatory system also controls chromatin organization in somatic cells. Using the Drosophila S2 cell line, Nup62 or Nup155 were depleted individually and simultaneously by RNAi. Control RNAi cells showed a relatively even distribution of chromatin within the nucleus except for a dense region that corresponds to heterochromatin. In contrast, Nup62 RNAi resulted in an uneven distribution of chromatin within the nucleus. To quantify this, the area that chromatin occupies relative to the nuclear area was measured. The cells depleted of Nup62 showed a significant decrease in chromatin occupancy compared with a control RNAi. Strikingly, double depletion of Nup62 and Nup155 showed a chromatin occupancy similar to the control. This rescue was reversed by RNAi-resistant full-length Nup155 but not by resistant Nup155 lacking the chromatin-binding region. No significant change in chromatin occupancy was observed upon Nup155 depletion alone. This demonstrated the presence of a common negative loop within the NPC that controls the global chromatin distribution between female germline cells and somatic cells (Breuer, 2015).
Recent reports described the role of the NPC to tether chromatin and thus create an environment for gene regulation. While recruitment mechanisms for specific genes have been described, very little is known about whether or how this tethering is regulated. This study makes two major conceptual advances in understanding of global chromatin organization, especially the critical role and regulation of the NPC-mediated tethering. First, it highlights a far greater role of the NPC in large-scale chromatin organization than previously anticipated. Second, it points to a universal regulatory circuit inside the NPC that controls the attachment state of chromatin to the nuclear pore. This consists of a negative regulatory loop in which chromatin-binding Nup155 recruits the central channel protein Nup62, which in turn suppresses chromatin binding. As nuclear pore components associate with the genome to positively or negatively influence gene expression, this regulatory loop might be part of a wider network for the NPC to control gene expression, depending on the cellular and developmental context. Although a genuine and direct regulatory role of this loop has yet to be demonstrated, its intrinsic capacity supplies the NPC with a key mechanism to globally or locally organize the metazoan genome. On the other hand, any change or imbalance in this regulatory network might have dramatic effects for the nuclear architecture and, concomitantly, the expression profile of the cell. This may have a significant medical implication, as nuclear pore components not only are known to deteriorate with age but are also affected in several tissue-specific human diseases (Breuer, 2015).
Satellite DNAs (satDNAs) are a ubiquitous feature of eukaryotic genomes and are usually the major components of constitutive heterochromatin. The 1.688 satDNA, also known as the 359 bp satellite, is one of the most abundant repetitive sequences in Drosophila melanogaster and has been linked to several different biological functions. This study investigated the presence and evolution of the 1.688 satDNA in 16 Drosophila genomes. The 1.688 satDNA family was found to be much more ancient than previously appreciated, being shared amongst part of the melanogaster group that diverged from a common ancestor ∼27 Mya. The 1.688 satDNA family has two major subfamilies spread throughout Drosophila phylogeny (∼360 bp and ∼190 bp). Phylogenetic analysis of ∼10,000 repeats extracted from 14 of the species revealed that the 1.688 satDNA family is present within heterochromatin and euchromatin. A high number of euchromatic repeats are gene proximal, suggesting the potential for local gene regulation. Notably, heterochromatic copies display concerted evolution and a species-specific pattern, whereas euchromatic repeats display a more typical evolutionary pattern, suggesting that chromatin domains may influence the evolution of these sequences. Overall, these data indicate the 1.688 satDNA as the most perduring satDNA family described in Drosophila phylogeny to date. This study provides a strong foundation for future work on the functional roles of 1.688 satDNA across many Drosophila species (de Lima, 2020).
Contrary to dogma, evolutionarily young and dynamic genes can encode essential functions. Rvolutionarily dynamic ZAD-ZNF genes, which encode the most abundant class of insect transcription factors, are more likely to encode essential functions in Drosophila melanogaster than ancient, conserved ZAD-ZNF genes. This study focused on the Nicknack ZAD-ZNF gene, which is evolutionarily young, poorly retained in Drosophila species, and evolves under strong positive selection. Yet it was found that it is necessary for larval development in D. melanogaster. Nicknack encodes a heterochromatin-localizing protein like its paralog Oddjob, also an evolutionarily dynamic yet essential ZAD-ZNF gene. The divergent D. simulans Nicknack protein can still localize to D. melanogaster heterochromatin and rescue viability of female but not male Nicknack-null D. melanogaster. These findings suggest that innovation for rapidly changing heterochromatin functions might generally explain the essentiality of many evolutionarily dynamic ZAD-ZNF genes in insects (Kasinathan, 2020).
Polycomb and Trithorax group proteins maintain stable epigenetic memory of gene expression states for some genes, but many targets show highly dynamic regulation. This study combined experiment and theory to examine the mechanistic basis of these different modes of regulation. A mathematical model is presented comprising a Polycomb/Trithorax response element (PRE/TRE) coupled to a promoter and including Drosophila developmental timing. The model accurately recapitulates published studies of PRE/TRE mediated epigenetic memory of both silencing and activation. With minimal parameter changes, the same model can also recapitulate experimental data for a different PRE/TRE that allows dynamic regulation of its target gene. The model predicts that both cell cycle length and PRE/TRE identity are critical for determining whether the system gives stable memory or dynamic regulation. This work provides a simple unifying framework for a rich repertoire of PRE/TRE functions, and thus provides insights into genome-wide Polycomb/Trithorax regulation (Reinig, 2020).
Polycomb group (PcG) and trithorax group (trxG) proteins are evolutionary conserved factors that contribute to cell fate determination and maintenance of cellular identities during development of multicellular organisms. The PcG maintains heritable patterns of gene silencing while trxG acts as anti-silencing factors by conserving activation of cell type specific genes. Genetic and molecular analysis has revealed extensive details about how different PcG and trxG complexes antagonize each other to maintain cell fates, however, the cellular signaling components that contribute to the preservation of gene expression by PcG/trxG remain elusive. This study reports an ex vivo kinome-wide RNAi screen in Drosophila aimed at identifying cell signaling genes that facilitate trxG in counteracting PcG mediated repression. From the list of trxG candidates, Ballchen (BALL), a histone kinase known to phosphorylate histone H2A at threonine 119 (H2AT119p), was characterized as a trxG regulator. The ball mutant exhibits strong genetic interactions with Polycomb (Pc) and trithorax (trx) mutants and loss of BALL affects expression of trxG target genes. BALL co-localizes with Trithorax on chromatin and depletion of BALL results in increased H2AK118 ubiquitination, a histone mark central to PcG mediated gene silencing. Moreover, BALL was found to substantially associate with known TRX binding sites across the genome. Genome wide distribution of BALL also overlaps with H3K4me3 and H3K27ac at actively transcribed genes. It is proposed that BALL mediated signaling positively contributes to the maintenance of gene activation by trxG in counteracting the repressive effect of PcG (Khan, 2021).
Haspin, a highly conserved kinase in eukaryotes, has been shown to be responsible for phosphorylation of histone H3 at threonine 3 (H3T3ph) during mitosis, in mammals and yeast. This study reports that Haspin is the kinase that phosphorylates H3T3 in Drosophila melanogaster and it is involved in sister chromatid cohesion during mitosis. The data reveal that Haspin also phosphorylates H3T3 in interphase. H3T3ph localizes in broad silenced domains at heterochromatin and lamin-enriched euchromatic regions. Loss of haspin compromises insulator activity in enhancer-blocking assays and triggers a decrease in nuclear size that is accompanied by changes in nuclear envelope morphology. Haspin is a suppressor of position-effect variegation involved in heterochromatin organization. These results also demonstrate that haspin is necessary for pairing-sensitive silencing and it is required for robust Polycomb-dependent homeotic gene silencing. Haspin associates with the cohesin complex in interphase, mediates Pds5 binding to chromatin and cooperates with Pds5-cohesin to modify Polycomb-dependent homeotic transformations. Therefore, this study uncovers an unanticipated role for Haspin kinase in genome organization of interphase cells and demonstrates that haspin is required for homeotic gene regulation (Fresan, 2020).
Drosophila Haspin modulates key aspects of chromatin organization during interphase: insulator activity, heterochromatin-induced position-effect variegation, nuclear morphology and PRE-dependent pairing-sensitive silencing. Some of these aspects could be influenced by mitotic events and, therefore, be regulated by Haspin functionality in chromosome organization during mitosis. However, the results also show that Haspin phosphorylates histone H3 in interphase and associates with the cohesin complex mediating Pds5 binding to chromatin in interphase cells, strongly suggesting that this kinase modulates chromatin organization not only during mitosis but also in interphase (Fresan, 2020).
Thr3 in histone H3 is located immediately adjacent to Lys4, which has been shown to be tri-methylated at active promoter sites. TFIID binding to H3K4me3, which is involved in transcription machinery recruitment, is severely reduced in mitosis as a result of H3T3 phosphorylation. On the other hand, in vitro studies have shown that tri-methylation of H3K4 reduces substrate recognition by Haspin suggesting antagonism between H3K4 methylation and H3T3 phosphorylation. In agreement with these studies, the current results show that H3T3ph localizes at heterochromatin and lamin-enriched regions, while it does not colocalize with active RNA polymerase II. Even though, future investigations will be needed to decipher whether or not H3T3ph never colocalizes with H3K4me and active transcription, altogether these studies suggest antagonism between phosphorylated H3T3 and active transcription. Moreover, Haspin is preferentially found at the nuclear lamina, like CP190 insulator protein and polycomb proteins. While proteins that are associated with transcriptionally active chromatin are easily solubilized in subcellular fractionation assays, Haspin, insulator and polycomb proteins are tightly bound to the nuclear matrix where they colocalize with laminas suggesting that nuclear organization of these proteins might contribute to their functionality (Fresan, 2020).
This study reports that haspin is a strong suppressor of position-effect variegation playing a role in heterochromatin organization. HP1 proteins, which are key components of heterochromatin, have been reported to promote haspin localization at mitotic centromeres to protect centromeric cohesion in mammals (Yi, 2018). Whether HP1 proteins promote Haspin localization at centromeric heterochromatin in interphase and whether phosphorylation of H3T3 is involved in heterochromatin organization remain to be determined (Fresan, 2020).
This study has also shown that haspin is required for robust Polycomb-dependent homeotic gene silencing based on the following observations in the absence of haspin: (1) derepression of Abd-B transcription; (2) reduction of Pc binding at several PRE-containing regulatory elements and (3) enhanced homeotic transformations in Pc mutant sensitized backgrounds. Moreover, depletion of haspin has a strong impact in pairing-sensitive silencing which involves long-range chromatin organization. The current results also show that Haspin cooperates with Pds5-cohesin to enhance Polycomb-dependent homeotic transformations. Thus, Haspin might regulate homeotic gene silencing by directly affecting the binding of PcG proteins to chromatin or by affecting Pds5-cohesin dynamics modulating chromatin organization of Polycomb domains. Recent results point to an important role for cohesin complexes in the establishment and/or maintenance of Polycomb-repressed domains in mammalian cells but also to restrict their aggregation. This study has shown that Haspin mediates Pds5-binding to chromatin in interphase and modulates cohesin association with chromatin along the cell cycle. Pds5 proteins have both positive and negative effects on cohesin association with chromatin, they cooperate with Wapl in releasing cohesin from DNA but they are also required to maintain sister-chromatid cohesion in G2/M. Pds5 interacts with Wapl, Dalmatian/Sororin, Eco/Eso acetyltransferase and Haspin through the same conserved protein-protein module [Goto, 2017; Nishiyama, 2010]. Wapl-Pds5 interaction has been shown to be counteracted by Eco and Sororin in S phase antagonizing Wapl's ability to dissociate cohesin from DNA. On the other hand, Haspin has been shown to phosphorylate Wapl (Liang, 2018) and to antagonize Wapl-Pds5 interaction to protect proper centromeric cohesion in mitosis (Goto, 2017; Liang, 2018; Zhou, 2017). Although relationship between Haspin and the cohesin complex needs to be further characterized, this work points to an important role of Haspin in the complex regulation of Pds5-cohesin dynamics along the entire cell cycle (Fresan, 2020).
It has been suggested that inhibition of Haspin could have potent anti-tumoral effects with fewer adverse effects compared with other anti-cancer agents. The current results show mitotic defects in Drosophila haspin mutants in agreement with previous reported studies in yeast and mammals (Dai, 2006; Goto, 2017). However, haspin mutants have been reported not lethal in budding yeast and in fission yeast and the current data show that they are also viable in Drosophila, even though life span and fertility are affected. Besides, these findings demonstrate that haspin is controlling genome organization of interphase cells raising concerns with respect to the use of Haspin inhibitors as potent mitosis-specific anticancer drugs (Fresan, 2020).
Mammalian SWI/SNF (mSWI/SNF) ATP-dependent chromatin remodeling complexes are multi-subunit molecular machines that play vital roles in regulating genomic architecture and are frequently disrupted in human cancer and developmental disorders. To date, the modular organization and pathways of assembly of these chromatin regulators remain unknown, presenting a major barrier to structural and functional determination. This study elucidates the architecture and assembly pathway across three classes of mSWI/SNF complexes-canonical BRG1/BRM-associated factor (BAF), polybromo-associated BAF (PBAF), and newly defined ncBAF complexes-and define the requirement of each subunit for complex formation and stability. Using affinity purification of endogenous complexes from mammalian and Drosophila cells coupled with cross-linking mass spectrometry (CX-MS) and mutagenesis, three distinct and evolutionarily conserved modules, their organization, and the temporal incorporation of these modules into each complete mSWI/SNF complex class were uncovered. Finally, human disease-associated mutations were mapped within subunits and modules, defining specific topological regions that are affected upon subunit perturbation (Mashtalir, 2018).
ATP-dependent chromatin remodeling complexes are multimeric molecular assemblies that regulate chromatin architecture. These complexes are grouped into four major families, including switching (SWI)/sucrose fermentation (sucrose non-fermenting [SNF]), INO80, ISWI (imitation SWI), and CHD/M-2 (chromodomain helicase DNA-binding) groups), all of which contain Snf2-like ATPase subunits but differ substantially via the incorporation of distinct subunits and in their targeting and activity on chromatin (Mashtalir, 2018).
SWI/SNF complexes were originally discovered and characterized in yeast, later in Drosophila, and most recently in mammals. Mammalian SWI/SNF (mSWI/SNF) complexes are ∼1- to 1.5-MDa entities combinatorially assembled from the products of 29 genes, including multiple paralogs, generating extensive diversity in composition. All complexes contain an ATPase subunit, either SMARCA4 (BRG1) or SMARCA2 (BRM), that catalyzes the hydrolysis of ATP. The roles of most other accessory subunits in complex assembly and stability as well as targeting and function remain unknown (Mashtalir, 2018).
Over the past several years, mSWI/SNF complexes have emerged as a major focus of attention because of the striking mutational frequencies in the genes encoding their subunits across a range of human diseases, from cancer to neurologic disorders. Indeed, recent exome sequencing studies have revealed that over 20% of human cancers bear mutations in the genes encoding mSWI/SNF subunits. In addition, heterozygous point mutations in mSWI/SNF genes have been implicated as causative events in intellectual disability and autism spectrum disorders (Mashtalir, 2018).
A major barrier to understanding of the functions, tissue-specific roles, and effect of mutations on mSWI/SNF complex mechanisms lies in the lack of information regarding subunit organization, assembly, and 3D structure. Several factors pose major challenges to such studies. Individually expressed subunits are often unstable or incorrectly folded without their appropriate binding partners, and minimal complexes pieced together via in vitro co-expression may not represent endogenous, physiologically relevant complexes in cells. Large quantities of purified endogenous complexes with minimal heterogeneity are required for downstream analyses, and selection of appropriate purification strategies cannot be informed without understanding modular architecture and assembly order. For these reasons and others, to date only low-resolution maps have been achieved using cryoelectron microscopy (cryo-EM) approaches (Mashtalir, 2018).
To establish a comprehensive structural framework for mSWI/SNF complexes, this study used a multifaceted series of approaches involving complex and subcomplex purification, mass spectrometry (MS), crosslinking mass spectrometry (CX-MS), systematic genetic manipulation of subunits and subunit paralog families, evolutionary analyses, and human disease genetics. These studies reveal that mSWI/SNF complexes exist in three non-redundant final form assemblies: BRG1/BRM-associated factor complexes (BAFs), polybromo-associated BAF complexes (PBAFs), and non-canonical BAFs (ncBAFs), for which this study established the assembly requirements and modular organization. This study defines the full spectrum of endogenous combinatorial possibilities and the effect of individual subunit deletions and mutations, including recurrent, previously uncharacterized missense and nonsense mutations, on complex architecture. These studies provide important insights into mSWI/SNF complex organization, structure, and function and the biochemical consequences of a wide range of human disease-associated mutations (Mashtalir, 2018).
This study presents a comprehensive architectural framework for the mSWI/SNF chromatin remodeler complex family, including the assembly pathways and inter- and intra-module linkages across three distinct complexes. Integrating multiple complex purifications with size fractionation, mutagenesis, and cross-linking mass spectrometry, this study defined intra-complex modular architecture, stoichiometry, and evolutionary relationships and explored the effects of disease-associated mutations on complex architecture and assembly.
One particularly unexpected result is that the initial core for all three mSWI/SNF family complexes is a heterotrimer consisting of two SMARCC subunits (as a dimer) and one SMARCD subunit. Although previous in vitro subunit co-purifications had suggested a 'minimal BAF complex' consisting of SMARCA4, SMARCC1, SMARCC2, and SMARCB1, this study found that neither complex assembly pathways nor cross-linking mass spectrometry profiles of full BAF or PBAF complexes implicated this tetramer as a physiologic core in mammalian cells. Indeed, these results may begin to explain the challenges that have been faced in obtaining high-resolution structural information on this complex and in using such minimal complexes for small molecule screening efforts. Importantly, the mSWI/SNF initial core is required for global complex stability and the interaction of the majority of subunits in all three mSWI/SNF complexes. Notably, the newly identified ncBAF complex assembles exclusively around a SMARCC1 and SMARCD1 initial core and lacks SMARCE1 and SMARCB1 subunits, indicating fundamental differences and/or compensation in biochemical activity (Mashtalir, 2018).
Interestingly, network modularity analyses of cross-linking mass spectrometry data place SMARCB1 in the ATPase module, whereas biochemical purification of SMARCB1 demonstrates its presence in the BAF core module. This suggests that SMARCB1 may be involved in functionally linking the core and ATPase modules, potentially modulating ATPase or remodeling activity. Indeed, SNF5 regulates the chromatin remodeling activity of the yeast complex. Although the SNF5 and SMARCB1 subunits are largely dispensable for complex integrity in both yeast and human settings, respectively, we observed that these orthologs exhibit different module associations in distantly related eukaryotes, suggesting that SMARCB1 may play a dynamic role in regulating SWI/SNF complex activity (Mashtalir, 2018).
ARID subunits (ARID1A, ARID1B, and ARID2) are among the most frequently mutated subunits in human disease. Importantly, this study found that ARID subunits are the major determinants of assembly pathway branching toward BAF or PBAF complexes. ARID subunits bind the BAF core module through the CBR regions on the C terminus and N terminus of ARID1 and ARID2, respectively, likely leading to the formation of a large interaction interface and forging a structurally essential bridge between the core and ATPase modules. SMARCD subunits in particular play a major role in ARID subunit binding as their loss substantially affects ARID and subsequent ATPase module assembly. The critical role for ARID subunits is further illustrated by their interaction with the ATPase module subunits SMARCA and ACTL6A. Finally, the absence of any ARID subunits in the newly identified ncBAF complex suggests an alternative, ARID-independent mode of binding the ATPase module (likely mediated by GLTSCR1 and GLTSCR1L subunits) (Mashtalir, 2018).
Assembly of multiprotein complexes often occurs in a directed modular manner with defined and evolutionarily conserved subcomplexes. Indeed, analysis of cross-linking mass spectrometry-identified linkages within SWI/SNF complexes of two other eukaryotic species reveals evolutionary conservation of the complex modularity identified in mammalian cells. The conserved structural properties of these complexes suggest separation and divergence of complex functions. Although the ATPase domain has been implicated in nucleosome sliding, the role of the other modules and subunits in both ATPase activity and nucleosome remodeling remains poorly understood. Extensive DNA binding surfaces, particularly on the BAF core module, may play critical roles in directing forces from the ATPase-nucleosome-DNA interaction. Further studies will be required to define the role of each subunit and domain in complex targeting and in the modulation of ATPase-driven nucleosome remodeling.
Finally, the current findings suggest that BAF inter- and intra-modular interactions are altered by mutations found in many human cancers and other diseases and that these mutations disrupt the normal complex assembly pathway or subunit protein stability. A prime example of this lies in the extensively mutated ARID1A subunit, including both nonsense mutations and missense mutations that are disproportionately skewed to the C-terminal domain that we found is required for BAF complex binding (Mashtalir, 2018).
Taken together, these studies present new opportunities for structural and functional characterization of this family of mammalian chromatin remodeling complexes that exhibits outsized roles in human disease. Understanding the architecture and modular organization of mSWI/SNF complexes greatly potentiates the ability to assign density to subunits and modules using cryo-EM in efforts to achieve 3D structure, to link structure to binding and biochemical activity on chromatin, and to develop physiologically meaningful small-molecule screening strategies, collectively serving as a critical foundation in the quest to define mechanisms of mSWI/SNF-mediated chromatin remodeling in normal and disease states (Mashtalir, 2018).
To regenerate, damaged tissue must heal the wound, regrow to the proper size, replace the correct cell types, and return to the normal gene-expression program. However, the mechanisms that temporally and spatially control the activation or repression of important genes during regeneration are not fully understood. To determine the role that chromatin modifiers play in regulating gene expression after tissue damage, ablation was induced in Drosophila melanogaster imaginal wing discs, and a screen was carried out for chromatin regulators that are required for epithelial tissue regeneration. Many of these genes are shown to be important for promoting or constraining regeneration. Specifically, the two SWI/SNF chromatin-remodeling complexes play distinct roles in regulating different aspects of regeneration. The PBAP (see Polybromo) complex regulates regenerative growth and developmental timing, and is required for the expression of JNK signaling targets and the growth promoter Myc. By contrast, the BAP complex (see Osa) ensures correct patterning and cell fate by stabilizing the expression of the posterior gene engrailed. Thus, both SWI/SNF complexes are essential for proper gene expression during tissue regeneration, but they play distinct roles in regulating growth and cell fate (Tian, 2021).
Regeneration is a complex yet highly elegant process that some organisms can use to recognize, repair, and replace missing or damaged tissue. Imaginal disc repair in Drosophila is a good model system for understanding regeneration, due to the high capacity of these tissues to regrow and restore complex patterning, as well as the genetic tools available in this model organism. Regeneration requires the coordinated expression of genes that regulate the sensing of tissue damage, induction of regenerative growth, repatterning of the tissue, and coordination of regeneration with developmental timing. Initiation of regeneration in imaginal discs requires known signaling pathways such as the Reactive oxygen species (ROS), Jun N-terminal kinase (JNK), Wingless (Wg), p38, Janus kinase/signal transducer and activator of transcription (Jak/STAT), and Hippo pathways. These pathways activate many regeneration genes, such as the growth promoter Myc and the hormone-like peptide ilp8, which delays pupariation after imaginal disc damage. However, misregulation of these signals can impair regeneration. For example, elevated levels of JNK signaling can induce patterning defects in the posterior of the wing, and elevated ROS levels can suppress JNK activity and regenerative growth. While the signals that initiate regeneration have been extensively studied, regulation of regeneration gene expression in response to tissue damage is not fully understood (Tian, 2021).
Such regulation could occur through chromatin modification. In Drosophila, chromatin modifiers include the Polycomb repressive complexes PRC1 and PRC2, which can be recruited to specific locations by the Pho repressor complex (PhoRC), the activating complexes Trithorax acetylation complex (TAC1), Complex of proteins associated with Set1 (COMPASS) and COMPASS-like, the nucleosome remodeling complex (NURF), and the switch/sucrose non-fermentable (SWI/SNF) chromatin remodelers Brahma-associated proteins (BAP) and Polybromo-associated proteins (PBAP). PRC2 carries out trimethylation of histone H3 at lysine 27, recruiting PRC1 to repress transcription of nearby genes. COMPASS-like and COMPASS carry out histone H3 lysine 4 monomethylation and di- and trimethylation, respectively, thereby activating the expression of nearby genes. TAC1 acetylates histone H3 lysine 27, also supporting activation of gene transcription. NURF, BAP, and PBAP alter or move nucleosomes to facilitate binding of transcription factors and chromatin modifiers. Rapid changes in gene expression induced by these complexes may help facilitate a damaged tissue's regenerative response (Tian, 2021).
A few chromatin modifiers and histone modifications have been reported to be important for regulating regeneration of Xenopus tadpole tails, mouse pancreas and liver, zebrafish fins, and Drosophila imaginal discs. Furthermore, components of Drosophila and mouse SWI/SNF complexes regulate regeneration in the Drosophila midgut and mouse skin, liver, and ear. However, little is known about how these complexes alter gene expression, signaling, and cellular behavior to regulate regeneration. Importantly, genome-wide analysis of chromatin state after Drosophila imaginal disc damage revealed changes in chromatin around a large set of genes, including known regeneration genes . Thus, chromatin modifiers likely play a key role in regulating activation of the regeneration program. However, it is unclear whether all regeneration genes are coordinately regulated in the same manner, or whether specific chromatin modification complexes target different subsets of genes that respond to tissue damage (Tian, 2021).
To probe the role of chromatin modifiers in tissue regeneration systematically, a collection of pre-existing Drosophila mutants and RNAi lines targeting components of these complexes as well as other genes that regulate chromatin was collected, and these lines were screened for regeneration defects using the Drosophila wing imaginal disc. A spatially and temporally controllable tissue-ablation method was used that uses transgenic tools to induce tissue damage only in the wing primordium. This method ablates 94% of the wing primordium on average at the early third instar and allows the damaged wing discs to regenerate in situ. Previous genetic screens using this tissue ablation method have identified genes critical for regulating different aspects of regeneration, such as taranis, trithorax, and cap-n-collar, demonstrating its efficacy in finding regeneration genes (Tian, 2021).
Through this targeted genetic screen of chromatin regulators, it was found that mutations in Drosophila SWI/SNF components caused striking regeneration defects. The SWI/SNF complexes are conserved multi-subunit protein complexes that activate or repress gene expression by using the energy from ATP hydrolysis to disrupt histone-DNA contacts and remodel nucleosome structure and position. Brahma (Brm) is the only ATPase of the SWI/SNF complexes in Drosophila. Moira (Mor) serves as the core scaffold of the complexes. Other components contain domains involved in protein-protein interactions, protein-DNA interactions, or interactions with modified histones. There are two subtypes of SWI/SNF in Drosophila: the Brahma-associated proteins (BAP) and the Polybromo-associated BAP (PBAP) remodeling complexes. They share common core components, including Brm, Snf5-related 1 (Snr1), Mor, Brahma-associated protein 55kD (Bap55), Brahma-associated protein 60kD (Bap60), Brahma-associated protein 111kD (Bap111), and Actin, but contain different signature proteins. The PBAP complex is defined by the components Brahma-associated protein 170kD (Bap170), Polybromo, and Supporter of activation of yellow protein (Sayp) (Tian, 2021).
This study shows that the SWI/SNF complexes BAP and PBAP are required for regeneration, and that the two complexes play distinct roles. The PBAP complex is important for activation of JNK signaling targets such as ilp8 to delay metamorphosis and allow enough time for the damaged tissue to regrow, and for expression of Myc to drive regenerative growth. By contrast, the BAP complex functions to prevent changes in cell fate induced by tissue damage through stabilizing expression of the posterior identity gene engrailed. Thus, different aspects of the regeneration program are regulated independently by distinct chromatin regulators (Tian, 2021).
To address the question of how regeneration genes are regulated in response to tissue damage, a collection of mutants and RNAi lines were screened that affect a significant number of the chromatin regulators in Drosophila. Most of these mutants had regeneration phenotypes, confirming that these genes are important for both promoting and constraining regeneration and likely facilitate the shift from the normal developmental program to the regeneration program, and back again. The variation in regeneration phenotypes among different chromatin regulators and among components of the same multiunit complexes supports a previous finding that damage activates expression of genes that both promote and constrain regeneration. Such regulators of regeneration may be differentially affected by distinct mutations that affect the same chromatin-modifying complexes, resulting in different phenotypes (Tian, 2021).
This study has demonstrated that both Drosophila SWI/SNF complexes play essential but distinct roles during epithelial regeneration, controlling multiple aspects of the process, including growth, developmental timing, and cell fate. Furthermore, this work has identified multiple likely targets, including mmp1, Myc, ilp8, and en. Indeed, analysis of data from a recent study that identified regions of the genome that transition to open chromatin after imaginal disc damage showed such damage-responsive regions near Myc, mmp1, and ilp8. While previous work has suggested that chromatin modifiers can regulate regeneration, and that the chromatin near Drosophila regeneration genes is modified after damage, the results suggest that these damage-responsive loci are not all coordinately regulated in the same manner. The SWI/SNF complexes target different subsets of genes, and it will not be surprising if different cofactors or transcription factors recruit different complexes to other subsets of regeneration genes (Tian, 2021).
Is the requirement for the SWI/SNF complexes for growth and conservation of cell fate in the wing disc specific to regeneration? In contrast to tara, which is required for posterior wing fate only after damage and regeneration, loss of mor in homozygous clones during wing disc development caused loss of en expression in the posterior compartment, although this result was interpreted to mean that mor promotes rather than constrains en expression, which is the opposite of the current observations. Importantly, undamaged mor heterozygous mutant animals did not show patterning defects, while damaged heterozygous mutant animals did, indicating that regenerating tissue is more sensitive to reductions in SWI/SNF levels than normally developing tissue. Furthermore, osa is required for normal wing growth, but reduction of osa levels did not compromise growth during regeneration (Supplementary Figure S3, D-H), and instead led to enhanced regeneration. Thus, while some functions of SWI/SNF during regeneration may be the same as during development, other functions of SWI/SNF may be unique to regeneration (Tian, 2021).
SWI/SNF complexes help organisms respond rapidly to stressful conditions or changes in the environment. For example, SWI/SNF is recruited by the transcription factor DAF-16/FOXO to promote stress resistance in Caenorhabditis elegans, and the Drosophila BAP complex is required for the activation of target genes of the NF-κB signaling transcription factor Relish in immune responses. This study showed that the Drosophila PBAP complex is similarly required after tissue damage for activation of target genes of the JNK signaling transcription factor AP-1 after tissue damage. Interestingly, the BAF60a subunit, a mammalian homolog of Drosophila BAP60, directly binds the AP-1 transcription factor and stimulates the DNA-binding activity of AP-1, suggesting that this role may be conserved (Tian, 2021).
In summary, this study have demonstrated that the two SWI/SNF complexes regulate different aspects of wing imaginal disc regeneration, implying that activation of the regeneration program is controlled by changes in chromatin, but that the mechanism of regulation is likely different for subsets of regeneration genes. Future identification of all genes targeted by BAP and PBAP after tissue damage, the factors that recruit these chromatin-remodeling complexes, and the changes they induce at these loci will deepen our understanding of how unexpected or stressful conditions lead to rapid activation of the appropriate genes (Tian, 2021).
Histone acetylations are important epigenetic markers for transcriptional activation in response to metabolic changes and various stresses. Using the high-throughput SEquencing-Based Yeast replicative Lifespan screen method and the yeast knockout collection, this study demonstrate that the HDA complex, a class-II histone deacetylase (HDAC), regulates aging through its target of acetylated H3K18 at storage carbohydrate genes. In addition to longer lifespan, disruption of HDA results in resistance to DNA damage and osmotic stresses. This study shows that these effects are due to increased promoter H3K18 acetylation and transcriptional activation in the trehalose metabolic pathway in the absence of HDA. Furthermore, it was determined that the longevity effect of HDA is independent of the Cyc8-Tup1 repressor complex known to interact with HDA and coordinate transcriptional repression. Silencing the HDA homologs in C. elegans and Drosophila increases their lifespan and delays aging-associated physical declines in adult flies. Hence, this study has demonstrated that this HDAC controls an evolutionarily conserved longevity pathway (Yu, 2021).
Enhancers are essential drivers of cell states, yet the relationship between accessibility, regulatory activity, and in vivo lineage commitment during embryogenesis remains poorly understood. This study measured chromatin accessibility in isolated neural and mesodermal lineages across a time course of Drosophila embryogenesis. Promoters, including tissue-specific genes, are often constitutively open, even in contexts where the gene is not expressed. In contrast, the majority of distal elements have dynamic, tissue-specific accessibility. Enhancer priming appears rarely within a lineage, perhaps reflecting the speed of Drosophila embryogenesis. However, many tissue-specific enhancers are accessible in other lineages early on and become progressively closed as embryogenesis proceeds. This study demonstrates the usefulness of this tissue- and time-resolved resource to definitively identify single-cell clusters, to uncover predictive motifs, and to identify many regulators of tissue development. For one such predicted neural regulator, l(3)neo38, a loss-of-function mutant was generated and an essential role for neuromuscular junction and brain development was uncovered (Reddington, 2020).
Chromatin accessibility profiling is a powerful method to map the regulatory genome-the full complement of regulatory elements engaged by transcription factors (TFs) and other regulatory proteins-in a specific cell type and biological context. Accessible regions are generally measured by the sensitivity of genomic regions to the nuclease DNaseI, e.g., 'DNase-seq', or insertion of the Tn5 transposase, e.g., 'ATAC-seq', following the displacement of canonical nucleosomes by TFs and other regulatory factors. when measured over cellular differentiation, these approaches can chart the dynamics of regulatory-element usage genome wide, infer the occupancy of DNA-binding factors en masse, and identify candidate TF drivers of cell-fate decisions (Reddington, 2020).
Although an excellent method to identify the location of regulatory elements, the precise relationship between chromatin accessibility, enhancer activity, and the timing of cell-fate decisions remains unclear. Chromatin accessibility is often equated with regulatory activity, but there are several reasons why this may not be the case. First, the precise molecular determinants of chromatin accessibility as measured by methods, such as DNase-seq and ATAC-seq, and the relative importance of different TF families, chromatin-remodeling complexes, and histone variants are not well understood. Second, chromatin accessibility per se provides no information about the identity of the bound proteins, whether they have activating or repressive activity, or the function of the regulatory region (enhancer, insulator, or origin of replication). Third, the activation of at least a subset of lineage-specific enhancers occurs via a multistep process during development, being bound in precursor cells ('priming'), but only becoming activated at subsequent developmental stages following an exchange or addition of TFs. Fourth, chromatin accessibility may reflect residual TF binding that remains in a recently diverged sister lineage. Directly connecting accessibility to enhancer activity more generally has been hampered by technical limitations-the activity of a large number of enhancers is rarely measured directly in vivo, but is rather inferred from secondary information, such as histone modifications, that are themselves imperfect correlates of enhancer activity. Therefore, it remains unclear if the timing of enhancer activity correlates with the timing of accessibility and if this relationship changes over development (Reddington, 2020).
Chromatin accessibility profiling has been applied extensively to human cell lines and primary tissues, providing a comprehensive picture of regulatory landscapes in definitive cell types and highlighting the extensive differences between them. In contrast, far less is known about how regulatory landscapes diverge during embryonic development. With few exceptions, much of what is known comes from whole embryos or dissected organs or tissues, which have limited sensitivity to detect enhancers that are active in small subsets of cells due to averaging over heterogeneous cell types (Reddington, 2020).
Single-cell methods can address some of these issues. The power of single-cell ATAC-seq was recently demonstrated by combinatorial indexing (sci-ATAC-seq) to dissect cell-type-specific signatures of open chromatin starting from whole embryos (Cusanovich, 2018). This 'shotgun' approach identified thousands of previously unidentified regulatory elements and was sufficient to annotate cell clusters and predict tissue-specific enhancer activity (Cusanovich, 2018). However, a limitation of single-cell genomics is that cell identities are assigned post hoc based on prior information, e.g., using marker genes. Having high-quality tissue- and time-resolved datasets are therefore an extremely useful resource for annotating single-cell data. In addition, given the current sparsity of single-cell epigenomics data, bulk data have greater statistical power to define cell-type and time-point-specific regulatory features de novo (Reddington, 2020).
To better characterize chromatin accessibility during the development of different embryonic cell lineages, a tissue- and time-resolved chromatin accessibility atlas was generated spanning the stages when all major cell types are specified during Drosophila embryogenesis, focusing on the mesoderm and neuronal lineages. This atlas was integrated with an extensive database of curated embryonic enhancers to systematically assess the relationship between enhancer accessibility and enhancer activity in vivo during cell-fate decisions. The high resolution of this tissue- and time-resolved data facilitated the discovery of thousands of previously unidentified tissue-specific regulatory elements. This study demonstrated the value of this resource to define cell identities in sci-ATAC-seq data (Cusanovich, 2018) and as a training set to identify sequence features that are predictive of tissue-specific regulatory-element usage. This allowed identification of many potential regulators in each lineage, one of which was characterize by genetic knockout, and an essential role was demonstrated in neuromuscular junction and brain development (Reddington, 2020).
In this study most l(3)neo38CRISPR mutant embryos reached the pupal stage (74%), but then died as late (black) pupae; only 12% hatched from the pupal case compared with 90% of the control heterozygote animals. The small number of l(3)neo38CRISPRadult flies that do hatch have severely reduced motility, are barely able to walk and unable to fly (with a 'wings-up' phenotype), and fail to reproduce, suggesting defects in the neuromuscular system (Reddington, 2020).
To further characterize these defects, third instar larval brains and neuromuscular junctions were examined via immunostaining. To assess the brain, advantage was taken of a GFP reporter (3xP3-GFP) driven by binding sites for the conserved neuronal transcription factor Pax6 present in the VasaCas9 line that was used for the CRISPR deletion. In wild type (heterozygous l(3)neo38CRISPR/+) larval brains, GFP is highly expressed in the optic lobes (OL), with lower levels in the ventral nerve cord (VNC), similar to the Pax6 homolog, ey. Loss-of-function l(3)neo38CRISPR brains show the reverse, with almost no GFP expression in the OL, and high expression in the VNC, especially in the posterior part (Reddington, 2020).
To examine the development of the neuromuscular junction (NMJ), homozygous loss-of-function third instar larvae were dissected and stained with anti-HRP (horseradish peroxidase, to mark neuronal membranes) and Brp (Bruchpilot, to mark synaptic active zones). This revealed neuromuscular junctions that are both smaller and less branched in l(3)neo38CRISPR mutants. Quantifying the NMJ area at the ventral longitudinal muscles 6 and 7 in abdominal segments A3 and A4, using the HRP marker protein, revealed a significant reduction in the size of NMJs in homozygous loss-of-function mutants compared with wild-type larvae of the same age. Consistent with these neurological defects, l(3)neo38 occupies a subset of neuronal specific distal DHS, as seen using embryonic ChIP-seq data . The nearest genes of these l(3)neo38 ChIP peaks are enriched in functions related to postembryonic development and the regulation of neuron interactions and synapses (Reddington, 2020).
Drosophila is a long-standing model organism for the study of gene regulation, chromatin biology, cell-fate decisions, and general properties of embryonic development, with many fundamental biological processes being first discovered in this system. This study extended the Drosophila toolkit by generating an extensive resource of tissue-resolved chromatin accessibility during an in vivo time course of tissue development. By isolating highly pure populations of cell nuclei in a tight developmental time course, this atlas reveals the extensive dynamics that accompanies major developmental transitions in two important lineages: the nervous system and the mesoderm/muscle system. Importantly, this significantly extends knowledge beyond the previously available data collected from whole embryos and single cells from a smaller number of time points (Cusanovich, 2018). This resource can be easily searched and visualized (Reddington, 2020).
There are multiple approaches that can achieve cell-type-resolved chromatin accessibility information from developing embryos, such as the INTACT, BiTS, CaTaDA, and shotgun single-cell ATAC-seq. Each has its own strengths and weaknesses, making the choice of method dependent on technical issues. An advantage of this modified BiTS approach is that it can be applied to wild-type embryos of theoretically any species, although it is limited by the availability of high-quality antibodies against nuclear marker proteins of interest. Single-cell ATAC-seq is a very exciting alternative as it negates the need for transgenic animals or antibodies to isolate cell populations, and provides an unbiased view of cellular heterogeneity (Cusanovich, 2018). However, without prior enrichment, hundreds of thousands to potentially millions of cells are required to capture rare cell types. At present the resolution is to sparse in singe-cell epigenetic data (scATAC-seq or ChIP-seq) to provide accurate quantification, or de novo detection, of enhancers from rare cell subsets-highlighting the need for detailed studies of specific tissues/cell-types to provide deep, quantitative information on regulatory regions for targeted cell populations (Reddington, 2020).
This study demonstrates the utility of this tissue- and time-resolved atlas to uncover biological insights into regulatory element usage during embryonic development. Many promoters, even those of tissue specific genes, are open constitutively, including in tissues where the gene is not expressed. In contrast, the majority of distal sites are dynamic and tissue-specific. Thousands of these overlap characterized enhancers, and are accessible in the appropriate tissue matching the enhancers' activity-strongly suggesting that the bulk of distal elements (with high developmental variance) are new tissue-specific enhancers. Similar to promoters, this study also uncovered a subset of enhancers that are open in lineages where they are not active at early embryonic stages, and this gets progressively lower as embryogenesis proceeds. This indicates that active enhancers become closed in other lineages as cells begin to differentiate. Interestingly, the majority of these are bound by ubiquitous TFs, which suggests that they are not sufficient to activate the enhancer. Perhaps they boost the levels of the enhancers activity in the 'right' tissue, when co-bound by lineage specific TFs or act as place holders to keep these enhancers open until the embryonic stage when the lineage-specific factor is expressed (Reddington, 2020).
ATP-dependent chromatin remodelers control the accessibility of genomic DNA through nucleosome mobilization. However, the dynamics of genome exploration by remodelers, and the role of ATP hydrolysis in this process remain unclear. This study used live-cell imaging of Drosophila polytene nuclei to monitor Brahma (BRM) remodeler interactions with its chromosomal targets. In parallel, local chromatin condensation and its effect on BRM association were measured. Surprisingly, only a small portion of BRM is bound to chromatin at any given time. BRM binds decondensed chromatin but is excluded from condensed chromatin, limiting its genomic search space. BRM-chromatin interactions are highly dynamic, whereas histone-exchange is limited and much slower. Intriguingly, loss of ATP hydrolysis enhanced chromatin retention and clustering of BRM, which was associated with reduced histone turnover. Thus, ATP hydrolysis couples nucleosome remodeling to remodeler release, driving a continuous transient probing of the genome (Tilly, 2021).
An embryo experiences increasingly complex spatial and temporal patterns of gene expression as it matures, guiding the morphogenesis of its body. Using super-resolution fluorescence microscopy in Drosophila melanogaster embryos, it was observed that the nuclear distributions of transcription factors and histone modifications undergo a similar transformation of increasing heterogeneity. This spatial partitioning of the nucleus could lead to distinct local regulatory environments in space and time that are tuned for specific genes. Accordingly, transcription sites driven by different cis-regulatory regions each had their own temporally and spatially varying local histone environments, which could facilitate the finer spatial and temporal regulation of genes to consistently differentiate cells into organs and tissues. Thus, 'nuclear morphogenesis' may be a microscopic counterpart of the macroscopic process that shapes the animal body (Tsai, 2022).
During development, the embryo experiences changes in its gene regulation profiles, adapting to different regulatory needs. Early on during development, cell division occurs quickly and TFs form multi-cell gradients with uniform nuclear distributions. This stage up to gastrulation lasts ∼3 h in D. melanogaster embryos. Subsequently, the division slows down and segmentation of the body takes place. This phase lasts ∼18 h. During this time, many regulatory and signaling pathways for different cell types, tissues and organs become active, leading to a potentially complicated regulatory environment in the nucleus. The expression patterns of TFs became lineage specific and their nuclear distributions became heterogeneous. For TFs that regulate different genes during later stages of development, this spatial partitioning could separate them from early genes and provide another layer of safety beyond repression to prevent accidental cross-activation (Tsai, 2022).
As TFs interact with DNA, their general nuclear distributions could reflect changes in the chromatin environment. Starting from the beginning of zygotic gene expression, the levels of histone modifications increase at specific regions on the genome. This occurs concurrently with the establishment of TADs and with the chromatin organizing into distinct transcriptionally active or repressed regions. This study observed such increases in active and repressive histone modifications. The increase in H3K27me3, associated with the polycomb group proteins, was the most drastic and could be due to differentiating cells shutting down genomic regions. As chromatin accessibility may drive sub-nuclear compartmentalization, this change in chromatin organization could create distinct accessibility patterns that guide TFs into heterogeneous distributions. These changes in the distributions of TFs and chromatin states across development could alter the regulation of genes depending on the timing of their expression (Tsai, 2022).
To quantitatively measure temporal and spatial variations in histone environments around genes, constructs were used with MS2 reporter mRNAs. This study verified that their expression patterns matched their endogenous counterparts in the developmental stages that were observed. Nevertheless, to make specific statements regarding the spatial-temporal behaviors, regulatory properties and functions of endogenous genes, future experiments would ideally tag endogenous genes and decouple position tracking from transcriptional activity to observe both active and repressive microenvironments (Tsai, 2022).
Over time scales of hours, an increase was observed in H3K4me1 near active hbBAC transcription sites between stage 5 and 10 embryos. In contrast, the levels of H3K4me3 were high in both younger and older embryos. This suggests that the promotor is used at similar levels during both stages. In contrast, enhancers, especially those that are not immediately next to the promoter, may not have sufficient time to interact with the promoter in the early embryo due to the rapid division cycle and relative lack of structure in the nucleus. As the nuclear environment becomes more stable and heterogeneous in older embryos, longer-distance interactions could form, bringing more or different enhancers into play. For genes active during multiple developmental stages, a changing set of regulatory elements over time due to kinetic constraints may permit them to respond to different regulatory inputs and serve multiple roles using the same cis-regulator region (Tsai, 2022).
In less than 1 h, during the process of cellularization, this study observed that the histone environments at transcription sites of snaA2.2 W0.10 and rhoNEE could still vary. The upward trend of H3K4me3 for both constructs could indicate increasing transcriptional activity in general. Although both constructs share the same hbP2 promoter, there is still a difference in the time trajectory of H3K4me3, potentially stemming from their different enhancers. Notably, the rhoNEE minimal enhancer is about 300 bp while the sna A2.2 W0.10 construct contains a longer enhancer of about 2 kb from sna. The short enhancer of rhoNEE available for H3K4me1 deposition could mean that small changes in H3K4me1 level are below the detection threshold, whereas sna A2.2 W0.10 has a much more space for H3K4me1 to accumulate. Interestingly, hbBAC, which showed a clear change in H3K4me1, contains a cis-regulatory region of about 18 kb with potentially multiple enhancers. In addition, variations were observed in the rhoNEE histone environments depending on where the cell is located, consistent with the regulatory inputs of rhoNEE: the changing H3K4me1 but constant H3K4me3 suggests a shared active promoter (hbP2 in this construct) but differential engagement from enhancers, depending on whether the repressor Snail is present. Together, these results imply that histone environments around genes depend on the properties and states of cis-regulatory elements. These differences could refine expression patterns over time, as is the case for many developmental genes, where differences in local epigenetic marking could guide incorrectly expressing or silent cells into their proper state (Tsai, 2022).
In conclusion, this survey of the distribution of TFs and histone modifications in Drosophila embryos has provided evidence for the emergence of a heterogeneous nucleus during development. This transformation of the nuclear space may parallel morphogenesis where the embryo body becomes increasingly complex. Thus, this work suggests that 'nuclear morphogenesis' is a promising hypothesis that should be further tested and refined. Through investigating the interactions between nuclear environment and gene regulation beyond gastrulation, it would be possible gain insights into how the changing spatial architecture of the nucleus alters the strategies of gene regulation during later stages of embryo development. The ability to coordinate multiple lineage-, location- and time-specific environments in a single nucleus could facilitate gene regulation to ensure consistent cellular differentiation and the emergence of a robust developmental process. Spatial partitioning at both microscopic and macroscopic levels could be a common strategy in multi-cellular organisms to separate or coordinate regulatory processes that spans multiple length and time scales (Tsai, 2022).
Interactions between the genome and the nuclear pore complex (NPC) have been implicated in multiple gene regulatory processes, but the underlying logic of these interactions remains poorly defined. This study reports high-resolution chromatin binding maps of two core components of the NPC, Nup107 and Nup93, in Drosophila cells. This investigation uncovered differential binding of these NPC subunits, where Nup107 preferentially targets active genes while Nup93 associates primarily with Polycomb-silenced regions. Comparison to Lamin-associated domains (LADs) revealed that NPC binding sites can be found within LADs, demonstrating a linear binding of the genome along the nuclear envelope. Importantly, this study identified a functional role of Nup93 in silencing of Polycomb target genes and in spatial folding of Polycomb domains. These findings lend to a model where different nuclear pores bind different types of chromatin via interactions with specific NPC sub-complexes, and a subset of Polycomb domains is stabilized by interactions with Nup93 (Gozalo, 2019).
Spatial architecture in the nucleus is set up by interactions between the genome and protein components of nuclear macro-complexes and scaffolds. The most prominent nuclear scaffold is the nuclear envelope (NE), which consists of a double membrane interspersed by a variety of trans-membrane and closely associated proteins. Chromatin re-organization and gene re-positioning during cellular differentiation involves losing or gaining interactions between the genome and the NE, and such rearrangements can influence gene expression programs. For instance, the nuclear lamina, which is a filamentous protein network underlying the NE, has been extensively implicated in setting up tissue-specific genome organization by sequestering genes destined for silencing. Genome-wide mapping of Lamin-associated domains (LADs), as well as related functional studies, have led to the current view of the nuclear lamina as a compartment for stable gene repression (reviewed in van Steensel, 2017). Another major component of the NE is the nuclear pore complex (NPC), which consists of multiple copies of approximately 30 different proteins termed nucleoporins (Nups) and is responsible for selective nucleo-cytoplasmic transport. In addition to transport-related functions, NPCs and individual Nups are also involved in genome organization and gene regulation through physical interactions with the genome. Yet unlike the nuclear lamina, the functional relationship between NPCs and genome regulation appears to be considerably more varied and remains less understood (Gozalo, 2019).
Given the close proximity of nuclear pores to the underlying chromatin, it is not surprising that multiple studies have now identified binding of Nups to subsets of genes and regulatory elements in a number of species. Many of these studies have reported preferential association of particular Nups with actively transcribing genes or re-localization of genes to the NPCs during activation. These findings have led to the predominant view of the NPC as a nuclear compartment for active processes, functionally opposed to those of the nuclear lamina. However, at least in metazoan systems, this view is confounded by the reported intranuclear presence of Nups that have been classified as dynamic. The ~30 conserved Nups that comprise the NPC can be either dynamic, meaning they are able to come on and off the NPC during interphase, or stable, meaning they are core components of the NE-embedded NPC for the majority of the cell cycle. Currently, many of the reported contacts between active genes and Nups have been described for dynamic Nups, such as Nup98, Nup153, and Nup62, and can frequently occur in the nucleoplasm. Consequently, it is unclear whether genomic binding to actual NPCs is functionally distinct from intranuclear Nup binding (Gozalo, 2019).
Genomic binding to actual NPCs can be determined by mapping chromatin-binding patterns of stable Nups, which are components of the outer-ring Nup107-Nup160 and the inner-ring Nup93-Nup205 sub-complexes. Interestingly, previous studies that profiled chromatin binding of stable Nups did not identify enrichment for transcribing loci and reported prevalence of repressive chromatin. Similarly, DamID profiling of Nup98, artificially tethered to the NPC and thus used as a marker for actual NPC binding, showed no enrichment for active genes and instead exhibited high incidence of motifs for the architectural protein Su(Hw) in Drosophila cells. These studies conflict with the simplified view of the NPC as a scaffold for gene activation and highlight the complexity of NPC-genome interactions. It should be noted that the majority of the NPC genome-binding datasets, mentioned above, were produced using either the DamID technique, which tends to generate wide binding peaks, or the lower-resolution ChIP-chip approach, and thus may have given an incomplete picture of the locations and functions of NPC-genome contacts (Gozalo, 2019).
One hypothesis, which can explain this dichotomy of both active and silent regions at the NPC, is that individual stable Nups bind distinct regions of the genome and regulate distinct chromatin-associated processes. This study set out to explore this hypothesis by generating precise binding maps of stable Nups, using an optimized ChIP-seq approach. ChIP-seq maps revealed that Nup107, a core component of the outer ring sub-complex, and Nup93, a core component of the inner ring sub-complex, bind highly non-overlapping regions of the genome. Specifically, while Nup107 preferentially targets active promoters, as has been reported for other Nups, Nup93 associates primarily with silenced regions bound by Polycomb group (PcG) proteins. PcG proteins are conserved regulators of epigenetically maintained gene repression, which often bind the genome in long Polycomb (Pc) domains. In agreement with its binding pattern, this study found that Nup93 plays a functional role in the silencing and long-range interactions of Pc targets. Together, the results emphasize the concept that different sub-complexes of the nuclear pore interact with and influence distinct chromatin states, revealing a complex landscape of NE-genome interactions (Gozalo, 2019).
The results provide high-resolution chromatin binding maps of stable NPC components and offer a resource for future comparisons to a variety of genomic features. These maps and analysis contribute several insights into the nuclear organization field. First, it was found that representative members of the two core sub-complexes of the NPC, Nup107 and Nup93, bind to active and silenced regions, respectively. This differential binding helps explain the variability in previous conclusions on NPC-genome contacts and extends understanding of how NPC-genome contacts shape three-dimensional genome architecture. ChIP-seq maps of Nup107 are consistent with the predominant view of the NPC as a place for targeting active genes and suggest that this function is carried out primarily through associations of the genome with the outer-ring NPC sub-complex. Although currently it is not possible to definitively prove that all identified binding peaks of Nup107 and Nup93 represent NPC binding, comparison to LADs and immunofluorescence localization analysis, as well as DNA FISH analysis of select loci, suggest that a large proportion of these binding peaks represent regions present at the nuclear periphery, at actual NPCs (Gozalo, 2019).
Second, the findings describe a functional connection between Nup93, a conserved subunit of the inner-ring NPC sub-complex, and Polycomb complexes, which are key regulators of developmental gene silencing. ChIP-seq map of Nup93 demonstrates preferential targeting of Nup93 to a large subset of PcG domains, particularly those that exhibit the highest level of Pc binding. Importantly, it was found that lowering levels of Nup93 leads to de-repression of Pc targets in both cultured cells and fly tissues. These findings suggest that (1) a core NPC subunit is involved in the epigenetic maintenance of silencing via its chromatin binding role; and (2) a subclass of particularly stable PcG chromatin domains are targeted to the nuclear periphery, where they require Nup93 for optimal silencing. The function of the Nup93 sub-complex in gene repression appears to be highly conserved, as strikingly, the S. pombe homolog of Nup93 was found by Moazed and colleagues to be required for silencing and nuclear clustering of heterochromatin (Iglesias, 2019). Nup93 has also been previously shown to be required for the repression of the HoxA gene cluster in mammalian cells (Labade, 2016). This is in line with previous findings that another subunit of this sub-complex, Nup155, associates with histone deacetylases in mammalian cells and targets repressed heterochromatin in yeast. The role of Nup93 in PcG silencing is also potentially related to the previously reported link of Nup153 to PcG-mediated repression in mouse ES cells. In further support of this notion, several Nups have been previously identified in a genome-wide imaging screen for factors that affect nuclear distribution of PcG proteins (Gozalo, 2019).
Thus, a proposed model envisions that certain nuclear pores may interact with active chromatin via the Nup107 sub-complex, while other nuclear pores may associate with silent chromatin via the Nup93 sub-complex. This model is based on the ability to ChIP-seq distinct regions of the genome with different structural NPC components, and it is further supported by biochemical interaction data and the specificity of the functional effect of Nup93. The proposed model is consistent with previous findings in yeast, which reported the binding of a component of the Nup93 sub-complex to silent chromatin and the possible existence of this sub-complex as a type of an independent nuclear pore-related complex, present at the nuclear periphery. In this context, it remains to be determined whether some of the Nup93-PcG interactions similarly occur as an independent complex or if they are normally part of actual NPCs and whether the observed functional effect of Nup93 on PcG silencing always takes place at the NPC (Gozalo, 2019).
Interestingly, it was found that the Nup93-targeted PcG domains tend to preferentially interact with each other in nuclear space. It is intriguing that other Nups, such as Nup98 and Mlp1/2, have been previously shown to facilitate long-range contacts of transcribing genes, such as enhancer-promoter and 5'-3' loops. It appears that stabilization of long-range contacts, either at active or silent genes, may be generally promoted by NPC binding, but the nature of contacts depends on the particular Nup involved. Based on the combined results, it is hypothesized that Nup93 binding may promote stabilization of PcG domains that are destined to be highly repressed. This stabilization may involve promoting long-range interactions between Pc sites, as well as possibly helping sequester PcG domains into specific nuclear compartments, away from gene activity. The findings also suggest that in the case of PcG silencing, long-range interactions are more functionally involved in gene repression than localization to the nuclear periphery is, since de-repression of PcG targets is consistently associated with loss of long-range interactions (Gozalo, 2019).
Furthermore, the results suggest that some of the previously defined LADs are in fact interrupted or flanked with NPC-associated chromatin. In this manner, it appears that at least a fraction of mapped LADs may be complex, containing Nup-targeted sub-environments. These conclusions are also supported by the recent refined analysis of mammalian LADs, which revealed LAD interruptions that contain marks of active chromatin, termed 'Disruption in Peripheral signal' (DiPs). If such DiPs are biologically meaningful, the data would suggest that some such DiPs may be NPC-bound areas of the genome, characterized by functions distinct from the surrounding LADs. An intriguing conjecture is that positioning genes at NPCs within LADs may facilitate ready switching of transcriptional states, such that genes can shift between adjacent active and silent states, depending on incoming signals (Gozalo, 2019).
Finally, this analysis demonstrated widespread genomic binding by a non-stable Nup Elys, which is currently the only Nup with a known direct chromatin binding activity (Zierhut, 2014). Interestingly, Nup107 is almost exclusively found at Elys binding sites. It is tempting to speculate that Elys serves as a chromatin tethering Nup for the Nup107 sub-complex components in the interphase genome, much like it has been demonstrated to do post-mitotically, during NPC assembly. The reproducibility of Elys-Nup107 binding patterns further invokes the possibility that post-mitotic targeting of the Elys/Nup107 sub-complex to chromatin occurs at specific sites in the genome and as such, may participate in the correct re-establishment of chromatin states and nuclear architecture after mitosis. On the other hand, Nup93 similarly shares a large fraction of its binding sites with Elys, suggesting that Elys may carry a similar function in targeting the inner ring sub-complex to chromatin. Presently it remains unclear how this specificity of Nup93 versus Nup107 genome targeting may be established. But together, the findings support the model where different subunits of the NPC have evolved unique functions in chromatin regulation. Individual Nups appear to be able to facilitate either activating or repressive processes and to assist nuclear organization of chromatin domains and key proteins complexes (Gozalo, 2019).
Chromosomes in many organisms, including Drosophila and mammals, are folded into topologically associating domains (TADs). Increasing evidence suggests that TAD folding is hierarchical, wherein subdomains combine to form larger superdomains, instead of a sequence of nonoverlapping domains. This study examined the hierarchical structure of TADs in Drosophila. The boundaries of TADs of different hierarchical levels are characterized by the presence of different portions of active chromatin, but do not vary in the binding of architectural proteins, such as CCCTC binding factor or cohesin. The apparent hierarchy of TADs in Drosophila chromosomes is not likely to have functional importance but rather reflects various options of long-range chromatin folding directed by the distribution of active and inactive chromatin segments and may represent population average (Luzhin, 2018).
While short-read sequencing technology has resulted in a sharp increase in the number of species with genome assemblies, these assemblies are typically highly fragmented. Repeats pose the largest challenge for reference genome assembly, and pericentromeric regions and the repeat-rich Y chromosome are typically ignored from sequencing projects. This study assemble the genome of Drosophila miranda using long reads for contig formation, chromatin interaction maps for scaffolding and short reads, and optical mapping and bacterial artificial chromosome (BAC) clone sequencing for consensus validation. The assembly recovers entire chromosomes and contains large fractions of repetitive DNA, including about 41.5 Mb of pericentromeric and telomeric regions, and >100 Mb of the recently formed highly repetitive neo-Y chromosome. While Y chromosome evolution is typically characterized by global sequence loss and shrinkage, the neo-Y increased in size by almost 3-fold because of the accumulation of repetitive sequences. This high-quality assembly allows reconstruction og the chromosomal events that have led to the unusual sex chromosome karyotype in D. miranda, including the independent de novo formation of a pair of sex chromosomes at two distinct time points, or the reversion of a former Y chromosome to an autosome (Mahajan, 2018).
Heterochromatic regions of the genome are repeat-rich and poor in protein coding genes, and are therefore underrepresented in even the best genome assemblies. One of the most difficult regions of the genome to assemble are sex-limited chromosomes. The Drosophila melanogaster Y chromosome is entirely heterochromatic, yet has wide-ranging effects on male fertility, fitness, and genome-wide gene expression. The genetic basis of this phenotypic variation is difficult to study, in part because the detailed organization of the Y chromosome is not known. To study Y chromosome organization in D. melanogaster, an assembly strategy was developed involving the in silico enrichment of heterochromatic long single-molecule reads, and these reads were to create targeted de novo assemblies of heterochromatic sequences. Contigs were assigned to the Y chromosome using Illumina reads to identify male-specific sequences. Our pipeline extends the D. melanogaster reference genome by 11.9 Mb, closes 43.8% of the gaps, and improves overall contiguity. The addition of 10.6 MB of Y-linked sequence permitted study of the organization of repeats and genes along the Y chromosome. A high rate of duplication was detected to the pericentric regions of the Y chromosome from other regions in the genome. Most of these duplicated genes exist in multiple copies. The evolutionary history of one sex-linked gene family-crystal-Stellate was detailed. While the Y chromosome does not undergo crossing over, high gene conversion rates were detected within and between members of the crystal-Stellate gene family, Su(Ste), and PCKR, compared to genome-wide estimates. These results suggest that gene conversion and gene duplication play an important role in the evolution of Y-linked genes (Chang, 2018).
The Y influences expression of hundreds of genes genome-wide, possibly by sequestering key components of the heterochromatin machinery away from other positions in the genome. To test the influence of the Y chromosome on the genome-wide chromatin landscape, the genomic distribution was assayed of histone modifications associated with gene activation (H3K4me3) or heterochromatin (H3K9me2 and H3K9me3) in fruit flies with varying sex chromosome complements (X0, XY and XYY males; XX and XXY females). Consistent with the general deficiency of active chromatin modifications on the Y, this study found that Y gene dose has little influence on the genomic distribution of H3K4me3. In contrast, both the presence and the number of Y-chromosomes strongly influence genome-wide enrichment patterns of repressive chromatin modifications. Highly repetitive regions such as the pericentromeres, the dot, and the Y chromosome (if present) are enriched for heterochromatic modifications in wildtype males and females, and even more strongly in X0 flies. In contrast, the additional Y chromosome in XYY males and XXY females diminishes the heterochromatic signal in these normally silenced, repeat-rich regions, which is accompanied by an increase in expression of Y-linked repeats. Hundreds of genes were found that are expressed differentially between individuals with aberrant sex chromosome karyotypes, many of which also show sex-biased expression in wildtype Drosophila. Thus, differences in the chromatin landscape of males and females may also contribute to sex-biased gene expression and sexual dimorphisms (Brown, 2020).
Heterochromatin is associated with transcriptional repression. In contrast, several genes in the pericentromeric regions of Drosophila melanogaster are dependent on this heterochromatic environment for their expression. This study presents a comprehensive analysis of the epigenetic landscape of heterochromatic genes across all the developmental stages of Drosophila using the available histone modification and expression data from modENCODE. Heterochromatic genes were found to exhibit combinations of active and inactive histone marks that correspond to their level of expression during development. Nuclear Matrix Associated Regions (MARs) are prominently present in the intergenic regions of heterochromatic genes during embryonic stages suggesting their plausible role in pericentromeric genome organization. Taken together, this meta-analysis of the various genomic datasets suggest that the epigenomic and genomic landscape of the heterochromatic genes are distinct which could be contributing to their unusual regulatory features as opposed to the surrounding heterochromatin, which is repressive in nature (Saha, 2019).
Interphase chromatin is organized into topologically associating domains (TADs). Within TADs, chromatin looping interactions are formed between DNA regulatory elements, but their functional importance for the establishment of the 3D genome organization and gene regulation during development is unclear. Using high-resolution Hi-C experiments, this study analyzed higher order 3D chromatin organization during Drosophila embryogenesis and identified active and repressive chromatin loops that are established with different kinetics and depend on distinct factors: Zelda-dependent active loops are formed before the midblastula transition between transcribed genes over long distances. Repressive loops within polycomb domains are formed after the midblastula transition between polycomb response elements by the action of GAGA factor and polycomb proteins. Perturbation of PRE function by CRISPR/Cas9 genome engineering affects polycomb domain formation and destabilizes polycomb-mediated silencing. Preventing loop formation without removal of polycomb components also decreases silencing efficiency, suggesting that chromatin architecture can play instructive roles in gene regulation during development (Ogiyama, 2018).
This study shows that the 3D organization of the Drosophila genome is established during embryogenesis through the stepwise formation of TADs and chromatin compartments as well as of active and repressive chromatin loops anchored by Zelda and GAGA factor-dependent PREs. Moreover, this study dissected the function of PRE sequences and PRE looping interactions for the formation of polycomb domains and PcG-mediated repression (Ogiyama, 2018).
Developmental profiling shows that 3D genome folding involves multiple rapid processes, some of which occur simultaneously and others in a stepwise manner, and involves contacts at different scales of linear distances. Consistent with recent studies in mammals and Drosophila, this study observed that chromatin is largely unstructured in early embryos before ZGA (Hug, 2017, Ke, 2017). A few minutes later, at cycles 9-13, early boundaries corresponding to the Pol-II- and Zld-bound sites of genes of the first midblastula transcriptional wave and TADs begin to demarcate, albeit poorly, as previously described (Hug, 2017). Another rapid structural transition occurs within less than 20 min after mitosis 13 when early TADs become much more demarcated. This rapid structural transition is remarkable and unexpected. Physical modeling considers chromosome dynamics an important parameter in 3D genome folding regulation, and it has been proposed that chromosomes never reach equilibrium in interphase because of their relatively slow dynamic behavior in the confined nuclear space. Whereas most models used FISH data and in vivo chromatin tracking to calibrate their parameters to simulate dynamic behaviors, it will be interesting to take into account the rapid changes detected in the Hi-C data for model refinement (Ogiyama, 2018).
Another unprecedented finding is that genome compartmentalization of chromatin sharing distinct epigenetic marks occurs at different developmental stages and dynamics during Drosophila embryogenesis. Although active and PcG-repressed compartments are established soon after the major wave of ZGA, these two compartments are initially not well separated. This might reflect a higher degree of the genome switching compartments at early embryonic stages, as it has been previously observed during human ESC differentiation, and a higher degree of chromatin mobility during early development. In contrast, active and PcG-repressed compartments are well separated at the end of embryogenesis, correlating with a decrease in chromatin mobility, which might help to stabilize gene-regulatory states (Ogiyama, 2018).
These results extend previous findings (Blythe, 2016) and suggest that Zld and GAGA factor play a key role in chromatin organization during embryogenesis, not only locally but also by regulating 3D genome folding. In addition to defining a subset of TAD boundaries (Hug, 2017), Zld is required to induce long-range active gene contacts during the MBT. A second wave of active chromatin loops appears between Pol-II-bound sites after the MBT. These loops are independent of Zld but associated with classical insulator proteins, such as CTCF and GAGA factor. After MBT and the major wave of ZGA, GAGA factor acts together with Zld to determine the open chromatin structure of active gene promoters, and insulators. Another major role for GAGA factor is to set up repressive chromatin loops formed over shorter distances within polycomb domains. These loops are first observed at early cleavage cycle 14, at the end of MBT, and well ahead of the time at which Hox phenotypes can be detected in PcG mutants. Because previous studies suggested that GAGA factor can contribute to the formation of chromatin loops, this protein might directly mediate chromatin contacts or might recruit other chromatin-associated factors that mediate looping interactions. One such factor could be cohesin, which physically interacts with PcG proteins, associates with repressive looping anchor points (Eagen, 2017), and mediates looping within the engrailed and invected polycomb domain. However, the fact that GAGA factor-dependent loops are particularly strong in polycomb domains supports the hypothesis that GAGA factor might induce loops via recruitment of PRC1 proteins, such as Ph, which is able to mediate looping contacts via the oligomerization of its SAM domain. Because mutation of GAGA motifs within the PRE sequence induces not only loss of GAGA factor but also a loss of PcG recruitment, it was not possible to discriminate whether loss of PRE looping is the direct consequence of loss of GAGA factor binding or the consequence of loss of PcG binding in this case (Ogiyama, 2018).
A recent study showed that deletion of the four major PREs at the invected/engrailed gene locus, which are also involved in a repressive chromatin loop, does not substantially affect the 3D structure of this polycomb domain or the deposition of H3K27me3 (De, 2016). The authors concluded that these and other PREs in this highly complex locus might act in a redundant manner to establish repressive polycomb domains (De, 2016). In contrast, the current analysis of the simpler dac gene locus shows that disruption of a single PRE induces the loss of PRE looping interactions and that PREs act cooperatively to set up a repressive chromatin environment. Mutation of both PREs simultaneously at the dac gene locus induces the loss of all tested repressive chromatin marks, indicating that no additional PRE sequences are present at this polycomb domain. However, the two PREs are not of equivalent importance for the formation of the polycomb domain and target gene repression, and it will be exciting to investigate in the future whether the differential function of the PREs is encoded in their DNA sequence or whether it is determined by their genomic position (Ogiyama, 2018).
Unexpectedly, deletion of individual or both PREs is not sufficient to globally activate target gene expression in embryogenesis, which might be due to the absence of an essential activator outside of the dac expression domain. In contrast, changes in chromatin structure and contacts were detected at this stage, suggesting that these changes play a causal role in the observed dac gain-of-function phenotype observed at later developmental stages (Ogiyama, 2018).
The Hi-C time course data indicated that polycomb domains form gradually and are correlated with the formation of polycomb bodies and the deposition of H3K27me3. The current data also indicate that PRE looping interactions occur at the beginning of polycomb domain formation and are maintained during development. Two non-exclusive hypotheses for the importance of repressive chromatin loops can be postulated: initial PRE looping interactions might help PRE-anchored PcG complexes to contact neighboring nucleosomes and facilitate the deposition of H3K27me3 along polycomb domains. Alternatively, PRE looping contacts might contribute to establish a particular chromatin nano-compartment, enriched in PcG proteins and refractory to illegitimate activation by transcriptional components (Ogiyama, 2018).
Because deletion or mutations of PRE sequences lead to the loss of PRE looping interactions but also to the loss of PRE function, it is impossible to uncouple the importance of chromatin looping interactions from the role of PcG proteins in genome regulation. By inserting a functional gypsy insulator element between the two PREs of the dac gene locus, it was possible to block PRE interactions without affecting PRE function, allowing the demonstration that, although PRE looping contacts do not play a deterministic role in the establishment of polycomb domains and gene silencing, they are necessary to lock genes in a repressed state during development. This was, however, not due to loss of PcG proteins or of H3K27me3, suggesting that looping is rather important for the formation of a repressive nano-compartment. This function might be key to prevent improper gene activation frequently observed in cancer or other diseases (Ogiyama, 2018).
In summary, this work identified a multistep phenomenon leading to 3D genome organization in Drosophila as well as the importance and regulators of repressive chromatin loops. Future work combining epigenomics and microscopy with genome engineering should allow identifying the mechanisms by which 3D chromatin compartments fine-tune gene expression in this and other cases of gene regulation and 3D genome reprogramming (Ogiyama, 2018).
Chromatin is known to undergo extensive remodeling during nuclear reprogramming. However, the factors and mechanisms involved in this remodeling are still poorly understood and current experimental approaches to study it are not best suited for molecular and genetic analyses. This study reports on the use of Drosophila preblastodermic embryo extracts (DREX) in chromatin remodeling experiments. The results show that incubation of somatic nuclei in DREX induces changes in chromatin organization similar to those associated with nuclear reprogramming, such as rapid binding of the germline specific linker histone dBigH1 variant (Pérez-Montero, 2013) to somatic chromatin, heterochromatin reorganization, changes in the epigenetic state of chromatin, and nuclear lamin disassembly. These results raise the possibility of using the powerful tools of Drosophila genetics for the analysis of chromatin changes associated with this essential process (Satovic, 2018).
This study reports that incubation of somatic nuclei in DREX induces changes in chromatin organization similar to those associated with nuclear reprogramming. On one hand, rapid incorporation was observed of the Drosophila germline specific linker histone dBigH1 into the somatic nuclei. NT experiments performed in Xenopus and mammals showed that incorporation of the oocyte specific linker histone variants B4 and H1oo into the donor nuclei is an early event in nuclear reprogramming. B4 binding precedes loading of oocyte RNApol II and expression of a dominant negative B4 form significantly inhibits transcription of many reprogrammed genes. Along the same lines, expression of H1oo in mouse ESCs impairs differentiation although it does not improve iPSC formation. How oocyte specific H1s might contribute to nuclear reprogramming remains not well understood. Oocyte specific H1s are less positively charged than their somatic counterparts and, therefore, their interaction with DNA is weaker and condense chromatin less than somatic H1s, rendering it more accessible to chromatin modifiers, remodelers and transcription factors. In this regard, Xenopus B4 is more mobile than somatic H18 and B4-containing chromatin is more accessible to remodeling factors39. B4 binds pervasively across chromatin of the donor nuclei and, concomitantly, somatic H1s are released, suggesting competition of somatic H1s by the oocyte specific variants. However, this competition does not appear to play an important role in reprogramming since overexpression of somatic H1s does not interfere with B4 binding and subsequent activation of pluripotency genes. Moreover, in mouse fibroblasts, binding of H1oo is detected 10' after NT, while release of somatic H1s occurs later at 30' after NT20. Similarly, somatic H1s replacement can last hours in NT experiments with bovine cells. Finally, the results indicate that, upon incubation in DREX, dBigH1 binds along chromatin without affecting somatic dH1 occupancy. In fact, dH1 occupancy is significantly reduced only at short incubation times when dBigH1 binding is very low (Satovic, 2018).
The results also show that DREX induces changes in the epigenetic landscape of chromatin, which are in agreement with the global epigenetic remodeling of chromatin observed during reprogramming of somatic cells to iPSCs. In particular, increased global H3Ac was observed that is maintained throughout the incubation time course. Increased histone acetylation is observed in fully reprogrammed iPSCs40 and ESC chromatin is hyperacetylated compared to differentiated cells. It was also observed that H3K4me3 levels increased more intensively at promoters of developmentally regulated genes that are silent in S2 cells but highly expressed in early embryogenesis, suggesting their reactivation. Interestingly, pluripotency-related and developmentally regulated genes are known to acquire H3K4me3 at promoters during nuclear reprogramming. Finally, though not statistically significant, global levels of H3K4me3 and the chromatin bound promoter-proximal active RNApol IIoser form tend to increase at short incubation times. In this regard, NT experiments in Xenopus showed loading of oocyte basal transcription factors and RNApol II leading to genome-wide transcriptional reprogramming and selective activation of pluripotency genes. Notably, these results showed that increased histone acetylation induced by DREX does not require binding of dBigH1, suggesting that, at least in part, the epigenetic changes occurring during reprogramming do not depend only on the activities of the oocyte specific H1s (Satovic, 2018).
Incubation in DREX also induces profound changes in chromatin/nuclear organization. On one hand, at short incubation times, DREX induces heterochromatin reorganization since HP1a/H3K9me3 foci disassemble. A decrease in the number of HP1a foci has also been reported during reprogramming to iPSC. In this regard, chromatin of pluripotent cells is largely decondensed and heterochromatin is organized in larger and fewer domains that become smaller, more abundant and hypercondensed as cells differentiate. Interestingly, incubation in DREX did not decrease H3K9me2 occupancy at multiple heterochromatic elements, suggesting that DREX affects condensation but not the actual heterochromatin content of somatic nuclei. Oocyte specific H1s might be one of the factors contributing to heterochromatin decondensation since, in humans, H1oo is required for decondensation of sperm chromatin. At long incubation times, HP1a foci reform and extrude from nuclei. Interestingly, extrusion of heterochromatic sequences was also reported in somatic plant cells undergoing meiosis. Finally, it was also observed that DREX induces disassembly of nuclear lamin, a nuclear envelope component of differentiated cells that is absent in ESCs. Similar results were reported earlier using a Drosophila oocyte cell-free extract. Nuclear lamin disassembly is considered a marker of reprogrammed cells, since it is detected at the nuclear envelope in partial iPSCs, but not in fully reprogrammed iPSCs40. Interestingly, nuclear lamin disassembly strongly correlates with heterochromatin reorganization, which might account for the heterochromatin extrusion observed after long-term exposure to DREX (Satovic, 2018).
In summary, these results show that DREX induces several changes associates with gain of pluripotency, such as binding of the germline specific linker histone dBigH1, epigenetic remodeling, heterochromatin reorganization and nuclear lamin disassembly. However, it is highly unlikely that DREX induces full reprogramming of somatic nuclei. Nevertheless, the use of DREX offers the possibility of applying the powerful genetics techniques developed in Drosophila to the analysis of factors and mechanisms involved in chromatin remodeling during this essential process (Satovic, 2018).
The Spt-Ada-Gcn5-acetyltransferase (SAGA) chromatin-modifying complex possesses acetyltransferase and deubiquitinase activities. Within this modular complex, Ataxin-7 anchors the deubiquitinase activity to the larger complex. This study identified and characterized Drosophila Ataxin-7 (CG9866)and found that reduction of Ataxin-7 protein results in loss of components from the SAGA complex. In contrast to yeast, where loss of Ataxin-7 inactivates the deubiquitinase and results in increased H2B ubiquitination, loss of Ataxin-7 results in decreased H2B ubiquitination and H3K9 acetylation without affecting other histone marks. Interestingly, the effect on ubiquitination was conserved in human cells, suggesting a novel mechanism regulating histone deubiquitination in higher organisms. Consistent with this mechanism in vivo, this study found that a recombinant deubiquitinase module is active in the absence of Ataxin-7 in vitro. When the consequences of reduced Ataxin-7 were examined in vivo, it was found that flies exhibited pronounced neural and retinal degeneration, impaired movement, and early lethality (Mohan, 2014).
The Spt-Ada-Gcn5-acetyltransferase (SAGA) chromatin-modifying complex is a highly conserved 2-MDa protein complex comprised of ∼20 subunits. The complex is arranged in a modular fashion and contains two enzymatic activities: an acetyltransferase activity associated with the GCN5/Pcaf subunit and a deubiquitinase activity associated with the yUbp8/dNon-stop/hUsp22 subunit. In yeast, the ataxin-7 homolog is Sgf73, and studies have shown that it anchors the deubiquitinase module (which includes Sgf73, Sgf11, Sus1, and Ubp8) to the SAGA complex. Crystal structures and biochemical analysis of the yeast deubiquitinase module have shown that the N terminus of Sgf73 extends deep into the deubiquitinase module, intertwining between the components of the module to ensure an active conformation for the deubiquitinase. Without Sgf73, the deubiquitinase is inactive. The carefully orchestrated addition and removal of ubiquitin on H2B are important regulators of transcription. H2B monoubiquitination is a prerequisite for di- and trimethylation of H3K4 and H3K79, modifications associated with transcriptionally active chromatin (Mohan, 2014).
This study identified the gene product of CG9866 as the Drosophila homolog of ataxin-7 (Ataxin-7). Biochemical analysis, including affinity purification, multidimensional protein identification technology (MudPIT) proteomic analysis, and gel filtration chromatography, establish that Ataxin-7 is a stable component of the SAGA chromatin remodeling complex. Analysis of SAGA from Ataxin-7-deficient flies revealed the loss of components from the SAGA complex, consistent with a role for Ataxin-7 in anchoring the deubiquitinase module to the complex. In contrast to the increased H2Bub observed upon loss of ataxin-7 in yeast, this study observed a decrease in H2B ubiquitination in Drosophila with no associated changes in histone methylation and a reduction in the levels of H3K9 acetylation but not K3K14 acetylation. This surprising change in H2B ubiquitination was confirmed in human cells in which knockdown of human Ataxin-7 also resulted in decreased H2B ubiquitination. It is hypothesized that this decrease reflects the release of an active deubiquitinase module from SAGA and, consequently, loss of SAGA-associated regulation of the deubiquitinase activity. Consistent with this model, it was found that the deubiquitinase is enzymatically active when the complex is reconstituted in vitro even in the absence of Ataxin-7. An examination of flies with reduced expression of Ataxin-7 showed that loss of Ataxin-7 results in neural and retinal degeneration, impaired movement, and decreased life span (Mohan, 2014).
This study presents the first study of Drosophila Ataxin-7. Ataxin-7 shares primary amino acid sequence conservation with human Ataxin-7 and, accordingly, is also a member of the SAGA chromatin-modifying complex. SAGA subunits are lost in the absence of Ataxin-7, resulting in a global decrease in the levels of H2B ubiquitination and H3K9 acetylation without affecting H3K14ac, H3K4me2/3, or H3K79me3. Because Ataxin-7 associates with the intact SAGA complex and loss of Ataxin-7 results in fragmentation of the complex, this study explored whether decreased levels of H2B ubiquitination were due to release of an active deubiquitinase module from SAGA. Indeed, the deubiquitinase module assembled in vitro from purified Non-stop, E(y)2, and Sgf11 is enzymatically active and is unaffected by the presence or absence of Ataxin-7. In vivo, disruption of Ataxin-7 expression leads to severe neural and retinal degeneration, limited life span, and defective locomotion. These defects are at least in part due to elevated deubiquitinase activity, since loss of one copy of the Non-stop deubiquitinase suppresses the lethality of Ataxin-7 mutants (Mohan, 2014).
These results suggest a more elaborate mode of regulation of histone ubiquitination in higher eukaryotes than that found in yeast. Studies in Saccharomyces cerevisiae showed that the deubiquitinase module comprised of Sgf73, Ubp8, Sgf11, and Sus1 is arranged so that each member of the module is in contact with the other three, and these contacts establish an enzymatically active module. Truncation of the Sgf73 N terminus led to an enzymatically inactive module. In contrast, in Drosophila and in human cells, the presence of Ataxin-7 is not necessary for deubiquitinase activity, and, instead, loss of Ataxin-7 results in increased deubiquitination and reduced levels of H2B ubiquitination (Mohan, 2014).
Interestingly, it wa found that this decrease in H2B ubiquitination does not coincide with a decrease in H3K4me2/3 or H3K79me3. Previously, it was shown that H2B ubiquitination was necessary for recruitment of methyltransferases to place these marks, and reduction of the Drosophila Bre1 E3 ubiquitin ligase results in loss of both H2B ubiquitination and H3 methylation. If indeed H2B ubiquitination is required for H3 methylation, then the mark would have been placed and then removed post-methylation by the mistargeted deubiquitinase module. This suggests that methylation may not be affected by the loss of Ataxin-7 because the deubiquitinase acts after ubiquitination-dependent methylation has occurred. It is also possible that a high level of H2B ubiquitination is not required to target methyltransferases. Recently, it was shown in yeast that loss of Chd1 results in reductions in H2B ubiquitination without a corresponding loss of H3K4 or H3K79 trimethylation, suggesting that very low levels of H2B ubiquitination are enough to target methyltransferases. Alternatively, it is possible that another mechanism exists for targeting methyltransferases when levels of ubiquitinated H2B are low or absent. During muscle differentiation, the RNF20 ubiquitin ligase is depleted in differentiated myotubes, resulting in the absence of H2B ubiquitination, yet trimethylation of H3K4 and H3K79 is detected on chromatin lacking H2B ubiquitination. Recently, it was shown that transcription factors such as p53 along with p300 can recruit the SET1 histone methyltransferase complex independent of H2B ubiquitination (Mohan, 2014).
Mutation of different SAGA subunits does not necessarily result in changes in expression of the same sets of genes. Disrupting expression of subunits in the acetyltransferase module versus the deubiquitinase module affects a curiously divergent set of genes, indicating that different genes have varying requirements for each catalytic activity of the complex. A dissociated active deubiquitinase module may play a role in this sophisticated regulation, since a free ubiquitin protease module could act on ubiquitinated chromatin independent of SAGA recruitment and regulation. In yeast lacking Sgf73, Sus1 is released from SAGA but is still recruited to genes, albeit at a reduced level. Moreover, early reports identified USP22 as an H2Aub deubiquitinase acting on polycomb-regulated genes. In principal, SAGA may release subcomplexes to participate in diverse functions. It was recently shown in S. cerevisiae that the proteasome is capable of pulling the enzymatically active DUB module, including Sgf73, Sgf11, and Sus1, from SAGA, indicating that separation of SAGA may normally occur. Further investigation into the modularity of SAGA and its implications for transcriptional regulation in vivo may aid in understanding how this complex might provide a sophisticated mechanism for chromatin modification and gene regulation. In addition to direct chromatin modification, Non-stop and its homologs have also been shown to act on nonhistone proteins, and it is possible that nonhistone targets play a significant role in SCA7 pathology (Mohan, 2014).
SCA7 and other polyglutamine expansion diseases have divergent pathologies. These differences suggest that the function of the expanded protein is critical to the etiology of each disease. Unfortunately, there is currently a limited understanding of the molecular functions of the wild-type proteins and what role loss or gain of function might play in the characteristic patterns of neural degeneration found upon polyglutamine expansion. In initial studies examining the wild-type function of other SCA proteins, loss of the wild-type protein did not entirely phenocopy the polyglutamine expansion disease. For example, Ataxin-1 knockout mice do not show cerebellar or brainstem degeneration but do exhibit cognitive defects and ataxia. The observations of this study suggest that loss of Ataxin-7 function may play a role in SCA7 disease progression. Polyglutamine-expanded Ataxin-7 is resistant to proteolysis, accumulating in cells. At the same time, the wild-type protein is only produced from one genomic copy, and the resulting protein is subjected to normal proteolysis. Furthermore, the Ataxin-7 N terminus extends into the deubiquitinase module, and this region is subject to polyglutamine tract expansion. Polyglutamine-expanded Ataxin-7 is present in the larger SAGA complex, but the deubiquitinase module was not specifically analyzed in this model. It will be interesting to examine whether the deubiquitinase module is also recruited to the complex under these conditions and, if so, whether it is enzymatically active. If the deubiquitinase module does not form around the expanded Ataxin-7, tracking the localization of this module may be insightful. It will also be interesting to examine the potential regulation of the deubiquitinase module independent of SAGA under wild-type conditions. Understanding what might trigger release of the module from the larger complex will be critical to understanding the role of SAGA in neural and retinal stability (Mohan, 2014).
Imaging and chromosome conformation capture studies have revealed several layers of chromosome organization, including segregation into megabase-sized active and inactive compartments, and partitioning into sub-megabase domains (TADs). It remains unclear, however, how these layers of organization form, interact with one another and influence genome function. This study shows that deletion of the cohesin-loading factor Nipbl in mouse liver leads to a marked reorganization of chromosomal folding. TADs and associated Hi-C peaks vanish globally, even in the absence of transcriptional changes. By contrast, compartmental segregation is preserved and even reinforced. Strikingly, the disappearance of TADs unmasks a finer compartment structure that accurately reflects the underlying epigenetic landscape. These observations demonstrate that the three-dimensional organization of the genome results from the interplay of two independent mechanisms: cohesin-independent segregation of the genome into fine-scale compartments, defined by chromatin state; and cohesin-dependent formation of TADs, possibly by loop extrusion, which helps to guide distant enhancers to their target gene (Schwarzer, 2017).
The three-dimensional organization of chromosomes is tightly related to their biological function. Genome-wide chromosome conformation capture (Hi-C) maps have revealed key features of the 3D organization of metazoan chromosomes, including compartmentalization, TADs, and interaction peaks. Compartmentalization is visible as a characteristic checkerboard pattern of contact enrichment both in cis and in trans between megabase-sized genomic intervals of the same type, reflecting spatial segregation of transcriptionally active (type A) and inactive (type B) chromatin. TADs appear as squares of enriched contact frequency with sharp boundaries; they usually span hundreds of kilobases and do not necessarily exhibit any checkering. TADs are thought to contribute to gene expression, notably by promoting or preventing interactions between promoters and distant regulatory elements. Finally, peaks (often termed loops are visible as focal enrichments in contact frequency between pairs of loci, often at the corners of TAD squares. Despite a lack of supporting mechanistic experiments, compartments, TADs and peaks are typically assumed to constitute hierarchical levels of chromosome folding; however, the connection between them remains poorly understood (Schwarzer, 2017).
Architectural proteins, notably cohesin and the transcriptional insulator CTCF, are believed to have crucial roles in chromatin organization during interphase. Cohesin and CTCF co-localize at TAD boundaries and the bases of Hi-C peaks, but their roles have not been fully defined. The recently proposed loop extrusion model yields predictions that are consistent with previous experimental results. In this model, TADs emerge from the progressive extrusion of chromatin loops by a protein complex (for example, cohesin) until it dissociates from chromatin or reaches a boundary element and CTCF depletion support the proposed role of CTCFs as boundary elements, but the role of cohesin in interphase chromatin organization and the process of loop extrusion remains unclear, as experimental depletions of cohesin have shown limited impact on chromatin organization (Schwarzer, 2017).
To identify the role of cohesin in interphase chromatin, Nipbl (yeast homolog: Scc2, Drosophila homolog: Nipped-B), which is necessary for loading of cohesin onto chromatin, was deleted. As the turnover period of chromosome-bound cohesin is about 20 minutes, constant loading is required for its presence on DNA. Efficient deletion of Nipbl was achieved in non-dividing hepatocytes by using a liver-specific, tamoxifen-inducible Cre driver, which circumvents the lethality of Nipbl+/- mice and the essentiality of cohesin in dividing cells. Ten days after tamoxifen injection, Nipbl expression was greatly reduced, leading to displacement of cohesin from the chromatin fraction to the soluble nuclear fraction, which indicates loss of cohesin from chromosomes. This strong depletion of chromatin-bound cohesin was also observed by calibrated chromatin immunoprecipitation and sequencing (ChIP-seq) for RAD21 and SMC3 both genome-wide. A conservative estimate indicates at least a fourfold to sixfold decrease in chromatin-associated cohesin. The liver showed no particular pathological signs compared to control mice. Hepatocytes showed no sign of cell death or proliferation (Schwarzer, 2017).
To assess the consequences of Nipbl depletion and cohesin loss on chromosome organization, tethered chromatin conformation capture (a variant of Hi-C) was performed on purified hepatocytes from wildtype, tamoxifen control (TAM) and ΔNipbl mice. For each of these three conditions, two biological replicates were generated; five out of six replicates produced more than 25 million interactions at separations of over 10 kb, on par with other primary tissue Hi-C result. The contact maps obtained from each biological replicate showed extensive similarities, allowing pooling of the two replicate data sets to generate Hi-C maps for the three different conditions. Hi-C maps were compared for ΔNipbl and control cells by examining compartments, TADs, peaks, and global scaling of the contact probability P(s) (Schwarzer, 2017).
The Hi-C maps reveal that Nipbl deletion has a striking effect on genome organization, contrasting with the very mild changes reported in previous cohesin depletion experiments. Compared to wild-type and TAM control samples, ΔNipbl cells show genome-wide disappearance of local TAD patterns but persistence of type A-type B compartmentalization. Disappearance of TADs in ΔNipbl cells is widespread and can be seen in individual maps, as well as on the composite map constructed by averaging the ΔNipbl Hi-C map around locations of domain boundaries detected in wild-type maps. Some local organization is retained in regions with higher activity (A compartments), where cohesin is about threefold more abundant than in inactive regions (B compartments) in TAM cells. It was shown that these structures are not residual or new TADs, but unmasked small compartments. TAD-associated peaks of contact enrichment also disappear in ΔNipbl maps, showing up to fourfold reduction in contact frequency, notably between convergent CTCF sites. Insulation and directionality of the contact footprint of CTCF sites are also absent in ΔNipbl cells. However, no effect of Nipbl deletion was observed on CTCF occupancy, demonstrating that the loss of TADs and the CTCF contact footprint in ΔNipbl cells is not due to loss of or changes in CTCF occupancy. This finding strengthens the idea that CTCF and cohesin have distinct roles in shaping chromosome architecture (Schwarzer, 2017).
The changes seen in ΔNipbl cells cannot be attributed to altered gene expression, since TAD patterns vanished equally in regions where gene expression was unchanged and in those where it was upregulated or downregulated. This major reorganization of chromatin architecture is also reflected in plots of contact frequency P(s) as a function of genomic distance s. Loss of chromatin-associated cohesin led to disappearance of the first regime, producing a single decay of contact probability across the whole range. This observation suggests that the first scaling regime reflects the compaction of the genome associated with TADs. This was confirmed by calculating P(s) separately within and between wild-type TAD intervals. In control cells, P(s) within TADs decreases more slowly than P(s) between TADs. In ΔNipbl cells, the within-TAD P(s) curve collapses to the between-TAD P(s) curve, indicating that the characteristic enrichment of contacts within TADs is lost and that chromatin folding becomes more uniform and decompacted, consistent with decompaction observed by imaging upon Nipbl reduction (Schwarzer, 2017).
Next, the effects of Nipbl depletion were simulated in a model of loop extrusion, by reducing the number of extruding cohesins. For each simulated concentration of extruding cohesins, Hi-C maps and P(s) were calculated within and between TADs. In these simulations, eightfold depletion of extruding cohesins was required to achieve agreement with the experimental data, manifested by (1) noticeable disappearance of TAD and corner peak enrichments; (2) loss of the P(s) ~ s-0.7 regime in the scaling; and (3) decompaction of chromatin. Together, these analyses and the observed effects of Nipbl deletion indicate that cohesin has a central role in the local compaction of chromosomes, and support the idea that this effect is mediated by the production of dynamic populations of extruded chromatin loops between boundary elements, which form TAD and corner peak patterns in interphase Hi-C maps (Schwarzer, 2017).
The compartmentalization of chromatin was examined in ΔNipbl cells. As noted earlier, in contrast to the marked loss of TADs, compartmentalization still exists. Moreover, closer examination of Hi-C data and compartment tracks reveals the emergence of series of shorter compartmental intervals in ΔNipbl cells, with small B-like regions appearing inside A regions. This finer compartmentalization is reflected in the shorter autocorrelation length of the compartment track: 150 kb in ΔNipbl cells versus around 500 kb in wild-type and TAM cells. The finer compartmentalization explains most of the remaining or new domains and boundaries seen in ΔNipbl Hi-C maps. These emerging B-like regions possess the hallmarks of compartmentalization: they are visible as local depressions in the ΔNipbl compartment track and they show preferential interactions with other B-regions in both far cis- and trans-chromosomal maps. As their diagonal squares lack both corner peaks and enriched borders, and exhibit mutual checkering, it is concluded that these intervals do not represent newly formed TADs. By contrast, predominantly B-type regions of the genome do not show similar fragmentation in ΔNipbl cells, despite a complete loss of TAD patterns. Importantly, the appearance of finer compartmentalization upon Nipbl deletion shows that wild-type TADs can span regions of intrinsically opposing compartment type. Together, these observations defy the common notion of TADs simply being the building blocks of larger compartmental segments; instead, it is concluded that TADs and compartments represent two independent, potentially antagonistic types of chromosomal organization (Schwarzer, 2017).
Notably, it was found that the compartmentalization profile of the ΔNipbl Hi-C map reflects local transcriptional activity and chromatin state better than that of the wild-type Hi-C map. The compartment track of ΔNipbl cells shows a stronger correlation with tracks of activity-associated epigenetic marks (for example, acetylation of H3K27 (H3K27ac), trimethylation of H3K4 (H3K4me3), DNase hypersensitivity, transcription factor binding), smoothed over a wide range of window sizes. To understand the relationship between epigenetic state and the change in compartment status, the compartment tracks were compared to the mouse liver chromatin state segmentation simplified into three state categories: active, repressed and inert. Whereas inert regions are relatively unaffected by Nipbl deletion, regions of repressed and active chromatin further diverge in their compartment status, producing local peaks in the compartment track in active regions, and local B-like depressions in repressed regions. Furthermore, regions of facultative lamin-B1 association are enriched in regions that show a reduction in compartment signal (from A-like to B-like), while those showing lamin-B1 association across different mouse cell lines are primarily B-type in both wild-type and mutant cells. These changes in compartmentalization cannot be attributed to changes in expression or activity marks (H3K27ac and H3K4me3), which are largely unperturbed in ΔNipbl cells at the scales relevant to compartmentalization. In summary, absence of cohesin enhances the compartmentalization of active and inactive chromatin as follows: (1) A and B regions, as detected by Hi-C, form fewer contacts between one another; (2) a finer compartment division emerges; (3) and this finer compartment structure corresponds better to the local functional states of the genome, even when considering those observed in wild-type cells. The fact that no effect on activity marks indicates that cohesin and TADs do not play a role in the maintenance of epigenetic state, though this does not rule out possible roles in its establishment. A recent study has observed a similar preservation of epigenetic state upon CTCF depletion (Schwarzer, 2017).
These results indicate that chromatin has an intrinsic tendency to segregate into compartments based on the local epigenetic landscape and transcriptional activity, and that Nipbl and cohesin activity interfere with this clear subdivision by bringing together and mixing loci with opposing states (Schwarzer, 2017).
Next the effect of Nipbl deletion and disappearance of TADs on transcription were examined. About a thousand genes show significant changes in expression in ΔNipbl cells. Gene ontology enrichment analysis does not give a strong indication of preferential effect on a biological function, reflecting possibly indirect and adaptive transcriptional changes (Schwarzer, 2017).
While peaks of H3K27ac (and H3K4me3) at the promoters of affected genes change in coherence with expression changes, distal peaks (marking active distant enhancers) are mostly unaffected, indicating that although transcriptional changes did occur, the regulatory potential of the cells was mostly unperturbed. There is so far no reliable way to identify a priori genes for which distal regulatory interactions are essential. However, it was noticed that downregulated genes were surrounded by a larger intergenic space (defined by the distance separating the transcription start sites (TSSs) of their immediate neighbours) than upregulated or unaffected ones and that transcriptional changes were concentrated within regions that formed larger TADs in wild-type cells. This characteristic genomic context of transcriptional alterations is consistent with defective long-range regulatory interactions in ΔNipbl cells (Schwarzer, 2017).
Closer examination of RNA sequencing (RNA-seq) tracks revealed widespread upregulation of exogenic (intergenic or antisense intragenic) transcription in ΔNipbl cells. While genes are more often downregulated than up-regulated in ΔNipbl hepatocytes, a clear opposite trend was observed in exogenic regions, where transcription is more frequently upregulated than reduced. Using a conservative approach, this study found 1,107 non-genic transcripts or transcribed regions that showed at least an eightfold increase in transcription in ΔNipbl cells; among these, 232 corresponded to non-coding RNAs, which were not or barely detected in control samples, and often not annotated. The new transcription is often bi-directional and occurs either at small pre-exiting H3K4me3 peaks, which are likely to correspond to poised promoters, or at active H3K27ac enhancers. Several examples were seen of reciprocal expression changes (that is, downregulation of a gene being followed by upregulation of an adjacent gene or of a new non-coding transcript), but often, new non-coding transcription arose without measurable impact on surrounding genes. While the chromatin profile suggests that enhancers retain their normal activity and therefore regulatory potential, this rise in intergenic transcription initiated in the vicinity of distal regulatory elements suggests that Nipbl deletion impairs enhancer communication: with a reduced range of contact owing to the absence of TADs, some enhancers may not reach their target promoters and instead transfer their activity to nearby alternative, sometimes cryptic, targets (including themselves) (Schwarzer, 2017).
Overall, the findings provide insights into the mechanisms that generate the 3D organization of the interphase genome and their relation to gene expression. The data show that cohesin is essential for the formation of TADs. It is possible that in previous studies that did not report such drastic effects cohesin depletion was insufficient to achieve substantial loss of TADs. The simulations carried out in this study suggest that TADs would still be pronounced at twofold cohesin depletion, and that approximately eightfold depletion is required to induce loss of TADs; this is close to what was observed in ΔNipbl samples. It is, however, also possible that Nipbl may affect cohesin activity at several levels, that is, not only as a loader but also by facilitating ATP hydrolysis and loop extrusion (Rhodes, 2017), which could further affect TAD formation. This could also account for why a recent deletion of Mau2 (also known as Scc4), a co-factor of NIPBL, showed a milder effect on TAD formation (Schwarzer, 2017).
While this manuscript was under review, several preprints reported direct perturbations to cohesin, producing results consistent with this study and thus providing additional support that loss of cohesin is responsible for the effects that were observed. Single-nucleus Hi-C for cohesin (Rad21, also known as Scc1) knockout zygotes demonstrated complete loss of TADs and associated peaks (Gassler, 2017). However, the emergence of fine compartmentalization was not reported, probably owing to the limited number of nuclei studied. Two other preprints reporting degron-mediated depletion of RAD21 in human cell lines observed concordant effects on TADs, peaks, and compartmentalization to the current study. In addition, one of these studies reported that cohesin depletion resulted in the emergence of 'loops' enriched in super-enhancers. However, unlike the sharp, cis-only focal Hi-C peaks previously referred to as loops, these patterns manifest as grids of larger patches of contact enrichment appearing both in cis and in trans, and hence cannot be loops. Indeed, such features were observed in the current data using a new Hi-C browser, and it was reported earlier that these patches simply appear to be strong compartmental interactions between very active regions and tend to line up well with transcribed genes (Schwarzer, 2017).
The results challenge the classic picture of genome architecture in which TADs, peaks and compartments represent well-defined hierarchically folded entities, with individual TADs combining to form compartmental regions. The observations show that there are at least two mechanisms of independent origin whose overlapping action organizes mammalian interphase chromatin. The first mechanism, global spatial segregation and compartmentalization of the genome into active and inactive compartments, is achieved by a cohesin-independent mechanism, which acts globally and at varying scales, including scales smaller than previously appreciated. This compartmentalization is, however, suppressed by the action of a second, cohesin-dependent, mechanism that compacts chromatin locally, independently of its status, resulting in formation of TAD patterns in Hi-C maps. Notably, the independence of the two mechanisms is supported by the absence of compartments, despite the existence of TADs, in maternal zygotic pronuclei. Local compaction can be achieved by a cohesin-driven and energy-dependent loop extrusion mechanism, akin to the motor function recently observed in another SMC complex, condensin (Terakawa, 2017; Nuebler, 2017). demonstrated that active loop extrusion in a polymer model of chromatin can indeed lead to reduced compartmentalization and failure of short compartmental segments to segregate, as observed here (Schwarzer, 2017).
The co-existence of two processes with different modes and scales of action can help to explain the difficulties in the field in delineating and unambiguously classifying the different features of Hi-C maps, leading to a confusing plethora of denominations and definitions. The current data clearly demonstrate the existence of local, cohesin-dependent, self-interacting domains identifiable as TADs. The experimental ability to distinguish between the two modes of chromosome organization should enable investigations of the process(es) that govern their formation and maintenance, as well as characterization of their relationships to gene expression. In this respect, while compartmental segregation may facilitate the maintenance of regulatory interactions, the ability of cohesin to counteract segregation and bring regions of different activities together may play an essential role in initiating changes in gene expression driven by distant enhancers (Schwarzer, 2017).
Despite an abundance of new studies about topologically associating domains (TADs), the role of genetic information in TAD formation is still not fully understood. This study used HiCExplorer to annotate >2800 high-resolution (570 bp) TAD boundaries in Drosophila melanogaster. Eight DNA motifs enriched at boundaries were identified, including a motif bound by the M1BP protein, and two new boundary motifs. In contrast to mammals, the CTCF motif is only enriched on a small fraction of boundaries flanking inactive chromatin while most active boundaries contain the motifs bound by the M1BP or Beaf-32 proteins. Boundaries can be accurately predicted using only the motif sequences at open chromatin sites. It is proposed that DNA sequence guides the genome architecture by allocation of boundary proteins in the genome. Finally, an interactive online database is presented to access and explore the spatial organization of fly, mouse and human genomes (Ramirez, 2018).
How the DNA packs into the nucleus and coordinates functional activities is a long-standing question in biology. Recent studies have shown that the genome of different organisms is partitioned into chromatin domains, usually called topologically associated domains (TADs), which are invariable between cell types and evolutionary conserved in related species (Ramirez, 2018).
To understand TAD formation, researchers had focused on the proteins found at TAD boundaries. In mammalian cells, the CCCTC-binding factor (CTCF) protein has been shown to be enriched at chromatin loops, which also demarcate a subset of TAD boundaries (referred to as 'loop domains'). A proposed mechanism, based on the extrusion of DNA by cohesin, suggests that the DNA-binding motif of CTCF and its orientation determine the start and end of the loop. In line with this hypothesis, deletions of the CTCF DNA-motif effectively removed or altered the loop or caused changes in gene~enhancer interactions that lead to developmental abnormalities in mouse embryos. Additionally, acute depletion of CTCF leads to loss-of-TAD structure on CTCF containing boundaries. However, CTCF-cohesin loops only explain a fraction (<39%) of human TAD boundaries, while plants and bacteria lack CTCF homologs but also show TAD-like structures. Thus, it is possible that additional factors are involved in the formation of TADs (Ramirez, 2018).
In contrast to mammals, the genetic manipulation tools available in flies have allowed the characterization of several proteins that, like CTCF, are capable of inhibiting enhancer-promoter interactions. Throughout this paper these proteins are refered to as 'insulator proteins' and their binding motifs as 'insulators' or 'insulator motifs'. In flies, apart from CTCF, the following DNA-binding insulator proteins have been associated to boundaries: Boundary Element Associated Factor-32 (Beaf-32), Suppressor of Hairy-wing (Su(Hw)), and GAGA factor (GAF). Also, Zest white 5 (Zw5) has been proposed to bind boundaries. These insulator proteins recruit co-factors critical for their function, such as Centrosomal Protein-190 (CP190) and Mod(mdg4)12. Recently, novel insulator proteins have been described as binding partners of CP190: the zinc finger protein interacting with CP190 (ZIPIC), Pita which appear to have human homologs and localizes to TAD boundaries, and the Insulator binding factors 1 and 2 (Ibf1 and Ibf2). Except for CP190 and Mod(mdg4), all previously characterized boundary associated proteins bind to specific DNA motifs, suggesting that the 3D conformation of chromatin can be encoded by these motifs (Ramirez, 2018).
This study sought to identify the DNA encoding behind TAD boundaries in flies. First, software (HiCExplorer) was developed to obtain boundary positions at 0.5 kilobase resolution based on published Hi-C sequencing data from Drosophila melanogaster Kc167 cell line. Using these high-resolution TAD boundaries, eight significantly enriched DNA-motifs were identified. Five of these motifs are known to be bound by the insulator proteins: Beaf-32, CTCF, the heterodimer Ibf1 and Ibf2, Su(Hw) and ZIPIC. A large fraction of boundaries contain the motif bound by the motif-1 binding protein (M1BP), a protein associated to constitutively expressed genes. This motif has recently been found at boundaries. The two remaining DNA-motifs have not been associated to boundaries before. Surprisingly, it was found that depletion of Beaf-32 has no major effect on chromosome organisation, while the depletion of M1BP leads to cell arrest in M-phase and dramatically affects the Hi-C results. Using machine learning methods based on the acquired DNA-motif information, boundaries were accurately distinguish from non-boundaries and TAD boundaries that were missed when using only Hi-C data were identified. The results suggest that the genome architecture of flies can be explained predominantly by the genetic information. The methods for Hi-C data processing, TAD calling and visualization were implemented into an easy to use tool called HiCExplorer. To facilitate exploration of available Hi-C data, an interactive online database was provided containing processed high-resolution Hi-C data sets from fly, mouse and human genome (Ramirez, 2018).
This study used high resolution (DpnII restriction enzyme) and deeply sequenced (~246 million reads) Hi-C data to map the genomic positions of TAD boundaries within ~600 bp in D. melanogaster. This analysis revealed a larger number of TADs, including many small active TADs (23 kb mean length), that were absent in previous reports. TAD size, boundary strength, chromatin marks, gene orientation, and transcription at the TADs were characterized. Motif calling was performed at boundaries, validating the presence of known insulators, along with M1BP motif, which recently has also been shown to be associated to boundaries and core promoter motif 6 and motif 8, which have not been associated to boundaries before. Using different machine learning methods, this study found that DNA motifs and open chromatin are sufficient to accurately predict a major fraction of fly boundaries. Finally, a set of useful tools and a resource is presented for visualization and annotation of TADs in different organisms (Ramirez, 2018).
This study verifies various properties of fly boundaries indicated in previous publications. Most boundaries associate with promoters and active chromatin (Hou, 2012) and known insulator proteins are enriched at boundaries. A comprehensive set of core promoter motifs are detected at boundaries, including the newly discovered M1BP motif, and motifs which have been associated to housekeeping gene expression. However, some of the results contradict previous observations. For example, it was find that genes at boundaries have higher expression and lower variability of expression throughout fly development. This in line with (Hug, 2017; Ulianov, 2016) but in contrast with Hou (2012), who suggest that gene density and not the transcriptional state is important for boundary formation. Unlike Hou this study found that genes at boundaries tend to be divergently transcribed. In contrast to various earlier studies, CTCF does not appear to be a major boundary associated insulator in flies. This study also shows that the number of insulator motifs at boundaries correlates very little with boundary strength (Ramirez, 2018).
Most of these differences are due to the increased resolution of detected boundaries and the combined analysis of DNA motifs with ChIP-seq data, rather than ChIP-Seq peaks alone. This study shows that correlating boundaries with ChIP-Seq peaks alone is not a good measure when it comes to determinants of boundary formation. Many DNA-binding proteins show co-localization in ChIP-Seq data without presence of the corresponding DNA motifs. This is possible due to cross-linking artifacts and indirect binding, which is, in fact, aggravated at boundaries, which tend to contact each other in 3D space (Ramirez, 2018).
Another argument for considering motifs is the contradicting case of CTCF at boundaries. In contrast to earlier studies based on CTCF ChIP-seq, this study found that the CTCF motif is rarely associated to boundaries. This difference is caused by the quality of the ChIP-seq data that can produce spurious peaks. For example, a significant enrichment of CTCF at boundaries was observed in the ChIP-Seq data from another study. On the other hand, ChIP-seq data sets show significant enrichment if ChIP-seq peaks are only considered that contain the CTCF motif. For CTCF, and in general for ChIP-seq experiments in flies, 'phantom peaks' are known to occur at active promoters. Thus, to avoid misleading results the current analyses are based on motif presence when possible and for ChIP-Seq data sets, significance threshold along with motif binding affinity are used for analysis (instead of taking a significance cutoff alone) (Ramirez, 2018).
This study observed that boundary strength is associated with the chromatin states of flanking TADs and particular motif combinations, but is not associated with the number of co-occurring boundary motifs. Boundary strength is higher between active and inactive/PcG TADs while is lower at boundaries separating two TADs within the same state (e.g., active-active, inactive-inactive). Boundaries containing Beaf-32 are stronger when present together with either motif 6, Pita, or ZIPIC motif while weaker with motif 8. Although, the mechanism by which combinations of insulators alter the boundary strength still remains unclear, an association was observed of Nup98 with Pita motif, motif 6, and CTCF, suggesting that association with nuclear pore proteins may result in stronger boundaries. Nup98 has now been shown to be functionally important in mediating enhancer-promoter looping in the Drosophila genome (Ramirez, 2018).
The current results indicate that the two sets of boundary motifs (promoter and non-promoter) participate in the compartmentalization of different types of chromatin. Boundaries containing core promoter motifs are either flanking, or surrounded by active chromatin regions. In contrast, the boundaries containing non-promoter motifs tend to be within or at the borders of inactive or repressed chromatin. This finding is in line with previous reports showing an enrichment of CTCF at the borders of H3K27me3 domains and an enrichment of Beaf-32 in active chromatin. This indicates that insulator proteins might serve different functions guided by the DNA sequence. For example, this study observed that GAF motif, whose presence is negatively associated with TAD boundaries, is rather detected alone at 'loop domains' (Ramirez, 2018).
These analyses indicate that the depletion of the well-studied insulator protein Beaf-32, has no significant effect on the chromosome conformation. However, in Drosophila melanogaster, both the Beaf-32 and DREF proteins bind exactly the same DNA motif. Thus, the current results, as well as others point out that DREF, a protein that unlike Beaf-32 is conserved in humans, might have a more prominent role in genome organization than previously thought (Ramirez, 2018).
On the other hand, cells under M1BP knockdown grow slower in culture and get arrested in M-Phase, probably because M1BP is a transcription factor of constitutively expressed genes. Since M1BP depleted cells show cell cycle defects, it is difficult to separate the direct role of M1BP at boundaries from the indirect effects caused by deregulation of thousands of genes. To study the direct role of M1BP at boundaries, it would be useful to perform either deletion of M1BP motif on boundaries using CRISPR, as shown for CTCF in mammals or through acute and complete depletion of M1BP9 (Ramirez, 2018).
This study presents evidence that the DNA sequence contains features that can guide the formation of higher order chromosome organisation. The association of boundary types with a combination of motifs, and the fact that boundaries can be predicted using DNA sequence alone, in absence of any information about associated protein or histone marks leads the authors to propose a DNA-guided chromatin assembly model. In this model, the boundary elements are recognized by their proteins, which help loading TAD assembly factors onto chromatin. Promoter and non-promoter boundaries can thus have different mechanisms of formation. DNA motifs at inactive regions can attract proteins that may establish TAD domains by setting up barriers for chromatin marks. Although overall barrier activity of insulator proteins have been controversial, it is plausible that the barrier mechanism is active only at a subset of boundaries (like those of inactive TAD domains). DNA motifs at gene promoters can associate with core-promoter proteins which then guide the assembly of Pol-II pre-initiation complex. The pre-initiation complex can then recruit condensins. Once recruited, condensins can perform loop extrusion independent of Pol-II transcriptional activity, leading to emergence of TADs. Condensins can also remain associated to chromatin during mitosis, to re-establish TADs after the cell division. In general, the results indicate that active transcription and chromosome conformation are related. Future studies investigating the association of Pol-II pre-initiation complex and condensin activity on gene promoters would advance understanding of mechanism of TAD formation (Ramirez, 2018).
Eukaryotic chromosomes are spatially segregated into topologically associating domains (TADs). Some TADs are attached to the nuclear lamina (NL) through lamina-associated domains (LADs). This study identified LADs and TADs at two stages of Drosophila spermatogenesis -Â in bamΔ86 mutant testes which is the commonly used model of spermatogonia (SpG) and in larval testes mainly filled with spermatocytes (SpCs). This study found that initiation of SpC-specific transcription correlates with promoters' detachment from the NL and with local spatial insulation of adjacent regions. However, this insulation does not result in the partitioning of inactive TADs into sub-TADs. It was also revealed an increased contact frequency between SpC-specific genes in SpCs implying their de novo gathering into transcription factories. In addition, the specific X chromosome organization was uncovered in the male germline. In SpG and SpCs, a single X chromosome is stronger associated with the NL than autosomes. Nevertheless, active chromatin regions in the X chromosome interact with each other more frequently than in autosomes. Moreover, despite the absence of dosage compensation complex in the male germline, randomly inserted SpG-specific reporter is expressed higher in the X chromosome than in autosomes, thus evidencing that non-canonical dosage compensation operates in SpG (Ilyin, 2022).
The locations of chromatin loops in Drosophila were determined by Hi-C (chemical cross-linking, restriction digestion, ligation, and high-throughput DNA sequencing). Whereas most loop boundaries or 'anchors' are associated with CTCF protein in mammals, loop anchors in Drosophila were found most often in association with the polycomb group (PcG) protein Polycomb (Pc), a subunit of polycomb repressive complex 1 (PRC1). Loops were frequently located within domains of PcG-repressed chromatin. Promoters located at PRC1 loop anchors regulate some of the most important developmental genes and are less likely to be expressed than those not at PRC1 loop anchors. Although DNA looping has most commonly been associated with enhancer-promoter communication, the results indicate that loops are also associated with gene repression (Eagen, 2017).
Active and inactive genes are folded differently and located in different regions of the nucleus, but the molecular basis of chromosome folding and nuclear architecture remains to be determined. One folding paradigm is the formation of protein-mediated DNA loops, which are most commonly associated with enhancer-promoter communication. On the other hand, gene repression is most often associated with heterochromatin formation and chromosome condensation (Eagen, 2017).
Chromosome folding can be revealed by chemical cross-linking, followed by restriction digestion, ligation, and high-throughput DNA sequencing. Such 'Hi-C' analysis has revealed intrachromosomal folding on multiple length scales. From lengths of 1 kb, the smallest so far examined, to hundreds of kilobases, two features are observed, loops and so-called topologically associating domains, or 'TADs' (which have also been referred to as A/B domains, physical domains, topological domains, or contact domains). Loops bring a pair of loci into close physical proximity; TADs represent genomic intervals in which all pairs of loci exhibit an enhanced frequency of contact, and correspond to stably condensed chromosomal regions. On a larger scale, extending to whole chromosomes, TADs interact with one another to form 'compartments' (Eagen, 2017).
Hi-C, on the smallest length scale, based on the most extensive sequencing, is needed for unambiguous identification of chromatin loops and has been reported thus far only for the mouse and human genomes. This limitation has also been overcome Drosophila by Hi-C analysis at subkilobase resolution. An unanticipated correlation was found with results of ChIP-seq analysis in Drosophila, with an important functional correlate (Eagen, 2017).
The locations of loop anchors in Drosophila determined in this study are notable both for correlations with ChIP-seq data and for the lack thereof. The lack of correlation with locations of CTCF protein was unexpected, inasmuch as most loop anchors in mammals are associated with CTCF protein, apparently bound to CTCF sequence motifs in a convergent orientation. There are evidently multiple patterns of protein association with loop anchors in metazoans. The association of loop anchors in Drosophila with Pc protein is noteworthy because it points to a role of looping not only in gene activation, as widely observed in the past, but in gene repression as well (Eagen, 2017).
Regions of PcG-repressed chromatin ('PcG domains') that are separated by hundreds of kilobases to megabases are known to be in enhanced spatial proximity, but details of their internal organization have only been investigated by averaging over many PcG domains. The high resolution of the Hi-C contact maps revealed chromatin loops within individual PcG domains, giving insight into their internal organization. PRC1 is known to compact nucleosome arrays in vitro. Knockdown of the PRC1 subunit, Polyhomeotic (Ph), in vivo decompacts PcG-repressed chromatin, and Ph that is unable to polymerize impairs the ability of PRC1 to form clusters. Together with these findings, the results suggest that PRC1-bound chromatin loops within PcG-repressed domains either establish or maintain a condensed state (Eagen, 2017).
Previous analyses by 3C have pointed to associations of PcG proteins with chromatin loops for the Bithorax complex (BX-C) in S2 cells; for inv and en in BG3 and Sg4 cells; and for an embryonic, pupae, and adult transgenic reporter system. The Hi-C data are, however, at higher resolution and genome-wide. Higher resolution allowed more comprehensive analysis, such as the unambiguous identification of loops and the segmentation of ANT-C into a series of TADs with one or two homeotic gene promoters per TAD. Genome-wide analysis revealed both the pervasive nature of Pc protein association and the absence of significant CTCF protein association, despite conservation of CTCF from Drosophila to human (Eagen, 2017).
A report by Cubeñas-Potts (2017) on Drosophila chromatin loops in Kc cells appeared while this manuscript was in preparation. Cubenas-Potts noted an enrichment of cohesin but a lack of Drosophila CTCF at loop anchors, consistent with the current observations. Cubenas-Potts did not mention Pc, but the current analysis of their data revealed an enrichment of loop anchors at Pc ChIP peaks and an enrichment of Pc ChIP peaks at loop anchors. A likelihood was found of repression of promoters at Pc-bound loop anchors, especially for developmental genes; Cubenas-Potts et al. observed an enrichment of active developmental enhancers at loop anchors, possibly because these are among the many loop anchors not bound by Pc, or because the chromatin at these loop anchors is bivalent, bound by nonhistone proteins and histone posttranslational modifications associated with both gene activation and repression (Eagen, 2017).
The occurrence of PRC1 at loop anchors could reflect a role in loop formation similar to that proposed for CTCF in mammals, wherein cohesion complexes extrude loops, in a process halted upon reaching bound CTCF. Consistent with this model, a large majority (72.8%) of Drosophila loop anchors are bound by the Rad21 subunit of cohesin. Regardless of whether PRC1 performs such a role, additional proteins must be involved, because PRC1 is present at only 26% of Drosophila loop anchors (Eagen, 2017).
In the nucleus of eukaryotic cells, genomic DNA associates with numerous protein complexes and RNAs, forming the chromatin landscape. Through a genome-wide study of chromatin-associated proteins in Drosophila cells, five major chromatin types were identified as a refinement of the traditional binary division into hetero- and euchromatin. These five types were given color names in reference to the Greek word chroma. They are defined by distinct but overlapping combinations of proteins and differ in biological and biochemical properties, including transcriptional activity, replication timing, and histone modifications. This work assesses the evolutionary relationships of chromatin-associated proteins and present an integrated view of the evolution and conservation of the fruit fly Drosophila melanogaster chromatin landscape. Homology prediction across a wide range of species was combined with gene age inference methods to determine the origin of each chromatin-associated protein. This provides insight into the evolution of the different chromatin types. The results indicate that for the euchromatic types, YELLOW and RED, young associated proteins are more specialized than old ones; and for genes found in either chromatin type, intron/exon structure is lineage-specific. Next, evidence is provided that a subset of GREEN-associated proteins is involved in a centromere drive in D. melanogaster. The results on BLUE chromatin support the hypothesis that the emergence of Polycomb Group proteins is linked to eukaryotic multicellularity. In light of these results, how the regulatory complexification of chromatin links to the origins of eukaryotic multicellularity is discussed (Parey, 2019).
Eukaryotic chromosomes are organized in multiple scales, from nucleosomes to chromosome territories. Recently, genome-wide methods identified an intermediate level of chromosome organization, topologically associating domains (TADs), that play key roles in transcriptional regulation. However, these methods cannot directly examine the interplay between transcriptional activation and chromosome architecture while maintaining spatial information. This paper presents multiplexed, sequential imaging approach (Hi-M) that permits simultaneous detection of chromosome organization and transcription in single nuclei. This allowed unveiling of the changes in 3D chromatin organization occurring upon transcriptional activation and homologous chromosome unpairing during awakening of the zygotic genome in intact Drosophila embryos. Excitingly, the ability of Hi-M to explore the multi-scale chromosome architecture with spatial resolution at different stages of development or during the cell cycle will be key to understanding the mechanisms and consequences of the 4D organization of the genome (Cardozo Gizzi, 2019).
How the nuclear lamina (NL) impacts on global chromatin architecture is poorly understood. This study shows that NL disruption in Drosophila S2 cells leads to chromatin compaction and repositioning from the nuclear envelope. This increases the chromatin density in a fraction of topologically-associating domains (TADs) enriched in active chromatin and enhances interactions between active and inactive chromatin. Importantly, upon NL disruption the NL-associated TADs become more acetylated at histone H3 and less compact, while background transcription is derepressed. Two-colour FISH confirms that a TAD becomes less compact following its release from the NL. Finally, polymer simulations show that chromatin binding to the NL can per se compact attached TADs. Collectively, these findings demonstrate a dual function of the NL in shaping the 3D genome. Attachment of TADs to the NL makes them more condensed but decreases the overall chromatin density in the nucleus by stretching interphase chromosomes (Ulianov, 2019).
The nuclear lamina (NL) is a meshwork of lamins and lamin-associated proteins lining the nuclear envelope (NE). Several lines of evidence support the idea that the NL is a platform for the assembly of the repressive compartment in the nucleus. In mammals, nematode and Drosophila, the lamina-associated chromatin domains (LADs) contain mostly silent or weakly expressed genes. Activation of tissue-specific gene transcription during cell differentiation is frequently associated with translocation of loci from the NL to the nuclear interior. The expression level of a reporter gene is ~5-fold lower when it is inserted into LADs compared to inter-LADs. Artificial tethering of weakly expressed reporter genes to the NL results in their downregulation thus indicating that contact with the NL may cause their repression. Accordingly, many transcriptional repressors, including histone deacetylases (HDACs) are linked to the NL. The high throughput chromosome conformation capture (Hi-C) technique has revealed the spatial segregation of open (DNase I-sensitive) and closed (DNase I-resistant) chromatin into two well-defined compartments. Importantly, in mammalian cells, the DNase I-resistant compartment is strongly enriched with NL contacts. Moreover, a whole-genome DNase I-sensitivity assay in Drosophila S2 cells indicated that LADs constitute the densely packed chromatin. Additionally, super-resolution microscopy studies in Kc167 cells show that inactive chromatin domains (including Polycomb (Pc)-enriched regions) are more compact than active ones (Ulianov, 2019).
The newly developed single-cell techniques demonstrate that LADs, operationally determined in a cell population, may be located either at the NL or in the nuclear interior in individual cells. Surprisingly, the positioning of LADs in the nuclear interior barely affects the inactive state of their chromatin. This raises the question as to whether contact with the NL makes the chromatin in LADs compact and inactive. However, few studies directly address this issue. It has been shown that lamin Dm0 knock-down (Lam-KD) in Drosophila S2 cells decreases the compactness of a particular inactive chromatin domain. Accordingly, the accessibility of heterochromatic and promoter regions has been shown to increase upon Lam-KD in Drosophila S2R+ cells. However, the impact of the NL on the maintenance of the overall chromatin architecture remains mostly unexplored (Ulianov, 2019).
This study shows that upon loss of all lamins, the density of peripheral chromatin is decreased in Drosophila S2 cells leading to the slight overall chromatin compaction. At the same time, chromatin in LADs becomes less tightly packed which correlates with the enhancement of initially weak level of histone H3 acetylation and background transcription in these regions (Ulianov, 2019).
Using immunostaining and FISH experiments, Lam-KD in Drosophila S2 cells was shown to lead to a slight reduction in total chromatin volume and, as a result, an increase in chromatin packaging density. However, the stronger compaction of chromatin is not homogeneous and depends on the epigenetic state and scale. Hi-C analysis clearly indicates two opposite trends in chromatin behaviour. The contact frequency in the active chromatin increases over short distances (i.e., within the 'active' TADs and the inter-TADs) and decreases over long distances (i.e., within the A compartment). Whereas in the inactive chromatin it, inversely, decreases over short distances (i.e. within the TADs mostly corresponding to LADs), but increases at the chromosomal scale (Ulianov, 2019).
A model is suggested explaining general chromatin stretching as well as the condensation of inactive chromatin in TADs mediated by the NL. If chromatin mobility is constrained by its tethering to the NL, then the release from this tethering will lead to chromatin shrinkage due to macromolecular crowding and inter-nucleosomal interactions. Therefore counterintuitively, the NL appears not to restrict chromatin expansion but provides an anchoring surface necessary to keep interphase chromosomes slightly stretched. At the same time, inactive chromatin may become additionally condensed due to the deacetylation by HDACs, linked to the NL, and/or mechanically, due to chromatin binding with the NL (Ulianov, 2019).
A recently published study analysed the 3D genome organization upon NL disruption in mouse embryonic stem cells (mESCs). It is interesting to compare the current results from Drosophila with those from mice. Upon loss of all lamins, the general TAD profile is still preserved in both species, however, intra- and inter-TAD interactions are altered. Strikingly, upon loss of all lamins, a fraction of NL-attached TADs becomes less condensed in both species. However, in contrast to Drosophila S2 cells, this is not accompanied by a general detachment of chromatin from the NE in mESCs. Additionally, distant interactions within the inactive chromatin are mostly increased in both species upon lamin loss. Finally, while some genes located at the nuclear periphery and in the nuclear interior have changed their expression both in mESCs and in Drosophila S2 cells, an increase in the background transcription upon lamin loss is detected specifically in Drosophila LADs, and this was not reported for mESCs44. Taken together, these findings indicate that both in mammals and Drosophila the NL not only makes nearby chromatin more compact and repressed, but also affects chromatin interactions and gene expression in the nuclear interior (Ulianov, 2019).
The diversity of mechanisms of chromatin attachment to the NL in Drosophila and mammals may explain the differences in chromatin behaviour in response to the lack of all lamins. For example, it was shown that LBR and PRR14 proteins participate in the tethering of the H3K9-methylated chromatin to the NE in mammals. Whereas in mammalian ESCs LADs are strongly enriched with the H3K9me2/32, in Drosophila Kc167 and, likely, in S2 cells this modification is not present in LADs. Accordingly, the results indicate that LBR is not required to keep chromatin at the nuclear periphery in S2 cells. Therefore, the removal of all lamins may not be sufficient to detach all LADs from the NE in mESCs, but can release LADs in Drosophila S2 cells (Ulianov, 2019).
In conclusion, using different approaches this study revealed that NL disruption in Drosophila S2 cells leads to general chromatin compaction, accompanied by the impaired spatial segregation of total chromatin into active and inactive types, and the decompaction of a fraction of NL-attached TADs linked to partial derepression of their chromatin. Importantly, the observed phenomena may be related to the abnormal expression of genes in lamin-associated diseases (Ulianov, 2019).
The genetic material of eukaryotes is segregated into transcriptionally active euchromatin and silent heterochromatin compartments. The spatial arrangement of chromatin compartments evolves over the course of cellular life in a process that remains poorly understood. The latest nuclear imaging experiments reveal a number of dynamical signatures of chromatin that are reminiscent of active multi-phase liquids. This includes the observations of viscoelastic response, coherent motions, Ostwald ripening, and coalescence of chromatin compartments. There is also growing evidence that liquid-liquid phase separation of protein and nucleic acid components is the underlying mechanism for the dynamical behavior of chromatin. In order to dissect the organizational and dynamical implications of chromatin's liquid behavior, this study has devised a phenomenological field-theoretic model of the nucleus as a multi-phase condensate of liquid chromatin types. Employing the liquid chromatin model of the Drosophila nucleus, an extensive set of simulations with an objective to shed light on the dynamics and chromatin patterning observed in the latest nuclear imaging experiments. Simulations reveal the emergence of experimentally detected mesoscale chromatin channels and spheroidal droplets which arise from the dynamic interplay of chromatin type to type interactions and intermingling of chromosomal territories. In addition, coherent motions of chromatin domains observed in displacement correlation spectroscopy measurements which are explained within the framework of this model was quantitatively reproduced by phase separation of chromatin types operating within constrained intra-chromosomal and inter-chromosomal boundaries. Finally, this study illuminates the role of heterochromatin-lamina interactions in the nuclear organization by showing that these interactions enhance the mobility of euchromatin and indirectly introduce correlated motions of heterochromatin droplets (Laghmach, 2021).
The three-dimensional organization of chromatin contributes to transcriptional control, but information about native chromatin distribution is limited. Imaging chromatin in live Drosophila larvae, with preserved nuclear volume, revealed that active and repressed chromatin separates from the nuclear interior and forms a peripheral layer underneath the nuclear lamina. This is in contrast to the current view that chromatin distributes throughout the nucleus. Furthermore, peripheral chromatin organization was observed in distinct Drosophila tissues, as well as in live human effector T lymphocytes and neutrophils. Lamin A/C up-regulation resulted in chromatin collapse toward the nuclear center and correlated with a significant reduction in the levels of active chromatin. Physical modeling suggests that binding of lamina-associated domains combined with chromatin self-attractive interactions recapitulate the experimental chromatin distribution profiles. Together, these findings reveal a novel mode of mesoscale organization of peripheral chromatin sensitive to lamina composition, which is evolutionary conserved (Amiad-Pavlov, 2021).
Robust chromatin organization within the nucleus has been linked to the control of gene transcription, yet information about the 3D chromatin distribution under physiological conditions in a living organism is limited. This study has revealed the 3D distribution of chromatin in muscle nuclei of live Drosophila larvae at high resolution. This analysis demonstrates a novel mode of chromatin organization at the nuclear mesoscale of fully differentiated cells, wherein a high density of chromatin, including both active and repressed regions, is distributed at the nuclear periphery. Chromatin density decreases sharply as a function of distance to the center of nucleus, giving rise to a chromatin-devoid region at the center of the nucleus. This novel mode of nuclear-scale chromatin architecture was detectable only in unfixed tissue within a live organism and preservation of nuclear volume was critical for that. The peripheral chromatin organization was sensitive to lamin A/C levels and contributed to maintaining adequate levels of the active epigenetic mark H3K9ac. Model simulations show that a peripheral chromatin organization was obtained when lamina-associated domains (LADs) and chromatin attractive interactions dominate the entropic tendency to distribute the chromatin uniformly throughout the nucleus. Peripheral chromatin organization was observed not only in larval muscles but also in other live Drosophila larval tissues, as well as in live human effector T lymphocytes and neutrophils, suggesting an evolutionary conserved mode of chromatin organization at the nuclear periphery (Amiad-Pavlov, 2021).
A key experimental feature that allowed viewing nuclear partitioning was preservation of nuclear volume. Previous global chromatin organization studies were performed primarily on fixed preparation or on live flat cultured cells, which might obscure the native spatial chromatin distribution. Specifically, prevailing fixation methods may cause cell and nuclear volume changes, often at the Z axis, while the dimensions in the imaging X-Y plane are less affected, possibly due to robust nuclear-cytoskeletal interactions in this plane. Furthermore, cells grown on rigid surfaces are relatively spread and flat, with 40% to 50% less water content relative to cells grown on soft matrices. As nuclear and cytoplasmic volumes were shown to be interconnected and adapt to external signals, through tight control of their mutual water content (23, 25, 55), preservation of nuclear volume within the tissue (as in the current experimental system) appears to be critical. This was also deduced from model simulations where nuclear volume was found to be directly linked to chromatin mesoscale organization. Significantly, changes in nuclear volume and shape were shown to have critical functional effects such as altered chromatin organization, gene expression, and DNA synthesis (Amiad-Pavlov, 2021).
Although a few previous studies already reported specific cases of peripheral chromatin distribution, it was considered as a transient stage that correlated with the differentiation state of the cell, for example, in the transition from 8- to 20-cell stages of in vitro fertilized bovine embryos, T cell differentiation, or during myeloid cell differentiation. The reported alternations in chromatin organization were accompanied by volume changes; however, the link between these two parameters was not explored. the current data suggest that chromatin peripheral organization is common to a wide range of differentiated tissues. Furthermore, chromatin to nucleus volume fraction is a major regulator of chromatin mesoscale organization, as indicated by experimental and model results. Drosophila myonuclei are polyploid, where DNA copy number was shown to correlate with nuclear size. This study demonstrate a constant chromatin/nucleus volume ratio (0.31) for a wide range of muscle nuclei volumes suggesting that it is a tightly regulated feature. A constant chromatin volume fraction in polyploid nuclei is not unique to larval myonuclei and was also found in Drosophila embryos as well as in plants and fish (Amiad-Pavlov, 2021).
A recent study challenged the prevailing view that repressed and active chromatin compartments partition radially within the nuclear volume, i.e., repressed chromatin predominates at the nuclear periphery and active chromatin prevails near the nuclear center. In agreement with this report, this study did not observe a clear radial separation between active and repressed chromatin distributions and also detected active chromatin at the nuclear peripheral compartment (Amiad-Pavlov, 2021).
The nuclear lamina controls radial chromatin organization by its dynamic tethering of chromatin to the nuclear periphery through LADs. Because the nucleus is thought to be a mechanosensitive organelle in which the close proximity of the chromatin to the nuclear envelope might sensitize the chromatin to both mechanical inputs and changes in the nuclear lamina composition, a peripheral chromatin organization might couple between mechanical inputs and transcriptional regulation. This is especially relevant for fully differentiated nondividing cells, in which the program of gene transcription has been well established; however, because this analysis did not include undifferentiated cells, it remains unclear whether chromatin peripheral organization also takes place in all cell types. In addition, the global chromatin detachment from the periphery and its condensation toward the center that was observed upon elevated lamin C levels agrees with previous reports on reduced chromatin-lamina tethering following up- or down-regulation of lamin C, as well as with observations in cells with laminopathy-associated mutations. These chromatin structural changes were accompanied by chromatin decondensation and were linked to transcriptional regulation. The current data support a link between 3D chromatin distribution and transcriptional regulation as a significant reduction was observed in the active, H3K9ac levels in the nuclei overexpressing lamin C-GFP. The centrally located, condensed chromatin might prompt reduced availability to enzymes (such as acetyltransferase), leading to a net decrease in acetylated H3K9, predicted to induce reduction in gene transcription. Chromatin regions that are already densely packed (such as repressed chromatin) might be less affected from the collapse of chromatin to the center as observed with the preserved H3K9me3 levels in nuclei overexpressing lamin C-GFP (Amiad-Pavlov, 2021).
The simulations of chromatin distribution dynamics involved only a small number of parameters extracted from the literature, including the percentage of LADs per chromosome, the tendency of chromatin to effectively self-attract and the extent of nuclear confinement. The similarity between the simulation and the experimental results suggests that these are among the most critical factors in determination of the partitioning of chromatin on the nuclear scale. It further suggests that a balance between chromatin-lamina interactions through LADs, entropy, and chromatin self-association may represent the major driving forces for the nuclear partitioning observed experimentally. The physical model suggests that the experimentally observed mesoscale chromatin collapse from the periphery to the nucleus interior upon lamin C OE might be driven by reduced lamina-LAD interactions. It has been shown that both lamin A/C and B types compete with one another on similar LAD-binding sites. Although lamin B contacts with LAD exclusively at the nuclear periphery because its localization is restricted to the nuclear membrane due to its farnesylation, lamin A/C was shown to interact with LADs at the nuclear periphery and at the nucleoplasm. This study demonstrated that in contrast to chromatin collapse following lamin C OE, lamin Dm0 (lamin B homolog) OE did not result in chromatin detachment from the lamina and the peripheral chromatin organization was preserved. This might be due to the nucleoplasmic localization of lamin C or due to distinct association of each of the lamin types with other nuclear membrane components affecting the recruitment and preservation of chromatin-lamina interactions at the periphery (Amiad-Pavlov, 2021).
In summary, this study reveals a novel mode of nuclear mesoscale chromatin organization in fully differentiated cells in which chromatin density is high at the nuclear periphery and undetectable in the nuclear center, creating an effectively central chromatin-devoid region. Simulation of chromatin organization based on LAD binding and chromatin self-attraction recapitulated the experimental observations, supported that preservation of nuclear volume is critical and predicted that changing each of these parameters may disrupt peripheral chromatin density profile. Experimental OE of lamin C disrupted the peripheral chromatin organization, supporting the basic assumptions of the model, and had functional consequence of reduced active epigenetic mark (Amiad-Pavlov, 2021).
Chromatin-based transcriptional silencing is often described as a stochastic process, largely because of the mosaic expression observed in position effect variegation (PEV), where a euchromatic reporter gene is silenced in some cells as a consequence of juxtaposition with heterochromatin. High levels of variation in PEV phenotypes are commonly observed in reporter stocks. To ascertain whether background mutations are the major contributors to this variation, it was asked how much of the variation is determined by genetic variants segregating in the population. Using selective breeding of a fourth chromosome PEV reporter line, 39C-12, two inbred lines were isolated exhibiting contrasting degrees of variegation (A1: low expression, D1: high expression). Within each inbred population, remarkable similarity is observed in the degree of variegation: 90% of the variation between the two inbred lines in the degree of silencing can be explained by genotype. Further analyses suggest that this result reflects the combined effect of multiple independent trans-acting loci. The results indicate that background genetic variants play the major role in determining the variable degrees of PEV commonly observed in laboratory stocks. Interestingly, not only does the degree of variegation become consistent in inbred lines, the patterns of variegation also appear similar. Combining these observations with the spreading model for local heterochromatin formation, an augmented stochastic model is proposed to describe PEV in which the genetic background drives the overall level of silencing, working with the cell lineage-specific regulatory environment to determine the on/off probability at the reporter locus in each cell. This model acknowledges cell type-specific events in the context of broader genetic impacts on heterochromatin formation (Wang, 2019).
The Drosophila genome encodes three BEN-solo proteins including Insensitive (Insv), Elba1 and Elba2 that possess activities in transcriptional repression and chromatin insulation. A fourth protein-Elba3-bridges Elba1 and Elba2 to form an ELBA complex. This paper reports comprehensive investigation of these proteins in Drosophila embryos. Common and distinct binding sites for Insv and ELBA and their genetic interdependencies are assessed. While Elba1 and Elba2 binding generally requires the ELBA complex, Elba3 can associate with chromatin independently of Elba1 and Elba2. It was further demonstrated that ELBA collaborates with other insulators to regulate developmental patterning. Finally, this study found that adjacent gene pairs separated by an ELBA bound sequence become less differentially expressed in ELBA mutants. Transgenic reporters confirm the insulating activity of ELBA- and Insv-bound sites. These findings define ELBA and Insv as general insulator proteins in Drosophila and demonstrate the functional importance of insulators to partition transcription units (Ueberschar, 2019).
Proper gene regulation requires coordinated activities of distinct classes of cis- and trans-regulators. Insulators (or boundary elements) are a special type of cis elements that constrain enhancer-promoter interactions and set chromatin boundaries. Historically, boundary or enhancer-blocking activities of newly identified insulators were mostly tested on a one-on-one basis in transgenic lines or genetically dissected for individual loci. Recent advances in genomics and chromatin structure capture techniques allowed more systematic identification of insulators and also assigned new properties to them in chromatin architecture organization (Ueberschar, 2019).
The activity of insulator elements depends upon their associated factors. The zinc-finger protein CTCF seems to be the only insulator protein conserved between vertebrates and invertebrates. In addition to its established roles as an insulator in chromatin organization, long-range regulatory element looping and enhancer segregating, several of the early studies on mammalian CTCF indicated that it functions in transcriptional repression. More than a dozen proteins have been shown to have insulator function in Drosophila. According to the combinatory co-occupancy patterns of the five insulator proteins CP190, BEAF-32, CTCF, Su(Hw), and Mod(mdg4), Drosophila insulators were divided into two classes. Class I insulators are mainly bound by CP190, BEAF-32, and CTCF in active chromatin regions proximal to promoters, while class II insulators are mostly bound by Su(Hw) located in distal intergenic loci. However, at the functional level, how these factors cooperate remains unclear (Ueberschar, 2019).
The BEN (BANP, E5R, and NAC1) domain is a recently recognized domain present in a variety of metazoan and viral proteins. Several BEN-containing proteins including mammalian BANP/SMAR1, NAC1, BEND3, and the C isoform of Drosophila Mod(mdg4) have chromatin-associated functions and have been linked to transcriptional silencing. The BEN domain has been shown to possesses an intrinsic sequence-specific DNA-binding activity. Mammalian RBB, a BEN and BTB domain protein, binds to and directly represses expression of the HDM2 oncogene through interaction with the nucleosome remodeling and deacetylase (NuRD) complex. Drosophila Insv binds to a palindromic motif, TCCAATTGGA and its variants (TCYAATHRGAA), and represses genes in the nervous system. Two other Drosophila BEN proteins, Elba1 and Elba2, along with the adaptor protein Elba3, are assembled in a heterotrimeric complex (ELBA) and associate with the asymmetric site 'CCAATAAG' in the Fab-7 insulator. elba1 and elba3 are closely linked in the genome and specifically expressed during the mid-blastula transition, which restricts ELBA activity to this early developmental window. Interestingly, the genes encoding Insv and Elba2 are also arranged next to each other in the genome, even though their gene products show different tissue specificity in later developmental stages (Ueberschar, 2019).
Most of the BEN-domain proteins contain other characterized motifs. However, Insv, Elba1, Elba2, and several mammalian homologs, such as BEND5 and BEND6, harbor only one BEN domain and lack other known functional domains. Thus, this sub-class is referred to as BEN-solo factors. Previous work demonstrated that Insv and ELBA BEN-solo factors share common properties, e.g., binding to the palindromic sites as homodimers and repressing reporter genes in cultured cells, but also display distinct activities, e.g., Insv being the only one that interacts with Notch signaling and its inability to bind to the asymmetric site. Interestingly, the Fab-7 insulator requires ELBA for its early boundary activity, but also needs Insv in later development (Ueberschar, 2019).
It remains to be determined how the ELBA factors regulate gene expression and embryogenesis. This study has comprehensively characterized the three Drosophila BEN-solo factors and the adapter protein Elba3, by analyzing DNA-binding preferences (symmetric versus asymmetric), chromatin binding inter-dependence (homodimers versus heterotrimeric complex) and mechanisms in gene regulation (repressor versus insulator). ChIP-seq (chromatin immunoprecipitation followed by deep sequencing) analyses show that ELBA and Insv bind many common and distinct genomic regions. Unexpectedly, Elba3 associates with chromatin even in the absence of its DNA-binding partners Elba1 and Elba2. ChIP-nexus (chromatin immunoprecipitation experiments with nucleotide resolution through exonuclease, unique barcode, and single ligation) assay distinguishes asymmetric heterotrimeric binding pattern of Elba1 and Elba2 from symmetric homodimer pattern of Insv. Although all four factors repress transcription, only the ELBA factors genetically interact with GAF and CP190 and are required for embryonic patterning. Finally, this study shows that adjacent genes separated by ELBA binding are less differentially expressed in the ELBA mutants. Insv-associated adjacent genes do not show such a global effect, despite individual loci relying on Insv insulation. And ELBA- and Insv-bound elements block enhancer-promoter interaction in transgenic reporters. Collectively, these findings indicate a role of ELBA and Insv as general insulators in partitioning transcription units in Drosophila (Ueberschar, 2019).
The BEN domains of Elba1, Elba2, and Insv share similar amino acid sequences and identical protein-DNA interaction sites. However, their DNA-binding activities seem to be complex. When expressed in cultured cells, all of these factors display high affinity to the palindromic site, while only the ELBA complex is able to bind the asymmetric site. In vitro translated proteins of Elba1 and Elba2 can bind to both types of motifs when additional bridging factor is present. ChIP-seq analyses confirm that in vivo Elba1 and Elba2 target the genome only through forming a heterotrimeric complex with Elba3. This suggests that the affinity of Elba1 and Elba2 binding to DNA is weak and needs to be enhanced by additional factors. The ChIP-seq analyses also demonstrate that Elba3 shows broader binding to the genome and is able to target many genomic loci in the absence of Elba1 and Elba2. The Elba3 protein does not have any known functional motif and not even a predictable DNA-binding domain. One potential factor that can bring Elba3 to chromatin is Insv. Indeed, the Elba3 peaks that are independent of Elba1/2 overlap more with Insv peaks. However, Insv is unlikely the only co-factor, as many of the Elba1/2-independent peaks do not overlap with Insv sites. Other insulator proteins with DNA-binding property, such as CP190 and GAF, are candidates that can bring Elba3 to the genome given that these factors co-occupy many genomic loci and display genetic interaction (Ueberschar, 2019).
This study used a high-resolution ChIP-nexus approach and confirmed that Insv and the ELBA factors associate with both types of DNA motifs. The ChIP-nexus analyses also provided evidence that Elba1 and Elba2 in the ELBA complex bind to DNA in an asymmetric configuration. Intriguingly, at some of the loci, such as the asymmetric sites in Fab-7 and Parp, Elba1 and Elba2 show '+' versus '-' strand preference. The genomic loci with asymmetric binding are expected to represent weak association of ELBA with DNA, as strong DNA binding would allow equal pull down of the subunits with the antibody against any of the three components. There are many other loci showing symmetric read distribution for Elba1 and Elba2. These sites either mediate strong binding of the complex or symmetric binding of Elba1 and Elba2 (e.g., as homodimers). Insv binding is always symmetric, suggesting that it binds to the sites as homodimers. This evidence shows ChIP-nexus can be a powerful tool to resolve binding symmetry by a heterotrimeric complex. It will be of interest to understand how the BEN domains have evolved in DNA-binding affinity and sequence specificity across species. Previous and current work well exemplifies the approach to determine the molecular properties of a less studied DNA-binding protein family (Ueberschar, 2019).
The activity of ELBA in the early embryo was examined by RNAi knockdown experiments, where it was shown to influence early boundary activity of the HS1 element. However, the effect of complete loss of ELBA in embryonic development has not been investigated. Loss-of-function mutants for the ELBA genes were generated, and the genes were found to be dispensable for viability. This is not surprising as other chromatin insulator proteins, such as dCTCF35 and BEAF-3236, are not required for viability. One possibility is that Drosophila utilizes multiple backup mechanisms to ensure boundary fidelity. Indeed, when one copy of CP190 or GAF is removed, loss of ELBA led to drastic developmental consequences in this sensitized background. Despite both ELBA and Insv associating with known insulator proteins, such as CP190, BEAF-32, and GAF, ELBA showed strong genetic interactions with CP190 and GAF in viability and early embryonic patterning, while insv did not. It is possible that Insv is less needed during the embryonic stage and/or that another unknown factor compensates for its joint function with CP190 and GAF. In support of the first possibility, insv is required for maintaining segmentation of adult flies when the GAF sites are mutated from Fab-725 (Ueberschar, 2019).
Genes in the Drosophila genome are more compact than in vertebrates. There is a need to partition dense transcription units to ensure enhancer specificity. Thanks to many years of genetic studies in Drosophila, a list of individual genomic loci were identified that separate enhancers or promoters. Insulator proteins such as GAF, CTCF, CP190, and BEAF-32 were found to mediate these activities. It was shown that BEAF-32 binds to sequences in between closely apposed genes with a head-to-head configuration (divergent). This study showed that the neighboring genes that are differentially expressed in wild type become more equally expressed in ELBA mutant embryos. In this case, all three types of promoter configurations, divergent, tandem, and convergent, have similar requirement for ELBA-dependent insulation. Together with the evidence that genomic elements bound by ELBA and Insv are sufficient to block enhancer-promoter interactions in transgene assays, the results suggest ELBA is required to ensure the autonomous regulation of linked transcription units (Ueberschar, 2019).
New properties have been assigned to insulators recently, especially in chromatin organization and long-range cis-element interactions. This work focused on the functions of ELBA and Insv in active chromatin regions because of their enrichment in close proximity to active promoters. However, enrichment of ELBA and Insv was detected in several known elements that could mediate long-range interactions, such as the homie-nhomie and scs and scs' loci. Future studies will be needed to determine the roles of ELBA and Insv in chromatin organization (Ueberschar, 2019).
Chromatin 3D structure plays a crucial role in regulation of gene activity. Previous studies have envisioned spatial contact formations between chromatin domains with different epigenetic properties, protein compositions and transcription activity. This leaves specific DNA sequences that affect chromosome interactions. The Drosophila melanogaster polytene chromosomes are involved in non-allelic ectopic pairing. The mutant strain agnts3, a Drosophila model for Williams-Beuren syndrome, has an increased frequency of ectopic contacts (FEC) compared to the wild-type strain Canton-S (CS). Ectopic pairing can be mediated by some specific DNA sequences. Using Homology Segment Analysis software, this study estimated the correlation between FEC and frequency of short matching DNA fragments (FMF) for all sections of the X chromosome of Drosophila CS and agnts3 strains. With fragment lengths of 50 nucleotides (nt), CS showed a specific FEC-FMF correlation for 20% of the sections involved in ectopic contacts. The correlation was unspecific in agnts3, which may indicate the alternative epigenetic mechanisms affecting FEC in the mutant strain. Most of the fragments that specifically contributed to FMF were related to 1.688 or 372-bp middle repeats. Thus, middle repetitive DNA may serve as an organizer of ectopic pairing (Zhuravlev, 2021).
Chromosomes are organized into high-frequency chromatin interaction domains called topologically associating domains (TADs), which are separated from each other by domain boundaries. The molecular mechanisms responsible for TAD formation are not yet fully understood. In Drosophila, it has been proposed that transcription is fundamental for TAD organization while the participation of genetic sequences bound by architectural proteins (APs) remains controversial. This study investigated the contribution of domain boundaries to TAD organization and the regulation of gene expression at the Notch gene locus in Drosophila. Deletion of domain boundaries was found to result in TAD fusion and long-range topological defects that are accompanied by loss of APs and RNA Pol II chromatin binding as well as defects in transcription. Together, these results provide compelling evidence of the contribution of discrete genetic sequences bound by APs and RNA Pol II in the partition of the genome into TADs and in the regulation of gene expression in Drosophila (Arzate-Mejia, 2020).
This study provides evidence that discrete genetic sequences occupied by APs and RNA Pol II are potent chromatin insulators that actively partition the genome into Topological Domains. Furthermore, partial disruption or complete removal of the domain boundaries alter genome topology, transcription, and the recruitment of APs and RNA Pol II (Arzate-Mejia, 2020).
Whether domain boundaries are autonomous discrete genetic elements mediating the formation of TADs is a subject of intense debate. The collection of CRISPR-Cas9 mediated deletions of domain boundaries at the Notch locus provide evidence on the existence of autonomous genetic elements bound by APs and Pol II that act as chromatin insulators essential for TAD formation (Arzate-Mejia, 2020).
A 300-bp sequence comprising the entire intergenic region between kirre and Notch is a modular chromatin insulator constituting a domain boundary. Non-overlapping portions of the intergenic region, with binding sites for specific APs and RNA Pol II, act as discrete modules that restrain interactions of the kirre and the Notch genes, with the removal of all modules necessary for TAD fusion. The topological effects observed upon boundary deletion are remarkably consistent with cytological data from the Notch mutant facet-strawberry (faswb) where deletion of a ~0.9-kb region spanning the 5' region of Notch results in loss of an interband and fusion of the 3C7 band containing Notch with the upstream band. Also, reporter assays in transgenic flies and cytological evidence support an autonomous role for the 5' intergenic region of Notch as a chromatin insulator as the ectopic insertion of this sequence is sufficient and necessary to split a band into two, forming an interband in polytene chromosomes. In the case of the intragenic enhancer boundary, deletion of a ~2-kb region results in a dramatic increase of ectopic interactions between Notch domains and loss of a ~1-Mb domain downstream of Notch. Therefore, evidence from cytological studies in the fly and the in nucleus Hi-C data from CRISPR mutants conclusively demonstrates that domain boundaries are essential for TAD formation (Arzate-Mejia, 2020).
Recent reports have suggested a prominent role for transcription as the main driver for domain organization in Drosophila. Also, a role for RNA Pol II in mediating domain formation has been recently proposed. The data provide important observations that support a role for RNA Pol II in boundary activity and therefore in TAD formation in Drosophila. First, re-analysis of public Hi-C data from early stages of Drosophila embryogenesis suggests that the 5' boundary of Notch is established before Zygotic Genome Activation (nuclear cycle 13), and therefore, before transcription at the locus. The appearance of TAD boundaries at Notch strongly correlates with the early acquisition of chromatin accessibility (nuclear cycle 11) and with the binding of proteins like RNA Pol II (nuclear cycle 12), the general transcription factor TBP, and the pioneering factor Zelda. Second, transcription inhibition early in development results in a decrease in intra-domain interactions within the Notch locus. However, the boundaries and the domains at Notch are still detected, which correlates with the retention of RNA Pol II at domain boundaries suggesting that RNA Pol II is key for TAD formation. In support of this, it was observed that deletion of the Promoter Proximal Region of Notch (5pN-Δ102) resulted in a major decrease in transcription within the D1 domain (>80%) but just in discrete topological changes mainly detected as a reduction in intra-domain interactions for the D1 domain, consistent with the topological effects observed at the locus upon transcription inhibition. Importantly, loss of the Promoter Proximal Region of Notch resulted in the reduction but not loss of RNA Pol II binding, which implies that the remaining RNA Pol II could be sufficient to sustain boundary activity. In support of this, the fusion of the D1 domain of Notch with the upstream TAD correlates with complete loss of RNA Pol II at the 5' end of Notch. A similar trend was observed when removing the intronic boundary of Notch, with the fusion of Notch domains strongly correlating with loss of RNA Pol II binding at the region adjacent to the deleted boundary and in exon 6. Furthermore, the formation was observed of a new TAD spanning the full Notch locus despite a significant loss of transcription along the gene. Therefore, although transcription plays a role in mediating intra-domain interactions, the data suggest that discrete, accessible genomic sequences occupied by RNA Pol II, could have a major role in shaping Drosophila genome organization independent of transcription (Arzate-Mejia, 2020).
Architectural Proteins in Drosophila can mediate long-range interactions, however their role in shaping TADs has remained elusive. The data suggest a role of architectural protein binding sites (APBSs) in boundary activity in part throughout RNA Pol II recruitment. For example, non-overlapping regions of the 5' boundary have a differential effect on RNA Pol II recruitment, which correlates with the presence of different APBSs. In particular, it was observed that deletion of a ~200-bp region containing just a CTCF motif (5pN-Δ183) have a stronger effect in RNA Pol II recruitment than deletion of the Promoter Proximal Region (5pN-Δ102) which contains a binding site for M1BP.Then, in this case the CTCF DNA-binding motif seems important to either directly or indirectly recruit RNA Pol II. In support of this, mutation of the CTCF motif results in loss of binding of nuclear proteins. It esd also observed that loss of both CTCF and M1BP binding sites in the 5pN-Δ343 mutant correlates with the maximal decrease of CTCF and RNA Pol II occupancy, complete loss of insulation and TAD fusion, implying that domain boundaries can be resilient to the loss of RNA Pol II binding through the presence of multiple APBSs. In support of a role for DNA-binding APs in boundary activity through RNA Pol II recruitment, it has been reported that depletion of the pioneering factor Zelda results in loss of RNA Pol II recruitment, deficient local insulation and fusion of adjacent TADs (Arzate-Mejia, 2020).
TAD boundaries can block unspecific regulatory communication, however, their role in gene regulation has been recently subject to intense debate. The data support that TADs have an important function in gene regulation in Drosophila. It was found that deletions spanning the 5' boundary of Notch, consistently results in loss of transcription within the D1 domain likely as a combination of reduced RNA Pol II occupancy at the 5' end of the gene, loss of insulation between adjacent TADs and gain of ectopic interactions. Deletion of the B2 boundary also results in a reduction in Notch transcription. Interestingly, boundary disruption leads to loss of RNA Pol II binding at exon 6 and at the 5' region of Notch, suggesting that it influences RNA Pol II recruitment to the Notch, locus probably by direct physical interaction. Then, in this case, reduction in transcription could be a consequence of disrupting physical interactions between regulatory elements that affect RNA Pol II recruitment, rather than the consequence of insulation loss. Furthermore, loss of the mega-domain due to deletion of the B2 boundary affects gene regulation of the genes located within the domain. Therefore, the evidence shows that disruption of TAD organization by alteration of boundaries impacts gene expression (Arzate-Mejia, 2020).
Finally, an important observation from these experiments is that deletion of TAD boundaries and accompanying changes in gene transcription as well as changes in the recruitment of CTCF and RNA Pol II at domain boundaries, do not abolish the intra-TAD specific organization of Notch since subdomains are preserved despite TAD fusion. These suggest that additional mechanisms contribute to folding the genome into smaller domains, possibly by aggregation of regions with similar chromatin features (Arzate-Mejia, 2020).
In conclusion, the data demonstrate the existence of discrete genetic sequences with boundary activity that influence genome organization into Topological Associated Domains and the regulation of gene expression. Other domain boundaries with a similar chromatin composition and APs occupancy could behave similarly (Arzate-Mejia, 2020).
Based on the results, a mechanism is proposed for boundary formation through the binding of APs that results in recruitment of RNA Pol II. In such a model, a boundary is robust to APs depletion as far as RNA Pol II binding is maintained. Finally, it is envisioned that genome organization in Drosophila is dependent on two mechanisms: one driven by self-association of regions with similar transcriptional or epigenetic profiles and one that partitions the genome into interaction domains driven by genetic elements acting as chromatin insulators (Arzate-Mejia, 2020).
Drosophila Myb (Dm-Myb) encodes a protein that plays a key role in regulation of mitotic phase genes. This study further refine its role in the context of a developing tissue as a potentiator of gene expression required for proper RNA polymerase II (RNA Pol II) function and efficient H3K4 methylation at promoters. In contrast to its role in gene activation, Myb is also required for repression of many genes, although no specific mechanism for this role has been proposed. This study now reveals a critical role for Myb in contributing to insulator function, in part by promoting binding of insulator proteins BEAF-32 and CP190 and stabilizing H3K27me3 Polycomb-group (PcG) domains. In the absence of Myb, H3K27me3 is markedly reduced throughout the genome, leading to H3K4me3 spreading and gene derepression. Finally, Myb is enriched at boundaries that demarcate chromatin environments, including chromatin loop anchors. These results reveal functions of Myb that extend beyond transcriptional regulation (Santana, 2020).
This study identified four novel roles for the Drosophila Myb oncoprotein. First, it was shown that Myb acts as a potentiator of target gene transcription, not merely as an epigenetic factor that maintains activated expression. Potentiation is associated with increased H3K4me3 as well as recruitment and pause release/elongation of RNA Pol II. Next, it was shown that in addition to binding promoter potentiator sites, Myb binds to insulator sites genome wide, is necessary to promote the binding of BEAF-32 and CP190 at Myb-shared sites, and is required for the function of at least a subset of insulators. Third, it was shown that Myb plays a critical role in formation and/or stabilization of H3K27me3 domains, preventing silent genes within these domains from becoming derepressed, in stark contrast to Myb's well-known role as an activator of gene expression. Finally, this study showed that Myb is enriched at TAD boundaries and chromatin loop anchors (Santana, 2020).
Similar to what has previously been reported, an enrichment was identified of cell cycle genes directly regulated by Myb, with the majority being G2/M but also several S phase genes (consistent with studies on B-Myb, supporting its role in S phase). Notably, a recent study reported that Cyclin A (the cyclin that regulates aspects of both S phase and G2/M) directly binds to Myb and is required for expression of many of its targets (Rotelli, 2019). Further supporting a role for Myb in transcriptional regulation, this study found that Myb is required for optimal levels of H3K4 promoter methylation, with Myb playing a key role in RNA Pol II dynamics. Analysis of the promoter regions of Myb-potentiated targets in Myb mutants revealed two distinct classes of targets: one that requires Myb for efficient recruitment of RNA Pol II and both that use Myb for efficient RNA Pol II pause release/elongation. Myb may use NURF to establish a nucleosomal-free region that allows recruitment of RNA Pol II, similar to what has been observed with GAF at several targets. Further suggesting a critical role for Myb in transcriptional regulation, the failure to potentiate key targets in Myb mutants (e.g., okr and Rad9 for S phase; Cap-D2 for chromosome condensation; mad2, Mps1, Pen, dgt6, and msd5 for M phase) likely explains the varied phenotypes observed in Myb mutants (including increased mitotic index, partially condensed chromosomes, aneuploidy, and S phase defects) (Santana, 2020).
Recent studies have shown that several components of dREAM can bind insulator sites, and RNAi depletion of Mip40, Mip130, and E2f2 resulted in impairment of enhancer-blocking function at specific sites. Furthermore, components of dREAM (Myb, Mip120, Mip130, Rbf1, E2f2, and DP) were shown to directly interact with dCTCF and/or CP190. Interestingly, double knockdown of dCTCF and CP190 resulted in loss of dREAM (Mip40, Mip120, Mip130, and E2f2) at some shared sites. The results presented in this study further extend these studies and demonstrate that Myb directly contributes to boundary insulator function (including sites located at divergently paired genes). Intriguingly, in many cases where one gene of a divergently paired genes (DPGs) is transcribed and the other is silent, it was found that Myb has no role in expression of the activated gene; rather, Myb prevents inappropriate activation of the silent gene. This cannot be explained by loss of the primary repressive arm of the Myb complex (E2f2/Rbfs) because knockdown of E2f2 or Rbf1/2 in Kc167 cells led to derepression of only 17 out of the 96 genes upregulated when Myb is knocked down. Furthermore, out of the 46 genes upregulated in SL2 cells upon knockdown of E2f2 or Rbf1/2, only 3 are upregulated in Myb mutant wing discs. Finally, the absence of E2f2 in 3rd instar larvae led to upregulation of genes from four DPGs; yet, these genes are not upregulated in Myb mutant wing discs. Of the insulator sites shown in this study to require Myb, the vast majority (89%) contain Myb but not E2f2 consensus sites, similar to what was observed in Kc167 cells for genes requiring Myb for repression (Santana, 2020).
Consistent with Myb playing a role in insulators that separate different chromatin neighborhoods, this study found a striking correlation between Myb occupancy and active, null, PcG H3K27me3 (all 99%), and APBS TAD boundaries (>93%). TAD boundaries have been associated with chromatin looping, and studies on CTCF and CP190 in vertebrates and invertebrates, respectively, have proposed or confirmed a role for these insulator factors in tethering the chromatin loops. Further work has shown that chromatin loop anchors can also exist within TADs. It is particularly noteworthy that Myb occupies the majority of chromatin loop anchor points identified in another study Eagen (2017), in addition to significant overlaps with several other chromatin factors (including dCTCF, Rad21, and Pc) at anchor points. Further work is needed to determine whether Myb plays a role in tethering chromatin loops, although it is intriguing to speculate that the loss of the H3K27me3 mark in Myb or dCTCF mutants could result from disrupting H3K27me3 loops, which, in turn, might diminish that region's ability to associate with an H3K27me3 nuclear subcompartment. The observation that dCTCF has been shown to directly interact with Myb further suggests that these two chromatin factors can collaborate, and a remarkable 73% of the anchors bound by dCTCF are also bound by Myb (similar to the Rad21-bound anchors, for which 75% are also occupied by Myb). It is postulated that B-Myb (the most closely related vertebrate Myb family member to Dm-Myb) and CTCF might be playing similar collaborative roles in humans (Santana, 2020).
Although a role for Myb in transcriptional activation has been recognized for some time, its role in insulator function is particularly intriguing, as this activity appears to be separate from its transcriptional role. Indeed, the chromatin changes observed at many DPGs and other insulator elements strongly support the direct role of Myb in promoting a barrier function between chromatin states (in part by promoting binding of BEAF-32/ CP190). However, unlike previously described examples of barrier insulator loss, which results in heterochromatin spreading into euchromatin, the opposite scenario were observed in Myb mutants, namely the spreading of H3K4me3, which may be enabled by the significant loss of the H3K27me3 mark. This is the very definition of barrier insulator function, namely, maintaining a discrete separation of chromatin environments. Whether Myb is playing a role in chromatin loop formation is unclear at this point, but its overlap with the majority of loop anchor points identified in at least one study in flies is intriguing. Further work will be needed to address Myb's role at the loop anchors and whether this role relates to stabilization of H3K27me3 domains and/or barrier insulator function. For example, directed mutagenesis of Myb-dependent insulator sites by using tools, such as CRISPR-Cas9, to specifically target Myb, dCTCF, or BEAF-32 sites can provide additional insight into the respective roles of these insulator factors. In addition, assay for transposase-accessible chromatin using sequencing (ATAC-seq) or chromosome conformation capture (3C)-type techniques performed in both controls and Myb mutants can be used to determine whether important chromatin/ 3D structure changes might underlie the altered transcriptional (Santana, 2020).
Heterochromatin is a key architectural feature of eukaryotic genomes crucial for silencing of repetitive elements. During Drosophila embryonic cellularization, heterochromatin rapidly appears over repetitive sequences, but the molecular details of how heterochromatin is established are poorly understood. This study mapped the genome-wide distribution of H3K9me3-dependent heterochromatin in individual embryos of Drosophila miranda at precisely staged developmental time points. Canonical H3K9me3 enrichment was found to be established prior to cellularization and matures into stable and broad heterochromatin domains through development. Intriguingly, initial nucleation sites of H3K9me3 enrichment appear as early as embryonic stage 3 over transposable elements (TEs) and progressively broaden, consistent with spreading to neighboring nucleosomes. The earliest nucleation sites are limited to specific regions of a small number of recently active retrotransposon families and often appear over promoter and 5' regions of LTR retrotransposons, while late nucleation sites develop broadly across the entirety of most TEs. Interestingly, early nucleating TEs are strongly associated with abundant maternal piRNAs and show early zygotic transcription. These results support a model of piRNA-associated co-transcriptional silencing while also suggesting additional mechanisms for site-restricted H3K9me3 nucleation at TEs in pre-cellular Drosophila embryos (Wei, 2021).
One of the drawbacks of Hi-C (high-throughput chromosomes conformation capture) analysis and interpretation is the presence of systematic biases, such as different accessibility to enzymes, amplification, and mappability of DNA regions, which all result in different visibility of the regions. Iterative correction (IC) is one of the most popular techniques developed for the elimination of these systematic biases. IC is based on the assumption that all chromatin regions have an equal number of observed contacts in Hi-C. In other words, the IC procedure is equalizing the experimental visibility approximated by the cumulative contact frequency (CCF) for all genomic regions. However, the differences in experimental visibility might be explained by biological factors such as chromatin openness, which is characteristic of distinct chromatin states. This study shows that CCF is positively correlated with active transcription. It is associated with compartment organization, since compartment A demonstrates higher CCF and gene expression levels than compartment B. Notably, this observation holds for a wide range of species, including human, mouse, and Drosophila. Moreover, the CCF state for syntenic blocks were tracked between human and mouse; it is concluded that active state assessed by CCF is an intrinsic property of the DNA region, which is independent of local genomic and epigenomic context. These findings establish a missing link between Hi-C normalization procedures removing CCF from the data and poorly investigated and possibly relevant biological factors contributing to CCF (Samborskaia, 2020).
Mammalian and Drosophila genomes are partitioned into topologically associating domains (TADs). Although this partitioning has been reported to be functionally relevant, it is unclear whether TADs represent true physical units located at the same genomic positions in each cell nucleus or emerge as an average of numerous alternative chromatin folding patterns in a cell population. This study used a single-nucleus Hi-C technique to construct high-resolution Hi-C maps in individual Drosophila genomes. These maps demonstrate chromatin compartmentalization at the megabase scale and partitioning of the genome into non-hierarchical TADs at the scale of 100 kb, which closely resembles the TAD profile in the bulk in situ Hi-C data. Over 40% of TAD boundaries are conserved between individual nuclei and possess a high level of active epigenetic marks. Polymer simulations demonstrate that chromatin folding is best described by the random walk model within TADs and is most suitably approximated by a crumpled globule build of Gaussian blobs at longer distances. Prominent cell-to-cell variability was observed in the long-range contacts between either active genome loci or between Polycomb-bound regions, suggesting an important contribution of stochastic processes to the formation of the Drosophila 3D genome (Ulianov, 2021).
Homologous chromosomes colocalize to regulate gene expression in processes including genomic imprinting, X-inactivation, and transvection. In Drosophila, homologous chromosomes pair throughout development, promoting transvection. The 'button' model of pairing proposes that specific regions along chromosomes pair with high affinity. This study identified buttons interspersed across the fly genome that pair with their homologous sequences, even when relocated to multiple positions in the genome. A majority of transgenes that span a full topologically associating domain (TAD) function as buttons, but not all buttons contain TADs. Additionally, buttons are enriched for insulator protein clusters. Fragments of buttons do not pair, suggesting that combinations of elements within a button are required for pairing. Pairing is necessary but not sufficient for transvection. Additionally, pairing and transvection are stronger in some cell types than in others, suggesting that pairing strength regulates transvection efficiency between cell types. Thus, buttons pair homologous chromosomes to facilitate cell-type-specific interchromosomal gene regulation (Viets, 2019).
Chromosomes are organized in a complex manner in the nucleus. In higher organisms, they localize to distinct territories. Regions of chromosomes interact to form compartments, which are segregated based on gene expression states. Chromosomes are further organized into topologically associating domains (TADs), regions of self-association that are hypothesized to isolate genes into regulatory domains and ensure their activation by the correct cis-regulatory elements. TADs vary in size from ~100 kb in Drosophila melanogaster to ~1 Mb in mammals. Disruptions of nuclear organization, such as alteration of TAD structure and localization of genes to incorrect nuclear compartments, can have major effects on gene expression at some loci in cis. However, it is unclear how elements interact between chromosomes to organize chromatin and regulate gene expression in trans (Viets, 2019).
One key aspect of nuclear architecture involves the tight colocalization, or 'pairing,' of homologous chromosomes to facilitate regulatory interactions between different alleles of the same gene. In Drosophila, homologous chromosomes are paired in somatic cells. This stable pairing provides an excellent paradigm to study the mechanisms driving interactions between chromosomes (Viets, 2019).
It is poorly understood how homologous chromosomes come into close physical proximity. Classical studies proposed a 'zipper' model, in which all regions of the genome have an equal ability to pair based on sequence homology, and chromosome pairing begins at the centromere and proceeds distally to the telomeres. Studies of pairing initiation during development led to a shift toward the 'button' model, which proposes that regions of high pairing affinity are interspersed along chromosome arms and come together through a random walk (Viets, 2019).
The nature of the high-affinity 'buttons' that bring homologous chromosomes together is unclear. Many elements, including insulators, Polycomb response elements (PREs), and heterochromatic repeats, drive looping interactions in cis along the same chromosome arm or clustering between non-homologous sequences on different chromosomes. However, only a handful of small DNA elements, including the gypsy retrotransposon and the Fab-7, Mcp, and TMR regions of the Abd-B locus, are known to drive pairing between homologous sequences on different chromosomes. As three of these elements are within the same locus, the sequence and structural features that contribute to genome-wide pairing are unknown. The scarcity of small DNA elements that are known to drive pairing suggests that combinations of elements and/or higher order chromatin structures are required to button homologous chromosomes together (Viets, 2019).
Pairing facilitates a gene-regulatory mechanism known as transvection, in which two different mutant alleles interact between chromosomes to rescue gene expression. Transvection has been described for a number of Drosophila genes. With the exceptions of Abd-B and certain transgenes containing the Homie, gypsy, Mcp, TMR, and Fab-7 sequences, transvection requires homologous chromosome pairing and is disrupted by chromosome rearrangements. DNA elements such as insulators and PREs contribute to transvection and similar phenomena at many loci, but it is unclear if the same DNA elements are always involved in both pairing and transvection or if pairing and transvection are mechanistically separable (Viets, 2019).
Homologous chromosome pairing occurs more strongly in some cell types than in others. Pairing occurs in 15%-30% of nuclei in the early embryo, gradually increases throughout embryonic development, and reaches a peak of 90%-95% by the third instar larval stage. Similarly, transvection efficiency varies widely among cell types. However, a direct link between the level of pairing and the strength of transvection in a given cell type has not been established (Viets, 2019).
This study has developed a method to screen for DNA elements that pair, and has identified multiple button sites across the Drosophila genome, allowing examination of features that determine button activity. Hi-C was conducted on Drosophila larval eye discs and TADs were identified. ∼80% of transgenes encompassing an entire TAD function as buttons, but not all buttons contain TADs. Additionally, it was found that buttons are enriched for insulator protein clusters. Moreover, buttons drive pairing with their endogenous loci when placed at different positions in the genome. By testing mutant alleles and transgenes of the spineless gene, this study shows that pairing and transvection are mechanistically separable and cell-type-specific (Viets, 2019).
The mechanisms driving chromosome pairing and transvection have remained a mystery of fly genetics since their initial discoveries. Through development of a FISH assay to test regions of the genome for pairing ability, this study provides support for the button model of pairing initiation and offers evidence of general features, TADs and clusters of insulators, which contribute to pairing. Consistent with these findings, recent work examining steady-state interactions between paired chromosomes indicates that tightly associated regions are enriched for insulator proteins. Additionally, homologous TADs have been observed to interact between paired chromosomes using super-resolution microscopy. Moreover, homologous chromosome pairing initiates at the same embryonic stage as TAD formation. However, this relationship is complicated, as it is also the stage when zygotic genome activation occurs (Viets, 2019).
This work provides an important complement to other recent work examining steady-state interactions between homologous chromosomes after pairing has already been established. Whereas these studies allow genome-wide detection of tightly paired regions, the assay provides a direct functional examination of button regions for pairing ability, enabling testing of specific hypotheses about the factors that contribute to pairing. The results reveal a link between the presence of TADs and the ability to pair (Viets, 2019).
This work is also relevant to the study of nuclear architecture in general. The role of TADs in genome function remains unclear. While it is generally thought that TADs isolate genes into regulatory domains to ensure their activation by the correct cis-regulatory elements, only a few studies have identified a role for TADs in maintaining proper gene activity. Studies that dissolve or rearrange TADs find relatively minor effects on gene expression, suggesting that TADs may play additional roles beyond the maintenance of transcriptional activity. This work identifies one such additional role for these poorly understood chromatin structures (Viets, 2019).
The data suggest that intact buttons comprising complete TADs and combinations of insulators are sufficient to drive pairing and that smaller fragments of buttons do not have strong pairing activity. A model is proposed in which complex networks of insulator elements within TADs bring homologous chromosomes together. Pairing may initiate at insulator-enriched TAD boundaries, bringing the interiors of TADs into close physical proximity as a result. Alternatively, each TAD may bind a unique 'code' of insulator proteins, allowing it to find its homologous partner. Unique insulator codes might also facilitate the formation of button-specific chromatin conformations, allowing each button to create its own microcompartment and pair with its homolog. Therefore, while TADs act as regions of association in cis, the results suggest that TADs may also act as regions of self-association in trans (Viets, 2019).
Furthermore, pairing was found to be necessary but not sufficient for transvection, and distinct elements are required for these processes: TADs and clusters of insulators play a role in pairing, while a single insulator element facilitates transvection to the endogenous spineless locus. Canonical studies of transvection indicate that disruptions of transvection are caused by disruptions in pairing. The data clearly indicate that transvection can also be disrupted by the loss of small DNA elements, even when pairing is preserved. Consistent with these findings using endogenous alleles, insulators play roles in transvection between transgenes (Viets, 2019 and references therein).
Pairing and transvection are stronger in certain cell types than in others, suggesting that tighter pairing in a given cell type enables more efficient transvection. While a number of studies have noted differences in the level of pairing between cell types or in the efficiency of transvection between cell types, the study provides a link between the level of pairing in a given tissue and the efficiency of transvection in that tissue. This cell-type-specific pairing may be partially facilitated by differences in insulator levels or by tissue-specific factors that recruit insulator proteins. Additionally, larval photoreceptors are post-mitotic, while the antennal disc is still undergoing cell division. Frequent cell division in the antennal disc may disrupt pairing interactions between transgenes and their endogenous loci (Viets, 2019).
In addition to differences in pairing efficiency, differences in the function of cis-regulatory elements between cell types may contribute to cell-type-specific transvection. Certain enhancers have a stronger capacity to act between chromosomes than others. An eye-specific enhancer for a gene might have a stronger transvection capacity than the antenna-specific enhancer for the same gene, facilitating more efficient transvection in the eye. Additionally, enhancer action between chromosomes is often more efficient in the absence of a promoter in cis. Therefore, certain enhancers, such as those in the antenna, may only perform transvection in specific promoter-mutant backgrounds. However, the current data suggest that the strength of pairing also plays a major role in determining transvection efficiency between cell types. Variation in levels of pairing or transvection across cell types has been observed for a number of loci, suggesting that differences in pairing between cell types may be a general mechanism regulating gene expression (Viets, 2019).
Together, these findings suggest a general mechanism in which TADs and insulators contribute to homologous chromosome pairing and interchromosomal gene regulation across organisms to facilitate processes including X-inactivation and imprinting (Viets, 2019).
Abdusselamoglu, M. D., Landskron, L., Bowman, S. K., Eroglu, E., Burkard, T., Kingston, R. E. and Knoblich, J. A. (2019). Dynamics of activating and repressive histone modifications in Drosophila neural stem cell lineages and brain tumors. Development 146(23). PubMed ID: 31748204
Abraham, A., Villanyi, Z., Zsindely, N., Nagy, G., Szabo, A., Bodai, L., Henn, L. and Boros, I. M. (2022). Despite its sequence identity with canonical H4, Drosophila H4r product is enriched at specific chromatin regions. Sci Rep 12(1): 5007. PubMed ID: 35322122
Ahmad, K. and Henikoff, S. (2002). Histone H3 variants specify modes of chromatin assembly. Proc. Natl. Acad. Sci. 99 Suppl 4: 16477-84. 1217744
Akan, I., Love, D. C., Harwood, K., Bond, M. R. and Hanover, J. A. (2016). Drosophila O-GlcNAcase deletion globally perturbs chromatin O-GlcNAcylation. J Biol Chem [Epub ahead of print]. PubMed ID: 26957542
Alberti, S., Gladfelter, A. and Mittag, T. (2019). Considerations and challenges in studying liquid-liquid phase separation and biomolecular condensates. Cell 176(3): 419-434. PubMed ID: 30682370
Alecki, C., Chiwara, V., Sanz, L. A., Grau, D., Arias Perez, O., Boulier, E. L., Armache, K. J., Chedin, F. and Francis, N. J. (2020). RNA-DNA strand exchange by the Drosophila Polycomb complex PRC2. Nat Commun 11(1): 1781. PubMed ID: 32286294
Alexiadis, A., Delidakis, C. and Kalantidis, K. (2017). Snipper, an Eri1 homologue, affects histone mRNA abundance and is crucial for normal Drosophila melanogaster development. FEBS Lett [Epub ahead of print]. PubMed ID: 28626879
AlHaj Abed, J., Erceg, J., Goloborodko, A., Nguyen, S. C., McCole, R. B., Saylor, W., Fudenberg, G., Lajoie, B. R., Dekker, J., Mirny, L. A. and Wu, C. T. (2019). Highly structured homolog pairing reflects functional organization of the Drosophila genome. Nat Commun 10(1): 4485. PubMed ID: 31582763
Allahyar, A., Vermeulen, C., Bouwman, B. A. M., Krijger, P. H. L., Verstegen, M., Geeven, G., van Kranenburg, M., Pieterse, M., Straver, R., Haarhuis, J. H. I., Jalink, K., Teunissen, H., Renkens, I. J., Kloosterman, W. P., Rowland, B. D., de Wit, E., de Ridder, J. and de Laat, W. (2018). Enhancer hubs and loop collisions identified from single-allele topologies. Nat Genet 50(8): 1151-1160. PubMed ID: 29988121
Amankwaa, B., Schoborg, T. and Labrador, M. (2022). Drosophila insulator proteins exhibit in vivo liquid-liquid phase separation properties. Life Sci Alliance 5(12). PubMed ID: 35853678
Amiad-Pavlov, D., Lorber, D., Bajpai, G., Reuveny, A., Roncato, F., Alon, R., Safran, S. and Volk, T. (2021). Live imaging of chromatin distribution reveals novel principles of nuclear architecture and chromatin compartmentalization. Sci Adv 7(23). PubMed ID: 34078602
Amodeo, A. A., Jukam, D., Straight, A. F. and Skotheim, J. M. (2015). Histone titration against the genome sets the DNA-to-cytoplasm threshold for the Xenopus midblastula transition. Proc Natl Acad Sci U S A 112(10): E1086-1095. PubMed ID: 25713373
Andreyeva, E. N., et al. (2017). Regulatory functions and chromatin loading dynamics of linker histone H1 during endoreplication in Drosophila. Genes Dev 31(6): 603-616. PubMed ID: 28404631
Arzate-Mejia, R. G., Josue Cerecedo-Castillo, A., Guerrero, G., Furlan-Magaril, M. and Recillas-Targa, F. (2020). In situ dissection of domain boundaries affect genome topology and gene transcription in Drosophila. Nat Commun 11(1): 894. PubMed ID: 32060283
Ayer, D. E., Kretzner, L. and Eisenman, R. N. (1993). Mad: a heterodimeric partner for Max that antagonizes Myc transcriptional activity. Cell 72: 211-222. 8425218
Bag, I., Chen, S., Rosin, L. F., Chen, Y., Liu, C. Y., Yu, G. Y. and Lei, E. P. (2021). M1BP cooperates with CP190 to activate transcription at TAD borders and promote chromatin insulator activity. Nat Commun 12(1): 4170. PubMed ID: 34234130
Baldi, S., Krebs, S., Blum, H. and Becker, P. B. (2018). Genome-wide measurement of local nucleosome array regularity and spacing by nanopore sequencing. Nat Struct Mol Biol 25(9): 894-901. PubMed ID: 30127356
Barbieri, M., Chotalia, M., Fraser, J., Lavitas, L. M., Dostie, J., Pombo, A. and Nicodemi, M. (2012). Complexity of chromatin folding is captured by the strings and binders switch model. Proc Natl Acad Sci U S A 109: 16173-16178. PubMed ID: 22988072
Baumann, D. G. and Gilmour, D. S. (2017). A sequence-specific core promoter-binding transcription factor recruits TRF2 to coordinately transcribe ribosomal protein genes. Nucleic Acids Res 45(18): 10481-10491. PubMed ID: 28977400
Bellec, M., Dufourt, J., Hunt, G., Lenden-Hasse, H., Trullo, A., Zine El Aabidine, A., Lamarque, M., Gaskill, M. M., Faure-Gautron, H., Mannervik, M., Harrison, M. M., Andrau, J. C., Favard, C., Radulescu, O. and Lagha, M. (2022). The control of transcriptional memory by stable mitotic bookmarking. Nat Commun 13(1): 1176. PubMed ID: 35246556
Benedetti, F., Dorier, J., Burnier, Y. and Stasiak, A. (2014). Models that include supercoiling of topological domains reproduce several known features of interphase chromosomes. Nucleic Acids Res 42: 2848-2855. PubMed ID: 24366878
Bernues, J., Izquierdo-Boulstridge, A., Reina, O., ..., Climent-Canto, P. and Azorin, F. (2022). Lysine 27 dimethylation of Drosophila linker histone dH1 contributes to heterochromatin organization independently of H3K9 methylation. Nucleic Acids Res 50(16): 9212-9225. PubMed ID: 36039761
Berloco, M., Fanti, L., Breiling, A., Orlando, V. and Pimpinelli, S. (2001). The maternal effect gene, abnormal oocyte (abo), of Drosophila melanogaster encodes a specific negative regulator of histones. Proc Natl Acad Sci U S A 98(21): 12126-12131. PubMed ID: 11593026
Black, K, L., Petruk, S., Fenstermaker, T. K., Hodgson, J. W., Caplan, J. L., Brock, H. W. and Mazo, A. (2016). Chromatin proteins and RNA are associated with DNA during all phases of mitosis. Cell Discov 2: 16038. PubMed ID: 27807477
Blythe, S. A. and Wieschaus, E. F. (2016). Establishment and maintenance of heritable chromatin structure during early Drosophila embryogenesis. Elife 5. PubMed ID: 27879204
Boettiger, A. N., Bintu, B., Moffitt, J. R., Wang, S., Beliveau, B. J., Fudenberg, G., Imakaev, M., Mirny, L. A., Wu, C. T. and Zhuang, X. (2016). Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature [Epub ahead of print]. PubMed ID: 26760202
Benner, L., Castro, E. A., Whitworth, C., Venken, K. J. T., Yang, H., Fang, J., Oliver, B., Cook, K. R. and Lerit, D. A. (2019). Drosophila heterochromatin stabilization requires the zinc-finger protein Small ovary. Genetics. PubMed ID: 31558581
Breuer, M. and Ohkura, H. (2015). A negative loop within the nuclear pore complex controls global chromatin organization. Genes Dev 29: 1789-1794. PubMed ID: 26341556
Brown, E. J., Nguyen, A. H. and Bachtrog, D. (2020). The Drosophila Y chromosome affects heterochromatin integrity genome-wide. Mol Biol Evol. PubMed ID: 32211857
Bughio, F., Huckell, G. R. and Maggert, K. A. (2019). Monitoring of switches in heterochromatin-induced silencing shows incomplete establishment and developmental instabilities. Proc Natl Acad Sci U S A 116(40): 20043-20053. PubMed ID: 31527269
Cardozo Gizzi, A. M., Cattoni, D. I., Fiche, J. B., Espinola, S. M., Gurgo, J., Messina, O., Houbron, C., Ogiyama, Y., Papadopoulos, G. L., Cavalli, G., Lagha, M. and Nollmann, M. (2019). Microscopy-based chromosome conformation capture enables simultaneous visualization of genome organization and transcription in intact organisms. Mol Cell 74(1): 212-222.e215. PubMed ID: 30795893
Cartwright, T. N., Harris, R. J., Meyer, S. K., Mon, A. M., Watson, N. A., Tan, C., Marcelot, A., Wang, F., Zinn-Justin, S., Traktman, P. and Higgins, J. M. G. (2022). Dissecting the roles of Haspin and VRK1 in histone H3 phosphorylation during mitosis. Sci Rep 12(1): 11210. PubMed ID: 35778595
Chang, C. H. and Larracuente, A. M. (2018). Heterochromatin-enriched assemblies reveal the sequence and organization of the Drosophila melanogaster Y chromosome. Genetics. PubMed ID: 30420487
Chereji, R. V., Bryson, T. D. and Henikoff, S. (2019). Quantitative MNase-seq accurately maps nucleosome occupancy levels. Genome Biol 20(1): 198. PubMed ID: 31519205
Chetverina, D., Vorobyeva, N. E., Mazina, M. Y., Fab, L. V., Lomaev, D., Golovnina, A., Mogila, V., Georgiev, P., Ziganshin, R. H. and Erokhin, M. (2022). Comparative interactome analysis of the PRE DNA-binding factors: purification of the Combgap-, Zeste-, Psq-, and Adf1-associated proteins. Cell Mol Life Sci 79(7): 353. PubMed ID: 35676368
Chopra, V. S., Cande, J., Hong, J. W. and Levine, M. (2009). Stalled Hox promoters as chromosomal boundaries. Genes Dev 23(13): 1505-1509. PubMed ID: 19515973
Ciabrelli, F., Comoglio, F., Fellous, S., Bonev, B., Ninova, M., Szabo, Q., Xuereb, A., Klopp, C., Aravin, A., Paro, R., Bantignies, F. and Cavalli, G. (2017). Stable Polycomb-dependent transgenerational inheritance of chromatin states in Drosophila. Nat Genet 49(6): 876-886. PubMed ID: 28436983
Climent-Canto, P., Carbonell, A., Tatarski, M., Reina, O., Bujosa, P., Font-Mateu, J., Bernues, J., Beato, M. and Azorin, F. (2020). The embryonic linker histone dBigH1 alters the functional state of active chromatin. Nucleic Acids Res. PubMed ID: 32103264
Cuartero, S., Fresan, U., Reina, O., Planet, E. and Espinas, M. L. (2014). Ibf1 and Ibf2 are novel CP190-interacting proteins required for insulator function. EMBO J 33(6): 637-647. PubMed ID: 24502977
Cubeñas-Potts, C., Rowley, M. J., Lyu, X., Li, G., Lei, E. P. and Corces, V. G. (2017). Different enhancer classes in Drosophila bind distinct architectural proteins and mediate unique chromatin interactions and 3D architecture. Nucleic Acids Res 45(4): 1714-1730. PubMed ID: 27899590
Cusanovich, D. A., Reddington, J. P., Garfield, D. A., Daza, R. M., Aghamirzaie, D., Marco-Ferreres, R., Pliner, H. A., Christiansen, L., Qiu, X., Steemers, F. J., Trapnell, C., Shendure, J. and Furlong, E. E. M. (2018). The cis-regulatory dynamics of embryonic development at single-cell resolution. Nature 555(7697): 538-542. PubMed ID: 29539636
Cusanovich, D. A., Hill, A. J., Aghamirzaie, D., Daza, R. M., Pliner, H. A., Berletch, J. B., Filippova, G. N., Huang, X., Christiansen, L., DeWitt, W. S., Lee, C., Regalado, S. G., Read, D. F., Steemers, F. J., Disteche, C. M., Trapnell, C. and Shendure, J. (2018). A single-cell atlas of in vivo mammalian chromatin accessibility. Cell 174(5): 1309-1324 e1318. PubMed ID: 30078704
Czermin, B., Schotta, G., Hulsmann, B. B., Brehm, A., Becker, P. B., Reuter, G. and Imhof, A. (2001). Physical and functional association of SU(VAR)3-9 and HDAC1 in Drosophila EMBO Rep. 2: 915-919. 11571273
De, S., Mitra, A., Cheng, Y., Pfeifer, K. and Kassis, J. A. (2016). Formation of a polycomb-domain in the absence of strong polycomb response elements. PLoS Genet 12(7): e1006200. PubMed ID: 27466807
De, S., Cheng, Y., Sun, M. A., Gehred, N. D. and Kassis, J. A. (2019). Structure and function of an ectopic Polycomb chromatin domain. Sci Adv 5(1): eaau9739. PubMed ID: 30662949
de Lima, L. G., Hanlon, S. L. and Gerton, J. L. (2020). Origins and Evolutionary Patterns of the 1.688 Satellite DNA Family in Drosophila Phylogeny. G3 (Bethesda). PubMed ID: 32934018
DeLuca, S. Z., Ghildiyal, M., Pang, L. Y. and Spradling, A. C. (2020). Differentiating Drosophila female germ cells initiate Polycomb silencing by regulating PRC2-interacting proteins. Elife 9. PubMed ID: 32773039
Dias, G. B., Heringer, P., Svartman, M. and Kuhn, G. C. (2015). Helitrons shaping the genomic architecture of Drosophila: enrichment of DINE-TR1 in alpha- and beta-heterochromatin, satellite DNA emergence, and piRNA expression. Chromosome Res [Epub ahead of print]. PubMed ID: 26408292
Dufourt, J., Trullo, A., Hunter, J., Fernandez, C., Lazaro, J., Dejean, M., Morales, L., Nait-Amer, S., Schulz, K. N., Harrison, M. M., Favard, C., Radulescu, O. and Lagha, M. (2018). Temporal control of gene expression by the pioneer factor Zelda through transient interactions in hubs. Nat Commun 9(1): 5194. PubMed ID: 30518940
Duronio, R. J. and Marzluff, W. F. (2017). Coordinating cell cycle-regulated histone gene expression through assembly and function of the Histone Locus Body. RNA Biol 14(6): 726-738. PubMed ID: 28059623
Eagen, K. P., Hartl, T. A. and Kornberg, R. D. (2015). Stable Chromosome Condensation Revealed by Chromosome Conformation Capture. Cell 163: 934-946. PubMed ID: 26544940
Eagen, K. P., Aiden, E. L. and Kornberg, R. D. (2017). Polycomb-mediated chromatin loops revealed by a subkilobase-resolution chromatin interaction map. Proc Natl Acad Sci U S A 114(33): 8764-8769. PubMed ID: 28765367
Erceg, J., Pakozdi, T., Marco-Ferreres, R., Ghavi-Helm, Y., Girardot, C., Bracken, A. P. and Furlong, E. E. (2017). Dual functionality of cis-regulatory elements as developmental enhancers and Polycomb response elements. Genes Dev 31(6): 590-602. PubMed ID: 28381411
Erceg, J., AlHaj Abed, J., Goloborodko, A., Lajoie, B. R., Fudenberg, G., Abdennur, N., Imakaev, M., McCole, R. B., Nguyen, S. C., Saylor, W., Joyce, E. F., Senaratne, T. N., Hannan, M. A., Nir, G., Dekker, J., Mirny, L. A. and Wu, C. T. (2019). The genome-wide multi-layered architecture of chromosome pairing in early Drosophila embryos. Nat Commun 10(1): 4486. PubMed ID: 31582744
Espinola, S. M., Gotz, M., Bellec, M., Messina, O., Fiche, J. B., Houbron, C., Dejean, M., Reim, I., Cardozo Gizzi, A. M., Lagha, M. and Nollmann, M. (2021). Cis-regulatory chromatin loops arise before TADs and gene activation, and are independent of cell fate during early Drosophila development. Nat Genet 53(4): 477-486. PubMed ID: 33795867
Falahati, H., Pelham-Webb, B., Blythe, S. and Wieschaus, E. (2016). Nucleation by rRNA dictates the precision of nucleolus assembly. Curr Biol 26(3): 277-285. PubMed ID: 26776729
Farago, A., Urmosi, A., Farkas, A. and Bodai, L. (2021). The histone replacement gene His4r is involved in heat stress induced chromatin rearrangement. Sci Rep 11(1): 4878. PubMed ID: 33649489
Filion, G. J., van Bemmel, J. G., Braunschweig, U., Talhout, W., Kind, J., Ward, L. D., Brugman, W., de Castro, I. J., Kerkhoven, R. M., Bussemaker, H. J. and van Steensel, B. (2010). Systematic protein location mapping reveals five principal chromatin types in Drosophila cells. Cell 143: 212-224. PubMed ID: 20888037
Floc'hlay, S., Wong, E., Zhao, B., Viales, R. R., Thomas-Chollier, M., Thieffry, D., Garfield, D. A. and Furlong, E. E. M. (2020). Cis-acting variation is common across regulatory layers but is often buffered during embryonic development. Genome Res. PubMed ID: 33310749
Fresan, U., Rodriguez-Sanchez, M. A., Reina, O., Corces, V. G. and Espinas, M. L. (2020). Haspin kinase modulates nuclear architecture and Polycomb-dependent gene silencing. PLoS Genet 16(8): e1008962. PubMed ID: 32750047
Fujioka, M., Mistry, H., Schedl, P. and Jaynes, J. B. (2016). Determinants of chromosome architecture: Insulator pairing in cis and in trans. PLoS Genet 12: e1005889. PubMed ID: 26910731
Fujioka, M., Nezdyur, A. and Jaynes, J. B. (2021). An insulator blocks access to enhancers by an illegitimate promoter, preventing repression by transcriptional interference. PLoS Genet 17(4): e1009536. PubMed ID: 33901190
Fursova, N. A., Mazina, M. Y., Nikolenko, J. V., Vorobyova, N. E. and Krasnov, A. N. (2020). Drosophila Zinc Finger Protein CG9890 Is Colocalized with Chromatin Modifying and Remodeling Complexes on Gene Promoters and Involved in Transcription Regulation. Acta Naturae 12(4): 114-119. PubMed ID: 33456983
Fursova, N. A., Turberfield, A. H., Blackledge, N. P., Findlater, E. L., Lastuvkova, A., Huseyin, M. K., Dobrinic, P. and Klose, R. J. (2021). BAP1 constrains pervasive H2AK119ub1 to control the transcriptional potential of the genome. Genes Dev 35(9-10): 749-770. PubMed ID: 33888563
Galan, S., Machnik, N., Kruse, K., Diaz, N., Marti-Renom, M. A. and Vaquerizas, J. M. (2020). CHESS enables quantitative comparison of chromatin contact data and automatic feature extraction. Nat Genet 52(11): 1247-1255. PubMed ID: 33077914
Gassler, J., Brandao, H. B., Imakaev, M., Flyamer, I. M., Ladstatter, S., Bickmore, W. A., Peters, J. M., Mirny, L. A. and Tachibana, K. (2017). A mechanism of cohesin-dependent loop extrusion organizes zygotic genome architecture. EMBO J 36(24): 3600-3618. PubMed ID: 29217590
Ghavi-Helm, Y., Jankowski, A., Meiers, S., Viales, R. R., Korbel, J. O. and Furlong, E. E. M. (2019). Highly rearranged chromosomes reveal uncoupling between genome topology and gene expression. Nat Genet 51(8): 1272-1282. PubMed ID: 31308546
Ghotbi, E., Ye, P., Ervin, T., Kum, A., Benes, J. and Jones, R. S. (2021). Polycomb-group recruitment to a Drosophila target gene is the default state that is inhibited by a transcriptional activator. Sci Adv 7(29). PubMed ID: 34272248
Gisselbrecht, S. S., Barrera, L. A., Porsch, M., Aboukhalil, A., Estep, P. W., Vedenko, A., Palagi, A., Kim, Y., Zhu, X., Busser, B. W., Gamble, C. E., Iagovitina, A., Singhania, A., Michelson, A. M. and Bulyk, M. L. (2013). Highly parallel assays of tissue-specific enhancers in whole Drosophila embryos. Nat Methods 10(8): 774-780. PubMed ID: 23852450
Gisselbrecht, S. S., Palagi, A., Kurland, J. V., Rogers, J. M., Ozadam, H., Zhan, Y., Dekker, J. and Bulyk, M. L. (2019). Transcriptional silencers in Drosophila serve a dual role as transcriptional enhancers in alternate cellular contexts. Mol Cell. 77(2):324-337 PubMed ID: 31704182
Gotz, M., Messina, O., Espinola, S., Fiche, J. B. and Nollmann, M. (2022). Multiple parameters shape the 3D chromatin structure of single nuclei at the doc locus in Drosophila. Nat Commun 13(1): 5375. PubMed ID: 36104317
Gozalo, A., Duke, A., Lan, Y., Pascual-Garcia, P., Talamas, J. A., Nguyen, S. C., Shah, P. P., Jain, R., Joyce, E. F. and Capelson, M. (2019). Core components of the nuclear pore bind distinct states of chromatin and contribute to Polycomb repression. Mol Cell. PubMed ID: 31784359
Hafer, T. L., Patra, S., Tagami, D. and Kohwi, M. (2022). Enhancer of trithorax/polycomb, Corto, regulates timing of hunchback gene relocation and competence in Drosophila neuroblasts. Neural Dev 17(1): 3. PubMed ID: 35177098
Harmston, N., Ing-Simmons, E., Tan, G., Perry, M., Merkenschlager, M. and Lenhard, B. (2017). Topologically associating domains are ancient features that coincide with Metazoan clusters of extreme noncoding conservation. Nat Commun 8(1): 441. PubMed ID: 28874668
Hassig, C. A., et al. (1997). Histone deacetylase activity is required for full
transcriptional repression by mSin3A. Cell 89 (3): 341-347. PubMed ID: 9150133
Herzog, V. A., Lempradl, A., Trupke, J., Okulski, H., Altmutter, C., Ruge, F., Boidol, B., Kubicek, S., Schmauss, G., Aumayr, K., Ruf, M., Pospisilik, A., Dimond, A., Senergin, H. B., Vargas, M. L., Simon, J. A. and Ringrose, L. (2014). A strand-specific switch in noncoding transcription switches the function of a Polycomb/Trithorax response element. Nat Genet 46(9): 973-981. PubMed ID: 25108384
Hou, C., Li, L., Qin, Z. S. and Corces, V. G. (2012). Gene density, transcription, and insulators contribute to the partition of the Drosophila genome into physical domains. Mol Cell 48(3): 471-484. PubMed ID: 23041285
Hou, L., Wang, L., Berg, A., Qian, M., Zhu, Y., Li, F. and Deng, M. (2012). Comparison and evaluation of network clustering algorithms applied to genetic interaction networks. Front Biosci (Elite Ed) 4: 2150-2161. PubMed ID: 22202027
Hu, X., Li, M., Hao, X., Lu, Y., Zhang, L. and Wu, G. (2021). The Osa-Containing SWI/SNF Chromatin-Remodeling Complex Is Required in the Germline Differentiation Niche for Germline Stem Cell Progeny Differentiation. Genes (Basel) 12(3). PubMed ID: 33806269
Hug, C. B., Grimaldi, A. G., Kruse, K. and Vaquerizas, J. M. (2017). Chromatin architecture emerges during zygotic genome activation independent of transcription. Cell 169(2): 216-228. PubMed ID: 28388407
Hung, R. J., Hu, Y., Kirchner, R., Liu, Y., Xu, C., Comjean, A., Tattikota, S. G., Li, F., Song, W., Ho Sui, S. and Perrimon, N. (2020). A cell atlas of the adult Drosophila midgut. Proc Natl Acad Sci U S A 117(3): 1514-1523. PubMed ID: 31915294
Hur, W., Kemp, J. P., Jr., Tarzia, M., Deneke, V. E., Marzluff, W. F., Duronio, R. J. and Di Talia, S. (2020). CDK-regulated phase separation seeded by histone genes ensures precise growth and function of Histone Locus Bodies. Dev Cell 54(3): 379-394. PubMed ID: 32579968
Hwang, K. K., Eissenberg, J. C. and Worman, H. J. (2001). Transcriptional repression of euchromatic genes by Drosophila heterochromatin protein 1 and histone modifiers. Proc. Natl. Acad. Sci. 98: 11423-11427. 11562500
Ilyin, A. A., Kononkova, A. D., Golova, A. V., Shloma, V. V., Olenkina, O. M., Nenasheva, V. V., Abramov, Y. A., Kotov, A. A., Maksimov, D. A., Laktionov, P. P., Pindyurin, A. V., Galitsyna, A. A., Ulianov, S. V., Khrameeva, E. E., Gelfand, M. S., Belyakin, S. N., Razin, S. V. and Shevelyov, Y. Y. (2022). Comparison of genome architecture at two stages of male germline cell differentiation in Drosophila. Nucleic Acids Res 50(6): 3203-3225. PubMed ID: 35166842
Ing-Simmons, E., Vaid, R., Bing, X. Y., Levine, M., Mannervik, M. and Vaquerizas, J. M. (2021). Independence of chromatin conformation and gene regulation during Drosophila dorsoventral patterning. Nat Genet 53(4): 487-499. PubMed ID: 33795866
Jacobs, S. A., Taverna, S. D., Zhang, Y., Briggs, S. D., Li, J., Eissenberg, J. C., Allis, C. D. and Khorasanizadeh, S. (2001). Specificity of the HP1 chromo domain for the methylated N-terminus of histone H3. EMBO J. 20: 5232-5241. 11566886
Jain, S. U., Rashoff, A. Q., Krabbenhoft, S. D., Hoelper, D., Do, T. J., Gibson, T. J., Lundgren, S. M., Bondra, E. R., Deshmukh, S., Harutyunyan, A. S., Juretic, N., Jabado, N., Harrison, M. M. and Lewis, P. W. (2020). H3 K27M and EZHIP Impede H3K27-Methylation Spreading by Inhibiting Allosterically Stimulated PRC2. Mol Cell 80(4): 726-735. PubMed ID: 33049227
Jefferies, G., Somers, J., Lohrey, I., Chaturvedi, V., Calabria, J., Marshall, O. J., Southall, T. D., Saint, R. and Murray, M. J. (2020). Maintenance of Cell Fate by the Polycomb Group Gene Sex Combs Extra Enables a Partial Epithelial Mesenchymal Transition in Drosophila. G3 (Bethesda). PubMed ID: 33051260
Jost, D., Carrivain, P., Cavalli, G. and Vaillant, C. (2014). Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res 42: 9553-9561. PubMed ID: 25092923
Jung, Y.L., Kang, H., Park, P.J. and Kuroda, M.I. (2016). Correspondence of Drosophila Polycomb Group proteins with broad H3K27me3 silent domains. Fly (Austin) [Epub ahead of print]. PubMed ID: 26940990
Kang, H., Jung, Y. L., McElroy, K. A., Zee, B. M., Wallace, H. A., Woolnough, J. L., Park, P. J. and Kuroda, M. I. (2017). Bivalent complexes of PRC1 with orthologs of BRD4 and MOZ/MORF target developmental genes in Drosophila. Genes Dev 31(19): 1988-2002. PubMed ID: 29070704
Kapur, I., Boulier, E. L. and Francis, N. J. (2022). Regulation of Polyhomeotic Condensates by Intrinsically Disordered Sequences That Affect Chromatin Binding. Epigenomes 6(4). PubMed ID: 36412795
Kasinathan, B., Colmenares, S. U., McConnell, H., Young, J. M., Karpen, G. H. and Malik, H. S. (2020). Innovation of heterochromatin functions drives rapid evolution of essential ZAD-ZNF genes in Drosophila. Elife 9. PubMed ID: 33169670
Ke, Y., Xu, Y., Chen, X., Feng, S., Liu, Z., Sun, Y., Yao, X., Li, F., Zhu, W., Gao, L., Chen, H., Du, Z., Xie, W., Xu, X., Huang, X. and Liu, J. (2017). 3D chromatin structures of mature gametes and structural reprogramming during mammalian embryogenesis. Cell 170(2): 367-381 e320. PubMed ID: 28709003
Khan, M. H. F., Akhtar, J., Umer, Z., Shaheen, N., Shaukat, A., Munir, M. S., Mithani, A., Anwar, S. and Tariq, M. (2021). Kinome-Wide RNAi Screen Uncovers Role of Ballchen in Maintenance of Gene Activation by Trithorax Group in Drosophila. Front Cell Dev Biol 9: 637873. PubMed ID: 33748127
Kharchenko, P. V., Alekseyenko, A. A., Schwartz, Y. B., Minoda, A., Riddle, N. C., et al. (2011). Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature 471(7339): 480-5. PubMed ID: 21179089
Khoroshko, V. A., Pokholkova, G. V., Levitsky, V. G., Zykova, T. Y., Antonenko, O. V., Belyaeva, E. S. and Zhimulev, I. F. (2020). Genes Containing Long Introns Occupy Series of Bands and Interbands In Drosophila melanogaster polytene Chromosomes. Genes (Basel) 11(4). PubMed ID: 32290448
King, T., Johnson, J. E. and Bateman, J. R. (2019). Position effects influence transvection in Drosophila melanogaster. Genetics. PubMed ID: 31611231
Koreski, K. P., Rieder, L. E., McLain, L. M., Chaubal, A., Marzluff, W. F. and Duronio, R. J. (2020). Drosophila Histone Locus Body assembly and function involves multiple interactions. Mol Biol Cell: mbcE20030176. PubMed ID: 32401666
Kuhn, T. M., Pascual-Garcia, P., Gozalo, A., Little, S. C. and Capelson, M. (2019). Chromatin targeting of nuclear pore proteins induces chromatin decondensation. J Cell Biol. PubMed ID: 31366666
Laghmach, R., Di Pierro, M. and Potoyan, D. A. (2021). The interplay of chromatin phase separation and lamina interactions in nuclear organization. Biophys J. PubMed ID: 34653387
Laprell, F., Finkl, K. and Muller, J. (2017). Propagation of Polycomb-repressed chromatin requires sequence-specific recruitment to DNA. Science 356(6333):85-88. PubMed ID: 28302792
Le, T. B., Imakaev, M. V., Mirny, L. A. and Laub, M. T. (2013). High-resolution mapping of the spatial organization of a bacterial chromosome. Science 342: 731-734. PubMed ID: 24158908
Lee, H. G., Kahn, T. G., Simcox, A., Schwartz, Y. B. and Pirrotta, V. (2015). Genome-wide activities of Polycomb complexes control pervasive transcription. Genome Res 25: 1170-1181. PubMed ID: 25986499
Li, Z., Guo, X., Huang, H., Wang, C., Yang, F., Zhang, Y., Wang, J., Han, L., Jin, Z., Cai, T. and Xi, R. (2020). A Switch in Tissue Stem Cell Identity Causes Neuroendocrine Tumors in Drosophila Gut. Cell Rep 33(9): 108459. PubMed ID: 33264608
Liao, Y., Zhang, X., Chakraborty, M. and Emerson, J. J. (2021). Topologically associating domains and their role in the evolution of genome structure and function in Drosophila. Genome Res 31(3): 397-410. PubMed ID: 33563719
Liaw, G. J. (2022). Polycomb repressive complex 1 initiates and maintains tailless repression in Drosophila embryo. Biochim Biophys Acta Gene Regul Mech 1865(2): 194786. PubMed ID: 35032681
Loubiere, V., Papadopoulos, G. L., Szabo, Q., Martinez, A. M. and Cavalli, G. (2020). Widespread activation of developmental gene expression characterized by PRC1-dependent chromatin looping. Sci Adv 6(2): eaax4001. PubMed ID: 31950077
Lu, X., Li, J. M., Elemento, O., Tavazoie, S. and Wieschaus, E. F. (2009). Coupling of zygotic transcription to mitotic control at the Drosophila mid-blastula transition. Development 136(12): 2101-2110. PubMed ID: 19465600
Luzhin, A. V., Flyamer, I. M., Khrameeva, E. E., Ulianov, S. V., Razin, S. V. and Gavrilov, A. A. (2018). Quantitative differences in TAD border strength underly the TAD hierarchy in Drosophila chromosomes. J Cell Biochem. PubMed ID: 30260021
Mahajan, S., Wei, K. H., Nalley, M. J., Gibilisco, L. and Bachtrog, D. (2018). De novo assembly of a young Drosophila Y chromosome using single-molecule sequencing and chromatin conformation capture. PLoS Biol 16(7): e2006348. PubMed ID: 30059545
Marshall, O. J. and Brand, A. H. (2017). Chromatin state changes during neural development revealed by in vivo cell-type specific profiling. Nat Commun 8(1): 2271. PubMed ID: 29273756
Martin, R. L., Maiorano, J., Beitel, G. J., Marko, J. F., McVicker, G. and Fondufe-Mittendorf, Y. N. (2017). A comparison of nucleosome organization in Drosophila cell lines. PLoS One 12(6): e0178590. PubMed ID: 28570602
Mashtalir, N., D'Avino, A. R., Michel, B. C., Luo, J., Pan, J., Otto, J. E., Zullow, H. J., McKenzie, Z. M., Kubiak, R. L., St Pierre, R., Valencia, A. M., Poynter, S. J., Cassel, S. H., Ranish, J. A. and Kadoch, C. (2018). Modular organization and assembly of SWI/SNF family chromatin remodeling complexes. Cell 175(5): 1272-1288. PubMed ID: 30343899
McKowen, J. K., Avva, S., Maharjan, M., Duarte, F. M., Tome, J. M., Judd, J., Wood, J. L., Negedu, S., Dong, Y., Lis, J. T. and Hart, C. M. (2022). The Drosophila BEAF insulator protein interacts with the polybromo subunit of the PBAP chromatin remodeling complex. G3 (Bethesda) 12(11). PubMed ID: 36029240
Messina, G., Prozzillo, Y., Monache, F. D., Santopietro, M. V. and Dimitri, P. (2022). Evolutionary conserved relocation of chromatin remodeling complexes to the mitotic apparatus. BMC Biol 20(1): 172. PubMed ID: 35922843
Mir, M., Stadler, M. R., Ortiz, S. A., Hannon, C. E., Harrison, M. M., Darzacq, X. and Eisen, M. B. (2018). Dynamic multifactor hubs interact transiently with sites of active transcription in Drosophila embryos. Elife 7. PubMed ID: 30589412
Mohan, R. D., Dialynas, G., Weake, V. M., Liu, J., Martin-Brown, S., Florens, L., Washburn, M. P., Workman, J. L. and Abmayr, S. M. (2014). Loss of Drosophila Ataxin-7, a SAGA subunit, reduces H2B ubiquitination and leads to neural and retinal degeneration. Genes Dev 28(3): 259-272. PubMed ID: 24493646
Munden, A., Benton, M. L., Capra, J. A. and Nordman, J. T. (2022). R-loop Mapping and Characterization During Drosophila Embryogenesis Reveals Developmental Plasticity in R-loop Signatures. J Mol Biol 434(13): 167645. PubMed ID: 35609632
Nagy, L., et al. (1997). Nuclear receptor repression mediated by a complex
containing SMRT, mSin3A, and histone deacetylase. Cell 89 (3): 373-380. PubMed ID: 9150137
Ninova, M., Chen, Y. A., Godneeva, B., Rogers, A. K., Luo, Y., Fejes Toth, K. and Aravin, A. A. (2020a). Su(var)2-10 and the SUMO pathway link piRNA-guided target recognition to chromatin silencing. Mol Cell 77(3): 556-570. PubMed ID: 31901446
Ninova, M., Godneeva, B., Chen, Y. A., Luo, Y., Prakash, S. J., Jankovics, F., Erdelyi, M., Aravin, A. A. and Fejes Toth, K. (2020b). The SUMO ligase Su(var)2-10 controls hetero- and euchromatic gene expression via establishing H3K9 trimethylation and negative feedback regulation. Mol Cell 77(3): 571-585 e574. PubMed ID: 31901448
Nishioka, K., Wang, X. F., Miyazaki, H., Soejima, H. and Hirose, S. (2018). Mbf1 ensures Polycomb silencing by protecting E(z) mRNA from degradation by Pacman. Development 145(5). PubMed ID: 29523653
Nixon, K. C. J., et al. (2019). A syndromic neurodevelopmental disorder caused by mutations in SMARCD1, a core SWI/SNF subunit needed for context-dependent neuronal gene regulation in flies. Am J Hum Genet. PubMed ID: 30879640
Nuebler, J., Fudenberg, G., Imakaev, M., Abdennur, N. and Mirny, L. A. (2018). Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc Natl Acad Sci U S A 115(29): E6697-E6706. PubMed ID: 29967174
Ogiyama, Y., Schuettengruber, B., Papadopoulos, G. L., Chang, J. M. and Cavalli, G. (2018). Polycomb-dependent chromatin looping contributes to gene silencing during Drosophila development. Mol Cell 71(1): 73-88. PubMed ID: 30008320
Oudelaar, A. M., Davies, J. O. J., Hanssen, L. L. P., Telenius, J. M., Schwessinger, R., Liu, Y., Brown, J. M., Downes, D. J., Chiariello, A. M., Bianco, S., Nicodemi, M., Buckle, V. J., Dekker, J., Higgs, D. R. and Hughes, J. R. (2018). Single-allele chromatin interactions identify regulatory hubs in dynamic compartmentalized domains. Nat Genet 50(12): 1744-1751. PubMed ID: 30374068
Palmateer, C. M., Moseley, S. C., Ray, S., Brovero, S. G. and Arbeitman, M. N. (2021). Analysis of cell-type-specific chromatin modifications and gene expression in Drosophila neurons that direct reproductive behavior. PLoS Genet 17(4): e1009240. PubMed ID: 33901168
Pan, L., Xie, W., Li, K.L., Yang, Z., Xu, J., Zhang, W., Liu, L.P., Ren, X., He, Z., Wu, J., Sun, J., Wei, H.M., Wang, D., Xie, W., Li, W., Ni, J.Q. and Sun, F.L. (2015). Heterochromatin remodeling by CDK12 contributes to learning in Drosophila. Proc Natl Acad Sci U S A [Epub ahead of print]. PubMed ID: 26508632
Parey, E. and Crombach, A. (2019). Evolution of the Drosophila melanogaster chromatin landscape and its associated proteins. Genome Biol Evol 11(3): 660-677. PubMed ID: 30689829
Paul, J. (1972). General theory of chromosomes structure and gene activation in eukaryotes. Nature 238: 444-446. PubMed ID: 4561852
Rajpurkar, A. R., Mateo, L. J., Murphy, S. E. and Boettiger, A. N. (2021). Deep learning connects DNA traces to transcription to reveal predictive features beyond enhancer-promoter contact. Nat Commun 12(1): 3423. PubMed ID: 34103507
Pérez-Montero, S., Carbonell, A., Morán, T., Vaquero, A. and Azorín, F. (2013). The embryonic linker histone H1 variant of Drosophila, dBigH1, regulates zygotic genome activation. Dev Cell 26: 578-90. PubMed ID: 24055651
Perino, M., van Mierlo, G., Karemaker, I. D., van Genesen, S., Vermeulen, M., Marks, H., van Heeringen, S. J. and Veenstra, G. J. C. (2018). MTF2 recruits Polycomb Repressive Complex 2 by helical-shape-selective DNA binding. Nat Genet. PubMed ID: 29808031
Postika, N., Metzler, M., Affolter, M., Muller, M., Schedl, P., Georgiev, P. and Kyrchanova, O. (2018). Boundaries mediate long-distance interactions between enhancers and promoters in the Drosophila Bithorax complex. PLoS Genet 14(12): e1007702. PubMed ID: 30540750
Rach, E. A., Winter, D. R., Benjamin, A. M., Corcoran, D. L., Ni, T., Zhu, J. and Ohler, U. (2011). Transcription initiation patterns indicate divergent strategies for gene regulation at the chromatin level. PLoS Genet 7(1): e1001274. PubMed ID: 21249180
Ramírez, F., Lingg, T., Toscano, S., Lam, K. C., Georgiev, P., Chung, H. R., Lajoie, B. R., de Wit, E., Zhan, Y., de Laat, W., Dekker, J., Manke, T. and Akhtar, A. (2015). High-affinity sites form an interaction network to facilitate spreading of the MSL complex across the X chromosome in Drosophila. Mol Cell 60: 146-162. PubMed ID: 26431028
Ramirez, F., Bhardwaj, V., Arrigoni, L., Lam, K. C., Gruning, B. A., Villaveces, J., Habermann, B., Akhtar, A. and Manke, T. (2018). High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat Commun 9(1): 189. PubMed ID: 29335486
Ramirez, F., Bhardwaj, V., Arrigoni, L., Lam, K. C., Gruning, B. A., Villaveces, J., Habermann, B., Akhtar, A. and Manke, T. (2018). High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat Commun 9(1): 189. PubMed ID: 29335486
Rao, S., Ahmad, K. and Ramachandran, S. (2021). Cooperative binding between distant transcription factors is a hallmark of active enhancers. Mol Cell. PubMed ID: 33705711
Rao, S. S., Huntley, M. H., Durand, N. C., Stamenova, E. K., Bochkov, I. D., Robinson, J. T., Sanborn, A. L., Machol, I., Omer, A. D., Lander, E. S. and Aiden, E. L. (2014). A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159(7): 1665-1680. PubMed ID: 25497547
Rea, S., Eisenhaber, F., O'Carroll, D., Strahl, B. D., Sun, Z. W., Schmid, M., Opravil, S., Mechtler, K., Ponting, C. P., Allis, C. D. and Jenuwein, T. (2000). Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406: 593-599. 10949293
Reddington, J. P., Garfield, D. A., Sigalova, O. M., Karabacak Calviello, A., Marco-Ferreres, R., Girardot, C., Viales, R. R., Degner, J. F., Ohler, U. and Furlong, E. E. M. (2020). Lineage-Resolved Enhancer and Promoter Usage during a Time Course of Embryogenesis. Dev Cell. PubMed ID: 33171098
Reddington, J. P., Garfield, D. A., Sigalova, O. M., Karabacak Calviello, A., Marco-Ferreres, R., Girardot, C., Viales, R. R., Degner, J. F., Ohler, U. and Furlong, E. E. M. (2020). Lineage-resolved enhancer and promoter usage during a time course of embryogenesis. Dev Cell. PubMed ID: 33171098
Reinig, J., Ruge, F., Howard, M. and Ringrose, L. (2020). A theoretical model of Polycomb/Trithorax action unites stable epigenetic memory and dynamic regulation. Nat Commun 11(1): 4782. PubMed ID: 32963223
Renschler, G., Richard, G., Valsecchi, C. I. K., Toscano, S., Arrigoni, L., Ramirez, F. and Akhtar, A. (2019). Hi-C guided assemblies reveal conserved regulatory topologies on X and autosomes despite extensive genome shuffling. Genes Dev. PubMed ID: 31601616
Rhodes, J., Mazza, D., Nasmyth, K. and Uphoff, S. (2017). Scc2/Nipbl hops between chromosomal cohesin rings after loading. Elife 6. PubMed ID: 28914604
Riahi, H., Brekelmans, C., Foriel, S., Merkling, S. H., Lyons, T. A., Itskov, P. M., Kleefstra, T., Ribeiro, C., van Rij, R. P., Kramer, J. M. and Schenck, A. (2019). The histone methyltransferase G9a regulates tolerance to oxidative stress-induced energy consumption. PLoS Biol 17(3): e2006146. PubMed ID: 30860988
Richards, E. J. and Elgin, S. R. C. (2002). Epigenetic codes for heterochromatin formation and silencing: rounding up the usual suspects. Cell 108: 489-500. 11909520
Rieder, L. E., Jordan, W. T., 3rd and Larschan, E. N. (2019). Targeting of the dosage-compensated male X-chromosome during early Drosophila development. Cell Rep 29(13): 4268-4275. PubMed ID: 31875538
Ringrose, L., Rehmsmeier, M., Dura, J. M. and Paro, R. (2003). Genome-wide prediction of Polycomb/Trithorax response elements in Drosophila melanogaster. Dev. Cell 5: 759-771. 14602076
Rosin, L. F., Nguyen, S. C. and Joyce, E. F. (2018). Condensin II drives large-scale folding and spatial partitioning of interphase chromosomes in Drosophila nuclei. PLoS Genet 14(7): e1007393. PubMed ID: 30001329
Rotelli, M. D., Policastro, R. A., Bolling, A. M., Killion, A. W., Weinberg, A. J., Dixon, M. J., Zentner, G. E., Walczak, C. E., Lilly, M. A. and Calvi, B. R. (2019). A Cyclin A-Myb-MuvB-Aurora B network regulates the choice between mitotic cycles and polyploid endoreplication cycles. PLoS Genet 15(7): e1008253. PubMed ID: 31291240
Rowley, M. J., Nichols, M. H., Lyu, X., Ando-Kuri, M., Rivera, I. S. M., Hermetz, K., Wang, P., Ruan, Y. and Corces, V. G. (2017). Evolutionarily conserved principles predict 3D chromatin organization. Mol Cell 67(5): 837-852. PubMed ID: 28826674
Rowley, M. J., Lyu, X., Rana, V., Ando-Kuri, M., Karns, R., Bosco, G. and Corces, V. G. (2019). Condensin II counteracts Cohesin and RNA Polymerase II in the establishment of 3D chromatin organization. Cell Rep 26(11): 2890-2903. PubMed ID: 30865881
Rozenwald, M. B., Galitsyna, A. A., Sapunov, G. V., Khrameeva, E. E. and Gelfand, M. S. (2020). A machine learning framework for the prediction of chromatin folding in Drosophila using epigenetic features. PeerJ Comput Sci 6: e307. PubMed ID: 33816958
Saha, P., Sowpati, D. T. and Mishra, R. K. (2019). Epigenomic and genomic landscape of Drosophila melanogaster heterochromatic genes. Genomics 111(2): 177-185. PubMed ID: 29432976
Sahu, B., Hartonen, T., Pihlajamaa, P., Wei, B., Dave, K., Zhu, F., Kaasinen, E., Lidschreiber, K., Lidschreiber, M., Daub, C. O., Cramer, P., Kivioja, T. and Taipale, J. (2022). Sequence determinants of human gene regulatory elements. Nat Genet. PubMed ID: 35190730
Salzler, H. R., Tatomer, D. C., Malek, P. Y., McDaniel, S. L., Orlando, A. N., Marzluff, W. F. and Duronio, R. J. (2013). A sequence in the Drosophila H3-H4 Promoter triggers histone locus body assembly and biosynthesis of replication-coupled histone mRNAs. Dev Cell 24(6): 623-634. PubMed ID: 23537633
Samata, M., Alexiadis, A., Richard, G., Georgiev, P., Nuebler, J., Kulkarni, T., Renschler, G., Basilicata, M. F., Zenk, F. L., Shvedunova, M., Semplicio, G., Mirny, L., Iovino, N. and Akhtar, A. (2020). Intergenerationally maintained histone H4 lysine 16 acetylation is instructive for future gene activation. Cell. PubMed ID: 32502394
Samborskaia, M. D., Galitsyna, A., Pletenev, I., Trofimova, A., Mironov, A. A., Gelfand, M. S. and Khrameeva, E. E. (2020). Cumulative contact frequency of a chromatin region is an intrinsic property linked to its function. PeerJ 8: e9566. PubMed ID: 32864204
Santana, J. F., Parida, M., Long, A., Wankum, J., Lilienthal, A. J., Nukala, K. M. and Manak, J. R. (2020). The Dm-Myb oncoprotein contributes to insulator function and stabilizes repressive H3K27me3 PcG domains. Cell Rep 30(10): 3218-3228. PubMed ID: 32160531
Satovic, E., Font-Mateu, J., Carbonell, A., Beato, M. and Azorin, F. (2018). Chromatin remodeling in Drosophila preblastodermic embryo extract. Sci Rep 8(1): 10927. PubMed ID: 30026552
Schwartz, Y. B., et al. (2006). Genome-wide analysis of Polycomb targets in Drosophila melanogaster. Nat. Genet. 38(6): 700-5. 16732288
Schwarzer, W., Abdennur, N., Goloborodko, A., Pekowska, A., Fudenberg, G., Loe-Mie, Y., Fonseca, N. A., Huber, W., C, H. H., Mirny, L. and Spitz, F. (2017). Two independent modes of chromatin organization revealed by cohesin removal. Nature 551(7678): 51-56. PubMed ID: 29094699
Sexton, T., Yaffe, E., Kenigsberg, E., Bantignies, F., Leblanc, B., Hoichman, M., Parrinello, H., Tanay, A. and Cavalli, G. (2012). Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148(3): 458-472. PubMed ID: 22265598
Sengupta, A. K., Kuhrs, A. and Müller, J. (2004). General transcriptional silencing by a Polycomb response element in Drosophila. Development 131: 1959-1965. 15056613
Shevelyov, Y. Y., Lavrov, S. A., Mikhaylova, L. M., Nurminsky, I. D., Kulathinal, R. J., Egorova, K. S., Rozovsky, Y. M. and Nurminsky, D. I. (2009). The B-type lamin is required for somatic repression of testis-specific gene clusters. Proc Natl Acad Sci U S A 106: 3282-3287. PubMed ID: 19218438
Skrajna, A., Yang, X. C., Dadlez, M., Marzluff, W. F. and Dominski, Z. (2018). Protein composition of catalytically active U7-dependent processing complexes assembled on histone pre-mRNA containing biotin and a photo-cleavable linker. Nucleic Acids Res. PubMed ID: 29529248
Smith, M. B. and Weiler, K. S. (2010). Drosophila D1 overexpression induces ectopic pairing of polytene chromosomes and is deleterious to development. Chromosoma 119: 287-309. PubMed ID: 20127347
Stadler, M. R., Haines, J. E. and Eisen, M. (2017). Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo. Elife 6. PubMed ID: 29148971
Stow, E. C., Simmons, J. R., An, R., Schoborg, T. A., Davenport, N. M. and Labrador, M. (2022). A Drosophila insulator interacting protein suppresses enhancer-blocking function and modulates replication timing. Gene 819: 146208. PubMed ID: 35092858
Sureka, R., Wadhwa, R., Thakur, S. S., Pathak, R. U. and Mishra, R. K. (2018). Comparison of Nuclear Matrix and Mitotic Chromosome Scaffold proteins in Drosophila S2 cells - Transmission of hallmarks of nuclear organization through mitosis. Mol Cell Proteomics. PubMed ID: 29991507
Sureka, R., Avvaru, A. K., Sowpati, D. T., Pathak, R. U. and Mishra, R. K. (2022). Structural and developmental dynamics of Matrix associated regions in Drosophila melanogaster genome. BMC Genomics 23(1): 725. PubMed ID: 36284304
Szabo, Q., Jost, D., Chang, J. M., Cattoni, D. I., Papadopoulos, G. L., Bonev, B., Sexton, T., Gurgo, J., Jacquier, C., Nollmann, M., Bantignies, F. and Cavalli, G. (2018). TADs are 3D structural units of higher-order chromosome organization in Drosophila. Sci Adv 4(2): eaar8082. PubMed ID: 29503869
Tauc, H. M., Rodriguez-Fernandez, I. A., Hackney, J. A., Pawlak, M., Ronnen Oron, T., Korzelius, J., Moussa, H. F., Chaudhuri, S., Modrusan, Z., Edgar, B. A. and Jasper, H. (2021). Age-related changes in polycomb gene regulation disrupt lineage fidelity in intestinal stem cells. Elife 10. PubMed ID: 33724181
Tian, Y. and Smith-Bolton, R. K. (2021). Regulation of growth and cell fate during tissue regeneration by the two SWI/SNF chromatin-remodeling complexes of Drosophila. Genetics 217(1): 1-16. PubMed ID: 33683366
Tilly, B. C., Chalkley, G. E., van der Knaap, J. A., Moshkin, Y. M., Kan, T. W., Dekkers, D. H., Demmers, J. A. and Verrijzer, C. P. (2021). In vivo analysis reveals that ATP-hydrolysis couples remodeling to SWI/SNF release from chromatin. Elife 10. PubMed ID: 34313222
Torosin, N. S., Anand, A., Golla, T. R., Cao, W. and Ellison, C. E. (2020). 3D genome evolution and reorganization in the Drosophila melanogaster species group. PLoS Genet 16(12): e1009229. PubMed ID: 33284803
Tsai, A. and Crocker, J. (2022). Nuclear morphogenesis: forming a heterogeneous nucleus during embryogenesis. Development 149(4). PubMed ID: 35142344
Tsai, A., Alves, M. R. and Crocker, J. (2019). Multi-enhancer transcriptional hubs confer phenotypic robustness. Elife 8. PubMed ID: 31294690
Tschiersch, B., Hofmann, A., Krauss, V., Dorn, R., Korge, G. and Reuter, G. (1994). The protein encoded by the Drosophila position-effect variegation suppressor gene Su(var)3-9 combines domains of antagonistic regulators of homeotic gene complexes. EMBO J. 13: 3822-3831. 7915232
Tyler, J. K., et al. (1996). The p55 subunit of Drosophila chromatin assembly factor 1 is homologous to a Histone deacetylase-associated protein. Mol. Cell. Biol. 16: 6149-6159. 8887645
Ueberschar, M., Wang, H., Zhang, C., Kondo, S., Aoki, T., Schedl, P., Lai, E. C., Wen, J. and Dai, Q. (2019). BEN-solo factors partition active chromatin to ensure proper gene activation in Drosophila. Nat Commun 10(1): 5700. PubMed ID: 31836703
Ulianov, S. V., Khrameeva, E. E., Gavrilov, A. A., Flyamer, I. M., Kos, P., Mikhaleva, E. A., Penin, A. A., Logacheva, M. D., Imakaev, M. V., Chertovich, A., Gelfand, M. S., Shevelyov, Y. Y. and Razin, S. V. (2015). Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res. PubMed ID: 26518482
Ulianov, S. V., Doronin, S. A., Khrameeva, E. E., Kos, P. I., Luzhin, A. V., Starikov, S. S., Galitsyna, A. A., Nenasheva, V. V., Ilyin, A. A., Flyamer, I. M., Mikhaleva, E. A., Logacheva, M. D., Gelfand, M. S., Chertovich, A. V., Gavrilov, A. A., Razin, S. V. and Shevelyov, Y. Y. (2019). Nuclear lamina integrity is required for proper spatial organization of chromatin in Drosophila. Nat Commun 10(1): 1176. PubMed ID: 30862957
Ulianov, S. V., Zakharova, V. V., Galitsyna, A. A., Kos, P. I., Polovnikov, K. E., Flyamer, I. M., Mikhaleva, E. A., Khrameeva, E. E., Germini, D., Logacheva, M. D., Gavrilov, A. A., Gorsky, A. S., Nechaev, S. K., Gelfand, M. S., Vassetzky, Y. S., Chertovich, A. V., Shevelyov, Y. Y. and Razin, S. V. (2021). Order and stochasticity in the folding of individual Drosophila genomes. Nat Commun 12(1): 41. PubMed ID: 33397980
Umer, Z., Akhtar, J., Khan, M. H. F., Shaheen, N., Haseeb, M. A., Mazhar, K., Mithani, A., Anwar, S. and Tariq, M. (2019). Genome-wide RNAi screen in Drosophila reveals Enok as a novel trithorax group regulator. Epigenetics Chromatin 12(1): 55. PubMed ID: 31547845
Viets, K., Sauria, M. E. G., Chernoff, C., Rodriguez Viales, R., Echterling, M., Anderson, C., Tran, S., Dove, A., Goyal, R., Voortman, L., Gordus, A., Furlong, E. E. M., Taylor, J. and Johnston, R. J. (2019). Characterization of button loci that promote homologous chromosome pairing and cell-type-specific interchromosomal gene regulation. Dev Cell. PubMed ID: 31607649
Voigt, S. and Kost, L. (2021). Differences in temperature-sensitive expression of PcG-regulated genes among natural populations of Drosophila melanogaster.
G3 (Bethesda) 11(9). PubMed ID: 34544136
Vorobyeva, N. E., Erokhin, M., Chetverina, D., Krasnov, A. N. and Mazina, M. Y. (2021). Su(Hw) primes 66D and 7F Drosophila chorion genes loci for amplification through chromatin decondensation. Sci Rep 11(1): 16963. PubMed ID: 34417521
Wang, S. H. and Elgin, S. C. R. (2019). The impact of genetic background and cell lineage on the level and pattern of gene expression in position effect variegation. Epigenetics Chromatin 12(1): 70. PubMed ID: 31722719
Wei, K. H., Chan, C. and Bachtrog, D. (2021). Establishment of H3K9me3-dependent heterochromatin during embryogenesis in Drosophila miranda. Elife 10. PubMed ID: 34128466
Wilky, H., Chari, S., Govindan, J. and Amodeo, A. A. (2019). Histone concentration regulates the cell cycle and transcription in early development. Development 146(19). pii: dev177402. PubMed ID: 31511251
Wright, D. and Schaeffer, S. W. (2022). The relevance of chromatin architecture to genome rearrangements in Drosophila. Philos Trans R Soc Lond B Biol Sci 377(1856): 20210206. PubMed ID: 35694744
Xu, B., Li, X., Gao, X., Jia, Y., Li, F. and Zhang, Z. (2021). Decoding nucleosome positions with ATAC-seq data at single-cell levelBiorxiv
Yao, B., Li, Y., Wang, Z., Chen, L., Poidevin, M., Zhang, C., Lin, L., Wang, F., Bao, H., Jiao, B., Lim, J., Cheng, Y., Huang, L., Phillips, B. L., Xu, T., Duan, R., Moberg, K. H., Wu, H. and Jin, P. (2018). Active N(6)-methyladenine demethylation by DMAD regulates gene expression by coordinating with Polycomb Protein in neurons. Mol Cell 71(5): 848-857.e846. PubMed ID: 30078725
Yao, Y., Li, X., Wang, W., Liu, Z., Chen, J., Ding, M. and Huang, X. (2018). MRT, functioning with NURF complex, regulates lipid droplet size. Cell Rep 24(11): 2972-2984. PubMed ID: 30208321
Yarosh, W. and Spradling, A. C. (2014). Incomplete replication generates somatic DNA alterations within Drosophila polytene salivary gland cells. Genes Dev 28(16): 1840-1855. PubMed ID: 25128500
Zabidi, M. A., Arnold, C. D., Schernhuber, K., Pagani, M., Rath, M., Frank, O. and Stark, A. (2015). Enhancer-core-promoter specificity separates developmental and housekeeping gene regulation. Nature 518(7540): 556-559. PubMed ID: 25517091
Zhimulev, I. F. (1999). Genetic organization of polytene chromosomes. Adv Genet 39: 1-589. PubMed ID: 10319426
Zhimulev, I. F., Belyaeva, E. S., Makunin, I. V., Pirrotta, V., Volkova, E. I., Alekseyenko, A. A., Andreyeva, E. N., Makarevich, G. F., Boldyreva, L. V., Nanayev, R. A. and Demakova, O. V. (2003). Influence of the SuUR gene on intercalary heterochromatin in Drosophila melanogaster polytene chromosomes. Chromosoma 111: 377-398. PubMed ID: 12644953
Zhimulev, I. F., Zykova, T. Y., Goncharov, F. P., Khoroshko, V. A., Demakova, O. V., Semeshin, V. F., Pokholkova, G. V., Boldyreva, L. V., Demidova, D. S., Babenko, V. N., Demakov, S. A. and Belyaeva, E. S. (2014). Genetic organization of interphase chromosome bands and interbands in Drosophila melanogaster. PLoS One 9: e101631. PubMed ID: 25072930
Zielke, T., Glotov, A. and Saumweber, H. (2015). High-resolution in situ hybridization analysis on the chromosomal interval 61C7-61C8 of Drosophila melanogaster reveals interbands as open chromatin domains. Chromosoma [Epub ahead of print]. PubMed ID: 26520107
Yu, R., Cao, X., Sun, L., Zhu, J. Y., Wasko, B. M., Liu, W.,