Polycomb: Biological Overview | Evolutionary Homologs | Regulation | Protein interactions | Developmental Biology | Effects of Mutation | References

Gene name - Polycomb

Synonyms -

Cytological map position - 78C9

Function - transcription factor

Keywords - Polycomb group

Symbol - Pc

FlyBase ID:FBgn0003042

Genetic map position - 3-47.1

Classification - chromo domain

Cellular location - nuclear



NCBI links: Precomputed BLAST | Entrez Gene | UniGene |

Recent literature
Du, J., Zhang, J., He, T., Li, Y., Su, Y., Tie, F., Liu, M., Harte, P. J. and Zhu, A. J. (2016). Stuxnet facilitates the degradation of Polycomb protein during development. Dev Cell 37: 507-519. PubMed ID: 27326929
Summary:
Polycomb-group (PcG) proteins function to ensure correct deployment of developmental programs by epigenetically repressing target gene expression. Despite the importance, few studies have been focused on the regulation of PcG activity itself. This study reports a Drosophila gene, stuxnet (stx), that controls Pc protein stability. Heightened stx activity leads to homeotic transformation, reduced Pc activity, and de-repression of PcG targets. Conversely, stx mutants, which can be rescued by decreased Pc expression, display developmental defects resembling hyperactivation of Pc. Biochemical analyses provide a mechanistic basis for the interaction between stx and Pc; Stx facilitates Pc degradation in the proteasome, independent of ubiquitin modification. Furthermore, this mode of regulation is conserved in vertebrates. Mouse stx promotes degradation of Cbx4, an orthologous Pc protein, in vertebrate cells and induces homeotic transformation in Drosophila. These results highlight an evolutionarily conserved mechanism of regulated protein degradation on PcG homeostasis and epigenetic activity.
Loubiere, V., Delest, A., Thomas, A., Bonev, B., Schuettengruber, B., Sati, S., Martinez, A. M. and Cavalli, G. (2016). Coordinate redeployment of PRC1 proteins suppresses tumor formation during Drosophila development. Nat Genet [Epub ahead of print]. PubMed ID: 27643538
Summary:
Polycomb group proteins form two main complexes, PRC2 and PRC1, which generally coregulate their target genes. This study shows that PRC1 components act as neoplastic tumor suppressors independently of PRC2 function. By mapping the distribution of PRC1 components and trimethylation of histone H3 at Lys27 (H3K27me3) across the genome, a large set of genes were identified that acquire PRC1 in the absence of H3K27me3 in Drosophila larval tissues. These genes massively outnumber canonical targets and are mainly involved in the regulation of cell proliferation, signaling and polarity. Alterations in PRC1 components specifically deregulate this set of genes, whereas canonical targets are derepressed in both PRC1 and PRC2 mutants. In human embryonic stem cells, PRC1 components colocalize with H3K27me3 as in Drosophila embryos, whereas in differentiated cell types they are selectively recruited to a large set of proliferation and signaling-associated genes that lack H3K27me3, suggesting that the redeployment of PRC1 components during development is evolutionarily conserved.
Goto, M., Toda, N., Shimaji, K., Suong, D. N., Vo, N., Kimura, H., Yoshida, H., Inoue, Y. H. and Yamaguchi, M. (2016). Polycomb-dependent nucleolus localization of Jumonji/Jarid2 during Drosophila spermatogenesis. Spermatogenesis 6(3): e1232023. PubMed ID: 28144496
Summary:
Drosophila Jumonji/Jarid2 (dJmj) has been identified as a component of Polycomb repressive complex 2. However, it is suggested that dJmj has both PRC-dependent and -independent roles. Subcellular localization of dJmj during spermatogenesis is unknown. Immunocytochemical analyses was performed with specific antibodies to dJmj and tri-methylation at lysine 27 on histone H3 (H3K27me3). Interestingly, dJmj exclusively localizes at nucleolus in the late growth stage. Examination of the dJmj localization in various Polycomb group (PcG) mutant lines at the late growth stage allowed identification of some PcG genes, including Polycomb (Pc), to be responsible for dJmj recruitment to nucleolus. In addition, size of nucleolus was decreased in some of these mutant lines. In a mutant of testis-specific TAF homolog (tTAF) that is responsible for nucleolus localization of Pc, dJmj signals were detected not only at nucleolus but also on the condensed chromatin in the late growth stage. Duolink In situ Proximity ligation assay clarified that Pc interacts with dJmj at nucleolus in the late growth stage. Furthermore, the level of H3K27me3 decreased in nuclei at this stage. Taken together, it is concluded that tTAF is responsible for recruitments of dJmj to nucleolus in the late growth stage that appears to be mediated by Pc. Compartmentalization of dJmj in nucleolus together with some of PcG may be necessary to de-repress the expression of genes required to cellular growth and proliferation in the following meiotic divisions.

BIOLOGICAL OVERVIEW

What are the targets of Polycomb-group (Pc-G) proteins and how do they regulate gene silencing? Pc-G proteins are usually considered to be inhibitors of homeotic genes, since Pc-G mutants were originally identified on the basis of their causing expression of homeotic genes in unusual locations, areas in which they normally would not be expressed. This so-called ectopic expression is attributed to the failure of proper gene silencing in Pc-G mutants. In fact bithorax complex contains Pc-G response elements (PREs) that are involved in initiating and maintaining silencing of the whole bithorax complex (Busturia, 1993).

Pc-G genes also regulate gap genes giant and knirps, restructuring their transcription to the posterior half of the embryo. This silencing is inititated by high levels of Hunchback protein in the anterior portion of the embryo, and maintained by Pc-G action (Pelegri, 1994).

The action of Pc-G proteins is thought to be mediated through chromatin. Chromatin is a combination of DNA and protein that derives its name from a staining reaction with dyes. About a third of the DNA in chromosomes is held in heterochromatin, a type of chromatin with which few genes are associated. Placing active genes next to heterochromatin results in their inactivation or variation in expression, an effect termed position effect variation. It is thought that heterochromatin is capable of spreading to active genes, resulting in their silencing (Spofford, 1976).

These classic biological observations have taken on a new and more literal meaning recently with the discovery of common sequence domains in Pc-G genes. Polycomb itself has a domain called the chromodomain, which is shared with HP1, a heterochromatin-associated protein of Drosophila (Paro, 1991). Many of the Pc-G proteins, including Polycomb itself, are unable to bind DNA.

Considering that they lack the ability to bind DNA, how do Pc-G proteins function to establish gene silencing? It is thought that the initial repression of a gene is carried out by transcription factors that possess the ability to recognize DNA. In the cases of giant and knirps, for example, the maternal protein Hunchback represses their expression in the anterior part of the embryo (Pelegri, 1994). Pc-G proteins provide a mechanism whereby initial repression becomes permanent. They carry out this role by assembling at the site of initial repression and forming a multiprotein complex involved in modifying chromatin to promote gene silencing. Polycomb itself has this ability to self associate (Franke, 1992).

Thus many of the roles of Pc-G proteins are played out not in association with DNA but in association with each other and with other chromosomal proteins, especially histones, the principle scaffolding protein of chromosomes. Interest in the silencing of gene expression has shifted from a concern about DNA-protein interaction to an emphasis on protein-protein interaction, and the maintenance and modification of chromatin structure (Reviewed by Orlando, 1995, Pirrotta, 1995 and Simon, 1995).

Trithorax group (trx-G) genes are able to reverse the inactivating effects of chromatin. These proteins are thought to function as transcriptional activators, removing the block on gene expression put in place by arrays of inactivating proteins.

Cooperativity, specificity, and evolutionary stability of polycomb targeting in Drosophila

Metazoan genomes are partitioned into modular chromosomal domains containing active or repressive chromatin. In flies, Polycomb group (PcG) response elements (PREs) recruit Pho and other DNA-binding factors and act as nucleation sites for the formation of Polycomb repressive domains. The sequence specificity of PREs is not well understood. This study used comparative epigenomics and transgenic assays to show that Drosophila domain organization and PRE specification are evolutionarily conserved despite significant cis-element divergence within Polycomb domains, whereas cis-element evolution is strongly correlated with transcription factor binding divergence outside of Polycomb domains. Cooperative interactions of PcG complexes and their recruiting factor Pho stabilize Pho recruitment to low-specificity sequences. Consistently, Pho recruitment to sites within Polycomb domains is stabilized by PRC1. These data suggest that cooperative rather than hierarchical interactions among low-affinity sequences, DNA-binding factors, and the Polycomb machinery are giving rise to specific and strongly conserved 3D structures in Drosophila (Schuettengruber, 2014).

Comparative epigenomics was used to demonstrate that Polycomb domains are an extremely well conserved feature of the genome during fly evolution. In fact, the evolutionary profile of epigenomic domain organization in embryos of five Drosophila species indicates a complete lack of divergence of H3K27me3- marked Polycomb domains in syntenic regions. A similar high conservation of the H3K27me3 pattern across Drosophila species was recently described. Polycomb domains typically harbor several PH-marked PREs, and a comparative analysis showed that these are also highly conserved and the few loci that show a divergence of PRC1 occupancy patterns are not correlated with overall domain divergence. Likewise, the binding of PHO and DSP1 is highly conserved (to a degree at least as strongly, and possibly more strongly, than binding of individual factors), but even cases of diverged factor occupancies are usually not correlated with overall PRE divergence. In marked contrast, the sequences underlying PREs and Polycomb domains are diverging extensively, and sequence-based prediction of PREs across Drosophila species suggested that divergence of PREs could occur frequently. However, neither ChIP-seq experiments nortransgenic reporter assays support this dynamic behavior. Instead, such sequence divergence is buffered by the epigenetic targeting mechanisms to maintain Polycomb domains. It is suggested that the multilayered organization uses redundancy and cooperativity to facilitate the remarkable Polycomb domain conservation. This is occurring both in cis, where several TFs collaborate to define a regulatory element even when the underlying sequence is imperfect, and at the domain level, where several PREs participate to define the PcG domain structure and possibly stabilize each other (Schuettengruber, 2014).

Although PREs are associated with several known sequence features (such as GAGA- and PHO-binding motifs) in a statistically significant way, these features are not sufficient to distinguish many PREs from the genomic background and from other PHO- or DSP1-bound active chromatin elements. There are many possible explanations for this lack of specificity, including the existence of additional, yet-to-be-characterized sequence-specific recruiting factors; the involvement of nucleosome positioning; transcription of non-coding RNAs; or imperfect modeling of the sequence specificity of the known factors. The data presented in this study, however, introduce a new perspective that can help resolve this conundrum. In contrast to previous hypotheses, the data show that even when strong binding sites are lacking, PHO and DSP1 may bind PREs directly through weak (but highly nonrandom) motifs. Remarkably, sequence affinities that are completely nonspecific on a genomic scale (possibly defining millions of spurious sites) are still highly informative for predicting the binding intensity within the context of a PRE. The strong correlation of PHO binding with weak but nonrandom motifs makes it unlikely that binding to these sites represents indirect binding via interaction/looping with strong binding sites. The data show that in order to understand PRE sequence specificity, multiple potential binding sites with variable affinities and fidelities must be taken into account, and their cooperative interaction must be considered in the context of the PRE chromosomal landscape. This idea is compatible with the evolutionary constraints on PRE sequences, which has been demonstrated in this study to affect a spectrum of binding affinities rather than to conserve classical binding sites alone (Schuettengruber, 2014).

What might be the molecular mechanism that allows the specific binding of weak sites in the context of Polycomb domains? One possibility is that cooperative binding of TFs at PREs supports their occupancy of weak motifs. Indeed, this study found that PHO and DSP1 are bound jointly at PREs (with weak underlying sequence motifs), whereas at other regions of the genome where the factors bind alone, they are usually associated with strong sequence motifs. This observation is in agreement with the recently proposed 'TF collective model,' according to which combinatorial TF binding occurs with little or no apparent sequence motifs for at least a subset of the bound factors (Schuettengruber, 2014).

In addition, it was shown that transient interactions of DNA-binding proteins with weak affinity sites are stabilized by the presence of the PcG proteins themselves. A similar observation of a positive feedback of PRC1 on PHO binding was recently reported (Kahn, 2014) and is further supported by the fact that cooperative binding of PHO and Polycomb to PREs can occur even in vitro. In vivo, long-range contacts involving remote PREs within the same (or even a different) Polycomb domain may contribute to this process. Clustering of multiple flanking PREs in the 3D space of the nucleus might generate Polycomb compartments characterized by high concentrations of PcG proteins as well as their recruiting DNA-binding proteins. In this scenario, loss of occupancy following the dissociation of any of these factors from DNA may be more easily replenished by the concentrated stock of factor within a Polycomb compartment compared with individual binding sites present elsewhere in the genome. This may push the equilibrium toward increased PHO and DSP1 binding to low-affinity sites and partially reduce the evolutionary pressure to maintain the nucleotidic sequence of recruiter motifs at PREs. Structural long-range effects may also inhibit PcG recruitment in cases where active enhancers and TSSs are in proximity to a candidate PRE sequence. This analysis suggests that H3K4me3-marked loci are also highly conserved, but the low-affinity PHO- or DSP1-binding sites in them are completely uncorrelated with occupancy of these factors, further supporting a model of highly organized and cooperative epigenomic organization (Schuettengruber, 2014).

In conclusion, the data indicate that sequence conservation collaborates with 3D chromatin architecture to maintain an exceptional evolutionary stability of Polycomb-regulated loci in fly genomes. This phenomenon highlights the contribution of chromosome domains and their particular looping structures to epigenomic specificity and genome evolution. Hi-C analysis in mammals has revealed that topological domains are a strikingly conserved feature between the mouse and human genomes. The current data raise the possibility that, beyond combinatorial contributions by TF-binding sites in close proximity, the confinement of regulatory elements within TADs and their frequent DNA contacts constitute significant driving forces that also affect DNA sequence evolution in these and possibly many other specie (Schuettengruber, 2014).

Propagation of Polycomb-repressed chromatin requires sequence-specific recruitment to DNA

Epigenetic inheritance models posit that during Polycomb repression, Polycomb Repressive Complex 2 (PRC2) propagates histone H3K27 tri-methylation (H3K27me3) independently of DNA sequence. This study shows that insertion of Polycomb Response Element (PRE) DNA into the Drosophila genome creates extended domains of H3K27me3-modified nucleosomes in the flanking chromatin and causes repression of a linked reporter gene. After excision of PRE DNA, H3K27me3 nucleosomes become diluted with each round of DNA replication and reporter gene repression is lost, whereas in replication-stalled cells, H3K27me3 levels stay high and repression persists. Hence, H3K27me3-marked nucleosomes provide a memory of repression that is transmitted in a sequence-independent manner to daughter strand DNA during replication. In contrast, propagation of H3K27 tri-methylation to newly incorporated nucleosomes requires sequence-specific targeting of PRC2 to PRE DNA (Laprell, 2017).

The ability of certain histone-modifying enzymes to bind to the modification they generated has led to models where such enzymes might propagate modified chromatin domains by a positive feedback loop, independently of the underlying DNA sequence. Two paradigms of chromatin states have been proposed to be maintained by such an epigenetic inheritance mechanism: constitutive heterochromatin with histone H3 lysine 9 di- and tri-methylation (H3K9me2/3) generated by Suv39/Clr4 enzymes, and Polycomb-repressed chromatin marked with H3K27me3 by PRC2. In both chromatin states, these histone modifications are essential for repressing gene transcription. To date, there is compelling evidence that H3K9me2/3- and H3K27me3-modified nucleosomes are transmitted to daughter strand DNA during replication. However, the steps required to propagate these modifications are much less understood. Fission yeast Clr4 has the capacity to propagate ectopically induced H3K9me2/3 domains over many cell divisions by an H3K9me2/3-based positive feedback loop but only in cells mutated for H3K9me2/3 demethylase activity. In the case of PRC2, allosteric activation of the enzyme induced by binding to H3K27me3 has been proposed to be the foundation for propagating H3K27me3 chromatin. In mammalian cells, transient DNA-tethering of PRC2 generates short ectopic H3K27me3 domains that were at least partially maintained for several cell divisions after release of DNA-tethered PRC2. However, in Drosophila, where PRC2 and other Polycomb group (PcG) protein complexes are targeted to PREs, repression imposed by insertion of PRE DNA next to a reporter gene was lost upon excision of PRE DNA. This study investigated how insertion and excision of PRE DNA at ectopic sites in Drosophila affects binding of PcG proteins and H3K27me3 at the molecular level (Laprell, 2017).

Two previously described strains were analyzed that each carried a single copy of the >PRE>dppWE-TZ reporter gene, integrated at different chromosomal locations. >PRE>dppWE-TZ contains a 1.6 kilobase (kb) DNA fragment of the bxd PRE from the HOX gene Ultrabithorax (Ubx), flanked by FRT recombination sites (>PRE>) to permit excision of PRE DNA by Flp-mediated recombination. Adjacent to the >PRE> cassette, the construct contains a reporter gene comprising the wing imaginal disc enhancer from decapentaplegic (dpp) (E), linked to the hsp70 TATA box minimal promoter (T) and LacZ sequences encoding β-galactosidase (Z) . In the presence of the >PRE> cassette, the transgene was silenced and no β-galactosidase activity could be detected in wing imaginal discs of >PRE>dppWE-TZ transgenic animals. In contrast, >dppWE-TZ transgenic animals, generated by excision of the >PRE> cassette in the germ line, showed strong β-galactosidase expression in the characteristic pattern driven by the dpp enhancer. The observation that silencing of the intact >PRE>dppWE-TZ reporter gene is lost in mutants lacking PRC2 function, prompted determination of the H3K27 methylation profile and binding of PcG proteins across the transgene. In both lines, the transgene had inserted into a genomic location normally devoid of H3K27me3 and PcG protein binding. Chromatin immunoprecipitation (ChIP) assays were performed on batches of wing imaginal discs from >PRE>dppWE-TZ and the corresponding >dppWE-TZ transgenic animals, and the immunoprecipitates were analyzed by quantitative real-time PCR (qPCR). For qPCR, primer pairs were used that selectively amplified transgene sequences and sequences in the genomic regions flanking the transgene insert. As controls, primer pairs were used amplifying sequences at the endogenous bx PRE in Ubx that are known to be bound by PcG proteins (C2) or enriched for H3K27me3 (C1 and C3) and at two regions elsewhere in the genome (C4 and C5) without PcG protein binding or H3K27me3 (Laprell, 2017).

The PRC1 subunits Polycomb (Pc), Polyhomeotic (Ph) and the PRC2 subunit E (z) were specifically enriched at the transgene PRE in animals carrying >PRE>dppWE-TZ and, as expected, no binding was detected in >dppWE-TZ animals. In both >PRE>dppWE-TZ transgenic lines, H3K27me3 was present at high levels across a domain that extended about 4-5 kb to either side of the >PRE> cassette, spanning almost the entire construct. No enrichment of H3K27me3 was detectable at the >dppWE-TZ transgene. At >PRE>dppWE-TZ, PRC2 thus tri-methylates H3K27 across a chromatin interval that spans about 8-10 kb (Laprell, 2017).

To estimate to what extent nucleosomes at the >PRE>dppWE-TZ transgene are tri-methylated at H3K27, the H3K27me2 profile was determined. H3K27me2 levels across the >PRE>dppWE-TZ transgene were much lower than at C4 and C5 and comparable to the levels at Ubx (regions C1-C3) that is repressed and predominantly tri-methylated at H3K27 in wing imaginal discs. Conversely, across >dppWE-TZ, H3K27me2 levels were much higher and comparable to those seen at C4 and C5. This suggest that the nucleosomes across the >PRE>dppWE-TZ transgene are predominantly tri-methylated at H3K27 (Laprell, 2017).

Excision of the >PRE> cassette from >PRE>dppWE-TZ transgenic animals by heat-shock induced expression of Flp during larval development results in appearance of β-galactosidase expression in the dpp pattern 12 hours after the heat shock. Efficiency of PRE excision was measured and it was found that 8 hours after a single 1-hour heat shock, excision had occurred in about 95% of wing imaginal disc cells. The delayed increase of β-galactosidase expression over time suggests a gradual rather than abrupt loss of repression. ChIP analyses were performed on chromatin prepared from batches of entire wing imaginal discs dissected from >PRE>dppWE-TZ transgenic animals 12, 32 or 56 hours after Flp-induction. This allowed monitoring the consequences of PRE excision in cells that had undergone at least one (+12 hours), at least two (+32 hours), or more than four (+56 hours) cell divisions. 12 hours after Flp-induction, H3K27me3 levels were at least two-fold reduced across the entire transgene and further reduced by at least two-fold at the 32 hours time point. 56 hours after Flp-induction, H3K27me3 levels across the transgene were nearly as low as in >dppWE-TZ animals derived from >dppWE-TZ germ cells. The histone H3 profile was unaltered at all time points, suggesting that PRE excision does not cause global disruption of nucleosome occupancy across the transgene. The loss of H3K27me3 after PRE excision suggests that PRC2 is unable to propagate H3K27me3 across the >dppWE-TZ transgene in the absence of PRE DNA (Laprell, 2017).

In parallel, Pc protein binding was monitored after PRE excision. Pc, unlike Ph or other PRC1 subunits, is not only bound at PREs but also associates with the chromatin flanking PREs likely reflecting its interaction with H3K27me3-modified nucleosomes. 12 hours after PRE excision, Pc binding at the transgene was already almost reduced to background levels (Laprell, 2017).

The H3K27me3 profile at the >PRE>dppWE-UZ transgene that contains a 4.1 kb fragment of the Ubx promoter instead of the hsp70 minimal promoter was then analyzed. At >PRE>dppWE-UZ, the H3K27me3 domain spans about 12 kb and is thus about 4 kb longer than at >PRE>dppWE-TZ. Nevertheless, after PRE excision, H3K27me3 at dppWE-UZ was lost at a rate comparable to that seen at dppWE-TZ. Ubx promoter DNA thus does not enable H3K27me3 propagation. It is concluded that even at a domain that spans 12 kb and therefore comprises about 60 nucleosomes, PRC2 is unable to propagate H3K27me3 in the absence of PRE DNA (Laprell, 2017).

The H3K27me3 profile and reporter gene repression was then analyzed after PRE excision in animals in which DNA replication had been blocked. Larvae were reared in liquid medium containing Aphidicolin, an inhibitor of DNA polymerases A and D, which resulted in a complete block of DNA replication in imaginal discs. In larvae reared in Aphidicolin-containing medium, Flp-induced PRE excision from >PRE>dppWE-TZ was as efficient as under normal growth conditions but 12 hours after excision, H3K27me3 levels at the transgene were undiminished compared to +PRE control larvae. In larvae reared in liquid medium without Aphidicolin, PRE excision resulted in the expected two-fold reduction of H3K27me3 levels after 12 hours. Together, this suggests that the loss of H3K27me3 nucleosomes after PRE excision in proliferating cells reflects their dilution as they become transmitted to DNA daughter strands during replication. Unlike under normal growth conditions, Aphidicolin-treated larvae lacked detectable β-galactosidase expression 12 hours after PRE excision. When these animals were permitted to recover in medium lacking Aphidocolin, they resumed DNA replication and began expressing β-galactosidase. If DNA replication is blocked and H3K27me3 levels stay high, repression is thus also sustained in the absence of PRE DNA, possibly by PRC1 (Laprell, 2017).

Finally, PRE excision was induced from >PRE>dppWE-TZ in larvae that were hemizygous for UtxΔ, a null mutation in the single H3K27me3 demethylase in Drosophila. 12 hours after Flp-induction, H3K27me3 levels at the transgene were reduced about two-fold, like in wild-type animals. This suggest that demethylation of H3K27me3 by Utx does not contribute to the disappearance of H3K27me3 from transgene chromatin after PRE excision (Laprell, 2017).

These results lead to the following conclusions. First, PRE cis-regulatory DNA provides the genetic basis not only for generating but also for propagating H3K27me3-modified chromatin. This argues against a simple epigenetic model where PRC2 binding to parental H3K27me3 nucleosomes after replication would suffice to propagate H3K27 tri-methylation in daughter strand chromatin. PRC2 needs to be recruited to PRE DNA first, before allosteric activation through interaction with H3K27me3 nucleosomes in flanking chromatin may then facilitate methylation of newly incorporated nucleosomes. Secondly, following PRE excision and replication, parental H3K27me3 nucleosomes remain associated with the same underlying DNA in daughter cells and thus provide epigenetic memory. However, while in replication-stalled cells high H3K27me3 levels permit to sustain repression also in the absence of PRE DNA, their dilution in proliferating cells is accompanied with loss of repression after one cell division. H3K27me3 nucleosomes therefore only appear to provide short-term epigenetic memory of the repressed state. Hence, DNA targeting of PRC2 after replication to replenish H3K27me3 is critical to preserve repression (Laprell, 2017).

Drosophila HOX and other large-size PcG target genes often contain multiple PREs and H3K27me3 domains that span dozens of kilobases. Deletion of single PREs from these genes typically results in only minor diminution of the H3K27me3 profile and misexpression is less severe than misexpression of the native genes in PcG mutants. Furthermore, when the same >PRE> cassette that was used in this study was excised from a Ubx-LacZ reporter gene with more extended Ubx upstream regulatory sequences, repression was lost with a longer delay\, suggesting that additional elements with PRE properties in those Ubx sequences permitted to sustain repression through more cell divisions. The evolution of PRE DNA sequences and of their frequency and arrangement within target genes may thus ultimately determine stability and heritability of H3K27me3 chromatin and PcG repression (Laprell, 2017).


GENE STRUCTURE

cDNA clone length - 2.5 kb

Bases in 5' UTR - 107

Exons - two - The intron is centered in the chromo domain (Paro, 1993).

Bases in 3' UTR - 1170


PROTEIN STRUCTURE

Amino Acids - 390

Structural Domains

The PC protein exhibits homology to the heterochromatin associated protein HP1 (Platero, 1995). The homology is confined to a 37 amino acid domain in the N-terminal part of the two proteins. This region is termed the chromo domain, standing for chromatin organization modifier (Paro, 1991 and Messmer, 1992). Carboxy-terminal truncations of the PC protein do not affect chromosomal binding of PC. However, mutations affecting only the chromo domain (including in vitro generated deletions, as well as point mutations) abolish chromosomal binding. Thus the chromo domain is important for the function of PC and it is absolutely required for binding of PC protein to chromatin. Some of the nuclear patterns generated by the mutated forms of the fusion proteins suggest that the chromo domain could be involved in a packaging mechanism, essential for compacting chromosomal proteins within heterochromatin or heterochromatin-like complexes (Messmer, 1992).

Progressive polycomb assembly on H3K27me3 compartments generates polycomb bodies with developmentally regulated motion

Polycomb group (PcG) proteins are conserved chromatin factors that maintain silencing of key developmental genes outside of their expression domains. Recent genome-wide analyses showed a Polycomb (PC) distribution with binding to discrete PcG response elements (PREs). Within the cell nucleus, PcG proteins localize in structures called PC bodies that contain PcG-silenced genes, and it has been recently shown that PREs form local and long-range spatial networks. The nuclear distribution of two PcG proteins, PC and Polyhomeotic (PH) was examined in this study. Thanks to a combination of immunostaining, immuno-FISH, and live imaging of GFP fusion proteins, it was possible to analyze the formation and the mobility of PC bodies during fly embryogenesis as well as compare their behavior to that of the condensed fraction of euchromatin. Immuno-FISH experiments show that PC bodies mainly correspond to 3D structural counterparts of the linear genomic domains identified in genome-wide studies. During early embryogenesis, PC and PH progressively accumulate within PC bodies, which form nuclear structures localized on distinct euchromatin domains containing histone H3 tri-methylated on K27. Time-lapse analysis indicates that two types of motion influence the displacement of PC bodies and chromatin domains containing H2Av-GFP. First, chromatin domains and PC bodies coordinately undergo long-range motions that may correspond to the movement of whole chromosome territories. Second, each PC body and chromatin domain has its own fast and highly constrained motion. In this motion regime, PC bodies move within volumes slightly larger than those of condensed chromatin domains. Moreover, both types of domains move within volumes much smaller than chromosome territories, strongly restricting their possibility of interaction with other nuclear structures. The fast motion of PC bodies and chromatin domains observed during early embryogenesis strongly decreases in late developmental stages, indicating a possible contribution of chromatin dynamics in the maintenance of stable gene silencing (Cheutin, 2012).

This study showed that PC bodies co-localize with H3K27me3 and form small nuclear domains of heterogeneous intensity. Surprisingly, PC bodies are found in DAPI poor regions, often adjacent to DAPI and histone-dense euchromatic regions. This result thus indicates that PC bodies are not among the most condensed chromatin portions of the euchromatic part of the genome. This localization of PC bodies is consistent with a previous study with electron microscopy, which has shown that PC is concentrated in the perichromatin compartment of the mammalian nucleus. In contrast, these data are in apparent contrast with a series of papers reporting PcG protein-dependent chromatin condensation. PcG complexes have been shown to compact chromatin in vitro and reduce DNA accessibility in vivo. Moreover, recent works show that PcG proteins are required to maintain compaction of Hox loci in mammalian embryonic stem cells and of the mouse Kcnq1 imprinted cluster. In those studies, condensation has been addressed by measuring either the compaction of nucleosomal fibers in electron microscopy, or the distance between close genomic loci by FISH. It is difficult to relate in vitro data to the current in vivo analysis. In particular, FISH analyses do not directly distinguish between a truly dense 3D organization and other types of conformations, such as a multi-looped architecture that would not necessarily induce an increase in chromatin density. Therefore, PcG target chromatin is probably organized in higher-order 3D structures that involve nucleosome-nucleosome and protein-protein interactions, but the net density of DNA (as seen by DAPI) or histones (as seen by tagged-histone microscopy) is not particularly high in these structures (Cheutin, 2012).

Earlier studies indicated that PcG proteins rapidly exchange between the nucleoplasm and PC bodies, suggesting that PC bodies consist of a local transient accumulation of PcG proteins in the cell nucleus. Earlier studies have detected the same number of PC bodies inside the nucleus as the number of bands observed on polytene chromosomes, suggesting that PC bodies are formed by PcG proteins binding to their target chromatin. The observed colocalization of PcG target genes with PC bodies in diploid cells confirms this view. An alternative scenario posits that PC bodies could form nucleation sites onto which PcG-target genes move to become silenced. Two lines of evidence from this work suggest the first scenario to be closer to reality. Firstly, it was found that the amount of PC within a PC body depends on the linear size of the genomic region coated by PC and H3K27me3. Secondly, the higher enrichment of PC in PC bodies after homologous chromosome pairing strongly suggests that PC bodies are the nuclear counterparts of linear genomic domains identified in genome-wide studies rather than nuclear structures to which Polycomb target genes have to be localized for their silencing (Cheutin, 2012).

In the head of embryos, where the Antp and Abd-B genes are silenced, they localize in large PC bodies in all cell nuclei. In contrast, loci where PC coating is restricted to smaller genomic regions do not always localize within PC bodies in interphase cell nuclei. Interestingly, time-lapse imaging shows that large PC bodies are stable structures that can be visualized in all frames of time series, whereas small PC bodies are apparently less stable because they are not visible in all of the frames. One possible explanation for the lack of colocalization between PC target genes and PC bodies is that small genomic regions may not be coated by PC in every cell. Alternatively, the amount of PC within the PC body in which small genomic regions localize might be too small to be directly observed, and only become visible when several small PC bodies interact together. For instance a previous study showed that a transgene containing only two copies of a PRE could be detected in about 50% of cell nuclei (Cheutin, 2012).

Intense PC bodies can be visualized during entire time-lapse experiments, allowing the study of their motion. The interpretation of these time-lapse experiments is not straightforward because the MSD of PC bodies only weakly correlates with the MSC. Interestingly, tracks of PC bodies are mainly composed of narrow angles. The analysis of the motion of chromatin domains containing H2Av-GFP gave similar results, but gave unambiguous evidence for the coordinated motion of several chromatin domains. By using the Lac repressor/lac operator system, two components of chromatin motion in early G2 Drosophila spermatocyte nuclei have been reported: a short range motion which occurs in approximately 0.5 µm radius domains, and long-range motion confined to a large, chromosome-sized domain. Another study has also identified a two-regime motion of a chromatin locus inside mammalian nucleus by using a two-photon microscope, which provides high spatial and temporal resolution. This work indicated that chromatin loci undergo apparent constrained diffusion during long periods, interrupted by jumps of 150 nm lasting less than 2 s. However, none of these previous works reported any coordinated motion of adjacent chromatin domains, and therefore they both described the motion of chromatin as being consistent with a random walk (Cheutin, 2012).

In tracking experiments, it was realized that the fast regime of motion is tightly constrained within volumes much smaller than chromosome territories. This suggests that any given locus will normally explore a restricted three-dimensional environment in the cell nucleus. Since this applies generally to chromatin at all developmental stages, one can deduce that each genomic locus is likely to locate in the vicinity of neighboring loci in the three-dimensional nuclear space. The prediction is thus that each locus should most frequently contact other loci that are in its linear neighborhood along the chromosome. This behavior matches the results observed in chromosome conformation capture on chip (4C) experiments, where each 4C bait had most contacts within few hundred kb to a few Mb of surrounding chromatin. Thus, the current results provide a possible scenario for the explanation of these results obtained from large cell populations. Recent studies showed that homeotic gene clusters form an extensive network of contacts with other PcG target loci. This is consistent with the observation of multiple PC body collisions that can be stable for prolonged times in the nucleus. In contrast, the fact that PC intensity correlates with the linear extension of genomic PC and H3K27me3 domains suggests that PC-mediated associations are relatively rare, at least during embryogenesis (Cheutin, 2012).

The slower regime of long-range motion depends on coordinated large-scale chromatin movements that were not documented before. This may depend on the tools used in previous studies. Time-lapse experiments performed by using the Lac repressor/lac operator system only follow one or a few points inside the cell nucleus, limiting the probability to observe coordinated motions, especially in species containing many chromosomes. In contrast, this study followed many chromatin domains inside Drosophila nuclei and long-range coordinated motions were easily identified when at least two distinct nuclear structures moved simultaneously with a similar trajectory. This motion is directional and chromatin domains and PC bodies can cover up to 1 µm in 10 sec. Different objects having coordinated motion probably belong to the same structure, which suggests that the ensemble of chromatin domains and PC bodies displaying a similar coordinated motion forms a single higher-order nuclear structure. This kind of motion is perfectly consistent with the observation of a chromosome territory, which implies that chromosomes form distinct nuclear structures in interphase cells. A displacement of an entire chromosome, or of a chromosome arm, or a large part thereof, would induce the coordinated motion of all chromatin domains and PC bodies associated to the corresponding chromosome portion (Cheutin, 2012).

The few association and dissociation events of PC bodies observed during this work are related to long-range coordinated motion events that affect both chromatin domains and PC bodies. Therefore, gene kissing depending on PcG proteins could rely on large scale chromatin movements which lead to transient fusion of PC bodies, and may be in turn specifically stabilized by interactions among PcG proteins. Moreover, the association and dissociation of PC bodies seems to be developmentally regulated, because dynamic associations and dissociations were observed during early embryogenesis, but are strongly reduced later in development (Cheutin, 2012).

Condensed chromatin domains and PC bodies move in confined volumes much smaller than chromosome territories. This highly constrained motion prevents chromatin domains from dispersing inside the cell nucleus and can explain why chromosomes form chromosome territories in interphase cells. This movement within highly confined volumes implies that some forces prevent chromatin from diffusing within entire chromosome territories. Interestingly, it was shown before that chromatin loci localized in peri-nucleolar areas or within heterochromatin move less than the ones included in euchromatin, and it was concluded that association of chromatin loci with different nuclear compartments induces specific constraints on their motion. Another time-lapse experiment performed on one Drosophila locus flanking a large block of heterochromatin showed that random association of this locus with pericentric heterochromatin is quite stable and decreases its motion. The motion of larger chromatin structures such as heterochromatin or euchromatin domains cannot be addressed by tracking single loci. By analyzing structures larger than individual chromatin loci, the motion of both bulk chromatin domains and of PC bodies seems to be influenced by their respective local enrichment of histone and PC proteins. Therefore, one key determinant of the motion constraint is an inner property of these structures, which is coherent with the concept of self-organization (Cheutin, 2012).

The most dramatic change of PC body motion occurs during embryogenesis when nuclear volumes strongly decrease, concomitant with a decrease in bulk chromatin motion. Comparison of chromatin motion between early and late G2 Drosophila spermatocytes or between undifferentiated and differentiated cells of eye imaginal discs indicated that the volume in which chromatin loci move decreases during differentiation. However, because of the particularly rapid motion of chromatin domains and PC bodies during early embryogenesis, the slowdown of chromatin motion occurring during embryogenesis is higher than the ones previously described during differentiation. Interestingly, the reduction of the volume of constraint during developmental progression suggests a correlation between the flexibility of chromatin structures and the potential for cell differentiation (Cheutin, 2012).

It is interesting to note that the motion of PC bodies appears less sensitive to temperature than chromatin domains in late embryos, suggesting that Polycomb proteins may specifically buffer environmental effects such as temperature change. This buffering may be an important determinant of the stability of Polycomb-dependent gene silencing during development. During this work, no other fundamental difference was observed between the motion of condensed chromatin domains and of PC bodies. This apparent absence in specificity is coherent with data implying that PC bodies form molecularly specialized chromatin regions, but suggests that the molecular identity of these structures is not the main determinant of their motion. Interestingly, a previous study has shown that the artificial Mx1-YFP nuclear body exhibits a very similar mobility compared with Promyelocytic leukemia and Cajal bodies. Although being molecularly different, no specific motion of these nuclear bodies was observed, indicating that the motion of nuclear bodies mainly depends on structural issues such as their size and the nuclear volume. During fly embryogenesis, PC bodies and condensed chromatin domains move similarly, but PC bodies move in a larger volume than chromatin domains. To explain this difference, one might argue that condensed chromatin domains would form much larger structures than PC bodies. This is difficult to ascertain until the identity of these DAPI- and histone-dense regions is better understood. Genome-wide analysis of chromatin components has recently identified five different types of chromatin in Drosophila cells, among which three contained silent genes (Filion, 2010). In addition to heterochromatin and Polycomb-repressed chromatin, a third type of silent chromatin was uncovered, which is composed of very large genomic domains encompassing half of the genomic euchromatin. It is proposed that this silent chromatin portion of the genome is physically manifested as the DAPI- and histone-dense chromatin that this study has identified to be distinct from PC bodies (Cheutin, 2012).


Polycomb: Evolutionary Homologs | Regulation | Protein interactions | Developmental Biology | Effects of Mutation | References

date revised:  5 August 2016

Home page: The Interactive Fly © 1995, 1996 Thomas B. Brody, Ph.D.


The Interactive Fly resides on the
Society for Developmental Biology's Web server.