twist
The first step in the differentiation of the Drosophila mesoderm is the activation of two regulatory
genes, twist (twi) and snail (sna), in ventral regions of early embryos. DL and TWI directly activate sna expression. Site-directed mutagenesis of DL- and
TWI-binding sites within defined regions of the sna promoter suggest that the two proteins
function multiplicatively to ensure strong, uniform expression of sna, particularly in ventral-lateral regions where there are diminishing
amounts of DL. These results are consistent with the possibility that the sharp sna borders are
formed by multiplying the shallow DL gradient and the steeper TWI gradient (Ip, 1992b).
In mutants of snail or twist, transcription of folded gastrulation is normal in the posterior midgut primordium but almost completely eliminated on the ventral side. Maternal-effect ventralizing mutations that expand the expression of twist and snail also expand the domain of fog transcription. In embryos from torpedoQY mutant mothers, Twist protein expression extends farther laterally, but the ventral furrow is usually split into two narrow ventrolateral invaginations by an unknown patterning mechanism. In this case, fog is transcribed in two separate ventrolateral stripes (Costa, 1994).
During Drosophila gastrulation, morphogenesis occurs as a series of cell shape changes and cell
movements that probably involve adhesive interactions between cells. The dynamic aspects of cadherin-based cell-cell adhesion were examined in the morphogenetic events to
assess the contribution of such activity to morphogenesis. Shotgun and Cadherin-N show complementary expression
patterns in the presumptive ectoderm and mesoderm at the mRNA level. Switching of
cadherin expression from the Shotgun to the CadN type in the mesodermal germ layer occurs downstream
of the mesoderm-determination genes twist and snail. These dynamic aspects of cadherin-based cell-cell adhesion appear to be
associated with the following: (1) initial establishment of the blastoderm epithelium; (2) acquisition of
cell motility in the neuroectoderm; (3) cell sheet folding, and (4) epithelial to mesenchymal conversion
of the mesoderm. These observations suggest that the behavior of the Shotgun-catenin adhesion
system may be regulated in a stepwise manner during gastrulation to perform successive
cell-morphology conversions. Also discussed are the processes responsible for loss of epithelial cell polarity and
elimination of preexisting Shotgun-based epithelial junctions during early mesodermal
morphogenesis (Oda, 1998).
Expression of mef2 is
dependent on the mesodermal determinants twist and snail but independent of the
homeobox-containing gene tinman, which is required for visceral muscle and heart development (Lilly, 1994).
The function of the Drosophila mef2 gene, a member of the MADS box supergene family of
transcription factors, is critical for terminal differentiation of the three major muscle cell types, namely
somatic, visceral, and cardiac. During embryogenesis, mef2 undergoes multiple phases of expression,
which are characterized by initial broad mesodermal expression, followed by restricted expression in
the dorsal mesoderm, specific expression in muscle progenitors, and sustained expression in the
differentiated musculatures. Evidence is presented that temporally and spatially specific
mef2 expression is controlled by a complex array of cis-acting regulatory modules that are responsive
to different genetic signals. Functional testing of approximately 12 kb of 5' flanking region of the mef2
gene shows that the initial widespread mesodermal expression is achieved through a 280-bp
twist-dependent enhancer. Subsequent dorsal mesoderm-restricted mef2 expression is mediated
through a 460-bp dpp-responsive regulatory module, which involves the function of the Smad4 homolog
Medea and contains several binding sites for Medea and Mad. Mef2 expressing cells are coincident with those expressing Tinman. Notably, both Mef2 and Tinman expression are in four of six cardioblasts that are present per hemisegment. The complete overlap between the two expression patterns suggests that the activity of this enhancer element could be dependent on tinman function or under similar regulatory controls as is tinman. The cardiac enhancer that functions at later stages also drives mef2 expression in the caudal visceral mesoderm as well as in the somatic mesoderm. Taken together, the presented data suggest that specific expression of
the mef2 gene in myogenic lineages in the Drosophila embryo is the result of multiple genetic inputs
that act in an additive manner on distinct enhancers in the 5' flanking region (Nguyen, 1999).
The homeobox-containing gene tinman (msh-2),
is expressed in the mesoderm primordium; this expression requires the function of the
mesoderm determinant twist. Later in development, as the first mesodermal subdivisions are
occurring, expression of tinman becomes limited to the visceral mesoderm and the heart (Bodmer, 1993).
tinman encodes a homeodomain transcription factor required for
the development of the dorsal mesoderm and its derivatives in the Drosophila embryo. Genetic
analyses indicate that tinman resides downstream of the mesodermal determinant twist, which encodes
a basic helix-loop-helix-type transcription factor. However, the regulation of tinman by twist remains
poorly understood. Using expression assays in cultured cells and transgenic flies, it has been shown that two
distinct clusters of E-box regulatory sequences, present upstream of the tinman gene, mediate tinman
expression in the visceral mesoderm. These elements are conserved between the Drosophila
melanogaster and Drosophila virilis tinman genes and serve as binding sites for Twist (E1 cluster located from -1134 to -1101) and
Tinman (E2 cluster located from -868 to -831) proteins. In cultured cells, Twist and Tinman binding results in the activation of tinman
gene expression. In transgenic animals, the E1 and E2 clusters are functionally connected; both
elements are required for tinman activation in cells of the visceral mesoderm and also for tinman
repression in cells of the somatic musculature. These results demonstrate that tinman is a direct
transcriptional target for Twist and its own gene product in visceral mesodermal cells, supporting the
idea that twist and tinman function in the subdivision of the mesoderm during Drosophila embryogenesis (Lee, 1997).
The Drosophila tinman homeobox gene has a major role in early mesoderm patterning: it determines
the formation of visceral mesoderm, heart progenitors, specific somatic muscle precursors and glia-like
mesodermal cells. These functions of tinman are reflected in its dynamic pattern of expression, which
is characterized by initial widespread expression in the trunk mesoderm, then refinement to a broad
dorsal mesodermal domain, and finally restricted expression in heart progenitors. Each of these phases of expression is driven by a discrete enhancer element, the first being active in
the early mesoderm, the second in the dorsal mesoderm and the third in cardioblasts. Surprisingly, all of these elements are located at positions downstream of the transcription start site. Element B(1800 bp) is located in the first intron; a second enhancer element, D (about 350bp), is located about 2 kb downstream of the 3' end of tin and activates gene expression in the dorsal portion of the mesoderm. Element D is active between stage 11 and early stage 12 of embryogenesis. A third element, C (300bp) is active in the dorsal vessel. This element activates expression from stage 12 on, in four out of six cardioblasts per hemisegment. Finally, an element A (about 500 bp), located in the 5' portion of the first intron, activates tin in the anterior tip of the head. After invagination of the stomodeum, the bulk of these tin expressing cells form the roof of the pharynx (Yin, 1997).
The early-active enhancer element is a direct target of twist, a gene necessary for tinman activation that encodes a basic
helix-loop-helix (bHLH) protein. This 180 bp enhancer
includes three E-box sequences that bind Twist protein in vitro and are essential for enhancer activity
in vivo. Ectodermal misexpression of twist causes ectopic activation of this enhancer in ectodermal
cells, indicating that twist is the only mesoderm-specific activator of early tinman expression. The 180 bp enhancer also includes negatively acting sequences. Binding of Even-skipped to these sequences appears to reduce twist-dependent activation in a periodic fashion, thus producing a striped tinman pattern in the early mesoderm. In addition, these sequences prevent activation of tinman by twist in a defined portion of the head mesoderm that gives rise to hemocytes.
This repression requires the function of buttonhead, a head-patterning gene: buttonhead is necessary for normal activation of the hematopoietic differentiation gene serpent in the
same area. The second expression domain, restricting tin mRNA expression in the dorsal mesoderm, is triggered by Dpp-mediated induction events. Together, these results show that tinman is controlled by an array of discrete enhancer elements that are activated successively by differential genetic inputs, as well as by closely linked activator and repressor binding sites within an early-acting enhancer, which restricts twist activity to specific areas within the twist expression domain (Yin, 1997).
Ras signaling elicits diverse outputs, yet how Ras specificity is generated remains incompletely understood. Wingless and
Decapentaplegic confer competence for receptor tyrosine kinase-mediated induction of a subset of Drosophila muscle and cardiac progenitors by acting
both upstream of and in parallel to Ras. In addition to regulating the expression of proximal Ras pathway components, Wg and Dpp coordinate the direct effects
of three signal-activated transcription factors (dTCF, Mad, and Pointed that function in the Wg, Dpp, and Ras/MAPK pathways, respectively) and two tissue-restricted transcription factors (Twist and
Tinman) on even-skipped, a progenitor identity gene enhancer. The integration of Pointed with the combinatorial effects of dTCF, Mad, Twist, and Tinman
determines inductive Ras signaling specificity in muscle and heart development (Halfon, 2000).
Cell fate specification in the somatic mesoderm of the Drosophila embryo has been examined as a model for dissecting the molecular basis of combinatorial
signaling involving receptor tyrosine kinases (RTKs). The somatic musculature and the cells that compose the heart develop from specialized cells called progenitors. Each progenitor divides asymmetrically to produce two founder cells that possess
information that specifies individual muscle fate and that seed the formation of multinucleate myofibers. The focus of this study has been a small subset of somatic mesodermal cells that express the transcription factor Even skipped. Eve is expressed in the progenitors and
founders of both the dorsal muscle fiber DA1 and a pair of heart accessory cells, the Eve pericardial cells or EPCs. Since eve is the earliest known marker for these cells and is required for their formation, eve is referred to here as a progenitor identity gene (Halfon, 2000).
Given that the eve MHE recapitulates early mesodermal Eve expression, a determination was made of whether this enhancer contains binding sites for candidate signal-dependent and mesoderm-specific transcription factors. Focus was placed on two mesoderm-specific factors, Tin and Twi, as well as the nuclear factors that act downstream of Wg (dTCF), Dpp (Mad) and Ras (Pnt, Yan). A computer-based search of the MHE sequence has suggested the presence of potential binding sites for each of these transcription factors. Gel-shift assays confirm that these putative sites actually bind the relevant factors. This analysis establishes the existence of one binding site for dTCF, six for Mad, two for Twist, and four each for Tin and Pnt. Since Yan binds to each of the Pnt sites, these are referred to as Ets sites (Halfon, 2000).
To ascertain whether these in vitro binding sites have in vivo functional significance, the sites were mutated, both singly and in combination, within the context of the entire MHE. All mutagenesis was by base substitution so as not to affect the spacing between other potential cis-regulatory elements. The ability of the mutated MHEs to drive reporter gene expression was tested in transgenic embryos and this expression was compared to that of endogenous Eve. Of the six Mad sites, only Mad4, 5, and 6 are critical for MHE function when inactivated singly or in combination. Mutation of the single dTCF site or of individual binding sites for Twi, Tin, or the Ets factors also lead to loss of reporter gene expression in some, but not all, Eve-expressing cells, with some mutant sites associated with a more severe loss than others. Of note, both the EPC and DA1 lineages are affected equally by all of the mutations. In addition, the activity level in those Eve-expressing cells that do maintain reporter gene expression is on average lower than that seen with the wild-type MHE. In contrast to the single site mutants, mutation of the two Twi, all four Tin, or all four Ets sites completely eliminate MHE activity. It is concluded that binding sites for two tissue-specific and three signal-responsive transcription factors are required for full activity of the MHE in both the muscle and the heart lineages (Halfon, 2000).
The finding that the three Wg-dependent factors, dTCF, Twi, and Tin, that directly regulate eve could explain why activated Ras is incapable of bypassing Wg in the induction of Eve progenitors. Therefore attempts were made to rescue Eve expression in wg mutant embryos by ectopically expressing Twi and Tin together with activated Ras. However, Eve progenitors were not recovered by this manipulation, perhaps due to the direct requirement of dTCF for eve MHE activity. While activated Arm can supply the missing downstream Wg transcription factor in this rescue experiment, Arm alone is capable of fully rescuing not only the Eve progenitors but also all of the Wg-dependent factors that regulate the MHE, including Twi, Tin, and the RTK/Ras pathway components. Thus, the combined effects of the MHE transcription factors could not be further evaluated in the absence of Wg signaling. Nevertheless, the rescue and enhancer mutagenesis data strongly support the involvement of Wg as a mesodermal competence determinant both upstream of the Ras pathway and directly (via dTCF) as well as indirectly (via Twi and Tin) in the transcriptional response to inductive RTK signaling (Halfon, 2000).
Since mutation of any single transcription factor binding site in the MHE causes only a partial loss of enhancer activity, it was considered whether different sites might function together synergistically. To test this possibility, binding site mutations for two different activators were combined. Simultaneous mutation of the dTCF and Twi1 sites led to reporter gene expression in approximately 5-fold fewer cells than would be expected from the additive independent effects of each mutation. A similar, though slightly less robust, synergy was observed when the dTCF and Ets3 mutations were combined (Halfon, 2000).
An assessment was made of whether ectopic coexpression of individual transcription factors or upstream signals would lead to cooperative effects on endogenous Eve expression. As previously reported, ectopic Wg has no effect on Eve expression at late stage 11, activated Ras1 induces extra Eve progenitors, and ectopic Wg plus activated Ras1 cause a lateral expansion of the progenitor clusters. When Twi is expressed using a twi-Gal4 driver, a few Eve-positive cells develop at ectopic positions. The magnitude of this effect is increased by coexpression of Wg and Twi, and even more so by coexpression of Twi with activated Ras1. The latter effect strikingly resembles that of Wg plus activated Ras1. With the simultaneous ectopic expression of Wg, Twi, and activated Ras1, Eve progenitors form an almost continuous anteroposterior stripe confined to the dorsal mesoderm. These results demonstrate a synergistic induction of Eve progenitors by various combinations of Wg, Twi, and activated Ras1 that parallels the synergistic loss of MHE activity seen by mutating the dTCF, Twi, and Ets binding sites. Taken together, these loss- and gain-of-function findings suggest that dTCF, Twi, and Pnt cooperate at the MHE to synergistically regulate Eve transcription and, by extension, to induce the specification of Eve progenitor fates (Halfon, 2000).
It is concluded that Wg and Dpp coordinate a series of signal-activated (dTCF and Mad) and mesoderm-specific (Twi and Tin) transcription factors in a temporal and spatial pattern that facilitates cooperation with the Ras transcriptional effector Pnt. The synergistic integration of these five transcription factors by a single enhancer generates a specific developmental response to Ras/MAPK signaling. Moreover, Wg and Dpp exert proximal effects in this signaling network by enabling Ras/MAPK activation through the regulated localized expression of upstream components of the RTK signal transduction machinery. A model governing the acquisition of developmental competence, signal integration and response specificity in this system is presented. Wg and Dpp provide competence through the regulation of tissue-specific transcription factors (Tin and Twi), signal-responsive transcription factors (Mad and dTCF), and proximal components of the RTK/Ras pathways (Htl, Hbr, and Rho). The Ras signaling cascade leads to activation of the inductive transcription factor, Pnt, and inactivation of the Yan repressor. While a direct role for Mad in regulating Tin expression has been demonstrated, Wg regulation of Tin, Twi, Htl, Hbr, and Rho may be either direct or indirect. Dpp has additional effects on the proximal RTK factors. The five transcriptional activators assemble at and are integrated by the MHE, where they function synergistically to promote eve expression. Specificity of the response to inductive RTK/Ras signaling derives from the combinatorial effects of the tissue-restricted and signal-activated transcription factors that converge at the MHE. In the absence of inductive signaling, Yan would repress eve by binding to the Ets sites. Since eve is a muscle and heart identity gene, the regulatory mechanisms are inferred to have a more general function in determining progenitor fates. Additional complexity attendant upon the control of RTK activity in this system derives from positive feedback regulation of the Ras/MAPK cascade and from reciprocal regulatory interactions between the Ras and Notch pathways (Halfon, 2000).
The ventral nervous system defective/NK-2 gene may be repressed at several sites, by different genes: in the mesodermal anlage
by Snail, in mesectodermal cells by Single-minded, and in the lateral
neuroectodermal and/or dorsal epidermal anlagen, repression may be mediated indirectly by
Decapentaplegic. Twist either activates vnd gene in the posterior portion of the embryo or is a
coactivator with Dorsal (Mellerick, 1995).
Primary neurogenesis in the central nervous system of insects and vertebrates occurs in three dorsoventral domains on either side of the neuroectoderm. Among the three dorsoventral domains of the Drosophila neuroectoderm, the medial and lateral columns express the zinc-finger gene escargot (esg), whereas the intermediate column does not. esg expression was examined as a probe to investigate the mechanism of neuroectoderm patterning. The effect of dorsoventral patterning genes on esg expression was studied. decapentaplegic, snail and twist repress esg expression outside the neuroectoderm. The expression of esg in the intermediate column is normally repressed, but is de-repressed when Egfr activity is either elevated or reduced. A neurogenic enhancer of esg was identified, and shown to be separable into a distal region that promotes ubiquitous expression in the neuroectoderm and a proximal region that represses the intermediate expression. It is concluded that decapentaplegic, snail, twist and an activator all act through the distal region to initiate transcription of esg in the neuroectoderm. It is proposed that the combination of opposing gradients of Egfr and its ligand creates a peak of Egfr activity in the intermediate column, where Egfr represses esg transcription through the proximal repressor region. These two kinds of regulation establish the early esg expression that prefigures the neuroectoderm patterning (Yagi, 1997).
The fibroblast growth factor (FGF)/receptor system is thought to mediate various developmental
events in vertebrates. DFR1, known as Heartless, and Breathless proteins
contain two and five immunoglobulin-like domains, respectively, in the extracellular region, and a
split tyrosine kinase domain in the intracellular region. In early embryos, DFR1 mRNA expression,
requiring both twist and snail proteins, is specific to mesodermal primordium and invaginated
mesodermal cells. At later stages, putative muscle precursor cells and cells in the central nervous
system (CNS) express DFR1. breathless expression occurs in endodermal precursor cells, CNS
midline cells and certain ectodermal cells such as those of trachea and salivary duct. FGF-receptor
homologs in Drosophila would thus appear essential for generation of mesodermal and
endodermal layers, invaginations of various types of cells, and CNS formation (Shishido, 1993).
rhomboid (rho) encodes a putative transmembrane receptor that is required for the differentiation of
the ventral epidermis.
Dorsal acts in concert with basic helix-loop-helix (b-HLH) proteins, possibly including Twist, to activate rhomboid in both lateral and ventral regions. Expression is blocked in ventral regions (the
presumptive mesoderm) by Snail, which is also a direct target of the DL morphogen (Ip, 1992a).
The twist gene is a good candidate for regulating expression of buttonless. buttonless is expressed in dorsal median cells, mesodermal cells that are arranged as a single pair within each segment along the dorsal midline, just above the central nervous system. There are a total of 6 consensus binding sites for TWI within a 300 base pair region of genomic sequence immediately upstream of the btn structural gene (Chiang, 1994).
In an attempt to identify genes that are involved in Drosophila embryonic cardiac
development, a gene was cloned whose function is required late in
embryogenesis to control heart rate and muscular activity. This gene has been named held
out wings (how) because hypomorphic mutant alleles produce adult animals that have lost
their ability to fly; they keep their wings on the horizontal, at a 90ƒ angle from the body axis. In
contrast to the late phenotype observed in null mutants, the How protein is expressed early
in the invaginating mesoderm; this expression is apparently under the control of twist.
When the different mesodermal lineages segregate, the expression of How becomes
restricted to the myogenic lineage, including the cardioblasts and probably all the myoblasts.
Antibodies directed against the protein demonstrate that How is localized to the nucleus.
how encodes a protein containing one KH-domain that has been implicated in binding
RNA. how is highly related to the mouse quaking gene that plays a role in
myelination and that could serve to link a signal transduction pathway to the control of
mRNA metabolism. The properties of the how gene described herein suggest that this gene
participates in the control of expression of as yet unidentified target mRNAs coding for
proteins essential to cardiac and muscular activity (Zaffran, 1997).
Zygotic expression of modifier of variegation modulo depends on
the activity of genes which pattern the embryo along dorsoventral and anteroposterior axes and
specify diversified morphogenesis. dorsal and the mesoderm-specific genes twist and snail direct
modulo expression in the presumptive mesoderm. The homeotic genes Sex combs reduced and
Ultrabithorax positively regulate the gene in the ectoderm of parasegment 2 and abdominal
mesoderm (Graba, 1995).
Drosophila has a single MEF2 gene, DMEF2, that is alternatively
spliced to produce different transcripts; it is expressed in the mesodermal
primordium before gastrulation. The mechanisms responsible for the subsequent
subdivision of the mesoderm are unknown. However, DMEF2 may play a role in this
important event. Experiments show that it is a downstream target for twist
and that its early expression pattern modulates as the mesoderm organizes into cell
groupings with distinct fates. DMEF2 is also expressed in both the segregating
primordia and the differentiated cells of the somatic, visceral and heart musculature. It
is the only known gene expressed throughout
differentiation in these three main types of muscle (Taylor, 1995).
MEF2 is a MADS-box transcription factor required for muscle development in Drosophila. The bHLH transcription factor Twist directly regulates Mef2 expression in adult somatic muscle precursor cells via a 175-bp enhancer located 2245 bp upstream of the transcriptional start site.
Within this element, a single evolutionarily conserved E box is essential for enhancer activity. Twist
protein can bind to this E box to activate Mef2 transcription. Ectopic expression of twist results in
ectopic activation of the wild-type 175-bp enhancer. Twist activation of Mef2 transcription via this enhancer is required for normal adult muscle development; reduction in Twist function results in phenotypes similar to those observed in Mef2 mutant adults. The requirement of Twist for adult muscle development is seen in defective development of dorsal longitudinal indirect flight muscles (DLMs). When Twist function is reduced during the larval stage, Mef2 is not expressed in the adult muscle precursor cells, resulting in abnormal patterning of the adult somatic muscles, most strikingly in the DLMs. Hypomorphic Mef2 mutant adults show very similar defects, indicating that this phenotype, resulting from loss of twist expression, likely occurs
through a requirement to maintain normal Mef2 levels. The 175-bp enhancer is also active in the embryonic mesoderm, indicating that this enhancer functions at multiple times during development, and that its function is
dependent on the same conserved E box. In embryos, a reduction in Twist function also strongly
reduces Mef2 expression. Since there is only weak expression from the 175-bp enhancer early in embryogenesis, Twist alone cannot be responsible for activating Mef2 early on. It is likely that the genomic region governing the earliest expression of Mef2 is either outside this enhancer or overlapping it. The activity of the 175-bp enhancer increases during embryogenesis until stage 12, where it is expressed in segmentally repeating groups of cells corresponding to the unfused and undifferentiated myoblasts of the larval somatic muscles. At this stage, twist and Mef2 are coexpressed in these cells. These findings define a novel transcriptional pathway required for skeletal muscle development and identify Twist as an essential and direct regulator of Mef2 expression in the somatic mesoderm (Cripps, 1998a).
The basic helix-loop-helix transcription factor Twist is required for normal development of larval and adult somatic muscles in Drosophila. Adult flies
normally have six pairs of dorsal longitudinal indirect flight muscles (DLMs), whereas when Twist function is reduced, only three pairs of DLMs are
formed. Although twist is expressed in precursors of adult muscles throughout the larval and early pupal stages, it is demonstrated that Twist function is
required only during the late larval stage for DLM patterning. In wild-type flies, this is just prior to the time when three pairs of persistent larval muscle
fibers split longitudinally to form templates for the six pairs of DLMs. These larval muscle fibers do not express twist, but the adult muscle precursors to which they fuse are found to express twist. Thus Twist function in the adult muscle precursor cells is required for splitting to occur in the muscles to which the myoblasts fuse. By examining sections at various times during pupal development, it has been found that
splitting of the larval muscles does not occur in twist mutants, indicating that Twist function is required to induce major changes in the larval templates prior
to differentiation. The function of Twist in larval muscle splitting is likely mediated by myocyte enhancer factor-2 (MEF2) since in Mef2 hypomorphic
mutants splitting is also reduced and Mef2 expression is dependent on Twist. These findings define specific roles for Twist and MEF2 during pupal
myogenesis and demonstrate that these transcription factors function in adult muscle precursor cells to regulate downstream factors controlling muscle cell
splitting and morphogenesis (Cripps, 1998b).
Both twist and snail are required for the mesodermal activation of the novel zinc-finger homeodomain gene zfh-1. ZFH-1 contains one homeodomain and nine C2H2 zinc fingers. In twist mutants, all the early mesodermal anti-AFH-1 staining posterior to the cephalic furrow is lost, although expression anterior to this furrow, as well as the later expression in the developing CNS is unaffected. A similar result is obtained in snail mutant embryos except that the early mesodermal staining anterior to the the cephalic furrow is also lost. In both mutants, the later sites of presumptive mesodermal zfh-1 expression are also devoid of staining, although this may be due to the failure of mutant embryos to reach later developmental stages (Lai, 1991).
The transcription factor Twist initiates Drosophila mesoderm development, resulting in the formation of heart, somatic muscle, and other cell types. Using a Drosophila embryo sorter, enough homozygous twist mutant embryos were isolated to perform DNA microarray experiments. Transcription profiles of twist loss-of-function embryos, embryos with ubiquitous twist expression, and wild-type embryos were compared at different developmental stages. The results implicate hundreds of genes, many with vertebrate homologs, in stage-specific processes in mesoderm development. One such gene, gleeful/lame duck, related to the vertebrate Gli genes, is essential for somatic muscle development and sufficient to cause neural cells to express a muscle marker (Furlong, 2001).
Formation of muscles during embryonic development is a complex process that requires coordinate actions of many genes. Somatic, visceral, and heart muscle are all derived from mesoderm progenitor cells. The Drosophila twistgene, which encodes a bHLH transcription factor, is essential for multiple steps of mesoderm development: invagination of mesoderm precursors during gastrulation, segmentation, and specification of muscle types. The role of twist in mesoderm development has been conserved during evolution, perhaps because it controls conserved regulatory mesoderm genes. For example, tinman and dMef2 are regulated by Twist in flies and are highly conserved in sequence and function in vertebrates (Furlong, 2001).
In Drosophila, somatic muscle forms from progenitor cells that divide to become muscle founder cells. Founder cells acquire unique identities controlled by transcription factors including Krüppel, S59, vestigial, and apterous. Each of the 30 body wall muscles in an abdominal hemisegment is initiated by a single founder cell and has unique attachments and innervations. To further clarify mechanisms underlying founder cell specification, myoblast fusion, and muscle patterning, Drosophila mutants together with microarrays of cDNA clones were used (Furlong, 2001).
twist mutant embryos develop no mesoderm. The population of mRNA species isolated from twist homozygous embryos were compared with that of stage-matched wild-type embryos. Drosophila lethal mutations are maintained as heterozygotes, in trans to balancer chromosomes. A twist mutation was established in trans to a balancer chromosome carrying a transgene encoding green fluorescent protein (GFP). Embryos were collected from wild-type and twist/GFP-balancer fly stocks at specific developmental stages. The twist/GFP-balancer collections contain a mixed population of embryos: one-quarter twist homozygotes lacking GFP; half heterozygotes with one copy of GFP, and one-quarter homozygous for the balancer chromosome with two copies of GFP. Homozygous twist mutant embryos were separated from their siblings using an embryo sorter. Putative homozygous twist embryos were assessed by immunostaining with an antibody to dMef2. More than 99% of the selected embryos had the twist phenotype (Furlong, 2001).
Three different periods of mesoderm development were analyzed: stages 9-10, 11, and 11-12. During stage 9-10, the earliest time GFP is detectable in the balancer embryos, mesoderm cells migrate dorsally and become specified as somatic, visceral, cardiac, and fat body mesoderm. twist and its direct targets tinman and dMef2 are expressed throughout stage 9-10 mesoderm. The middle period contains stage 11 embryos and is a transition between the first period (stage 9-10) and the third period (late stage 11-12). During late stage 11 and stage 12, myoblast fusion begins and twist expression remains prominent in only a subset of the somatic muscle cells (Furlong, 2001).
For each developmental period, three independent embryo collections, embryo sortings, and microarray hybridizations were conducted. The microarrays used for the analysis contained over 8500 cDNAs corresponding to 5081 unique genes plus a variety of controls. Each embryonic RNA sample was compared with a reference sample, which contains RNA made from all stages of the Drosophila life cycle and allows direct comparisons among all the experiments (Furlong, 2001).
To determine how transcription was affected by the twist mutation, SAM (significance analysis of microarrays) analysis was used. Genes that are normally highly expressed in mesoderm should have lower transcript levels in twist homozygotes. Genes in other tissues whose expression depends on signals from the mesoderm might also have reduced expression. Transcripts of 130 genes, the 'Twist-low' group, were significantly lower in twist mutants than in wild type. Conversely, cells that would have formed mesoderm may take on other fates in the absence of twist, such as neuroectoderm; therefore, many transcript levels could increase in twist mutants. Genes whose transcription is repressed by signals from the mesoderm would also be enriched in twist mutants. One hundred fifty genes, called the 'Twist-high' group, have increased levels of RNA in twist mutant embryos (Furlong, 2001).
In total, 280 of ~5000 genes had significant changes in transcript levels, with 10 false positives. The genes on the array include 15 previously characterized mesoderm-specific genes, all of which were significantly reduced in twist mutant embryos. The arrays also contain genes known to be
transcribed in both mesoderm and other cell types. Significant changes in expression were detected for many of these genes (Furlong, 2001).
To ectopically express twist, a dominant gain-of-function mutation of the maternal gene Toll (Toll10B) was used. Activated Toll induces the expression of twist and snail in early embryos and of immune response genes in older embryos. Thus, the effects of Toll10B on gene expression reflect the activities of Twist as well as Snail and Dorsal, or their combined actions. Toll10B embryos are essentially bags of mesoderm; epidermal structures are absent or greatly reduced, and they have been used successfully in subtractive hybridization screens to identify mesoderm genes. The gene transcription profile of Toll10B embryos was compared with that of wild-type embryos during four periods of development, using the reference sample to normalize experiments. The earliest period, stage 5, is when twist is initially expressed in presumptive mesoderm. The other three periods analyzed were those used in the twist mutant analysis: stages 9-10, 11, and 11-12 (Furlong, 2001).
Data from loss- and gain-of-function experiments, combined with careful staging, yield a useful picture of genes that are likely to be required for mesoderm specification and muscle differentiation. Of 360 identified mesoderm genes, 273 have not been the focus of developmental studies. The predicted proteins encode transcription factors, signal transduction molecules, kinases, and pioneer proteins. The stage at which each gene is active is one criterion for assigning possible functions. Another key criterion will be finding a mutant phenotype. As a pilot, this additional step was undertaken for the gene CG4677 (LD47926). Changes in CG4677 transcript levels were also observed in a Toll10B subtractive hybridization screen (Furlong, 2001).
CG4677 is transcribed in the visceral mesoderm at stages 10-13 and the somatic mesoderm during stages 11-13. This gene encodes a C2H2 zinc finger transcription factor with high sequence similarity to vertebrate Gli proteins, the gene has been named gleeful (gfl). Mammalian Gli proteins act downstream of Hedgehog signaling proteins to control target gene transcription (Furlong, 2001).
The role of gfl in mesoderm development was assessed by disrupting its function using RNA interference. Injection of a double-stranded RNA (dsRNA) control sequence had no effect on mesoderm development. In contrast, gfl dsRNA injection causes severe loss and disorganization of somatic muscle cells, whereas heart and visceral muscle were unaffected. A similar phenotype was seen in Df(3R)hh homozygous embryos: the deficiency removes gfl but not the nearby hedgehog gene (Furlong, 2001).
To determine whether gfl can induce muscle cell development, a UAS-gfl transgenic fly strain was generated. Ectopic expression of gfl using an en-GAL4 driver results in lethality and induction of ectopic dMEF2 expression in the ventral nerve cord. Remarkably, Gfl is sufficient to induce expression of a muscle gene in neuronal cells. Previous studies have shown an essential role for Sonic hedgehog signaling in the formation of slow muscle in avian and zebrafish embryos. gfl may be performing a similar role in Drosophila somatic muscle development (Furlong, 2001).
Determining the position of ventro-lateral neuroectoderm versus dorsal non neural ectoderm is controlled by
maternal (dorsal) and zygotic genes (dpp, sog, brk, sna, twi). SoxNeuro (SoxN) expression is specifically affected in these
mutants. dl mutants lack early SoxN expression. Embryos mutants for dpp show a dorsal expansion of SoxN expression, as also observed when
misexpressing sog by means of the Gal4 system. Inversely, misexpressing dpp early in embryogenesis leads to severe reduction of SoxN expressing-cells, as observed in sog mutants and sog, brk double mutants. Finally, twi mutants are characterized by a ventral expansion of SoxN expression into the presumptive mesoderm. These experiments are
consistent with a role for the D/V patterning genes in the
control of SoxN expression, with SoxN being negatively
regulated dorsally and ventrally by dpp and mesoderm
genes, and positively by sog and brk in the neuroectodermal region. A similar situation has been reported
in Xenopus, with SoxD, an essential mediator of neural
induction, being negatively regulated by BMP4 and positively by chordin (the vertebrate homologs of dpp and sog, respectively) (Cremazy, 2000).
The patterning of the Drosophila mesoderm requires Wingless.
Little is known about how Wg provides patterning information to the mesoderm,
which is neither an epithelium nor contains the site of Wg production. By
studying specification of muscle founder cells as marked by the lineage-specific
transcription factor Slouch, it was asked how mesodermal cells interpret the steady
flow of Wg. Through the manipulation of place, time and amount of Wg signaling,
it has been observed that Slouch founder cell cluster II is more sensitive to Wg
levels than the other Slouch-positive founder cell clusters. To specify Slouch
cluster I, Wg signaling is required to maintain high levels of the myogenic
transcriptional regulator Twist. However, to specify cluster II, Wg not only
maintains high Twist levels, but also provides a second contribution to activate
Slouch expression. This dual requirement for Wg provides a paradigm for
understanding how one signaling pathway can act over time to create a diverse
array of patterning outcomes (Cox, 2005).
In wg mutant embryos, the heart and approximately half the body
wall muscles are lost. One subset of these Wg-dependent body wall muscles can be
visualized using an antibody to the NK-homeodomain protein Slouch (S59). Slouch
expression arises in a precise, stereotypic pattern during embryonic
development. It is
first expressed in a single progenitor cell during early stage 11 of embryonic
development; this cell divides to give rise to two founder cells (Ia and Ib)
which together form cluster I (cI). During late stage 11, two additional
Slouch-positive progenitors appear at a different ventral location and divide
sequentially to form four founder cells that make up cluster II. Still
later, at stage 12, a single progenitor arises dorsally and divides to give
rise to cluster III. These muscle founder cells contain all the information
needed to create a particular subset of muscles and contribute to the
stereotypic set of larval muscles in each abdominal segment. After stage 12,
Slouch expression is maintained in a subset of these founder cells
that give rise, in the final muscle pattern, to muscle VT1 (from cI), VA2
(from cII) and DT1 (from cluster III).
Maintenance of Slouch expression in these founder cells is crucial to the
development of these muscles; removal of slouch leads to complete
(VT1) and partial muscle transformations (VA2; DT1). In this
study, focus was placed on the role of Wg in patterning the Slouch muscle founder
cells. For simplicity, focus was placed solely on two ventral Slouch clusters (I and
II), which develop independently and arise in a similar position along the dorsoventral axis but
have different anterior-posterior positions within each abdominal
hemisegment (Cox, 2005).
Through the manipulation of the amount and time of exposure to Wg signaling
in the Drosophila mesoderm, it is shown that Slouch founder cell
cII requires more Wg signaling than its neighbor, cI. Because cII arises in
the mesoderm beneath the source of Wg signal, it was initially thought that the
sensitivity that was detected would be due to Wg acting as a classic morphogen.
Specifically, during stage 11, Wg would directly elicit
concentration-dependent responses, leading to Slouch cI specification at low
levels and cII at higher levels. Instead, the data suggest an alternative
mechanism underlying this sensitivity. For Slouch cI, Wg signaling through
Twist is sufficient for fate specification. However, for Slouch cII, a second,
Twist-independent Wg signal is also necessary (Cox, 2005).
It has been shown that wg mutants fail to maintain high
levels of Twist.
Overexpression of Twist leads to expanded somatic mesodermal fates at the
expense of other mesodermal fates, such as heart and gut muscle. Conversely,
decreasing Twist levels leads to a reduction in somatic mesodermal fate, while
heart and gut muscle remain largely unaffected. These
findings underscore the importance of high Twist levels for the proper
implementation of somatic muscle fate. Because loss of high Twist levels leads
to loss of muscle founder cells, including all Slouch-positive clusters of
founder cells, it has always appeared that each Slouch cluster requires the
same amount of Wg signal (relayed through Twist) to assume its particular
fate. In this study, the requirement for Wg in maintaining high
Twist levels was uncoupled from the later role of Wg in specifying cII fate. The fact that
Twist specifically rescues Slouch cI in a wg mutant background
suggests that Slouch cII requires an additional, Twist-independent
contribution from Wg for proper patterning. Consistent with these results, wg
hypomorphs were found that provided sufficient signaling to maintain
high Twist levels during early mesoderm development and therefore pattern cI,
but that do not pattern cII. Temperature-shift experiments using wg
temperature-sensitive alleles have shown that Slouch cII specification and
engrailed expression in the ectoderm require Wg expression at later
stages of embryonic development. Thus, the absence of Slouch cII in the
different wg alleles, in hh mutant embryos and in a Twist
rescued wg mutant embryo, all suggest that proper patterning requires
not only an earlier Wg-dependent regulation of Twist, but also an additional
Wg contribution to specify its identity (Cox, 2005).
Manipulations of Wg signaling also revealed two additional aspects of
Wg signaling to the mesoderm. (1) It was found that the mesoderm, in general,
has a different threshold for Wg signaling when compared with the ectoderm.
Conditions that completely rescue the ventral ectoderm and epidermis
(wgPE6 at the permissive temperature) fail to completely
rescue the mesoderm. (2) It was found that different mesodermal targets respond
differently to Wg signaling. For example, expression of the
DeltaNTcf (dominant negative form of Pangolin) has mild effects
on Twist but significant effects on Slouch cII.
Although it is predicted that TCF binds slouch regulatory regions
directly, it was found that Wg regulates Twist both directly through TCF and
indirectly through the pair-rule gene sloppy-paired. Whether or not
the difference in Wg regulation of
twist and slouch is due to the structure of the regulatory
regions, additional factors that integrate on these promoters in these
contexts and the activity of the Arm/dTCF complex remains to be uncovered (Cox, 2005).
This study also underscores the contribution that other factors make to
position the Slouch clusters: ectopic Wg expression in the mesoderm does not
produce uniform Slouch expression. This aspect of Wg signaling is reflected in other tissues
such as the epidermis. The
size of Slouch cII could not be further enlarged beyond that seen when Wg signaling was
initially increased. This suggests a
prepatterning mechanism, perhaps involving the activity of the pair-rule genes
that have been shown to be responsible for segmentation of the mesoderm, as well as
the integration of other signal transduction pathways, such as EGF/FGF and
Notch signaling. The data suggest that Wg signaling then works on this
prepattern to regulate the domain of Slouch expression (Cox, 2005).
The effect of Wg on muscle patterning is similar to
that described for even-skipped muscle progenitor specification; that
is, Wg signaling (in collaboration with such signals as Decapentaplegic) is
first required to set up a region of 'competence' through activation of
mesoderm-specific factors such as Twist and Tinman. Wg then later cooperates
with these intrinsic factors to induce the expression of even-skipped
in dorsal muscle progenitors, much as would be suggested for Slouch cII. However, the
observations suggest an important variation of Wg signaling in mesodermal
patterning. In the case of Slouch patterning, Wg creates temporal as well as
spatial diversity, while in patterning eve it only acts temporally.
Wg signaling contributes to the expression of Slouch in its two discrete
ventral patches by two distinctive mechanisms: through the regulation of an
upstream transcription regulator (Twist), which is sufficient for one domain
of expression; and through the cooperation of this factor with a second,
temporally distinct Wg input for the second domain of expression. The
expression of the same gene but at two different times and places, through two
Wg-dependent means, gives insight into how an organism may generate diverse
tissues in response to the same signal (Cox, 2005).
Work carried out in the wing imaginal disc suggests that Wg acts as a
morphogen. In this tissue, Wg protein was visualized in a graded
distribution and it appears to activate multiple target genes directly, in a
concentration-dependent manner. Based on these criteria, Wg has been labeled as a classical
morphogen. However, careful inspection of the molecular mechanisms underlying
Wg activation of both short- and long-range targets in the wing have revealed
that the pattern of Wg expression changes during wing imaginal disc
development, and that Wg collaborates with other pathways to set up the
expression of these genes. These studies have cast doubt on whether Wg is a
true morphogen in this tissue (Cox, 2005).
Investigating the molecular mechanisms that govern patterning of
the embryonic mesoderm, similarly suggests that Wg does not act on Slouch
clusters I and II as a classical morphogen. Wg does not
activate cI directly, but instead maintains high levels of Twist,
which sets up a somatic mesodermal competency domain that is sufficient to
create cI. Additional Wg is then required later to pattern cII. It can be
argued that Wg acts as a morphogen to regulate Twist expression (at low
levels), and then to control Slouch expression (at high levels) within cells
of cII. However, the precise regulation and dependence of Slouch clusters I
and II on Wg within both the dorsoventral and anteroposterior axes suggest
that there must be additional patterning information available to properly
place these two cell types. As more putative morphogens are held up to the
lens of molecular biology, it will be interesting to see whether there are
unexpected, new twists in the molecular underpinnings of morphogens (Cox, 2005).
The maternal Toll signaling pathway sets up a nuclear gradient of the
transcription factor Dorsal in the early Drosophila embryo. Dorsal
activates twist and snail, and the Dorsal/Twist/Snail network
activates and represses other zygotic genes to form the correct expression
patterns along the dorsoventral axis. An essential function of this patterning
is to promote ventral cell invagination during mesoderm formation, but how the
downstream genes regulate ventral invagination is not yet known. wntD
(FlyBase name: Wnt8) is shown to be a member of the Wnt family. The expression
of wntD is activated by Dorsal and Twist, but the expression is much
reduced in the ventral cells through repression by Snail. Overexpression of WntD
in the early embryo inhibits ventral invagination, suggesting that the
de-repressed WntD in snail mutant embryos may contribute to inhibiting
ventral invagination. The overexpressed WntD inhibits invagination by
antagonizing Dorsal nuclear localization, as well as twist and
snail expression. Consistent with the early expression of WntD at the
poles in wild-type embryos, loss of WntD leads to posterior expansion of nuclear
Dorsal and snail expression, demonstrating that physiological levels of
WntD can also attenuate Dorsal nuclear localization. The de-repressed WntD in
snail mutant embryos contributes to the premature loss of snail
expression, probably by inhibiting Dorsal. Thus, these results together
demonstrate that WntD is regulated by the Dorsal/Twist/Snail network, and is an
inhibitor of Dorsal nuclear localization and function. The closest homologs of
Drosophila WntD, vertebrate Wnt8 proteins, regulate mesoderm patterning, neural
crest cell induction, neuroectoderm patterning, and axis formation (Hoppler,
1998; Lekven, 2001; Lewis, 2004; Popperl, 1997). These vertebrate Wnt8 proteins
may transmit the signal through the canonical pathway, but the exact mechanism
remains unclear. So far, the downstream mediators of Drosophila WntD signaling
are not known (Ganguly, 2005).
A second study (Gordon, 2005) confirms and extends Ganguly (2005) by
inducing a mutation in wntD by homologous replacement. The Gordon study
shows that WntD acts as a feedback inhibitor of the NF-kappaB homologue Dorsal,
during both embryonic patterning and the innate immune response to infection.
wntD expression is under the control of Toll/Dorsal signalling, and
increased levels of WntD block Dorsal nuclear accumulation, even in the absence
of the IkappaB homologue Cactus. The WntD signal is independent of the common
Wnt signalling component Armadillo. By engineering a gene knockout, this study
shows that wntD loss-of-function mutants have immune defects and exhibit
increased levels of Toll/Dorsal signalling. Furthermore, the wntD mutant
phenotype is suppressed by loss of zygotic dorsal (Gordon, 2005).
To identify novel components in the dorsoventral pathway, a microarray assay
was carried out using embryos derived from gain-of-function and loss-of-function
mutants of the Toll pathway. Among the novel genes identified, the expression
and function of wntD was analyzed because the Wnt family of secreted
proteins regulates patterning, cell polarity and cell movements. The results
show that wntD is activated by Dorsal and Twist but repressed by Snail.
Increased expression of WntD in wild-type early embryos inhibits ventral
invagination. Thus, wntD is the first Snail target gene shown to have an
interfering function in mesoderm invagination. The overexpressed WntD blocks
invagination by inhibiting Dorsal nuclear localization. Loss-of-function
analyses also show that physiological levels of WntD can attenuate Dorsal
nuclear localization and function. Therefore, wntD is a novel downstream
gene of the Dorsal/Twist/Snail network and can feed back to inhibit Dorsal
(Ganguly, 2005).
The dynamic pattern of wntD expression in the early embryo is a
combined result of activation by Dorsal/Twist and repression by Snail.
Overexpressed WntD negatively regulates Dorsal nuclear localization, leading to
an inhibition of ventral cell invagination. Physiological levels of WntD can
also negatively regulate Dorsal, since loss of WntD leads to detectable
expansion of both Dorsal nuclear localization and snail expression in the
posterior regions. Furthermore, de-repressed WntD expression in the ventral
region of snail mutant embryos can also attenuate Dorsal function.
However, the loss of WntD could not rescue the invagination defect of the
snail mutant embryo, suggesting that in the snail mutant embryo
there are other de-repressed genes that can interfere with ventral invagination
(Ganguly, 2005).
The wntD loss-of-function phenotype correlates with the expression of
wntD at the poles of pre-cellular blastoderms. wntD is also
expressed a bit later in the mesectoderm, and weakly in the mesoderm. Because
WntD can inhibit Dorsal, one speculation is that WntD in the early mesectoderm
may help to establish the sharp snail expression at the
mesectoderm-neuroectoderm boundary. However, no changes were detected in the
Dorsal protein gradient or snail pattern in the trunk regions of the
Df(3R)l26c embryos. It is speculated that the timing of early expression
of wntD, which may have additional input from the Torso pathway at the poles, is important for the feedback inhibition of Dorsal. By the time of cellularization, the Dorsal protein gradient is well established. This
well-established Dorsal gradient activates the wntD gene in the trunk
regions, but the subsequently translated WntD protein may not be capable of
exerting a strong negative-feedback effect on the already formed Dorsal
gradient. This timing argument is supported by the results of
WntD-overexpression experiments. The use of maternal nanos-Gal4 caused a strong inhibition of Dorsal nuclear localization and of ventral invagination, whereas the use of zygotic promoters did not result in a significant phenotype (Ganguly, 2005).
Snail acts as a transcriptional repressor for at least 10 genes in the
ventral region where mesoderm arises. In snail mutant embryos, all of
these target genes are de-repressed in the ventral cells, concomitant with
severe ventral invagination defects. However, no direct evidence has been
reported on whether these de-repressed genes interfere with invagination. This study showed for the first time that a target gene of Snail, namely wntD, can block ventral invagination when overexpressed. If de-repressed WntD is solely responsible for inhibiting ventral invagination, it would be expected that, in the snail;Df(3R)l26c double-mutant embryos, ventral invagination would appear again. No rescue of ventral invagination was detected in the double-mutant embryos, suggesting that wntD is not the only de-repressed target gene that inhibits invagination. Nonetheless, the de-repressed WntD can attenuate Dorsal function, and may contribute to the ventral invagination defect (Ganguly, 2005).
Although hundreds of evolutionarily conserved microRNAs have been discovered, the functions of most remain unknown. This study describes the embryonic spatiotemporal expression profile, transcriptional regulation, and loss-of-function phenotype of Drosophila miR-1 (DmiR-1). DmiR-1 RNA is highly expressed throughout the mesoderm of early embryos and subsequently in somatic, visceral, and pharyngeal muscles, and the dorsal vessel. The expression of DmiR-1 is controlled by the Twist and Mef2 transcription factors. DmiR-1KO mutants, generated using ends-in gene targeting, die as small, immobilized second instar larvae with severely deformed musculature. This lethality is rescued when a DmiR-1 transgene is expressed specifically in the mesoderm and muscle. Strikingly, feeding is what triggers DmiR-1KO-associated paralysis and death; starved first instar DmiR-1KO larvae are essentially normal. Thus, DmiR-1 is not required for the formation or physiological function of the larval musculature, but is required for the dramatic post-mitotic growth of larval muscle (Sokol, 2005).
MiR-1 is an evolutionarily conserved miRNA whose tissue-specific expression pattern also appears to be phylogenetically conserved. The worm and fly genomes each possess a single miR-1 gene (Cel-miR-1 and DmiR-1, respectively) while the zebrafish, mouse, and human genomes each contain two miR-1 loci (Dre-miR-1-1 and Dre-miR-1-2, Mmu-miR-1-1 and Mmu-miR-1-2, and Hsa-miR-1-1 and Hsa-miR-1-2, respectively). Northern blot analysis indicates that miR-1 is expressed specifically in mouse and human heart and skeletal muscle. This result has been further confirmed using three different in vivo techniques to examine miRNA expression; a murine Mmu-miR-1-1 and Mmu-miR-1-2 'sensor' transgene is repressed specifically in the adult heart; promoter fusion constructs to Mmu-miR-1-1 and Mmu-miR-1-2 are both expressed in cardiac and skeletal muscle precursor cells; in situ hybridization using locked nucleic acid (LNA) probes detected Dre-miR-1-1 and Dre-miR-1-2 expression in muscle. Two recent studies suggest that miR-1 might possess a variety of in vivo functions. Microarray analysis of HeLa cells transfected with Hsa-miR-1 indicates that miR-1 functions to maintain muscle cell identity by repressing the expression of nonmuscle genes. Overexpression analysis of Mmu-miR-1 indicates that it may regulate the proliferation of cardiomyoctes by controlling the expression of the Hand2 transcription factor (Sokol, 2005 and references therein).
To directly assess the in vivo function of miR-1, the expression pattern, transcriptional regulation, and loss-of-function phenotype of Drosophila miR-1 (DmiR-1) were each examined. As in zebrafish, mice, and humans, DmiR-1 is specifically expressed in muscle cells. DmiR-1 expression is regulated by the promesodermal transcription factor Twist and the promyogenic transcription factor Mef2. Muscles form normally in DmiR-1KO mutant embryos and function normally in first instar DmiR-1KO mutant larvae. However, when larval growth is initiated by feeding, DmiR-1KO mutant larvae become paralyzed, arrest their growth and ultimately die as small, second instar larvae with massively disrupted somatic musculature (Sokol, 2005).
In situ hybridization was performed using probes to detect the full-length primary transcript (pri-DmiR-1) from which the DmiR-1 21mer is processed. Pri-DmiR-1 is initially detected ventrally in the presumptive mesoderm of stage 5 embryos in a pattern similar to that of the snail and twist gene products. Direct comparison of the pri-DmiR-1 expression pattern along the anterior-posterior axis with the snail and twist mRNA pattern in the cellular blastoderm indicates that pri-DmiR-1 shares the sharp posterior border of snail RNA but its anterior boundary is posterior to that of both twist and snail RNA. Pri-DmiR-1 is expressed in the ventral-most cells of the cellular blastoderm, in a swath that is 18-20 cells wide. During gastrulation and after invagination of the ventral cells of the cellular blastoderm, pri-DmiR-1 is expressed exclusively in mesodermal cells. After subdivision of the mesoderm, pri-DmiR-1 expression is observed in the cephalic mesoderm and the primordia of the visceral and somatic musculature. After germ-band retraction, pri-DmiR-1 was detected in the pharyngeal and visceral musculature, the myocardial cells of the dorsal vessel and in the segmentally repeated clusters of mesodermal cells that give rise to the somatic muscles. Notably, pri-DmiR-1 was restricted to muscle progenitors and was not expressed in other mesodermal derivatives such as the fat body, gonadal mesoderm, and midline glia (Sokol, 2005).
The expression pattern of the mature 21-nt form of DmiR-1 was directly assayed using a 21-nt digoxigenin-labeled LNA oligonucleotide probe complementary to DmiR-1. The similarity between the two expression patterns confirmed that DmiR-1 RNA is expressed specifically in the mesoderm and its muscle cell derivatives. However, there were two key differences between the expression patterns. (1) While pri-DmiR-1 is detected in a punctate, nuclear-staining pattern, DmiR-1 21mer is cytoplasmic. This is consistent with the rapid processing of miRNA transcripts into mature miRNAs in conjunction with transport to the cytoplasm. (2) While robust expression in the presumptive mesoderm is detected with the pri-DmiR-1 probes, the DmiR-1 21mer was first faintly detected later at gastrulation. This could reflect stage-specific differences in probe accessibility. Alternatively, pri-DmiR-1 may be transcribed beginning at the cellular blastoderm stage but not processed until gastrulation (Sokol, 2005).
Four lines of evidence are presented that DmiR-1 is essential for myofiber function during larval development: (1) consistent with the muscle- and heart-specific expression of miR-1 in zebrafish, mouse, and humans, zygotically expressed DmiR-1 is expressed in most, if not all, the myogenic cells of the larval muscle system; (2) the genetic removal of zygotic DmiR-1 results in a highly penetrant and temporally specific locomotion defect -- first instar DmiR-1KO larvae become increasingly lethargic and they eventually die as small immobilized, second instar larvae; (3) the body wall muscles of second instar DmiR-1KO mutant larvae are massively disrupted, presumably causing the locomotion defec; (4) the larval DmiR-1KO mutant phenotypes, gradual paralysis, and death, can be rescued when DmiR-1 transcript is expressed specifically in muscle cells (Sokol, 2005).
Strikingly, the DmiR-1KO larval phenotype is only manifest after feeding. Newly hatched DmiR-1KO larvae that have not yet begun to feed are nearly wild type in their body wall contractions, dorsal vessel contractions, and excretion. Furthermore, first instar DmiR-1 mutant larvae that are cultured with only sucrose for an energy source but no nutritional supplement to support growth, and consequently do not proceed through larval development to the second instar, are essentially wild type in their movement and longevity. This growth-dependent larval muscle phenotype of DmiR-1KO animals is in contrast to previously described mutations that more generally disrupt larval muscle function. For example, mutations in the alpha-actinin and ryanodine receptor genes, both of which are expressed throughout the larval musculature, cause growth-independent phenotypes; muscle cell function is strongly compromised even in newly hatched larvae. From this, it is concluded that DmiR-1 function and hence the regulation of gene expression by the DmiR-1 miRNA, is critical for maintaining muscle integrity during the dramatic, post-mitotic growth in muscle mass of wild-type larvae (Sokol, 2005).
Wild-type newly hatched Drosophila larvae encounter a developmental decision: Larvae that are fed a sucrose-only diet arrest development as first instars and can remain so, vigorously searching for food, for up to 2 weeks. By contrast, larvae that are fed a nutritional food source grow dramatically, increasing their body mass ~200-fold in 4 d. For most terminally differentiated larval-specific tissues, including the gut, epidermis, fat body, trachea and salivary glands, larval growth is accomplished by expansion of cell size rather than by cell proliferation. For these cells, feeding triggers a specialized cell cycle, the endocycle, in which cells undergo rounds of DNA replication without division. In contrast, the precursors of adult structures, including imaginal disc cells and neuroblast cells, grow via conventional diploid cell cycles (Sokol, 2005).
Little is known about the cell biology of larval muscle growth. While neither cell division nor nuclear division takes place, each myofiber expands at least 100-fold in size, presumably also involving endocyclic DNA replication. The data do not distinguish whether the DmiR-1KO mutation disrupts entry into the endocycle or some other step in muscle cell growth. Genes that act throughout the larva to control entry into the endocycle have been identified as mutations that cause growth arrest after feeding. But unlike DmiR-1KO, endocycle mutants display wild-type larval behavior and locomotion. Hence the DmiR-1KO phenotype cannot be explained as simply a consequence of muscle cells failing to enter the endocycle (Sokol, 2005).
Evidence is presented that places DmiR-1 within the established mesodermal and myogenic transcriptional networks. Twist and Dorsal are known to coregulate a number of mesodermal genes and two weak Dorsal-binding sites have been observed in the DmiR-1 enhancer/lacZ fusion transgenes tested here. However, the finding that ectopic Twist in a gastrulation defective mutant background is sufficient to direct DmiR-1 expression indicates that Twist alone, without Dorsal, is sufficient for DmiR-1 expression. Although Twist activates DmiR-1 transcription throughout the embryonic mesoderm it is nevertheless found that embryos entirely depleted of DmiR-1 appear normal. The expression of DmiR-1 in larval muscle, where it carries out its critical function, is likely to be maintained by factors downstream of Twist, particularly Mef2. So, what might be the role for Twist-mediated embryonic expression of DmiR-1? The early expression of DmiR-1 throughout the mesoderm could be gratuitous for the embryo per se but nevertheless could reflect a role for Twist-activated DmiR-1 at a later developmental stage. For example, Twist expression is maintained during larval development in clusters of undifferentiated cells that will give rise to adult myoblasts and ultimately to adult muscles. Interestingly, the Notch signaling pathway down-regulates Twist expression and thereby promotes the formation of embryonic somatic muscle founder cells at the expense of adult myoblast progenitor cells. Perhaps Twist-activated expression of DmiR-1 counteracts Notch signaling and hence helps maintain adult myoblast progenitor cells during embryogenesis and larval development (Sokol, 2005).
MicroRNAs (miRNAs) regulate posttranscriptional gene activity by binding to specific sequences in the 3' UTRs of target mRNAs. A number of metazoan miRNAs have been shown to exhibit tissue-specific patterns of expression. This study investigated the possibility that localized expression is mediated by tissue-specific enhancers, comparable to those seen for protein-coding genes. Two miRNA loci in Drosophila melanogaster are investigated, the mir-3096 polycistron (8-miR) and the mir-1 gene. The 8-miR locus contains a cluster of eight distinct miRNAs that are transcribed in a common precursor RNA. The 8-miR primary transcript displays a dynamic pattern of expression in early embryos, including repression at the anterior and posterior poles. An 800-bp 5' enhancer was identified that recapitulates this complex pattern when attached to a RNA polymerase II core promoter fused to a lacZ-reporter gene. The miR-1 locus is specifically expressed in the mesoderm of gastrulating embryos. Bioinformatics methods were used to identify a mesoderm-specific enhancer located ~5 kb 5' of the miR-1 transcription unit. Evidence is presented that the 8-miR enhancer is regulated by the localized Huckebein repressor, whereas miR-1 is activated by Dorsal and Twist. These results provide evidence that restricted activities of the 8-miR and miR-1 miRNAs are mediated by classical tissue-specific enhancers (Biemar, 2005).
The 8-miR complex is located between two predicted protein-coding genes, CG15125 and CG11018, in the 56E region on the right arm of chromosome 2. To determine the approximate transcription start site of the 8-miR transcription unit, 5' RACE was used. Several independent experiments were carried out, and RACE products corresponding to two different start sites were isolated several times. Consensus sequences for both an initiator and a TATA box are appropriately spaced upstream of the identified start sites. The alignment of this genomic interval with the corresponding regions of the most divergent Drosophilids indicates strong conservation of each of the individual miRNAs within the 8-miR complex (Biemar, 2005).
A digoxigenin-labeled 8-miR antisense RNA probe was hybridized to staged embryos to determine the expression profile of the precursor transcript during development. Expression is initially detected in all of the nuclei of precellular embryos. As expected, staining is restricted to nuclei and not seen in the cytoplasm. The first indication of differential spatial regulation occurs at the midpoint of cellularization, when 8-miR transcripts are lost at the posterior pole. By the completion of cellularization, this loss in staining expands and there is also reduced expression in anterior regions. Staining persists at the anterior tip but is lost from subterminal regions of the anterior pole (Biemar, 2005).
During gastrulation there is both dorsal-ventral and anterior-posterior modulation of the 8-miR-staining pattern. Staining is first lost from the presumptive mesoderm and neurogenic ectoderm in ventral and lateral regions. There are transient stripes of 8-miR expression in the dorsal ectoderm, but they rapidly give way to a single band of staining in central regions. By the onset of the rapid phase of germband elongation, staining is essentially lost except for residual expression at the anterior tip and dorsal ectoderm (Biemar, 2005).
The early loss of staining at the posterior pole suggests that Huckebein (Hkb) might repress 8-miR transcription in the early embryo. To investigate this possibility, colocalization assays were done with snail, which is selectively expressed in the presumptive mesoderm of cellularizing and gastrulating embryos. The posterior border of the snail pattern is established by the localized Hkb repressor. The 8-miR pattern displays a similar posterior border, and there is an expansion of both the snail and 8-miR patterns in hkb-/hkb- mutant embryos (Biemar, 2005).
Further evidence for repression by Hkb was obtained by analyzing torso dominant (torD) mutants. tor encodes a receptor tyrosine kinase that is normally activated only at the poles, where it is required for the localized expression of tailless (tll) and hkb. torD encodes a constitutively activated form of the receptor tyrosine kinase that results in expanded expression of hkb and tll at the poles. This expansion in Hkb causes a severe shift in the posterior border of both the snail and 8-miR expression patterns. The identification of a sequence-specific transcriptional repressor, Hkb, as a likely regulator of 8-miR expression suggests that the dynamic staining pattern is probably controlled at the level of de novo transcription (Biemar, 2005).
Direct support for this possibility was obtained by the identification of an 8-miR enhancer. An ~800-bp genomic DNA fragment extending from the miR-3 region of the 8-miR complex to the predicted start site of CG11018 was attached to a lacZ-reporter gene containing the minimal eve promoter sequence. The resulting fusion gene recapitulates most aspects of the endogenous 8-miR expression pattern. In particular, lacZ transcripts are initially detected throughout precellular embryos but sequentially lost from the posterior pole and anterior regions during cellularization. At the onset of gastrulation, expression is diminished in ventral regions, and the staining detected in the dorsal ectoderm exhibits segmental modulation. Thus, the 5' 8-miR enhancer contains repression elements that mediate silencing by Hkb (and possibly Tll) at the termini in response to Tor signaling (Biemar, 2005).
The preceding analysis provides evidence that cell-specific enhancers regulate miRNA gene expression, as seen for protein coding genes. Further support was obtained by analyzing a second miRNA that displays localized expression in the early Drosophila embryo, miR-1. The mir-1 gene is highly conserved in different animal groups and displays localized expression in a variety of mesodermal lineages, including cardiac mesoderm in vertebrates. The Drosophila mir-1 gene is first expressed in the presumptive mesoderm during the final phases of cellularization. Expression persists in differentiating mesodermal tissues during gastrulation, germband elongation, and segmentation. Mutant embryos that contain the constitutively activated Toll10B receptor display ubiquitous expression of miR-1, concomitant with the transformation of all of the tissues into mesoderm (Biemar, 2005).
Whole-genome tiling arrays were used to obtain an estimate of the miR-1 transcription unit. These high-density oligonucleotide arrays contain 25-nt oligomers spaced on average every 36 bp and cover the entire nonrepetitive Drosophila genome, from one end of each chromosome to the other. Total RNA was extracted from three different mutant strains. Embryos derived from pipe-/pipe- females lack Toll-signaling activity and thereby lack a Dorsal nuclear gradient. As a result, genes normally activated by high, intermediate, and low levels of the gradient are silent, and there is a loss of mesoderm and neurogenic ectoderm. Instead, genes that are repressed by the Dorsal gradient, and normally restricted to the dorsal ectoderm, are now expressed throughout the embryo, causing the transformation of mesoderm and neurogenic ectoderm into dorsal ectoderm. Previous microarray assays have shown that genes expressed in the dorsal ectoderm are overexpressed in mutant embryos derived from pipe-/pipe- embryos. As expected, such mutants display little or no expression of the miR-1 transcription unit. Similarly, embryos derived from Tollrm9/Tollrm10 mutants contain weak Toll signaling and low levels of nuclear Dorsal everywhere. These low levels are insufficient for the activation of mesoderm genes, but are sufficient for the activation of neurogenic genes and the repression of dorsal ectoderm genes. Again, these mutants fail to express miR-1. Toll10B embryos contain strong, ubiquitous Toll signaling and high levels of Dorsal, which activate mesoderm genes throughout the embryo. These embryos display strong expression of the miR-1 transcription unit. The tiling array suggests that the gene is ~2.9 kb in length. The mature, processed miRNA is located roughly in the center of the inferred transcription unit (Biemar, 2005).
The early expression of the miR-1 primary transcript in the mesoderm raises the possibility that the gene might be regulated by the Dorsal gradient. Approximately one-half of all Dorsal-target enhancers also contain binding sites for the basic helix-loop-helix Twist activator. A 50-kb interval encompassing the miR-1 locus was surveyed for clusters of Dorsal and Twist binding sites. The best cluster was identified ~5 kb upstream of the miR-1 start site. There are a total of three Dorsal- and four Twist-binding sites contained over an interval of ~1.1 kb in this distal 5' region (Biemar, 2005).
A genomic DNA fragment encompassing these sites was attached to a lacZ-reporter gene and expressed in transgenic embryos. The reporter gene exhibits localized expression in the ventral mesoderm, beginning at the onset of gastrulation. Expression persists during germband elongation. These observations suggest that miR-1 is directly activated by Dorsal and Twist. However, lacZ transcripts expressed from the miR-1::lacZ transgene are detected somewhat later than the endogenous miR-1 primary transcript, which first appears before the completion of cellularization. It is conceivable that the miR-1 locus contains a second enhancer that directs earlier expression (Biemar, 2005).
The preceding analysis provides evidence that dynamic patterns of miRNA gene expression are controlled by tissue-specific enhancers, and not by the differential processing of miRNA precursor RNAs. Both the 8-miR and miR-1 enhancers produce authentic patterns of lacZ-reporter gene expression when attached to the core promoter region of the eve gene. The 8-miR enhancer appears to be regulated by the Hkb repressor, whereas miR-1 is activated by Dorsal and Twist (Biemar, 2005).
The miR-1 enhancer is somewhat unusual among 'type 1' Dorsal target enhancers, in that it contains a large number of Snail repressor sites. Type 1 enhancers are activated by high levels of the Dorsal gradient in the ventral mesoderm. Previous studies have identified six such enhancers. They all contain multiple low-affinity Dorsal binding sites, but essentially lack Snail repressor sites. The general absence of Snail sites permits activation of type 1 genes in the ventral mesoderm where there are high levels of the repressor. An exception is the type 1 intronic enhancer that regulates Heartless (Htl), one of the two FGF receptor genes in the Drosophila genome (Biemar, 2005).
The htl intronic enhancer is ~800 bp in length and contains two low-affinity Dorsal binding sites and two optimal Twist sites. Each Twist site overlaps a Snail repressor site, but the enhancer nonetheless activates lacZ-reporter gene expression in the presumptive mesoderm before the completion of cellularization. The htl enhancer fails to mediate expression in the neurogenic ectoderm because it lacks the arrangement of optimal Dorsal and Twist sites required for activation by intermediate levels of the Dorsal gradient (type 2 enhancers) (Biemar, 2005).
The miR-1 enhancer contains three weak Dorsal sites, four optimal Twist sites (CACATGT; Kate Senger, unpublished results cited in Biemar, 2005), and five Snail repressor sites (three of the sites overlap the optimal Twist sites and two occur at separate sites). Perhaps the relative increase in the number of Snail repressor sites in the miR-1 enhancer (vs. the htl enhancer) causes late onset of miR-1::lacZ transgene expression. The Snail repressor is transiently expressed in the ventral mesoderm during cellularization but disappears after invagination. It is during the time when Snail levels subside that the miR-1 enhancer first becomes active (Biemar, 2005).
Previous studies have emphasized the importance of the Snail repressor in defining spatially localized patterns of gene expression. Dorsal target genes activated by intermediate (type 2) and low (type 3) levels of the gradient contain Snail repressor sites that keep the genes off in the ventral mesoderm and restricted to the neurogenic ectoderm. The present identification of the distal miR-1 enhancer raises the possibility that Snail also influences the timing of gene expression (Biemar, 2005).
The ventral midline is a source of localized signals that help pattern the nerve cord. For example, a transmembrane protease encoded by rhomboid (rho) produces a secreted source of the EGF ligand Spitz. Sim also leads to the expression of slit, which encodes a secreted repellant that binds the Roundabout receptor and inhibits the growth of axonal projections across the midline (Zinzen, 2006b).
Sim target genes are highly conserved in A.mel., and in situ hybridization assays reveal that they are similarly expressed in the ventral midline of the developing honeybee nerve cord. However, their expression is significantly broader in A.mel. than in D.mel., 5–6 cells versus 1–2 cells, respectively. Evidence is presented that this broader midline is due to divergent regulation of sim expression. In A.mel., sim is regulated solely by Twi and does not depend on Notch signaling, whereas Notch is responsible for restricting sim to single rows of cells in the early D.mel. embryo (e.g., Bardin, 2006; De Renzis, 2006). It is proposed that the acquisition of Notch dependence at the sim locus is sufficient to account for restricted expression of sim and the narrow midline in D.mel (Zinzen, 2006b).
The ventral midline in D.mel. embryos encompasses just 1–2 cells that express signaling molecules such as rho and slit. In contrast, orthologous genes are expressed in 5–6 cells in the honeybee embryo. Notably, the initial expression pattern of A.mel. sim is expanded, and the sim-staining pattern remains broad after convergence of the midline following the spreading of neurogenic ectoderm over the mesoderm. In addition to expression in the ventral midline, sim staining is also detected in more lateral clusters of cells exhibiting segmental periodicity in A.mel. embryos; these might be neurons or glial cells migrating away from the midline (Zinzen, 2006b).
Previous studies suggest that sim functions as a 'master control gene' to direct differentiation of the ventral midline in D.mel. To determine whether the expanded sim pattern in honeybees can account for the broadening of the midline, whether ectopic sim expression is sufficient to induce transcription of target genes such as slit and rho. The D.mel. sim-coding sequence was placed under the control of the eve stripe 2 enhancer (eve.2) and expressed in transgenic embryos. There is transient sim expression in the stripe 2 domain of early (stages 5–7) embryos in addition to the endogenous pattern (mesectoderm) in the presumptive ventral midline (Zinzen, 2006b).
The initial sim expression pattern is established by a distal 5′ enhancer that contains linked Dl-, Twi-, and Suppressor of Hairless [Su(H)]-binding sites. Expression is maintained by a separate autoregulatory enhancer containing Sim/Tango-binding sites; Tango is a ubiquitous bHLH-PAS transcription factor that forms heterodimers with Sim. Though the eve stripe 2 enhancer mediates transient activation, autoregulation maintains expression of the endogenous sim gene in the ventral neurogenic ectoderm of advanced-stage embryos, but not in the mesoderm or dorsal ectoderm (Zinzen, 2006b).
Ectopic sim expression leads to the induction of various target genes, including rho, slit, sog, and the transcription factor otd. These results provide evidence that ectopic sim expression is sufficient to expand the ventral midline in D.mel. In principle, the altered midline seen in the honeybee embryo could be explained by a change in sim regulation. The distal 5′ enhancer that establishes sim expression is the most likely site of change, since the autoregulatory enhancer merely maintains expression within the limits of the established pattern (Zinzen, 2006b).
To determine the basis for the distinct sim expression patterns in flies and honeybees, it was necessary to isolate the early sim enhancer from A.mel. However, the identification of homologous enhancers is complicated by the rapid turnover of noncoding DNA sequences in insect genomes. For example, the 5′ flanking regions of the sim loci in D.mel. and A.gam. lack simple sequence homology, even though they belong to the same order (Diptera). Nonetheless, it was possible to identify the early sim enhancer in A.gam. based on the clustering of Dl-, Twi-, and Su(H)-binding sites. The D.mel. and A.gam. enhancers are located in similar positions relative to the sim transcription unit (Zinzen, 2006b).
A.mel. is a member of the order Hymenoptera and is highly divergent from D.mel. Computational methods used for the in silico identification of the A.gam. sim enhancer were further developed to ensure the accurate identification of the sim enhancer in A.mel. The current method (ClusterDraw2) employs position-weighted matrices (PWMs) to identify binding motif clusters (Zinzen, 2006b).
The efficacy of the method was tested by surveying ~50 kb genomic intervals encompassing the sim loci of D.mel. and A.gam.. PWMs of Dl, Twi, Snail, and Su(H) were used in various combinations and individually. The best binding site clusters coincide exactly with the known sim enhancers (Zinzen, 2006b).
ClusterDraw2 was used to survey a ~50 kb genomic DNA interval encompassing the sim locus of A.mel.. The best prediction occurs in the 5′ flanking region of the gene, similar to the locations of the fly and mosquito enhancers. However, while the D.mel. and A.gam. sim enhancers contain several optimal Su(H)-binding sites, the A.mel. cluster lacks such sites, but contains several high-scoring Twi sites. This is consistent with the possibility that A.mel. sim is regulated by Twi alone, rather than by the combination of Twi+Notch (Zinzen, 2006b).
A 2.2 kb genomic DNA fragment encompassing the predicted A.mel. sim enhancer directs lateral stripes of lacZ expression in transgenic D.mel. embryos. A similar pattern was obtained with a 471 bp fragment containing the predicted Twi-binding sites. This pattern encompasses 3–4 cells on either side of the presumptive mesoderm, similar to the expression of the endogenous A.mel. sim gene, but distinct from the single-row sim patterns in D.mel. and A.gam. (Zinzen, 2006b).
The fly, honeybee, and mosquito sim enhancers were crossed into various genetic backgrounds to determine the basis for their distinct expression patterns. The D.mel. m5/8 enhancer was also examined. It is located within the Enhancer of split (E(spl)) complex, where it controls the expression of the m5 and m8 genes within the mesectoderm. The m5/8 enhancer directs lacZ expression in a pattern that is virtually identical to that produced by the D.mel. sim enhance (Zinzen, 2006b).
Transgenic D.mel. embryos carrying an eve.2::NICD fusion gene exhibit ectopic Notch signaling in the eve stripe 2 domain. The m5/8-lacZ transgene is strongly induced in the neurogenic ectoderm and dorsal ectoderm, but not in the mesoderm, where the Sna repressor is present. The D.mel. sim-lacZ transgene displays only modest ectopic induction by the eve.2::NICD transgene; this induction appears as a 'pyramid' limited to ventral regions of the neurogenic ectoderm. This pyramid coincides with the intersection of ectopic Notch signaling and the endogenous Twi gradient. The different patterns ('pyramid' versus 'column') seen for the sim and m5/8 enhancers appear to reflect activation by Notch+Twi or regulation by Notch alone, respectively. The m5/8 enhancer contains an SPS (Su(H) Paired Site) motif, and it has been suggested that the endogenous m8 gene is activated solely by Notch signaling (Zinzen, 2006b).
The A.mel. sim enhancer is not activated by the eve.2::NICD transgene, consistent with the absence of Su(H) sites in this enhancer. To determine whether it is activated by Twi, the lacZ fusion gene was crossed into embryos carrying an hsp83::twi-bcd-3′UTR transgene that produces high levels of Twi transcripts at the anterior pole. The resulting ectopic anteroposterior Twi protein gradient induces intense expression of the lacZ reporter gene directed by the A.mel. sim enhancer. In contrast, neither the D.mel. sim enhancer nor the m5/8 enhancer is induced by this ectopic gradient. Finally, the D.mel. sim enhancer is inactive in mutant embryos derived from germline clones lacking Su(H) activity, whereas the honeybee sim enhancer is fully active. Thus, unlike the D.mel. sim enhancer, the A.mel. enhancer does not rely on Notch signaling (Zinzen, 2006b).
The preceding analysis suggests that the D.mel. sim enhancer is activated by Twi and Notch signaling, whereas the A.mel. sim enhancer is activated solely by Twi. These distinct modes of regulation are reflected by the composition of binding sites in the different enhancers. The A.mel. enhancer contains several optimal Twi sites, but it lacks unambiguous Su(H) sites. In contrast, the D.mel. enhancer contains several optimal Su(H) sites, but just one optimal Twi site. Both enhancers contain binding sites for the Sna repressor, which inhibits expression in the mesoderm (Zinzen, 2006b).
sim regulation was examined in the mosquito, A.gam., to determine whether the midline of ancestral dipterans might have been regulated solely by Notch signaling, as seen for the fly m5/8 enhancer. The A.gam. genome contains a clear ortholog of the sim gene, expressed in a single row of cells in the mesectoderm, similar to the pattern seen in D.mel. The A.gam. sim enhancer directs sporadic expression within the mesectoderm of transgenic D.mel. embryos, but it is strongly induced by the eve.2::NICD transgene. This response is similar to that obtained with the D.mel. m5/8 enhancer, but it is distinct from the 'pyramid' pattern seen for the D.mel. sim enhancer (Zinzen, 2006b).
To determine whether the sim loci of other drosophilids are regulated by Twi+Notch, as seen in D.mel., or Notch alone, sim enhancers from D. pseudoobscura (D.pse.) and D. virilis (D.vir.) were tested in transgenic eve.2::NICD D.mel. embryos. Surprisingly, these enhancers behave like the A.gam. sim enhancer: they are expressed throughout the neurogenic ectoderm and dorsal ectoderm (“column”) in response to Notch signaling, rather than the “pyramid” pattern indicative of Notch+Twi regulation. These observations suggest that the evolution of sim regulation is highly dynamic, although there is no obvious difference in the number or quality of Su(H) and Twi sites in the different drosophilid enhancers. Perhaps a subtle shift in the organization of binding sites distinguishes regulation by Notch alone versus Notch+Twi (Zinzen, 2006b).
Specification of muscle identity in Drosophila is a multistep process: early positional information defines competence groups termed promuscular clusters, from which muscle progenitors are selected, followed by asymmetric division of progenitors into muscle founder cells (FCs). Each FC seeds the formation of an individual muscle with morphological and functional properties that have been proposed to reflect the combination of transcription factors expressed by its founder. However, it is still unclear how early patterning and muscle-specific differentiation are linked. This question was addressed using Collier (Col; also known as Knot) expression as both a determinant and read-out of DA3 muscle identity. Characterization of the col upstream region driving DA3 muscle specific expression revealed the existence of three separate phases of cis-regulation, correlating with conserved binding sites for different mesodermal transcription factors. Examination of col transcription in col and nautilus (nau) loss-of-function and gain-of-function conditions showed that both factors are required for col activation in the 'naive' myoblasts that fuse with the DA3 FC, thereby ensuring that all DA3 myofibre nuclei express the same identity programme. Together, these results indicate that separate sets of cis-regulatory elements control the expression of identity factors in muscle progenitors and myofibre nuclei and directly support the concept of combinatorial control of muscle identity (Dubois, 2007).
col belongs to the class of Drosophila regulatory genes
with numerous introns, large amounts of flanking sequence and multiple
expression sites. During embryogenesis, col is expressed in the
MD2/PS0 head region, the somatic DA3 muscle, precursor cells of the lymph
gland, a small set of multidendritic (md) neurons of the peripheral nervous
system and specific neurons of the central nervous system (CNS). A lacZ reporter transgene (P{5col::lacZ}, abbreviated P5cl, contains 5 kb of
col upstream DNA, which faithfully reproduced col
transcription both in the MD2/PS0 and the DA3 muscle, starting at the
progenitor stage and not in promuscular cluster(s). To identify the missing cis-regulatory information, a longer construct was tested containing the entire 9 kb region separating col from CG10200, the next predicted upstream gene. In
addition to the head and DA3 muscle, P9cl expression reproduced
col expression in md neurons and a subset of neurons in the CNS. A
DNA fragment located further upstream, between CG10200 and the next predicted
gene CG10202, was independently shown to drive col expression in the
anteroposterior organiser of the wing imaginal disc
(Hersh, 2005). However, neither this construct nor P9cl reproduced Col expression in
promuscular clusters. The col transcription unit is immediately flanked at its 3' end by another gene, BEAF32, making rather unlikely the presence of cis-regulatory
elements within this region. However, it contains ten different introns, of total length around 30 kb, the cis-regulatory content of which remains to be assessed (Dubois, 2007).
To delineate more precisely the CRM driving col expression in the
DA3 muscle, a series of constructs was tested containing 2.6, 2.3, 1.6 and 0.9
kb of DNA upstream of the col transcription start site, respectively. P2.6cl retained the information necessary for col expression in MD2/PS0 and
the DA3 progenitor and muscle, although it was noted that P2.6cl expression in muscle progenitors was less robust than P9cl. P2.3cl was also activated in
MD2/PS0 at stage 6 and the DA3 muscle. However, unlike P9cl or
P2.6cl, P2.3cl was not activated in the DA3/DO5 progenitor but only
at the FC stage; ectopic lacZ expression was observed in clusters of neuroectodermal cells at embryonic stage 11). This difference indicated that cis-regulatory elements required for col expression in the DA3/DO5 progenitor reside
between positions -2.6 and -2.3 and act separately from those required for
expression in the DA3 FC and muscle. P1.6cl was active only in
MD2/PS0, whereas no expression at all could be detected with P0.9cl. Together, expression data from this series of reporter constructs allowed the mapping of the CRM required for col-specific expression in the DA3/DO5 muscle progenitor and DA3 FC/myofibre to a DNA fragment between positions -2.6 and -1.6 upstream of the col
transcription start (Dubois, 2007).
Advantage was taken of the recently available genome sequences of several
Drosophila species to search for conserved motifs in the col
upstream DNA, as it has often proven to be effective to identify functionally
important cis-regulatory elements. Among these species, D. virilis (D. vir) is the most distant from D. melanogaster (D. mel). It was first verified that Col expression in D. vir was similar to that in D. mel embryos and could infer from this that the regulatory information controlling col
transcription in the DA3 muscle lineage has been conserved. Sequence
comparison of 9 kb of the col upstream region between D. mel, D.
vir and four other Drosophila species, D. yakuba, D.
ananassae, D. pseudoobscura and D. mojavensis revealed numerous
stretches of high sequence conservation, of sizes up to 100 bp. Ten conserved motifs of size >20 bp, numbered 1 to 10 from 5' to 3', were found in the same order and at the
same relative position between position -2.6 and the start of transcription in
all six Drosophila species. To test the
relevance of this conservation, lacZ reporter constructs were created
containing either D. vir or D. mel DNA (Dubois, 2007).
P.3.4clvir corresponds to D. mel P2.6cl, whereas
P3.4-1.3clvir and P2.6-0.9cl are truncated
versions covering motifs 1 to 10. All four reporter genes showed expression in
the DA3 muscle, starting at the progenitor stage, confirming the evolutionary
conservation of a DA3-muscle-specific CRM. A Gal4 driver line
containing only the -2.6 to -1.6 region (P2.6-1.6cG), harbouring only
motifs 1 to 7, was also specifically expressed in the DA3 muscle. This confirmed that
the DA3 muscle CRM is located between positions -2.6 and - 1.6. It was noticed,
however, that expression of P2.6-1.6cG was weaker and more sporadic
than P2.6-0.9cl, suggesting the existence of cis-regulatory
element(s) between positions -1.6 and -0.9 contributing to robust DA3 muscle
expression. The conserved motifs 1 to 10 were searched for consensus
binding sites of known TFs that could account for col activation in
the DA3 muscle. This identified a binding site for the mesodermal basic
helix-loop-helix (bHLH) protein Twi (within motif 2), correlating well with the position of the muscle progenitor cis-element and a potential EBF/Col-binding site within motif 7. Further visual inspection of the sequence alignments identified other conserved TF-binding sites, including one Mef2-binding site within the -1.6 to -0.9 fragment and one consensus binding site for Nau (Huang, 1996; Kophengnavong, 2000). In contrast, the position of the Mef2 site correlated well with the requirement of the -1.6 to -0.9 fragment for robust DA3 muscle expression. The presence of a Nau-binding site was particularly intriguing since Nau is required for DA3 muscle formation. Potential binding sites for other TFs could be found in the DA3 CRM, but the annotation to the conserved sites. The relative paucity of known TF-binding sites in the conserved sequence motifs found in the DA3 muscle CRM leaves largely open the question of the roles of these motifs in col regulation (Dubois, 2007).
Functional dissection of the DA3 muscle CRM present in the col
upstream region showed that col expression in the DA3 FC can be
separated from its expression in the DA3/D05 progenitor and the promuscular
cluster. It thus revealed the existence of three steps in the transcriptional
control of muscle identity. That col expression in the DA3/D05 progenitor could
be uncoupled from that in promuscular clusters was in apparent contradiction
with the previous conclusion from pioneering studies on Eve expression in
dorsal muscle progenitors that this expression issued from Eve activation in
promuscular clusters. Restriction of Eve expression to progenitors was
considered a secondary step, mediated by N-signalling during progenitor
selection by lateral inhibition. To reconcile these data and this model, it is proposed that the muscle DA3 CRM is active only in the DA3/D05 progenitor because it lacks some
positively acting cis-elements necessary to counteract N-mediated repression
of col transcription. It has been shown that col
transcription is repressed by N during the progenitor selection process.
It is also noted that a Twi-binding site is present in the 'progenitor' subdomain
of the DA3 CRM. The functional importance of this site is supported by its in vivo occupancy in 4- to 6-hour-old embryos when selection of the DA3/DO5 progenitor takes place (Sandmann, 2007). Together, Twi in vivo binding and the col/P2.6cl/P2.3cl expression data suggest that Twi activity contributes to col expression in the DA3/DO5 progenitor but may not be sufficient to override N
repression of col transcription before progenitor selection.
Additional binding sites for Twi present in the col upstream region,
between positions -8.7 and -8.3, are also bound by Twi in vivo
(Sandmann, 2007) and probably contribute to the robustness of P9cl expression in
progenitor cells, but the question of which cis-regulatory elements mediate
col activation in promuscular clusters remains open. From Eve
expression studies, a computational framework has been developed to identify other FC-specific genes (Estrada, 2006; Philippakis, 2006). This framework, named Codefinder, integrates transcriptome data and clustering
of combinations of binding sites for five different TFs (Pnt, dTCF, Mad, Twi
and Tin). col/kn was selected by Codefinder owing to the presence of
five clusters of binding sites, four of which are located within introns
(Philippakis, 2006). It remains to be determined which of these could be responsible for col activation in promuscular clusters, but it is interesting to note that another
in vivo Twi-binding site in 4-6-hour-old embryos correlates with the
3'-most cluster (Sandmann, 2007). In addition to Twi, conserved binding sites for Nau and
Mef2 are found within the DA3 CRM. The Mef2 binding site is located in a
region required for robust DA3-muscle expression of a reporter gene. A direct control of col transcription by Mef2 during the muscle fusion process is further supported by the recent finding (Sandmann, 2006) that Mef2 binds in vivo to the col upstream region between 6 and 8 hours of embryonic development (Dubois, 2007).
Detailed analysis of col auto-activation revealed a reiterative,
two-step process: import of pre-existing Col protein in the fusion competent myoblast nuclei that incorporate into the growing DA3 myofibre precedes activation of col
transcription. This process ensures that all incorporated FCM nuclei acquire the same identity. Nau is required for maintaining col transcription in the DA3 muscle
precursor and this control is probably direct. The presence of a putative
EBF-binding site in the DA3 muscle CRM also correlates with the Col
requirement for maintaining its own transcription beyond the FC stage.
Thus, despite the failure to detect strong Col binding to this
site in vitro, it appears to be essential for col auto-regulation in
vivo. This suggests that in vivo binding is potentiated by one or more
specific co-factor(s) present in the DA3 muscle. One co-factor is probably
Nau, as the ability of Col to activate its own transcription in newly
recruited fusion competent myoblasts is dependent upon Nau activity. Nau is not
sufficient, however, as many muscles containing both Nau and Col proteins do
not activate col transcription. Interestingly, mouse
EBF (also known as Ebf1 and Olf1 - Mouse Genome Informatics) and E2A (Tcfe2a -
Mouse Genome Informatics), a bHLH protein of the same subgroup as MyoD, have
been shown to act on the same target promoter and synergistically upregulate
transcription of B-lymphocyte-specific genes, although no direct physical
interaction between EBF and E2A could be found in vitro. This suggested that
functional interaction of EBF and E2A, similar to Col and Nau, requires yet
another factor. Taking into account the restricted pattern of ectopic
col activation in hs-col conditions, it is hypothesised that Vg
could be another component of the DA3 combinatorial identity. However, we
found that Vg is not required for DA3 muscle specification, leaving open the
question of which factor may bridge Col and Nau functions (Dubois, 2007).
Unlike col or P2.6cl, P2.3cl is expressed in the DA3 FC and muscle
precursor but not the DA3/DO5 progenitor, showing that col
transcription in the progenitor and muscle precursor is under separate
control. These two phases of col regulation are intimately linked,
however, as Col is required for activating its own transcription in the nuclei
of FCM recruited by the DA3 FC. This regulatory cascade may explain how
pre-patterning of the somatic mesoderm and muscle identity are
transcriptionally linked in the Drosophila embryo. As discussed
above, the ability of Col to auto-regulate depends upon the presence of Nau, another muscle identity TF. Col and Nau act as obligatory co-factors fo maintenance/activation of Col expression in all nuclei of the DA3 muscle, thus bringing to light a clear case of combinatorial coding of muscle identity (Dubois, 2007).
Synergistic interactions between the maternal regulatory factor Dorsal (DL) and basic helix-loop-helix
(bHLH) activators, are essential for initiating differentiation of the mesoderm and neuroectoderm in
the early Drosophila embryo. DL-bHLH interactions mediating gene
expression in the neuroectoderm and mesoderm are fundamentally distinct. Sharp
on/off patterns of gene expression in the presumptive mesoderm do not require linkage of Dorsal and other bHLH binding sites. Analysis of minimal and synthetic promoter elements suggests that DL and bHLH activators, such as Twist, might interact with different rate-limiting components of the transcription complex.
(Szymanski, 1995).
The establishment of mesoderm and neuroectoderm in the early Drosophila embryo relies on
interactions between the Dorsal morphogen and basic-helix-loop-helix (bHLH) activators. Dorsal and the bHLH activator Twist synergistically activate transcription in cell culture and in vitro from a promoter containing binding sites for both factors. Somewhat surprisingly, a region of
Twist outside the conserved bHLH domain is required for the synergy. The two N-terminal Gln-rich regions of Twist appear to mediate synergistic activation by Dorsal and Twist and are also required for binding to Dorsal in vitro. In Dorsal, the rel homology
domain appears to be sufficient for synergy. The interaction between Dorsal and Twist does not appear to be of sufficient strength to yield cooperative binding to DNA. It is suggested that the interaction betweein Dorsal and Twist induces a conformational change in one of the factors that enables it to efficiently activate transcription (Shirokawa, 1997).
The basic helix-loop-helix transcription factor Twist
regulates a series of distinct cell fate decisions within the
Drosophila mesodermal lineage. These twist functions are
reflected in its dynamic pattern of expression, which
is characterized by initial uniform expression during
mesoderm induction, followed by modulated expression at
high and low levels in each mesodermal segment, and
finally restricted expression in adult muscle progenitors.
Two distinct partner-dependent functions for Twist were found t