RNA binding proteins

Drosophila genes listed by biochemical function

RNA binding proteins and proteins involved in post-transcriptional regulation

The architecture of pre-mRNAs affects mechanisms of splice-site pairing
The RNA-binding protein Rasputin/G3BP enhances the stability and translation of its target mRNAs
Necessity of Flanking Repeats R1' and R8' of Human Pumilio1 Protein for RNA Binding
The translational repressor Glorund uses interchangeable RNA recognition domains to recognize Drosophila nanos

abstrakt

DEAD-box subfamily ATP-dependent helicase protein

Activity-regulated cytoskeleton associated protein 1

RNA-binding protein - mediates intercellular RNA transfer - forms capsid-like structures that bind arc1 mRNA in neurons - loaded into extracellular

vesicles that are transferred from motorneurons to muscles - synaptic plasticity at the neuromuscular junction - trans-synaptic mRNA transport

Adar

double-stranded RNA adenosine deaminase

alan shepard

regulates insulator activity, neuronal remodeling during metamorphosis and gravitaxis

apontic

novel bZIP transcription factor and RNA-binding protein

Argonaute 1

PAZ domain protein involved in post-transcriptional gene silencing - mutants exhibit defects in the embryonic nervous system

armitage

RNA helicase involved in posttranscriptional gene silencing - mutations disrupt mRNA translational silencing of oskar in the oocyte and silencing of Stellate in male germ cells

arrest

ribonucleoprotein-type RNA-binding protein

Ars2

RNA-binding protein - key component of Drosophila antiviral immunity - interacts with Dcr-2 - required for siRNA-mediated silencing -

plays an essential role in miRNA-mediated silencing

Ataxin-2

multi-functional protein that binds DEAD box helicases of the Me31B family that associated with Argonaute and microRNA function -

required for microRNA function and synapse-specific long-term olfactory habituation -

assembles with polyribosomes and poly(A)-binding protein, a key regulator of mRNA translation

Ataxin-2 binding protein 1 (common alternative name: Rbfox)

RNA-binding protein - homolog of an autism-susceptibility gene - targets pumilio mRNA for destabilization

and translational silencing, thereby promoting germ cell development, oogenesis

aubergine

related to eukaryotic translation initiation factor 2C - involved in post-transcriptional gene silencing

bag of marbles

functions as a translation repressor by interfering with translation initiationg

benign gonial cell neoplasm

cofactor of Bag of marbles that directly inhibits Pumilio repression of Nanos mRNA activity

to promote differentiation of germ line stem cells

Bicaudal C

RNA-binding protein that regulates expression of specific germline mRNAs by controlling their poly(A)-tail length

bicoid

transcription factor - homeodomain - RNA binding protein

bicoid stability factor (common alternative name: Lrpprc1, Leucine-rich pentatricopeptide repeat domain-containing protein 1

functions in mRNA stability and post-transcriptional control of gene expression - has multiple roles in gene expression in

mitochondria - reader of the N6-methyladenosine (m6A) modification of RNA - required for progression through oogenesis and viability.

boule

RRM motif protein involved in spermatogenesis

brain tumor

involved in post-transcriptional regulation of Hunchback mRNA

bruno (preferred name: arrest)

ribonucleoprotein-type RNA-binding protein

cabeza

conserved neurally expressed nuclear RNA binding protein whose mutants exhibit reduced climbing abilities of adult flies and

anatomical defects in presynaptic terminals of motoneurons in third instar larvae - - negatively regulates the EGFR signaling pathway

required for determination of cone cell fate in the eye

crooked neck

mRNA splicing factor that participates in the assembly and control of the splicing machinery

cup

translational repressor - represses oskar translation - physically interacts with Bruno

Dicer-1

ribonuclease III family, double-stranded RNA domain binding domain, DEAD/DEAH box helicase, PAZ domain -

an enzyme involved in degrading RNA - involved in double-stranded RNA interference (RNAi) and post-transcriptional gene regulation (PTGS)

Dicer-2

DEAD/DEAH box helicase - mutants are defective in processing small interfering RNAs

DISCO Interacting Protein 1

double-stranded RNA binding protein - regulates the abundance of stable intronic sequence RNAs (sisRNAs) - controls

germline stem cell self-renewal - involved in innate immunity - prevents transcriptional activation by Ubx

egalitarian

RNA binding protein - participates along with BicaudalD in transport of mRNA during oogenesis - salivary gland morphogenesis

Eukaryotic initiation factor 4E

binds to the mRNA 5' cap thus controlling a crucial step in translation initiation - required for cell growth -

promotes dedifferentiation of neuroblasts back to a stem cell-like state thus functioning as an oncogene -

a target of Ago2 in translational repression - functions as a splice factor for msl-2 and Sxl pre-mRNAs

embryonic lethal abnormal vision (common alternative name: elav)

RNA binding protein

exuperantia

a core component of a large protein complex involved in localizing mRNAs both within nurse cells and the developing oocyte

fandango (common alternative name: faint sausage)

a subunit of the spliceosome-activating Prp19 complex (NineTeen complex), which is essential for efficient pre-mRNA splicing, regulates the efficiency of splicing

of zygotic transcripts and their abundance - required for normal cellularization, tracheal cell migration, and epidermal morphogenesis in the embryo, ortholog

of yeast SYF1 and human XAB2, important for spliceosome stabilization and activation, peripheral nervous system, central nervous system, axonal pathfinding

female sterile (3) homeless (preferred name: spindle E)

DE-H family of RNA-dependent ATPases

female sterile (1) Yb (common alternative name: Yb)

tudor domain RNA helicase, post-transcriptional gene regulation - regulates transposon silencing via the piRNA pathway -

component of the Yb body, a site for Piwi-associated RNA biogenesis, regulates Piwi transport to the nucleus

Fmr1

KH domain RNA-binding protein - homolog of mammalian Fragile X mental retardation gene - represses Futsch translation - mutants have synaptic structural defects

4EHP

Eukaryotic initiation factor 4E - a cap binding protein that inhibits hunchback, caudal and bicoid mRNA translation

found in neurons

an ELAV family RNA binding protein that works as a post-transcriptional regulator - Fne is present in the cytoplasm of all neurons - promotes bypass of proximal

polyadenylation signals in nascent transcripts - Lack of fne produces fusion of the mushroom body β-lobes and altered male courtship behaviour - Fne acquires

a mini-exon, generating a new protein able to translocate to the nucleus and rescue ELAV-mediated alternative polyadenylation and alternative splicing

gawky

RNA-binding protein - glycine-tryptophan (GW) repeat protein required for P-body integrity -

promotes mRNA deadenylation, decapping and degradation

glorund

RNA-binding protein - represses nanos (nos) translation and uses its quasi-RNA recognition motifs to recognize both G-tract and structured

UA-rich motifs within the nos translational control element - recruits dFMRP to inhibit nanos translation elongation

half pint (preferred name: poly-U-binding splicing factor)

RNA recognition motif protein - functions in both constitutive and alternative splicing - required during oogenesis

hangover

a nuclear RNA binding protein that regulates dunce transcript levels, bouton growth at the neuromuscular

junction, and cellular stress in response to ethanol, heat and paraquat

Helicase at 25E

DExH/D RNA helicase required in the nucleus for mRNA export - required in the cytoplasm for remodeling

the transacting factors that dictate mRNA cytoplasmic destination

hephaestus

polypyrimidine tract binding protein - controls alternative splicing - required to attenuate Notch activity after ligand-dependent activation during wing development

held out wings

KH motif

hiiragi (also known as poly(A) polymerase)

involved in both nuclear and cytoplasmic polyadenylation of mRNA

IGF-II mRNA-binding protein

zipcode-binding protein - regulator of RNA transport in oocytes and neurons - regulates aging of the Drosophila testis stem-cell niche -

contributes to the localized expression of gurken mRNA in the oocyte - counteracts endogenous small interfering RNAs to stabilize mRNAs

Inducer of meiosis 4

methyltransferase, internal modification of mRNA - regulates alternative splicing and Notch

signaling - regulates sex determination and dosage compensation - modulates neural functions

Inositol-requiring enzyme-1

protein kinase, endonuclease, a regulator of the unfolded protein response, leading to the activation

of the transcription factor Xbp1 - regulated Ire1-dependent decay - regulation of RNA splicing - photoreceptor differentiation and rhabdomere morphogenesis

Inverse regulator a (common alternative name: Pcf11)

RNA binding motif protein - dismantles elongation complexes by a Pol II C-terminal domain (CTD) dependent mechanism - forms a bridge between the CTD and RNA

let-7 (preferred name: microRNA encoding gene let-7)

encodes an RNA species involved in translational silencing of target mRNAs

lin-28

cold shock and RNA-binding protein - regulator of developmental timing - regulator of microRNA maturation

loquacious

a component of a functional pre-miRNA processing complex - stimulates and directs pre-microRNA processing activity

mago nashi

novel protein involved in oogenesis - mutation results from the failure of nuclear migration to the anterodorsal cortex during oogenesis - part of the

exon junction complex, which is required for post-transcriptional processes such as pre-mRNA splicing, RNA localization

and nonsense-mediated decay - involved in germline development, germplasm assembly and photoreceptor differentiation

maelstrom

HMG box protein - spindle class protein - potential regulator of RNA processing or subcellular localization

maleless

DEAH-box subfamily ATP-dependent helicase

maternal expression at 31B

A DEAD-box helicase, part of a ribonuclear protein complex, that restricts translation of oocyte-localizing RNAs -

in neurons Me31B acts to promote translational repression and/or mRNA degradation in response to miRNAs

Meiosis regulator and mRNA stability factor 1

post-transcriptional effector domain that recruits CCR4-NOT deadenylase complex to shorten target mRNA poly-A tails and suppress

their translation - ensures proper oocyte maturation by regulating nanos expression - transition from meiosis I to II is compromised mutant oocytes

mei-P26

a conserved translational regulator that facilitate the switch from proliferation to differentiation - associates with miRNA

pathway components to represses the translation of target mRNAs - cooperates with Bam, Bgcn and Sxl

to promote early germline development in the Drosophila ovary - a target of Vasa in promoting stem cell differentiation

modulo

RRM-containing domain - modifier of PEV promoting chromatin compaction and inactivation - controls cellular growth rate downstream of dMYC

musashi

RNP-1 and RNP-2 motifs

muscleblind

RNA splice factor involved in terminal muscle and eye differentiation - mutants model features of myotonic dystrophy

nanos

RNA binding protein in oocyte - zinc finger protein

Negative elongation factor E

RNA-binding protein - along with other factors, NELF causes polymerase to pause in the promoter proximal region of heat shock genes

Nuclear polyadenosine RNA-binding 2

poly(A) RNA binding protein - functions in cytoplasmic control of neuronal mRNAs in conjunction with the fragile X protein ortholog dFMRP - patterns axon projection in the developing brain

Nucleolar protein at 60B

pseudouridylate synthase - enzymatically modifies ribosomal RNA - required for maintenance of germ-line stem cells

off-schedule

functions as translation initiation factor eIF4G during spermatogenesis to coordinate the initiation of meiotic division and differentiation

orb

RRM - RNA binding protein

orb2

cytoplasmic polyadenylation element-binding protein - RNA binding protein - forms amyloid-like oligomers enriched in the synaptic membrane - critical for the persistence of long-term memory

oskar

novel - assembles germ plasm - probably does not bind RNA directly

Pabp2

poly(A) binding protein - functions in nuclear polyadenylation - cytoplasmic PABP2 acts to shorten the poly(A) tails of specific mRNAs

Painting of fourth

RNA-binding protein that increases transcription output from chromosome 4, targets specific loci on the X chromosome

partner of drosha

double-stranded RNA-binding protein - essential for the biogenesis of canonical miRNAs - required for imaginal disc growth

P-element somatic inhibitor

splice factor that regulates the thermosensitive alternative splicing of timeless (tim) - AGO1 interacts with Psi to repress Myc

transcription and inhibit developmental cell and tissue growth - Psi interacts with the mediator complex to modulate MYC transcription - Psi interaction

with the U1 small nuclear ribonucleoprotein complex (snRNP) controls male courtship behavior by regulation splicing

for fruitless - regulates splicing of the P-element transposase pre-mRNA by binding a pseudo-splice site upstream of the authentic splice site

pelota

RNA binding motif protein - controls germ-line stem cell self-renewal by repressing a differentiation pathway, possibly through regulating translation

Pcf11 (preferred name: Inverse regulator a)

RNA binding motif protein - dismantles elongation complexes by a Pol II C-terminal domain (CTD) dependent mechanism - forms a bridge between the CTD and RNA

pitchoune

DEAD-box RNA helicase

polyA-binding protein

RNA-binding protein involved in translational regulation and nonsense-mediated mRNA decay

poly-U-binding splicing factor

RNA recognition motif protein - functions in both constitutive and alternative splicing - required during oogenesis

pre-mRNA processing factor 40

splice factor - regulates histone mRNA expression by modulating transcription - constituent of histone locus body, a chromatin-associated

nuclear body that associates with replication-dependent histone gene clusters - regulates alternative splicing of Neurexin IV

pumilio

novel - binds hunchback mRNA

rasputin

component of stress granules - interacts with several protein partners under both stress and non-stress conditions including Caprin, FMR1 and

Lingerer - Sec16, a component of the endoplasmic reticulum exit site, is a Rasputin interactor and stabilizer - a positive regulator

of orb in oogenesis - FMR1, Rasputin and Caprin act together with the UBA protein Lingerer to restrict tissue growth

r2d2

double-stranded RNA-binding protein - bridges the initiation and effector steps of the Drosophila RNAi pathway

by facilitating siRNA passage from Dicer, which carrys out the initiation step, to RISC, which carrys out the effector step

Salsa

a helicase - a component of the NineTeen Complex (NTC), also known as Pre-mRNA-processing factor 19 (Prp19) complex - regulates distinct spliceosome

conformational changes necessary for splicing, rate-limiting for splicing of a subset of small first introns during oogenesis, including the first intron of gurken

sans fille (also known as U1AsnRNP)

splicing factor - sex determination

Sex lethal

RNA binding and splicing

smaug

RNA binding protein - repressor of NOS translation

spenito

mRNA-binding factor - m(6)A methyltransferase complex component - sex determination - fat body - counteracts Split ends

function in fat regulation - stimulates axon outgrowth during neurosecretory cell remodeling - promotes Wingless signaling

spindle E

DE-H family of RNA-dependent ATPases

split ends

codes for RRM motif RNA-binding protein - mutants have defects in Notch and Egf receptor signaling resulting defects in cell-fate and in axon guidance

Spt6

transcription elongation factor implicated in RNA processing and degradation of improperly processed pre-mRNA

squid

hnRNP D homolog

staufen

Double stranded RNA binding protein

Stem-loop binding protein

required for replication-dependent histone mRNA metabolism - 3' end cleavage of replication-dependent histone pre-mRNAs - promotes a structural rearrangement of the

catalytically active U7-dependent processing complex, resulting in juxtaposition of an endonuclease with the cleavage site in the pre-mRNA substrate - oogenesis

survival motor neuron

RNA binding protein involved, along with Gemins, in the assembly of the small nuclear ribonucleoproteins that constitute the spliceosome -

neuromuscular junction protein required in both neurons and muscle for normal junctional morphology

Syncryp

mRNA binding protein that regulates localized translation during synaptic plasticity in the neuromuscular junction - regulates synaptic

output through regulation of retrograde BMP signaling - regulates of localized transcripts during axis specification

TAR DNA-binding protein-43 homolog

RNA-binding protein - regulation of synaptic efficacy and motor control - regulation of Futsch activity

at the neuromuscular junction - regulation of the robustness of neuronal specification through microRNA-9a -

a model for Amyotrophic lateral sclerosis (ALS), often referred to as Lou Gehrig's Disease

thoc5

a component of the conserved THO/TREX (transcription/export) complex involved in processing of nascent RNAs and mRNA export

from nucleus - splicing-independent loading on nascent RNA - piRNA biogenesis - male meiosis- regulation of p53 and PI3K/AKT signaling

trailer hitch

conserved protein involved in mRNA localization that interfaces with the secretory pathway to promote efficient protein trafficking in the cell

transformer

RNA splice factor - Sex determination - productively spliced in the presence of Sex lethal

transformer 2

RNA splice factor - along with Transformer directs female specific splicing of doublesex RNA

Tropomyosin I

cytoskeletal element - intracellular transport - links kinesin-1 in a strongly inhibited state to oskar mRNA - binds RNA

via its alternative cargo binding domain - an unusual product of the Tm1 locus, Tm1-I/C, resembles an intermediate

filament protein in some respects - Arp2/3 complex and cofilin, in turn, regulate the binding of tropomyosin to actin filaments

twin

degrades mRNA poly(A) tails - CCR4 componenet of an enzyme complex catalyzing mRNA deadenylation

U1snRNP preferred name: sans fille

splicing factor - sex determination

Upstream of N-ras

RNA-binding protein that interacts with Rox RNAs to repress dosage compensation complex formation in female

and promote its assembly on the male X chromosome

U2 small nuclear riboprotein auxiliary factor 50

splicing factor required for the ATP-dependent association of U2 snRNP with pre-mRNA branchpoints - forms a heterodimer

with the small splice factor U2af38 - U2af50 interacts with the intronic 3' polypyrimidine tract - the small subunit functions

in recognition of the 3' AG dinucleotide

vasa

maternal - RNA helicase

virilizer

transmembrane protein regulating Sex lethal splicing

vreteno

Tudor domain protein involved in regulation of Piwi-interacting RNAs (piRNAs) in gonadal tissues thus regulating mobile genetic elements

ypsilon schachtel

cold-shock domain protein involved in post-transcriptional regulation of Oskar mRNA

The architecture of pre-mRNAs affects mechanisms of splice-site pairing

The exon/intron architecture of genes determines whether components of the spliceosome recognize splice sites across the intron or across the exon. Using in vitro splicing assays, this study demonstrates that splice-site recognition across introns ceases when intron size is between 200 and 250 nucleotides. Beyond this threshold, splice sites are recognized across the exon. Splice-site recognition across the intron is significantly more efficient than splice-site recognition across the exon, resulting in enhanced inclusion of exons with weak splice sites. Thus, intron size can profoundly influence the likelihood that an exon is constitutively or alternatively spliced. An EST-based alternative-splicing database was used to determine whether the exon/intron architecture influences the probability of alternative splicing in the Drosophila and human genomes. Drosophila exons flanked by long introns display an up to 90-fold-higher probability of being alternatively spliced compared with exons flanked by two short introns, demonstrating that the exon/intron architecture in Drosophila is a major determinant in governing the frequency of alternative splicing. Exon skipping is also more likely to occur when exons are flanked by long introns in the human genome. Interestingly, experimental and computational analyses show that the length of the upstream intron is more influential in inducing alternative splicing than is the length of the downstream intron. It is concluded that the size and location of the flanking introns control the mechanism of splice-site recognition and influence the frequency and the type of alternative splicing that a pre-mRNA transcript undergoes (Fox-Walsh, 2005).

Pre-mRNA splicing is an essential process that accounts for many aspects of regulated gene expression. Of the ~25,000 genes encoded by the human genome, >60% are believed to produce transcripts that are alternatively spliced. Thus, alternative splicing of pre-mRNAs can lead to the production of multiple protein isoforms from a single pre-mRNA, exponentially enriching the proteomic diversity of higher eukaryotic organisms. Because regulation of this process can determine when and where a particular protein isoform is produced, changes in alternative-splicing patterns modulate many cellular activities (Fox-Walsh, 2005).

The spliceosome assembles onto the pre-mRNA in a coordinated manner by binding to sequences located at the 5' and 3' ends of introns. Spliceosome assembly is initiated by the stable associations of the U1 small nuclear ribonucleoprotein particle with the 5' splice site, branch-point-binding protein/SF1 with the branch point, and U2 snRNP auxiliary factor with the pyrimidine tract. ATP hydrolysis then leads to the stable association of U2 snRNP at the branch-point and functional splice-site pairing (Fox-Walsh, 2005).

Intron size has been correlated with rates of evolution and the regulation of genome size. The exon/intron architecture has also been shown to influence splice-site recognition. For example, increasing the size of mammalian exons results in exon skipping. However, the same enlarged exons are included when the flanking introns are small. Thus, splice-site recognition is more efficient when introns or exons are small. Because, in the human genome, the majority of exons are short and introns are long, it is expected that the vast majority of splice sites in the human genome are recognized across the exon. Lower eukaryotes have a genomic architecture that is typified by small introns and flanking exons with variable length, suggesting that splice-site recognition occurs across the intron. Consistent with this model, expansion of small introns in yeast or Drosophila causes loss of splicing, cryptic splicing, or intron retention. Taken together, these observations suggest that splice sites are recognized across an optimal nucleotide length (Fox-Walsh, 2005).

It is unknown whether splice-site recognition across the intron or across the exon results in similar efficiencies of spliceosomal assembly and/or splice-site pairing. This study demonstrates that splice-site recognition across the intron ceases when the intron reaches a length between 200 and 250 nt. Because splice-site recognition is more efficient across the intron, alternative splicing is less likely for exons flanked by short introns. This influence is supported experimentally and by computational analyses of Drosophila and human alternative-splicing databases. It is concluded that the size and location of the flanking introns control the mechanism of splice-site recognition and influence the frequency and the type of alternative pre-mRNA splicing (Fox-Walsh, 2005).

Previous studies have suggested that genes with small introns tend to be recognized across the intron, and genes with large introns are recognized across the exon. To determine the distance at which recognition of splice sites switches from cross-intron interactions to cross-exon interactions occurs, advantage was taken of an in vitro kinetic splicing assay that was originally used to demonstrate that exonic splicing enhancers (ESEs), discrete sequences within exons that promote both constitutive and regulated splicing, activate both splice sites of an exon simultaneously (Lam, 2002). A number of pre-mRNAs were designed with intron lengths ranging from 120 to 425 nt. Within each set, the pre-mRNAs differ only in the presence or absence of a well characterized 13-nt ESE derived from the Drosophila doublesex and Drosophila fruitless pre-mRNAs. Each pre-mRNA harbors the same weak 5' and 3' splice sites that require the activities of ESEs for recognition in their natural context (Tian, 1992; Lam, 2003). Because splicing factors present in HeLa cell nuclear extracts activate the ESEs used (Lam, 2003), the presence of functional or mutant enhancer elements within each test substrate determine its splicing efficiency. If the splice sites are recognized across the exon, it is expected that the activation of the splice sites on each exon constitutes a different step during spliceosomal assembly, because the ESE located on each exon will only aid in the recognition of its weak splice site. Thus, the activities of the separate ESEs are expected to display synergistic kinetics, because the activation of each ESE accelerates an independent step during spliceosomal assembly. However, if the splice sites are recognized across the intron, the ESE located on each exon will aid in the recognition of both weak splice sites, because the recruited spliceosomal components define the entire intron within one step. In this scenario, the activities of the separate ESEs are expected to display additive kinetics, because the activation of each ESE accelerates the same rate-limiting step during spliceosomal assembly (Fox-Walsh, 2005).

In vitro splicing assays were performed with each of the four pre-mRNA sets over a 3-h time course to determine the apparent rates of splicing. Pre-mRNAs with an intron size of 120 nt display additive kinetics. Using Drosophila nuclear extract (Kc), it was possible to demonstrate additive kinetics for substrates containing the 120-nt intron; however, it was not possible to detect sufficient splicing for the substrates containing longer introns. These results are consistent with in vitro studies demonstrating that splicing of pre-mRNAs with long introns is supported in HeLa nuclear extract but not in Kc extract. The kinetics of pre-mRNAs containing an intron 200 nt or less in length are additive. This behavior indicates that the spliceosomal components required for the recognition of both splice sites are recruited to the intron simultaneously. However, constructs with introns >200 nt demonstrate synergistic kinetics. It is concluded that the change from splice-site recognition across the intron to splice-site recognition across the exon occurs when the intronic length is between 200 and 250 nt (Fox-Walsh, 2005).

The kinetic analysis demonstrates that the upstream 5' splice site and the downstream 3' splice site are recognized simultaneously across introns <200 nt. Significantly, in the absence of ESEs, splice-site recognition across the intron is a much more efficient process than splice-site recognition across the exon. Thus, splice-site recognition across the intron may be able to rescue the inclusion of internal exons harboring weak splice sites. To test this hypothesis, a series of pre-mRNA substrates containing three exons was designed for in vitro splicing analysis in which the internal exon contains splice sites that are insufficiently recognized in the absence of ESEs. The four substrates generated differed only in their ability to be recognized across each intron by changing the length of the intron from <200 to >250 nt, thus permitting or discouraging splice-site recognition across the intron. As expected, the internal exon is predominantly excluded when flanked by two long introns. However, significant inclusion of the internal exon is observed if one of the flanking introns is short enough to support splice-site recognition across the intron. In fact, two short introns increase exon inclusion ~30 times greater than two long introns (Fox-Walsh, 2005).

To estimate the fractions of splice sites that may be recognized through cross-intron interactions, the flanking-intron lengths were recored for every internal exon within the human and Drosophila genomes. Genome information was obtained from the Alternative Splicing Database (ASD), which contains information about the exon/intron structure and EST-verified alternative-splicing events of several thousand genes. Within the human genome, many exons are flanked by at least one short intron, creating two separate populations, separated roughly by the intron length that is proposed to represent the transition of splice-site recognition from across the intron to across the exon. As expected from previous intron-length analyses, a very different distribution is seen in the Drosophila genome, where ~85% of exons are flanked by at least one short intron. An overlay of the Drosophila and human genomes demonstrates that the minimum intron length in the human genome is at the same location that demarcates the maximum intron length of the major Drosophila exon population. This difference in genome constraint may reflect specific compositional variations between the Drosophila and human spliceosomes (Fox-Walsh, 2005).

Because splice-site recognition across the intron rescues exon inclusion, how intron length influences alternative splicing within the Drosophila and human genomes was investigated. To do so, the flanking-intron information of each exon was correlated with exon-skipping and alternative-splice-site-activation events reported in the ASD to compute the probability that an exon is involved in alternative splicing, without taking into consideration the contributions made by splice-site signal strength and splicing enhancers or silencers. Thus, the correlation simply tests whether the influence of the exon/intron architecture on alternative splicing is significant enough to be detectable amid all other splicing determinants. Computational analysis of the Drosophila genome supports a significant role for intron length in defining the likelihood of alternative splicing. A striking influence of the exon/intron architecture is observed for simple exon-skipping events. Exons flanked by very long introns are up to 90-fold more likely to be skipped than exons that are flanked by two short introns. Significantly, the most drastic increase in the probability of alternative splicing (>10-fold) was observed when the length of flanking introns increased from 225 to 525 nt. In agreement with the experimental results, a greater probability that an exon is alternatively spliced was observed when the upstream intron is long. This polarity could be the consequence of coupling pre-mRNA splicing to transcription by RNA polymerase II. Even in the category of alternative 5' or 3' splice-site activation, alternative splicing is up to 10-fold more likely for exons that are flanked by long introns. It is concluded that, in Drosophila, exon skipping is a rare event for exons flanked by short introns and that the length of the upstream intron is of greater importance than the length of the downstream intron in determining whether an exon will be involved in exon skipping (Fox-Walsh, 2005).

Within the human genome, a similar correlation between the exon/intron architecture and the probability of exon skipping is observed; however, the ~5-fold maximal variance calculated is significantly lower than that observed for Drosophila. As for Drosophila, the length of the upstream intron is more important in determining the frequency of alternative splicing. In the case of alternative 5' or 3' splice-site usage, the opposite distribution of alternative splicing is seen in the human genome. The activation of alternative splice sites is less likely if the flanking introns are long. It is concluded that exon/intron architecture influences the frequency and type of alternative splicing that an exon may undergo in the Drosophila and human genomes (Fox-Walsh, 2005).

These experiments support the existence of two different mechanisms for splice-site recognition, splice-site recognition across the intron, and splice-site recognition across the exon. Splice-site recognition across the intron ceases when the intron size reaches the threshold length of >200 nt. Importantly, splice-site recognition across the intron is more efficient and increases the inclusion of exons with weak splice sites. These results demonstrate that the distance between splice sites affects efficient spliceosomal assembly. Presumably, the pairing of cross-exon-defined splice sites requires the interaction between two sets of pre-spliceosomes across an intron of variable length. In contrast, splice-site recognition across the intron already identifies the splice sites that will be paired. It is also possible that the kinetics of splice-site pairing are slowed because longer introns associate with an increased number of hnRNP proteins. HnRNP proteins coat nascent pre-mRNAs and are thought to interfere with the splicing reaction. Therefore, larger introns may reduce splicing by decreasing the relative concentration of splicing components through competition with hnRNPs (Fox-Walsh, 2005).

Additive kinetics of splice-site activation demonstrate that splice-site recognition across the intron is achieved through the recruitment of a multicomponent complex that contains components of the splicing machinery required for 5' and 3' splice-site definition. Interestingly, the activation of a single ESE results in a significant increase in splicing activity, suggesting that ESEs influence splice-site activation of adjacent exons. As anticipated from ESE distance/activity correlations, this effect depends on intron length. Given the unique combination of splice sites and cis-acting elements, it is possible that the precise transition from splice-site recognition across the intron to splice-site recognition across the exon may vary for different substrates. The presence of strong splice sites and enhancers or silencers could modulate the cross-intron recognition by increasing or decreasing the strength of interaction between spliceosomal components and the pre-mRNA (Fox-Walsh, 2005).

The observation that increasing exon length decreases exon inclusion suggests that similar distance limitations exist for splice-site recognition across the exon. Approximately 80% of human exons are <200 bp in length, the average being 170 bp. Importantly, exon length is tightly distributed when compared with intron length. These results demonstrate that maintaining exon size in the human genome is more important to the architecture and evolution of a gene than is maintaining intron size. In contrast to the human genome, exon size varies much more than intron size in yeast. The maximum intron length of 182 nt lies well within the size limitations of splice-site recognition across the intron. Taken together, these considerations support the notion that the majority of splice sites in higher eukaryotes are recognized across the exon, whereas lower eukaryotes employ splice-site recognition across the intron (Fox-Walsh, 2005).

It is well established that several types of exon and intron elements influence splice-site choice. The most prominent include the exon/intron junction signals and splicing enhancers and silencers. The results show that the exon/intron architecture is an additional parameter that affects the efficiency of splice-site recognition and alternative pre-mRNA splicing. When compared in otherwise isogenic test substrates, splice-site recognition across the intron rescues the inclusion of a weak internal exon by >10-fold. Even though the computational analysis ignores the contributions made by variable splice sites, enhancers, and silencers, a striking increase in the probability of alternative splicing is observed for Drosophila exons, whose splice sites are recognized across the exon. Thus, the exon/intron architecture in Drosophila is a major determinant in governing the probability of alternative splicing. Within the human genome, a qualitatively similar trend was observed for exon-skipping events but with a reduced magnitude. One major difference between the Drosophila and human gene architecture is intron length. Human genes are dominated by long introns (87% of introns are >250 nt), whereas short introns are much more common in Drosophila (66% are <250 nt). One possible explanation for the small intron size in Drosophila could be the pressure to maintain a constrained genome size in these fast-replicating organisms (Fox-Walsh, 2005).

Alternative splicing is extensive in both species, supporting the argument that both species benefit from expanded proteomes generated from alternative splicing. However, genome analysis suggests that there are significant differences in the weight of the mechanisms by which alternative splicing can be induced. In Drosophila, intron length is a major determinant in promoting alternative splicing patterns. In the human, additional mechanisms of controlling alternative splicing may have gained more influence on intron expansion to maintain balanced levels of alternative splicing (Fox-Walsh, 2005).

The RNA-binding protein Rasputin/G3BP enhances the stability and translation of its target mRNAs

G3BP RNA-binding proteins are important components of stress granules (SGs). This study analyze the role of the Drosophila G3BP Rasputin (RIN) in unstressed cells, where RIN is not SG associated. Immunoprecipitation followed by microarray analysis identifies over 550 mRNAs that copurify with RIN. The mRNAs found in SGs are long and translationally silent. In contrast, it was found that RIN-bound mRNAs, which encode core components of the transcription, splicing, and translation machinery, are short, stable, and highly translated. RIN was shown to be associated with polysomes and evidence was provided for a direct role for RIN and its human homologs in stabilizing and upregulating the translation of their target mRNAs. It is proposed that when cells are stressed, the resulting incorporation of RIN/G3BPs into SGs sequesters them away from their short target mRNAs. This would downregulate the expression of these transcripts, even though they are not incorporated into stress granules (Laver, 2020).

Post-transcriptional regulation (PTR) plays a key role in the control of gene expression in all cell types. PTR is achieved by RNA-binding proteins (RBPs) and small RNAs, such as microRNAs (miRNAs), which act as specificity factors that modulate the interaction of mRNAs with the cellular machinery that localizes, translates, and degrades mRNAs (Laver, 2020).

PTR is particularly important in early animal embryos, where maternally provided mRNAs and proteins control developmental events prior to transcriptional activation of the embryo's genome. In several model animals, including Drosophila, the transfer of control from maternal products to those synthesized by the embryo's own genome -- the maternal-to-zygotic transition (MZT) -- is very rapid, occurring over a matter of hours, and thus facilitating studies of the mechanisms and functions of PTR. For example, it has been shown that the Drosophila RBP Smaug (SMG), which binds to specific stem-loop structures in its target mRNAs to repress their translation and trigger their degradation, is essential for repression and clearance of hundreds of maternal mRNAs and for timely activation of the zygotic genome. SMG is not the only negative regulator of maternal transcripts in Drosophila; additional RBPs (e.g., brain tumor) or miRNAs (e.g., miR-309) function in maternal mRNA clearance (Laver, 2020).

PTR also serves as a rapid response to cellular stress. Under stress conditions, cells shut down translation of many mRNAs while upregulating transcription and/or translation of sets of protein chaperones that maintain basal cellular integrity. Repression occurs, at least in part, in membraneless organelles known as stress granules (SGs). SGs are thought to contain transcripts that are stalled in translation initiation, and recent global analyses have shown that SGs are enriched for long transcripts (Laver, 2020).

An important component of SGs is the RBP G3BP, which is conserved throughout eukaryotes. Mammals have two genes, G3BP1 and G3BP2, whereas in Drosophila there is a single gene, Rasputin (rin). In human cells, G3BPs are necessary for SG formation, and if overexpressed, they are sufficient to induce SGs even in the absence of stress. RIN is necessary for SG formation in the Drosophila S2 tissue culture cell line, and although RIN or G3BP overexpression can induce SGs in human cells, this is not the case in S2 cells. RIN and G3BPs interact with several of the same protein partners under both stress and non-stress conditions; these include Caprin (CAPR), FMRP (FMR1 in Drosophila), and UPA2 (Lingerer [LIG] in Drosophila) (Laver, 2020 and references therein).

The roles of RIN/G3BPs in unstressed cells have received less attention than their roles upon stress. Multiple functions have been attributed to G3BPs, including transcript destabilization and repression (e.g., c-myc, BART, CTNNB1, PMP22, HIV-1, and miR-1), transcript stabilization (e.g., Tau and SART3), subcellular transcript localization (e.g., Twist1), and transcript sequestration into virus- induced foci (e.g., HIV-1). In Drosophila, mutations in the rin gene cause severe defects in oogenesis, mutant females lay few eggs, and those that are laid fail to hatch. rin mutations can also result in tissue patterning and growth defects. Despite RIN's essential function in the fly life cycle, there have been no analyses of the RIN-bound transcriptome or RIN's global role in gene regulation, nor are the molecular mechanisms that underlie RIN function known (Laver, 2020).

To better understand the function of RIN/G3BP in unstressed cells, a global analyses of the RIN-associated proteome and transcriptome was carried out in early Drosophila embryos. Using an anti-RIN synthetic antibody that was isolated from a phagedisplayed library of fragments antigen binding (Fab), RIN was immunoprecipitated and then mass spectrometry (IP-MS) was carried out to identify RIN's partner proteins. Interactions were found with several RBPs previously shown to interact with G3BP/RIN (e.g., CAPR, FMR1, and LIG), consistent with IP of a biologically relevant RIN-containing complex. RNA-dependent interactions with RIN were found for both small and large ribosomal subunit proteins, suggesting that RIN may be polysome associated, a fact that that was confirmed using polysome gradients. By coimmunoprecipitating RIN together with bound mRNAs followed by microarray analysis, hundreds of in vivo target transcripts were identified in embryos that are characterized by two features: they are short and enriched for a binding motif that was previously identified in vitro. RIN-associated mRNAs are enriched for Gene Ontology (GO) terms for core components of the transcription, splicing, and translation machinery as well as of mitochondria. RIN's endogenous targets in early embryos are more stable and more highly translated than co-expressed unbound transcripts. Their shorter length and higher rates of translation contrast with the behavior of mRNAs associated with SGs. Consistent with a role for RIN as a positive regulator of transcript stability, in rin mutants, the abundance of several highly bound target mRNAs is reduced relative to controls. Using a heterologous RNA-binding domain to tether RIN, G3BP1, or G3BP2 to a luciferase reporter mRNA in S2 tissue culture cells, it was confirmed that RIN/G3BP increases the stability and/or translation of bound transcripts in the absence of stress (Laver, 2020).

The data support a conserved function for G3BP proteins as potentiators of the translation and stability of their target transcripts. It is speculated that stress-dependent recruitment of G3BPs/RIN into SGs may serve as a mechanism to downregulate gene expression both directly, by removing RIN from its endogenous target mRNAs, as well as indirectly, through reduced transcription, splicing, and translation (Laver, 2020).

Two features, short transcript length and the motif bound by the RIN RRM in vitro, are predictive of RIN binding. That short transcripts in general might be more likely to contain the motif is excluded by the fact that RIN's target mRNAs are enriched for the motif compared with length-matched, co-expressed, unbound mRNAs. Thus, it is proposed that each feature separately contributes to RIN target mRNA binding, with shortness playing a greater role than the motif (Laver, 2020).

Although it is unclear how RIN is able to measure transcript length, there is a precedent for the differential behavior and regulation of short versus long mRNAs. For example, in general, short mRNAs are more highly translated than long mRNAs, and this is thought to reflect the fact that short mRNAs have a higher affinity for the cap-binding complex. It has been proposed that this higher affinity is related to the possibility that short mRNAs are able to form a closed-loop structure more readily than longer mRNAs. It is noted that a recent study that calls the closed-loop model into question was unable to test short mRNAs because of technical limitations. Thus, even if long mRNAs do not form a stable closed loop, it remains possible that short mRNAs do (Laver, 2020).

The ribosome-associated protein RACK1 has been shown to be required for the efficient translation of short but not long mRNAs. Although this study has shown that RIN is associated with polysomes in early embryos, Drosophila RACK1 is not on the list of RIN protein interactors and neither is there a significant overlap between the RIN protein interaction network and a set of protein interactors previously identified for RACK1. Thus, the mechanisms by which RIN recognizes short mRNAs may differ from those that have been identified previously (Laver, 2020).

Another striking feature of RIN-bound mRNAs is that they are depleted for SREs, the binding sites of SMG, which destabilizes and translationally represses its target mRNAs. This could suggest that the high translational efficiencies and stability of RIN-target mRNAs is simply a result of a lack of SREs. However, this study has provided evidence that RIN can exert its effects in situations where the SMG protein is not present (i.e., during oogenesis, in embryos older than 3 h, and in S2 cells). Thus, it is concluded that most RIN-bound mRNAs in the early embryo are upregulated directly by RIN and indirectly through a lack of SREs (Laver, 2020).

Because the target mRNAs of RIN are enriched for GO terms related to multiple levels of the core gene expression machinery-transcription, splicing, and translation-RIN may directly potentiate the expression of its bound target transcripts and, in so doing, also indirectly upregulate gene expression globally. As an example of how RIN/G3BP's direct and indirect effects might converge on the same cellular process, the production of cytoplasmic ribosomes is considered. Metabolic labeling with radioactive amino acids has shown that cytoplasmic ribosomal protein (cRP) synthesis increases after fertilization, peaks at 3-4 h, and subsequently decreases. Recent ribosome occupancy-based measurements have confirmed that the translational efficiency of cRP mRNAs increases in early embryos relative to mature oocytes. This study has shown that RIN potentiates target mRNA stability and translation and that cRP mRNAs are highly enriched among RIN's targets; thus, the potentiation of cRP mRNAs is expected to be a direct effect of RIN (Laver, 2020).

cRP mRNAs are also regulated by their conserved 50-terminal oligopyrimidine (50TOP) motifs. Two RBPs, La and Larp1, have been implicated in regulation of the stability and/or translation of 50TOP mRNAs by this motif. This study shows that the Drosophila orthologs of both of these RBPs-LA and LARP-associate with RIN in an RNA-dependent manner. This could reflect the fact that LA, LARP, and RIN co-bind cRP mRNAs and, thus, that this class of mRNAs is subject to multiple direct mechanisms that potentiate its expression. Noteworthy is the fact that, of the RIN target mRNAs, only the cRP transcripts carry 50TOP motifs and, as such, they represent a distinct class of mRNAs (Laver, 2020).

Necessity of Flanking Repeats R1' and R8' of Human Pumilio1 Protein for RNA Binding

Human Pumilio (hPUM see Drosophila Pumilio) is a structurally well-analyzed RNA-binding protein that has been used recently for artificial RNA binding. Structural analysis revealed that amino acids at positions 12, 13, and 16 in the repeats from R1 to R8 each contact one specific RNA base in the eight-nucleotide RNA target. The functions of the N- and C-terminal flanking repeats R1' and R8', however, remain unclear. This study reports how the repeats contribute to overall RNA binding. The first step was to prepare three mutants in which R1' and/or R8' were deleted and then analyzed RNA binding using gel shift assays. The assays showed that all deletion mutants bound to their target less than the original hPUM, but that R1' contributed more than R8', unlike Drosophila PUM. It was further investigated which amino acid residues of R1' or R8' were responsible for RNA binding. With detailed analysis of the protein tertiary structure, this study found a hydrophobic core in each of the repeats. All hydrophobic amino residues in each core to alanine were mutated during this process. The gel shift assays with the resulting mutants revealed that both hydrophobic cores contributed to the RNA binding: especially the hydrophobic core of R1' had a significant influence. In the present study, it was demonstrated that the flanking R1' and R8' repeats are indispensable for RNA binding of hPUM and suggest that hydrophobic R1'-R1 interactions may stabilize the whole hPUM structure (Nakamura, 2021).

The translational repressor Glorund uses interchangeable RNA recognition domains to recognize Drosophila nanos

The Drosophila melanogaster protein Glorund (Glo) represses nanos (nos) translation and uses its quasi-RNA recognition motifs (qRRMs) to recognize both G-tract and structured UA-rich motifs within the nos translational control element (TCE). It has been shown previously that each of the three qRRMs is multifunctional, capable of binding to G-tract and UA-rich motifs, yet if and how the qRRMs combine to recognize the nos TCE remained unclear. This study determined solution structures of a nos TCEI_III RNA containing the G-tract and UA-rich motifs. The RNA structure demonstrated that a single qRRM is physically incapable of recognizing both RNA elements simultaneously. In vivo experiments further indicated that any two qRRMs are sufficient to repress nos translation. interactions of Glo qRRMs were probed with TCEI_III RNA using NMR paramagnetic relaxation experiments. The in vitro and in vivo data support a model whereby tandem Glo qRRMs are indeed multifunctional and interchangeable for recognition of TCE G-tract or UA-rich motifs. This study illustrates how multiple RNA recognition modules within an RNA-binding protein may combine to diversify the RNAs that are recognized and regulated (Warden, 2023).

Specific recognition of target RNAs by RNA-binding proteins is essential for gene control and often achieved by modular combination of multiple RNA-binding domains. Tandem RNA-binding domains may arise from duplication to allow recognition of longer linear RNA sequences or more complex structures than a single domain can accomplish. For example, the tandem zinc fingers of Tristetraprolin each recognize a UAUU sequence within the UUAUUUAUU AU-rich element (ARE). Duplicated domains also may diverge to recognize distinct sequence or structural elements. In the case of HuD protein, its two N-terminal RNA recognition motif (RRM) domains both recognize UU elements in AREs, but RRM1 also interacts with four additional 5' flanking nucleotides. Although amino acid sequences of RNA-binding domains can indicate whether they retain the characteristic RNA recognition features of that domain family, it remains impossible to predict most domain-specific variations. Hence, the specificity of RNA-binding domains must be determined experimentally. Each study expands knowledge of sequence features that drive specificity and improves prediction methods to identify target RNAs (Warden, 2023).

During Drosophila development, translational control of nanos (nos) mRNA is essential for the formation of the anterior-posterior body axis of the embryo. This process begins in the ovary when nos mRNA is transferred to oocytes from nurse cells during late oogenesis and translationally repressed until it is localized to the posterior. The translational repression of unlocalized nos mRNA is mediated by a translational control element (TCE) in the 3' UTR of nos mRNA. nos TCE RNA is composed of one stem (TCEI) and two stem loop structures (TCEII and TCEIII). During oogenesis, binding of TCEI and TCEIII by the protein Glorund (Glo) controls Nos expression. This protein:RNA interaction is essential to repress unlocalized nos mRNA during oogenesis, which ultimately ensures the formation of functional anterior structures in the fly embryo. Repression of nos initiated by Glo during oogenesis is also maintained in the early embryo by another RNA-binding protein, Smaug, which recognizes TCE stem loop II (Warden, 2023).

Glo is an RNA-binding protein belonging to the heterogenous nuclear ribonucleoprotein F/H (hnRNP F/H) family. Mammalian hnRNP F and H are predominantly nuclear and best known as regulators of alternative splicing. Additionally, hnRNP F and H1 are abundant in regenerating axons, where they bind to mRNAs involved in axonal growth and are thought to regulate their axonal transport and/or translation. Notably, mammalian hnRNP F and H proteins are upregulated in a variety of cancers and are thought to contribute to pathogenicity in part by altering translation. Like hnRNPs F/H, Glo contains three tandem quasi-RNA recognition motifs (qRRMs). It was previously demonstrated that the Glo qRRMs are multifunctional, each capable of binding to G-tract and structured UA-rich features of the nos TCE. The G-tract sequence is found in TCEI and the structured UA-rich motif in TCEIII. Both features are essential for TCE function and Glo recognition, yet the affinity for G-tract binding is substantially higher than UA-rich motif binding. Disruption of either RNA motif results in modestly increased Nos protein, but mutation of both RNA motifs dramatically increases Nos protein levels. Moreover, engineered mutations that preferentially disrupt Glo qRRM binding to either the G-tract or UA-rich motif demonstrate that nos repression requires both modes of Glo-TCE recognition in vivo. How the tandem Glo qRRMs combine to recognize the G-tract and UA-rich motifs of TCE RNA remained unanswered (Warden, 2023).

This study presents a high-resolution solution structure of an RNA containing the elements recognized by Glo: nos TCEI and TCEIII, which is called in this study TCEI_III. The divide-and-conquer method, a well-established and effective approach combining structural biology methods, was used to build a larger macromolecular model from high-resolution structures of its components. This divide-and-conquer method allowed this study to avoid the technical difficulties of crystallizing structured RNAs without engineered crystal contacts such as tetraloop/tetraloop insertions. First a solution structure of TCEIII RNA was determined by NMR. Then the TCEIII structure along with NMR data for TCEI_III RNA and overall shape information from small-angle X-ray scattering (SAXS) were used to produce the hybrid TCEI_III model. This model suggested that more than one Glo qRRM is necessary for TCE recognition, and in vivo experiments indicated that a minimum of two qRRMs is sufficient for nos repression. Furthermore, NMR paramagnetic relaxation experiments (PREs) were used to probe the interactions between the G-tract and UA-rich motifs of TCE RNA and individually-labeled Glo qRRMs. Our in vitro and in vivo data are consistent with a model where the Glo qRRMs are indeed multifunctional and interchangeable for recognition of nos TCE G-tract or UA-rich motifs (Warden, 2023).

Combinations of RNA recognition modules generate the diverse specificities of RNA-binding proteins needed to control gene expression. RRM or KH domains often occur in tandem and together generate distinct RNA recognition specificity to select target mRNAs. Although much information is available about the specificities of RNA-binding domains, including high-throughput screening of sequence specificity, there relatively little information on how multiple RNA-binding modules bring their individual specificities to RNA target selection. This study demonstrates that at least two of the three multifunctional Glo qRRMs are required to recognize the nos TCE RNA, one for the TCEIII UA-rich stem and one for the TCEI G-tract. Therefore, the efficiency of having two RNA-binding modes in one qRRM is not utilized: one domain does not simultaneously recognize both RNA features. It was also found that the three qRRMs are interchangeable: any two of the three Glo qRRMs are sufficient in vivo to repress nos translation and maintain viability. Although qRRMs 1 and 2 are closely spaced with 8 residues between the C-terminal α helix of qRRM1 and the N-terminal β strand of qRRM2, this length linker appears sufficient to bridge the distance between G-tract and UA-rich elements. Alternatively, the C-terminal α helix of qRRM1 could unfold upon RNA binding, as seen for the RRM of Nop15 when it binds to pre-rRNA. Finally, it does not matter which RNA feature is recognized by the two qRRMs. This striking redundancy of the Glo qRRMs seems excessive, as some specialization would have been expected to have evolved (Warden, 2023).

Several evolutionary benefits are suggested from having three redundant multifunctional qRRMs in a single protein. (i) Functional redundancy protects crucial gene regulation. Repression of nos translation during oogenesis is essential to ensure correct anterior patterning in the embryo, yet mutation of one qRRM avoids a biological disruption as nos expression is normal in embryos. Similarly, although the target RNA(s) are not known, Glo with two intact qRRMs supports viability to adulthood. (ii) Redundancy gives Glo the potential to recognize a variety of target RNAs with different combinations of G-tract and UA-rich features. In addition, the G-tracts may be either single-stranded or base-paired. Previous work has demonstrated that Glo qRRMs bind to short single-stranded G-tracts, and this study observed visible imino proton resonances for the G-tract sequence (G87, G88, and G89) in the TCEI_III RNA, indicating that the G-tract remains base paired in the presence of Glo. Therefore, Glo qRRMs appear to differ from hnRNP F qRRMs that disrupt RNA structures when binding to G-tracts. Although the in vivo and in vitro evidence indicates the importance of base pairing for TCE activity, it will require additional structural studies to determine how Glo qRRMs specifically recognize duplex RNAs. (iii) Functionally redundant qRRMs with different spacing between the qRRMs adds additional opportunities for distinct RNA recognition. The linker between Glo qRRMs 1 and 2 is short (∼8 aa) whereas qRRMs 2 and 3 are separated by a long linker (∼240 aa). Although the distance between qRRMs does not affect nos regulation as any two qRRMs are sufficient, the spacing may alter specificity for other RNAs. For example, qRRMs 1 and 2 would require RNAs with G-tract and/or UA-rich motifs near each other whereas the long linker between qRRMs 2 and 3 should make that two-domain combination less restrictive in arrangement of G-tract and/or UA-rich motifs in target RNAs. To appreciate the range of specificity and roles of its multifunctional qRRMs, additional RNAs that are controlled by Glo will need to be identified (Warden, 2023).

Adding to its multifunctionality, Glo also binds to other proteins. Glo was recently shown to interact with Drosophila Fragile X mental retardation protein (dFMRP) to stall ribosomes and decrease the translation elongation rate on nos mRNA. Although this study has not identified differences in RNA recognition, Glo qRRMs do have different protein binding specificity. Glo interacts with dFMRP via qRRM2, but qRRM1 and qRRM3 do not directly interact with dFMRP. It is tempting to speculate that interaction between qRRM2 and dFMRP alters RNA recognition, and that other protein interactions could modify RNA recognition by the other qRRMs to generate specialization (Warden, 2023).

References

Fox-Walsh, K. L., et al. (2005). The architecture of pre-mRNAs affects mechanisms of splice-site pairing. Proc. Natl. Acad. Sci. 102(45): 16176-81. PubMed ID: 16260721

Lam, B. J. and Hertel, K. J. (2002). A general role for splicing enhancers in exon definition. RNA 8(10): 1233-41. PubMed ID: 12403462

Laver, J. D., Ly, J., Winn, A. K., Karaiskakis, A., Lin, S., Nie, K., Benic, G., Jaberi-Lashkari, N., Cao, W. X., Khademi, A., Westwood, J. T., Sidhu, S. S., Morris, Q., Angers, S., Smibert, C. A. and Lipshitz, H. D. (2020). The RNA-binding protein Rasputin/G3BP enhances the stability and translation of its target mRNAs. Cell Rep 30(10): 3353-3367. PubMed ID: 32160542

Nakamura, K., Nakao, T., Mori, T., Ohno, S., Fujita, Y., Masaoka, K., Sakabayashi, K., Mori, K., Tobimatsu, T. and Sera, T. (2021). Necessity of Flanking Repeats R1' and R8' of Human Pumilio1 Protein for RNA Binding. Biochemistry 60(40): 3007-3015. PubMed ID: 34541851

Tian, M. and Maniatis, T. (1992). Positive control of pre-mRNA splicing in vitro. Science 256(5054): 237-40. PubMed ID: 1566072

Warden, M. S., DeRose, E. F., Tamayo, J. V., Mueller, G. A., Gavis, E. R. and Hall, T. M. T. (2023). The translational repressor Glorund uses interchangeable RNA recognition domains to recognize Drosophila nanos. Nucleic Acids Res. PubMed ID: 37427795

Drosophila genes listed by biochemical function

The Interactive Fly resides on the
Society for Developmental Biology's Web server.