ReviewNoncoding RNAs

Outside the coding genome, mammalian microRNAs confer structural and functional complexity

See allHide authors and affiliations

Sci. Signal.  17 Mar 2015:
Vol. 8, Issue 368, pp. re2
DOI: 10.1126/scisignal.2005813


Protein-coding genes are regulated posttranscriptionally by microRNAs (miRNAs). These small noncoding RNAs add substantial complexity to the molecular networks that underlie development, physiology, and disease. Many miRNAs exhibit unique gene structures, frequent functional redundancy, and complex biogenesis regulation, which contribute to their diversity and versatility. In this review, which contains 5 figures, 2 tables, and 159 references, we discuss how these unique features of miRNAs provide insights into the functional complexity of mammalian genomes.


MicroRNAs (miRNAs) comprise a class of small, regulatory noncoding RNAs (ncRNAs) with pivotal roles in posttranscriptional gene regulation. Since their initial discovery in 1993, numerous miRNAs have been identified in mammalian genomes, many of which play important roles in diverse cellular processes in development and disease. These small ncRNAs regulate the expression of many protein-coding genes posttranscriptionally, thus adding a substantial complexity to the molecular networks underlying physiological development and disease. In part, this complexity arises from the distinct gene structures, the extensive genomic redundancy, and the complex regulation of the expression and biogenesis of miRNAs. These characteristics contribute to the functional robustness and versatility of miRNAs and provide important clues to the functional significance of these small ncRNAs. The unique structure and function of miRNAs will continue to inspire many to explore the vast noncoding genome and to elucidate the molecular basis for the functional complexity of mammalian genomes.


In comparative genomic studies, the number of protein-coding genes within a given genome does not correlate well with the developmental and pathological complexity of the organism (1). With the identification of numerous transcripts from the noncoding genome, which greatly exceeds protein-coding genes in number and diversity, recent studies have led to a reassessment of genomic information content (2, 3). For example, more than 80% of the human genome can be transcribed, yet only less than 2% of our genome contains protein-coding capacity (2). It is increasingly clear that the diversity of the noncoding genome correlates well with the functional complexity of a given organism (1). The vast noncoding transcriptome contains numerous noncoding RNAs (ncRNAs) that act as integral components of the molecular networks in development and disease (4). Although we are only starting to understand the realm of ncRNA biology, a frequent mode of action for ncRNAs is to form RNA-protein complexes to regulate gene expression at the level of gene transcription, RNA processing, RNA degradation, and protein translation (5). It is most likely that the base pairing between ncRNAs and other nucleic acids confers the specificity of such gene regulation (5).

One of the most studied classes of ncRNAs is microRNAs (miRNAs), a class of small regulatory ncRNAs with pivotal roles in posttranscriptional gene regulation (69). Nascent transcripts from miRNA genes contain one or multiple stem-loop structure(s), which are processed sequentially by the microprocessor complex (DGCR8/Drosha) and Dicer to yield mature miRNA duplexes (10). One strand of the mature miRNA duplex is incorporated into the effector complex, RISC (RNA-induced silencing complex), which recognizes multiple mRNA targets through imperfect base pairing, and mediates posttranscriptional repression through combined mechanisms of mRNA degradation and translational repression (11).

Since the identification of the first miRNA using forward genetics assays in worms (12, 13), more than 2000 miRNAs have been identified in human cells to date, regulating nearly all essential cellular processes in development and disease. The biogenesis of miRNAs, the regulation of posttranscriptional gene regulation by miRNAs, and the biological functions of specific miRNAs are the subject of multiple extensive reviews (911, 1417), which we will not reiterate here.

Despite the relatively short history of the miRNA field, both miRNA antagonists and miRNA mimics have exhibited therapeutic potential (1820) and recently entered clinical trials for treating human diseases. Nevertheless, efforts to characterize mammalian miRNA functions in vivo have not been straightforward because many individual miRNA knockout mouse strains do not exhibit strong developmental phenotypes (21). Here, we will focus on the particular functionality of mammalian miRNAs conferred by their distinct gene structure, genomic organization, and expression regulation, and will discuss the challenge we face to elucidate the functional significance of these small ncRNAs.

Functional Importance of miRNAs in Mammalian Development and Physiology

miRNAs exhibit unique gene structure and genome organization that distinguish them from most protein-coding genes in the mammalian genomes. One of the central questions in the miRNA field is to what extent these small ncRNAs play an essential or important role in development and disease. Targeted deletion of key miRNA biogenesis machineries in mice, including Dgcr8 and Dicer, invariably leads to profound developmental defects in many cell types and contexts (2224). Mutations of miRNA biogenesis machineries have also been identified in pathological conditions, such as several cancer types. In addition, it is increasingly clear that numerous signaling pathways that establish cell fate and/or function—including the Hippo pathway, the transforming growth factor β (TGF-β) pathway, estrogen receptor (ER) signaling, the p38 mitogen-activated protein kinase (MAPK) pathway, and the p53-dependent DNA damage response—converge on controlling global or specific miRNA biogenesis (2527). This regulation of miRNA synthesis in part involves a direct interaction between microprocessor-associated helicase proteins, p68 and p72, and a component in each of the aforementioned signaling pathways (2527). Thus, these findings clearly indicate the functional importance of miRNAs as a whole in diverse developmental and pathological processes.

The initial, systematic functional characterization of individual miRNAs was performed in Caenorhabditis elegans, in which most individual miRNA knockout worms are viable without obvious developmental defects (28). A subsequent study has focused on redundant miRNA families and generated mutant worm strains deficient for all members of each family (29). Yet 12 of 15 miRNA family mutations fail to generate obvious developmental defects (29), leading some to question the extent of functional importance of redundant miRNA families. The story is somewhat different in mammals; mouse knockout studies have so far been performed on a limited number of miRNAs. Although individual miRNA knockout mice often do not yield obvious developmental phenotype (Tables 1 and 2), emerging evidence has implicated the functional importance of redundant miRNA families in normal development and stress response. Of the eight mammalian miRNA families with essential developmental functions, six exhibit extensive genomic redundancy, and their functional importance can only be revealed when most or all of the redundancy is removed (Table 1). The redundant, polycistronic miRNAs are particularly important. Although only limited effort so far has been dedicated to study their functions in vivo by removing all or most paralogous loci, an essential developmental function has been revealed in all such studies (3033).

Table 1 Mouse knockout phenotype for redundant miRNAs.

A summary of the phenotypes observed for mouse knockout studies to date examining miRNAs that exhibit redundancy.

View this table:
Table 2 Mouse knockout phenotype for nonredundant miRNAs.

A summary of the phenotypes observed for mouse knockout studies to date examining miRNAs that have no known redundancy.

View this table:

Complex Functional Interactions by Polycistronic miRNAs

Unlike eukaryotic mRNAs that are mainly monocistronic, a substantial portion of miRNA genes tend to be organized in clusters to generate polycistronic precursors. Twenty-five percent of human miRNAs and 31% of mouse miRNAs are predicted to be derived from polycistronic precursors (34), where a single transcript contains multiple stem-loop structures and yields multiple mature miRNAs. Polycistronic miRNA components are co-regulated transcriptionally, yet each encoded miRNA is individually processed by the biogenesis machinery and has a distinct sequence, target specificity, and functional readout. Among the polycistronic miRNAs, some contain a tandem of miRNA homologs working in concert, likely generated by local duplication of an existing miRNA locus during evolution. Others consist of miRNAs with different mature sequences and biological functions, potentially harboring a complex mode of interaction. Still others contain both homologous and heterologous miRNA components, conferring both a redundancy and a versatility that are rarely seen for single miRNA genes.

The small size of miRNAs and the imperfect base pairing for their target recognition enable each miRNA to regulate many mRNA targets (9). miRNA polycistrons further expand the functional capacity of individual miRNAs, yielding a greater capacity for gene regulation through the functional coordination of multiple cotranscribed miRNAs. This internal interaction among miRNA polycistronic components could constitute a unique molecular basis for a pleiotropic function in development and disease. Indeed, for the reported miRNAs that, when deleted in mice, lead to essential developmental and physiological phenotypes, a significant portion are polycistronic miRNAs (Table 1).

Many polycistronic mammalian miRNAs are conserved among vertebrate genomes. When aligning the homologous miRNA polycistrons, their conserved structures are centered on regions that encode the precursor miRNA (pre-miRNA), whereas the regions between pre-miRNA sequences are often divergent. Polycistronic miRNAs are often redundant within a particular genome, likely resulting from genomic duplication events during evolution (35). Among the paralogous miRNA polycistrons, not only is sequence conserved for most miRNA components, but the organization of the polycistronic components is also largely preserved (36). Thus, both the individual miRNA components and their genomic organization are likely to have an impact on their functions.

Cooperative interactions within polycistronic miRNAs

The polycistronic miRNAs bear striking structural resemblance to prokaryotic operons, in which multiple protein-coding genes under the same transcriptional control act cooperatively to regulate a specific biological process. This analogy has led many to speculate that multiple cotranscribed polycistronic miRNA components could also act synergistically to achieve functional robustness. Not surprisingly, polycistronic miRNAs that solely contain homologous components are likely to exhibit cooperative interactions to confer functional robustness, and such homologous miRNAs frequently share the same seed sequences and thus the same or similar target specificity and biological functions (Fig. 1A). For example, the miR-221/222 dicistron contains two homologous miRNAs, miR-221 and miR-222, each containing the same seed region. Overexpression of miR-221/222 has been observed in a large number of malignancies. Both miRNAs cooperatively promote proliferation and block TRAIL [tumor necrosis factor (TNF)–related apoptosis-inducing ligand]–induced apoptosis by activating the PI3K (phosphatidylinositide 3-kinase)–AKT pathway, and they promote the cell cycle by repressing tumor suppressor genes such as those encoding p27, p57, PTEN (phosphatase and tensin homolog), and TIMP3 (tissue inhibitor of matrix metalloproteinase 3) (37, 38). Similarly, miR-221 and miR-222 act cooperatively to repress expression of the gene encoding the c-Kit receptor (CD117) in germ cells to maintain the undifferentiated state of mammalian spermatogonia (39), as well as in erythroleukemia cells to inhibit erythropoiesis (40). Polycistronic miRNAs containing heterologous components can also exhibit a similar cooperative interaction because these cotranscribed miRNAs with distinct mature sequences can still target the same mRNA(s) at different sites or target different components in the same pathway (Fig. 1A).

Fig. 1 Polycistronic miRNAs harbor complex functional interactions among its components.

(A) Cooperative functional interaction occurs within a polycistronic miRNA. Some polycistronic miRNAs contain a tandem of homologous components that frequently share the same seed sequences, and thus, they often have the same targets and biological function (right panel). Other miRNA clusters consist of heterologous miRNAs that act on different target sites to synergically regulate one biological process (left panel). (B) miRNA polycistrons can harbor a functional antagonism among the encoded components. (C) Schematic illustration of the miR-19:miR-92 antagonism in regulating the oncogenic cooperation between miR-17-92 and c-Myc. Whereas miR-19 miRNAs repress c-Myc–induced apoptosis to promote oncogenesis, miR-92 exhibits an opposite effect to promote c-Myc–induced apoptosis.

Despite the cooperative interactions among the polycistronic miRNA components, the extent of the functional contribution of each miRNA can be variable in a cell type– and context-dependent manner. For example, some miRNA polycistrons could harbor one functionally dominant miRNA component that acts cooperatively with the remaining component(s). The evolutionarily conserved miR-143/145 dicistron encodes miR-143 and miR-145, which exhibit no sequence homology. Although these two miRNAs likely act on different target sites, they share a surprising number of common targets for actin dynamics, cytoskeletal function, and injury-induced phenotypic switching of smooth muscle cells (41, 42). Deficiency of miR-145, but not of miR-143, perturbs the formation of actin stress fibers in mice and blocks vascular smooth muscle cell response to vessel injury (42). Mice deficient for miR-143/145 largely phenocopy miR-145−/− mice, yet with a slightly stronger manifestation on a subset of the defects (42). These findings support a cooperative mode of interactions between miR-143 and miR-145, with miR-145 as the dominant player. Similarly, the dicistronic miR-144/451 miRNA also harbors a component, miR-451, that is functionally dominant in erythroid differentiation. During differentiation through this lineage, miR-451 plays a major regulatory role, at least in part by down-regulating the chaperone protein 14-3-3ζ, which coordinates signal transduction downstream of hematopoietic growth factor receptor signaling (43).

Antagonistic interactions within polycistronic miRNAs

Whereas cooperation is the predominant mode of interaction among polycistronic miRNA components, some miRNA polycistrons also harbor a functional antagonism. Such functional antagonism does not exist in isolation, nor does it manifest as a stereotypical mechanism; instead, it is cell type– and context-dependent, often contributing to the functional complexity of miRNA polycistrons. The first characterized example involves two muscle-specific miRNA clusters, miR-1-1/miR-133a-2 and miR-1-2/133a-1. In mouse C2C12 myoblast cells, miR-1 promotes differentiation by inhibiting myoblast proliferation through the repression of HDAC4 (histone deacetylase 4), whereas miR-133a counteracts this effect by promoting myoblast proliferation through the regulation of SRF (serum response factor) (44). Another well-characterized miRNA polycistron that harbors internal antagonism is miR-17-92, a potent oncomir cluster that encodes six mature miRNAs belonging to four families (miR-17/20, miR-18, miR-19a/19b, and miR-92) (45). In a mouse model of Burkitt’s lymphoma, where miR-17-92 exhibits potent cooperation with the transcription factor c-Myc to promote B cell lymphomagenesis, the miR-19 family components are both necessary and sufficient to mediate the oncogenic effect of miR-17-92 (45, 46), whereas the miR-92 component acts as an internal inhibitor to dampen this oncogenic activity (47) (Fig. 1C). This effect of miR-92 is, at least in part, mediated by enhancing c-Myc dosage to impose a strong coupling between excessive proliferation and p53-dependent apoptosis. The miR-19:miR-92 antagonistic interaction converges on the regulation of cell survival during B cell transformation, with miR-19 promoting cell survival and miR-92 enhancing apoptosis upon aberrant c-Myc up-regulation (47) (Fig. 1C). In this case, functional crosstalk between oncogene and tumor suppressor pathways is hardwired into the unique polycistronic miRNA structure, limiting detrimental oncogenic signaling in cells with inappropriate miR-17-92 induction.

A number of mechanisms could regulate the extent of the antagonism within a miRNA polycistron and the intricate balance it confers. For example, the miR-19: miR-92 antagonism is disrupted during c-Myc–induced oncogenesis because of a greater increase of miR-19 over miR-92 (47). Because polycistronic miRNA components are co-regulated transcriptionally, their differential expression regulation under a specific context is possibly mediated through differential miRNA biogenesis and/or miRNA turnover. Multiple discoveries have shed light on the identification of miRNA-specific posttranscriptional regulations that open a plethora of mechanisms by which individual miRNAs can be differentially expressed (Fig. 2). Emerging evidence has demonstrated that both the tertiary structure of miRNA precursors and trans-acting RNA binding proteins could have an impact on the canonical miRNA biogenesis, resulting in cell type– and context-dependent, differential miRNA production from different polycistronic components encoded by the same precursor (4851). Additionally, a number of RNA binding proteins are also identified to regulate miRNA stability via a sequence-dependent manner (52, 53). Thus, although polycistronic miRNA components are co-regulated transcriptionally, differential expression clearly modulates their relative abundance, yielding unusual functional complexity and robustness through their functional interaction.

Fig. 2 Diverse mechanisms regulate miRNA-specific biogenesis.

A schematic illustration summarizes the biogenesis regulators of specific miRNAs, with positive regulators on the left and negative regulators on the right (26, 50, 52, 8798, 144159). miRNAs whose biogenesis is controlled by each regulator are also listed.

Cell type– and context-dependent versatility of polycistronic miRNA functions

In a polycistronic miRNA, the functional readout of the individual miRNA components is cell type– and context-dependent. For example, miR-17-92 components exhibit opposing effects on the regulation of angiogenesis in two different cell types (5456). In colon cancer cells that overexpress MYC, miR-18 and miR-19 synergistically induce angiogenesis by repressing TSP1 and CTGF, respectively (54). In endothelial cells, however, miR-17/20 and miR-92 cooperatively mediate antiangiogenic effects by down-regulating Janus kinase (JAK) and the integrin subunit α5, respectively (55, 56). Thus, two nonoverlapping subsets of miR-17-92 components generate opposing effects on a single biological process (angiogenesis) in two different cell types (Fig. 3A). Additionally, it is not uncommon that a specific biological context also determines which polycistronic component(s) plays a key function. The oncogenic potential of miR-17-92 is evident both in cooperation with c-Myc and in cooperation with RAS oncogene. Whereas miR-19 mediates the oncogenic collaboration with c-Myc through the repression of the tumor suppressor PTEN (46), the miR-17/20 miRNAs are responsible for cooperating with RAS by repressing the cyclin-dependent kinase inhibitor p21 (57). This is due to the distinct effects of miR-19 and miR-17/20 on repressing c-Myc–induced apoptosis and bypassing RAS-induced senescence, respectively (46, 57).

Fig. 3 Cell type– and context-dependent versatility of polycistronic miRNA functions.

(A) The miR-17-92 functions on angiogenesis depend on cell types. miR-18a and miR-19 repress the abundance of several secreted molecules in tumor cells (54), thereby promoting angiogenesis through a non–cell-autonomous mechanism. In contrast, in endothelial cells, miR-17/20 and miR-92a suppress the abundance of JAK and the integrin subunit α5, respectively, thus inducing an antiangiogenic effect through a cell-autonomous mechanism (55, 56). (B) The mode of functional interaction between miR-1 and miR-133 depends on the biological contexts. Although miR-1 and miR-133 synergistically promote differentiation from embryonic stem cells to mesoderm, they exhibit a functional antagonism in regulating the muscle-specific differentiation from mesoderm (58).

Clearly, a distinct molecular pathway can act downstream of each polycistronic miRNA component, constituting the molecular basis of a pleiotropic effect. The exact mode of functional interaction within miRNA polycistrons reflects the crosstalk among these downstream signaling pathways, and could vary greatly depending on cell types and biological context. One of the best-characterized examples is the cardiomyocyte-enriched miR-1/133a polycistron, which regulates embryonic stem (ES) cell differentiation into cardiomyocytes (58). It is known that the myogenic differentiation factors, including MyoD, SRF, and myocyte enhancer factor (MEF2), work in part through miR-1 and miR-133a to control skeletal and cardiac muscle cell specification. For example, MEF2 directly regulates miR-1/133a expression through a tissue-specific enhancer within an intron located between the miR-1 and miR-133a coding regions (59). The cotranscribed miR-1 and miR-133a components exhibit different modes of interaction in a stage-specific manner during ES cell differentiation to cardiomyocytes (Fig. 3B). In the first phase, when ES cells differentiate into mesoderm, miR-1 and miR-133a cooperate to suppress ectodermal and endodermal cell fates (58). In the second phase, when the embryonic mesoderm is differentiated into cardiomyocytes, miR-1 and miR-133a antagonize each other, with miR-1 promoting and miR-133a repressing this process (58). This internal antagonism is, at least in part, due to miR-1 regulation of the Notch ligand Delta and the transcription factor Hand2 (58, 60). Thus, the polycistronic gene structure of a miRNA can confer cell type– and context-dependent crosstalk among multiple downstream pathways, conferring unusual versatility and plasticity.

Functional Robustness Conferred by miRNAs

To date, deletion of most individual miRNAs in mice failed to cause an obvious defect; however, important phenotypes often emerge when compound miRNA knockout mice are generated, thereby partially or completely removing the redundancy in the mouse genome (Tables 1 and 2). These findings suggest that mammals are generally resistant to moderate perturbation of individual miRNA expression. The unique gene structures, genomic organization, and expression regulation of miRNAs all contribute to the molecular basis for their functional robustness; thus, careful experimental strategies are required to elucidate their biological significance. Here, we will discuss how genomic redundancy, polycistronic gene structure, and expression regulation all contribute to the unusual functional robustness of some miRNAs.

Genomic redundancy of miRNAs

The redundancy of miRNA families is more extensive in mammalian genomes than in invertebrate genomes. This redundancy comes not only from polycistronic miRNAs that contain homologous components but also from multiple paralogous miRNA loci in a given genome (34). Among the limited miRNA families conserved among invertebrate and vertebrate genomes, expansion throughout higher-order genomes is frequently observed. The let-7 family contains four members in worms, yet there are 14 different Let-7 miRNAs in mice (61). Similarly, miR-34 family only has one member in worms, and there are six copies of miR-34 miRNAs in mammals (30). This increased miRNA redundancy during evolution likely reflects the functional importance of such miRNAs in mammalian development and physiology. In mouse knockout studies, a significant portion of miRNA deficiency phenotypes results from single or compound deletion of redundant miRNA family member(s) (Table 1). From a limited number of studies, these redundant miRNA loci appear not to be simply functional replicates of each other. Instead, they can undergo distinct expression regulation and engage complex feedback mechanisms to trigger functional compensation and confer robustness.

The complete removal of all members of a redundant miRNA family through the generation of knockout mice is a time-consuming endeavor. Thus, most functional studies to date have focused on mice with the deletion of one miRNA or a subset of homologous miRNAs. Nevertheless, a wide spectrum of developmental and physiological phenotypes has emerged from such investigations (Tables 1 and 2). The miR-200 family contains five homologous miRNAs located in two polycistronic clusters, miR-200a/200b/429 and miR-200c/141. Deletion of miR-200b and miR-429 alone within miR-200a/200b/429 results in female infertility due to ovulatory failure (62). The miR-124 family encodes three identical, neuron-specific miRNAs, including miR-124-1, miR-124-2, and miR-124-3. Mice deficient for miR-124-1 alone exhibit small brain size, aberrant axonal sprouting in dentate gyrus granule cells, and increased cell death in retinal cones (63). miR-15a/16-1, which is frequently deleted in B cell chronic lymphocytic leukemia (CLL), is part of a redundant miRNA family with two distinct loci. miR-15a/16-1 deletion in mice causes indolent, B cell–autonomous, clonal lymphoproliferative disorders, recapitulating the phenotypical spectrum of human CLL patients (64). Protein kinase Cα (PKCα)–dependent signaling represses miR-15a, and this regulation, at least in some cells, is part of a feed-forward loop that promotes DNA synthesis because miR-15a modulates cyclin E expression (65). Despite these compelling observations, the essential function of such redundant miRNA families is yet to be better revealed because the typical experimental approaches underestimate the extent of the phenotype because of functional compensation from the remaining miRNA homolog(s).

There are only a limited number of studies in which redundant miRNAs are studied systematically in knockout mice (30, 31, 66, 67). Not surprisingly, synergistic cooperation could occur among redundant miRNA paralogs, whereby only the complete removal of all redundant family members will yield an evident phenotype. For example, miR-133a-1 and miR-133a-2 encode identical, muscle-specific miRNAs, located within the miR-1-2/133a-1 and miR-1-1/133a-2 miRNA dicistrons, respectively. Deficiency of miR-133a-1 or miR-133a-2 alone fails to cause any specific phenotype, whereas miR-133a double knockout mice exhibit lethal ventricular-septal defects in embryos or neonates with 50% penetrance (67).

In addition to a synergistic interaction among redundant miRNA loci, some families contain multiple loci that have different functional contributions to a given biological process. Among the three individual loci encoding members of the miR-17-92 family (miR-17-92, miR-106a-363, and miR-106b-25), miR-17-92 clearly has the most prominent developmental role because the single deletion of miR-17-92, but not that of the other paralogs, exhibits an obvious developmental phenotype, including postnatal lethality, lung hypoplasia, skeletal defects, and aberrant apoptosis during pro- to pre-B transition (31, 68). Although miR-106b-25 deletion alone does not display any obvious defects, it clearly enhances the developmental phenotype of miR-17-92−/− mice, with the combined deletion producing embryonic lethality at embryonic day 15.5 (E15.5) with severe lung and heart defects (31). In contrast, deletion of miR-106a-363, either alone or in combination with its paralogs, did not yield any obvious phenotype, possibly because of its low or undetectable expression level (31).

An even more complex interaction is found in redundant miRNAs, where different subsets of paralogs preferentially regulate distinct biological processes. The best-studied example is the let-7 family of miRNAs in worms. Let-7 was initially identified through forward genetic screen in worms as an important regulator for the larval-to-adult transition (69). Three additional let-7 family members exist in worms (miR-48, miR-84, and miR-241). Whereas let-7 predominantly promotes the larval-to-adult transition through the down-regulation of hbl-1, lin-41, and daf-12, miR-48, miR-84, and miR-241 collectively repress hbl-1 and regulate the early larval developmental timing at the L2-to-L3 transition (6975). Sequence differences in the 3′ end of the let-7 family miRNAs could contribute to the functional specificity of different let-7 loci (75). The functional specificity of each let-7 locus is due, at least in part, to the differential temporal and special expression regulation. let-7 itself is enriched in late larval stages to regulate larva-to-adult transition; in comparison, all other let-7 paralogs peak during early larval stages, thus predominantly promoting early larval developmental progression (69, 75). Because the mammalian let-7 family contains more than a dozen paralogs, it is likely to have even more complex interactions to regulate multiple distinct biological processes.

The monocistronic and polycistronic miRNAs often exhibit different patterns of redundancy. For monocistronic miRNAs, the manifestation of genomic redundancy is straightforward because the related loci each encode a single miRNA that is similar in mature sequences. However, for polycistronic miRNAs, genomic redundancy can be more complicated (Fig. 4). In some cases, individual miRNA loci contain different numbers of the homologous miRNAs. A well-characterized example is the miR-34/449 miRNA family, whose three paralogs in mammals contain one, two, and three miR-34/449 miRNAs, respectively (30) (Fig. 4) . In other cases, redundant polycistronic miRNAs only contain a subset of components that are conserved. One of the best example is the miR-17-92 family, where only three of six miR-17-92 components are conserved in its paralog miR-106b-25 (31, 76) (Fig. 4). It has been speculated that such paralogous miRNA polycistrons arise from genome duplication and, subsequently, can be subjected to local deletion/duplication of specific component(s) (35). Thus, it is conceivable that the molecular composition of miRNA paralogs could also contribute to their functional specificity.

Fig. 4 Distinct genomic organization of paralogous polycistronic miRNA loci.

miR-1-1/133a-2, miR-1-2/133a-1, and miR-206/133b are strictly conserved paralogs. miR-34a, miR-34b/34c, and miR-449 are paralogs with distinct gene structures. They contain homologous miRNAs that are present in different copy numbers. miR-17-92, miR-106a-363, and miR-106b-25 are also paralogs that harbor distinct gene structures. They are composed of both homologous and heterologous miRNAs, and only a subset of components is conserved between the paralogs.

Paralogous miRNA loci provide functional redundancy not only by regulating overlapping targets but also by providing compensatory expression. This is clearly illustrated by the miR-34/449 miRNA family, in which the loss of function of all three family members (miR-34a, miR-34b/34c, and miR-449) results in strong motile ciliogenesis defects (30, 77) (see further discussion below), yet mice deficient for individual loci display no such phenotype (78, 79). In addition to the functional redundancy conferred by sequence homology, compensatory expression among miR-34/449 family members also reinforces this redundancy. A recent study reported that miR-449 was induced in miR-34b/34c–deficient mice, whereas miR-34b and miR-34c induction was observed in miR-449–deficient mice (77). Although the precise signals that mediate this intricate compensatory expression regulation are not known, the identification of this feedback response highlights the extent of functional redundancy in miRNA genes.

Predominant cell type–specific expression of particular miRNA families

In addition to genomic redundancy, miRNA expression regulation also plays an important role in their functional robustness. Some miRNA families exhibit predominant, cell type–specific expression patterns. Predominantly expressed miRNAs can constitute a significant portion of total expressed miRNAs, ranging from 30 to 70% as reported in the literature. This is in sharp contrast to protein-coding genes, where the most abundant transcript often constitutes 1 to 2% of the total transcriptome in a given cell type (Fig. 5A).

Fig. 5 The dominant expression of miRNAs in specific cell types.

(A) Distribution of the most abundant miRNA and mRNA in the transcriptome. miR-290-295 and its homologs, which constitute ~70% of total expressed miRNAs in ES cells (81). In contrast, the most abundant mRNA in ES cells constitutes 1.3% of the total transcriptome (Risso et al., personal communication). (B) Dominantly expressed miRNA families in heart tissue, hepatocytes, and ciliated epithelia (GEO: GSM539871) (84, 85).

Previous studies have identified a number of such predominantly expressed, tissue-specific miRNAs. miR-122 is among the first characterized tissue-specific miRNAs, accounting for nearly 70% of all expressed miRNAs in hepatocytes (80). In mouse ES cells, components from the miR-290-295 polycistron make up ~70% of total miRNA populations, and their related homologs in the mouse genome constitute yet another 10% of miRNA expression (Fig. 5A) (8183). In frog ciliated epithelia, miR-34/449 family miRNAs, encoded by three individual genomic loci, collectively account for half of all expressed miRNAs (84). In striated muscle in heart, the two homologous murine miR-1/133 dicistrons generate more than 40% of expressed miRNA (Fig. 5B) (85). Finally, the miR-124 miRNA and its variants constitute 25 to 48% of total expressed miRNAs in a variety of regions in the brain (80). Although predominantly expressed miRNAs exist in specific cell types, it is worth noting that the exact percentages reported may vary from different studies. These numbers are calculated on the basis of miRNA sequencing experiments, in which the percentage of an miRNA among the total expressed miRNAs is determined not only by its abundance but also by its cloning efficiency (86). Nevertheless, it is clear that miRNAs from a single family could constitute the majority of expressed miRNAs in specific cell types.

It remains largely unknown how a single miRNA family predominates among all expressed mature miRNAs. In addition to transcriptional regulation, it is speculated that regulation of miRNA biogenesis also plays an important role to yield differential production of different miRNAs posttranscriptionally (Fig. 2). At least in some well-studied examples, RNA binding proteins can promote or inhibit the processing of specific miRNAs at different steps of the canonical miRNA biogenesis pathway (50, 8790) (Fig. 2). For example, the RNA binding protein TDP-43 (TAR DNA-binding protein 43) associates with terminal loop sequences in a small set of miRNAs and promote their processing by favoring their interaction with both Drosha in the nucleus and Dicer in the cytoplasm (89). Some RNA binding proteins, such as the RNA binding proteins RBFOX3 (RNA binding protein fox-1 homolog 3) and hnRNPA1 (heterogeneous nuclear ribonucleoprotein A1), exhibit opposite effects on the biogenesis of different miRNAs, acting as positive regulators for some but as negative regulators for others (50, 91, 92). In addition to RNA binding proteins, miRNA biogenesis can also be regulated by ADAR deaminases through RNA editing (93); by classical DNA binding proteins, such as p53, SMAD, and BRCA1, through direct interactions (9496); or by ncRNAs (97, 98). Together, the plethora of mechanisms underlying miRNA biogenesis is likely to contribute to the predominant expression of specific miRNA families in certain cell types.

Predominant expression and genomic redundancy confer functional robustness

In many cases, this cell type–specific, predominant miRNA expression pattern is associated with genomic redundancy and is likely to constitute an important molecular basis to confer functional robustness. Not surprisingly, the deletion of an entire family of such predominantly expressed miRNAs often gives rise to important developmental phenotypes (Table 1). One of the most characterized examples comes from studies of the miR-34/449 miRNA family that consists of six homologous miRNAs at three genomic loci, miR-34a, miR-34b/34c, and miR-449. The genomic redundancy of miR-34/449 miRNAs and their predominant expression in multiciliated epithelia (nearly 50% of expressed miRNAs) suggest functional significance to ciliogenesis (Fig. 5B) (84). Consistently, deletion of all three miR-34/449 loci in mice leads to impaired motile ciliogenesis in multiple cell types, giving rise to partial postnatal lethality, strong respiratory dysfunction, and male/female infertility (30). Not all miR-34/449 loci are equally enriched in ciliated epithelia, and their functional importance in ciliogenesis seems to be correlated with the extent of their expression, rather than their target specificity (30). Consistent with the more dominant expression of miR-34b/c and miR-449 polycistrons in multiciliated cells, one functional allele of miR-34b/34c or miR-449 is largely sufficient to support proper motile ciliation in otherwise miR-34/449–deficient cells (30). In contrast, having two intact miR-34a alleles in the absence of miR-34b/34c and miR-449 could not sustain normal ciliogenesis (30). The different functions among the three miR-34/449 genes could result from differential regulation of their expression in ciliated epithelia, particularly at the level of mature miRNAs.

Given the extensive redundancy exhibited by some miRNA families, it could be technically challenging to generate a knockout model to remove the entire family of miRNAs that collectively exhibit a predominant expression pattern. Nevertheless, deleting one or a subset of loci could render important insights in some cases. Deletion of miR-290-295 impairs the cell proliferation and differentiation potential of ES cells in vitro and results in embryonic lethality with incomplete penetrance (33). It is curious why removing miR-290-295, which makes up nearly 70% ES cell–expressed miRNAs, fails to generate a stronger phenotype that resembles Dicer- or Dgcr8-null ES cells. It turns out that miRNAs homologous to miR-290-295 components, including miR-17, miR-20, and miR-302 miRNAs, constitute another 10% of ES cell–expressed miRNAs to provide functional compensation (81). Similarly, the miR-124 family, including the most highly expressed miRNAs in neurons, consists of three genomic loci that yield three, identical mature miR-124 species. miR-124-1 knockout mice exhibit defective cell survival and axon outgrowth of dentate gyrus neurons and retinal cones, a relatively mild neuronal phenotype (63). It is conceivable that the complete removal of all miR-124 loci would generate a stronger neuronal defect. For cell type–specific, predominantly expressed miRNA families, functional redundancy can be particularly robust because the redundant miRNA loci could each confer a nearly complete functional compensation owing to their high-level expression.

Whereas most dominantly expressed miRNAs play an essential role in normal development, some regulate pathways important for physiologic homeostasis. For example, miR-122 makes up nearly 70% of miRNA expression in hepatocytes (Fig. 5B), yet mice with constitutive deletion or liver-specific deletion do not generate an overt developmental phenotype in liver (99). Nevertheless, miR-122 still regulates important physiological responses, particularly in response to various stresses. miR-122 is regulated in a circadian manner by orphan nuclear receptor REV-ERBα and appears to regulate the circadian expression of a large number of transcripts in murine hepatocytes (100). Additionally, as miR-122−/− mice age, a number of hepatic phenotypes also start to emerge, including steatohepatitis, liver fibrosis, and hepatocellular carcinoma, indicating that miR-122 constitutes an important checkpoint for fat metabolism and cell proliferation in hepatocytes (101, 102). This effect of miR-122 is mediated, at least in part, by its repression of the chemokine (C-C motif) ligand 2 (CCL2) and Klf6 (101, 102). Finally, miR-122 plays a noncanonical role in the life cycle of HCV by promoting viral replication (103). Vidigal et al. also published a review summarizing the phenotypes of miRNA knockouts in mice (104).

Concluding Remarks

The complexity of the miRNA transcriptome and the functional interplay both within its own members and with protein-coding genes underscore the vast functionality of the noncoding gene landscape. Beyond the structure-function relationships intrinsic to miRNAs, their specialized genomic organization and expression regulation add additional layers of complexity to their biological functions. Additionally, transcriptional and epigenetic control, miRNA biogenesis, and miRNA turnover, although incompletely understood, likely collaborate to establish highly dynamic miRNA output during tissue specification and maintenance. The important roles that miRNAs play in mammalian development and disease will continue to drive investigations to explore the vast noncoding genome and understand the developmental and physiologic complexities resulting from the intricate interactions between ncRNAs and protein-coding genes.


Acknowledgments: We would like to thank the members of the He laboratory for their help and inspiring discussions, particularly A. Modzelewski for carefully proofreading the text. Funding: L.H. acknowledges an R01 and an R21 grant from the National Cancer Institute (R01 CA139067 and 1R21CA175560-01), a California Institute for Regenerative Medicine New Faculty Award (RN2-00923-1), a Tobacco-Related Disease Research Program research grant (21RT-0133), and a research scholar award from the American Cancer Society (ACS, 123339-RSG-12-265-01-RMC). V.O. is supported by a Leukemia and Lymphoma Society special fellow award (LLS, 3423–3513). A.C.M. acknowledges an R01 grant from the NIH (R01HL098608) and an R01 grant from the National Heart, Lung, and Blood Institute (R01HL098608). Competing interests: The authors declare that they have no competing interests.
View Abstract

Navigate This Article