Molecular Biology Terms Starting With R

R

Molecular Biology Glossary: R

Molecular BiologyMolecular Biology TechniquesGene RegulationRNA RegulationGene Expression

Regulatory Region

/ REG-yoo-luh-tor-ee REE-jun /  ·  Latin: regulare (to rule) + regio (area)

Molecular BiologyIntermediate
Also known as:Cis-Regulatory Element

Regulatory Region is a DNA sequence that controls the expression of nearby or distant genes by providing binding sites for transcription factors, chromatin remodeling complexes, and other regulatory proteins.

Regulatory regions include core promoters where RNA polymerase binds, proximal elements within a few hundred base pairs of the transcription start site, and distal enhancers that can function hundreds of kilobases away to increase transcription rates 10-fold or more. Insulators such as CTCF-binding sites define domain boundaries and prevent enhancers from activating inappropriate genes, while locus control regions coordinate expression of gene clusters such as the beta-globin locus. Open chromatin signatures, conservation across species, and histone modifications like H3K4me3 at promoters or H3K27ac at active enhancers help researchers identify these sequences.

Mutations in regulatory regions cause many human diseases without altering any protein-coding sequence.

Did you know?

Genome-wide association studies have mapped thousands of disease-associated variants to regulatory DNA rather than to protein-coding exons. One analysis of GWAS data found that roughly 93 percent of trait-associated single-nucleotide variants fall outside protein-coding regions, most of them in predicted regulatory sequences.

Common misconception

Regulatory DNA only controls genes located immediately next to it. Enhancers can activate genes located hundreds of kilobases away by looping through three-dimensional space to contact a promoter.

Example in nature

In developing red blood cells of humans, a locus control region located roughly 6 to 22 kilobases upstream of the beta-globin gene cluster coordinates the sequential activation of embryonic, fetal, and adult globin genes. This single regulatory region drives expression levels high enough to produce the roughly 280 million hemoglobin molecules packed into each mature red blood cell.

Replication Fork

/ rep-lih-KAY-shun fork /  ·  Latin: replicare (to fold back) + fork

Molecular BiologyIntermediate
Also known as:Replication Bubble

Replication Fork is the Y-shaped structure formed at an active origin of replication where the two parental DNA strands are separated and new complementary strands are synthesized bidirectionally.

At each replication fork, helicase unwinds the double helix, single-strand binding proteins stabilize the exposed strands, and primase synthesizes RNA primers to initiate synthesis on both strands. DNA polymerase synthesizes the leading strand continuously in the direction of fork movement and the lagging strand discontinuously as Okazaki fragments, each roughly 100 to 200 nucleotides long in eukaryotes, moving away from the fork. Fork progression reaches about 50 base pairs per second in human cells, compared with roughly 1,000 base pairs per second in Escherichia coli.

Stalled forks can collapse to produce double-strand breaks that are repaired by homologous recombination.

Did you know?

Human cells fire approximately 30,000 to 50,000 replication origins per S phase, allowing the roughly 6.4 billion base pairs of the diploid genome to be duplicated in 6 to 8 hours. Without this large number of origins, replication from a single fork at 50 base pairs per second would take decades.

Common misconception

Both DNA strands are copied in exactly the same continuous way. One strand is made continuously and the other in short fragments called Okazaki fragments, because the two strands run antiparallel and DNA polymerase can only synthesize in the 5-prime to 3-prime direction.

Example in nature

Hydroxyurea stalls replication forks in mammalian cells by depleting deoxyribonucleotide pools. Treatment can reduce dNTP levels by more than 90 percent within hours, activating the ATR checkpoint to stabilize forks before they collapse into double-strand breaks.

Reverse Primer

/ reh-VERS PRY-mer /  ·  Latin reversus, turned back; Latin primarius, first in rank

Molecular Biology TechniquesIntermediate
Also known as:reverse oligonucleotide primerdownstream primerantisense primer

Reverse Primer is a short synthetic oligonucleotide that anneals to the antisense strand of double-stranded DNA at the downstream end of a target sequence, priming DNA synthesis toward the forward primer during PCR.

In PCR, the reverse primer and forward primer flank the target sequence from opposite strands, directing DNA polymerase to synthesize new strands toward each other and amplify the region between them exponentially. The reverse primer sequence is complementary to the antisense strand, meaning it shares the same sequence as the sense strand but is oriented 3-prime to 5-prime relative to that strand. Primer length typically ranges from 18 to 25 nucleotides, and melting temperatures for the forward and reverse primers are matched within 2 to 5 degrees Celsius to ensure both anneal efficiently during the same PCR cycle.

Self-complementarity within or between primers must be avoided to prevent hairpin formation or primer-dimer artifacts that reduce amplification efficiency.

Did you know?

Researchers designing reverse primers for RT-PCR sometimes deliberately span an exon-exon junction so the primer cannot bind genomic DNA, which would lack that junction. This design strategy distinguishes amplification from mRNA-derived cDNA from any contaminating genomic DNA in the sample.

Common misconception

Forward and reverse primers bind the same DNA strand in the same direction. They bind opposite strands with their 3-prime ends facing each other, so that polymerase extension from each primer moves toward the other.

Example in nature

When researchers amplify a specific exon of the human BRCA1 gene for sequencing, the reverse primer is designed to bind the antisense strand roughly 200 to 500 base pairs downstream of the forward primer. The two primers together define an amplicon of that precise length, which is then verified by gel electrophoresis.

Reverse Transcription

/ reh-VERS tran-SKRIP-shun /  ·  Latin: reversus + transcribere

Molecular BiologyIntermediate
Also known as:cDNA Synthesis

Reverse Transcription is the synthesis of complementary DNA from an RNA template by the enzyme reverse transcriptase.

Reverse transcriptase, encoded by retroviruses such as HIV, first synthesizes a DNA-RNA hybrid duplex using the viral RNA genome as a template. Its associated RNase H activity then degrades the RNA strand of that hybrid, and a second DNA strand is synthesized to produce a complete double-stranded cDNA. In molecular biology, reverse transcription is combined with PCR in the RT-PCR technique to detect and quantify specific RNA sequences by converting them to cDNA before amplification.

Retrotransposons, which make up a substantial fraction of many eukaryotic genomes, also use reverse transcription to copy themselves into new genomic locations.

Did you know?

Howard Temin and David Baltimore independently discovered reverse transcriptase in 1970, a finding that overturned the assumption that genetic information could only flow from DNA to RNA. Both researchers shared the Nobel Prize in Physiology or Medicine in 1975 for this work.

Building Blocks of Nucleic Acids →
Common misconception

RNA can never be copied into DNA. Reverse transcriptase enzymes perform exactly this reaction, and retroviruses depend on it to integrate their genomes into host chromosomes.

Are Enzymes Proteins? →
Example in nature

HIV uses reverse transcription to convert its single-stranded RNA genome into double-stranded DNA inside an infected CD4-positive T cell. This DNA copy is about 9,700 base pairs long and is integrated into a host chromosome by viral integrase.

Riboswitch

/ RY-boh-switch /  ·  Latin ribosum (ribose-based); Old English swician, to turn

Gene RegulationAdvanced
Also known as:RNA regulatory elementaptamer-based switch

Riboswitch is a segment of messenger RNA that folds into distinct secondary structures depending on whether a specific small molecule is bound, and this conformational change directly controls transcription termination or translation initiation without requiring any protein regulator.

Riboswitches are found predominantly in bacteria, where they are embedded in the 5-prime untranslated regions of mRNAs encoding metabolic enzymes. Each riboswitch contains an aptamer domain that binds its target ligand with high specificity and an expression platform whose folding state determines whether the downstream gene is expressed. When intracellular concentrations of a metabolite such as adenosylcobalamin rise above a threshold, ligand binding stabilizes a terminator hairpin that halts transcription before the coding sequence is reached.

More than 40 distinct riboswitch classes have been identified in bacteria, each recognizing a different small molecule including amino acids, nucleotides, metal ions, and enzyme cofactors.

Did you know?

Riboswitches have also been discovered in some eukaryotes, including fungi. The TPP riboswitch controlling thiamine biosynthesis genes in the fungus Neurospora crassa (a bread mold) regulates splicing of a precursor mRNA rather than transcription termination, showing that the same RNA-based sensing mechanism can be rewired to control different steps of gene expression.

Common misconception

Riboswitches are a type of protein that binds mRNA to regulate gene expression. They are segments of the mRNA molecule itself, requiring no protein partner to sense a ligand and change gene output.

Example in nature

In Bacillus subtilis, the lysine riboswitch in the 5-prime UTR of the lysC gene binds free lysine when cellular concentrations exceed roughly 1 millimolar. Binding induces a transcription-terminating hairpin that reduces lysC mRNA production by more than 10-fold, preventing the cell from overproducing the amino acid.

Ribozyme

/ RY-boh-zym /  ·  English: ribonucleic acid + enzyme

Molecular BiologyAdvanced
Also known as:Catalytic RNA

Ribozyme is an RNA molecule that catalyzes a chemical reaction, most commonly the cleavage or formation of phosphodiester bonds, demonstrating that proteins are not the only biological molecules capable of enzymatic activity.

The discovery of ribozymes by Thomas Cech and Sidney Altman in the early 1980s, work that earned them the 1989 Nobel Prize in Chemistry, overturned the long-held assumption that all biological catalysts are proteins. Cech showed that the group I intron of Tetrahymena thermophila (a ciliated protozoan) could splice itself out of a precursor RNA without any protein assistance. The most biologically significant ribozyme is the ribosome itself: the peptidyl transferase center that forms peptide bonds during translation is composed of ribosomal RNA, not protein, meaning every living cell continuously uses RNA catalysis to build its proteome.

Some ribozymes, such as the hammerhead and hairpin ribozymes found in plant viroids, cleave RNA strands with rate enhancements of up to 10 million-fold over the uncatalyzed reaction.

Did you know?

Researchers have engineered artificial ribozymes in the laboratory that can catalyze reactions not found in nature, including carbon-carbon bond formation. One synthetic ribozyme developed by the Szostak laboratory in 2001 could copy short RNA templates, supporting the idea that a self-replicating RNA molecule could have existed before protein enzymes evolved.

Are Enzymes Proteins? →
Common misconception

All enzymes are proteins. Certain RNA molecules also catalyze chemical reactions, and the ribosome's peptide bond-forming activity is carried out entirely by ribosomal RNA.

Example in nature

Plant viroids, such as the avocado sunblotch viroid, contain hammerhead ribozyme sequences that cleave the viroid RNA during rolling-circle replication. The hammerhead ribozyme in this viroid cleaves a specific phosphodiester bond with a rate constant of roughly 1 per minute under physiological magnesium concentrations, generating the unit-length viroid genome.

Translation Biology →

RISC

/ RISK /  ·  Scientific term used in rna regulation.

RNA RegulationAdvanced

RISC is a ribonucleoprotein complex that uses a short single-stranded RNA guide to locate complementary messenger RNA molecules and silence them by cleavage or translational repression.

The core of RISC is an Argonaute protein that binds a guide RNA strand of 21 to 23 nucleotides derived from small interfering RNA or microRNA processing. When the guide sequence matches a target mRNA with full complementarity, the Argonaute slicer domain cleaves the mRNA at a position precisely 10 nucleotides from the guide’s 5 end, marking it for rapid degradation. Partial complementarity, which is typical of most animal microRNA-target interactions, instead recruits GW182 scaffold proteins and deadenylase complexes that shorten the mRNA’s poly-A tail and block ribosome access.

RISC-mediated silencing defends plants, animals, and fungi against RNA viruses and fine-tunes developmental gene expression programs across hundreds of target transcripts simultaneously. Caenorhabditis elegans provided the first genetic evidence for this pathway when Andrew Fire and Craig Mello demonstrated in 1998 that injected double-stranded RNA silenced matching genes far more potently than single-stranded RNA alone.

Did you know?

Human cells contain four Argonaute paralogs, AGO1 through AGO4, but only AGO2 retains the catalytic residues needed for direct mRNA cleavage; the other three paralogs silence targets exclusively through translational repression and deadenylation, meaning that slicer-independent silencing is the predominant mode in human cells.

Common misconception

RISC edits the DNA sequence of genes it targets. RISC acts exclusively on RNA, either cleaving target mRNA or blocking its translation, without altering the underlying genomic DNA sequence.

Building Blocks of Nucleic Acids →
Example in nature

In the nematode Caenorhabditis elegans, RISC loaded with let-7 microRNA represses the lin-41 mRNA during larval development by binding multiple partially complementary sites in its 3-prime untranslated region. This repression shifts the animal from larval to adult cell fates at a precise developmental stage, and worms lacking functional let-7 repeat larval molts indefinitely rather than completing the transition, a phenotype first described in 2000.

RNA Interference

/ ar-en-ay in-ter-FEER-ents /  ·  English: RNA + interference

Molecular BiologyIntermediate
Also known as:RNAiPost-Transcriptional Gene Silencing

RNA Interference is a conserved cellular mechanism in which double-stranded RNA triggers sequence-specific silencing of gene expression by directing degradation or translational repression of complementary messenger RNA.

The pathway begins when the enzyme Dicer cleaves long double-stranded RNA or pre-microRNA hairpins into short duplexes of approximately 21 to 23 nucleotides. One strand of each duplex loads into the RNA-induced silencing complex, where it guides RISC to complementary mRNA targets. Perfect complementarity between the guide and target triggers Argonaute-mediated mRNA cleavage, while partial complementarity recruits deadenylases and translational repressors that reduce protein output without destroying the transcript immediately.

Andrew Fire and Craig Mello discovered the mechanism in Caenorhabditis elegans in 1998 and received the Nobel Prize in Physiology or Medicine in 2006 for the work. Researchers now exploit the pathway therapeutically: inclisiran, approved by the FDA in 2021, uses small interfering RNA to silence PCSK9 mRNA in liver cells and lower LDL cholesterol in patients with cardiovascular disease.

Did you know?

Plants use RNA interference as a primary antiviral defense, and some plant viruses have evolved suppressor proteins that block Dicer or RISC to escape silencing. Tobacco mosaic virus encodes the p19 suppressor protein, which sequesters 21-nucleotide small interfering RNA duplexes before they can load into RISC, allowing the virus to accumulate in leaf tissue despite an active plant immune response.

Common misconception

Gene silencing always requires a change to the DNA sequence. RNA interference reduces gene output by targeting RNA molecules, leaving the genomic DNA sequence completely unchanged.

Building Blocks of Nucleic Acids →
Example in nature

In the nematode Caenorhabditis elegans, feeding worms bacteria engineered to produce double-stranded RNA matching any chosen gene reliably silences that gene throughout the animal's body. This technique, called feeding RNAi, has been used to screen nearly every one of the roughly 20,000 C. elegans genes for developmental and behavioral phenotypes, with some genome-wide screens completed in under two weeks.

RNA Polymerase II

/ ar-en-AY poh-LIM-er-ays TOO /  ·  Scientific term used in gene expression.

Gene ExpressionAdvanced

RNA Polymerase II is the eukaryotic enzyme that transcribes protein-coding genes and most non-coding RNA genes from a DNA template into precursor RNA molecules.

RNA Polymerase II moves along the template DNA strand in the 3-to-5 direction, adding ribonucleotides to the growing RNA chain in the 5-to-3 direction at approximately 20 to 40 nucleotides per second in mammalian cells. General transcription factors assemble at core promoter elements such as the TATA box to position the enzyme at the transcription start site, and the Mediator complex relays signals from enhancer-bound activators to the polymerase. Shortly after synthesis begins, capping enzymes add a 7-methylguanosine cap to the 5 end of the nascent RNA, protecting it from exonuclease degradation and facilitating ribosome recognition.

Promoter-proximal pausing, in which the polymerase stalls 20 to 60 nucleotides downstream of the start site, is a widespread regulatory checkpoint that controls the rate of productive elongation across thousands of human genes. Beyond messenger RNA, RNA Polymerase II transcribes long non-coding RNAs, microRNA precursors, and small nuclear RNAs that are processed into spliceosomal components.

Did you know?

The carboxy-terminal domain of the largest RNA Polymerase II subunit contains 52 tandem repeats of a seven-amino-acid sequence in humans, and the phosphorylation state of these repeats changes at each stage of transcription, recruiting different RNA processing factors to the elongating polymerase so that capping, splicing, and polyadenylation are coupled directly to transcription rather than occurring as separate downstream events.

Common misconception

One RNA polymerase performs every transcription job in eukaryotic nuclei. Eukaryotic cells use three distinct nuclear RNA polymerases: RNA Polymerase I transcribes ribosomal RNA precursors, RNA Polymerase II transcribes protein-coding and most non-coding genes, and RNA Polymerase III transcribes transfer RNAs and 5S ribosomal RNA.

Example in nature

In differentiating human erythroid progenitor cells, RNA Polymerase II transcribes the beta-globin gene at a rate that produces tens of thousands of beta-globin mRNA molecules per cell over the course of terminal differentiation. The nascent transcript spans approximately 1,600 nucleotides of coding sequence embedded within a roughly 1,500-nucleotide pre-mRNA that is then spliced to remove two introns before export to the cytoplasm.

RNA Secondary Structure

/ ar-en-ay SEK-on-dair-ee STRUK-cher /  ·  Ribonucleic Acid; Latin secundarius; Latin structura

RNA BiologyAdvanced
Also known as:RNA foldinghairpin loopstem-loop

RNA Secondary Structure is the set of base-paired regions and unpaired loops that form when complementary sequences within a single RNA strand fold back and hydrogen-bond with each other, producing a characteristic two-dimensional shape that influences the RNA's function.

Unlike double-stranded DNA, a single RNA strand can pair with itself wherever complementary sequences occur within the same molecule, forming stem-loops, hairpins, internal loops, and bulges stabilized by Watson-Crick and wobble base pairs. These structures are thermodynamically stable under physiological conditions and are encoded directly by the RNA sequence, meaning that two RNAs with different sequences but similar folding patterns can perform equivalent functions. Ribosomal RNA provides the most dramatic example: the 16S rRNA of Escherichia coli folds into more than 50 distinct helical stems that together form the structural scaffold of the small ribosomal subunit, with individual stems conserved across billions of years of evolution.

Regulatory RNA structures called riboswitches in bacterial mRNA leaders change conformation when a small molecule binds, toggling transcription termination or translation initiation on or off without any protein intermediary. Computational tools such as the Zuker algorithm predict minimum free-energy secondary structures from sequence alone, though experimental methods including SHAPE chemistry are needed to confirm folding in living cells.

Did you know?

The hepatitis delta virus ribozyme, found in the genome of a human liver pathogen, folds into a double-pseudoknot structure that catalyzes self-cleavage of the viral RNA during replication. This reaction proceeds without any protein cofactor, demonstrating that RNA structure alone can generate catalytic activity comparable to protein enzymes.

Common misconception

RNA is always a floppy, unstructured strand. RNA molecules fold into stable, precisely defined secondary structures that are as functionally important as the sequence itself, and disrupting these structures by mutation typically abolishes RNA function even when the encoded protein sequence is unchanged.

Example in nature

Transfer RNA from all organisms folds into a cloverleaf secondary structure containing four stem-loops: the acceptor stem, the D-loop, the anticodon loop, and the T-psi-C loop. The anticodon loop, positioned at the bottom of the cloverleaf, spans exactly three unpaired nucleotides that base-pair with a complementary codon on mRNA during translation, and the acceptor stem at the top carries the amino acid attached by aminoacyl-tRNA synthetase.

Building Blocks of Proteins →

RNA-Dependent DNA Polymerase

/ ar-en-AY dih-PEN-dent dee-en-AY poh-LIM-er-ays /  ·  Scientific term used in molecular genetics.

Molecular GeneticsAdvanced

RNA-Dependent DNA Polymerase is an enzyme that synthesizes a new DNA strand by reading an RNA template, a reaction used by retroviruses and retrotransposons to convert RNA genetic information back into DNA.

This enzyme, universally called reverse transcriptase, was discovered independently by Howard Temin and David Baltimore in 1970 and overturned the assumption that genetic information flows only from DNA to RNA to protein. Retroviral reverse transcriptase carries three catalytic activities in a single polypeptide: RNA-dependent DNA polymerase activity to copy the RNA genome into a complementary DNA strand, RNase H activity to degrade the RNA strand of the resulting RNA-DNA hybrid, and DNA-dependent DNA polymerase activity to synthesize the second DNA strand. HIV reverse transcriptase introduces approximately one error per 10,000 nucleotides copied because it lacks a proofreading exonuclease, generating the genetic diversity that underlies rapid drug resistance.

The enzyme requires a primer to begin synthesis; HIV uses a host transfer RNA base-paired to the viral RNA genome as its primer. Antiretroviral drugs including tenofovir and emtricitabine inhibit reverse transcriptase and form the backbone of modern HIV treatment regimens.

Did you know?

Reverse transcriptase is not exclusive to viruses: human LINE-1 retrotransposons, which make up roughly 17 percent of the human genome, encode their own reverse transcriptase and use it to copy LINE-1 RNA back into DNA that can insert at new genomic locations, a process that has shaped the size and structure of the human genome over millions of years.

Building Blocks of Nucleic Acids →
Common misconception

Polymerases always use DNA as their template. RNA-dependent DNA polymerase reads an RNA strand as its template, producing a DNA product, which is the reverse of the direction described by the original central dogma.

Are Enzymes Proteins? →
Example in nature

HIV reverse transcriptase copies the approximately 9,700-nucleotide viral RNA genome into double-stranded DNA within hours of viral entry into a CD4-positive T cell. The DNA copy is then transported into the nucleus and integrated into a host chromosome, where it can persist for years in long-lived cellular reservoirs.

RNA-Dependent RNA Polymerase

/ ar-en-ay deh-PEN-dent ar-en-ay poh-LIM-er-ays /  ·  Ribonucleic Acid; Latin dependere; Greek polys, many; Greek meros; -ase

Molecular VirologyAdvanced
Also known as:RdRpRNA replicaseviral RNA polymerase

RNA-Dependent RNA Polymerase is an enzyme that synthesizes RNA using an RNA template, a reaction used by many RNA viruses during genome replication and transcription.

Most RNA viruses that replicate RNA genomes through RNA intermediates encode or package an RNA-dependent RNA polymerase, because uninfected human cells do not normally copy RNA from RNA. Positive-sense RNA viruses such as poliovirus translate the polymerase after entry, while negative-sense RNA viruses such as influenza must package polymerase complexes inside the virion so transcription can begin immediately. Retroviruses are a major exception because they use reverse transcriptase to copy RNA into DNA, and hepatitis delta virus exploits host RNA polymerase activity in an unusual RNA-templated replication strategy.

RNA-dependent RNA polymerases generally have lower fidelity than cellular DNA polymerases, although coronaviruses partially offset this limitation with nsp14 exonuclease proofreading. These enzymes are major antiviral drug targets, including the SARS-CoV-2 polymerase targeted by remdesivir.

Did you know?

Coronavirus polymerase complexes are unusually elaborate for RNA viruses. SARS-CoV-2 uses nsp12 as the catalytic polymerase together with cofactors nsp7 and nsp8, while nsp14 proofreading helps preserve a genome of about 30,000 nucleotides.

Are Enzymes Proteins? →
Common misconception

All polymerases copy DNA templates. RNA-dependent RNA polymerases copy RNA templates into RNA products, a reaction central to the replication of many RNA virus genomes.

Building Blocks of Nucleic Acids →
Example in nature

Poliovirus RNA-dependent RNA polymerase, called 3Dpol, replicates the approximately 7,500-nucleotide viral genome in infected human intestinal cells. Within about 8 hours, one infected cell can produce thousands of viral RNA copies and release large numbers of progeny virions.

How Do Viruses Reproduce? →