Molecular Biology Terms Starting With D
Molecular Biology Glossary: D
Dicer
/ DY-ser / · English: to dice (cut into small pieces)
Dicer is an RNase III endonuclease that cleaves long double-stranded RNA and pre-microRNA hairpin precursors into short double-stranded fragments of approximately 21 to 23 nucleotides that initiate RNA interference pathways.
Dicer recognizes pre-miRNA hairpins through a PAZ domain that anchors the 3-prime end of the precursor, positioning the RNase III catalytic domains to cut both strands approximately 22 nucleotides from that anchor point, generating duplexes with characteristic 2-nucleotide 3-prime overhangs. One strand of the resulting small RNA duplex is loaded into the RNA-induced silencing complex (RISC), where it guides sequence-specific recognition of complementary mRNA targets, typically achieving 5- to 20-fold repression of target transcript levels. Beyond miRNA processing, Dicer also cleaves long double-stranded RNA derived from viruses or transposons into small interfering RNAs (siRNAs) that direct cleavage of matching transcripts.
Dicer knockout mice die at approximately embryonic day 7.5 due to failure of miRNA-mediated regulation across developmental gene networks, demonstrating that Dicer activity is required for early mammalian development. Mutations in the human DICER1 gene are associated with a tumor predisposition syndrome that includes pleuropulmonary blastoma, ovarian Sertoli-Leydig cell tumors, and multinodular goiter.
In the nematode Caenorhabditis elegans, the Dicer ortholog DCR-1 was among the first proteins shown to be required for RNA interference in an animal, a finding reported by Knight and Bass in 2001. C. elegans lacking DCR-1 cannot silence transposons in the germline, leading to transposon mobilization and genome instability across generations.
RNA interference happens without enzymes. Dicer is one key enzyme that processes double-stranded RNA precursors into the small RNAs that guide silencing complexes to their targets.
Are Enzymes Proteins? →In the fruit fly Drosophila melanogaster, two distinct Dicer proteins divide the labor of small RNA biogenesis: Dicer-1 processes pre-miRNA hairpins into mature miRNAs, while Dicer-2 processes long double-stranded RNA into siRNAs. Each protein is approximately 200 kilodaltons in size, and genetic loss of either one causes distinct, non-overlapping defects in gene silencing.
DNA Binding Domain
/ dee-en-AY BYN-ding doh-MAYN / · Scientific term used in gene regulation.
DNA Binding Domain is the discrete structural region of a protein that recognizes and contacts specific DNA sequences or structural features, typically within the major groove of the double helix.
Common DNA binding domain architectures include zinc fingers, which use a zinc ion to stabilize a finger-shaped projection that inserts into the major groove; helix-turn-helix motifs, in which one alpha helix makes sequence-specific contacts with bases; and basic leucine zippers, in which a leucine-rich dimerization region positions a basic DNA-contacting segment. Each domain type achieves specificity through precise hydrogen bonds and van der Waals contacts between amino acid side chains and the edges of base pairs exposed in the major groove. A single zinc finger contacts approximately 3 base pairs, and proteins such as the transcription factor Sp1 chain together three zinc fingers to recognize a 9-base-pair GC-rich sequence with high affinity.
The modular nature of these domains has made them the basis of engineered DNA-binding proteins, including zinc finger nucleases used in early gene-editing experiments.
Researchers have engineered arrays of zinc finger domains to target virtually any chosen DNA sequence, a strategy that preceded CRISPR-based editing. Each engineered finger in the array recognizes a specific 3-base-pair triplet, and linking six fingers together produces a protein that binds an 18-base-pair sequence, long enough to specify a unique site in the human genome.
An entire protein surface contacts DNA equally during binding. Sequence-specific recognition is concentrated in defined structural domains, while other regions of the same protein mediate transcriptional activation, cofactor recruitment, or signal sensing.
The glucocorticoid receptor's zinc finger DNA binding domain contacts a specific 15-base-pair hormone response element in the promoters of target genes. Two receptor monomers dimerize through their DNA binding domains and together span both half-sites of the palindromic response element, separated by exactly 3 base pairs.
DNA Binding Protein
/ dee-en-AY BYN-ding PROH-teen / · Scientific term used in gene regulation.
DNA Binding Protein is any protein that forms noncovalent interactions with DNA to carry out functions including transcription regulation, replication, repair, and genome packaging.
These proteins contact DNA through hydrogen bonds to base edges, electrostatic interactions between positively charged amino acids and the negatively charged phosphate backbone, and van der Waals contacts with the sugar-phosphate scaffold. Some DNA binding proteins recognize specific sequences: the lac repressor of Escherichia coli binds a 21-base-pair operator sequence with an affinity roughly 1,000-fold greater than its affinity for nonspecific DNA. Others bind without sequence preference; histone proteins wrap approximately 147 base pairs of DNA around an octamer core to form nucleosomes, compacting the roughly 2-meter human genome into a nucleus about 6 to 10 micrometers in diameter.
DNA repair proteins such as XPA and RPA recognize structural distortions rather than specific sequences, binding single-stranded or damaged DNA to initiate repair.
The single-stranded DNA binding protein of bacteriophage T4, called gene 32 protein, was one of the first DNA binding proteins characterized biochemically, in work by Bruce Alberts and colleagues in the early 1970s. Gene 32 protein coats single-stranded DNA with such high cooperativity that a single molecule binding to DNA dramatically increases the probability that neighboring molecules will also bind, a property that proved important for understanding how replication forks are stabilized.
Every DNA binding protein recognizes a specific DNA sequence. Many DNA binding proteins, including histones, single-stranded DNA binding proteins, and some repair factors, bind DNA without any sequence preference, relying instead on backbone contacts or structural features of the DNA.
During replication of the bacteriophage T4 genome, the gene 32 single-stranded DNA binding protein coats exposed single-stranded regions at replication forks, with each protein monomer covering approximately 7 nucleotides. Roughly 10,000 copies of gene 32 protein are present in a single infected Escherichia coli cell, enough to coat all single-stranded DNA generated during phage replication.
Difference Between Prokaryotic and Eukaryotic Cells →DNA Double Helix
/ dee-en-ay DUB-ul HEE-liks / · Deoxyribonucleic Acid; Latin duplus, double; Greek helix, spiral
DNA Double Helix is the three-dimensional structure of DNA in which two antiparallel polynucleotide strands wind around a common axis in a right-handed spiral, held together by hydrogen bonds between complementary base pairs.
Each strand consists of a repeating sugar-phosphate backbone with one of four bases, adenine, thymine, guanine, or cytosine, attached to each deoxyribose sugar. Adenine pairs with thymine through two hydrogen bonds, while guanine pairs with cytosine through three hydrogen bonds, a difference that makes GC-rich regions of DNA more thermally stable than AT-rich regions. The two strands run antiparallel, meaning the 5-prime end of one strand aligns with the 3-prime end of its partner.
James Watson and Francis Crick published the double helix model in 1953, drawing on X-ray diffraction data from Rosalind Franklin and Maurice Wilkins; the structure explained both how genetic information is stored and how it can be faithfully copied. Each complete helical turn spans approximately 3.4 nanometers and contains 10 base pairs, giving the helix a uniform diameter of 2 nanometers.
The B-form double helix described by Watson and Crick is the predominant structure under physiological conditions, but DNA can also adopt A-form and Z-form helices depending on sequence and environment. Z-DNA, which winds in a left-handed direction, forms transiently in GC-rich sequences during transcription and may influence gene regulation, though its precise biological roles are still being investigated.
DNA is a flat, ladder-like structure. The molecule is a three-dimensional right-handed helix with precise geometric constraints, including a major groove approximately 2.2 nanometers wide and a minor groove approximately 1.2 nanometers wide, each recognized by different classes of DNA binding proteins.
The bacterium Thermus aquaticus, which lives in hot springs at temperatures near 75 degrees Celsius, maintains a functional double helix under conditions that would denature most DNA. Its genome is enriched in GC base pairs, which form three hydrogen bonds each and raise the melting temperature of the helix, allowing the organism to replicate its DNA at temperatures lethal to most other organisms.
DNA Helicase
/ dee-en-ay HEL-ih-kays / · DNA + Greek: helikos (spiral) + -ase
DNA Helicase is an enzyme that separates the two strands of the DNA double helix by breaking hydrogen bonds between base pairs, using energy from ATP hydrolysis to drive directional movement along the DNA.
The major replicative helicase in eukaryotes is the MCM2-7 complex, a ring-shaped hexamer that is loaded onto origins of replication during G1 phase and encircles one DNA strand while translocating to displace the complementary strand. Single-strand binding proteins stabilize the separated strands behind the helicase, and topoisomerases relieve the torsional stress that builds ahead of the moving fork. The bacteriophage T7 gene 4 helicase unwinds DNA at a rate of approximately 130 base pairs per second at 37 degrees Celsius, a rate comparable to eukaryotic replicative helicases.
Mutations in specialized helicases cause Werner syndrome, Bloom syndrome, and Fanconi anemia, all characterized by genomic instability and elevated cancer risk.
The MCM2-7 helicase complex is loaded onto human DNA origins in roughly twice the number needed for replication, creating a reserve of dormant helicases that can be activated if a replication fork stalls. This excess licensing, established during G1 phase, provides a backup mechanism that helps cells complete genome duplication even when individual forks encounter obstacles such as DNA damage or tightly bound proteins.
DNA strands separate spontaneously during replication because the hydrogen bonds between bases are weak. Thermal energy alone is insufficient to unwind DNA at the replication fork at physiological temperatures; helicase enzymes use ATP hydrolysis to drive strand separation directionally and at rates compatible with cell division.
Are Enzymes Proteins? →During replication of the Escherichia coli chromosome, the DnaB helicase unwinds approximately 1,000 base pairs per second at the replication fork. The entire 4.6-million-base-pair genome is replicated from a single origin in roughly 40 minutes under optimal growth conditions, a rate that requires continuous helicase activity throughout the process.
DNA Ligase
/ dee-en-ay LY-gays / · DNA + Latin: ligare (to bind) + -ase
DNA Ligase is an enzyme that seals single-stranded nicks in double-stranded DNA by forming a phosphodiester bond between adjacent 3-prime hydroxyl and 5-prime phosphate termini, making it indispensable for DNA replication and repair.
During lagging-strand replication, DNA ligase joins Okazaki fragments by sealing the nicks left after RNA primer removal and gap filling by DNA polymerase. In base excision repair and nucleotide excision repair, ligase is the final enzyme that restores the intact phosphodiester backbone after the damaged region has been excised and resynthesized. Bacteriophage T4 DNA ligase, which can also join blunt-ended DNA fragments, is one of the most widely used enzymes in molecular cloning, where researchers use it to insert foreign DNA into plasmid vectors.
Unlike the bacterial enzyme, T4 DNA ligase requires ATP as a cofactor rather than NAD+, a distinction that affects how ligation reactions are set up in the laboratory.
Nick sealing by DNA ligase depends on a cofactor to activate the enzyme before it can form a phosphodiester bond. Bacteria use NAD+ for this activation step, while human DNA ligases and bacteriophage T4 ligase use ATP, and the AMP released during the reaction becomes transiently linked to the enzyme itself as part of the catalytic mechanism.
DNA ligase copies DNA. It joins DNA ends after other enzymes have copied or repaired the strand.
Are Enzymes Proteins? →In fluorescence-based ligation assays, researchers use human DNA ligase I to seal nicks in reconstituted Okazaki fragment substrates in vitro. Under optimized conditions, a single nick-sealing reaction can occur in less than 1 second, fast enough to keep pace with lagging-strand maturation during replication.
DNA Probe
/ dee-en-ay PROHB / · Deoxyribonucleic Acid; Latin probare, to test
DNA Probe is a short, labeled single-stranded DNA molecule designed to hybridize with a specific complementary sequence in a sample, allowing researchers to detect the presence, location, or abundance of a target gene or transcript.
When a sample is denatured to separate DNA into single strands and then incubated with a probe, the probe binds its complementary target through Watson-Crick base pairing in a process called hybridization. Probes carry detectable labels, most commonly radioactive isotopes such as phosphorus-32, fluorescent dyes, or chemiluminescent groups, so that bound probe can be visualized after unbound probe is washed away. Southern blotting, fluorescence in situ hybridization (FISH), and microarray platforms all depend on probe hybridization to identify specific sequences within complex genomic or transcriptomic samples.
Probe length and GC content determine hybridization stringency; probes of 20 to 50 nucleotides are common in diagnostic applications because they balance specificity with efficient binding kinetics.
Probes can hybridize to RNA targets as well as DNA targets. In Northern blotting, a DNA probe binds complementary mRNA sequences to reveal which transcripts are present in a tissue sample, and the same hybridization chemistry underlies in situ hybridization experiments that map gene expression patterns within intact tissue sections.
Building Blocks of Nucleic Acids →A DNA probe cuts DNA like an enzyme. Probes detect sequences through complementary binding, not enzymatic cleavage.
Are Enzymes Proteins? →FISH uses fluorescent DNA probes to bind specific chromosome regions in intact human cells. Probes targeting the HER2 locus on chromosome 17 are routinely used in breast cancer diagnostics, where amplification of that locus appears as clusters of fluorescent signals rather than the normal two dots per cell.
DNA Repair
/ dee-en-ay reh-PAIR / · DNA + Latin: reparare (to make ready again)
DNA Repair is the collection of cellular mechanisms that detect and correct chemical damage or sequence errors in DNA, preserving genome integrity across cell generations.
Human cells sustain an estimated 10,000 to 100,000 DNA lesions per cell per day from sources including reactive oxygen species generated by normal metabolism, ultraviolet radiation, and spontaneous hydrolysis. Base excision repair corrects small oxidized or alkylated bases, nucleotide excision repair removes bulky helix-distorting lesions such as UV-induced pyrimidine dimers, and mismatch repair corrects replication errors that escape polymerase proofreading. Double-strand breaks, the most dangerous lesion type, are resolved by homologous recombination in dividing cells or by non-homologous end joining throughout the cell cycle.
Inherited defects in these pathways cause recognizable human diseases: loss of nucleotide excision repair causes xeroderma pigmentosum, loss of mismatch repair underlies Lynch syndrome, and mutations in BRCA1 or BRCA2 impair homologous recombination and sharply elevate breast and ovarian cancer risk.
The 2015 Nobel Prize in Chemistry was awarded to Tomas Lindahl, Paul Modrich, and Aziz Sancar for mapping the molecular mechanisms of base excision repair, mismatch repair, and nucleotide excision repair, respectively. Lindahl's work began in the 1970s when he demonstrated that DNA is chemically unstable under physiological conditions, overturning the assumption that the molecule was inherently stable inside cells.
DNA damage always becomes a mutation. Many lesions are corrected before the damaged strand is copied, so the original sequence is restored without any permanent change.
The bacterium Deinococcus radiodurans (commonly called the "radiation-resistant bacterium") can survive doses of ionizing radiation exceeding 5,000 gray, roughly 1,000 times the lethal dose for humans, by reassembling its shattered chromosome through an unusually efficient double-strand break repair system. Researchers have counted more than 100 double-strand breaks per genome in irradiated cells of this organism, all of which are accurately rejoined within hours.
DNA Supercoiling
/ dee-en-ay SOO-per-koy-ling / · Deoxyribonucleic Acid; Latin super, above; Old French coiller, to coil
DNA Supercoiling is the introduction of additional helical turns into, or the removal of turns from, the DNA double helix, producing either positively supercoiled DNA that is overwound or negatively supercoiled DNA that is underwound relative to the relaxed B-form helix.
The relaxed B-form double helix has one helical turn every 10.5 base pairs. When RNA polymerase or a replication fork advances along DNA, it generates positive supercoils ahead of the moving complex and negative supercoils behind it, because the strands cannot freely rotate around each other in a chromosome. Topoisomerases resolve this tension: type I topoisomerases cut one strand and allow controlled rotation to relax supercoils, while type II topoisomerases cut both strands and pass a second duplex through the break.
In bacteria, DNA gyrase, a type II topoisomerase, actively introduces negative supercoils using ATP hydrolysis, maintaining the chromosome in a negatively supercoiled state that lowers the energy cost of strand separation during transcription and replication. Fluoroquinolone antibiotics such as ciprofloxacin kill bacteria by trapping gyrase in a covalent complex with cut DNA, preventing religation and causing lethal double-strand breaks.
Eukaryotic cells package DNA into nucleosomes partly to manage supercoiling. Each nucleosome wraps approximately 147 base pairs of DNA around a histone octamer, and this wrapping constrains negative supercoils, reducing the topological stress that would otherwise accumulate in long chromosomal loops.
DNA is stored as a relaxed straight helix. Chromosomal DNA is almost always supercoiled, and bacteria maintain a specific level of negative supercoiling that is actively regulated by topoisomerases.
The circular chromosome of Escherichia coli (E. coli) is maintained at a superhelical density of approximately negative 0.06, meaning about 6 negative supercoils per 100 helical turns. DNA gyrase continuously replenishes this supercoiling as transcription and replication consume it, and inhibiting gyrase with nalidixic acid halts bacterial growth within minutes.
E-coli →Downstream Sequence
/ DOWN-streem SEE-kwens / · English: downstream + sequence
Downstream Sequence is a DNA or RNA region located on the 3-prime side of a reference point such as a transcription start site or coding sequence, oriented in the same direction as transcription or translation proceeds.
By convention, downstream refers to the direction of transcription from the 5-prime to 3-prime end of the coding strand, with increasing nucleotide position numbers assigned as one moves further downstream. Regulatory elements found downstream of coding sequences include 3-prime untranslated regions, polyadenylation signals, and transcription terminator sequences that govern the end of transcription and the processing of the mRNA 3-prime end. The canonical polyadenylation signal, AAUAAA in mammals, sits approximately 10 to 30 nucleotides upstream of the cleavage site, which is itself a downstream element relative to the stop codon.
In cell signaling, the term downstream is borrowed metaphorically to describe later steps in a cascade, such as kinases or transcription factors that receive signals from upstream receptors, a usage that has no physical relationship to DNA orientation.
The position numbering system used to describe downstream sequences assigns positive integers to nucleotides 3-prime of a reference point and negative integers to nucleotides 5-prime of it, with the reference point itself designated +1. This convention, established for transcription start sites, means that the first transcribed nucleotide is +1 and the nucleotide immediately before it is -1, with no position zero in the standard system.
Downstream always means physically lower on a chromosome diagram. Whether a sequence is downstream depends entirely on the orientation of the gene being referenced, not on its position on the page or chromosome map.
In the human beta-globin gene, a polyadenylation signal located downstream of the stop codon directs cleavage and polyadenylation of the pre-mRNA. The signal lies roughly 900 nucleotides beyond the coding region, and mutations in it can reduce mature beta-globin mRNA enough to cause beta-thalassemia.
