Genetics Terms Starting With G

G

Genetics Glossary: G

Molecular GeneticsPopulation GeneticsEpigeneticsTransmission Genetics

Gene

/ JEEN /  ·  Greek: genos (birth, origin)

Molecular GeneticsIntro
Also known as:Genetic Locus

Gene is a specific sequence of DNA that encodes the information needed to produce a functional RNA molecule or protein and serves as the basic unit of heredity.

Genes occupy specific positions called loci on chromosomes and can exist in multiple variant forms called alleles. The total number of protein-coding genes in the human genome is approximately 20,000, a surprisingly small fraction of the roughly 3.2 billion base pairs of total DNA. Beyond protein-coding genes, the genome contains thousands of genes that encode functional non-coding RNAs with regulatory roles, including microRNAs and long non-coding RNAs.

Did you know?

The human genome contains roughly 20,000 protein-coding genes, a similar number to the roundworm Caenorhabditis elegans, demonstrating that organism complexity is not simply a function of gene number.

Common misconception

A gene is not simply any segment of DNA. The term specifically refers to a discrete functional unit that is transcribed and contributes to a molecular or phenotypic outcome.

Example in nature

The BRCA1 gene encodes a protein central to DNA double-strand break repair, and individuals carrying certain loss-of-function mutations face a lifetime breast cancer risk exceeding 70 percent. Different BRCA1 mutations can disrupt DNA repair through distinct molecular mechanisms, producing varying degrees of cancer risk.

Gene Editing

/ jeen ED-ih-ting /  ·  English: gene + editing

Molecular GeneticsAdvanced
Also known as:Genome Editing

Gene Editing is the precise modification of DNA sequences within a genome using engineered molecular tools to insert, delete, or replace specific nucleotide sequences.

Modern gene editing uses programmable nucleases such as CRISPR-Cas9, TALENs, and zinc finger nucleases to cut DNA at targeted sequences. The cell repairs the resulting break through pathways that can knock out a gene, correct a mutation, or insert new sequences at the cut site. Gene editing holds therapeutic promise for treating inherited disorders, cancers, and infectious diseases by correcting disease-causing mutations directly in patient cells.

Did you know?

CRISPR-based gene editing has been used to eliminate the HIV genome from infected cells in laboratory models, representing a potential path toward a functional cure.

Most Deadliest Bacterial Diseases →
Common misconception

Gene editing and gene therapy are not the same. Gene therapy introduces a functional gene copy alongside the defective one, while gene editing directly corrects or disables the target DNA sequence.

Example in nature

In 2023, the first CRISPR gene editing therapy received regulatory approval for sickle cell disease, modifying patients' own bone marrow stem cells to reactivate fetal hemoglobin production. Clinical trial data showed that more than 90 percent of treated patients experienced no severe pain crises in the year following the procedure.

Gene Expression

/ jeen ek-SPRESH-un /  ·  Latin: expressio (a pressing out)

Molecular GeneticsIntro

Gene Expression is the process by which information encoded in a gene is used to produce a functional product, either a protein or a functional RNA molecule.

Gene expression involves two main steps: transcription, in which the DNA sequence is copied into RNA, and translation, in which the mRNA is decoded to assemble a protein. Expression is tightly regulated in time, space, and magnitude, so different cell types produce distinct sets of proteins from the same genome. Misregulation of gene expression is a central feature of cancer, where oncogenes become overexpressed and tumor suppressor genes are silenced.

Did you know?

Every cell in the human body contains the same DNA, yet a neuron and a liver cell look and function completely differently because they express different subsets of their genes.

Discover Kupffer Cells →
Common misconception

Gene expression does not always result in protein production. Many genes are transcribed into non-coding RNAs that regulate other genes without ever being translated into protein.

Example in nature

Insulin is produced only in pancreatic beta cells because the insulin gene is expressed exclusively in those cells, even though the gene is present in every cell of the body. Cell-specific transcription factors, including PDX1, bind regulatory regions of the insulin gene and activate transcription only in the pancreatic beta cell environment.

Gene Flow

/ JEEN floh /  ·  English: gene + flow

Population GeneticsIntermediate
Also known as:MigrationAllele Flow

Gene Flow is the transfer of alleles from one population to another through the movement and successful reproduction of individuals or gametes between populations.

Gene flow homogenizes allele frequencies between populations, counteracting the diverging effects of genetic drift and natural selection. High levels of gene flow between populations reduce genetic differentiation and can prevent speciation, while restricted gene flow allows populations to diverge over time. Pollen dispersal in plants and migration in animals are the primary mechanisms of gene flow in nature, and pollen grains of some pine species travel more than 600 kilometers on wind currents.

Did you know?

In humans, gene flow has been so extensive throughout history that all modern human populations share the vast majority of their genetic variation, with only minor differences in allele frequencies.

Common misconception

Gene flow does not only occur through migration of whole organisms. In plants, pollen carried by wind or pollinators can transfer alleles across vast distances between otherwise separated populations.

Example in nature

The spread of antibiotic resistance alleles from clinical bacterial strains into environmental bacterial populations through horizontal gene transfer is a form of gene flow with serious public health consequences. A single resistance plasmid can transfer between bacterial cells in under an hour, rapidly shifting allele frequencies across geographically distant populations.

Gene Pool

/ JEEN pool /  ·  English: gene + pool

Population GeneticsIntro

Gene Pool is the complete collection of all alleles present in all individuals of an interbreeding population at a given time.

The gene pool represents the total genetic diversity available within a population for selection and inheritance. A large, diverse gene pool provides raw material for adaptation to environmental change, while a restricted gene pool limits evolutionary potential. The composition of the gene pool changes over time through mutation, selection, genetic drift, and gene flow, with each force shifting allele frequencies in distinct ways.

Did you know?

When a species is reduced to a very small population, the loss of alleles from the gene pool can be irreversible even if population numbers later recover, permanently limiting genetic diversity.

Common misconception

The gene pool does not include alleles from different species. It refers only to the alleles present within a single interbreeding population of the same species.

Example in nature

Conservation programs for the Florida panther (Puma concolor coryi) introduced eight Texas pumas in 1995 to expand the gene pool and restore genetic diversity, successfully reversing signs of inbreeding depression. Within one generation, kinked tails and heart defects, both markers of inbreeding, dropped sharply in frequency among Florida panther kittens.

Genetic Code

/ jeh-NET-ik kohd /  ·  Greek: genesis + Latin: codex (tablet)

Molecular GeneticsIntro
Also known as:Triplet Code

Genetic Code is the set of rules by which nucleotide triplets in messenger RNA are translated into specific amino acids during protein synthesis.

The code consists of 64 codons, three of which are stop signals and 61 of which specify amino acids. Because 61 codons encode only 20 amino acids, most amino acids are specified by more than one codon, making the code degenerate but not ambiguous. The genetic code is nearly universal across all life, with only minor variations found in mitochondria and some microorganisms such as Mycoplasma species.

Did you know?

The genetic code was fully deciphered between 1961 and 1966 through the work of Marshall Nirenberg, Har Gobind Khorana, and Robert Holley, who shared the Nobel Prize in Physiology or Medicine in 1968.

Building Blocks of Proteins →
Common misconception

The genetic code is not the same as the genome. The genome is the full DNA sequence of an organism, while the genetic code is the universal translation table mapping codons to amino acids.

Example in nature

The codon AUG serves as the universal start signal and encodes methionine in virtually all organisms, providing the anchor point from which every protein sequence is read. Because the code is shared so broadly, human insulin has been produced commercially since 1982 by inserting the human insulin gene into Escherichia coli, which reads and translates the human codons accurately.

Genetic Drift

/ jeh-NET-ik drift /  ·  Greek: genesis + Old Norse: drift

Population GeneticsIntermediate
Also known as:Random Genetic DriftAllelic Drift

Genetic Drift is the random change in allele frequencies in a population from one generation to the next because of chance events in reproduction rather than natural selection.

Genetic drift has a greater effect in small populations because random sampling error is proportionally larger when fewer individuals reproduce. Drift can cause alleles to become fixed or lost entirely regardless of their fitness effects, and in a population of size N, a neutral allele fixes with a probability equal to its current frequency. Over many generations, drift reduces genetic diversity within populations and increases genetic differentiation between isolated populations.

Did you know?

In small populations, genetic drift can cause harmful alleles to become fixed and beneficial alleles to be lost purely by chance, acting against the direction of natural selection.

Common misconception

Genetic drift is not the same as natural selection. Selection is a non-random process driven by fitness differences, while drift is entirely random and independent of an allele's effect on survival or reproduction.

Example in nature

The cheetah (Acinonyx jubatus) shows such extreme genetic uniformity that skin grafts between unrelated individuals are rarely rejected, a consequence of genetic drift following a severe population bottleneck estimated to have occurred roughly 10,000 years ago. Random allele loss during that bottleneck reduced heterozygosity across the cheetah genome to levels far below those seen in other large felids.

Genome

/ JEE-nohm /  ·  Greek: genes (birth) + -ome (complete set)

Molecular GeneticsIntro

Genome is the complete set of genetic material in an organism, including all of its DNA sequences, protein-coding genes, regulatory elements, and non-coding regions.

In eukaryotes, the genome is organized into chromosomes and includes both the nuclear genome and the much smaller mitochondrial genome. The human genome contains about 3.2 billion base pairs, yet only about 1.5 percent of it encodes proteins. Whole-genome sequencing has revealed that a large fraction consists of transposable elements, repetitive sequences, and regulatory non-coding DNA.

These non-coding regions are not genetic “junk”; many regulate when and where genes are expressed.

Did you know?

The marbled lungfish (Protopterus aethiopicus) holds the record for the largest known animal genome, at roughly 130 billion base pairs, more than 40 times the size of the human genome, yet it encodes no more biological complexity.

Common misconception

The genome is not the same as the transcriptome or proteome. The genome is the static blueprint in DNA, while the transcriptome and proteome represent the dynamic subset of genes expressed as RNA and protein at a given time.

Building Blocks of Nucleic Acids →
Example in nature

Comparing the genomes of humans and chimpanzees (Pan troglodytes) reveals about 98.7 percent sequence identity, yet the two species differ dramatically in cognition, language, and behavior. Small sequence differences and changes in gene regulation can still produce large effects on phenotype.

Genomic Imprinting

/ jeh-NOH-mik im-PRIN-ting /  ·  Greek: genesis + Old French: empreindre (to impress)

EpigeneticsAdvanced
Also known as:Parental Imprinting

Genomic Imprinting is an epigenetic phenomenon in which a gene is expressed exclusively from either the maternally or paternally inherited chromosome, determined by the parent of origin.

Imprinted genes are marked during gamete formation by sex-specific epigenetic modifications, usually DNA methylation, that are maintained after fertilization. Loss of normal imprinting patterns causes several human developmental disorders, including Prader-Willi syndrome and Angelman syndrome, which arise from abnormalities in the same chromosomal region depending on which parental copy is affected. About 100 genes in the human genome are subject to genomic imprinting, and many of them influence fetal growth and brain development.

The insulin-like growth factor 2 gene (IGF2) is a well-studied example, expressed only from the paternal chromosome in humans and mice.

Did you know?

Genomic imprinting is thought to have evolved through a conflict between maternal and paternal genomes over the allocation of resources to offspring. The paternal genome "favors" greater resource extraction from the mother, while the maternal genome "favors" more restrained fetal growth, a tension reflected in the opposing effects of paternally and maternally expressed imprinted genes.

Common misconception

Genomic imprinting does not mean only one copy of a gene is present. Both copies are inherited normally, but one is epigenetically silenced based on which parent contributed it.

Example in nature

Prader-Willi syndrome results from loss of the paternally expressed copy of chromosome 15q11-q13, while Angelman syndrome results from loss of the maternally expressed copy of the same region. A deletion of identical size on the same chromosome produces two entirely different clinical disorders depending solely on which parent transmitted the affected chromosome.

Genomics

/ jeh-NOH-miks /  ·  Greek: genesis + -ics (study of)

Molecular GeneticsIntermediate

Genomics is the study of the structure, function, evolution, mapping, and editing of complete genomes, encompassing all DNA sequences within an organism.

Genomics emerged as a field following advances in high-throughput DNA sequencing that made it feasible to determine complete genome sequences. Comparative genomics reveals evolutionary relationships and identifies conserved functional elements by comparing genomes across species. Applied genomics underpins precision medicine, crop improvement, and microbial biotechnology by linking genotype to phenotype at genome scale.

The first complete genome of a free-living organism, the bacterium Haemophilus influenzae, was published in 1995 and spanned roughly 1.8 million base pairs.

Did you know?

The first complete genome sequenced was that of bacteriophage phi-X174 in 1977 by Frederick Sanger's team, covering just 5,386 base pairs. That single sequencing run required months of manual work that modern instruments now complete in seconds.

Biological Databases →
Common misconception

Genomics is not the same as genetics. Genetics studies the inheritance and variation of specific genes, while genomics takes a comprehensive whole-genome view of all genetic material simultaneously.

Example in nature

Metagenomics, a branch of genomics, sequences DNA directly from environmental samples without culturing organisms, revealing microbial diversity invisible to traditional microbiology. A single liter of ocean water can contain genetic material from thousands of distinct microbial species, most of which have never been grown in a laboratory.

Genotype

/ JEE-noh-typ /  ·  Greek: genos (birth) + typos (type)

Transmission GeneticsIntro

Genotype is the specific combination of alleles an organism carries at one or more genetic loci, representing its genetic constitution irrespective of physical appearance.

Genotype is distinguished from phenotype, which is the observable physical or biochemical expression of those alleles. Two organisms can share the same phenotype but carry different genotypes, as when a homozygous dominant individual and a heterozygous individual both display the dominant trait. Molecular tools such as PCR and DNA sequencing determine genotypes directly, independent of phenotypic observation.

In snapdragons (Antirrhinum majus), for example, flower color phenotype alone cannot distinguish homozygous red from heterozygous red plants without genetic testing.

Did you know?

Genotyping has revealed that roughly one in 25 people of Northern European descent carries a single recessive allele for cystic fibrosis without showing any symptoms, because one functional copy of the CFTR gene is sufficient for normal chloride transport.

Common misconception

Genotype and phenotype are not interchangeable terms. Genotype refers to the alleles present in the DNA, while phenotype refers to the observable traits those alleles produce in a specific environment.

Example in nature

Genetic testing of prospective parents can determine whether both carry a recessive allele for cystic fibrosis, predicting a one-in-four chance that each child will be affected. The risk calculation depends on Mendelian inheritance for that recessive condition.

Germline Mutation

/ JERM-line myoo-TAY-shun /  ·  German Keimbahn, germ track; Latin mutare, to change

Molecular GeneticsIntermediate
Also known as:heritable mutationconstitutional mutationgerm cell mutation

Germline Mutation is a genetic alteration that occurs in reproductive cells such as sperm or egg, or their precursors, making it heritable and present in every cell of the resulting offspring.

Because germline mutations are transmitted to offspring, they are the source of heritable genetic diseases such as cystic fibrosis, Huntington’s disease, and hereditary breast cancer, as well as the raw material for evolutionary change across generations. Germline mutations arise spontaneously through replication errors, environmental mutagen exposure, or transposon activity; the mutation rate is approximately one in 100 million bases in humans, equivalent to about 30 to 100 new mutations per person per generation. Unlike somatic mutations, germline mutations are subject to natural selection and can spread through a population if they confer a fitness advantage.

Paternal age is a significant factor: older fathers contribute more de novo germline mutations because sperm stem cells accumulate replication errors over decades.

Did you know?

De novo germline mutations, arising fresh in a single individual rather than inherited from a parent, account for a significant fraction of cases of autism spectrum disorder, schizophrenia, and intellectual disability. Studies estimate that each year of paternal age adds roughly one to two additional de novo mutations to the offspring's genome.

Common misconception

Germline mutations are not always harmful. The vast majority are neutral, and a small fraction are beneficial, providing the genetic variation that natural selection acts on to drive adaptive evolution.

Example in nature

The CCR5-delta32 deletion, a germline mutation present in about 10 percent of Northern Europeans, arose thousands of years ago and confers substantial resistance to HIV infection in homozygous carriers. Because it is transmitted through the germline, each carrier can pass the protective allele to offspring.

Guanine

/ GWAH-neen /  ·  Spanish: guano (dung, where it was first isolated)

Molecular GeneticsIntro

Guanine is a purine nitrogenous base that pairs with cytosine in both DNA and RNA through three hydrogen bonds, the strongest base-pairing interaction among the four standard nucleobases.

Guanine is one of the five main nucleobases found in nucleic acids, alongside adenine, cytosine, thymine, and uracil. Its three-hydrogen-bond pairing with cytosine makes GC-rich regions of DNA more thermally stable than AT-rich regions, a property exploited in PCR primer design. Guanine-rich sequences can also fold into four-stranded G-quadruplex structures found at telomeres and gene promoters, where they influence genome stability and transcription.

These G-quadruplex structures have attracted attention as targets for anti-cancer drugs because many oncogene promoters are guanine-rich.

Did you know?

Guanosine triphosphate, the guanine-containing nucleotide, donates energy in several key biochemical reactions, including signal transduction through G-proteins and elongation during protein synthesis at the ribosome. Unlike ATP, which powers most cellular work, GTP is specifically required by tubulin during microtubule polymerization.

Translation Biology →
Common misconception

Guanine is not the same as guanosine. Guanosine is the nucleoside formed when guanine bonds to a ribose sugar, while guanine refers only to the base itself.

Example in nature

Regions of the KRAS gene promoter containing guanine-rich sequences form G-quadruplex structures that suppress transcription, and small molecules designed to stabilize these structures have shown anti-tumor activity in cell culture studies. KRAS mutations drive roughly 25 percent of all human cancers, making its promoter G-quadruplex a target of active drug development.