Genetics and Heredity Explained: DNA, Genes, Inheritance, and the Human Genome
A comprehensive guide to genetics and heredity — how DNA encodes genetic information, how genes are inherited through Mendelian and non-Mendelian patterns, what the Human Genome Project revealed, genetic variation, mutations, and the genetic basis of human traits and disease.
The Molecular Basis of Heredity
Every living organism inherits its characteristics from its parents through a molecule called deoxyribonucleic acid (DNA). DNA is a double helix — two complementary strands of nucleotides (adenine, thymine, guanine, cytosine) wound around each other — first described structurally by James Watson and Francis Crick in 1953, building on X-ray crystallography data from Rosalind Franklin and Maurice Wilkins.
A human cell contains approximately 3.2 billion base pairs of DNA, distributed across 46 chromosomes (23 pairs) in the nucleus. If stretched end-to-end, the DNA from a single human cell would be about 2 meters long — packaged through extraordinary compaction into a nucleus about 6 micrometers in diameter.
Genes: The Units of Heredity
A gene is a segment of DNA that encodes a functional product — usually a protein, but also functional RNA molecules. The human genome contains approximately 20,000–25,000 protein-coding genes, which collectively occupy only about 1.5% of the total genome. The remaining 98.5% includes regulatory sequences, non-coding RNA genes, repetitive elements, and regions whose functions remain incompletely understood (once dismissively called "junk DNA," much of which is now known to have regulatory roles).
The central dogma of molecular biology describes the flow of genetic information: DNA → RNA → Protein. Transcription converts DNA into messenger RNA (mRNA); translation converts mRNA into protein using the ribosome and transfer RNAs. The genetic code — the mapping of three-nucleotide codons to amino acids — is nearly universal across all life on Earth, evidence of common ancestry.
Mendelian Inheritance
Gregor Mendel, an Austrian monk, discovered the fundamental rules of inheritance through pea plant experiments in the 1860s — published in 1866 and largely ignored until rediscovery in 1900. Mendel's laws describe how discrete heritable units (now called genes) are passed from parents to offspring:
- Law of Segregation: Each individual carries two copies of each gene (one from each parent); these copies separate during gamete formation, so each gamete carries only one copy
- Law of Independent Assortment: Genes on different chromosomes assort independently during gamete formation (now known to apply to genes far apart on the same chromosome, with exceptions for linked genes)
Alleles are different versions of the same gene. When both alleles are identical, the individual is homozygous; when different, heterozygous. Dominant alleles produce their phenotype even when one copy is present; recessive alleles require two copies. Blood type (ABO gene with A, B, and O alleles) illustrates codominance (A and B are both expressed in AB individuals) and incomplete dominance.
| Inheritance Pattern | Description | Example |
|---|---|---|
| Autosomal dominant | One mutant copy sufficient for disease; 50% offspring affected | Huntington's disease, Marfan syndrome |
| Autosomal recessive | Two mutant copies required; carriers (one copy) unaffected | Cystic fibrosis, sickle cell disease, PKU |
| X-linked recessive | Gene on X chromosome; males (XY) more often affected | Hemophilia, Duchenne muscular dystrophy, red-green color blindness |
| Mitochondrial | Inherited through maternal mitochondrial DNA | Leber hereditary optic neuropathy |
| Multifactorial/polygenic | Multiple genes + environment | Height, intelligence, most common diseases |
The Human Genome Project
The Human Genome Project (HGP) was an international scientific effort launched in 1990 to sequence the complete human genome. Declared complete in April 2003 (with a few gaps), it produced the first reference human genome sequence — a 3.2 billion base pair sequence that remains the foundation of modern genetics and medicine.
Key findings of the HGP:
- Humans have far fewer protein-coding genes than expected: ~20,000–25,000, similar to a roundworm's ~19,000 and far fewer than a rice plant's ~32,000
- The vast majority of the genome (~98.5%) is non-coding
- Any two humans are ~99.9% identical at the DNA sequence level — human genetic diversity is remarkably low compared to other species
- About 8% of the human genome consists of endogenous retroviruses — remnants of ancient viral infections integrated into our ancestors' genomes
The cost of sequencing a human genome has plummeted from ~$3 billion (HGP) to under $200 today (2025), following a cost curve faster than Moore's Law. This democratization has enabled large-scale population genomics, direct-to-consumer genetic testing (23andMe, AncestryDNA), and genomic medicine.
Genetic Variation and Mutations
Genetic variation arises from mutations — changes to the DNA sequence. Mutations can be:
- Single nucleotide polymorphisms (SNPs): Single base changes; most common form of variation; ~10 million SNPs in the human genome
- Insertions/deletions (indels): Addition or removal of nucleotides; can cause frameshift mutations that alter the reading frame of a gene
- Copy number variations (CNVs): Duplications or deletions of large chromosomal segments; contribute to disease and evolution
- Chromosomal abnormalities: Trisomy 21 (Down syndrome), deletion of chromosome 5p (Cri-du-chat syndrome)
Most mutations are neutral (they don't affect protein function) or are repaired by sophisticated DNA repair machinery. Beneficial mutations are subject to positive natural selection; harmful mutations are purified from the population over generations. The balance between mutation (introducing variation) and selection (filtering it) is fundamental to evolution.
The Genetic Basis of Complex Traits
Most medically and socially important human traits — intelligence, height, susceptibility to heart disease, diabetes, schizophrenia — are polygenic: influenced by hundreds to thousands of genetic variants, each with tiny individual effects, plus environmental factors. Genome-wide association studies (GWAS) have identified thousands of such variants.
Polygenic risk scores (PRS) — algorithms that aggregate many small-effect variants — can predict disease risk for conditions like coronary artery disease, type 2 diabetes, and some cancers. However, their predictive power is limited, they perform differently across ancestry groups (most GWAS were conducted on European-ancestry individuals), and they capture only a fraction of the total heritability ("missing heritability" problem).
The emerging field of CRISPR-Cas9 gene editing, developed by Jennifer Doudna and Emmanuelle Charpentier (2020 Nobel Prize in Chemistry), enables precise editing of specific DNA sequences — with applications in treating genetic diseases (sickle cell disease cures via CRISPR were approved in 2023), agriculture, and basic research.
Related Articles
biology
How Evolution Works: Natural Selection, Genetics, and the Evidence for Common Descent
A comprehensive explanation of evolution by natural selection — Darwin's theory, the mechanisms of evolutionary change, the modern evolutionary synthesis, evidence from fossil records and genetics, and common misconceptions.
8 min read
biology
What Are Viruses? Structure, Replication, Classification, and How They Cause Disease
A comprehensive explanation of viruses — what they are, their structural components, the viral replication cycle, how they are classified, the difference between DNA and RNA viruses, how viruses cause disease, and how vaccines work against them.
8 min read
biology
What Is Natural Selection? Darwin's Theory Explained
Understand natural selection — the mechanism of evolution proposed by Charles Darwin, how it drives adaptation, and the evidence supporting it.
8 min read
biology
How Ecosystems Work: Energy Flow, Nutrient Cycles, and Biodiversity
A comprehensive guide to how ecosystems function — energy flow through trophic levels, nutrient cycling, food webs, ecological succession, and the factors that maintain ecosystem stability and resilience.
8 min read