A focused review of molecular genetics for child neurology — emphasizing trinucleotide repeats, tissue-specific splicing, and the molecular consequences of variant types on gene function.
Tags: Basic Genetics
The human genome contains approximately 3.2 billion base pairs organized into 46 chromosomes, but only ~1.5% encodes protein. The remaining ~98.5% includes regulatory elements, non-coding RNA genes (20,000–25,000), introns, and repetitive sequences — much of which is increasingly recognized as functionally relevant. The genetic code is a degenerate triplet code: 64 codons specify 20 amino acids plus 3 stop signals, with degeneracy partially buffering synonymous substitutions.
Key Points
A three-tier fidelity system (base selection, proofreading, mismatch repair) reduces the replication error rate to ~1 in 10⁹–10¹⁰ per base per division. Despite this, the germline accumulates ~60–70 de novo SNVs per generation (~1–2 per genome per cell division), providing the substrate for both evolution and de novo genetic disease. Trinucleotide repeat expansions — a major class of neurological disease — arise from replication slippage at tandem repeat sequences, with expansion size often increasing across generations (anticipation).
Key Points
Splicing — the removal of introns and ligation of exons — is clinically the most important step in mRNA processing. Splicing is directed by conserved consensus sequences at the 5' splice donor (GT) and 3' splice acceptor (AG) sites flanking each intron. Approximately 10–15% of disease-causing variants affect splicing, making splice prediction a critical skill in variant interpretation.
Key Points
Mature mRNA is exported to the cytoplasm where ribosomes translate it codon-by-codon into a polypeptide chain. The AUG start codon is recognized by the 43S pre-initiation complex and Met-tRNA. Elongation proceeds until a stop codon (UAA, UAG, or UGA) is encountered, triggering release of the completed polypeptide. Post-translational modifications — phosphorylation, glycosylation, ubiquitination — determine protein localization, activity, and stability.
Key Points
Genetic variants are classified by their molecular nature and predicted effect on gene function. Understanding variant type is the first step in variant interpretation: it determines which ACMG/AMP evidence criteria apply, whether nonsense-mediated decay is expected, and whether the variant is likely to cause loss of function or a gain-of-function effect. Not all variants of the same class have the same functional impact — context is everything.
Key Points
1. A synonymous variant (c.300G>A, p.Thr100=) is identified in a patient with an unexplained genetic condition. Which of the following is the most accurate statement about this variant?
Synonymous variants — those that do not change the encoded amino acid — are not inherently benign. Nucleotide changes within exons can disrupt exonic splicing enhancer (ESE) sequences, causing exon skipping even without altering the amino acid. Additionally, some synonymous changes affect mRNA stability or translation efficiency. If a synonymous variant in a disease gene is identified in an unsolved patient, RNA-level studies (RT-PCR, RNA sequencing) should be performed to rule out a splicing effect before applying a benign classification.
2. A boy with intellectual disability and a CGG repeat expansion of 650 repeats in the 5' UTR of FMR1 has absent FMRP on immunocytochemistry. His carrier mother (CGG repeat: 85) has a normal FMRP level. Which mechanism best explains the difference between full mutation and premutation alleles?
Full mutation alleles (>200 CGG repeats) in FMR1 trigger hypermethylation of the repeat and the adjacent promoter CpG island, silencing transcription and abolishing FMRP production. Premutation alleles (55-200 repeats) remain unmethylated and are actively transcribed — in fact, they produce elevated FMR1 mRNA levels. This creates a paradox: premutation carriers have normal FMRP but excess mRNA, which causes distinct diseases (FXTAS, FXPOI) through an RNA gain-of-function mechanism. The CGG repeat is in the 5' UTR, not the coding sequence, so neither allele alters FMRP's amino acid sequence.
3. Alternative splicing of a gene produces both a 'short' isoform (expressed in muscle) and a 'long' isoform (expressed in brain). A patient has a splice site variant that disrupts exon inclusion in the brain isoform only. This variant most likely causes:
Tissue-specific alternative splicing can restrict the functional impact of a variant to a particular organ. If the pathogenic splice site variant only affects the brain-specific isoform, then only brain tissue lacks the normal protein product — explaining why some variants in ubiquitously expressed genes cause pure neurological phenotypes. This principle is clinically important: a 'splice-disrupting' variant identified in a brain-expressed isoform may be missed if only the reference (muscle) transcript is analyzed.
4. A pathogenic variant is identified as c.247C>T (p.Arg83Ter) in exon 3 of a gene with 10 exons. Which statement best predicts the molecular consequence?
A nonsense (stop-gain) variant introducing a premature termination codon (PTC) in exon 3 of a 10-exon gene is predicted to trigger nonsense-mediated decay (NMD). NMD degrades mRNAs when the PTC is located >50–55 nucleotides upstream of the final exon-exon junction. Since there are 7 more exons downstream of the PTC, this variant clearly meets the NMD rule. NMD prevents production of truncated proteins, typically resulting in effective loss-of-function — the basis for applying the PVS1 criterion.
5. Which of the following variant descriptions is consistent with a frameshift mutation?
A frameshift variant results from an insertion or deletion that is not a multiple of 3 nucleotides, disrupting the reading frame downstream of the variant. c.412_413del is a 2-nucleotide deletion — not divisible by 3 — which shifts the reading frame (indicated by 'fs' in the protein notation) and creates a premature stop codon 12 amino acids downstream (Ter12). Missense (Arg138Trp), nonsense (Arg138Ter), and splice-site variants (c.412+1G>A) are distinct variant classes with different molecular consequences.