A focused review of molecular genetics for child neurology — emphasizing trinucleotide repeats, tissue-specific splicing, and the molecular consequences of variant types on gene function.
Tags: Basic Genetics
The human genome contains approximately 3.2 billion base pairs organized into 46 chromosomes, but only ~1.5% encodes protein. The remaining ~98.5% includes regulatory elements, non-coding RNA genes (20,000–25,000), introns, and repetitive sequences — much of which is increasingly recognized as functionally relevant. The genetic code is a degenerate triplet code: 64 codons specify 20 amino acids plus 3 stop signals, with degeneracy partially buffering synonymous substitutions.
Key Points
A three-tier fidelity system (base selection, proofreading, mismatch repair) reduces the replication error rate to ~1 in 10⁹–10¹⁰ per base per division. Despite this, the germline accumulates ~60–70 de novo SNVs per generation (~1–2 per genome per cell division), providing the substrate for both evolution and de novo genetic disease. Trinucleotide repeat expansions — a major class of neurological disease — arise from replication slippage at tandem repeat sequences, with expansion size often increasing across generations (anticipation).
Key Points
Splicing — the removal of introns and ligation of exons — is clinically the most important step in mRNA processing. Splicing is directed by conserved consensus sequences at the 5' splice donor (GT) and 3' splice acceptor (AG) sites flanking each intron. Approximately 10–15% of disease-causing variants affect splicing, making splice prediction a critical skill in variant interpretation.
Key Points
Mature mRNA is exported to the cytoplasm where ribosomes translate it codon-by-codon into a polypeptide chain. The AUG start codon is recognized by the 43S pre-initiation complex and Met-tRNA. Elongation proceeds until a stop codon (UAA, UAG, or UGA) is encountered, triggering release of the completed polypeptide. Post-translational modifications — phosphorylation, glycosylation, ubiquitination — determine protein localization, activity, and stability.
Key Points
Genetic variants are classified by their molecular nature and predicted effect on gene function. Understanding variant type is the first step in variant interpretation: it determines which ACMG/AMP evidence criteria apply, whether nonsense-mediated decay is expected, and whether the variant is likely to cause loss of function or a gain-of-function effect. Not all variants of the same class have the same functional impact — context is everything.
Key Points
1. A patient with epilepsy has a variant at the canonical splice donor site of exon 5 in SCN1A (c.803+1G>A). SpliceAI predicts a high probability of exon 5 skipping. Which functional study would provide definitive evidence of the splicing effect?
RT-PCR (reverse transcription PCR) of patient mRNA is the definitive functional study for splice-site variants. It directly demonstrates the effect on the mature transcript — exon skipping, intron retention, or cryptic splice site activation. Gel electrophoresis reveals abnormal transcript sizes, and sequencing confirms the aberrant junction. Genomic DNA sequencing identifies the variant but cannot show its effect on splicing. In silico tools like SpliceAI predict the effect but do not constitute definitive evidence.
2. A 65-year-old man presents with progressive tremor, ataxia, and cognitive decline. His grandson was diagnosed with Fragile X syndrome. Genetic testing of the grandfather reveals 95 CGG repeats in FMR1. This clinical presentation is best explained by:
FMR1 premutation carriers (55-200 CGG repeats) produce elevated FMR1 mRNA that is thought to be toxic through an RNA gain-of-function mechanism. In older male carriers, this can cause FXTAS — a progressive neurodegenerative disorder characterized by intention tremor, cerebellar ataxia, and cognitive decline. Unlike full mutation Fragile X syndrome (>200 repeats, gene silencing, absent FMRP), premutation alleles are actively transcribed and produce normal FMRP but excess toxic mRNA.
3. A child with a neurodevelopmental disorder has a variant in a ubiquitously expressed gene, yet the phenotype is restricted to the central nervous system. Which molecular mechanism best explains this tissue-specific effect?
Alternative splicing generates tissue-specific isoforms from the same gene. If a variant disrupts an exon that is included only in the brain-specific isoform, other tissues with different splicing patterns are unaffected. This is a well-established mechanism in neurogenetics — for example, variants in brain-specific exons of SCN1A and DYNC1H1 cause purely neurological phenotypes despite the genes being expressed in multiple tissues. Variant interpretation must consider the relevant tissue-specific transcript.
4. A nonsense variant (p.Gln450Ter) is identified in the last exon of a 12-exon gene. Unlike most nonsense variants, this one is predicted to escape nonsense-mediated decay (NMD). Why?
Nonsense-mediated decay (NMD) requires the premature termination codon (PTC) to be more than 50-55 nucleotides upstream of the final exon-exon junction. Variants in the last exon — or within the last ~55 nt of the penultimate exon — escape NMD because no downstream exon-exon junction exists to trigger the surveillance pathway. These NMD-escaping variants produce stable truncated proteins, which may exert dominant-negative effects rather than causing simple loss-of-function through haploinsufficiency.
5. A genetic report notes that a recurrent pathogenic variant in a disease gene falls at a CpG dinucleotide, which is flagged as a mutation hotspot. A trainee asks what makes CpG sites prone to mutation. Which molecular process accounts for their elevated mutation rate?
CpG dinucleotides have a mutation rate approximately 10-fold higher than the genome average because cytosine at CpG sites is frequently methylated to 5-methylcytosine, which spontaneously deaminates to thymine. The resulting G:T mismatch may not be fully repaired before the next round of replication, permanently converting the CpG to TpG. This mechanism makes CpG sites the most common location for recurrent pathogenic missense and nonsense variants in human disease genes.