rs4728142 — IRF5 Promoter Indel Tag (CGGGG insertion)
Enhancer variant ~5 kb upstream of IRF5 that promotes ZBTB3-mediated chromatin looping to the IRF5 alternative promoter, driving overexpression of IRF5-short transcripts and increasing risk for lupus, Sjögren's syndrome, inflammatory bowel disease, and other autoimmune conditions; the leading causal candidate for the 5' IRF5 risk signal and a tag for the CGGGG promoter insertion that creates an extra Sp1 binding site
Details
- Gene
- IRF5
- Chromosome
- 7
- Risk allele
- A
- Consequence
- Regulatory
- Inheritance
- Additive
- Clinical
- Risk Factor
- Evidence
- Strong
- Chip coverage
- v3 v4 v5
Population Frequency
Ancestry Frequencies
Related SNPs
Category
Immune & GutSee your personal result for IRF5
Upload your DNA data to find out which genotype you carry and what it means for you.
Upload your DNA dataWorks with 23andMe, AncestryDNA, and other DNA test exports. Results in under 60 seconds.
IRF5 rs4728142 — The Promoter Enhancer Switch: Causal Driver of Interferon Overactivation
Interferon Regulatory Factor 5 (IRF5) is the molecular ignition switch for the type I interferon
response — the branch of innate immunity responsible for fighting viral infections by triggering
widespread pro-inflammatory signaling. When IRF5 is overactive, the same cascade that defends
against viruses begins attacking the body's own tissues, driving systemic lupus erythematosus
(SLE), Sjögren's syndrome, systemic sclerosis, and other autoimmune diseases. rs4728142 sits
approximately 5 kilobases upstream of IRF5's transcription start sites, in a region now established
as an active enhancer11 active enhancer
An enhancer is a regulatory DNA element that boosts transcription of a
nearby gene; it doesn't have to be immediately adjacent — enhancers can act from tens of thousands
of base pairs away via DNA looping for the gene.
The A allele of rs4728142 has recently been confirmed as the leading causal variant for the 5'
IRF5 risk signal — not merely a tag for some other functional change, but the variant that directly
rewires how the IRF5 gene is regulated.
The Mechanism
rs4728142 operates through a newly described chromatin-looping mechanism. The A allele alters
the binding affinity of ZBTB3 (Zinc Finger and BTB Domain-Containing Protein 3)22 ZBTB3 (Zinc Finger and BTB Domain-Containing Protein 3)
A transcription
factor that recognizes specific DNA sequences and can recruit chromatin-remodeling complexes;
ZBTB3 belongs to the BTB-POZ family involved in gene regulation and cell fate decisions.
When ZBTB3 binds the rs4728142-A allele, it orchestrates a short-range chromatin loop that
physically connects the upstream enhancer region to the alternative (short-transcript) promoter
of IRF5. The consequence: the IRF5 gene preferentially produces its shorter isoforms at the expense
of the full-length transcript, and total IRF5 expression is markedly elevated. In lymphoblastoid
cells from European ancestry individuals, the A allele is associated with a 1.5-fold increase in
IRF5 mRNA33 1.5-fold increase in
IRF5 mRNA
Bonferroni-corrected P=0.0004 in CEU cell lines; smaller but significant effects also
seen in Asian-ancestry cells (1.2-fold, P=0.006).
In SLE patient monocytes, CRISPR-based interference with the enhancer function at this locus
attenuated production of lupus-associated cytokines, confirming the causal chain from variant
to expression to inflammatory output.
The rs4728142-A allele is in strong linkage disequilibrium44 linkage disequilibrium
LD measures how often two nearby
variants are inherited together; r²>0.6 means the two variants co-occur on the same chromosome
much more often than chance would predict (r²>0.6)
with the CGGGG insertion-deletion polymorphism in the IRF5 promoter. The CGGGG indel sits
64 base pairs upstream of the exon 1A transcription start site; the 4-copy insertion allele (4R)
creates an additional binding site for the transcription factor Sp1, which was confirmed by
electrophoretic mobility shift assays. The 4R allele increases IRF5 mRNA expression in peripheral
blood mononuclear cells and amplifies interferon-inducible gene expression (MX1, IFITM1) in
Sjögren's patients. rs4728142 and the CGGGG indel together provide complementary molecular
explanations for why the 5' region of IRF5 elevates interferon tone: the CGGGG indel creates
an extra Sp1 site at the promoter; rs4728142 directs an enhancer loop to that promoter.
The Evidence
The breadth of disease associations for rs4728142 is exceptional even among major autoimmune risk loci. The variant has been independently associated with six autoimmune conditions across multiple populations.
In systemic lupus erythematosus, a large meta-analysis of IRF5 variants across nine
populations found rs4728142 among the strongest independent signals P=1.34×10⁻⁸,
OR=1.22 (95% CI 1.14–1.30)55 P=1.34×10⁻⁸,
OR=1.22 (95% CI 1.14–1.30)
Pooled analysis across European, Asian, and admixed populations
from multiple case-control studies, placing it
among the most replicated non-HLA genetic risk factors for lupus.
In primary Sjögren's syndrome, the CGGGG indel (in LD with rs4728142-A) was found to
fully explain the IRF5-Sjögren's association in two independent Norwegian and British cohorts,
yielding OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶)66 OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶)
Norheim et al. 2012, Annals of the Rheumatic
Diseases; OR of 2.00 for carrying the 4R CGGGG allele vs the 3R allele.
Carriers of the 4R allele had measurably higher IRF5 mRNA in both peripheral blood and salivary
gland epithelial cells — the tissue most directly relevant to Sjögren's pathology.
In inflammatory bowel disease, the CGGGG indel was the lead signal in an independent IBD
GWAS analysis, with OR=1.81 for overall IBD (P=1.9×10⁻⁵) and OR=2.42 specifically for
ulcerative colitis (P=5.3×10⁻⁸)77 OR=1.81 for overall IBD (P=1.9×10⁻⁵) and OR=2.42 specifically for
ulcerative colitis (P=5.3×10⁻⁸)
Eames et al. 2007, Human Molecular Genetics; replication
in a second cohort OR=1.59. The UC association
is among the strongest single-gene effects in IBD genetics.
In multiple sclerosis, rs4728142 showed independent association across Spanish, Swedish, and Finnish cohorts (combined P<0.001, OR≈1.14), with allele-specific protein binding on EMSA suggesting direct transcription factor recruitment at the variant site rather than passive LD.
In systemic sclerosis, the three-SNP IRF5 haplotype incorporating rs4728142 reached
OR=1.75 (P=9.04×10⁻²²)88 OR=1.75 (P=9.04×10⁻²²)
Dieudé et al. 2013, PLOS ONE; five-country European study including
Spain, Germany, Netherlands, Italy, and UK, exceeding
the effect of rs10488631 alone and confirming additive contributions from the 5' and 3' haplotype
blocks.
Practical Implications
The A allele at rs4728142 acts as an immune-system volume dial stuck at a higher setting. The overexpressed IRF5 protein amplifies both the initial interferon alarm response and subsequent pro-inflammatory cytokine cascades (TNF-α, IL-6, IL-12). For most carriers this translates to modest increases in autoimmune disease susceptibility across multiple organ systems — not a predetermined path to disease, but a meaningful shift in the immune threshold.
Because rs4728142 affects the IRF5 enhancer rather than the coding sequence, its effects are partly modifiable by context: the immune system's activation state, the presence of viral triggers (particularly Epstein-Barr virus, strongly implicated in SLE pathogenesis), and environmental modulators of interferon tone including vitamin D. The discovery that CRISPR interference with this enhancer can attenuate lupus-associated cytokine production in patient monocytes indicates that this regulatory region is being studied as a potential therapeutic target.
Population stratification is striking at this locus: European and Latino populations carry the A allele at approximately 44% frequency — nearly half the population — while East Asian populations carry it at only 13%. This mirrors the population-level differences in autoimmune disease prevalence and suggests that variation at IRF5 contributes to the well-documented ethnic disparities in lupus and related conditions.
Interactions
rs4728142 functions within the IRF5 three-block haplotype architecture alongside rs2004640 (exon 1B splice site) and rs10488631 (3' haplotype tag). The 5' block tagged by rs4728142 and the 3' block tagged by rs10488631 operate independently and additively: individuals carrying both the rs4728142-A risk allele and the rs10488631-C risk allele carry contributions from both haplotype blocks, with combined effects substantially exceeding those of either alone. The three-block SSc haplotype data illustrating OR=1.75 for the combined haplotype versus OR=1.63 for rs10488631 alone quantifies this additive contribution.
Downstream of IRF5, STAT4 (rs7574865) amplifies cellular responsiveness to the interferons that IRF5 drives. IRF5 variants increase interferon production; STAT4 variants increase cellular sensitivity to that interferon. Studies in primary Sjögren's syndrome and SLE demonstrate striking additive effects: with all five IRF5 + STAT4 risk alleles, OR for Sjögren's reaches 6.78. rs4728142 risk carriers with concurrent STAT4 rs7574865 risk alleles thus face a compounded immune activation burden — elevated production and elevated responsiveness to what is produced.
The autoimmune pleiotropic nature of rs4728142 — associated with lupus, Sjögren's, IBD, MS, SSc, and RA — means the variant likely exerts a shared molecular mechanism (IRF5 overactivation) that manifests as different diseases depending on which other genetic and environmental factors are present. Individuals carrying this risk allele who develop symptoms in one autoimmune domain should be aware of overlapping risk across organ systems.
Genotype Interpretations
What each possible genotype means for this variant:
No rs4728142 risk allele; IRF5 enhancer operates at standard activity without ZBTB3-driven promoter looping
You carry two copies of the G allele at rs4728142, meaning you do not carry the A allele that drives ZBTB3-mediated chromatin looping to the IRF5 alternative promoter. Your IRF5 gene expression at this enhancer locus is at baseline, without the upregulation associated with the A allele. This is the most common genotype globally: approximately 33% of people carry GG, though this varies substantially by ancestry — it is more common in East Asian populations where the A allele frequency is only 13%. Among Europeans, approximately 31% carry this genotype. You do not carry the tag for the CGGGG promoter 4R insertion at this locus, meaning your IRF5 promoter lacks the additional Sp1 binding site associated with elevated interferon output.
One copy of the IRF5 risk allele at this enhancer; modestly elevated IRF5 expression and increased autoimmune susceptibility
As a heterozygous AG carrier, you have one standard-regulation IRF5 copy (G allele) and one copy regulated by the risk enhancer architecture (A allele). The A allele drives ZBTB3 binding, chromatin looping to the alternative promoter, and preferential transcription of IRF5-short isoforms. Expression studies in European lymphoblastoid cells showed a 1.5-fold increase in IRF5 mRNA associated with the A allele — as a heterozygote, your effect is intermediate. The A allele also tags the CGGGG 4R insertion in the IRF5 promoter (r²>0.6), which independently creates an additional Sp1 binding site at the exon 1A promoter and has been shown to elevate IRF5 mRNA in patient PBMCs.
The clinical consequence of intermediate IRF5 upregulation depends substantially on which other IRF5 haplotype blocks you carry (rs2004640, rs10488631) and whether you also carry risk alleles in the STAT4 gene (rs7574865), which amplifies downstream responsiveness to the interferons IRF5 drives.
Two copies of the IRF5 enhancer risk allele; elevated IRF5 expression with substantially increased susceptibility to lupus, Sjögren's syndrome, ulcerative colitis, and related autoimmune conditions
The AA genotype at rs4728142 places both IRF5 gene copies under the control of the ZBTB3-mediated enhancer-looping mechanism. ZBTB3 preferentially binds the A allele and directs a chromatin loop from the upstream enhancer to the IRF5 alternative (short-transcript) promoter — this is now established as the causal mechanism at this locus based on integrative functional genomics and CRISPR validation in SLE patient monocytes. The result is overactivation of IRF5, excess pro-inflammatory cytokine production, and a lowered threshold for type I interferon responses.
The disease associations of this variant span an unusually wide range of autoimmune conditions. Per-allele odds ratios for lupus (OR≈1.22), Sjögren's syndrome (4R CGGGG OR=2.00 for carriers vs non-carriers), and ulcerative colitis (OR≈2.42 for the 4R allele) represent among the strongest non-HLA genetic effects in autoimmune disease. For AA homozygotes, the additive or multiplicative effects of two risk alleles place susceptibility at the higher end of the IRF5-associated risk spectrum.
The breadth of disease associations (lupus, Sjögren's, ulcerative colitis, systemic sclerosis, multiple sclerosis, ankylosing spondylitis, psoriasis) means that AA individuals face a broadly elevated IRF5-mediated immune risk — the specific disease that develops depends on other genetic factors, environmental exposures, and which tissues are first targeted. This also means that if one autoimmune condition is diagnosed, monitoring for others is warranted.
Key References
Integrative functional genomics study identifying rs4728142-A as the causal SLE variant at the IRF5 locus: affects ZBTB3 binding, chromatin status, and IRF5 expression; CRISPR interference attenuated cytokine production in lupus patient monocytes
rs4728142-A allele drives ZBTB3-mediated chromatin looping between an upstream enhancer and the IRF5 alternative promoter, promoting IRF5-short transcript overexpression and M1 macrophage polarization
Comprehensive IRF5 haplotype study identifying the CGGGG promoter indel (in LD with rs4728142, r²>0.6) as a novel 5 bp risk polymorphism for SLE — the 4× allele creates an additional Sp1 binding site and increases IRF5 mRNA expression in PBMCs of SLE patients
CGGGG 4R allele (in LD with rs4728142-A) associated with OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶) for primary Sjögren's syndrome; 4R carriers had higher IRF5 mRNA in PBMCs and salivary gland epithelial cells
CGGGG indel associated with IBD (OR=1.81 overall; OR=2.42 for ulcerative colitis); allele-specific Sp1 protein binding confirmed by EMSA; rs4728142 in strong LD with the indel
rs4728142 associated with multiple sclerosis in three European populations (combined P<0.001); allele-specific protein binding demonstrated by EMSA; r²>0.6 with CGGGG indel
rs4728142-A (labeled SNP 1) associates with 1.5-fold increased IRF5 mRNA expression in European lymphoblastoid cells (P=0.0004, Bonferroni-corrected), confirming functional effect on transcription
Three-SNP IRF5 haplotype including rs4728142 associated with systemic sclerosis at OR=1.75 (P=9.04×10⁻²²) — stronger than rs10488631 alone, demonstrating additive effects across IRF5 haplotype blocks