http://www.nature.com/cgi-taf/DynaPage.taf?file=/ni/journal/v2/n9/special_full/ni0901-802_r.html
Volume 2 No 9 - September 2001
Special Focus: Autoimmunity
Nature Immunology 2, 802 - 809 (2001)
Amy Wanstrat & Edward Wakeland
Center for Immunology, University
of Texas Southwestern Medical Center, Department of Immunology, 5323 Harry
Hines Blvd., Dallas, TX 75390, USA.
Correspondence should be addresed
to E Wakeland http://www.nature.com/email_response/email.taf?address=edward.wakeland%40utsouthwestern.edu
Susceptibility to complex autoimmune diseases (AIDs) is a multigenic phenotype affected by a variety of genetic and environmental or stochastic factors. After over a decade of linkage analyses, the identification of non-major histocompatibility complex (non-MHC) susceptibility alleles has proved to be difficult, predominantly because of extensive genetic heterogeneity and possible epistatic interactions among the multiple genes required for disease development. Despite these difficulties, progress has been made in elucidating the genetic mechanisms that influence the inheritance of susceptibility, and the pace of gene discovery is accelerating. An intriguing new finding has been the colocalization of several AID susceptibility genes in both rodent models and human linkage studies. This may indicate that several susceptibility alleles affect multiple AIDs, or alternatively that genomic organization has resulted in the clustering of many immune system genes. The completion of the human genome sequence, coupled with the imminent completion of the mouse genome, should yield key information that will dramatically enhance the rate of gene discovery in complex conditions such as AID susceptibility.
Complex autoimmune diseases (AIDs) are chronic conditions initiated by a loss of immunologic tolerance to self-antigens. Clinical disease generally manifests as a result of damage induced in one or more organ systems via the inappropriate activation of immune-mediated inflammation. Collectively, AID is estimated to affect 4–5% of the population, females generally having a higher disease incidence than males (1). Six of the most common AIDs are rheumatoid arthritis (RA), Graves' disease, insulin-dependent diabetes mellitus or type I (autoimmune) diabetes (IMD) pernicious anemia, systemic lupus erythematosus (SLE) and multiple sclerosis (MS); collectively they represent about 50% of all AIDs. Roughly 1 in 30 individuals is afflicted with some type of autoimmune disease, thus making autoimmunity a major health problem in modern medicine.
Although the pathogenic mechanisms responsible for the initiation of autoimmunity remain poorly understood, a variety of classic studies have clearly demonstrated that genetic predisposition is a major factor in disease susceptibility. The most potent genetic influence on susceptibility to autoimmunity is the major histocompatibility complex (MHC), which has been known for over two decades to affect susceptibility to a variety of AIDs. We will focus here on non-MHC susceptibility genes; for a review of MHC genes and autoimmunity see (2).
Information obtained via linkage studies of AIDs in both humans and rodents has begun to elucidate the role of genetics in disease predisposition. Here, we will provide an overview of the genetic mechanisms affecting the inheritance of susceptibility and discuss the progress that has been made toward identifying non-MHC genes and genetic pathways that contribute to AID susceptibility. The identification of these non-MHC susceptibility genes is expected to provide insights into the mechanisms that mediate disease pathogenesis and possibly identify new targets for the development of therapeutic strategies.
Genetic predisposition to AIDs
The powerful impact of genetic predisposition
on susceptibility to autoimmunity was first identified by the analysis
of disease concordance rates in monozygotic twins. The monozygotic disease
concordance rate ranges from about 15% for RA (3) to a fairly robust 57%
for SLE (4) (Table 1). Comparisons of these high concordance rates with
disease incidence in the general population predict that genetic predisposition
is the dominant factor in AID susceptibility. The dramatic decrease in
the concordance rate of siblings compared with that of monozygotic twins
supports the presence of multiple genes contributing to the genetic predisposition.
Finally, the calculation of ls
for these diseases, which are the ratio of the risk of disease recurrence
among the siblings of affected individuals to disease incidence in the
general population, also supports a potent role for genetic predisposition
in disease susceptibility. Table 1 shows ls
for various AIDs, which range from about 10 for RA to as high as 40 for
SLE (4).
Table 1. Relative
risks in autoimmune disease The powerful influence of genetic
predisposition on AID susceptibility was initially interpreted as indicating
that genome-wide linkage analysis would allow the identification of many
potent non-MHC AID susceptibility genes. This expectation fostered the
development of international coalitions focused on collecting large cohorts
of multiplex families afflicted with specific AID and utilizing state-of-the-art
technologies to scan their genomes for the locations of susceptibility
genes (5-15). After over a decade of such analyses, the inheritance of
AID susceptibility has proved to be highly complex and not readily amenable
to genetic analysis in heterogeneous populations.
The consistent observation throughout
these genome scans of AIDs has been the detection of many genomic segments
exhibiting a weak statistical association with disease susceptibility.
Individual genomic intervals are in general associated with susceptibility
to AID, with lod scores ranging from 2.0 to occasionally approaching 5.0
(5-15). For comparison, a fully penetrant Mendelian disease locus would
be detected with a lod score approaching 30 by the analysis of 100 affected
sibling pairs, which would be a small sample size for most characterizations
of AID susceptibility genetics. In addition, mapping studies by separate
investigators working on the same AID frequently do not detect an association
to the same genomic regions, thus raising an issue of reproducibility for
many of the associations reported. The prevailing situation in most AIDs
is therefore that susceptibility is only modestly associated with any specific
non-MHC locus, despite the potent role for genetic predisposition in disease
susceptibility.
The resolution of this apparent paradox
lies in the complexity of AID genetics. The inheritance of AID susceptibility
is multifactorial, which means that susceptibility arises from the combined
impact of multiple contributing susceptibility genes, each potentially
interacting with a poorly defined array of environmental and/or stochastic
factors. In such a complex system, sample size rapidly limits the feasibility
of obtaining statistically significant associations. In addition, the detection
of these loci is complicated by two factors that commonly influence the
inheritance of multifactorial traits: genetic heterogeneity and epistasis.
Genetic heterogeneity
Genetic heterogeneity refers to the
presence of multiple combinations of genes within the genome that are capable
of causing a similar or identical disease phenotype. Genetic heterogeneity
is a common feature of many genetic systems in both humans and animal models.
It simply reflects the fact that many genes participate in the development
of complex phenotypes and that different combinations of genetic abnormalities
can lead to a similar outcome.
Indications of a significant degree
of genetic heterogeneity in AID susceptibility are apparent from linkage
studies in both human populations and animal models. Classic association
studies for candidate susceptibility genes in both IMD and SLE have, for
example, frequently observed significant variations between ethnic groups
in the disease association of specific alleles and disease phenotypes (16,
17). Linkage analyses have detected an increase in statistical associations
with specific genomic intervals only in specific ethnic groups (6, 18).
These results are readily explained by variations in the genetic basis
for predisposition to AID between ethnic groups, although definitive results
must await the identification and analysis of specific susceptibility alleles.
Comparisons of the genomic locations
of susceptibility genes in separate mouse models of IMD, SLE, experimental
allergic encephalomyelitis (EAE, an animal model of MS) and collagen-induced
RA also clearly indicate that the genomic locations of many susceptibility
alleles vary between models. Although attention has generally focused on
the colocalization of susceptibility genes among AID-prone strains (see
below), most of the genomic segments detected are not shared between different
animal models, even with the same AID (19-26). A recent linkage analysis
of the lupus-prone BXSB strain found, for example, that only two of five
intervals overlapped with intervals detected in linkage studies performed
in other strain combinations. This indicated that lupus susceptibility
was being mediated in BXSB by loci that were not involved in disease susceptibility
in NZM2410, NZB/NZW or MRL/lpr strains (26). These results are representative
of findings in other AID models and indicate that susceptibility is mediated
predominantly by a heterogeneous array of genes in murine models of AID.
Epistatic interactions
Epistasis is classically defined
as a genetic interaction in which the genotype at one locus affects the
phenotypic expression of the genotype at another locus. Evidence consistent
with epistatic interactions among susceptibility alleles has been reported
in both animal models and human linkage studies of AID (27-30). Synergism
between susceptibility alleles is, for example, clearly seen in a recent
analysis of the congenic strains B6.Sle1, B6.yaa, and B6.Sle1/yaa B6.Sle1
and B6.yaa are B6-congenic mice that carry the Sle1 and yaa susceptibility
genes for systemic autoimmunity, respectively. Each strain spontaneously
produces nonpathogenic autoantibodies to nuclear antigens but fails to
develop severe autoimmunity, having a normal lifespan and developing little
or no glomerulonephritis. However, when these two susceptibility alleles
are combined in the B6.Sle1/yaa bicongenic strain, a severe systemic autoimmunity
develops, which culminates in fatal glomerulonephritis with an incidence
of 70% by 9 months of age (30). This is an example of epistasis between
two susceptibility alleles leading to a greater increase in disease severity
than would be predicted by simply adding together their individual phenotypes.
A second type of epistasis, in which
the autoimmune phenotypes of susceptibility alleles are suppressed by epistatic
modifiers, has also been detected in an animal model of SLE (31). Suppressive
modifiers were detected via the analysis of the disease phenotype mediated
by Sle1, Sle2 and Sle3 when introgressed onto different genetic backgrounds.
These three genes (or gene clusters, see below) are the primary genes responsible
for susceptibility to lupus nephritis in the NZM2410 lupus-prone mouse
(32). When they are introgressed from NZM2410 onto the B6 background, the
resultant B6.Sle1/Sle2/Sle3 triple congenic strain develops fatal lupus
nephritis with a penetrance approaching 100% in both genders at 9 months
of age. Although all three of these susceptibility alleles are derived
from the NZW genome, NZW exhibits very benign autoimmune phenotypes that
develop only in females older than 12 months (33). Thus, the expression
of Sle1, Sle2 and Sle3 is significantly suppressed in NZW mice.
A linkage analysis designed to detect
these suppressive modifiers in the NZW genome found four separate loci
that accounted for the suppression of lupus susceptibility in NZW mice
(31). These results indicate that the disease mediated by susceptibility
genes can be fully suppressed by other "modifying" genes in the genome,
resulting in the development of a relatively normal immune phenotype, despite
the presence of highly potent autoimmune disease alleles. The presence
of similar suppressive modifiers in AID in humans has not been demonstrated,
although it is reasonable to predict that similar genetic interactions
will affect disease predisposition in humans.
A review of the extensive linkage
studies that have accumulated in animal models of AID over the past 10
years suggests that epistatic interactions may be a common element in the
complex inheritance of disease susceptibility. Some susceptibility alleles
are, for example, inherited from the genome of the "normal" parental strain
rather than the autoimmune-prone strain, suggesting that normal strains
often carry alleles that enhance disease susceptibility when integrated
into a permissive genome (32, 34, 35). Similarly, several targeted gene
disruptions have been reported to mediate autoimmune phenotypes, but only
when crossed into a specific inbred strain or carried on a specific, mixed
background (36-38). Earlier analogous findings demonstrated that spontaneous
mutations of Fas and Fas ligand lost most or all of their autoimmune phenotypes
when crossed onto specific genetic backgrounds (39, 40). These phenotypic
variations likely represent epistatic interactions between susceptibility
alleles and other unknown loci present in the genomes of various inbred
mouse strains, although proof of this will require further linkage studies
and possibly congenic strain construction.
Thus, epistasis appears to significantly
affect the inheritance of susceptibility to AID. The data generated in
animal models suggest that the potency of many susceptibility alleles is
strongly dependent upon genomic context, as a result of complex interactions
with other susceptibility alleles and suppressive modifiers. Although the
extent of epistasis in human AID remains to be determined, it is reasonable
to predict that epistatic interactions are a consequence of the many functional
polymorphisms influencing immune recognition and responsiveness, and will
therefore be a component of AID susceptibility in most species.
Environment, stochastic events
and disease penetrance
Although genetic predisposition is
the major factor dictating AID susceptibility, various data indicate that
environmental or stochastic factors are also involved (41-43). A role for
nongenetic factors in the initiation of disease was classically predicted
by the incomplete concordance of disease expression among monozygotic twin
pairs. Although concordance can be high in AID, it does not approach 100%
(Table 1). Thus, genetically predisposed individuals may or may not develop
autoimmune disease, contingent upon other elements affecting their health.
A similar interpretation can be drawn from the incomplete penetrance of
spontaneous disease in inbred strains prone to the development of diabetes
or lupus (32, 44-46).
The strongest data supporting a role
for environmental factors come from epidemiologic studies of MS (47-49).
Interestingly, the incidence of MS is distributed in a nonrandom fashion
geographically, individuals at higher latitudes being at greater risk.
In addition, some studies have demonstrated an higher disease incidence
in certain locations, suggesting that disease risk is increased significantly
by local environment. The biomedical basis for these observations is unknown,
although some data have implicated microbial infections. Viral infections
have also been implicated as environmental triggers in several other AID.
Susceptibility to SLE has been, for example, associated with positive seroconversion
for Epstein-Barr virus (EBV) antibodies in a retrospective study of a large
cohort of patients and matched controls (50, 51). The strong impact of
EBV infection on B cell activation makes this association intriguing, although
more direct evidence will be required to establish a clear link between
EBV and SLE.
Spontaneous disease incidence in
AID-prone inbred strains would appear to be an excellent model in which
to test the potential role of environmental triggers in the initiation
of AID. In general, inbred strains of rodents that are susceptible to spontaneous
AID, such as nonobese diabetic (NOD), MRL/lpr, BXSB and NZM2410, all exhibit
an incomplete penetrance of disease. Because the environment in which mice
are housed is highly controlled, including environments that are specific
pathogen free (SPF), this argues that environmental variables are not necessary
for the observed incomplete disease penetrance. The incidence of diabetes
dramatically increases in colonies of NOD mice housed under SPF conditions
rather than conventional caging, indicating that risk increases in this
inbred strain when microbial infection is limited (52).
The incomplete penetrance of disease
in AID-prone animal models has led to the hypothesis that disease can be
initiated in highly susceptible animals by stochastic events that occur
during the normal functioning of the immune system (53). In this regard,
many multifactorial phenotypes exhibit incomplete penetrance that is similar
in nature to that of AID in spontaneous models, this being commonly attributed
to chance or stochastic events that occur during development or normal
physiology (54). In the context of autoimmunity, a reasonable prediction
is that genetic predisposition will lead to the creation of an immune system
that contains significant autoimmune potential, but that disease is initiated
only when specific T and/or B cells interact with a specific self-antigen
being presented appropriately in a stimulatory microenvironment. The probability
of this "stochastic event" occurring will then dictate the relative risk
for genetically identical individuals in a controlled environment.
Models of inheritance of disease
susceptibility
The inheritance of multifactorial
traits such as AID susceptibility is a complex process. Multifactorial
inheritance was first described and modeled as a "threshold liability"
in an analysis of polydactyly in guinea pigs (55). It was proposed that
the penetrance of polygenic, qualitative phenotypes would increase in relation
to the number of susceptibility genes present in the genome of an individual.
A hypothetical model of the inheritance
of AID can be proposed (Fig. 1). The x axis of this graph defines increasing
disease liability, the y axis represents the "threshold", which delineates
the point at which individuals will develop disease. Genetic predisposition
places individuals at some point along the x axis, based on the degree
of susceptibility dictated by their genomes. Environmental and stochastic
events will then increase or decrease their liability, depending on their
life experience. These environmental factors are arbitrarily depicted as
a normal distribution of liability around the mean location dictated by
genetic predisposition. The exact distribution could potentially take any
shape.
Figure 1. Threshold
liabilities in autoimmune disease.
In this model, only
individuals located to the right of the disease threshold line will develop
disease. The x axis represents increasing liability to disease, individuals
being located on the x axis based on the degree of their predisposition
to disease. An incremental increase in the number of susceptibility alleles
progressively increases liability to disease, resulting in movement toward
the disease threshold at the right on the x axis. The disease liability
introduced by environmental and stochastic effects is represented by the
normal distribution curve around the location of individuals with specific
degrees of genetic predisposition for disease.
The inheritance of susceptibility would then be determined by the cumulative content of disease susceptibility that an individual inherits. The relationship presented in Fig. 1 shows a simplified example of additive inheritance, in which each additional susceptibility allele that an individual inherits results in an incremental movement toward the disease threshold on the liability axis. The genomes of disease-prone inbred mouse strains such as NOD or NZM2410 contain enough susceptibility genes that their genetic predisposition places them beyond the disease threshold (Fig. 1). As a result, only a small fraction of individual mice in these strains will fail to develop disease, depending on the stochastic processes that occur during the normal functioning of their immune systems. Consistent with this model, AID-prone inbred strains contain many susceptibility genes, and disease penetrance decreases when the number of susceptibility genes is decreased via congenic strain construction (44).
The inheritance of disease susceptibility (Fig. 1) is presented in an extremely simplistic additive fashion; each additional susceptibility allele incrementally moves an individual an equivalent distance further toward the disease threshold. In reality, the process of inheritance is much more complex. As discussed above, epistatic interactions would be predicted to modify the incremental movement of individuals along the axis in a complex fashion that would not be additive. Thus, the position of an individual along the liability axis would be dependent upon the interactive consequences of all the susceptibility and suppressive alleles present in his or her genome.
In this regard, attempts to model the inheritance of AID susceptibility have often focused on distinguishing "additive" inheritance from "multiplicative" models (56). Linkage analyses in test crosses of AID-prone inbred strains have consistently found that relative risk increases in proportion to the number of active susceptibility genes present in the genome (29, 32, 57, 58), but the goodness of fit for additive versus multiplicative models has not been established (59). Given the extensive genetic heterogeneity observed in AID inheritance, it is reasonable to predict that both models will be in some circumstances correct.
Future prospects for linkage analysis
Susceptibility to AID is among the most complex genetic systems currently being investigated. Several strategies to cope with difficulties resulting from genetic heterogeneity and epistasis are currently being employed by investigators. The subdivision of patient populations into more homogeneous groups based on an analysis of closely associated component phenotypes (such as the presence of specific autoantibodies or clinical features within a specific disease) may limit some elements of genetic heterogeneity. An alternative approach has been to focus on disease inheritance in isolated human subpopulations with limited ethnic heterogeneity, a strategy that was successfully employed for the identification of AIRE as the gene responsible for autoimmune polyglandular syndrome (type 1) (60, 61). Although there has been some success in the more complex analysis of multifactorial AID susceptibility in isolated Scandinavian populations18, the value of this approach for the analysis of AID is controversial (62). Overall, a key element in the ultimate success of linkage analysis in AID will be the continued expansion of well characterized AID patient populations and families.
The recent publication of an almost complete nucleotide sequence of the human genome has provided detailed physical and molecular maps of the majority of human linkage groups (63). In addition, a growing database of single-nucleotide polymorphisms (SNPs), together with an improving technology for their detection, will greatly enhance the statistical power of linkage analysis (64, 65). It is likely that a database identifying several SNPs for every gene in the human genome will be available within the next 2 or 3 years. Armed with these resources, many scientists are hoping to advance our knowledge of complex polygenic disease etiology by quickly identifying the genes involved in such traits.
Although the analysis of SNPs will undoubtedly provide new insights, the accuracy and sensitivity of association studies will nonetheless continue to be impeded by epistasis, incomplete penetrance and genetic and phenotypic heterogeneity. An additional factor affecting the accuracy of SNP analysis will be the poorly defined nature of genetic disequilibrium at the genomic level in human populations. That is, for strong linkage associations to be achieved, a SNP must be so tightly associated with a disease-causing allele that it can be used as a marker to identify the disease allele among individuals in outbred populations. The degree of linkage disequilibrium in populations for closely linked markers is now a major research focus in genetics. Current data suggest that the distance of tight association appears to be highly variable and that disequilibrium may persist over regions of less than 1 kb in certain parts of the genome. As a result, thousands of SNPs are likely to be required to scan the entire genome, which will lead to a significant increase in the cost of genetic analysis and diminish the statistical power for the detection of associations. Despite these difficulties and technical issues, the analysis of SNPs should dramatically improve the accuracy and power of linkage analysis in human populations.
Identifying disease genes
The ultimate goal of linkage analysis is the identification of genes and genetic pathways that mediate disease susceptibility. Once all the genes in the human genome have been identified and encoded in a database, the process of susceptibility gene identification will simplify to the determination of a precise genomic location. For monogenic diseases, linkage analysis often defines the genomic location of disease alleles with sufficient precision to limit the number of positional candidate genes to about a dozen. As discussed above, however, linkage analysis of AID has not yielded robust statistical correlations with any specific loci, so the precision with which susceptibility alleles are positioned is extremely poor, resulting in genomic segments that contain hundreds or even thousands of positional candidate genes. This has complicated the final stage of disease gene identification and has been the major deterrent to the identification of human susceptibility alleles. Transmission disequilibrium testing (TDT) has been the best strategy for narrowing the interval size and potentially identifying disease genes in human genetics (66). This strategy has been used to identify and exclude several strong candidates in AID (67-69).
Investigators working on human AIDs have accumulated evidence supporting the identification of a handful of susceptibility alleles. The first non-MHC susceptibility allele to be identified was a variable number tandem repeat polymorphism in the promoter of the insulin gene that affects susceptibility to diabetes (70, 71). Several studies have clearly delineated multiple functional consequences of this polymorphism on the expression of both insulin and a closely linked insulin-like growth factor gene (72-74).
The exact mechanism by which these expression variations mediate susceptibility to IMD remains to be elucidated, but variations in the degree of expression in the thymus have been implicated (73, 75). Deficiencies in the complement component C1q are associated with SLE in a unique subset of families (76). The degree of penetrance of SLE in the absence of C1q is in excess of 90%, suggesting that a deficiency of this complement component is sufficient to mediate disease in a monogenic fashion. The precise molecular mechanism remains to be elucidated, although a role for C1q in early B cell development and the clearance of apoptotic cell bodies have been postulated (77, 78). In this regard, several different complement deficiencies appear to potentiate SLE, strongly implicating this system in at least some pathogenic pathways to systemic autoimmunity (79, 80).
A frame-shift variant and two missense
mutations in NOD2 have more recently been associated with susceptibility
to Crohn's disease by TDT and case-control analysis (81, 82). NOD2 is located
in the pericentromeric region of chromosome 16, a site previously shown
in multiple linkage studies to contain a Crohn's susceptibility gene. The
discovery of a chain-truncating frame-shift mutation in the NOD2 gene,
coupled with the knowledge that NOD2 activates nuclear factor-Bk
(NF-Bk) and confers
responsiveness to bacterial lipopolysaccharides, a pathway long thought
to be affected in Crohn's disease, strongly supports this inactive allele
of NOD2 as a susceptibility allele in Crohn's disease. Finally, AIRE was
recently identified by linkage and positional candidate analysis of affected
Scandinavian populations60, 61.
Linkage studies in animal models
have generally yielded statistical associations similar to those detected
in human family studies; as a result, genomic locations have been equally
imprecise. The most promising strategy for gene identification in animal
models has been the classic strategy of congenic dissection, an approach
pioneered by George Snell over 50 years ago (83). Congenic dissection separates
the multiple genes mediating a polygenic autoimmune disease into a collection
of congenic strains, each carrying one susceptibility gene on a single
genetic background. Subsequent analysis of the component phenotypes expressed
in each of the resultant congenic strains potentially allows a detailed
characterization of the disease component contributed by each susceptibility
gene in the original mouse strain and provides a phenotype to utilize for
fine-mapping. Several investigators are following this strategy to characterize
individual genes in animal models of IMD, SLE, MS and RA (28, 30, 84-89).
The identification of individual
susceptibility genes is still ongoing, but several susceptibility alleles
have been fine-mapped into extremely small congenic intervals, some of
which contain strong positional candidates (84, 86, 90). The candidacy
of Il2 as Idd3 is the most extensively investigated in positional candidate
analysis to date (91, 92). The Il2 allele in NOD mice differs from that
in C57BL/10 mice by a complex mutation that results in a change in glycosylation
that has been postulated to affect the in vivo function of interleukin
2 (IL-2) (92). Although these structural changes in IL-2 are intriguing,
conclusive evidence of a functional impact on the immune system by this
Il2 polymorphism has not yet been reported. Congenic analysis has identified
b2-microglobulin
as a candidate for the Idd13 locus on chromosome 2 (90). A functional polymorphism
between the alleles in NOR and NOD mice potentially affects MHC class I
function and has been postulated to influence susceptibility to diabetes.
Finally, Sle1 on chromosome 1 and Idd9 on chromosome 4 have each been dissected
into multiple susceptibility loci via the creation of congenic recombinant
chromosomes; several interesting positional candidates are in the process
of being characterized (86, 93, 94).
Are susceptibility genes shared
between AIDs?
Detailed analyses of linkage data
in humans and rodent models have provided support for the hypothesis that
common genes or genetic pathways may contribute to immune dysregulation
and susceptibility to multiple AID. The basic data supporting this hypothesis
are the observed colocalization of susceptibility loci in genome-wide scans
in both mouse and human studies (9, 95-97). Twelve separate non-MHC autoimmune
susceptibility clusters have been identified in independent human and mouse
genome scans (Fig. 2). These findings are intriguing and provide support
for the idea that susceptibility to multiple AID may have some common susceptibility
alleles or pathways. One caveat in interpretation is that the precision
of this analysis is dependent upon the accuracy of allele placement by
linkage studies, which is quite imprecise with multifactorial traits such
as susceptibility to AID.
Figure 2. Chromosomal
susceptibility regions associated with autoimmune disorders detected in
human and mouse genome scans.
Susceptibility regions
are shown at their approximate positions. Quantitative trait loci (QTLs)
for mouse models of SLE are designated Sle, Lbw, Nba (NZB/NZW), Ldrm (MRL/lpr)
and Bxs (BXSB). QTLs in MS mouse models are Eae and Tmevd (for Theiler's
murine encephalomyelitis virus–induced demyelinating disease) and for IMD
are Idd. QTLs for the RA mouse model are Cia (for collagen-induced arthritis).
QTLs locations for Crohn's disease were based on a study of mouse inflammatory
bowel disease model induced by dextran sulfate sodium. QTL locations are
based on linkage maps available at the Mouse Genome Database (http://www.informatics.jax.org/
). Relative chromosomal sizes and syntenic relationships are based on human-mouse
homology maps available at the National Center for Biotechnology website
(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Genome). An alternative interpretation of
the genetic colocalization of susceptibility alleles is that many immunologic
genes are loosely clustered in mammalian genomes. In this interpretation,
the colocalization of susceptibility alleles reflects disease associations
with different genes in the same cluster, rather than a common allele.
Data emerging from fine-mapping studies of susceptibility intervals from
several animal models of AID support this alternative interpretation. The
majority of the susceptibility segments that have undergone fine-mapping
analysis in animal models have been found to contain more than one susceptibility
locus (86, 90, 94, 98-102). Sle1, for example, was recently split into
four separate susceptibility loci, each contributing a unique aspect of
the "SLE1" phenotype originally defined in congenic dissection. Fine-mapping
studies of Idd10, Idd13, Idd5, Idd1 and Idd9 have resulted in the detection
of a cluster of susceptibility loci within each of these congenic intervals.
The frequent detection of the genomic
clustering of susceptibility genes may be interpreted in two ways. It is
possible that the frequency of apparent clustering represents an ascertainment
bias introduced by the relatively weak statistical power of the linkage
analysis of multifactorial traits. That is, linkage is detectable only
in regions of the genome that contain, by chance, several closely linked
susceptibility alleles, the combined phenotypic impact of which yields
a strong signal.
Alternatively, the detection of genomic
clusters may represent an organizational feature of mammalian genomes in
which genes involved in fundamental immune system pathways are occasionally
clustered in specific regions, similar to the gene clusters associated
with cytokine production (96). Data from the Human Genome Project has suggested
the presence of functionally related gene clusters throughout the genome
(103). The Cia3 locus on murine chromosome 6 may provide an example of
such clustering. This region is syntenic to regions on both human chromosome
12p12–p13 and rat chromosome 4 that contain susceptibility loci for IMD,
SLE, MS and RA, and contains a cluster of attractive candidate genes, including
Tnfrsf1a, I15ra, Cd4, Cd27, Tgfa and Bphs (87).
Thus, data supporting the organization
of susceptibility alleles into genomic clusters continue to accumulate.
These may, however, be interpreted as a mundane statistical anomaly or
an intriguing new insight into the organization of mammalian genomes. Although
a complete understanding must await a comparison of the disease alleles
identified in multiple AIDs, it is reasonable to conclude that linkage
colocalization reflects an important feature of the genetics of AID susceptibility.
Certainly, regions of the genome that contain clusters of susceptibility
genes warrant a top priority in future genomic analyses.
Future prospects
A crucial goal for future efforts
in the genetics of AID will be the transition from linkage analysis and
modeling into gene identification and disease pathway analysis. This process
has been impeded by several factors, including the complexity of the mode
of inheritance and the recently discovered genomic clustering of susceptibility
genes. Success in the identification of susceptibility alleles in human
populations will probably await the development of more powerful analytical
procedures and larger patient populations. The extensive development of
SNP technology may also facilitate gene identification, although SNPs alone
may not suffice to overcome the complexities introduced into the analysis
by epistasis and genetic heterogeneity.
The potential for success in AID
gene identification in animal models is, on the other hand, excellent.
Congenic dissection is a powerful tool that allows the characterization
of phenotypes conferred by the individual genetic components of a polygenic
disease, and standard fine-mapping procedures have been used to successfully
narrow the susceptibility interval to as little as 800–1000 kb (84, 86).
The completion of the Human Genome Project and the soon-to-be-completed
mouse project will provide quality molecular and physical maps for both
of these mammalian genomes, which will greatly facilitate positional cloning
efforts by providing rapid access and a precise localization of markers
and positional candidate genes. In animal models, definitive gene involvement
can be obtained by in vivo complementation using bacterial artificial chromosome
(BAC) transgenic technologies (104, 105). Targeted mutagenesis in BACs
and/or embryonic stem cells can also help definitively to identify specific
positional candidates as susceptibility genes.
Finally, the use of gene expression
microarrays to identify genes whose expression is modified by AID or specific
AID susceptibility alleles has the potential to revolutionize mapping strategies
for complex traits. In theory, gene expression analysis can be used to
delineate a plethora of component phenotypes in individuals with disease
and their relatives, potentially providing a variety of new mapping strategies
for complex traits. In addition, this technology should identify genes
that are dysregulated because of the susceptibility allele, thus providing
new insights into disease mechanisms and expanding the array of potential
targets for the development of therapeutic strategies. This technology
may also identify molecular expression phenotypes that may improve the
identification of individuals who are at risk of developing disease, affording
them the opportunity of preventive health care measures. Thus, although
the analysis of multifactorial traits and AID in particular has been challenging,
recent technical developments support an optimistic view of future developments.
References:
©
2001 Nature Publishing Group