Sequence of a Complete Chicken BG Haplotype Shows Dynamic Expansion and Contraction of Two Gene Lineages with Particular Expression Patterns

Download PDF České info

Many immune genes are multigene families, presumably in response to pathogen variation. Some multigene families undergo expansion and contraction, leading to copy number variation (CNV), presumably due to more intense selection. Recently, the butyrophilin family in humans and other mammals has come under scrutiny, due to genetic associations with autoimmune diseases as well as roles in immune co-regulation and antigen presentation. Butyrophilin genes exhibit allelic polymorphism, but gene number appears stable within a species. We found that the BG homologues in chickens are very different, with great changes between haplotypes. We characterised one haplotype in detail, showing that there are two single BG genes, one on chromosome 2 and the other in the major histocompatibility complex (BF-BL region) on chromosome 16, and a family of BG genes in a tandem array in the BG region nearby. These genes have specific expression in cells and tissues, but overall are expressed in either haemopoietic cells or tissues. The two singletons have relatively stable evolutionary histories, but the BG region undergoes dynamic expansion and contraction, with the production of hybrid genes. Thus, chicken BG genes appear to evolve much more quickly than their closest homologs in mammals, presumably due to increased pressure from pathogens.

Published in the journal: . PLoS Genet 10(6): e32767. doi:10.1371/journal.pgen.1004417
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1004417

Summary

Introduction

Many of the genes involved in immunity are part of multigene families. In some families, each gene is conserved for a specific function dedicated to a particular outcome, in others allelic polymorphism and copy number variation allow rapid evolution in response to new challenges, and in still other families both kinds of genes are found. Some well-characterised examples for adaptive immunity include genes of the major histocompatibility complex (MHC). For example, both the MHC class I and class II genes of humans and other higher apes have been relatively stable over 10 million years (My), whereas these genes have undergone many changes including extreme copy number variation (CNV) in monkeys [1], [2]. Further examples out of many are the genes encoding natural killer (NK) receptors, which not only undergo enormous CNV, but even use different structural families to carry out similar functions [3], [4]. Understanding the forces involved in this complex interplay of genomic structure, biological function and evolution is one of the challenges of modern genetics, with intense theoretical and experimental interest over many decades [for example: 5–22].

The regions in and around the mammalian MHC also include genes involved in innate immunity, such as the family of butyrophilin (and butyrophilin-like) genes for which an important role in the immune response is emerging. These genes are members of the B7 gene superfamily, many members of which are involved in immune co-regulation [23]–[26]. Some butyrophilin molecules function as inhibitory co-regulators, some may be involved in recognition of stress responses by γδ T cells, while others seem to have more specialised functions (such as synthesis of milk fat globules) and the functions of still others are as yet unknown [23]–[34]. Most importantly, butyrophilin genes have strong genetic associations with a variety of diseases in humans [35]–[49]. These genes encode transmembrane glycoproteins with two extracellular immunoglobulin (Ig)-like domains (one or four for butyrophilin-like molecules), and a few cytoplasmic heptad repeats followed by a B30.2 (or PRY-SPRY) domain [23]–[25], [34], [50], [51]. In humans, one of these genes is located in the MHC and the others in the extended MHC region, while in mouse some of these genes have been translocated elsewhere [23]–[25], [51], [52]. However, within each species the number and kinds of butyrophilin (and butyrophilin-like) genes seem to be fixed.

The SKINT genes are another multigene family within the B7 superfamily for which important roles in immune responses are emerging [24], [25], [53]–[55]. These genes have an extracellular V-like region related to butyrophilins and other members of the B7 superfamily, but have at least three transmembrane regions followed by short cytoplasmic tail. The SKINT1 gene is responsible for selection of a population of γδ T cells which become located specifically in mouse skin. Around the SKINT1 gene (located on a non-MHC chromosome) are several other SKINT genes and pseudogenes, the exact number of which varies between mouse strains. The single member of this family in humans is a pseudogene. Thus the SKINT family provides an example of B7 superfamily genes which appear to be evolving more rapidly than the butyrophilins.

Instead of butyrophilin genes, a related family of BG genes is found in and near the chicken MHC on chromosome 16. Indeed the chicken MHC was discovered as a serological blood group (the “B locus”) determining the highly polymorphic BG antigen on erythrocytes [56]–[59]. It is now clear that there is a multigene family of BG genes, with one gene in the MHC (the BF-BL region of the B locus) and an unknown number of BG genes in the nearby BG region of the B locus. It is also clear that BG genes are expressed, not only on erythrocytes, but with a wide tissue distribution and a number of associated immunological phenomena [60]–[68]. BG genes encode disulfide-linked dimers, each chain having a single extracellular Ig-like region (part of the V domain family) and a long cytoplasmic tail composed of many heptad repeats which presumably form an alpha helical coiled-coil. However, there is one chicken butyrophilin-like gene, Tvc-1, in the chicken genome which was described as the receptor for avian leukosis virus subgroup C, located on chromosome 28 [69].

Thus, chicken BG genes might be derived from ancestral butyrophilin genes, and perhaps have similarly important functions. Despite much speculation concerning associated immunological functions (reviewed in [59]), there are only two clear indications of functions for BG genes. One fortuitous discovery was the “zipper protein”, originally described as a soluble cytoplasmic protein which turned out to be the tail of a BG protein, and which has a role in controlling actin-myosin interaction in intestinal epithelial cells [70]. The other important study re-examined chickens that had been used to show that the BF-BL region (and not the BG region) determined resistance to the tumours caused by Marek's disease virus (an oncogenic herpesvirus) and Rous sarcoma virus (an acutely transforming retrovirus). By single nucleotide polymorphism (SNP) analysis and resequencing of genomic DNA, the authors found that a retroviral insertion into the 3′UTR of the single BG gene of the BF-BL region, the 8.5 or BG1 gene, correlated with resistance to the tumours. Moreover, they presented evidence that an immuno-receptor tyrosine-based inhibitory motif (ITIM) present in the cytoplasmic tail might be important to BG1 function [71]. Thus, the cytoplasmic tail has been identified as important in the two best studied examples of a functional effect for any BG gene, opening the question of what role the high level of polymorphism in the extracellular region might play.

It has become clear that the BG multigene family is quite complex in comparison to butyrophilin genes, and that an understanding of the true functions of particular BG genes will only be possible once a detailed picture of genomics and expression is available. In this paper, we provide the genomic organisation of the BG genes in the B12 haplotype, determine cell and tissue expression for each gene of the B12 haplotype, compare the B12 haplotype in detail with a red junglefowl haplotype used for the whole genome shotgun (WGS) sequence and at less resolution with five other haplotypes, and then consider what the data may mean in terms of multigene family evolution. The results take us to a new level of understanding, from which more detailed analyses can be launched.

Results

There are 14 BG genes in the B12 haplotype of C line chickens: One on chromosome 2, another in the MHC on chromosome 16 along with a cluster of 12 in the BG region

A cosmid library constructed from the genomic DNA of a CB congenic chicken line (B12 haplotype on a CC inbred chicken line background) had previously been used to define contigs, one of which (cluster I) was the BF-BL region (the classical MHC of the chicken) and three others (clusters II–IV) were later recognised as the Rfp-Y region (a region of non-classical MHC genes) [72]–[74]. We screened this library with a BG cDNA probe and picked 50 colonies, which were grown up. Analysis by Southern blot allowed the cosmid clones to be grouped into several contigs, but already many of the cosmids had suffered deletion of the BG genes, and in retrospect others had lost portions of their sequences.

Three authentic contigs were eventually defined by extensive restriction double digest mapping, subcloning, limited sequencing after PCR, and comparison to genomic DNA by Southern blot (Figure S1). One of these contigs corresponded to cluster I (the chicken MHC, or BF-BL region) which we had already shown contained a BG gene provisionally named the 8.5 gene (later renamed BG1). We fully sequenced the 8.5 gene (accession number KC963427, [59]), and later the whole of the cluster I contig (accession number AL023516, [73]). The other two contigs (named cluster V and VI) each contained six BG genes, which were given a variety of provisional names (now renamed BG2 through BG13). In addition, a related region without BG genes was found in each cluster. Three representative cosmids covering most of clusters V and VI were fully sequenced by standard shotgun techniques, confirming the presence of the six BG genes in each cluster along with a small region containing genes for a kinesin motor, a C-type lectin-like receptor and an unidentified protein (Figure 1, Figure S2, accession number KC955130).

Fig. 1. Fourteen BG genes of the B12 haplotype are present as two singletons (BG0 on chromosome 2, and BG1 in the BF-BL region or classical MHC on chromosome 16) and a cluster of twelve genes in the BG region on chromosome 16 (BG2-BG13, all in the same transcriptional orientation but separated into clusters V and VI by a region containing a kinesin motor protein gene, a C-type lectin gene, and an unassigned gene called LOC4255771).

We were concerned whether we had cloned all of the BG genes from the CB chicken. Screening revealed one additional B12 BG sequence (accession number KC955131) from one of our caecal tonsil cDNA libraries, which was called CTBG (and which we will now rename BG0). BLAST analysis of the chicken WGS sequence (www.ensembl.org, release 2.1) showed that this gene is present on chromosome 2 (positions 100590000–100600000), a different chromosome from the chicken MHC on chromosome 16 (Figure S3).

Using the partial sequences of all the genes identified at the time, we had designed potentially universal primers, and performed RT-PCR on a variety of cells and tissues (Figure 2, Figure S4). We found all of the genes from the cosmids expressed (except one, BG2, which we later realised had a single nucleotide change compared to the 3′ end of one of the primers). In addition, we found our supposedly universal primers did not amplify BG0, but specific primers showed that it has a wide if not ubiquitous tissue distribution.

Fig. 2. Individual BG genes of the B12 haplotype have striking expression patterns, as assessed by RT-PCR from cells and tissues using what were expected to be “universal primers” followed by cloning and sequencing.

In addition, we found several BG sequences from cDNA isolated from B12 haplotype chickens of the CB congenic line, but not B12 haplotype chickens from the parent C line. The congenic line CB (B12) was derived from the C line (which contains both B4 and B12 haplotypes) by backcrossing with the highly inbred CC (B4) line. The additional sequences (I, II, IIIa and IV) were eventually found to be BG genes from the B4 haplotype (Figure S4) that presumably were acquired during the backcrossing to produce the CB congenic chicken line.

Finally, to ascertain the relative location of the clusters, we used some cosmid clones as probes in metaphase and fibre-fluorescence in situ hybridisation (fibre-FISH) of chromosomes from B12 splenocytes stimulated with the mitogen concanavalin A (Figure 3). We found a single large cluster of BG genes defined by hybridisation with cosmids from clusters V and VI, separated from the chicken MHC as detected by a cluster I cosmid. In some fibres, hybridisation corresponding to a BG gene was found at the end of the cluster I towards the BG cluster, which oriented the end of the MHC with BG1 toward the BG cluster. Comparison of the hybridisation pattern of the cluster V and VI probes with hybridisation expected by relative nucleotide sequence identity showed that cluster V and VI are contiguous, with cluster V closest to cluster I.

**Fig. 3. The two cosmid clusters are contiguous with the orientation cluster VI-cluster V, followed by the TRIM and BF-BL regions, as assessed by fibre-FISH and sequence comparison.**

This organisation of the BG and MHC clusters was confirmed at the level of DNA sequence (Figure S5). Several BACs that span the BF-BL region through the TRIM region to some unidentified BG genes have been isolated from CB chickens [75]. Amplification from these BACs identified four BG genes located at one end of the cluster V contig, with the outermost being BG2, followed by BG3, BG4 and BG5. Similarly, amplification between BG8 of cluster VI and BG7 of cluster V physically linked these two clusters, with 1047 nucleotides of DNA between them.

Thus, we found 14 BG genes present in the B12 haplotype (as defined by the parent C line). There are two singleton genes, BG0 present on chromosome 2 (in the chicken WGS sequence) and BG1 found in the BF-BL region of the chicken MHC on chromosome 16. Upstream of the BG1 gene is a region containing TRIM genes among others, upstream of which is the BG region, the sequence of which is 99,274 nucleotides long. There are 12 BG genes located in this BG region, all in the same transcriptional orientation, and split into two clusters by the presence of kinesin, lectin and other genes.

Each BG gene has very specific cell and tissue expression, with one group expressed in haemopoietic cells and another group expressed in tissues

As mentioned above, we performed reverse transcriptase-PCR (RT-PCR) on a variety of cells and tissues with what we thought at the time were universal primers (which however turn out not to amplify BG0 or BG2). For each cell and tissue, we cloned the PCR products and counted the number of clones from several independent amplifications, a method used successfully for assessing the relative expression of MHC genes [76], [77]. With this simple assay, we found truly striking patterns of expression for each of the analysed genes, with only a few genes expressed in each cell type and restricted patterns even for tissues (Figure 2). To provide additional support for this approach, we developed specific RT-qPCR assays for two haemopoietic and two tissue BG genes, and found that the results with spleen, bone marrow, liver and duodenum confirm our expectations based on the data from the approach of amplifying, cloning and sequencing (Figure S6).

In the blood cells of chickens with the B12 haplotype, T cells strongly express BG9 and express weakly BG12. B cells also express BG9 and BG12, but in addition strongly express BG7 and weakly express BG8 and BG13. In contrast, thrombocytes do not express BG9, but more evenly express BG5 and BG11 along with BG7, BG8 and BG13 (like B cells) and BG12 (like both B and T cells). Macrophages isolated from blood, whether or not treated with agents like lipopolysaccharide (LPS) and interferon-gamma (INFγ), strongly express BG7 (like B cells) and BG9 (like T and B cells). Dendritic cells developed in culture with IL4 and GM-CSF from precursors in the blood strongly express BG9 (like B cells, T cells and macrophages).

BG molecules were originally discovered as blood group antigens on erythrocytes, which do not contain appropriate messenger RNA. However, bone marrow from phenylhydrazine-treated chickens should be enriched compared to normal bone marrow in erythroid precursor cells which have RNA for the BG molecules found on erythrocytes. On this basis, erythrocyte BG molecules of the B12 haplotype may include BG7 (like B cells, thrombocytes and macrophages), BG8 (like B and thrombocytes), BG11 (like thrombocytes) and BG12 (like B cells, T cells and thrombocytes). Also found but not enriched were BG5 (like thrombocytes) and BG13 (like T and B cells).

Primary lymphoid organs such as thymus and bursa are the source of mature peripheral T and B cells, respectively. In these organs, precursor lymphoid cells undergo complex differentiation and selection events dependent on a variety of other cell types, including one or more types of so-called stromal cells. These stromal cells, at least in the thymus, are known to include both haemopoietic and non-haemopoietic cell types. As might be expected, the expression of BG genes in these organs is complex. Comparison of the expression of BG genes in thymus and bursa with the small lymphoid cells suggest that (at least some of) the precursor T cells (thymocytes) express BG9 and BG12 like T cells. At least some of the precursor B cells in the bursa may express BG9 like B cells, but there was no evidence of expression of BG7 (which was strongly expressed in B cells), BG12 or BG13 (more weakly expressed in B cells), so these latter three may be differentiation antigens in the B cell lineage. Comparison of stromal cell populations prepared by gradient centrifugation from precursor cells or by killing most of the rapidly dividing precursor cells with cyclophosphamide suggests that in the B12 haplotype the various stromal cells of thymus may express BG3, BG7, BG8, BG9, B12 and BG13, while the stromal cells of bursa may express BG3, BG4, BG9 and BG10.

In other tissues, the expression patterns were complex, which may be the result of a single cell type expressing several BG molecules, or may reflect the presence of multiple cell types each of which expresses certain BG molecules. Notably, we only detected BG1 expression in intestine (adult duodenum but not embryonic enterocytes), even though the original 8.5 genomic fragment was found to hybridise to RNA from chicken thymus, liver, a T cell line and a B cell line [72]. Also, we found intestinal expression of BG10, which has a nearly identical sequence of the cytoplasmic tail to the previously identified zipper protein (Figure S3), a protein described to regulate actin-myosin interaction in the intestinal epithelium [70]. Duodenum strongly expressed BG1, BG3, BG4, BG6 and BG10. By contrast, embryonic enterocytes expressed only BG4. Brain strongly expressed BG9 (if macrophage-like microglia are the source, then they differ from peripheral macrophages which strongly express both BG7 and BG9), while kidney strongly expressed BG10 (like stromal cells of thymus and bursa).

Overall, we were able to discern two types of expression, one primarily in haemopoietic cells and the other primarily in tissues (presumably from other cell types, at least some of which we expect to be epithelial/stromal cells). Haemopoietic BG genes include BG5, BG7, BG8, BG9, BG11, BG12 and BG13; while the tissue BG genes include BG1, BG3, BG4, BG6 and BG10. As described below, the assignments of haemopoietic -⁠ and tissue-type BG genes correlate perfectly with the relationship of the presumed promoter and 5′UTR of these genes.

All the BG genes have similar structure, but there are many hybrid genes with the 5′ end determining cell and tissue expression

In order to better define these 14 BG genes, we compared them with published cDNA clones [38], [44], all of which were from other haplotypes, so we could not be sure whether we were comparing a cDNA with the appropriate gene. Using CLUSTLx, we found the same general organisation for every BG gene in the B12 haplotype (Figure 1): a first exon composed of roughly 200 nucleotide 5′UTR followed by a short signal sequence, a second exon encoding the immunoglobulin variable-like (Ig V-like) extracellular domain, a third exon encoding a connecting peptide and transmembrane region, a large number of small exons encoding 7 (or sometimes 8) amino acids which altogether would result in a cytoplasmic region with the potential to produce a coiled-coil, followed by an exon of the 3′UTR.

We then compared the predicted intron-exon structure with our authentic cDNAs of BG0 isolated from a B12 library or amplified from transfectants with BG1, BG10 and BG11 genes of the B12 haplotype (Figure S7). The exons encoding the V-like region, transmembrane region and the cytoplasmic domains were perfectly predicted, with the exceptions due to alternative splicing or read-through in the cytoplasmic exons (and one predicted cytoplasmic exon in BG10 that was not found). Similarly, the end of the first exon and the beginning of the last exon were perfectly predicted. For BG0, the WGS sequence showed two 3′ ends, but they turned out to be due to mis-assembly in the genome sequence (Figure S8). For BG1, BG10 and BG11, the location of the primers for amplification of the cDNA preclude assignment of the very 5′and 3′ ends of the genes. However, sequence comparison strongly suggests that 5′ of all the BG genes start at roughly the same place. Moreover, the 3′ ends seem clear from the conserved location of the single polyadenylation site in all genes.

In order to determine the relationship of the 14 BG genes, we performed phylogenetic analysis by neighbour joining (NJ) on the whole genes as well as portions of the genes. Dendrograms of the whole genes show two groups of paralogues, one for BG genes expressed in haemopoietic cells and the other for BG genes expressed in tissue BG genes in the BG region (Figure 4). As will become clear below, the topology of these trees depend on the relative amount of sequence from different parts of the gene.

Fig. 4. Phylogenetic analysis reveals six kinds of BG genes in the B12 haplotype: The two singletons each separately, and the twelve BG genes of the BG region in four groups indicating the presence of hybrid genes.

The dendrograms of the presumed promoter (as defined by the sequence of the first 500 nucleotides upstream of the 5′UTR) and the 5′UTR (as defined by sequence similarity to published cDNA sequences) showed the same topology as the whole genes, a topology with long branches and strongly supported by the bootstrap values (Figure 5). Comparison of these trees to the expression patterns in Figure 2 indicates that the promoter region (and possibly the 5′UTR) of a BG gene is the primary determinant(s) of cell and tissue-specific expression.

Fig. 5. Phylogenetic analysis of nucleotide sequences for different regions of BG genes indicates separate evolutionary histories, consistent with recombination and/or deletion leading to hybrid genes in the BG region.

There is almost no sequence identity between the two groups of promoters out to 1000 nucleotides before the 5′UTR. However, the analyses were carried out with 500 nucleotides corresponding to the proximal promoters, because the distal promoters of two BG genes contain sequence brought in from neighbouring genes (Figure S9). The distal promoter region of BG1 includes a duplication of a portion of the promoter region in between the neighbouring BNK and Blec genes, followed by a region of sequence of unknown evolutionary origin, and finally the proximal promoter region that is similar to other tissue BG genes. Similarly, the distal promoter of the BG9 gene in the B12 haplotype is unlike the consensus BG genes, and appears to have been derived from a hypothetical protein gene which is present in the red junglefowl haplotype but which has been deleted in the B12 haplotype (as shown below). This distal promoter sequence contains several brain-specific transcription factor binding sites (Figure S9), consonant with the expression in brain of BG9 in the B12 haplotype.

The sequences of the 5′UTR of the BG genes also fall into two groups (Figure 6), with a large specific deletion in the haemopoietic BG genes compared to the tissue BG genes. It seems likely that the difference is a true deletion, since two 27 nucleotide direct repeats are found in the tissue 5′UTRs which upon recombination would yield the deletion found in haemopoietic 5′UTRs. The simplest interpretation is that all the haemopoietic BG genes in the B12 haplotype are descended from a single ancestor, but it is also possible that concerted evolution between BG genes could lead to the same result.

**Fig. 6. Sequence alignments for the 5'UTR of B12 BG genes, showing the separation into genes expressed in hemopoietic cells and in tissues.**

In contrast to the unambiguous dendrograms of the 5′ end, the region corresponding to the signal sequence and the extracellular V-like region formed dendrograms with completely different topologies, which had short branches and were generally poorly supported by the bootstrap values (Figure 5). However, there are three groups each consisting of very similar V sequences: BG3, BG4 and BG5; BG7 and BG11; and BG8, BG9 and BG12. The last group of genes also share a deletion in intron 1, resulting in an intron of 113–144 nucleotides for BG8, BG9 and BG12 compared to 352–354 nucleotides for all the other BG genes. All the signal sequences and V-like regions have the expected sequence features described for BG genes, including the lack of N-linked glycosylation sites in the extracellular domain. This means that, contrary to almost every other type I membrane protein, all BG molecules lack N-linked glycans (as previously shown for BG molecules from erythrocytes, [60]), a curious property that has not yet been explained.

The dendrograms of the connecting peptide/transmembrane exon also yielded a tree (Figure 5) with short branches and low bootstrap values, but are separated into two broad groups. The sequences of these regions (Figure 7) are virtually identical among the BG genes, with a helical wheel depiction suggesting a flattened side for interaction between the two chains. In addition, some polar residues are found in most sequences, which in transmembrane regions can indicate specific interaction with polar residues of other chains. For all but three of these BG genes, the polar residues include two basic amino acids (histidine and lysine) near the start of the transmembrane region, but there is a well-supported group of three BG genes (BG8, BG9 and BG12) with hydrophobic leucine and polar threonine in those positions. All three are haemopoietic genes in which the V-like regions also form a group, perhaps indicating relatively recent duplication events. One gene (BG2) has a proline in the transmembrane region, which is most unusual. Finally, at the end of the transmembrane there is a tyrosine in all of the BG sequences except BG6 and BG7 which have a cysteine perhaps indicating a palmitylation site, and BG8, BG9 and BG12 which have a histidine.

Fig. 7. Sequence relationships for the connecting peptide to transmembrane region of B12 BG genes show two groups, those which have histidine and lysine near the N-terminus of the transmembrane region, and those with a leucine and threonine (arrows).

The cytoplasmic tail is encoded by small exons, 21 (or sometimes 24) nucleotide exons long. The predicted number of such exons varies between BG genes in the B12 haplotype, ranging from 13 in the BG1 (8.5) gene to 36 in the BG10 (zipper protein-like) gene, with a mean number of 26 (Figure S2). Out of 358 total exons, we identified 57 different groups of nucleotide sequences (Figure S10). Removing exons present only once among in the 14 BG genes, we could discern clear patterns, particularly if the first roughly 20% of the exons were removed from analysis. The dendrogram (Figure 4, Figure S10) based on this last 80% (including exons present only once in the 14 BG genes) shows two groups separated by long branches and with strong bootstrap support, along with separate branches for BG0 and BG1. However, the dendrogram has a different topology than the whole gene, the promoter and the 5′UTR, and much like that for the 3′UTR (as shown below).

Some cDNAs show unspliced (or retained) introns for which the sequence remains in frame, and we suggested [78] that these extra stretches coding for protein have the potential for interesting functions. The original cDNAs for BG1 (8.5) have a long stretch of contiguous C-terminal sequence (accession numbers KC955132 to KC955136, [59]), but our analysis shows that this region is in fact due to what was originally an unspliced intron, because the flanking 21 nucleotide repeat exons can be identified which have apparently reasonable splice sites (Figure 2, Figure S11). Interestingly, this region has recently been identified as containing a functional immunoreceptor tyrosine-based inhibitory motif (ITIM) [71], fulfilling our original prediction. We examined all of the sequences for the possibility of unspliced introns with in-frame sequence, and found between one and five per gene (Figure S11). We found ITIMs in translated intron sequences of six other genes (BG3, BG6, BG8, BG9, BG12 and BG13), but translation of all of these introns gave stop codons almost immediately after the ITIM, which would lead to truncated cytoplasmic tails (Figure S11).

The 3′UTRs range from 465 to 481 nucleotides in length, encoded by BG13 and BG11 respectively. Dendrograms (Figure 5) show two groups with the same topology as the cytoplasmic exons, with long branches and good bootstrap support.

Overall, the 5′ end of the gene clearly defines two groups that reflect the tissue distribution, the 3′ end defines two different groups, and the region in between does not fall into simple groups. Phylogenetic trees constructed by Bayesian analysis and by Maximum Parsimony (MP) give comparable topologies as the NJ method (Figures S12 and S13), and AU and SH tests after MP analysis provide statistical support for the presence of the two groups at the 5′ end and the two groups at the 3′ end, but no clear groups in between (Figure S13). This result is most easily explained by the presence of hybrid genes (in the sense used in reference [20]) formed by recombination between the two ends, in which the middle of some (and maybe all) genes has been so randomised by recombination that no phylogenetic signal is left. Neighbour network analysis by SplitsTree, a Phi test and an automated partitioning algorithm all support a history of extensive recombination across the BG genes, with independent histories for the 5′UTR, the V-like region and the 3′UTR (Figures S14 and S15). Recombination is certainly a plausible explanation for the sequence relationships found, since the 12 BG genes in the BG region are all close together in the same transcriptional orientation, so hybrid genes could be produced either by unequal crossing-over (through interchromosomal recombination, also known as non-allelic homologous recombination or NAHR) or by deletion (through intrachromosomal recombination) during meiosis. One of the consequences of such unequal crossing-over or deletion is expansion and contraction of this part of the multigene family, leading to copy number variation (CNV) in the BG region.

In this view, haemopoietic genes have either their original haemopoietic 3′ end or a tissue 3′ end, and tissue genes have either their original tissue 3′ end or a haemopoietic 3′ end. Unfortunately, with the data at our disposal, we cannot be absolutely sure which is which, so for the time being we will refer to the 3′ ends as type 1 and 2. Thus, BG8, BG9, BG12 and BG13 might be pure haemopoietic genes, BG5, BG7 and BG11 might be haemopoietic genes with a tissue 3′ end, BG2 and BG10 might be pure tissue genes, and BG3, BG4 and BG6 might be tissue genes with a haemopoietic 3′ end (Figure 8). Alternatively, BG7 and BG11 might be pure haemopoietic genes, BG5, BG8, BG9, BG12 and BG13 might be haemopoietic genes with a tissue 3′ end, BG3, BG4 and BG6 might be pure tissue genes, and BG2 and BG10 might be tissue genes with a haemopoietic 3′ end (Figure S16).

**Fig. 8. The presence of hybrid BG genes in the B12 haplotype shows no obvious pattern, consistent with a random process of recombination in the centre of the genes.**

Definition of BG genes in a red junglefowl haplotype and comparison with the B12 and other haplotypes shows evidence of expansion and contraction of the multigene family through deletion of genes and swapping of whole BG clusters

The WGS sequence was created from a chicken of the UCD001 line, an inbred red junglefowl line with the BQ haplotype, closely related to the standard B21 haplotype in experimental lines of chickens derived from egg layers [79]. Other than BG0, BG1, BG2 and BG10 (with BG10 being zipper protein-like), no BG genes were correctly identified by ENSEMBL in this genome sequence.

By using BLAST to probe with a 3′UTR sequence, seven BG genes arranged in tandem and in the same transcriptional orientation were identified on a supercontig (covering contigs 318.1 to 318.6) in the contiguous sequence for chromosome 16 (Figure S17). The automatic annotation programme GENSCAN utilised by ENSEMBL apparently did not recognise the 5′ ends of these BG genes, and therefore they were only predicted as producing a single long transcript. The position and orientation of this cluster was verified by comparison to a BAC contig from the same chicken [80], from which the first two BG genes as well as a lectin-like gene, a kinesin gene and the intervening downstream TRIM region had been sequenced (Figure 9). However, in the portion for which there is only the WGS sequence, there are gaps in assembly that raise the possibility of an additional two BG genes.

Fig. 9. Comparison of cosmid cluster VI from the B12 haplotype with the BQ haplotype from a red junglefowl, showing regions of virtual identity separated by two large indels, one in the middle of the sequences and the other where the red junglefowl haplotype (but not the B12 haplotype) continues into the TRIM region.

Direct sequence comparison of the red junglefowl sequence from the BACs, the red junglefowl sequence from the WSG sequence and the B12 sequence from the cosmids (Figure 9) shows that there has been a precise replacement of the BG11 gene in the B12 haplotype with at least four genes in the red junglefowl haplotype, with 99.98 and 98.90% sequence identity between the two haplotypes on the left side and the right side, respectively, of the breakpoints. Moreover, the red junglefowl sequence goes directly into the TRIM region after the lectin-kinesin gene pair, whereas the B12 sequence has two additional BG genes after the lectin-kinesin gene pair and no indication of the TRIM region. There are also some deletions in the WGS sequence compared to the BAC sequence, which may reflect differences in the exact haplotypes or in sequence assembly. However, this comparison strongly supports the notion that recombination leads to strong differences between BG haplotypes.

In addition, at least nine red junglefowl BG genes arranged in tandem were identified in the bin “chromosome 16 random”, which consists of contigs predicted to be on chromosome 16 but not assembled with the contiguous portions of the WSG sequence (Figure S17). The order of these genes is not known, but on the basis of fibre-FISH they form another cluster, located next to the first red junglefowl cluster (Figure 10). Thus, there appears to be in the neighbourhood of 18 BG genes in the BG region of the red junglefowl haplotype compared to 12 BG genes in the B12 haplotype, demonstrating CNV for the BG region.

Fig. 10. The BG regions of six haplotypes are located in the same orientation from the BF-BL region, but vary in size and composition, as assessed by fibre-FISH using probes corresponding to the cosmids cG43 from BG cluster V (red), cG24 from BG cluster VI (green) and c4.5 from BF-BL cluster I (white).

Phylogenetic comparison of these two BG clusters from the red junglefowl haplotype with the B12 haplotype showed that the first red junglefowl cluster is highly related to cluster VI from the B12 haplotype, but the second red junglefowl cluster is not closely related to any of the other clusters (Figure S18). Fibre-FISH shows that the two red junglefowl clusters are contiguous, based on their length and hybridisation to the B12 clusters (Figure 10), and the evidence from comparison to the reported BAC sequence locates and orientates the first cluster next to the TRIM region. Thus, the order of the clusters in B12 is BG cluster VI-BG cluster V-TRIM region-BF/BL region whereas the order of the clusters in red junglefowl is Second BG cluster-First BG cluster (related to cluster VI)-TRIM region-BF/BL region. This remarkable result is most easily explained by large-scale expansion and contraction events in the BG region, with whole clusters swapping in and out.

To test whether the differences between the B12 and red junglefowl sequences were due to one of them being an outlying variant compared to most MHC haplotypes, we performed fibre-FISH on an additional five haplotypes (B2, B4, B15, B19 and the true B21 haplotype). It is apparent that the order of the BG, TRIM and BF-BL regions is stable, but that the BG regions vary in size and order of BG genes (Figure 10). Thus it would appear that the expansion and contraction of the BG genes in the BG region is a general phenomenon.

Discussion

For the first time in the study of BG genes, we have an understanding of the genomic organisation of a complete BG haplotype, coupled with a comparison to other BG haplotypes and a determination of cell and tissue expression. Two overarching points emerge among the many new findings, which together portray BG genes as a much more dynamic and complex genetic system than their closest mammalian homologues, the butyrophilin genes.

The first major point that we establish in this paper is the very specific cell and tissue expression for each of the BG genes, which overall form two groups (along with one gene that may have a more ubiquitous tissue distribution), strongly supported by the phylogenetic analysis of the presumed promoter regions. Although BG molecules were first discovered as a polymorphic antigen on erythrocytes, it has been clear for some years that there is a multigene family of BG genes, at least some of which were expressed in other cell types, including thrombocytes, B and T cells, bursal and thymic stromal cells, and intestinal cells [63], [65], [78], [80]–[82]. However, there has never been a complete list of BG genes for a haplotype, nor a comprehensive analysis of which genes are expressed in which cells and tissues.

In this paper, we examine all the BG genes of the B12 haplotype both by sequence and expression analyses and find that some BG genes are expressed in one or another cell of the haemopoietic lineage while other BG genes are expressed in tissues, likely from non-haemopoietic lineages. These assignments are strengthened by the fact that the 5′ ends (putative promoter and 5′UTR) of the genes from the BG region also fall into two groups which fit exactly with the presumed cell and tissue distributions (with the exception of the singleton BG genes, discussed below). Interestingly, the haemopoietic genes of the B12 haplotype all have a deletion within the 5′UTR, which almost certainly arose by recombination between two 27 nucleotide direct repeats found in all tissue BG genes. These data might be interpreted to suggest that all haemopoietic BG genes descended from a single BG gene, with the tissue BG genes being ancestral.

Within these broad categories of haemopoietic and tissue BG genes, the specificity of expression of particular BG genes in a single cell type is remarkable, with some genes changing expression during differentiation. For instance, only one BG gene in the B12 haplotype is strongly expressed in T cells sorted from peripheral blood. In contrast, two BG genes are strongly expressed in B cells sorted from peripheral blood, but one of these was not found in bursa, the primary lymphoid organ for the production of B cells. Changes in expression during differentiation are also suggested for the BG3 gene, which is strongly expressed in T and B cells, thymus and bursa, macrophages and dendritic cells, but not thrombocytes nor bone marrow from which all haemopoietic lineages are thought to originate. Interestingly, the BG3 gene is also strongly expressed in brain, and at least one transcription factor binding site specific for neurones is found in the putative promoter of BG3. Expression of particular genes may also change during activation of a cell type, but for macrophages a number of strong stimuli failed to affect the two strongly-expressed BG genes, BG3 and BG13. Overall, much more work needs to be done to explore the complex expression patterns of genes from the BG region.

Of the two genes located outside of the BG region, BG0 has an apparently ubiquitous tissue distribution while BG1 is expressed in intestine and kidney. The 5′ regions of these two genes are different from the other BG genes; the BG1 promoter is in fact partly composed of inverted pieces of the promoters of neighbouring genes.

The second major point that we establish in this paper is the presence of BG genes with different evolutionary histories, some relatively stable and others changing rapidly. As mentioned above, there has long been evidence for multiple BG genes, many of which were located to the BG region and one located to the BF-BL region [62]–[64], [67], [78], [80]. In this paper, we report two single BG genes with relatively stable evolutionary histories compared to the many BG genes located in a cluster of clusters, for which there is dynamic expansion and contraction and thus a much more complex evolutionary history.

The singletons located outside of the BG region are the newly described BG0 gene on chromosome 2, and the long-studied BG1 gene located in the BF-BL region. Although they have similar intron -⁠ exon structures and both have promoters different from other BG genes, many other features differ between these genes. We found BG0 as a cDNA from B12 caecal tonsil (a gut-associated lymphoid tissue), and comparison with available sequences showed that it was nearly identical to a fragmentary cDNA clone reported as isolated from B21 embryonic erythrocytes [67]. Specific PCR amplification shows that it has a wide, perhaps even ubiquitous, tissue distribution, and is present in every haplotype examined as a nearly monomorphic transcript (Chattaway, Salomonsen and Kaufman, unpublished). These properties suggest that BG0 has a biological function that is stable and well-conserved, perhaps homeostatic. In contrast, BG1 was originally described as expressed in liver, thymus, a T cell line and a B cell line [72], but our tissue distribution shows that it is well-expressed in thymus, intestine and kidney. Orthologous sequences are present in every haplotype examined, but there is allelic sequence variation throughout the gene, including expansion and contraction of the cytoplasmic exons [71], [83](Chattaway, Salomonsen and Kaufman, unpublished). The properties of BG1 suggest a function that is under some selection to change, perhaps in response to changes in pathogens (as has been suggested by genetic evidence, [71]). However, another possibility that has not been ruled out is that the genetic variation is due to hitch-hiking on nearby MHC genes which are under strong selection for variation [84], [85].

At a descriptive theoretical level, the presence of BG genes as singletons outside the BG region is best understood as a consequence of the “birth and death model” of multigene family evolution [17], [21], for which it has been noted that single gene duplicates can arise by replicative translocation [15]. Moreover, such theoretical considerations suggest that evolution of new functions is likelier in such singletons compared to a tightly-linked multigene family [15], [19]. More recent evolutionary dynamics of these BG singletons is more likely to be governed by a model of divergent evolution for alleles [17], [21].

In contrast, the multigene family of BG genes present in the BG region is organised as a cluster of clusters, and is undergoing significant expansion and contraction. The identification of clusters is based on the presence of non-BG genes at or near the end of each proposed cluster, a kinesin motor gene, a C-type lectin-like gene and an unclassified open reading frame. These genes in a characteristic order are found near the end of each of the B12 cosmid clusters V and VI, and were also identified at the junction of the BG region with the TRIM region in the red junglefowl haplotype used for the WGS sequence [36], [54]. These genes may be considered as “framework genes” in the sense originally proposed by Amadou [86], in which nearly single-copy genes with stable functions flank regions of expanding and contracting multigene families of genes with various functions.

In this paper, we present five pieces of evidence consistent with significant expansion and contraction leading to CNV of the BG region by recombination and deletion: presence of cDNA sequences derived from the B4 haplotype in congenic B12 chickens, presence of apparent hybrid genes in the B12 haplotype by phylogenetic analysis of sequence, deletion of genes evident from sequence comparisons of the proximal cluster of BG genes in the B12 and red junglefowl haplotype, apparent swapping of clusters by comparison of B12 and red junglefowl haplotypes by sequence and fibre-FISH, and differences in length and specific hybridisation patterns by fibre-FISH between six BG region haplotypes. Such change within the BG region is consistent with early biochemical evidence of recombination based on analysis by two-dimensional gel electrophoresis [80], [87].

At a descriptive theoretical level, the appearance of the BG multigene family might be best explained by the “birth and death” model [17], [21], in which duplication leads to multiple gene copies out of which some genes may be retained, while others become nonfunctional. At the moment, there is no obvious evidence for homogenisation of BG genes, so the question does not arise whether there is birth and death followed by purifying selection or concerted evolution by ongoing sequence exchange resulting in homogenisation [14], [15], [17]. However, one of the hallmarks of the “birth and death” model is considered to be the presence of pseudogenes [17], [21], and there is no evidence that any of the BG genes examined are non-functional, given that they all have long open reading frames and they are all expressed at the RNA level. One possibility is that in other BG haplotypes there are pseudogenes that have yet to be described. Another possibility is that the maintenance of the BG multigene family in the BG region is based on a so-called “mixed model” [17], [21], perhaps with sequence exchange repairing any pseudogenes. A third possibility is that a new theoretical model should be considered. In any case, the current evidence suggests that changes in BG genes within the BG region occur by unequal crossing-over and deletion without the obvious appearance of pseudogenes. The importance of recombination in the evolution of these BG genes seems clear, but at the moment there is no evidence to suggest how fast it might be in comparison with other multigene families [22], although the appearance of B4 BG genes in the congenic CB chicken line during back-crossing suggests that it could occur over a few generations. Whatever the speed of such recombination, without selection the number of BG genes would likely gradually reduce to one [9], so this suggests that selection for function is also an important part of BG evolution.

Such rapid evolution based on various outcomes of recombination is not easily reconciled with models for simple specified functions of all the BG molecules encoded by the BG region. We have found that the 5′ and 3′ ends of BG genes can often be switched to make hybrid genes, with the phylogenetic signal of the middle portion of the gene apparently scrambled. However, we also found that the 5′ end of the genes determine cell and tissue expression. It has long been postulated that the extracellular V-like region in the middle of the gene is involved in some immunological or cell-cell interaction function, and the cytoplasmic tail at the 3′ end in interactions with the cytoskeleton [59]. Indeed, there is direct biochemical evidence that the cytoplasmic tail of zipper protein (similar to the BG10 gene of the B12 haplotype) can regulate actin-myosin interaction in the intestinal brush border, and an implication that variation in the number of cytoplasmic exons of BG1 is involved in resistance to certain viruses [70], [71]. The conundrum is how there can be a stable function based on the middle or 3′ end of a gene, when expression of this gene can suddenly be switched by recombination bringing in a new 5′ end to another cell or tissue, presumably a random genetic event. The resolution of this apparent conundrum is one of the next important tasks.

One possibility is that there are BG genes within the BG region that are relatively stable with important and specific functions, and that between them there are expansions and contractions of genes whose expression and function can change rapidly. The existence of such stable “framework genes” flanking regions of genomic change has been particularly well-characterised for the killer inhibitory receptor (KIR) gene cluster which encodes receptors on natural killer cells [3], [20], [88], [89]. Indeed, the KIR genes show other similarities to the properties of the BG genes, including “tail-swapping” in which the inhibitory and activating 3′ ends of genes with different 5′ extracellular regions are switched [90]. If this model of genomic evolution of the BG region is correct, we expect to find some orthologous genes in every BG haplotype, whose alleles will carry out similar functions and be expressed in similar cells and tissues.

Identifying and characterising such framework genes in the BG region is another of the next important tasks. As an example, the sequence of the zipper protein characterised in an unknown chicken line is nearly identical with BG10 of the B12 haplotype, suggesting that this may be a framework gene with a highly conserved function and cell expression. In fact, the zipper protein was discovered as a protein which can control actin-myosin interactions in intestinal brush borders [70], likely to be a homeostatic control for which rapid evolution would not be advantageous.

Another possibility is that there is sufficient functional redundancy between different BG proteins to relieve the selective pressure to maintain the expression of individual genes. If this was the case, perhaps the same function could be achieved in different haplotypes by structurally and functionally similar but not necessarily orthologous genes. At a descriptive level, this kind of evolution might be considered “subfunctionalisation”, in which newly duplicated genes share and then partition the functions of the original gene [14]. It has been noted on theoretical grounds that tight linkage increases the probability of subfunctionalisation at the expense of neofunctionalisation [15]. Unravelling which genes in different haplotypes may serve the same and different functions is a third important task.

At least some of the BG genes that undergo rapid evolutionary change will almost certainly have important functions as well, characterised by the need for diversity and rapid response to changing environmental conditions. The likeliest scenario is a molecular arms race with pathogens, in which the diversity of such BG genes is driven by the ever-changing variation in (certain) pathogens. As mentioned above, resistance and susceptibility to two oncogenic viruses have recently been ascribed to a retroviral insertion in the 3′UTR of the BG1 gene [71]. A fourth important task is thus to understand the mechanisms of how such BG genes enable an effective response to a pathogen.

Materials and Methods

Animals

Samples were taken from experimental chicken lines kept at the Basel Institute for Immunology in Switzerland, the Institute for Animal Health in the UK, the Ludwig Maximilians University in Germany, and the University of Cambridge in the UK. The origins and derivations of the chicken lines used in this work are described in some detail [77 and references cited therein]. All animal work was conducted according to the relevant national guidelines in place at the time of the research. Most recently, approval for animal research was through UK Home Office Licenses and local Ethics Committees at the Institute for Animal Health at Compton, and the University of Cambridge.

Isolation and sequencing of BG genes (Figures 1, S1 and S2)

Standard molecular biology techniques used in this paper are described in detail [91], [92], which typically are referenced along with modifications in the accompanying citations given below. Southern blotting using BG cDNA probes [78] identified the transcribed fragment 8.5 from various cosmid clones of the line CB (B12) chicken [72] as a BG gene (subsequently named BG1). A 6.5 kB fragment from cosmid cβ12 [72] cut with Nru I and Hind III was isolated and cloned into Bluescript (cβ12NK6.5BS2a clone), subclones were sequenced as described [93] by dideoxynucleotide technology using ³²P-labelled ATP, some portions after isolation of single strands, and the sequence was deposited in Genbank (accession number KC963427).

A cosmid library from a line CB (B12) chicken [72] was screened with cDNA clones G1, G5, G7 and G8 [53] labelled with ³²P by nick-translation. The 50 positive plaques were picked and grown up for isolation of DNA, which was analysed by Southern blot using various BG cDNA probes isolated from a H.B19 (B19) chicken spleen library [78]. Double-digestion restriction maps and limited sequencing were used to group the cosmids into clusters [59], with extraneous sequence of chimeric inserts determined by genomic Southern blots. DNA from cosmids cG43, cG3 and cG24 was sheared, cloned and sequenced using commercial fluorescent dye reaction kits followed by capillary electrophoresis at the Sanger Institute. The clusters were linked using the data from Figure S5C in Cambridge, and the whole sequence deposited in Genbank (accession number KC955130).

Polymerase chain reaction

Amplification from DNA and cDNA was carried out using different commercial kits at different times during the work described in this paper. Typically, the amplifications were carried out with commercial kits using manufacturer's instructions (sometimes optimised for Mg, dimethylsulfoxide or polyethyleneglycol concentration), with 2–5 min at 94–96°C for initial denaturation, followed by 30 cycles of 0.6–1 min at 94–96°C for denaturation, 1–2 min at a lower temperature (depending on the primers) for annealing and 1–5 min (depending on amplicon length) at 72°C for extension, and ending with 10 min at 72°C for final extension followed by storage at 4°C.

Isolation of BG cDNA for comparison to genomic sequences (Figures S3 and S7)

For BG0, clones were isolated from a cDNA library in Basel. Briefly, RNA was isolated from caecal tonsil of a CB chicken and cloned into λZAP vector to make the CT-2 library (much as described in [94]), which was screened with BG cDNA probes as above. Phage clones were picked and converted by plasmid clones by in vivo excision with a helper phage VCSMI3 in BB4 cells. DNA was prepared by the CTAB miniprep method, and the length of insert was determined by digestion and Southern blotting using BG cDNA [78] and genomic clones from the BG1 gene as probes, with clones 45A, 47B and 47C eventually picked for full sequencing using a T3 thermocycler (Biometra), a fluorescent dye terminator kit and a 373A DNA sequencer (both Applied Biosystems).

For BG1, BG10 and BG11, cDNA clones were isolated from transfectants in Basel. Briefly, mouse L cells with inactivated thymidine kinase gene (Ltk^- cells) were transfected by standard calcium phosphate procedures [91], [92] with clones cβ12NK6.5BS2a, cG13 and cG22₂. Cells expressing BG genes were selected by neomycin, then enriched by fluorescence-activated cell sorting and cloned by limiting dilution followed by screening using flow cytometry, in both cases using a pool of mAb to BG including BG2, 3, 4, 5, 6 and 9 [65]. Total RNA was prepared from clones 1.4 and 1.7 (cβ12NK6.5BS2a containing BG1 gene), 6N.39 (cG13 containing BG10 and BG11) and 8N.37 (cG22₂ containing BG10 and BG11) using the FastTrack kit (Life Technologies) and cDNA was prepared using Superscript reverse transcriptase (Stratagene). BG cDNA was amplified using a T3 thermocycler, a commercial Taq polymerase kit (Roche), with an annealing temperature of 45°C. BG1 cDNAs were amplified using reverse primer 2773 containing a Not I site (ATATATgcggccgcCTYTGGTTTCTTTCTTCCAATTGG) based on the cDNA clones G3 and G8, and a series of forward primers each containing a Sal I site, 2960 (TATATgtcgacTGGCAGAATTACTGGTGCC), 2961 (TATATgtcgacCTGGTGATAGCAGAGACCC) and 2962 (TATATgtcgacGGTAGAAGCTGGGC) designed based on the genomic sequence (accession number KC963427) to establish the 5′ end of the BG1 transcript. BG10 and BG11 cDNAs were amplified with forward primer 8081 containing an Nru I site (CACGTtcgcgaCCATGSNSTTYNYATYRRGMTGC) and reverse primer 2774 (sequence above). Amplified fragments were cut with appropriate restriction enzymes, gel-purified and cloned into Bluescript plasmid, with BG1 clones PCRX1, 3 and 5 from transfectant 1.7 and PCRX11, 29, 31 and 35 from transfectant 1.4, BG10 clones 34, 42, 47 and 50, and BG11 clones JK29, 32, 37, 45 and 49 eventually chosen for full sequencing using a T3 thermocycler (Biometra), fluorescent dye terminators kit and a 373A DNA sequencer (both Applied Biosystems).

Preparation, amplification and sequencing of cDNA from cells and tissues (Figures 2 and S4)

The cDNA preparations from T cell, B cell and thrombocyte populations purified by flow cytometry from blood of CB chickens in Basel were described in previous publications [76], [77].

The cDNA preparations from macrophages were prepared from blood monocytes isolated from CB chickens in Munich, essentially as described [95]. Briefly, leucocytes from heparinised blood were separated by centrifugation through Ficoll-Paque. Cells at the interface were washed twice in PBS, adjusted to 2×10⁷ cells/ml in RPMI 1640 supplemented with 10% FBS, 100 U/ml penicillin and 100 µg/ml streptomycin, and then incubated on 90-mm cell culture Petri dishes at 5% CO₂ and 40°C. After 48 hours non-adherent cells were removed by vigorous washing. The remaining cells were over 98% positive for KUL01, a macrophage-specific monoclonal antibody [96], and were further incubated in the same medium under identical conditions with or without activation. Cells were either stimulated with LPS from E. coli (O127,B8; Sigma) at a final concentration of 10 µg/ml or with cell culture supernatant of chicken INFγ expressing COS cells [97] at a final dilution of 1∶500 or with a combination of both. After 24 hours cells were washed with PBS and harvested into peqGOLD TriFast (Peqlab, Erlangen, Germany) and RNA extracted according to the manufacturer's protocol. In order to confirm macrophage activation cell nitric oxide release into cell culture supernatant was quantified by Griess reaction [98].

All other cells and tissues were from C-B12 chickens in Compton, with total RNA isolated in TRIzol and cDNA made with Superscript III (Invitrogen). Bone marrow from untreated and chickens rendered anaemic with phenylhydrazine was isolated as described [78]. Dendritic cells were derived from bone marrow and grown in chicken IL4 and GM-CSF for 7 days as described [99]. Intestinal enterocytes from embryonic day 14–15 embryos were isolated from the PBS-45% Percoll interface as described [100], [101]. Lymphocyte depletion was achieved by daily intramuscular injection of 3 mg cyclophosphamide each of the first 4 days after hatching [102]. Thymus and bursa tissues were disrupted by passing through a nylon sieve, and lymphocytes separated from stromal cells by 70%–45% discontinuous Percoll gradient with a PBS overlay [97].

Amplifications in Copenhagen were performed using a GeneAmp PCR System 2700 thermal cycler (Applied Biosystems) using either the High Fidelity PCR Enzyme kit (Fermentas #K0192) or Long PCR Enzyme kit (Fermentas #K0182). Samples of cDNA (usually 1 µl of a standard prep) were amplified using the commercial buffers with Mg concentration optimised, with 20–30 nmol of each primer, 55°C for annealing and 1 min for extension. For the data in Figure 2, the primers used for most BG genes were LP F1 (CCA GWT TCR CCC TYC CCT GGA GGA C), LP F2 (CTC CTG CCT TAT CTC RTG GCT CTG CAC), TM R1 (GAC ARA TGA CCC AMC SAG AWK TGT G) and TM R2 (CAC AGC CAG AGC CAC YKT CCA G), used in all four forward and reverse primer combinations. For BG2, the primers used were LP F1 and LP F2 as above, 43A R1 (GACAAATGACCCAGCCAGAGGAATTATG) and 43 R2 (CACAGCCAGAGCCACCTTCCAAG). For BG0, the primers used were LP F1 as above, CTBG F2 (CTC CTG GCT TAC CTC GTG GCT CTC AAC), CTBG R1 (CGA ATG ACG CAA ACA AAA GTG TGA G) and CTBG R2 (CCA CAG CCA GAG CCA CCT TCC AGG), used in all four forward and reverse primer combinations. Amplicons were isolated from TBE agarose gels using QIAquick Gel Extraction kit (QIAGEN 28706) and cloned using TOPO TA cloning kit (Invitrogen). Plasmid minipreps with right insert size after Eco RI digestion were sequenced using Big Dye Terminator reagent and an ABI automatic sequencer (Applied Biosystems).

Reverse transcriptase-quantitative polymerase chain reaction (RT-qPCR) of cDNA from tissues (Figure S6)

Approximately 100 mg of spleen, liver, duodenum and bone marrow were collected from two 10 week old female C-B12 chickens from the Institute for Animal Health. Tissues were collected into 500 µl RNA-Later (Ambion) and stored at −20°C. RNA was isolated from ∼100 mg homogenised tissue using the Nucleospin RNA II kit (Machery-Nagel), and 1 µg RNA was converted to cDNA using oligo-dT primer and the Verso cDNA synthesis kit (Thermo Scientific), both according to manufacturer's instructions.

Forward and reverse primers for qPCR were TGTGCTGTGCAAGATGAT and TTCCAGGGATGGATGATG for BG4, TGTGTTGTGCAAGATGAC and GAAAAGCAATGATGACAA for BG7, TGTGCTGTGCAAGATGGT and ACGATCTGGGAAAAGGGG for BG10, and GGACGATCTGGGAAAAGA and TATGCAGAAGCTGTGGTGA for BG11. Targets were amplified over 40 cycles (initial enzyme activation 15 min 95°C, then 40 cycles of 15 s at 94°C, 30 s at 60°C and 30 s at 72°C) using 10 pmol each primer in AbsoluteBlue qPCR Mix (Thermo Scientific). Samples were compared to a 5-point standard curve, and normalised to cyclophilin A and a reference spleen sample. Fluorescence data were collected and analysed using an iCycler (BioRad) with subsequent analyses preformed in Microsoft Excel.

Further PCR, cloning and sequencing

Amplifications in Cambridge were performed using a DNA Engine Tetrad 2 Peltier thermocycler (BioRad) using three commercial kits with buffers supplied and according to manufacturer's instructions.

For the experiment in Figure S5B, fragments were amplified from BACs P1(26)F6, 34 and P1(186)B6 [52] using 2 ng DNA, 0.6 mM each of the primers uc74 (CTCCTGCCTTATCTCRTGGCTCTGCAC), and uc76 (CACAGCCAGAGCCACYKTCCAG), with 1 U Velocity polymerase (Bioline BIO-21098) in 1×GC rich buffer containing 2 mM MgCl₂ and 0.04 mM of each dNTP, with the annealing step being 2 min at 53°C.

For the experiment in Figure S5C, genomic DNA was isolated from erythrocytes using a salting-out procedure, as described [103]. Fragments were amplified from 2 ng line C-B12 genomic DNA using 0.2 mM each primer uc244 (F10: TTGGGGAAATAGTGTGACCG) with uc250 (R18: GGAGGGATCAGGAGGGAGC) or uc248 (R11: GGGGGGAAGAATTTAGGGAT) with 0.5 U recombinant Taq DNA polymerase (Invitrogen 10342020) in 1× Invitrogen PCR reaction buffer, 2 mM added MgCl₂ and 0.25 mM of each dNTP. After initial denaturation, there were 5 cycles of 0.75 min at 95°C, 0.5 min at 60°C and 1.5 min at 72°C, followed by 30 cycles of 0.75 min at 95°C, 1 sec at 60°C and 1.5 min at 72°C, followed by a final extension step.

For the experiment in Figure S8, fragments were amplified from 1 µl (concentration unknown) line N genomic DNA using 0.4 mM of each primer [Pair 1: uc511 (BG0_P1F, TGCCCAGGGATGATTGTGAGGCT) and uc512 (BG0_P1R, TGCAGAACTGGGTGAGTCGTTCC); Pair 2: uc513 (BG0_P2F, TGCCCAGGGATGATTGTGAGGC) and uc514 (BG0_P2R, TGCAGAACTGGGTGAGTCGTTCCT); Pair 3: uc515 (BG0_P3F, GCCCAGGGATGATTGTGAGGCT) and uc516 (BG0_P3R, GCAGAACTGGGTGAGTCGTTCCT)], with 20 U Phusion polymerase (New England Biolabs #M0530S) in 1× HiFi buffer (containing 1.5 mM MgCl₂), 1.5% polyethylene glycol and 0.04 mM of each dNTP. After initial denaturation, there were 5 cycles of 0.75 min at 95°C, 0.5 min at 59.8°C and 1.5 min at 72°C, followed by 30 cycles of 0.75 min at 95°C, 1 sec at 59.8°C and 1.5 min at 72°C, followed by a final extension step.

The amplicons were cloned using the CloneJET kit (Fermentas) following manufacturer's instructions, and sequenced using commercial fluorescent dye kits followed by capillary electrophoresis at the DNA Sequencing Facility of the Department of Biochemistry, University of Cambridge (www2.bioc.cam.ac.uk/~pflgroup/DNA_Facility).

Sequence data was analyzed using CLC DNA workbench (version 5.7.1, www.clcbio.com/). Alignments were performed using ClustalX (www.clustal.org/) and MAFFT (http://mafft.cbrc.jp/alignment/software/). Some phylogenetic trees were created using the neighbour joining (NJ) method implemented by ClustalX with 1000 bootstrap seeds. Other trees were created using a Bayesian approach implemented by MrBayes [104] (version 3.1.2, http://mrbayes.sourceforge.net/), with the GTR substitution model used with gamma-distributed rate variation across sites, and with the MCMC analysis using one chain, 20000 generations and sampling every 100 generations. Still other trees were created maximum parsimony (MP) phylogenies, using PAUP 4.0b10 for Unix [105] (www.paup.csit.fsu.edu/). AU and SH tests [106] on MP trees were performed in CONSEL version 0.2 (www.is.titech.ac.jp/~shimo/prog/consel/). Finally, other trees were created using neighbour network analysis implemented by SplitsTree4 [107], [108] (http://www.splitstree.org/), and tested for significance using a Phi test for recombination [109]. Phylogenetic trees were visualized using Dendroscope (http://ab.inf.uni-tuebingen.de/software/dendroscope/). After removal of gaps in the alignment of all 14 BG genes by G-blocks using default stringent parameters [110], [111], an automated partitioning analysis performed using SAGUARO [112] (www.sourceforge.net/projects/saguarogw/) gave cacti which were handled with PHYLIP version 1∶3.68-2 [113] from the Ubuntu repositories (www.launchpad.net/ubuntu/lucid/+source/phylip/1∶3.68-2). Dotplots were created with a wordsize of 150, using dottup from the EMBOSS package (version 6.1.0-5, http://emboss.sourceforge.net/). Helical wheels were created using pepwheel from the EMBOSS package (version 6.1.0-5, http://emboss.sourceforge.net/). Read-through exons were found using an in-house Visual BASIC script, and the ITIMs by an in-house PERL script, both written by J. Chattaway.

The 21 nucleotide repeats were grouped using an in-house rule based clustering algorithm (J. Chattaway). The clustering algorithm first created a distance matrix between the nucleotide sequences of all the 21 nucleotide repeats. Then, the algorithm created a list of 21 nucleotide repeats that were at least 80% identical to each repeat, and compared the lists and placed repeats that were similar to the same sets of repeats into the same group, again with an 80% identity threshold. The number of resulting groups was small enough to be checked manually; some groups were merged or split, and each group was given a number as a label, in order from the largest to the smallest group.

Fibre-FISH (Figures 3 and 10)

The cell lines IS2 (MHC haplotype B2), UG5 (B13), TG15 (B15), IS19 (B19), and TG21 (B21) are reticuloendotheliosis virus (REV) transformed cell lines [114], [115], [116]. Primary B12 splenocytes were isolated from the spleen of a line C-B12 chicken (IAH, Compton; now rebranded as The Pirbright Institute) and stimulated with concanavalin A (ConA, Sigma) for 48 h. All cells were cultured at 37°C, 5% CO₂ in RPMI 1640 supplemented with 10% FCS, L-glutamine and kanamycin (all from GIBCO/Invitrogen).

Fibre-FISH was performed at Sanger Institute as described [117], except the fibres were not treated with acetic acid and pepsin. Approximately 10 ng of each cosmid was used for amplification by the GenomePlex Whole Genome Amplification (WGA) 2 kit (Sigma) according to manufacturer's instructions, and probes created by amplification using a modified WGA3 kit (Sigma) [118], with cosmid c4.5 labelled with digoxigenin-11-dUTP (Roche); cG24 with biotin-16-dUTP (Roche), and cG43 with dinitrophenyl (DNP)-11-dUTP (Perkin-Elmer). Hybridisation was carried out as described [119] except that the probes were allowed to bind overnight. Detection and imaging was carried out as described [118] with the DNP-11-dUTP probe detected using a 1∶200 dilution of rabbit anti-DNP IgG and a 1∶200 dilution of AlexaFluor 488 donkey anti-rabbit IgG (both from Molecular Probes, part of Invitrogen).

Supporting Information

Zdroje

1. DoxiadisGGM, OttingN, de GrootNG, NoortR, BontropRE (2000) Unprecedented polymorphism of Mhc-DRB region configurations in rhesus macaques. J Immunol 164 : 3193–3199.

2. DoxiadisGGM, de GrootN, OttingN, BlokhuisJH, BontropRE (2011) Genomic plasticity of the MHC class I A region in rhesus macaques: extensive haplotype diversity at the population level as revealed by microsatellites. Immunogenetics 63 : 73–83.

3. JiangW, JohnsonC, JayaramanJ, SimecekN, NobleJ, et al. (2012) Copy number variation leads to considerable diversity for B but not A haplotypes of the human KIR genes encoding NK cell receptors. Genome Res 22 : 1845–1854.

4. LanierLL (2005) NK cell recognition. Annu Rev Immunol 23 : 225–274.

5. SturtevantAH (1925) The effects of unequal crossing over at the bar locus in Drosophila. Genetics 10 : 117–147.

6. HaldaneJBS (1933) The part played by recurrent mutation in evolution. Am Nat 67 : 5–19.

7. MüllerHJ (1935) The origination of chromatin deficiencies as minute deletions subject to insertion elsewhere. Genetica 17 : 237–252.

8. Huxley J (1942) Evolution: the modern synthesis. London: Allen and Unwin.

9. WalshJB (1987) Persistence of tandem arrays: implications for satellite and simple-sequence DNAs. Genetics 115 : 553–567.

10. OtaT, NeiM (1994) Divergent evolution and evolution by the birth-and-death process in the immunoglobulin VH gene family. Mol Biol Evol 11 : 469–482.

11. NeiM, GuX, SitnikovaT (1997) Evolution by the birth-and-death process in multigene families of the vertebrate immune system. Proc Natl Acad Sci U S A 94 : 7799–7806.

12. FreemanJL, PerryGH, FeukL, RedonR, McCarrollSA, et al. (2006) Copy number variation: new insights in genome diversity. Genome Res 16 : 949–961.

13. ForceA, LynchM, PickettFB, AmoresA, YanYL, PostlethwaitJ (1999) Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151 : 1531–1545.

14. LynchM, ConeryJS (2000) The evolutionary fate and consequences of duplicate genes. Science 290 : 1151–1155.

15. LynchM, O'HelyM, WalshB, ForceA (2001) The probability of preservation of a newly arisen gene duplicate. Genetics 159 : 1789–1804.

16. RooneyAP, PiontkivskaH, NeiM (2002) Molecular evolution of the nontandemly repeated genes of the histone 3 multigene family. Mol Biol Evol 19 : 68–75.

17. NeiM, RooneyAP (2005) Concerted and birth-and-death evolution of multigene families. Annu Rev Genet 39 : 121–152.

18. FreemanJL, PerryGH, FeukL, RedonR, McCarrollSA, et al. (2006) Copy number variation: new insights in genome diversity. Genome Res 16 : 949–961.

19. HughesT, LiberlesDA (2007) The pattern of evolution of smaller-scale gene duplicates in mammalian genomes is more consistent with neo -⁠ than subfunctionalisation. J Mol Evol 65 : 574–588.

20. TraherneJA, MartinM, WardR, OhashiM, PellettF, et al. (2010) Mechanisms of copy number variation and hybrid gene formation in the KIR immune gene complex. Hum Mol Genet 19 : 737–751.

21. Eirín-LópezJM, RebordinosL, RooneyAP, RozasJ (2012) The birth-and-death evolution of multigene families revisited. Genome Dyn 7 : 170–196.

22. KatjuV, BergthorssonU (2013) Copy-number changes in evolution: rates, fitness effects and adaptive significance. Front Genet 4 : 273.

23. HenryJ, MillerMM, PontarottiP (1999) Structure and evolution of the extended B7 family. Immunol Today 20 : 285–288.

24. AfracheH, GouretP, AinoucheS, PontarottiP, OliveD (2012) The butyrophilin (BTN) gene family: from milk fat to the regulation of the immune response. Immunogenetics 64 : 781–794.

25. Abeler-DörnerL, SwamyM, WilliamsG, HaydayAC, BasA (2012) Butyrophilins: an emerging family of immune regulators. Trends Immunol 33 : 34–41.

26. ArnettHA, EscobarSS, VineyJL (2009) Regulation of costimulation in the era of butyrophilins. Cytokine 46 : 370–375.

27. NguyenT, LiuXK, ZhangY, DongC (2006) BTNL2, a butyrophilin-like molecule that functions to inhibit T cell activation. J Immunol 176 : 7354–7360.

28. ArnettHA, EscobarSS, Gonzalez-SuarezE, BudelskyAL, SteffenLA, et al. (2007) BTNL2, a butyrophilin/B7-like molecule, is a negative costimulatory molecule modulated in intestinal inflammation. J Immunol 178 : 1523–1533.

29. SmithIA, KnezevicBR, AmmannJU, RhodesDA, AwD, et al. (2010) BTN1A1, the mammary gland butyrophilin, and BTN2A2 are both inhibitors of T cell activation. J Immunol 184 : 3514–3525.

30. YamazakiT, GoyaI, GrafD, CraigS, Martin-OrozcoN, et al. (2010) A butyrophilin family member critically inhibits T cell activation. J Immunol 185 : 5907–5914.

31. BasA, SwamyM, Abeler-DörnerL, WilliamsG, PangDJ, et al. (2011) Butyrophilin-like 1 encodes an enterocyte protein that selectively regulates functional interactions with T lymphocytes. Proc Natl Acad Sci U S A 108 : 4376–4381.

32. HarlyC, GuillaumeY, NedellecS, PeignéC-M, MönkkönenH, et al. (2012) Key implication of CD277/butyrophilin-3 (BTN3A) in cellular stress sensing by a major human γδ T-cell subset. Blood 120 : 2269–2279.

33. PalakodetiA, SandstromA, SundaresanL, HarlyC, NedellecS, et al. (2012) The molecular basis for modulation of human Vγ9Vδ2 T cell responses by CD277/butyrophilin-3 (BTN3A)-specific antibodies. J Biol Chem 287 : 32780–32790.

34. VavassoriS, KumarA, WanGS, RamanjaneyuluGS, CavallariM, et al. (2013) Butyrophilin 3A1 binds phosphorylated antigens and stimulates human γδ T cells. Nat Immunol 14 : 908–916.

35. ValentonyteR, HampeJ, HuseK, RosenstielP, AlbrechtM, et al. (2005) Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat Genet 37 : 357–364.

36. SzyldP, JagielloP, CsernokE, GrossWL, EpplenJT (2006) On the Wegener granulomatosis associated region on chromosome 6p21.3. BMC medical genetics 7 : 21.

37. SpagnoloP, SatoH, GruttersJC, RenzoniEA, MarshallSE, et al. (2007) Analysis of BTNL2 genetic polymorphisms in British and Dutch patients with sarcoidosis. Tissue Antigens 70 : 219–227.

38. SatoH, SpagnoloP, SilveiraL, WelshKI, du BoisRM, et al. (2007) BTNL2 allele associations with chronic beryllium disease in HLA-DPB1*Glu69-negative individuals. Tissue Antigens 70 : 480–486.

39. KonnoS, TakahashiD, HizawaN, HattoriT, TakahashiA, et al. (2009) Genetic impact of a butyrophilin-like 2 (BTNL2) gene variation on specific IgE responsiveness to Dermatophagoides farinae (Der f) in Japanese. Allergol Int 58 : 29–35.

40. VikenMK, BlomhoffA, OlssonM, AkselsenHE, PociotF, et al. (2009) Reproducible association with type 1 diabetes in the extended class I region of the major histocompatibility complex. Genes Immun 10 : 323–333.

41. PathanS, GowdyRE, CooneyR, BecklyJB, HancockL, et al. (2009) Confirmation of the novel association at the BTNL2 locus with ulcerative colitis. Tissue Antigens 74 : 322–329.

42. HsuehK-C, LinY-J, ChangJ-S, WanL, TsaiF-J (2010) BTNL2 gene polymorphisms may be associated with susceptibility to Kawasaki disease and formation of coronary artery lesions in Taiwanese children. Eur J Pediatr 169 : 713–719.

43. LianY, YueJ, HanM, LiuJ, LiuL (2010) Analysis of the association between BTNL2 polymorphism and tuberculosis in Chinese Han population. Infect Genet Evol 10 : 517–521.

44. YamadaY, NishidaT, IchiharaS, SawabeM, FukuN, et al. (2011) Association of a polymorphism of BTN2A1 with myocardial infarction in East Asian populations. Atherosclerosis 215 : 145–152.

45. OrozcoG, BartonA, EyreS, DingB, WorthingtonJ, et al. (2011) HLA-DPB1-COL11A2 and three additional xMHC loci are independently associated with RA in a UK cohort. Genes Immun 12 : 169–175.

46. HoribeH, KatoK, OguriM, YoshidaT, FujimakiT, et al. (2011) Association of a polymorphism of BTN2A1 with hypertension in Japanese individuals. Am J Hypertens 24 : 924–929.

47. YoshidaT, KatoK, HoribeH, OguriM, FukudaM, et al. (2011) Association of a genetic variant of BTN2A1 with chronic kidney disease in Japanese individuals. Nephrology 16 : 642–648.

48. HiramatsuM, OguriM, KatoK, YoshidaT, FujimakiT, et al. (2011) Association of a polymorphism of BTN2A1 with type 2 diabetes mellitus in Japanese individuals. Diabet Med 28 : 1381–1387.

49. OguriM, KatoK, YoshidaT, FujimakiT, HoribeH, et al. (2011) Association of a genetic variant of BTN2A1 with metabolic syndrome in East Asian populations. J Med Genet 48 : 787–792.

50. HeidHW, WinterS, BruderG, KeenanTW, JaraschED (1983) Butyrophilin, an apical plasma membrane-associated glycoprotein characteristic of lactating mammary glands of diverse species. Biochim Biophys Acta 728 : 228–238.

51. StammersM, RowenL, RhodesD, TrowsdaleJ, BeckS (2000) BTL-II: a polymorphic locus with homology to the butyrophilin gene family, located at the border of the major histocompatibility complex class II and class III regions in human and mouse. Immunogenetics 51 : 373–382.

52. RhodesDA, StammersM, MalcherekG, BeckS, TrowsdaleJ (2001) The cluster of BTN genes in the extended major histocompatibility complex. Genomics 71 : 351–362.

53. BoydenLM, LewisJM, BarbeeSD, BasA, GirardiM, et al. (2008) Skint1, the prototype of a newly identified immunoglobulin superfamily gene cluster, positively selects epidermal gammadelta T cells. Nat Genet 40 : 656–662.

54. BarbeeSD, WoodwardMJ, TurchinovichG, MentionJJ, LewisJM, et al. (2011) Skint-1 is a highly specific, unique selecting component for epidermal T cells. Proc Natl Acad Sci U S A 108 : 3330–3335.

55. TurchinovichG, HaydayAC (2011) Skint-1 identifies a common molecular mechanism for the development of interferon-γ-secreting versus interleukin-17-secreting γδ T cells. Immunity 35 : 59–68.

56. BrilesWE, McGibbonWH, IrwinMR (1950) On multiple alleles effecting cellular antigens in the chicken. Genetics 35 : 633–652.

57. SchiermanLW, NordskogAW (1961) Relationship of blood type to histocompatibility in chickens. Science 134 : 1008–1009.

58. VilhelmováM, MiggianoVC, PinkJR, HálaK, HartmanováJ (1977) Analysis of the alloimmune properties of a recombinant genotype in the major histocompatibility complex of the chicken. Eur J Immunol 7 : 674–679.

59. KaufmanJ, SkjødtK, SalomonsenJ (1991) The B-G multigene family of the chicken major histocompatibility complex. Crit Rev Immunol 11 : 113–143.

60. SalomonsenJ, SkjødtK, CroneM, SimonsenM (1987) The chicken erythrocyte-specific MHC antigen. Characterization and purification of the B-G antigen by monoclonal antibodies. Immunogenetics 25 : 373–382.

61. GotoR, MiyadaCG, YoungS, WallaceRB, AbplanalpH, et al. (1988) Isolation of a cDNA clone from the B-G subregion of the chicken histocompatibility (B) complex. Immunogenetics 27 : 102–109.

62. MillerMM, AbplanalpH, GotoR (1988) Genotyping chickens for the B-G subregion of the major histocompatibility complex using restriction fragment length polymorphisms. Immunogenetics 28 : 374–379.

63. MillerMM, GotoR, YoungS, LiuJ, HardyJ (1990) Antigens similar to major histocompatibility complex B-G are expressed in the intestinal epithelium in the chicken. Immunogenetics 32 : 45–50.

64. KaufmanJ, SalomonsenJ, SkjødtK, ThorpeD (1990) Size polymorphism of chicken major histocompatibility complex-encoded B-G molecules is due to length variation in the cytoplasmic heptad repeat region. Proc Natl Acad Sci U S A 87 : 8277–8281.

65. SalomonsenJ, DunonD, SkjødtK, ThorpeD, VainioO, et al. (1991) Chicken major histocompatibility complex-encoded B-G antigens are found on many cell types that are important for the immune system. Proc Natl Acad Sci U S A 88 : 1359–1363.

66. SalomonsenJ, ErikssonH, SkjødtK, LundgreenT, SimonsenM, et al. (1991) The “adjuvant effect” of the polymorphic B-G antigens of the chicken major histocompatibility complex analyzed using purified molecules incorporated in liposomes. Eur J Immunol 21 : 649–658.

67. Miller MM (1991) The Major Histocompatibility Complex of the Chicken. Phylogenesis of Immune Functions. Boca Raton FL: CRC Press. pp. 151–169.

68. Döhring C, Riegert P, Salomonsen J, Skjødt K, Kaufman J (1993) The extracellular Ig V-like regions of the polymorphic B-G antigens of the chicken Mhc lack structural features expected for antibody variable regions. Avian Immunology in Progress: Institut National de la Recherche Agronomique. pp. 145–152.

69. EllederD, StepanetsV, MelderDC, SeniglF, GerykJ, et al. (2005) The receptor for the subgroup C avian sarcoma and leukosis viruses, Tvc, is related to mammalian butyrophilins, members of the immunoglobulin superfamily. J Virol 79 : 10408–10419.

70. BikleDD, MunsonS, KomuvesL (1996) Zipper protein, a B-G protein with the ability to regulate actin/myosin 1 interactions in the intestinal brush border. J Biol Chem 271 : 9075–9083.

71. GotoRM, WangY, TaylorRL, WakenellPS, HosomichiK, et al. (2009) BG1 has a major role in MHC-linked resistance to malignant lymphoma in the chicken. Proc Natl Acad Sci U S A 106 : 16740–16745.

72. GuillemotF, BillaultA, PourquiéO, BéharG, ChausséAM, et al. (1988) A molecular map of the chicken major histocompatibility complex: the class II beta genes are closely linked to the class I genes and the nucleolar organizer. The EMBO journal 7 : 2775–2785.

73. KaufmanJ, MilneS, GöbelTW, WalkerBA, JacobJP, et al. (1999) The chicken B locus is a minimal essential major histocompatibility complex. Nature 401 : 923–925.

74. MillerMM, GotoR, BernotA, ZoorobR, AuffrayC, BumsteadN, BrilesWE (1994) Two Mhc class I and two Mhc class II genes map to the chicken Rfp-Y system outside the B complex. Proc Natl Acad Sci USA 91 : 4397–4401.

75. RubyT, Bed'HomB, WittzellH, MorinV, OudinA, et al. (2005) Characterisation of a cluster of TRIM-B30.2 genes in the chicken MHC B locus. Immunogenetics 57 : 116–128.

76. WallnyH-J, AvilaD, HuntLG, PowellTJ, RiegertP, et al. (2006) Peptide motifs of the single dominantly expressed class I molecule explain the striking MHC-determined response to Rous sarcoma virus in chickens. Proc Natl Acad Sci U S A 103 : 1434–1439.

77. ShawI, PowellTJ, MarstonDA, BakerK, van HaterenA, et al. (2007) Different evolutionary histories of the two classical class I genes BF1 and BF2 illustrate drift and selection within the stable MHC haplotypes of chickens. J Immunol 178 : 5744–5752.

78. KaufmanJ, SalomonsenJ, SkjødtK (1989) B-G cDNA clones have multiple small repeats and hybridize to both chicken MHC regions. Immunogenetics 30 : 440–451.

79. International Chicken Genome SequencingConsortium (2004) Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 432 : 695–716.

80. MillerMM, GotoR, AbplanalpH (1984) Analysis of the B-G antigens of the chicken MHC by two-dimensional gel electrophoresis. Immunogenetics 20 : 373–385.

81. ShiinaT, BrilesWE, GotoRM, HosomichiK, YanagiyaK, et al. (2007) Extended gene map reveals tripartite motif, C-type lectin, and Ig superfamily type genes within a subregion of the chicken MHC-B affecting infectious disease. J Immunol 178 : 7162–7172.

82. MillerMM, GotoR, YoungS, ChirivellaJ, HawkeD, et al. (1991) Immunoglobulin variable-region-like domains of diverse sequence within the major histocompatibility complex of the chicken. Proc Natl Acad Sci U S A 88 : 4377–4381.

83. HosomichiK, MillerMM, GotoRM, WangY, SuzukiS, et al. (2008) Contribution of mutation, recombination, and gene conversion to chicken MHC-B haplotype diversity. J Immunol 181 : 3393–3399.

84. ShiinaT, OtaM, ShimizuS, KatsuyamaY, HashimotoN, et al. (2006) Rapid evolution of majorhistocompatibility complex class I genes in primates generates new disease alleles in humans via hitchhiking diversity. Genetics 173 : 1555–1570.

85. van Oosterhout (2009) A new theory of MHC evolution: beyond selection on the immune genes.Proc Biol Sci. 276 : 657–665.

86. AmadouC (1999) Evolution of the Mhc class I region: the framework hypothesis. Immunogenetics 49 : 362–367.

87. MillerMM, GotoR, BrilesWE (1988) Biochemical confirmation of recombination within the B-G subregion of the chicken major histocompatibility complex. Immunogenetics 27 : 127–132.

88. WilsonMJ, TorkarM, HaudeA, MilneS, JonesT, et al. (2000) Plasticity in the organization and sequences of human KIR/ILT gene families. Proc Natl Acad Sci U S A 97 : 4778–4783.

89. CanavezF, YoungNT, GuethleinLA, RajalingamR, KhakooSI, et al. (2001) Comparison of chimpanzee and human leukocyte Ig-like receptor genes reveals framework and rapidly evolving genes. J Immunol 167 : 5786–5794.

90. Abi-RachedL, ParhamP (2005) Natural selection drives recurrent formation of activating killer cell immunoglobulin-like receptor and Ly49 from inhibitory homologues. J Exp Med 201 : 1319–1332.

91. Maniatis T, Fritsch E, Sambrook J (1982) Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, New York.

92. Sambrook J, Fritsch EF, Maniatis T (1989) Molecular Cloning: Vol-3: A Laboratory Manual: Appendixes: CSH Laboratory Press.

93. RiegertP, AndersenR, BumsteadN, DöhringC, Dominguez-SteglichM, et al. (1996) The chicken beta 2-microglobulin gene is located on a non-major histocompatibility complex microchromosome: a small, G+C-rich gene with X and Y boxes in the promoter. Proc Natl Acad Sci U S A 93 : 1243–1248.

94. SalomonsenJ, MarstonD, AvilaD, BumsteadN, JohanssonB, et al. (2003) The properties of the single chicken MHC classical class II alpha chain (B-LA) gene indicate an ancient origin for the DR/E-like isotype of class II molecules. Immunogenetics 55 : 605–614.

95. PeckR, MurthyKK, VainioO (1982) Expression of B-L (Ia-like) antigens on macrophages from chicken lymphoid organs. J Immunol 129 : 4–5.

96. MastJ, GoddeerisBM, PeetersK, VandesandeF, BerghmanLR (1998) Characterisation of chicken monocytes, macrophages and interdigitating cells by the monoclonal antibody KUL01. Vet Immunol Immunopathol 61 : 343–357.

97. WeiningKC, SchultzU, MünsterU, KaspersB, StaeheliP (1996) Biological properties of recombinant chicken interferon-gamma. Eur J Immunol 26 : 2440–2447.

98. DingM, ZhangM, WongJL, RogersNE, IgnarroLJ, et al. (1998) Antisense knockdown of inducible nitric oxide synthase inhibits induction of experimental autoimmune encephalomyelitis in SJL/J mice. J Immunol 160 : 2560–2564.

99. WuZ, RothwellL, YoungJR, KaufmanJ, ButterC, et al. (2010) Generation and characterization of chicken bone marrow-derived dendritic cells. Immunology 129 : 133–145.

100. GoodmanT, LefrançoisL (1988) Expression of the gamma-delta T-cell receptor on intestinal CD8+ intraepithelial lymphocytes. Nature 333 : 855–858.

101. SalomonsenJ, SørensenMR, MarstonDA, RogersSL, CollenT, et al. (2005) Two CD1 genes map to the chicken MHC, indicating that CD1 genes are ancient and likely to have been present in the primordial MHC. Proc Natl Acad Sci U S A 102 : 8668–8673.

102. LinnaTJ, FrommelD, GoodRA (1972) Effects of early cyclophosphamide treatment on the development of lymphoid organs and immunological functions in the chickens. Int Arch Allergy Appl Immunol 42 : 20–39.

103. KaufmanJ, AndersenR, AvilaD, EngbergJ, LambrisJ, et al. (1992) Different features of the MHC class I heterodimer have evolved at different rates. Chicken B-F and beta 2-microglobulin sequences reveal invariant surface residues. J Immunol 148 : 1532–1546.

104. RonquistF, HuelsenbeckJP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19 : 1572–1574.

105. Swofford DL (2003) PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sinauer Associates, Sunderland, Massachusetts.

106. ShimodairaH, HasegawaM (2001) CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics 17 : 1246–1247.

107. HusonDH (1998) SplitsTree: analyzing and visualizing evolutionary data. Bioinformatics 14 : 68–73.

108. HusonDH, BryantD (2006) Application of Phylogenetic Networks in Evolutionary Studies. Mol Biol Evol 23 : 254–267.

109. BruenT, PhillipeH, BryantD (2006) A quick and robust statistical test to detect the presence of recombination. Genetics 172 : 2665–2681.

110. CastresanaJ (2000) Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molec Biol Evol 17 : 540–552.

111. TalaveraG, CastresanaJ (2007) Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systemat Biol 56 : 564–577.

112. ZamaniN, RussellP, LantzH, HoeppnerMP, MeadowsJRS, et al. (2013) Unsupervised genome-wide recognition of local relationship patterns. BMC genomics 14 : 347.

113. Felsenstein J (2005) PHYLIP (Phylogeny Inference Package) version 3.6. Distributed by the author. Department of Genome Sciences, University of Washington, Seattle

114. MarmorMD, BenatarT, RatcliffeMJ (1993) Retroviral transformation in vitro of chicken T cells expressing either alpha/beta or gamma/delta T cell receptors by reticuloendotheliosis virus strain T. J Exp Med 177 : 647–656.

115. MaccubbinD, SchiermanL (1986) MHC-restricted cytotoxic response of chicken T cells: expression, augmentation, and clonal characterization. J Immunol 136 : 12–16.

116. Walker BA, Hunt LG, Sowa AK, Skjødt K, Göbel TW, et al The dominantly expressed class I molecule of the chicken MHC is explained by coevolution with the polymorphic peptide transporter (TAP) genes. Proc Natl Acad Sci U S A 108 : 8396–8401.

117. KorbelJO, UrbanAE, AffourtitJP, GodwinB, GrubertF, et al. (2007) Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome. Science 318 : 420–426.

118. PerryGH, YangF, Marques-BonetT, MurphyC, FitzgeraldT, et al. (2008) Copy number variation and evolution in humans and chimpanzees. Genome Res 18 : 1698–1710.

119. YangF, MüllerS, JustR, Ferguson-SmithMA, WienbergJ (1997) Comparative chromosome painting in mammals: human and the Indian muntjac (Muntiacus muntjak vaginalis). Genomics 39 : 396–401.