Physical and Radiation Hybrid Mapping Reveal Gene Order and Orientation | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
Abstract |
|---|
|
|
|---|
The human surfactant protein (SP) A locus has been assigned to chromosome 10q22-q23 and consists of
two very similar genes, SP-A1 and SP-A2, as well as a truncated pseudogene. SP-A belongs to the family
of collagenous C-type lectins along with mannose binding protein (MBP) and SP-D, both of which have
also been mapped to the long arm of chromosome 10. In this article we report the relative location and orientation of each of the SP-A and SP-D genomic sequences. Characterization of two overlapping genomic clones revealed that the SP-A pseudogene lies in a reverse orientation 15 kb away from the 5' side of SP-A1. This finding was verified by the amplification of the entire SP-A pseudogene/SP-A1 intergenic region
using long-range polymerase chain reaction. The relative location of SP-A2 and SP-D was then ascertained by testing a number of sequence tagged sites against the Stanford TNG3 and G3 radiation hybrid
panels. The radiation hybrid mapping data showed that both SP-A2 and SP-D are on the 5' side of SP-A1
at approximate distances of 40 kb and 120 kb, respectively. The SP-A and SP-D loci were also oriented relative to the centromere, with the overall order being: centromere
SP-D
SP-A2
pseudogene
SP-A1
telomere.
| |
Introduction |
|---|
|
|
|---|
Pulmonary surfactant is a developmentally and hormonally regulated lipoprotein complex synthesized and secreted by alveolar type II cells. Surfactant is essential for normal lung function in that it prevents alveolar collapse at low lung volumes (i.e., expiration). A deficiency of pulmonary surfactant in prematurely born infants leads to respiratory distress syndrome, a significant cause of neonatal morbidity.
Pulmonary surfactant contains four nonserum proteins: surfactant protein (SP) A, B, C, and D. SP-A, the most abundant surfactant protein, is a highly modified 30 to 35 kD sialyglycoprotein (1, 2) that contains both collagenous (3) and carbohydrate binding domains (4, 5). The presence of these domains places SP-A in the family of collagenous C-type lectins along with SP-D (6) and mannose binding protein (MBP) (7). SP-A has been shown to be involved in the metabolism (8), structure (9), and function (10) of pulmonary surfactant, whereas the role of SP-D in surfactant physiology remains less defined. Both SP-A and SP-D appear to participate in local host defense and inflammatory processes of the lung (11), which is in keeping with their classification as C-type lectins (12).
The human SP-A locus consists of two very similar but nonidentical genes, SP-A1 (13) and SP-A2 (14), each approximately 5 kb in length. This is in contrast to mice (15), rats (16), rabbits (17), and dogs (18), where SP-A exists as a single-copy gene. It is thought that two SP-A1 gene products and one SP-A2 gene product associate through their collagenous domains to form heterotrimers (19). The human SP-A locus also contains a truncated and nonfunctional pseudogene (20) that shares 85% similarity with the SP-A 3' untranslated region (3'UTR). The human SP-A locus has been assigned to chromosome 10q22-q23 (21, 22) near the loci for SP-D (6, 22) and MBP (7). The clustering of SP-A1, SP-A2, SP-D, and MBP to the same cytogenetic region suggests that this area of chromosome 10 is involved in the evolution of collagenous C-type lectins (6, 7, 22). Recently, it has been found that SP-A1 and SP-A2 are in linkage disequilibrium, indicating close physical association (23).
The regulation of SP-A gene expression is under developmental and hormonal control and in the human appears to be quite complex (24). A number of 5' splicing variants and alleles have been described for each gene (23, 29- 31) that may explain the wide variation in SP-A mRNA levels seen among humans (32, 33). Moreover, there is evidence that the two SP-A genes are regulated differentially (34; our unpublished observations). Toward a more complete understanding of the mechanisms by which SP-A1 and SP-A2 are differentially and/or coordinately regulated, we sought to better characterize the human SP-A locus.
In this study, we used a combination of physical mapping, radiation hybrid mapping (at both medium and high resolutions), and long-range polymerase chain reaction (PCR) to determine the organization of the human SP-A locus. Radiation hybrid mapping was also used to determine the relative location of the SP-D locus and to orient both the SP-A and SP-D loci with respect to the centromere.
| |
Materials and Methods |
|---|
|
|
|---|
Genomic Clones
A bacteriophage EMBL3 human genomic library had been previously (35) constructed by subcloning partially Sau3A digested HeLa genomic DNA into the SalI site of the EMBL3 vector. This library was screened with a 750-bp BamHI/EcoRI fragment of human SP-A1 cDNA. Two positive EMBL3 clones, each with insert sizes of approximately 16 kb, were characterized in this study: H7 and H3. P1 and PAC (36) human genomic libraries were screened by Genome Systems, Inc. (St. Louis, MO) using PCR with SP-A-specific primers supplied by our laboratory. One SP-A1-positive clone (P1 267-5F) and one SP-A2-positive clone (PAC 307-L19) with insert sizes of 40 kb and 80 kb, respectively, were identified in this manner. P1 and PAC clones are designated according to clone addresses as suggested by the supplier. Primers generated from the ends of P1 267-5F were used to rescreen the PAC human genomic library (Genome Systems, Inc.) in order to perform a genomic walk from the 3' side of SP-A1. This process was repeated several times with each new genomic clone obtained from the walk. In doing so, PAC clones 144-5O, 23-I13, and 18-N24 were obtained; together these clones form a contig that spans approximately 250 to 300 kb on the 3' side of SP-A1.
Restriction Digests and Southern Hybridizations
Approximately 1 to 2 µg of cloned DNA was digested with 1 to 5 U of EcoRI, HindIII, or SacI (New England Biolabs, Beverly, MA). Digestion products were subjected to electrophoresis through a 1% agarose gel and transferred under vacuum (Pharmacia LKB, Uppsala, Sweden) onto Genescreen (NEN, Boston, MA) nylon membrane in 1 N NaOH after depurination in 0.1 N HCl. Membranes were then baked at 80°C under vacuum. Southern hybridizations were performed using probes that were either PCR products specific for the SP-A 3'UTR or purified restriction fragments. Probes were radiolabeled with [32P]dCTP using random primer labeling (Prime-it II kit; Stratagene, La Jolla, CA). Approximately 5 × 105 to 1 × 106 cpm of probe per milliliter of hybridization solution were used in Southern hybridizations. Prehybridization was performed at 42°C in 1 ml/10 cm2 of 90% (vol/vol) Hybrisol I (Oncor, Gaithersburg, MD), 5× Denhardt's solution (0.01% each of bovine serum albumin [BSA], Ficoll, and polyvinylpyrrolidone), and 2 µg/ml denatured fish sperm DNA. Probe was added in combination with 2 µg/ml denatured fish sperm DNA to the prehybridization solution. After 14 to 18 h of hybridization at 42°C, the membranes were washed twice in 1× SSC (0.15 mM NaCl, 0.015 mM sodium citrate [pH 7.5])/0.5% sodium dodecyl sulfate (SDS) at 25°C for 20 min and then once each in 0.5× SSC/0.5% SDS at 50°C and then at 65°C for 20 min. Membranes were then exposed to X-ray film (Kodak, Rochester, NY).
Sequencing of Genomic Clones
In order to confirm the identity of sequences within our genomic clones, to develop long-range PCR primers, and to generate primers used in amplifying sequence tagged sites (STSs) for radiation hybrid mapping (see the following), the EMBL3, P1, and PAC clones were partially sequenced. Sequencing primers were derived from the published sequences for SP-A1 (13), SP-A2 (14), and the SP-A pseudogene (20) or from vector sequences. Primers MBL (5'-GCTTATCTGCTTCTCATAGAGTCTTGC-3') and MBR (5'-ATAACGATCATATACATGGTTCTCTCC-3') are specific for the left and right arms of the EMBL3 vector, respectively, and were used to sequence the insert ends of clones H3 and H7. T7 and SP6 primers were used in the sequencing of insert ends of the P1 and PAC clones. Sequencing of the EMBL3 genomic clones was performed using the f-mol cycle sequencing kit (Promega, Madison, WI) and consisted of 35 cycles with denaturation at 94°C for 30 s, annealing at 42°C for 30 s, and extension at 72°C for 1 min. P1 and PAC DNA was precipitated with polyethylene glycol in 1.5 M NaCl and resuspended in H2O before sequencing. Sequencing reactions of the P1 and PAC clones consisted of 35 cycles of denaturation at 94°C for 30 s followed by annealing at 50°C for 30 s and extension at 72°C for 1 min. All sequencing reactions were run on 6% PAGE/7 M urea gels that were dried under vacuum and exposed to X-ray film (Kodak). Primers to be used in long-range PCR and radiation hybrid mapping were designed using the software OLIGO v4.0 (National Biosciences Inc., Plymouth, MN).
Long-range PCR
Primers used in long-range PCR are described in Table 1. The protocol for long-range PCR was modified from that previously described (37). Briefly, the reaction mixture contained 20 mM Tris-HCl (pH 8.6), 150 µg/ml BSA, 16 mM (NH4)2SO4, 3.5 mM MgCl2, 250 µM of each dNTP, 30 mM Tris base, 8% glycerol, 100 to 200 ng of template DNA, and 100 ng of each primer. After an initial denaturation of 1 min, 2.5 U of AmpliTaq (Applied Biosystems, Foster City, CA) combined with 0.15 U of Pfu DNA polymerase (Stratagene) were added. All long-range PCRs were performed using thin-walled tubes (USA/Scientific Plastics, Ocala, FL) in a total volume of 100 µl. Cycle conditions depended on the primers used, but typically reactions consisted of 30 to 35 cycles of an 8-s denaturation at 94°C, a 30-s annealing at 58 to 62°C, and a 4- to 16-min extension at 68°C.
|
Radiation Hybrid Mapping
Radiation hybrid (RH) mapping is a technique used to construct high-resolution genomic maps and is described in detail by Cox and colleagues (38). RH mapping was performed using the whole-genome Stanford TNG3 (high resolution) and G3 (medium resolution) radiation hybrid panels, both of which were purchased from Research Genetics (Huntsville, AL). A number of STSs were tested against the 90 human/hamster hybrid DNAs of the TNG3 panel and the 83 human/hamster hybrid DNAs of the G3 panel. PCR amplifications were carried out with sets of primers that amplify STSs identified by sequencing the ends of our genomic clones and/or were known to span the SP-A and SP-D loci. Information on the STSs amplified in radiation hybrid mapping is shown in Table 2. We also used two sets of primers that amplify expressed sequence tags (ESTs), WI-6610 and WI-11484, that have been placed on the Whitehead Institute/MIT Center for Genome Reseach (WICGR) low-resolution (Genebridge 4 panel) radiation hybrid map. Mapping data for these ESTs are available from WICGR via the World Wide Web at http://www-genome.wi.mit.edu. PCRs were performed in 96-well plates using a TwinBlock easycycler (ERICOMP, Inc., San Diego, CA) in 25 µl of 1.5 mM MgCl2, 0.05 mM of each dNTP, and 1× PCR buffer (10 mM Tris-HCl [pH 8.3], 50 mM KCl), 50 ng of each primer, and 2.5 U of AmpliTaq DNA polymerase (Applied Biosystems). Cycle conditions depended on the primers used, but generally 30 to 35 cycles of a 30-s denaturation at 94°C, a 30-s annealing at 56 to 60°C, and a 30-s extension at 72°C were used. PCRs were performed in duplicate. The scoring of clones as positive, negative, or ambiguous for an STS was accomplished by visualization of PCR products on ethidium-stained agarose gels. Two-point and multipoint analyses of the radiation hybrid data were performed using the statistical software package RHMAP v3.0 (39), obtained via the World Wide Web at http://www.sph.umich.edu/group/statgen/software. This software package consists of three statistical programs: RH2PT, RHMAXLIK, and RHMINBRK.
|
|
| |
Results |
|---|
|
|
|---|
Analysis of Two Overlapping Genomic Clones Reveals Close Proximity of the SP-A Pseudogene to SP-A1
Initial characterization of the EMBL3 genomic clones revealed that H7 contained the entire SP-A pseudogene and about 14 kb of flanking sequences. H3 was found to contain only the 5' half of SP-A1 (approximately bp 1-1800) in addition to 13 kb of 5' flanking sequence. Among the fragments produced by digestion of H3 and H7 with EcoRI are two that are common to both clones: a 1.2-kb and a 7.4-kb fragment (Figure 1A, lanes 2 and 3). Likewise, digestion with HindIII yielded 1.8-kb, 2.2-kb, and 3.5-kb fragments from both clones (not shown). These fragments do not come from the EMBL3 vector; thus, it appeared that there was an overlap between H7 and H3. In order to verify this overlap, the 1.2-kb EcoRI fragment from H3 was isolated, radiolabeled, and then used as a probe in Southern hybridizations. Figure 1B shows that the 1.2-kb EcoRI fragment from H3 hybridizes to the same size EcoRI fragment of H7 (lane 2), supporting an overlap between the clones. An overlap between H3 and H7 indicates that the SP-A pseudogene lies upstream of SP-A1 because H3 only contains the 5' half of SP-A1. An overlap between H7 and the clone P1 267-5F (which contains the entire SP-A1 gene) was also found, supporting close physical proximity of the SP-A pseudogene and SP-A1.
|
Orientation of the SP-A Pseudogene Relative to SP-A1
The enzyme SacI cuts within the SP-A pseudogene once at bp 646 (20) and digestion of clone H7 with SacI yields two fragments with approximate sizes of 30 kb and 15 kb (Figure 1A, lane 1). SacI does not cut within the EMBL3 left or right vector arms, which are 20 kb and 9 kb in length, respectively. Thus, the overlap between H7 and H3 must reside within the 30-kb SacI fragment (because this is the only SacI fragment large enough to contain the 1.2-kb and 7.4-kb EcoRI fragments as well as an EMBL3 vector arm). This is confirmed by the hybridization of the 1.2-kb EcoRI fragment from H3 to the 30-kb SacI fragment of H7 (Figure 1B, lane 1). A PCR-generated SP-A 3'UTR fragment derived from sequences downstream of the pseudogene SacI site served as probe in the Southern hybridization shown in Figure 1C. The SP-A 3'UTR probe is shown to hybridize to a 2.4-kb EcoRI fragment in Figure 1C (lane 2); this fragment corresponds to the majority of the SP-A pseudogene in addition to 3' flanking sequences. There is no hybridization of this probe to the H3 digests in Figure 1C (lane 3) because this clone only contains the 5' half of SP-A1 (no 3'UTR sequences). The 3'UTR probe hybridizes to the 15-kb SacI fragment of H7 (Figure 1C, lane 1) and not to the 30-kb SacI fragment, indicating that the SP-A pseudogene is in a reverse orientation relative to SP-A1. A partial restriction map of the SP-A pseudogene and SP-A1 intergenic region (IGR) is shown in Figure 2.
|
Long-range PCR Amplification of the SP-A Pseudogene/SP-A1 IGR
In order to verify the results obtained from the analysis of the EMBL3 clones, we attempted to amplify the IGR between the SP-A pseudogene and SP-A1 from genomic DNA using long-range PCR. Primers (Figure 2 and Table ) were generated as described in MATERIALS AND METHODS. Long-range PCR using the primer pair 413/291 resulted in a 3.2-kb band from both genomic DNA (Figure 3A, lane 1) and H7 (not shown). Long-range PCR using the primer pair 416/262 resulted in a 3.5-kb band in both genomic DNA (Figure 3A, lane 2) and clone H3 (not shown). Taken together, these PCR amplifications verify the overlap between clones H3 and H7. The entire IGR between the SP-A pseudogene and SP-A1 was then amplified using the primer pair 262/68 (Figure 3A, lane 3), which resulted in a 16-kb band, 14 kb of which comes from the SP-A pseudogene/SP-A1 IGR. We were also able to amplify the IGR using the primer pair 262/291 (not shown). These long-range PCR amplifications confirm not only the distance between the SP-A pseudogene and SP-A1, but also their relative orientation. In order for the 16-kb PCR fragment to be produced from genomic DNA, the SP-A pseudogene must be in an opposite orientation to that of SP-A1. EcoRI digestion of 262/68 long-range PCR products amplified in different reactions from two unrelated individuals is shown in Figure 3B, lanes 1 through 5. These digestions yielded the expected 1.2-kb and 7.4-kb fragments in addition to a 4.5-kb fragment from the SP-A1 end of the 262/68 PCR product, a 2.6-kb fragment from the 5' side of the SP-A pseudogene, and a 0.5-kb fragment from the pseudogene end of the 262/68 PCR product. These digestions confirm the identity of the 262/68 long-range PCR product as being the SP-A pseudogene/SP-A1 IGR.
|
High-resolution Radiation Hybrid Mapping of the SP-A and SP-D Loci
We tested a number of STSs against the high-resolution
TNG3 radiation hybrid panel. A description of these STSs
is shown in Table . The mean rate of retention (i.e., the
percentage of radiation hybrid clones found to be positive)
for the STSs in this panel was approximately 20%. Two-point analysis of the TNG3 radiation hybrid data was performed using the program RH2PT and is presented in
Table 3. Distances are represented as centirays 50,000 (cR50,000). Using the previously established relative location and orientation of the SP-A pseudogene and SP-A1
as an anchor, the two-point analysis shows that SP-A2 is
on the 5' side of SP-A1, with the SP-A pseudogene located
between the two functional genes. SP-D also appears to be
on the 5' side of SP-A1 and is closer to SP-A2 than to the SP-A pseudogene or SP-A1. Multipoint analysis was performed using the program RHMAXLIK and supports the
two-point analysis in that the most likely order was found
to be: SP-D
SP-A2-pseudogene
SP-A1 (1,183 times
more likely than any other order tested when using an
equal retention probability model).
|
A direct comparison of cR50,000 to physical distance can be made because the distance in kilobases has already been established between the SP-A pseudogene and SP-A1 (about 17 kb between the two STSs used). From the distances in centirays 50,000 shown in Table , we obtain a value of 1 cR50,000 ~ 1.5 kb. Assuming that this correlation holds true for the entire chromosome, we can estimate the distances in kilobases between the rest of the STSs shown in Table . Using the correlation of 1 cR50,000 ~ 1.5 kb, it appears that SP-A2 and SP-D are approximately 40 kb and 120 kb away from SP-A1, respectively. During the characterization of our genomic clones, PAC 307-L19 was identified as containing the entire SP-A2 gene, 2 kb of 5' flanking, and approximately 80 kb of 3' flanking sequences. This clone does not contain any SP-A1, SP-A pseudogene, or SP-D sequences nor does it overlap with any of the genomic clones mentioned previously. Taking these observations into account, SP-A2 must lie in a transcriptionally opposite direction than SP-A1 in order for these two genes to be no more than 40 kb apart.
Support for the relative location of SP-A2 and SP-D also comes from the contig (described in MATERIALS AND METHODS) generated by genomic walking on the 3' side of SP-A1. This PAC contig spans approximately 250 to 300 kb and contains no SP-A2 or SP-D sequences, indicating that these genes are on the 5' side of SP-A1.
Orientation of the SP-A and SP-D Loci Relative to the Centromere
We were able to orient the SP-A and SP-D loci with respect to the centromere by testing several STSs against the G3 medium-resolution hybrid panel. A description of these STSs is shown in Table . The mean rates of retention for these STSs in the G3 panel were dichotomous: WI-6610, SP-D, SP-A1, pseudogene, and SP-A2 had a retention rate of about 10%, and 530/531 and WI-11484 had a retention rate of about 24%. Results of the subsequent two-point analysis for the radiation hybrid data are presented in Table 4. Distances are represented as centirays 10,000 and data for markers found to have significant linkage (LOD score > 3) are shown. We cannot determine the relative order of the members of the SP-A locus because of the lower resolution in this radiation hybrid panel. However, the data support the relative locations of SP-A2 and SP-D as determined by the TNG3 high-resolution panel (i.e., SP-D is further away from SP-A1 than from SP-A2).
|
We can also conclude from the results in Table that
the SP-D locus is the closest to the EST WI-6610. WI-6610
has been placed on the WICGR low-resolution (Genebridge 4) radiation hybrid map at a distance of 457.75 cR3,000 from the top of the chromosome 10 linkage map.
This EST is closer to the centromere than the SP-A locus
(identified on the WICGR map as WI-7219 and located at
462.46 cR3,000 from the top of the chromosome 10 linkage
map). Thus, it appears that SP-D is closer to the centromere
than the SP-A locus and the overall order is: centromere
SP-D
SP-A2
pseudogene
SP-A1
telomere. This orientation with respect to the centromere is supported by the linkage (LOD > 3) of an STS (530/531) found approximately 300 kb from the 3' side of SP-A1 and the EST WI-11484. WI-11484 has been mapped distal to SP-A at 467.61 cR3,000 from the top of the chromosome 10 linkage group.
The overall organization of the SP-A and SP-D loci at low,
medium, and high resolutions is displayed in Figure 4.
| |
Discussion |
|---|
|
|
|---|
Pulmonary surfactant consists of four proteins: SP-A, SP-B, SP-C, and SP-D. Both SP-A and SP-D are collagenous C-type lectins that are involved in local host defense and inflammatory processes of the lungs (reviewed in reference 11). The human SP-A locus consists of two functional genes, SP-A1 and SP-A2 (14), and a pseudogene (20). The human SP-A and SP-D loci have been localized to 10q22-q23 by both in situ hybridization (21) and somatic cell hybrid analysis (6, 22). In addition, the SP-A locus has been placed on the WICGR radiation hybrid map at 462.64 cR3,000 from the top of the chromosome 10 linkage group (there is no information regarding the location of SP-D on this map). These mapping techniques, however, lack the resolving power to determine the exact order and distance of the members of these two loci. In the present study, we mapped the SP-A and SP-D loci at a high resolution and ordered these loci relative to the centromere. This was accomplished by using not only conventional physical mapping techniques (i.e., the analysis of overlapping clones) but newer ones such as long-range PCR and high-resolution (at around 10 kb) radiation hybrid mapping.
Radiation hybrid mapping (38) is a technique that uses
X-rays of a specific dose (10,000 rads for the G3 panel and
50,000 rads for the TNG3 panel) to randomly fragment the
chromosomes of a human donor cell line. The higher the
dose of ionizing radiation, the smaller the fragments produced. The donor cell line is fused to a recipient cell line
(generally TK
CHO cells), and hybrid clones are selected
and tested for the presence or absence of STSs. The closer
two STSs are together, the greater the likelihood that they
will be found in the same hybrid clone. Statistical analysis
using the software package RHMAP v3.0 (39) can then be
used to determine the best estimate of physical distance,
which is expressed in centirays specific for the X-ray dose
used to construct the radiation hybid panel. In this study,
we used both medium- and high-resolution radiation hybrid panels (G3 and TNG3, respectively) to order the
members of the SP-A and SP-D loci.
Once the relative order and orientation of the SP-A pseudogene and SP-A1 were established through physical mapping and long-range PCR, attempts were made to determine the relative location of SP-A2 by chromosomal walking. However, these attempts were undermined by the failure to detect any SP-A2 sequences 3' of SP-A1 (up to a distance of about 250 to 300 kb) and the inability to obtain true overlapping clones of H7 (on the 3' side of the SP-A pseudogene). Analysis of two SP-A-positive yeast artificial chromosomes (YACs) indicated the presence of SP-A1 and SP-A2, but not the SP-A pseudogene. This initially led us to believe that SP-A2 was on the 3' side of SP-A1 (40). However, we have subsequently concluded that either one or both YACs may contain internal deletions. Therefore, in the present study, we show the utility of high-resolution radiation mapping to determine gene order, especially when overlapping or high molecular weight clones (i.e., P1 artificial chromosomes [PACs] and YACs) are unavailable or suffer from deletions and/or chimerisms.
A direct comparison can be made between centirays 50,000 and kilobases because some of the STSs used in the high-resolution radiation hybrid mapping had also been physically mapped (namely, the SP-A pseudogene and SP-A1). We are able to estimate that 1 cR50,000 approximates 1.5 kb. Using this correlation, we can estimate that SP-A1 and SP-A2 (which are 26.4 cR50,000 apart; Table ) are separated by about 40 kb. Similarly, SP-A1 and SP-D are estimated to be 120 kb apart, supporting earlier findings that placed the two loci within 430 kb (22). These conclusions are based on the assumption that the correlation between centirays 50,000 and kilobases remains constant throughout the genome. In fact, when chromosome 2 STSs are tested against the TNG3 panel, we do get a similar correlation of 1 cR50,000 ~ 1.4 kb (41). Furthermore, previous studies using lower-resolution radiation hybrid panels have shown a relatively constant relationship between centirays and physical distance (38), as in the case with the G3 panel, where 1 cR10,000 ~ 30 kb. Two-point analysis of the TNG3 radiation hybrid data showed that the distance between SP-A1 and SP-A2 is 26.4 cR50,000. However, when this distance is taken as the sum of the distance between SP-A2 and the pseudogene (12.6 cR50,000) plus the distance between the pseudogene and SP-A1 (11.3 cR50,000), it becomes 23.9 cR50,000. Such inconsistencies have been reported before (38) and do not appear to change our general conclusions.
The orientation of SP-A2 is found to be opposite to
that of SP-A1, i.e., the two genes are in a "head to head"
arrangement. This conclusion is made based on the analysis of an SP-A2-positive genomic clone (PAC 307-L19)
that contains the SP-A2 gene, 2 kb of 5' flanking sequence,
and about 80 kb of 3' flanking sequence. This clone does
not, however, contain any SP-A1, SP-A pseudogene, or
SP-D sequences. In order for SP-A1 and SP-A2 to be 40 kb (26.4 cR50,000; Table ) apart without PAC 307-L19 containing any SP-A1 sequences, the two genes must be in opposite transcriptional orientations. Of course, this conclusion is also based on the assumption that PAC 307-L19 is
not chimeric. Although chimerism rates in P1 and PAC
(36) clones are far lower than in YACs, the possibility still
exists. A "head to head" arrangement of SP-A1 and SP-A2 raises the possibility that the two genes may share cis-acting regulatory elements. Characterization of the loci for
other "head to head" genes has yielded information on
shared regulatory regions; this is particularly true for the
1 and
2 chains of type IV collagen, two genes whose products, like SP-A1 and SP-A2, form heterotrimers (42).
To orient the SP-A and SP-D loci relative to the centromere, we used the medium-resolution G3 radiation hybrid panel. The combined use of medium- and high-resolution radiation hybrid mapping produces the overall map:
centromere
SP-D
SP-A2
pseudogene
SP-A1
telomere. Although we are unable to determine the relative
order of the members of the SP-A locus with the G3 panel
alone, we can conclude that SP-D is closer to SP-A2 than it
is to SP-A1. The distance in centirays between SP-D and
SP-A1 is 1.4 times the distance between SP-D and SP-A2
in both the medium- and high-resolution panels. This similarity in distance proportions between the two panels supports the validity of our map. However, the physical distance between STSs suggested by using the G3 panel are
greater than those of the TNG3. For example, assuming 1 cR10,000 ~ 30 kb, SP-A1 and SP-D are about 640 kb apart
and SP-A1 and the SP-A pseudogene are 200 kb apart; the
latter value is incorrect as shown by physical mapping. The
expansion of physical distance when using the G3 panel
could be a result of the relatively low rates of retention for
the SP-A and SP-D loci (compare 10% in the G3 panel to
a mean of 20% in the TNG3 panel).
Although the SP-A and SP-D loci have been previously mapped close to the locus for MBP (22), we did not find significant evidence for linkage between MBP and any member of the SP-A or SP-D loci, even when using the G3 medium-resolution panel. In fact, there was no linkage found between MBP or either of the ESTs, WI-6610 and WI-11484. Analysis of the radiation hybrid data for the MBP STS using the Stanford Radiation Hybrid Mapping Server (http://shgc-www.stanford.edu) localized MBP to within 42.3 cR10,000 (about 1,200 kb) of a genetic marker (D103567) that maps 79 centimorgans (cM) from the top of the chromosome 10 linkage group (the SP-A locus would be found between 106 cM and 119 cM; see Figure 4, low resolution). These findings suggest that the MBP and SP-D/SP-A loci are separated by a rather large distance (about 25 to 35 cM or 25,000 to 35,000 kb).
The SP-A genomic sequences seem to have arisen by a process of gene duplication. Sequence analysis of the three SP-A genomic sequences suggests that an ancestral SP-A sequence duplicated, giving rise to SP-A1 and SP-A2. Subsequent to this event, another duplication produced the SP-A pseudogene from SP-A2 (45). In this scenario, the SP-A pseudogene is closer in evolution to SP-A2 than it is to SP-A1. The fact that SP-A2 and the SP-A pseudogene are in the same relative orientation suggests that the two are closer in evolution (i.e., arisen from the same ancestral sequence) than SP-A1 and SP-A pseudogene. One can also see a parallel between physical distance and evolutionary relationships: of the C-type lectins found on chromosome 10, MBP is the farthest away from SP-A in both physical and evolutionary (22) terms. Similarly, SP-D is more related to SP-A in evolution (22) and is also physically closer to the SP-A locus than MBP. Recently, two SP-A genes, one SP-A1-like and the other SP-A2-like, have been identified in the baboon (46). The presence of two SP-A genes in the baboon indicates that the duplication of the primordial SP-A sequence into SP-A1 and SP-A2 occurred more than 26.5 million yr ago (46). It would be of interest to know whether the baboon has a SP-A pseudogene-like sequence. Absence of such a sequence would indicate that the pseudogene arose after the separation of apes from the lineage that led to humans.
Elucidation of the organization of other gene clusters has often provided information on the regulation of gene expression (42, 47). For example, the genes of the globin cluster are developmentally regulated in a manner that parallels their spatial organization (47). Thus, it is our hope that a detailed characterization of the human SP-A locus will not only give evidence of evolutionary relationships but also lead to a more complete understanding of the mechanisms involved in SP-A1 and/or SP-A2 gene expression. Given the important roles of SP-A in surfactant physiology and host defense, such an understanding could give insight into the reasons behind various clinical outcomes or lead to the design of better interventions for respiratory disease.
| |
Footnotes |
|---|
Address correspondence to: Joanna Floros, Ph.D., Department of Cellular and Molecular Physiology, Pennsylvania State University, College of Medicine, 500 University Dr., Hershey, PA 17033. E-mail: jfloros{at}cmp.hmc.psu.edu
(Received in original form May 27, 1997).
Acknowledgments: The writers would like to thank Drs. Anne Karinch and Jeff Murray for their valuable input and Susan DiAngelo for expert technical assistance. The work presented in this report was funded by Grant HL49823 from the National Institutes of Health.
Abbreviations cM, centimorgan; cR, centiray; EST, expressed sequence tag; IGR, intergenic region; MBP, mannose-binding protein; PAC, P1 artificial chromosome; PCR, polymerase chain reaction; SP, surfactant protein; STS, sequence tagged site; 3'UTR, 3' untranslated region; WICGR, Whitehead Institute/MIT Center for Genome Research; YAC, yeast artificial chromosome.
| |
References |
|---|
|
|
|---|
1.
Floros, J.,
D. S. Phelps, and
H. W. Taeusch.
1985.
Biosynthesis and in vitro
translation of the major surfactant-associated protein from human lung.
J.
Biol. Chem.
260:
495-500
2. Phelps, D. S., J. Floros, and H. W. J. Taeusch. 1986. Post-translational modification of the major human surfactant-associated proteins. Biochem. J. 237: 373-377 [Medline].
3. Hawgood, S., H. Efrati, J. Schilling, and B. J. Benson. 1985. Chemical characterization of lung surfactant apoproteins: amino acid composition, N-terminal sequence and enzymatic digestion. Biochem. Soc. Trans. 13: 1092-1096 [Medline].
4.
Haagsman, H. P.,
S. Hawgood,
T. Sargeant,
D. Buckley,
R. T. White,
K. Drickamer, and
B. J. Benson.
1987.
The major lung surfactant protein, SP
28-36, is a calcium-dependent, carbohydrate-binding protein.
J. Biol. Chem.
262:
13877-13880
5.
Drickamer, K.,
M. S. Dordal, and
L. Reynolds.
1986.
Mannose-binding proteins isolated from rat liver contain carbohydrate-recognition domains
linked to collagenous tails: complete primary structures and homology
with pulmonary surfactant apoprotein.
J. Biol. Chem.
261:
6878-6887
6.
Crouch, E.,
K. Rust,
R. Veile,
H. Donis-Keller, and
L. Grosso.
1993.
Genomic organization of human surfactant protein-D (SP-D): SP-D is encoded on chromosome 10q22.2-23.1.
J. Biol. Chem.
268:
2976-2983
7.
Sastry, K.,
G. A. Herman,
L. Day,
E. Deignan,
G. Bruns,
C. C. Morton, and
R. A. B. Ezekowitz.
1989.
The human mannose-binding protein gene:
exon structure reveals its evolutionary relationship to a human pulmonary
surfactant gene and localization to chromosome 10.
J. Exp. Med.
170:
1175-1189
8.
Dobbs, L. G.,
J. R. Wright,
S. Hawgood,
R. Gonzales,
K. Venstrom, and
J. Nellenbogen.
1987.
Pulmonary surfactant and its components inhibit secretion of phosphatidylcholine from cultured alveolar type II cells.
Proc. Natl. Acad. Sci. USA
84:
1010-1014
9. Suzuki, Y., Y. Fujita, and K. Kogishi. 1989. Reconstitution of tubular myelin from synthetic lipids and proteins associated with pig pulmonary surfactant. Am. J. Respir. Cell Mol. Biol. 140: 75-81 .
10. Schurch, S., F. Possmayer, S. Cheng, and A. M. Cockshutt. 1992. Pulmonary SP-A enhances adsorption and appears to induce surface sorting of lipid extract surfactant. Am. J. Physiol. 263(Lung Cell. Mol. Physiol. 7):L210- L218.
11. Phelps, D. S.. 1995. Pulmonary surfactant modulation of host-defense function. Appl. Cardiopulm. Pathophysiol. 5: 221-229 .
12. Reid, K.. 1993. Structure/function relationships in the collectins (mammalian lectins containing collagen-like regions). Biochem. Soc. Trans. 21: 464-470 [Medline].
13. White, R. T., D. Damm, J. Miller, K. Spratt, J. Schilling, S. Hawgood, B. Benson, and B. Cordell. 1985. Isolation and characterization of the human pulmonary surfactant apoprotein gene. Nature 317: 361-363 [Medline].
14. Katyal, S. L., G. Singh, and J. Locker. 1992. Characterization of a second human pulmonary surfactant associated protein SP-A gene. Am. J. Respir. Cell Mol. Biol. 6: 446-452 .
15. Korfhagen, T. R., M. D. Bruno, S. W. Glasser, P. J. Ciraolo, J. A. Whitsett, D. L. Lattier, K. A. Wikenheiser, and J. C. Clark. 1992. Murine pulmonary surfactant SP-A gene: cloning, sequence, and transcriptional activity. Am. J. Physiol. 263(Lung Cell. Mol. Physiol. 7):L546-L554.
16. Fisher, J. A., P. A. Emrie, J. Shannon, K. Sano, B. Hattler, and R. J. Mason. 1988. Rat pulmonary surfactant protein A is expressed as two differently sized mRNA species which arise from differential polyadenylation of one transcript. Biochim. Biophys. Acta 950: 338-345 [Medline].
17.
Boggaram, V.,
K. Qing, and
C. R. Mendelson.
1988.
The major apoprotein of rabbit pulmonary surfactant: elucidation of primary sequence and cyclic
AMP and development regulation.
J. Biol. Chem.
263:
2939-2947
18.
Benson, B.,
S. Hawgood,
J. Schilling,
J. Clements,
D. Damm,
B. Cordell, and
R. T. White.
1985.
Structure of canine pulmonary surfactant apoprotein: cDNA and complete amino acid sequence.
Proc. Natl. Acad. Sci.
USA
82:
6379-6383
19. Voss, T., K. Melchers, G. Scheirle, and K. P. Schafer. 1991. Structural comparison of recombinant pulmonary surfactant protein SP-A derived from two human coding sequences: implications for the chain composition of natural human SP-A. Am. J. Respir. Cell Mol. Biol. 4: 88-94 .
20. Korfhagen, T. R., S. W. Glasser, M. D. Bruno, M. J. McMahan, and J. A. Whitsett. 1991. A portion of the human surfactant protein A (SP-A) gene locus consists of a pseudogene. Am. J. Respir. Cell Mol. Biol. 4: 463-469 .
21. Bruns, G., H. Stroh, G. M. Velman, S. A. Latt, and J. Floros. 1987. The 35kd pulmonary surfactant associated protein is encoded on chromosome 10. Hum. Genetics 76: 58-62 [Medline].
22. Kölble, K., J. Lu, S. E. Mole, S. Kaluz, and K. B. M. Reid. 1993. Assignment of the human pulmonary surfactant protein D gene (SFTP4) to 10q22-q23 close to the surfactant protein A gene cluster. Genomics 17: 294-298 [Medline].
23. Floros, J., S. DiAngelo, M. Koptides, A. M. Karinch, P. Rogan, H. Nielson, R. Spragg, K. Watterberg, and G. Deiter. 1996. Human SP-A locus: allele frequencies and linkage disequilibrium between the two surfactant protein A genes. Am. J. Respir. Cell Mol. Biol. 15: 489-498 [Abstract].
24.
Ballard, P. L.,
S. Hawgood,
H. Liley,
G. A. Wellenstein,
B. Benson, and
R. T. White.
1986.
Regulation of pulmonary surfactant apoprotein SP28-36 gene in fetal human lung.
Proc. Natl. Acad. Sci. USA
83:
9527-9531
25.
Boggaram, V.,
M. E. Smith, and
C. R. Mendelson.
1989.
Regulation of expression of the gene encoding the major surfactant protein (SP-A) in human fetal lung in vitro: disparate effects of glucocorticoids on transcription and on mRNA stability.
J. Biol. Chem.
264:
11421-11427
26.
Liley, H. G.,
R. T. White,
B. J. Benson, and
P. L. Ballard.
1988.
Glucocorticoids both stimulate and inhibit production of pulmonary surfactant protein A in fetal human lung.
Proc. Natl. Acad. Sci. USA
85:
9096-9100
27. Mendelson, C. R., and V. Boggaram. 1991. Hormonal control of surfactant system in fetal lung. Annu. Rev. Physiol. 53: 415-440 [Medline].
28.
Odom, M. J.,
J. M. Snyder, and
C. R. Mendelson.
1987.
Adenosine 3',5'-monophosphate analogs and
-adrenergic agonists induce the synthesis of
the major surfactant apoprotein in human fetal lung in vitro.
Endocrinology
121:
1155-1163
[Abstract].
29. Karinch, A. M., and J. Floros. 1995. 5' Splicing and allelic variants of the human pulmonary surfactant protein A genes. Am. J. Respir. Cell Mol. Biol. 12: 77-88 [Abstract].
30. McCormick, S. M., V. Boggaram, and C. R. Mendelson. 1994. Characterization of mRNA transcripts and organization of human SP-A1 and SP-A2 genes. Am. J. Physiol. 266(Lung Cell. Mol. Physiol. 10):L354-L366.
31. Rishi, A., D. Hatzis, K. McAlmon, and J. Floros. 1992. An allelic variant of the 6A gene for human surfactant protein A. Am. J. Physiol. 262(Lung Cell. Mol. Physiol. 6):L566-L573.
32. Floros, J., D. S. Phelps, D. E. deMello, J. Longmate, H. Harding, B. Benson, and T. White. 1991. The utility of postmortem lung for RNA studies: variability and correlation of the expression of surfactant proteins in human lung. Exp. Lung Res. 17: 91-104 [Medline].
33. Karinch, A. M., and J. Floros. 1997. Effect of genotype on the SP-A2 splice variants and on the levels of surfactant protein-A mRNA in adult humans. Biochem. J. 321: 39-47 .
34. McCormick, S. M., and C. R. Mendelson. 1994. Human SP-A1 and SP-A2 genes are differentially regulated during development and by cAMP and glucocorticoids. Am. J. Physiol. 266(Lung Cell. Mol. Physiol. 10):L367- L374.
35. Kouretas, D., A. M. Karinch, A. Rishi, K. Melchers, and J. Floros. 1993. Conservation analysis of rat and human SP-A gene identifies 5' flanking sequences of rat SP-A that bind rat lung nuclear proteins. Exp. Lung Res. 19: 485-503 [Medline].
36. Ioannou, P. A., C. T. Amemiya, J. Garnes, P. M. Kroisel, H. Shizuya, C. Chen, M. A. Batzer, and P. J. deJong. 1994. A new bacteriophage P1-derived vector for the propagation of large human DNA fragments. Nature Genetics 6: 84-89 [Medline].
37.
Barnes, W. M..
1994.
PCR amplification of up to 35-kb DNA with high fidelity and high yield from
bacteriophage templates.
Proc. Natl. Acad. Sci.
USA
91:
2216-2220
38.
Cox, D. R.,
M. Burmeister,
E. R. Price,
S. Kim, and
R. M. Myers.
1990.
Radiation hybrid mapping: a somatic cell genetic method for constructing
high-resolution maps of mammalian chromosomes.
Science
250:
245-250
39.
Lange, K.,
M. Boehnke,
D. R. Cox, and
K. L. Lunetta.
1995.
Statistical methods for polyploid radiation hybrid mapping.
Genome Res.
5:
136-150
40. Hoover, R. R., and J. Floros. 1996. Characterization of the human SP-A locus. FASEB J. 10: A25 . (Abstr.) .
41. Kala, P., M. Koptides, S. DiAngelo, R. R. Hoover, Z. Lin, V. Veletza, D. Kouretas, and J. Floros. 1998. Characterization of markers flanking the human SP-B locus. Dis. Markers (In press)
42.
Heikkilä, P.,
R. Soininen, and
K. Tryggvason.
1993.
Directional regulatory
activity of cis-acting elements in the bidirectional
1(IV) and
2(IV) collagen gene promoter.
J. Biol. Chem.
268:
24677-24682
43. Pollner, R., C. Schmidt, G. Fischer, K. Kuhn, and E. Pöschl. 1997. Cooperative and competitive interactions of regulatory elements are involved in the control of divergent transcription of human Col4A1 and Col4A2 genes. FEBS Lett. 405: 31-36 [Medline].
44.
Pöschl, E.,
R. Pollner, and
K. Kuhn.
1988.
The genes for the
1(IV) and
2(IV) chains of human basement membrane collagen type IV are arranged head-to-head and separated by a bidirectional promoter of unique
structure.
EMBO J.
7:
2687-2695
[Medline].
45. Floros, J., A. Rishi, S. V. Veletza, and P. K. Rogan. 1993. Concerted and independent genetic events in the 3'UT region of the human surfactant protein A genes. Am. J. Hum. Genet. 53: A683 . (Abstr.) .
46. Gao, E., Y. Wang, S. M. McMormick, J. Li, S. R. Seider, and C. R. Mendelson. 1996. Characterization of two baboon surfactant protein A genes. Am. J. Physiol. 271(Lung Cell. Mol. Physiol. 10):L617-L630.
47. Evans, T., G. Felsenfeld, and M. Reitman. 1990. Control of globin gene transcription. Annu. Rev. Cell. Biol. 6: 95-124 .
This article has been cited by other articles:
![]() |
Y. Sumita, T. Sugiura, Y. Kawaguchi, S. Baba, M. Soejima, Y. Murakawa, M. Hara, and N. Kamatani Genetic polymorphisms in the surfactant proteins in systemic sclerosis in Japanese: T/T genotype at 1580 C/T (Thr131Ile) in the SP-B gene reduces the risk of interstitial lung disease Rheumatology, March 1, 2008; 47(3): 289 - 291. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Oberley, C. L. S. George, and J. M. Snyder A new tool to investigate differences between human SP-A1 and SP-A2 Am J Physiol Lung Cell Mol Physiol, May 1, 2007; 292(5): L1050 - L1051. [Full Text] [PDF] |
||||
![]() |
V. Kalavacharla, K. Hossain, Y. Gu, O. Riera-Lizarazu, M. I. Vales, S. Bhamidimarri, J. L. Gonzalez-Hernandez, S. S. Maan, and S. F. Kianian High-Resolution Radiation Hybrid Map of Wheat Chromosome 1D Genetics, June 1, 2006; 173(2): 1089 - 1099. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Moran, W. Khamri, M. M. Walker, and M. R. Thursz Role of surfactant protein D (SP-D) in innate immunity in the gastric mucosa: evidence of interaction with Helicobacter pylori lipopolysaccharide Innate Immunity, December 1, 2005; 11(6): 357 - 362. [Abstract] [PDF] |
||||
![]() |
G. Wang, X. Guo, and J. Floros Differences in the translation efficiency and mRNA stability mediated by 5'-UTR splice variants of human SP-A1 and SP-A2 genes Am J Physiol Lung Cell Mol Physiol, September 1, 2005; 289(3): L497 - L508. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Janic, T. M. Umstead, D. S. Phelps, and J. Floros Modulatory effects of ozone on THP-1 cells in response to SP-A stimulation Am J Physiol Lung Cell Mol Physiol, February 1, 2005; 288(2): L317 - L325. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. N. Mikerov, T. M. Umstead, W. Huang, W. Liu, D. S. Phelps, and J. Floros SP-A1 and SP-A2 variants differentially enhance association of Pseudomonas aeruginosa with rat alveolar macrophages Am J Physiol Lung Cell Mol Physiol, January 1, 2005; 288(1): L150 - L158. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Wang, X. Guo, and J. Floros Human SP-A 3'-UTR variants mediate differential gene expression in basal levels and in response to dexamethasone Am J Physiol Lung Cell Mol Physiol, May 1, 2003; 284(5): L738 - L748. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Lin and J. Floros Heterogeneous allele expression of pulmonary SP-D gene in rat large intestine and other tissues Physiol Genomics, December 3, 2002; 11(3): 235 - 243. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Murray, W. Khamri, M. M. Walker, P. Eggleton, A. P. Moran, J. A. Ferris, S. Knapp, Q. N. Karim, M. Worku, P. Strong, et al. Expression of Surfactant Protein D in the Human Gastric Mucosa and during Helicobacter pylori Infection Infect. Immun., March 1, 2002; 70(3): 1481 - 1487. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. R. Korfhagen Surfactant Protein A (SP-A)-Mediated Bacterial Clearance . SP-A and Cystic Fibrosis Am. J. Respir. Cell Mol. Biol., December 1, 2001; 25(6): 668 - 672. [Full Text] [PDF] |
||||
![]() |
X. Guo, H-M. Lin, Z. Lin, M. Montano, R. Sansores, G. Wang, S. DiAngelo, A. Pardo, M. Selman, and J. Floros Surfactant protein gene A, B, and D marker alleles in chronic obstructive pulmonary disease of a Mexican population Eur. Respir. J., September 1, 2001; 18(3): 482 - 490. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Wang, D. S. Phelps, T. M. Umstead, and J. Floros Human SP-A protein variants derived from one or both genes stimulate TNF-alpha production in the THP-1 cell line Am J Physiol Lung Cell Mol Physiol, May 1, 2000; 278(5): L946 - L954. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. van Eijk, H. P. Haagsman, T. Skinner, A. Archibold, K. B. M. Reid, and P. R. Lawson Porcine Lung Surfactant Protein D: Complementary DNA Cloning, Chromosomal Localization, and Tissue Distribution J. Immunol., February 1, 2000; 164(3): 1442 - 1450. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Akiyama, S. V. Volik, I. Plajzer-Frick, A. Prince, H. Sago, H.-U. G. Weier, J. N. Vanderbilt, S. Hawgood, and F. R. Poulain Characterization of the Mouse Collectin Gene Locus Am. J. Respir. Cell Mol. Biol., August 1, 1999; 21(2): 193 - 199. [Abstract] [Full Text] |
||||