Protocadherin Materials And Methods

*US5708143A*  
(12)United States Utility Patent(10)Patent No: US 5,708,143 A
  Suzuki (45)Date of Patent:Jan. 13, 1998

(54) Protocadherin materials and methods  
(75)Inventor: Shintaro Suzuki, Torrance, California (US)
(73) Assignee: Doheny Eye Institute, Los Angeles, California  
Type:U.S.
( * )Notice: Subject to any disclaimer, the term of this patent is extended or adjusted under 35 U.S.C. 154(b) by 0 days. 
(21)Appl. No.: 08/453,695 
(22)Filed: May 30, 1995 
     
(51)Int. Cl.6C07K 14/435; C12N 15/12 
(52)U.S. Cl.530/350; 35/691; 530/395 
(58)Field of Search 530/350; 530/395; 435/691 

        
(56)References Cited
 
 U.S. PATENT DOCUMENTS
 
 
 FOREIGN PATENT DOCUMENTS
  WO91/04745  4/1991 ( WO )
  WO92/08731  5/1992 ( WO )
 OTHER PUBLICATIONS
Sago et al., Genomics, 29, 631-640, 1995.
Amagai et al., "Autoantibodies against a Novel Epithelial Cadherin in Pemphigus Vulgaris, a Disease of Cell Adhesion", Cell, 67: 869-877 (Nov. 29, 1991).
Angerer et al., "Demonstration of Tissue-Specific Gene Expression of in Situ Hybridization", Methods of Enzymology, 152: 649-660, (1987).
Ausubel et al., Eds., Current Protocols in Molecular Biology, Sections 6.1.1 to 6.1.4 and 6.2.1 to 6.2.3, John Wiley & Sons, New York (1987).
Burt, "Morphologic Abnormalities in the Postnatal Differentiation of CA1 Pyramidal Cells and Granule Cells in the Hippocampal Formation of the Ataxic Mouse", Anat. Rec. 196: 61-69 (1980).
Chen et al., "Cell-Cell Contacts Mediated by E-Cadherin (Uvomorulin) Restrict Invasive Behavior of L-Cells:," J. Cell, Biol., 114(2): 319-327 (Jul. 1991).
Civitelli et al., "Connexin43 Mediates Direct Intercellular Communication in Human Osteoblastic Cell Networks", J. Clin. Invest., 91: 1888-1896 (1993).
Detrick et al., "The Effect of N-Cadherin Misexpression on Morphogenesis in Xenopus Embryos", Neuron, 4: 493-506 (Apr. 1990).
Donalies et al., "Expression of M-cadherin, a Member of the Cadherin Multigene Family, Correlates with Differentiation of Skeletal Muscle Cells", Proc. Natl. Acad. Sci. USA, 88: 8024-8028 (Sep. 1991).
Frixen et al., "E-Cadherin-Mediated Cell-Cell Adhesion Prevents Invasiveness of Human Carcinoma Cells" J. Cell. Biol., 113(1): 173-185 (Apr. 1991).
Fujimori et al., "Ectopic Expression of N-cadherin Perturbs Histogenesis in Xenopus Embryos", Development, 110: 97-104 (1990).
Gallin et al., "Sequence Analysis of a cDNA Clone Encoding the Liver Cell Adhesion Molecule, L-CAM", Proc. Natl. Acad. Sci. USA, 84: 2808-2812 (May 1987).
Goodwin et al., "Desmoglein Shows Extensive Homology to the Cadherin Family of Cell Adhesion Molecules", Biochem. Biophsy. Res. Commun., 173(3): 1224-1230 (Dec. 31, 1990).
Hatta et al., "Cloning and Expression of cDNA Encoding a Neural Calcium-dependent Cell Adhesion Molecule: Its Identity in the Cadherin Gene Family", J. Cell. Biol., 106: 873-881 (Mar. 1988).
Holton et al., "Desmosomal Glycoproteins 2 and 3 (desmocollins) Show N-terminal Similarity to Calcium-Dependent Cell-Cell Adhesion Molecules", J. Cell. Science, 97: 239-246 (1990).
Hynes et al., "Contact and Adhesive Specificities in the Associations, Migrations, and Targeting of Cells and Axons", Cell, 68: 303-322, (Jan. 24, 1992).
Inuzuka et al., "R-Cadherin: A Novel Ca.sup.2+ -Dependent Cell-Cell Adhesion Molecule Expressed in the Retina", Neruron, 7: 69-79 (1991).
Kennett, "Cell Fusion", Methods in Enzymol., 58: 345-359 (1979).
Kikuchi et al., "The Defective Organ of Corti in Shaker-1 Mice", Acta Oto-Laryng., 60: 287-303 (1965).
Kintner, "Regulation of Embryonic Cell Adhesion by the Cadherin Cytoplasmic Domain", Cell, 69: 225-236 (Apr. 17, 1992).
Koch et al., "Identification of Desmoglein, a Constitutive Desmosomal Glycoprotein, as a Member of the Cadherin Family of Cell Adhesion Molecules", Eur. J. Cell Biol., 53: 1-12 (1990).
Liaw et al., "Identification and Cloning of Two Species of Cadherins in Bovine Endothelial Cells", EMBO J., 9(9): 2701-2708 (1990).
Lord et al., "Shaker, A New Mutation of the House Mouse" (Mus Musculus) Am. Nat., 63: 453-442 (1929).
Lyon, M., "Twirler: A Mutant Affecting the Inner Ear of the House Mouse", J. Embryol. Exp. Morphol., 6: 105-116 (1958).
Lyon, M., "Ataxia--A New Recessive Mutant of the House Mouse", J. Hered., 46: 77-80 (1955).
Mahoney et al., "The fat Tumor Suppressor Gene in Drosophila Encodes a Novel Member of the Cadherin Gene Superfamily", Cell, 67: 853-868 (Nov. 29, 1991).
Maniatis et al., p. 196 in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory (1982).
Maruyama et al., "Detection of Calcium Binding Proteins by .sup.45 Ca Autoradiography on Nitrocellulose Membrane after Sodium Dodecyl Sulfate Gel Electrophoresis.sup.1 ", J. Biochem., 95: 511-519 (1984).
Matsunaga et al., "Guidance of Optic Nerve Fibers by N-cadherin Adhesion Molecules", Nature, 334: 62-64 (Jul. 1988).
Miyatani et al., "Neural Cadherin: Role in Selective Cell-Cell Adhesion", Science, 245: 631-635 (Aug. 1989).
Nagafuchi et al., "Transformation of Cell Adhesion Properties by Exogenously Introduced E-cadherin cDNA", Nature, 329: 341-343 (Sep. 1987).
Napolitano, et al., "Molecular Cloning and Characterization of B-Cadherin, a Novel Chick Cadherin", Cell Biol., 113(4): 893-905 (May 1991).
Nose et al., "Isolation of Placental Cadherin cDNA: Identification of a Novel gene Family of Cell-Cell Adhesion Molecules", EMBO J., 6(12): 3655-3661 (1987).
Porter et al., "Dystrophin Colocalizes with β-Spectrin in Distinct Subsarcolemmal Domains in Mammalian Skeletal Muscle", J. Cell. Biol., 117(5): 997-1005 (Jun. 1992).
Pytela et al., "Polymerase Chain Reaction Cloning with Degenerate Primers: Homology-Based Identification of Adhesion Molecules", Methods in Enzymology, Erkki Ruoslahti and Eva Engvall, Eds., 245:420-451, Academic Press, (1994).
Ranscht et al., "T-Cadherin, a Novel Caherin Cell Adhesion Mol. in the Nervous System Lacks the Conserved Cytoplasmic Region", Neuron, 7: 391-402 (Sep. 1991).
Ringwald te al., "The Structure of Cell Adhesion Molecular Uvomorulin. Insights into the Molecular Mechanism of Ca.sup.2+ -Dependent Cell Adhesion", EMBO J., 6(12): 3647-3653 (1987).
Sano et al., "Protocadherins: A Large Family of Cadherin-Related Molecules in Central Nervous System", The EMBO Journal, 12(6): 2249-2256 (1993).
Seldon et al., "Genetic Analysis of Autoimmune gld Mice", J. Exp. Med., 167: 688-693 (1988).
Shimoyama et al., "Molecular Cloning of a Human Ca.sup.2+ -Dependent Cell-Cell Adhesion Molecule Homologous to Mouse Placental Cadherin: Its Low Expression in Human Placental Tissues", J. Cell. Biol., 109: 1787-1794 (Oct. 1989).
Suzuki et al., "Diversity of the Cadherin Family: Evidence for Eight New Cadherin in Nervous Tissue", Cell Regulation, 2: 261-270 (Apr. 1991).
Suzuki et al., "Evidence for Cadherin Superfamily" Cell Struc. Func., 16: 605 (Nov. 23, 1991).
Suzuki et al., "Evidence of Cadherin Superfamily" J. Cell. Biol., 115: 72(a) (Abstract 416) (Dec. 9, 1991).
Takeichi, "Cadherin Cell Adhesion Receptors as a Morphogenetic Regulator.", Science, 251: 1451-1455 (Mar. 1991).
Takeichi, "Cadherins: A Molecular Family Important in Selective Cell-Cell Adhesion:", Annu. Rev. Biochem., 59: 237-252 (1990)
Thomas, "Hybridization of Denatured RNA and Small DNA Fragments Transferred to Nitrocellulose", Proc. Natl. Acad. Sci. USA, 77(9): 5201-5205 (Sep. 1980).
Towbin et al., "Electrophoretic Transfer of Proteins from Polyacrylamide Gels to Nitrocellulose Sheets: Procedure and Some Applications", PNAS 76: 4350-4354, (Sep. 1979).
Urushihara et al., "Immunological Detection of Cell Surface Components Related with Aggregation of Chinese Hamster and Chick Embryonic Cells", Dev. Biol., 70: 206-216 (1979).
Vandenbark et al., "Experimental Allergic Encephalomyelitis and Cellular Immunity in the Lewis Rat", Cell. Immunol., 12: 85-93 (1974).
Vleminckx et al., "Genetic Manipulation of E-Cadherin Expression by Epithelial Tumor Cells Reveals an Invasion Suppressor Role", Cell, 66: 107-119 (Jul. 12, 1991).
 
(74)Primary Examiner — Stephen Walsh
 Assistant Examiner — Sally P. Teng
 Attorney, Agent, or Firm — Marshall, O'Toole, Gerstein, Murray & Borun
 Exemplary claim number — 1,2
 Art Unit — 182

(57)

Abstract

[00001]  Polynucleotide sequences encoding novel cadherin-like polypeptides, designated protocadherins, and variants thereof are provided by the invention as well as methods and materials for the recombinant production of the same. Antibody substances specific for protocadherins are also disclosed as useful for modulating the natural binding and/or regulatory activities of the protocadherins.
2 Claims, 3 Drawing Sheets, and 3 Figures
   [00002]  This is a Rule 60 Divisional of U.S. application Ser. No. 08/268,161, filed Jun. 27, 1994, which in turn is a continuation-in-part of International Patent Application No. PCT/US93/12588, filed Dec. 23, 1993, which in turn is a continuation-in-part of U.S. patent application Ser. No. 07/998,003, filed Dec. 29, 1992.

FIELD OF THE INVENTION

   [00003]  The present invention relates, in general, to materials and methods relevant to cell-cell adhesion. More particularly, the invention relates to novel adhesion proteins, designated protocadherins, and to polynucleotide sequences encoding the protocadherins. The invention also relates to methods for inhibiting binding of the protocadherins to their natural ligands/antiligands.

BACKGROUND

   [00004]  In vivo, intercellular adhesion plays an important role in a wide range of events including morphogenesis and organ formation, leukocyte extravasion, tumor metastasis and invasion, and the formation of cell junctions. Additionally, cell-cell adhesion is crucial for the maintenance of tissue integrity.
   [00005]  Intercellular adhesion is mediated by specific cell surface adhesion molecules. Cell adhesion molecules have been classified into at least four families including the immunoglobulin superfamily, the integrin superfamily, the selectin family and the cadherin superfamily. All cell types that form solid tissues express some members of the cadherin superfamily suggesting that cadherins are involved in selective adhesion of most cell types.
   [00006]  Cadherins have been generally described as glycosylated integral membrane proteins that have an N-terminal extracellular domain (the N-terminal 113 amino acids of the domain appear to be directly involved in binding) consisting of five subdomains characterized by sequences unique to cadherins, a hydrophobic membrane-spanning domain and a C-terminal cytoplasmic domain that interacts with the cytoskeleton through carenins and other cytoskeleton-associated proteins. Some cadherins lack a cytoplasmic domain, however, and appear to function in cell-cell adhesion by a different mechanism than cadherins having a cytoplasmic domain. The cytoplasmic domain is required for the adhesive function of the extracellular domain in cadherins that do have an cytoplasmic domain. Binding between members of the cadherin family expressed on different cells is homophilic (i.e., a member of the cadherin family binds to cadherins of its own or a closely related subclass) and Ca.sup.2+ -dependent. For recent reviews on cadherins, see Takeichi, Annu. Rev. Biochem., 59: 237-252 (1990) and Takeichi, Science, 251: 1451-1455 (1991).
   [00007]  The first cadherins to be described (E-cadherin in mouse epithelial cells, L-CAM in arian liver, uvomorulin in the mouse blastocyst, and CAM 120/80 in human epithelial cells) were identified by their involvement in Ca.sup.2+ -dependent cell adhesion and their unique immunological characteristics and tissue localization. With the later immunological identification of N-cadherin, which was found to have a different tissue distribution than E-cadherin, it became apparent that a new family of Ca.sup.2+ -dependent cell-cell adhesion molecules had been discovered.
   [00008]  The molecular cloning of the genes encoding E-cadherin ?see Nagafuchi et al., Nature, 329:341-343 (1987)!, N-cadherin ?Hatta et al., J. Cell. Biol., 106: 873-881 (1988)!, and P-cadherin ?Nose et al., EMBO J., 6: 3655-3661 (1987)! provided structural evidence that the cadherins comprised a family of cell adhesion molecules. Cloning of L-CAM ?Gillin et al., Proc. Natl. Acad. Sci. USA, 84: 2808-2812 (1987)! and uvomorulin ?Ringwald et al., EMBO J., 6: 3647-3653 (1986)! revealed that they were identical to E-cadherin. Comparisons of the amino acid sequences of E-, N-, and P-cadherins showed a level of amino acid similarity of about 45%-58% among the three subclasses. Liaw et al., EMBO J., 9: 2701-2708 (1990) describes the use of PCR with degenerate oligonucleotides based on conserved regions of the E-, N- and P-cadherins to amplify N- and P-cadherin from a bovine microvascular endothelial cell cDNA.
   [00009]  The isolation by PCR of eight additional cadherins was reported in Suzuki et al., Cell Regulation, 2: 261-270 (1991). Subsequently, several other cadherins were described including R-cadherin ?Inuzuka et al., Neuron, 7: 69-79 (1991)!, M-cadherin ?Donalies, Proc. Natl. Acad. Sci. USA, 88: 8024-8028 (1991)!, B-cadherin ?Napolitano, J. Cell. Biol., 113: 893-905 (1991)! and T-cadherin ?Ranscht, Neuron, 7: 391-402 (1991)!.
   [00010]  Additionally, proteins distantly related to cadherins such as desmoglein ?Goodwin et al., Biochem. Biophys. Res. Commun., 173: 1224-1230 (1990) and Koch et al., Eur. J. Cell Biol., 53: 1-12 (1990)! and the desmocollins ?Holton et al., J. Cell Science, 97: 239-246 (1990)! have been described. The extracellular domains of these molecules are structurally related to the extracellular domains of typical cadherins, but each has a unique cytoplasmic domain. Mahoney et al., Cell, 67: 853-868 (1991) describes a tumor suppressor gene of Drosophila, called fat, that also encodes a cadherin-related protein. The fat tumor suppressor comprises 34 cadherin-like subdomains followed by four EGF-like repeats, a transmembrane domain, and a novel cytoplasmic domain. The identification of these cadherin-related proteins is evidence that a large superfamily characterized by a cadherin extracellular domain motif exists.
   [00011]  Studies of the tissue expression of the various cadherin-related proteins reveal that each subclass of molecule has a unique tissue distribution pattern. For example, E-cadherin is found in epithelial cells while N-cadherin is found in neural and muscle cells. Expression of cadherin-related proteins also appears to be spatially and temporally regulated during development because individual proteins appear to be expressed by specific cells and tissues at specific developmental stages ?for review see Takeichi (1991), supra!. Both the ectopic expression of cadherin-related proteins and the inhibition of native expression of cadherin-related proteins hinders the formation of normal tissue structure ?Detrick et al., Neuron, 4: 493-506 (1990); Fujimoil et al., Development, 110: 97-104 (1990); Kintner, Cell, 69: 225-236 (1992)!.
   [00012]  The unique temporal and tissue expression pattern of the different cadherins and cadherin-related proteins is particularly significant when the role each subclass of proteins may play in vivo in normal events (e.g., the maintenance of the intestinal epithelial barrier) and in abnormal events (e.g., tumor metastasis or inflammation) is considered. Different subclasses or combinations of subclasses of cadherin-related proteins are likely to be responsible for different cell-cell adhesion events in which therapeutic detection and/or intervention may be desirable. For example, auto-antibodies from patients with pemphigus vulgaris, an autoimmune skin disease characterized by blister formation caused by loss of cell adhesion, react with a cadherin-related protein offering direct support for adhesion function of cadherins in vivo ?Amagai et al., Cell, 67: 869-877 (1991)!. Studies have also suggested that cadherins and cadherin-related proteins may have regulatory functions in addition to adhesive activity. Matsunaga et al., Nature, 334: 62-64 (1988) reports that N-cadherin has neurite outgrowth promoting activity. The Drosophila fat tumor supressor gene appears to regulate cell growth and supress tumor invasion as does mammalian E-cadherin ?see Mahoney et al., supra; Frixen et al., J. Cell. Biol., 113:173-185 (1991); Chen et al., J. Cell, Biol., 114:319-327 (1991); and Vleminckx et al., Cell, 66:107-119 (1991)!. Thus, therapeutic intervention in the regulatory activities of cadherin-related proteins expressed in specific tissues may be desirable.
   [00013]  There thus continues to exist a need in the art for the identification and characterization of additional cadherin-related proteins which participate in cell-cell adhesion and/or regulatory events. Moreover, to the extent that cadherin-related proteins might form the basis for the development of therapeutic and diagnostic agents, it is essential that the genes encoding the proteins be cloned. Information about the DNA sequences and amino acid sequences encoding the cadherin-related proteins would provide for the large scale production of the proteins by recombinant techniques and for the identification of the tissues/cells naturally producing the proteins. Such sequence information would also permit the preparation of antibody substances or other novel binding molecules specifically reactive with the cadherin-related proteins that may be useful in modulating the natural ligand/antiligand binding reactions in which the proteins are involved.

SUMMARY OF THE INVENTION

   [00014]  The present invention provides cadherin-related materials and methods that are relevant to cell-cell adhesion. in one of its aspects, the present invention provides purified and isolated polynucleotides (e.g., DNA and RNA, both sense and antisense strands) encoding the novel cell adhesion molecules designated herein as protocadherins, including protocadherin-42, protocadherin-43, protocadherin pc3, protocadherin pc4 and protocadherin pc5. Preferred polynucleotide sequences of the invention include genomic and cDNA sequences as well as wholly or partially synthesized DNA sequences, and biological replicas thereof (i.e., copies of the sequences made in vitro). Biologically active vectors comprising the polynucleotide sequences are also contemplated.
   [00015]  Specifically illustrating protocadherin polynucleotide sequences of the present invention are the inserts in the plasmids pRC/RSV-pc42 and pRC/RSV-pc43 which were deposited with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Md. 20852 on Dec. 16, 1992 and were assigned ATCC Accession Nos. 69162 and 69163, respectively.
   [00016]  The scientific value of the information contributed through the disclosures of the DNA and amino acid sequences of the present invention is manifest. For example, knowledge of the sequence of a partial or complete DNA encoding a protocadherin makes possible the isolation by standard DNA/DNA hybridization or PCR techniques of full length cDNA or genomic DNA sequences that encode the protein (or variants thereof) and, in the case of genomic DNA sequences, that specify protocadherin-specific regulatory sequences such as promoters, enhancers and the like. Alternatively, DNA sequences of the present invention may be chemically synthesized by conventional techniques. Hybridization and PCR techiques also allow the isolation of DNAs encoding heterologous species proteins homologous to the protocadherins specifically illustrated herein.
   [00017]  According to another aspect of the invention, host cells, especially eucaryotic and procaryotic cells, are stably transformed or transfected with the polynucleotide sequences of the invention in a manner allowing the expression of ptotocadherin polypeptides in the cells. Host cells expressing protocadherin polypeptide products, when grown in a suitable culture medium, are particularly useful for the large scale production of protocadherin polypeptides, fragments and variants thereby enabling the isolation of the desired polypeptide products from the cells or from the medium in which the cells are grown.
   [00018]  The novel protocadherin protein products of the invention may be obtained as isolates from natural tissue sources, but are preferably produced by recombinant procedures involving the host cells of the invention. The products may be obtained in fully or partially glycosylated, partially or wholly de-glycosylated, or non-glycosylated forms depending on the host cell selected or recombinant production and/or post-isolation processing.
   [00019]  Protocadherin variants according to the invention may comprise polypeptide analogs wherein one or more of the specified amino acids is deleted or replaced or wherein one or more non-naturally encoded amino acids are added: (1) without loss, and preferably with enhancement, of one or more of the biological activities or immunological characteristics specific for a protocadherin; or (2) with specific disablement of a particular ligand/antiligand binding function. Also contemplated by the present invention are antibody substances (e.g., monoclonal and polyclonal antibodies, chimeric and humanized antibodies, antibody domains including Fab, Fab', F(ab').sub.2, Fv or single variable domains, and single chain antibodies) which are specific for the protocadherins of the invention. Antibody substances can be developed using isolated natural, recombinant or synthetic protocadherin polypeptide products or host cells expressing such products on their surfaces. The antibody substances may be utilized for purifying protocadherin polypeptides of the invention, for determining tissue expression of polypeptides and as antagonists of the ligand/antiligand binding activities of the protocadherins. Specifically illustrating monoclonal antibodies of the present invention are the protocadherin-43 specific monoclonal antibodies produced by the hybridoma cell line designated 38I2C which was deposited with the ATCC on Dec. 2, 1992 and was assigned ATCC Accession No. HB 11207.
   [00020]  Numerous other aspects and advantages of the present invention will be apparent upon consideration of the following detailed description, reference being made to the drawing wherein FIGS. 1A-C is an alignment of protocadherin amino acid sequences of the invention with the amino acid sequences of N-cadherin and of the Drosophila fat tumor suppressor.

BRIEF DESCRIPTION OF THE DRAWINGS

   [00021]  FIGS. 1A-1C presents an alignment of the deduced amino acid sequences of the extracellular subdomains of pc42 (EC-1 through EC-7), pc43 (EC-1 through EC-6), mouse N-Cadherin (EC-1 through EC-5) and Drosophila fat EC-18. A sequence on a line in FIG. 1A continues on the same line in FIGS. 1B and 1C.

DETAILED DESCRIPTION

   [00022]  The present invention is illustrated by the following examples wherein Examples 1, 2 and 3 describe the isolation by PCR of protocadherin polynucleotide sequences. Example 3 also describes the chromosome localization of several protocadherin genes of the invention. Example 4 describes the isolation by DNA/DNA hybridization of additional protocadherin polynucleotide sequences of the present invention. Example 5 presents the construction of expression plasmids including polynucleotides encoding protocadherin-42 or protocadherin-43 and the transfection of L cells with the plasmids. The generation of antibodies to protocadherin-42 and protocadherin-43 is described in Example 6. Example 7 presents the results of immunoassays of transfected L cells for the expression of protocadherin-42 or protocadherin-43. Example 8 describes the cell aggregation properties of L cells transfected with protocadherin-42, protocadherin-43 or a chimeric protoeadherin43/E-cadherin molecule. The calcium-binding properties of pc43 are described in Example 9. The results of assays of various tissues and cell lines for the expression of protocadherin-42 and protocadherin-43 by Northern blot, Western blot and in situ hybridization are respectively presented in Examples 10, 11 and 12. Example 13 describes immunoprecipitation experiments identifying a 120 kDa protein that coprecipitates with protocadherin-43.

EXAMPLE 1

   [00023]  The polymerase chain reaction (PCR) was used to isolate novel rat cDNA fragments encoding cadherin-related polypeptides.
   [00024]  Design of PCR Primers
   [00025]  Two regions of conserved amino acid sequence, one from the middle of the third cadherin extracellular subdomain (EC-3) and the other from the C-terminus of the fourth extracellular subdomain (EC-4), were identified by comparison of the published amino acid sequences for L-CAM (Gallin et al., supra), E-cadherin (Nagafuchi et al., supra), mouse P-cadherin (Nose et al., supra), uvomorulin (Ringwald et al., supra), chicken N-cadherin (Hatta et al., supra), mouse N-cadherin ?Miyatani et al., Science, 245:631-635 (1989)! and human P-cadherin ?Shimoyama et al., J. Cell. Biol., 109:1787-1794 (1989)!, and the corresponding degenerate oligonucleotides respectively set out below in IUPAC-IUB Biochemical nomenclature were designed for use as PCR primers.
[00026]  Primer 1 (SEQ ID NO: 1)
[00027]  5' AARSSNNTNGAYTRYGA 3'
[00028]  Primer 2 (SEQ ID NO: 2)
[00029]  3' TTRCTRTTRCGNGGNNN 5'
   [00030]  The degenerate oligonucleotides were synthesized using an Applied Biosystems model 380B DNA synthesizer (Foster City, Calif.).
   [00031]  Cloning of cDNA Sequences by PCR
   [00032]  PCR was carried out in a manner similar to that described in Suzuki et al., Cell Regulation, 2: 261-270 (1991) on a rat brain cDNA preparation. Total RNA was prepared from rat brain by the guanidium isothiocyanate/cesium chloride method described in Maniatis et al., pp. 196 in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory (1982). Brain poly(A).sup.+ RNAs were then isolated using a FastTrack® kit (Invitrogen, San Diego, Calif.) and cDNA was prepared using a cDNA synthesis kit (Boehringer Mannheim Biochemicals, Indianapolis, Ind.). The PCR reaction was initiated by adding 2.5 units of Taq DNA polymerase (Boehringer Mannheim Biochemicals) to 100 ng template cDNA and 10 μg of each primer, after which 35 reaction cycles of denaturation at 94° C. for 1.5 minutes, annealing at 45° C. for 2 minutes, and polymerization at 72° C. for 3 minutes were carried out. Two major bands of about 450 base pairs (bp) and 130 bp in size were found when the products of the PCR reaction were subjected to agarose gel electrophoresis. The 450 bp band corresponded to the expected length between the two primer sites corresponding to the middle of the third cadherin extracellular subdomain (EC-3) and the carboxyl terminus of the fourth cadherin extracellular subdomain (EC-4), but the 130 bp band could not be predicted from any of the previously identified cadherin sequences. The 450 bp and 130 bp bands were extracted by a freezing and thawing method. The resulting fragments were phosphorylated at the 5' end with T4 polynucleotide kinase and subcloned by a blunt-end ligation into the Sma I site of M13mp18 (Boehringer Mannheim Biochemicals) in a blunt end ligation for sequence analysis. Sequencing of the fragments was carried out by the dideoxynucleotide chain termination method using a Sequenase kit (United States Biochemicals, Cleveland, Ohio). DNA and amino acid sequence were analyzed using the Beckman Microgenic program (Fullerton, Calif.).
   [00033]  Analysis of cDNA Sequences
   [00034]  Nineteen novel partial cDNA clones were isolated. The DNA and deduced amino acid sequences of the clones (including sequences corresponding to the PCR primers) are set out as follows: RAT-123 (SEQ ID NOs: 3 and 4, respectively), RAT-212 (SEQ ID NOs: 5 and 6), RAT-214 (SEQ ID NOs: 7 and 8), RAT-216 (SEQ ID NOs: 9 and 10), RAT-218 (SEQ ID NOs: 11 and 12), RAT-224 (SEQ ID NOs: 13 and 14), RAT-312 (SEQ ID NOs: 15 and 16), RAT-313 (SEQ ID NOs: 17 and 18), RAT-314 (SEQ ID NOs: 19 and 20), RAT-315 (SEQ ID NOs: 21 and 22), RAT-316 (SEQ ID NOs: 23 and 24), RAT-317 (SEQ ID NOs: 25 and 26), RAT-321 (SEQ ID NOs: 27 and 28), RAT-323 (SEQ ID NOs: 29 and 30), RAT-336 (SEQ ID NOs: 31 and 32), RAT-352 (SEQ ID NOs: 33 and 34), RAT-411 (SEQ ID NOs: 35 and 36), RAT-413 (SEQ ID NOs: 37 and 38), and RAT-551 (SEQ ID NOs: 39 and 40).
   [00035]  The deduced amino acid sequences of the cDNA clones are homologous to, but distinct from the known cadherins. The cadherins described thus far have highly conserved, short amino acid sequences in the third extracellular subdomain (EC-3) including the consensus sequence D-Y-E or D-F-E located at the middle region of the subdomain and the consensus sequence D-X-N-E-X-P-X-F (SEQ ID NO: 41) or D-X-D-E-X-P-X-F (SEQ ID NO: 42) at its end (Hatta et al., supra), while the corresponding sequences of other subdomains, except for the fifth extracellular subdomain (EC-5), are D-R-E and D-X-N-D-N-X-P-X-F (SEQ ID NO: 43), respectively. In contrast, the deduced amino acid sequences of the new clones that correspond to cadherin extracellular subdomains include the sequence D-Y-E or D-F-E at one end, but have the sequence D-X-N-D-N-X-P-X-F instead of D-X-N-E-X-P-X-F or D-X-D-E-X-P-X-F, at the other end. The polypeptides encoded by the partial clones are homologous to previously identified cadherins but did not show significant homology to any other sequences in Genbank. Therefore, the partial cDNAs appear to comprise a new subclass of cadherin-related molecules.

EXAMPLE 2

   [00036]  Various cDNA fragments structurally similar to the rat cDNAs described in Example 1 were isolated from human, mouse, and Xenopus brain cDNA preparations and from Drosophila and C. elegans whole body cDNA preparations by PCR using Primers 1 and 2 as described in Example 1. The DNA and deduced amino acid sequences of the resulting PCR fragments (including sequences corresponding to the PCR primers) are set out as follows: MOUSE-321 (SEQ ID NOs: 44 and 45), MOUSE-322 (SEQ ID NOs: 46 and 47), MOUSE-324 (SEQ ID NOs: 48 and 49), MOUSE-326 (SEQ ID NOs: 50 and 51), HUMAN-11 (SEQ ID NOs: 52 and 53), HUMAN-13 (SEQ ID NOs: 54 and 55), HUMAN-21 (SEQ ID NOs: 56 and 57), HUMAN-24 (SEQ ID NOs: 58 and 59), HUMAN-32 (SEQ ID NOs: 60 and 61), HUMAN-42 (SEQ ID NOs: 62 and 63), HUMAN-43 (SEQ ID NOs: 64 and 65), HUMAN-212 (SEQ ID NOs: 66 and 67), HUMAN-213 (SEQ ID NOs: 68 and 69), HUMAN-215 (SEQ ID NOs: 70 and 71), HUMAN-223 (SEQ ID NOs: 72 and 73), HUMAN-410 (SEQ ID NOs: 74 and 75), HUMAN-443 (SEQ ID NOs: 76 and 77), XENOPUS-21 (SEQ ID NOs: 78 and 79), XENOPUS-23 (SEQ ID NOs: 80 and 81), XENOPUS-25 (SEQ ID NOs: 82 and 83), XENOPUS-31 (SEQ ID NOs: 84 and 85), DROSOPHILA-12 (SEQ ID NOs: 86 and 87), DROSOPHILA-13 (SEQ ID NOs: 88 and 89), DROSOPHILA-14 (SEQ ID NOs: 90 and 91) and C.ELEGANS-41 (SEQ ID NOs: 92 and 93). Comparison of the deduced amino acid sequences indicates significant similarity between sets of these clones. In particular, them are three sets of clones that appear to be cross-species homologues: RAT-218, MOUSE-322 and HUMAN-43; RAT-314, MOUSE-321 and HUMAN-11; and MOUSE-326 and HUMAN-42.

EXAMPLE 3

   [00037]  To ascertain the complete structure of the new proteins defined by the PCR products, two full length human cDNAs corresponding to the partial cDNAs HUMAN-42 and HUMAN-43 were isolated.
   [00038]  Isolation of Full-length Human cDNAs
   [00039]  A human fetal brain cDNA library (Stratagene, La Jolla, Calif.) in the λZapII vector was screened by the plaque hybridization method ?described in Ausubel et al, Eds., Current Protocols in Molecular Biology, Sections 6.1.1 to 6.1.4 and 6.2.1 to 6.2.3, John Wiley & Sons, New York (1987)! with .sup.32 P-labelled HUMAN-42 and HUMAN-43 DNA fragments. The positive clones were plaque-purified and, using a helper virus, the inserts were cut out by an in vivo excision method in the form of a Bluescript SK(+) plasmid. The insert sequences were then subcloned into the M13 vector (Boehringer Mannheim, Biochemicals) for sequencing. Several overlapping cDNA clones were isolated with each probe including two cDNAs which contained the putative entire coding sequences of two novel proteins designated protocadherin-42 (pc42) and protocadherin-43 (pc43). The DNA and deduced amino acid sequences of pc42 are set out in SEQ ID NOs: 94 and 95, respectively, while the DNA and deduced amino acid sequences of pc43 are set out in SEQ ID NOs: 96 and 97, respectively.
   [00040]  A description of the cloning of protocadherin sequences of the invention was published in Sano et al., The EMBO Journal, 12(6): 2249-2256 (1993) after filing of the priority application hereto. The deduced amino acid sequence of pc43 was previously presented at the Dec. 9, 1991 meeting of the American Society for Cell Biology. An abstract of the presentation is published as Suzuki et al., J. Cell. Biol., 115: 72a (Abstract 416) (Dec. 9, 1991).
   [00041]  Analysis of Full-length Human Clones
   [00042]  Comparison of the full length cDNA sequences of pc42 and pc43 to the sequences of the various DNA fragments originally obtained by PCR reveals that MOUSE-326 and HUMAN-42 correspond to a potion of the fourth extracellular subdomain (EC-4) of pc42, and RAT-314, MOUSE-321, and HUMAN-11 correspond to a portion of the third extracellular subdomain (EC-3) of pc43 and RAT-218, MOUSE-322 and HUMAN-43 correspond to a portion of the fifth extracellular domain (EC-5) of pc43.
   [00043]  The overall structures of pc42 and pc43 are similar to that of typical cadherins but the new molecules also have distinct features. Both protocadherin cDNA sequences contain putative translation initiation sites and translated amino acid sequences start with typical signal sequences, but the clones lack the prosequences that are present in all known cadherin precursors. The cDNAs encode proteins having a large N-terminal extracellular domain and a relatively short C-terminal cytoplasmic domain connected by a transmembrane sequence. The extracellular domains of pc42 and pc43 are different in length and pc42 contains seven subdomains that closely resemble the typical cadherin extracellular subdomain while pc43 has six such subdomains. The sizes of the protocadherin cytoplasmic domains are similar to those of typical cadherins, but the sequences do not show any significant homology with those of known cadherins or cadherin-related proteins.
   [00044]  Amino acid identity determinations between extracellular subdomains of human pc42 and pc43, and of mouse N-cadherin (SEQ ID NO: 98) (presented as an example of a "typical" cadherin) and the eighteenth extracellular subdomain of Drosophila fat tumor suppressor (EC-18, SEQ ID NO: 99) (the eighteenth extracellular subdomain of fat is a prototypical fat subdomain) are presented in Table 1 below, wherein, for example, "N-EC-1×pc42" indicates that the first extracellular subdomain of N-cadherin was compared to the extracellular subdomain of pc42 indicated on the horizonal axis. TABLE 1 ______________________________________ EC-1 EC-2 EC-3 EC-4 EC-5 EC-6 EC-7 ______________________________________ N-EC-1 × pc42 20 27 26 26 31 29 17 N-EC-1 × pc43 31 23 23 26 31 24 N-EC-2 × pc42 28 30 32 30 37 31 19 N-EC-2 × pc43 30 28 30 36 29 30 N-EC-3 × pc42 21 26 30 29 31 30 22 N-EC-3 × pc43 25 18 26 28 28 25 N-EC-4 × pc42 28 28 26 25 29 27 17 N-EC-4 × pc43 21 25 28 28 29 24 N-EC-5 × pc42 24 21 25 24 24 19 12 N-EC-5 × pc43 15 21 20 20 25 16 fat EC-18 × pc42 22 35 32 34 42 35 19 fat EC-18 × pc43 32 30 36 36 33 29 ______________________________________
   [00045]  The amino acid identity values between the extracellular subdomains of pc42 and pc43, and N-cadherin EC-1 through EC-5 and Drosophila fat EC-18 are mostly less than 40%. These identity values are comparable to the values between the subdomains of other cadherin subclasses. However, higher identity values indicate that pc42 and pc43 are more closely related to fat than to N-cadherin.
   [00046]  Amino acid identity determinations between extracellular subdomains of human pc42 and pc43 are presented in Table 2 below. TABLE 2 ______________________________________ pc42 pc43 EC-1 EC-2 EC-3 EC-4 EC-5 EC-6 EC-7 ______________________________________ EC-1 33 27 29 26 25 26 25 EC-2 26 38 29 33 34 28 21 EC-3 26 32 41 30 32 31 22 EC-4 25 34 30 41 39 31 18 EC-5 23 32 29 27 36 34 16 EC-6 25 25 26 25 28 23 26 ______________________________________
   [00047]  The identity values between respective EC-1, EC-2, EC-3, EC-4, EC-5 subdomains and the last subdomains of pc42 and pc43 are generally higher values than values obtained for comparisons of the protocadherins to N-cadherin. These results suggest that pc42 and pc43 are more closely related to one another than they are to classic cadherins.
   [00048]  FIGS. 1A-C presents an alignment of the deduced amino acid sequences of the extracellular subdomains of pc42 (EC-1 through EC-7) (amino acids 42-818 of SEQ ID NO: 95), pc43 (EC-1 through EC-6) (amino acids 29-688 of SEQ ID NO: 97), mouse N-cadherin (EC-1 through EC-5) (amino acids 1-557 of SEQ ID NO: 98) and Drosophila fat EC-18 (SEQ ID NO: 99). A sequence on a line in FIG. 1A continues on the same line in FIGS. 1B and 1C. Gaps were introduced to maximize homology. In FIGS. 1A-1C, the position at which an amino acid appears in a SEQ ID NO is indicated in parenthesis. For example, in FIG. 1A the first amino acid of EC1 of protocadherin-43 is an alanine which appears at position 29 in SEQ ID NO: 97 and the last amino acid of the protocadherin-43 EC1 appearing in FIG. 1A is an alanine which appears at position 63 in SEQ ID NO: 97. The amino acid residues described by capital letters in the "motif" line are present in more than half of the subdomains of N-cadherin, pc42, pc43 and Drosophila fat. The amino acid residues described by small letters in the motif line are less well conserved in human pc42, pc43, and Drosophila fat. FIGS. 1A-C shows that many amino acids characteristic of other cadherin extracellular domain repeats are conserved in the pc42 and pc43 sequences, including the cadherin sequence motifs DXD, DRE and DXNDNXPXF (SEQ ID NO: 43), two glycine residues, and one glutamic acid residue. Additionally, pc42 and pc43 share unique futures in comparison to N-cadherin. More amino acids at specific sites are conserved between pc42 and pc43, such as the DXDXGXN (SEQ ID NO: 100) protocadherin sequence motif near the amino terminus of the pc42 and pc43 subdomains and the AXDXGXP (SEQ ID NO: 101) sequence motif near the carboxyl terminus of the subdomains. Additionally, both protocadherins share regions that do not show significant homology with the typical cadherin motif (of N-cadherin) near the carboxyl terminus of EC-1, in the middle of EC-2 and EC-4, and at the carboxyl terminus of the last repeat. A cysteine residue is located at a similar position in the middle of EC-4 of pc42 and pc43. In general, the extracellular subdomains of pc42 and pc43 are more similar to EC-18 of fat than the extracellular subdomains of N-cadherin.
   [00049]  Possible Alternative Splicing
   [00050]  Sequence analysis of various overlapping protocadherin cDNA clones revealed that some clones contained unique sequences at the 3' end, although the 5' end sequences were identical to other clones. The sequences forming the boundaries of the 3' end regions are consistent with the consensus sequence of mRNA splicing, suggesting that these clones may correspond to alternatively spliced mRNAs. The DNA and deduced amino acid sequences of one possible product of alternative splicing of pc42 mRNA are set out in SEQ ID NOs: 102 and 103. The DNA and deduced amino acid sequences of two possible products of alternative splicing of pc43 mRNA are respectively presented in SEQ ID NO: 104 and 105, and SEQ ID NOs: 106 and 107.
   [00051]  Chromosome Localization
   [00052]  The chromosomal location of the protocadherin 413 gene (SEQ ID NO: 37) and of the pc42 and pc43 genes was determined by conventional methods.
   [00053]  Briefly, C3H/HeJ-gld and Mus spretus (Spain) mice and ?(C3H/HeJ-gld×Mus spretus) F.sub.1 ×C3H/HeJ-gld! interspecies backcross mice were bred and maintained as previously described in Seldin, et al., J. Exp. Med., 167: 688-693 (1988). Mus spretus was chosen as the second parent in the cross because of the relative ease of detection of informative restriction fragment length variants (RFLVs) in comparison with crosses using conventional inbred laboratory strains. Gene linkage was determined by segregation analysis.
   [00054]  Genomic DNA isolated from mouse organs by standard techniques was digested with restriction endonucleases and 10 μg samples were electrophoresed in 0.9% agarose gels. DNA was transferred to Nytran membranes (Schleicher & Schull, Inc., Keene, N.H.), hybridized with the appropriate probe at 65° C. and washed under stringent conditions, all as previously described in Maniatis et al., supra). To localize the pc42 gene, a mouse sequence probe corresponding to nucleotides 1419 to 1906 of SEQ ID NO: 94 was used and for pc43 a rat sequence probe corresponding to nucleotides 1060 to 1811 of SEQ ID NO: 96 was used. To localize the procadherin 413 gene, a probe including the sequence set out in SEQ ID NO: 37 was used. Other clones used as probes in the current study and RFLVs used to detect anonymous DNA loci were all previously described ?Chromosome 7, DNA segment, Washington 12 (D7Was12); the parathyroid hormone (Pth); calcitonin (Calc); hemoglobin, β chain (Hbb); metallothionein-I (Mt-1); adenine phosphoribosyltransferase (Aprt); growth hormone receptor (Ghr); prostaglandin E receptor EP2 subtype (Ptgerep2); dihydrofolate reductase-2 (Dhfr2); fibroblast growth factor a (Fgfa); and ghcocorticoid receptor-1 (Grl-1)!.
   [00055]  Comparison of the haplotype distribution of protocadherin genes with those determined for loci throughout the mouse genome allowed each to be mapped to specific regions of mouse chromosomes. The probability for linkage was >99% and indicated assignment of both the pc42 gene and the pc43 gene w chromosome 18. The assignment of the protocadherin 413 gene was chromosome 7. The region of chromosome 18 to which the pc42 and pc43 genes were mapped corresponds to the ataxia (ax) loci ?Burt, Anat. Rec., 196: 61-69 (1980) and Lyon, J. Hered., 46: 77-80 (1955)! and twirler (Tw) loci ?Lyon, J. Embryol. Exp. Morphol., 6: 105-116 (1958)!, while the region of chromosome 7 to which the protocadherin 413 gene was mapped corresponds to the shaker (sh-1) locus ?Kikuchi et al., Acta Oto-Laryngol., 60 : 287-303 (1965) and Lord et al., Am. Nat., 63: 453-442 (1929)!. These loci have been implicated as involved in hereditary neural disease in the mouse. This result is consistent with in situ hybridization results (see Example 12) showing that pc42 and pc43 are strongly expressed in the brain and particularly in the cerebellum.

EXAMPLE 4

   [00056]  Two additional novel human protocadherin cDNAs and one additional novel rat protocadherin cDNA were isolated using rat protocadherin fragments described in Example 1 as probes.
   [00057]  Initially, the rat clone RAT-214 (SEQ ID NO: 7) was used as a probe to screen a rat brain cDNA library (Stratagene, La Jolla, Calif.). The final washing step was performed twice at 50° C. in 0.1× SSC with 0.1% SDS for 15 minutes. Various clones were identified which contained partial cDNA inserts encoding related protocadherin amino acid sequences. The nucleotide sequence of one novel rat clone designated #6-2 is set out in SEQ ID NO: 108. The first fifteen nucleotides of SEQ ID NO: 108 are the sequence of a linker and are not part of the rat #6-2 clone.
   [00058]  A human fetal brain cDNA library obtained from Stratagene was screened with the 0.7 kbp PstI fragment of clone #6-2. The fragment appears to encode the EC-2 and EC-3 of the rat protocadherin. After screening about 2×10.sup.6 phages, eleven positive clones were isolated. Sequencing of the clones identified a novel full length human protocadherin cDNA designated human pc3. The nucleotide and deduced amino acid sequence of human pc3 are set out in SEQ ID NOs: 109 and 110.
   [00059]  The 0.7 kbp PstI fragment of rat clone #6-2 was also used to rescreen the Stratagene rat brain cDNA library for full length rat cDNA clones. A clone containing an insert encoding a full length novel protocadherin cDNA was isolated. The DNA and deduced amino acid sequence of the insert are set out in SEQ ID NO: 111 and 112. The full length rat cDNA was named pc5 because it does not appear to be the homolog of the human pc3 clone based upon a comparison of the sequences.
   [00060]  Concurrently, the 0.8 kbp Eco RI-Pst I fragment of partial rat cDNA designated #43 (SEQ ID NO: 113), which was obtained by screening the Stratagene rat brain cDNA library with a probe corresponding to the human pc43 cytoplasmic domain, was used to probe the Stratagene human cDNA library for full length human protocadhenin cDNAs. The fragment appears to encode EC-3 through the beginning of EC-6 of clone #43. One partial clone identified encodes a novel human protocadherin named human pc4. The nucleotide sequence and deduced amino acid sequences of the human pc4 clone are set out in SEQ ID NOs: 114 and 115. The amino acid sequence encoded by the pc4 clone appears to begin in the middle of EC-2 of pc4 and continues through the cytoplasmic tail of the protocadherin.

EXAMPLE 5

   [00061]  The full length human cDNAs encoding pc42 and pc43 were expressed in L cells (ATCC CCL 1) using the pRC/RSV expression vector (Invitrogen, San Diego, Calif.). The cDNAs were isolated from the Bluescript SK(+) clones described in Example 2 by digestion with SspI followed by blunt-ending with DNA polymeruse and digestion with XbaI (for pc42), or by double digestion with SpeI and EcoRV (for pc43). The pRC/RSV expression vector was digested with HindIII, followed by blunt-ending and re-digestion with XbaI for insertion of pc42 sequences, or by digested with XbaI followed by blunt-ending and re-digestion with SpeI for insertion of pc43 sequences. The isolated protocadherin DNAs were ligated into the linearized pRC/RSV vector. The resulting pc42 expression plasmid designated pRC/RSV-pc42 (ATCC 69162) and pe43 expression plasmid designated pRC/RSV-pc43 (ATCC 69163) were purified by CsCI gradient centrifugation and transfected into L cells by a Ca-phosphate method.
   [00062]  The pc42 and pc43 transfectants were morphologically similar to the parental cells. Northern blot analysis of L cells transfected with pc42 or pc43 DNA sequences showed that the transfected cells expressed mRNAs of a size expected to encode the particular protocadherin.

EXAMPLE 6

   [00063]  Rabbit polyclonal antibodies specific for pc42 and pc43 were generated as well as a mouse monoclonal antibody specific for pc43.
   [00064]  Preparation of Polyclonal Antibodies Specific for pc42 and pc43
   [00065]  DNA sequences encoding portions of the extracellular domain of pc42 and pc43 were each fused to a maltose binding protein-encoding sequence and expressed in bacteria. Specifically, DNAs corresponding to EC-4 through EC-7 of pc42 and EC-3 through EC-5 of pc43 were prepared by PCR and subcloned in the correct reading frame into the multicloning site of the pMAL expression vector (New England Biolabs, Beverly, Mass.) which contains sequences encoding maltose binding protein immediately upstream of the multicloning site. The resulting plasmids were then introduced into E. coli NM522 cells (Invitrogen, San Diego, Calif.) by a single step transformation method. Expression of the fusion proteins was induced by the addition of IPTG and the fusion proteins were purified from cell extracts by amylose resin affinity chromatography (New England Biolabs) as described by the manufacturer. The fusion proteins were used for the immunization of rabbits without further purification.
   [00066]  Polyclonal antibodies were prepared in rabbits by immunization at four subcutaneous sites with 500 μg of purified fusion protein in Freund's complete adjuvant. Subsequent immunizations with 100 μg of the fusion protein were in Freund's incomplete adjuvant. Immune sera was passed through sepharose coupled to maltose binding protein (New England Biolabs) and polyclonal antibodies were purified from immune sera using Sepharose affinity columns prepared by reaction of the purified fusion protein with CNBr Sepharose (Pharmacia). Reactivity of the polyclonal sera with purified pc42 fusion protein and pc42 transfected cell extracts (described in Example 5) was confirmed.
   [00067]  Preparation of Monoclonal Antibodies Specific for pc43
   [00068]  The pc43 fusion protein (containing the EC-3 through EC-5 subdomains of pc43) was used to generate monoclonal antibodies in mice according to the method of Kennett, Methods in Enzymol., 58:345-359 (1978). Briefly, mice were immunized with the pc43 fusion protein (100 μg) at two subcutaneous sites. The spleen from the highest titer mouse was fused to the NS1 myeloma cell line. The resulting hybridoma supernatants were screened in a ELISA assay for reactivity with the pc43 fusion protein and with maltose binding protein. The fusion wells with the highest reactivity to the pc43 extracellular domains were subcloned. The hybridoma cell line designated 38I2C (ATCC HB 11207) produced a IgG.sub.1 subtype monoclonal antibody specific for pc43. Reactivity of the monoclonal antibody produced by hybridoma cell line 38I2C to pc43 was confirmed by immunoblotting the pc43 L cell transfectants described in Example 5. The 38I2C monoclonal antibody is specific for human pc43.

EXAMPLE 7

   [00069]  L cells transfected with DNA sequences encoding pc42 and pc43 as prepared in Example 5 were assayed for expression of the protocadherins by immunoblot and by immunofluorescence microscopy.
   [00070]  Immunoblot Analysis
   [00071]  Cell extracts of pc42 and pc43 transfectants were subjected to SDS-PAGE and then blotted electrophoretically onto a PVDF membrane (Millipore, Bedford, Mass.). The membranes were incubated with 5% skim milk in Tris-buffered saline (TBS) for two hours and then respectively with either pc42 polyclonal sera or pc43 monoclonal antibody for one hour. The membranes were washed three times (for 5 minutes each wash) with TBS containing 0.05% Tween 20 and respectively incubated with alkaline phosphatase-conjugated anti-rabbit IgG antibody or anti-mouse IgG antibody (Promega, Madison, Wis.) in the same buffer for one hour. After washing the membranes with TBS containing 0.05% Tween 20, reactive bands were visualized by using Western Blue solution (Promega).
   [00072]  Anti-pc42 polyclonal antibodies stained a band of about 170 kDa molecular weight in pc42 transfected cells, but not parental L cells. The pc43-specific monoclonal antibody (38I2C) and polyclonal antibodies stained two adjacent bands of about 150 kDa molecular weight in pc43 transfected cells. The pc43 antibodies did not stain bands in parental L-cells. The molecular weights indicated by the staining of bands by the pc42 and pc43 antibodies are significantly larger than the molecular weights predicted from the deduced amino acid sequences. This discrepancy in molecular weight is common among various cadherin-related proteins and may be attributable to the glycosylation and/or cadherin specific structural properties. The pc42 antibody also stained smaller bands, which may be proteolytic degradation products.
   [00073]  When transfected cells were trypsinized and cell extracts were prepared, run on SDS/PAGE and immunoblotted with the appropriate antibody, the pc42 and pc43 polypeptides expressed by the transfected cells were found to be highly sensitive to proteolysis and were easily digested by 0.01% trypsin treatment. In contrast to the classic cadherins, however, these proteins were not protected from the digestion in the presence of 1-5 mM Ca.sup.2+.
   [00074]  Immunofluorescence Microscopy
   [00075]  Transfected cells were grown on a cover slip precoated with fibronectin and were fixed with 4% paraformaldehyde for 5 minutes at room temperature or with cold methanol on ice for 10 minutes followed by 4% paraformaldehyde fixation. After washing with TBS, the cells were incubated with TBS containing 1% BSA for 30 minutes and then with anti-pc42 polyclonal antibody or anti-pc43 monoclonal antibody in TBS containing 1% BSA for 1 hour at room temperature. Cover slips were then washed with TBS containing 0.01% BSA and respectively incubated with FITC-conjugated anti-rabbit antibody or anti-mouse antibody (Cappel, Durham, N.C.) for 60 minutes at room temperature. The cells were washed again with TBS containing 0.01% BSA and subjected to fluorescence microscopy. Both pc42-specific and pc43-specific polyclonal antibodies stained the cell periphery of transfected cells expressing the protoeadherin proteins, mainly at the cell-cell contact sites. The antibodies did not stain the parent L cells, nor did rabbit preimmune sera stain the pc42 and pc43 transfectants.

EXAMPLE 8

   [00076]  The cell aggregation properties of the transfected L cells expressing protocadherin proteins were examined. Transfected L cells were cultured in Dulbecco's Modified Eagles Medium (DMEM) (Gibco, Grand Island N.Y. supplemented with 10% fetal bovine serum at 37° C. in 5% CO.sub.2. Cells grown near confluence were treated with 0.01% trypsin in the presence of 1 mM EGTA for 25 minutes on a rotary shaker at 37° C. and collected by centrifugation. The cells were washed three times with Ca.sup.2+ free HEPES-buffered saline (HBS) after adding soybean trypsin inhibitor, and were resuspended in HBS containing 1% BSA. The cell aggregation assay ?Urushihara et al., Dev. Biol., 70: 206-216 (1979)! was performed by incubating the resuspended cells in a 1:1 mixture of DMEM and HBS containing 1% BSA, 2 mM CaCl.sub.2 and 20 μg/ml of deoxyribonucelease on a rotary shaker at 37° C. for 30 minutes to 6 hours.
   [00077]  The pc42 and pc43 transfectants did not show any significant cell aggregation activity during periods of incubation less than 1 hour. This is in contrast to the cell aggregation that occurs with classic cadherins in similar experiments (Nagafuchi et al., supra, and Hatta et al., supra). However, prolonged incubation of transfected cells (more than 1-2 hours) resulted in gradual re-aggregation of the cells into small aggregates. Similar results were obtained when single cell suspensions of transfected cells were prepared by trypsin treatment in the presence of Ca.sup.2+. No re-aggregation was observed under the same conditions when untransfected L cells or L cells transfected with pRC/RSV vector alone were tested. When pc43 transfectants labelled with DiO (Molecular Probes, Eugene, Oreg.) were incubated with unlabelled pc42 transfectants in the cell aggregation assay, aggregation of labelled and unlabelled cells was almost mutually exclusive indicating that protocadherin binding is homophilic.
   [00078]  In view of the fact that the protocadherin cytoplasmic domains exhibit no apparent homology to cadherin domains, experiments were performed to determine if the difference in cytoplasmic domains could account for the difference in cell aggregation activity observed in cadherin and protocadherin transfectants. The cytoplasmic domain of pc43 was replaced with the cytoplasmic domain of E-cadherin and aggregation of cells transfected with the chimeric construct was analyzed.
   [00079]  The Bluescript SK(+) clone described in Example 2 which contained the entire coding sequence for pc43 was digested with EcoRV and then partially digested with XbaI to remove the sequence corresponding to the cytoplasmic domain, and the plasmid DNA was purified by agarose gel electrophoresis. The cDNA corresponding to the cytoplasmic domain of mouse E-cadherin was synthesized by PCR using mouse cDNA made from mouse lung mRNA as a template and specific primers corresponding to a region near the N-terminus of the cytoplasmic domain sequence or the region containing the stop codon of mouse E-cadherin (Nagafuchi et al., supra). A XbaI sequence was included to the 5' end of the upstream primer. The E-cadherin cytoplasmic domain cDNA was then subcloned into the linearized pc43 Bluescript clone. The DNA containing the entire resulting chimeric sequence was cut out with SpeI and EcoRV and was subcloned into the SpeI-blunted XbaI site of the expression vector pRc/RSV vector. Finally, L cells were transfected with the resultant construct by a calcium phosphate method. After screening with G418 for about 10 days, the transfectants were stained with FITC-labeled 38I2C anti-pc43 antibody and subjected to FAGS analysis. A portion of highly labeled cells were isolated and cloned. Transfectants showed a morphology similar to that of parental L cells and the expressed protein was localized at the cell periphery using pc43 antibody for immunofluorescence microscopy.
   [00080]  Cell aggregation activity of the chimeric transfectants was analyzed as follows. The chimeric pc43 transfectants Were labeled with DiO for 20 minutes at room temperature. The resultant cells were trypsinized in the presence of 1 mM EGTA and single cell suspension was made. Then, the cells were mixed with unlabeled other type of transfectants and incubated on a rotary shaker for two hours. The results were examined with a fluorescence and a phase contrast microscope apparatus. Antibody inhibition of cell aggregation was examined by incubation of the transfectants in the presence of polyclonal anti-pc43 antibody (100 ng/ml) in the standard assay medium.
   [00081]  In the cell aggregation assay, the chimeric pc43 transfectants showed clear Ca.sup.2+ -dependent cell aggregation within forty minutes of incubation. Cell aggregation was inhibited by the addition of pc43-specific polyclonal antibody.

EXAMPLE 9

   [00082]  The procedures of Maruyama et al, J. Biochem., 95: 511-519 (1984) were used to determine the calcium binding properties of pc43 by Western blot analysis in the presence or absence of calcium-45. The pc43 fusion protein described in Example 6 containing pc43 subdomains EC-3 through EC-5 was compared to the calcium binding protein calmodulin. Samples of purified pc43 fusion protein were run on SDS/PAGE and electrophoretically transferred to PVDF membrane. Binding of the .sup.45 Ca.sup.2+ to the pc43 fusion protein was detected by autoradiography and was determined to be nearly as efficient as binding of .sup.45 Ca.sup.2+ to calmodulin. In contrast, there was no binding of calcium to purified maltose binding protein lacking the pc43 extracellular domain. The pc43 subdomains EC-3 through EC-5 contain sequences highly homologous to the putative Ca.sup.2.+-. binding motifs found in E-cadherin. ?See, Ringwald et al., EMBO J., 6: 3647-3653 (1987).!

EXAMPLE 10

   [00083]  The expression of mRNA encoding pc42 and pc43 was assayed in various tissues and cell lines by Northern blot.
   [00084]  Total RNAs were prepared by the guanidium isothiocyanate method and poly(A)+ RNAs were isolated using a FastTrack kit (Invitrogen). RNA preparations were electrophoresed in a 0.8% agarose gel under denaturing conditions and transferred onto a nitrocellulose filter using a capillary method. Northern blot analyses were performed according to the method of Thomas, Proc. Natl. Acad. Sci. USA, 77: 5201-5205 (1980). The final wash was in 0.2× standard saline citrate containing 0.1% sodium dodecyl sulfate at 65° C. for 10 minutes.
   [00085]  Protocadherin mRNA Expression in Adult Rat Tissues
   [00086]  Total mRNA preparations of rat tissues including brain, heart, liver, lung, skin, kidney and muscle were separated electrophoretically under denaturing conditions (10 μg mRNA/lane) and transferred onto nitrocellulose filters. The filters were hybridized with .sup.32 P-labelled cDNA fragments MOUSE-326 (which corresponds to EC-4 of human pc42) and RAT-218 (which corresponds to EC-5 of human pc43). The mRNAs of both protocadherins were highly expressed in brain. The pc42 probe detected a major band of 7 kb and a minor band of 4 kb in size, possibly representing the products of alternative splicing. The pc43 probe hybridized to a major band of 5 kb in size and with minor bands of smaller sizes.
   [00087]  Developmental Expression of Protocadherin mRNA in Rat Brain
   [00088]  To examine the developmental regulation of mRNA expression of the protocadherins, brain mRNA from rats at embryonic days 17 and 20, neonatal days 5 and 11 and from adult rats was prepared and subjected to Northern blot analysis as described above for other rat tissues. β-actin was used as an internal standard. mRNA levels for pc42 and pc43 proteins increased during embryonic development of the brain as compared with β-actin expression.
   [00089]  Protocadherin mRNA Expression in Human Cell Lines
   [00090]  Several neuronal and glial cell lines (including human SK-N-SH neuroblastoma, human U251 glioma, and mouse Neuro-2a neuroblastoma cell lines) were assayed by Northern blot using .sup.32 P-labelled for expression of pc42 and pc43 mRNA. Human cell lines were probed with HUMAN-42 (which corresponds to EC-4 of human pc42) and HUMAN-43 (which corresponds to EC-5 of human pc43) cDNA fragments while the mouse cell line was probed with MOUSE-326 (which corresponds to EC-4 of human pc42) and RAT-322 (which corresponds to EC-5 of human pc43) cDNA fragments. SK-N-SH human neuroblastoma cells and U251 human glioma cells were found to express pc43 mRNA and Neuro-2a mouse neuroblastoma cells were found to express pc42 mRNA.

EXAMPLE 11

   [00091]  Expression of pc43 protein in various tissues, extracts and cells was assayed by Western blot and immunofluorescence microscopy.
   [00092]  Expression in Rat Cardiac Muscle Extracts
   [00093]  A rat heart non-ionic detergent extract was prepared by freezing a heart in liquid nitrogen after removal, powdering in a mortar and pestle, grinding briefly in a polytron in 0.5% Nonidet P40 in ?10 mM PIPES (pH 6.8), 50 mM NaCl, 250 mM NH.sub.4 SO.sub.4, 300 mM sucrose, 3 mM MgCl.sub.1 ! and microfuging for 15 minutes. Samples were separated by SDS/PAGE and electrophoretically transferred to nitrocellulose (Towbin et al., PNAS 76:4350-4354, 1979). Two pc43 protein bands with molecular weights of 150 KDa and 140 KDa were detected with rabbit polyclonal antibodies to pc43 by the immunoblot method described in Example 7.
   [00094]  Expression in Tissue Sections and Cells
   [00095]  To determine the localization of the protocadherins in various tissues, human and rat adult tissues were removed, incubated in 30% sucrose in PBS for 30 minutes at 4° C., embedded in OCT compound (Tissue-Tek, Elkhart, Ind.) in cryomolds and quickly frozen. Six micron sections were cut and placed on glass slides. The slides were washed with PBS and fixed in 3% p-formaldehyde for 5 minutes. To permeablize the tissue sections, the slides were immersed in -20° C. acetone for 10 minutes and air dried. The sections were blocked with 2% goat serum and 1% BSA in PBS for 30 minutes and then incubated with the rabbit anti-pc43 polyclonal antisera for 1 hour at room temperature. The sections were rinsed 3 times in PBS containing 0.1% BSA and incubated with a biotinylated anti-rabbit (Vector Laboratories, Burlingame, Calif.) in 1% BSA in PBS for 30 minutes. After rinsing 3 times, strepavidin-conjugated with FITC (Vector Laboratories) was added for 30 minutes and again washed 3 times. For co-localization studies, an appropriate primary antibody was used with a TRITC-conjugated secondary antibody.
   [00096]  A. Muscle
   [00097]  Immunolocalization of pc43 in rat cardiac muscle shows that pc43 is localized in a repeating pattern which is consistent with pc43 being associated with the sarcomeres. Sarcomeres are repetitive contractile units between the fascia adherens in skeletal and cardiac muscle. Co-localization with cytoskeletal proteins shows that pc43 is present at the ends of the sarcomeres in the Z lines which are associated with desmin and the actin-binding protein vinculin, and alpha-actinin. The thin microfilaments of F-actin are associated with the thick myosin filaments between the Z lines. In contrast, N-cadherin is localized at the ends of cardiac myocytes at the fascia adherens junctions at sites of mycocyte:myocyte contact. The localization of pc43 in cardiac muscle suggests that pc43 may play a role in muscle contraction in the anchoring of the contractile apparatus to the plasma membrane.
   [00098]  Similar localization for pc43 was observed in rat skeletal muscle. Ultrastructural studies have shown that dystrophin, the gene product lacking in Duchenne muscular dystrophy, is a component of the sarcolemma ?Porter et al., J. Cell. Biol., 117:997-1005 (1992)!. The sarcolemma is connected to the contractile apparatus at the M and Z lines where pc43 is localized.
   [00099]  B. Brain
   [00100]  Reactivity of anti-pc43 polyclonal antibody and monoclonal antibody 38I2C on frozen sections of rat and human cerebellum, respectively, shows that the major sites of pc43 expression are located in Purkinje cells and the granule cell layer which contains numerous small neurons.
   [00101]  C. Placenta
   [00102]  Strong reactivity of monoclonal antibody 38I2C with human syncytiotrophoblasts was also observed in development of the placenta at an early state (5-7 weeks of gestation). Expression appeared to gradually decrease as the stage progressed indicating that pc43 may be involved in the implantation of fertilized eggs into the placenta.
   [00103]  D. Neuroblastoma and Astrocytoma Cells
   [00104]  Immunocytochemical localization of pc43 in Sk-N-SH neuroblastoma cells and UW28 astrocytoma cells using anti-pc43 antibodies reveals a punctate cell surface distribution of pc43 and in some cells there is a localization at the tips of extensions of neuronal foot processes. At sites of cell-cell contact of UW28 astrocytoma cells, pc43 is organized in a series of parallel lines. The lines start at the contact site and extend approximately 5 micron. F-actin microfilaments were identified with rhodamine-phalloidin (Molecular Probes, Eugene, Oreg., as described by the manufacturer) showing that the microfilaments in the cell appear to end in the pc43 linear structures which extend from the edge of the cell at sites of cell contact.
   [00105]  Immunoblotting studies with pc43 specific antibodies show that a protein with a molecular weight of 140 kDa is recognized in human Sk-N-SH neuroblastoma cells and in UW28 astrocytoma cells.
   [00106]  E. Osteoblasts
   [00107]  Immunocytochemical localization of pc43 using monoclonal antibody 38I2C in tow human ostogenic sarcoma cell lines ?SaOS (ATCC HTB 85) and MG-63 (ATCC CRL 1427)! and in cultures of normal human trabecular osteoblasts ?culture system described in Civitelli et al., J. Clin. Invest., 91: 1888-1896 (1993)! showed that pc43 is expressed in osteoblasts in a pattern similar to that seen in UW28 astrocytoma cells. At sites of cell-cell contact, pc43 is organized in a series of parallel lines that appear to correspond to the actin stress fibers. In addition, in some cells, pc43 appears to localize at the tips of contacting cell processes. Northern blot analysis provides additional evidence that pc43 is expressed in normal human trabecular osteoblasts. A pc43 specific DNA probe hybridized to a major band of 5 kb in samples of poly-A mRNA isolated from normal human trabecular osteoblasts.

EXAMPLE 12

   [00108]  In situ hybridization experiments using protocadherin specific RNA probes were performed on cryosections of rat tissue.
   [00109]  Sense and antisense .sup.35 S-riboprobes were made using the standard procedure described by Promega (Madison, Wis.). An approximately 400 bp EcoRI-Xbal fragment of the MOUSE-326 cDNA clone was used as a pc42 specific probe. This fragment encodes the middle of EC-3 to the end of EC-4 of pc42. An approximately 700 bp Smal fragment of the RAT-218 cDNA clone was used as a pc43 specific probe. The fragment encodes the end of EC-3 to the end of EC-5 of pc43.
   [00110]  Rat adult tissues were harvested and immediately embedded with OCT Compound Crissue-Tek) in cryomolds and quickly frozen in a bath of 95% ethanol/dry ice. The frozen blocks were stored at -80° C. until cut. Six micron tissue sections were cut using a cryostat (Reichert-Jung, Model #2800 Frigocut N, Leiea, Inc., Gilroy, Calif.). Cut tissue sections were stored at -80° C.
   [00111]  The in situ protocol used was a variation of that described by Angerer et al., Methods in Enzymology, 152: 649-660, (1987). All solutions were treated with diethylpyrocarbonate (DEPC, Sigma, St. Louis, Mo.) to remove RNase contamination. The tissue sections were first fixed in 4% parsformaldehyde at 4° C. for 20 minutes. To remove excess panformaldehyde and stop the tissue fixation, the slides were washed in PBS (phosphate buffered saline), denatured in a graded series of alcohols (70, 95, 100%) and then dried. To prevent the tissue from detaching from the glass slide during the in situ procedure, the tissue sections were treated in a poly-L-lysine solution (Sigma) at room temperature for 10 minutes. To denature all RNA in the tissue, the sections were placed in a solution of 70% formamide/2× SSC (0.15M NaCl/0.3M Na citrate, pH 7.0) at 70° C. for 2 minutes after which they were rinsed in chilled 2× SSC, dehydrated in a graded series of alcohols and then dried. Once dried, the sections were prehybridized in hybridization buffer ?50% formamide/50 mM DTT (dithiothrietol)/0.3M NaCl/20 mM Tris, pH 8.0/5 mM EDTA/1× Denhardt's (0.02% Ficoll Type 400/0.02% polyvinylpyrrolidone/0.02% BSA)/10% Dextran Sulfate! at the final hybridization temperature for approximately 4 hours. After prehybridization, approximately 1×10.sup.6 cpm of the appropriate riboprobe was added to each section. The sections were generally hybridized at 45° C. overnight (12-16 hours). To insure that the hybridization seen was specific, in some experiments the hybridization stringency was increased by raising the hybridization temperature to 50° C. As both the 45° C. and 50° C. experiments gave comparable results, the standard hybridization temperature used was 45° C.
   [00112]  To remove excess, nonhybridized probe, the sections were put through a series of washes. The sections were first rinsed in 4× SSC to remove the bulk of the hybridization solution and probe. Next a 15 minute wash in 4× SSC/50 mM DTT was carried out at room temperature. Washes at increased stringencies were also utilized. A 40 minute wash in 50% formamide/2× SSC/50 mM DTT was performed at 60° C. Four final room temperature washes were carried out for 10 minutes each: two in 2× SSC and two in 0.1× SSC. The washed slides were dehydrated in a graded series of alcohols and dried.
   [00113]  To visualize the hybridized probe, the slides were dipped in Kodak NTB2 nuclear emulsion (International Biotechnology, New Haven, Conn.) which had been diluted 1:1 in dH.sub.2 O. Once dry, the slides were stored at 4° C. in light-tight boxes for the appropriate exposure time. The in situ slides were independently viewed by two persons and scored positive or negative for hybridization signal.
   [00114]  All in situ hybridization studies were performed on rat tissue. Because results from Northern blot experiments (see Example 9) indicated that both pc42 and pc43 are expressed in adult brain, in situ hybridization studies were carried out to localize the expression of these molecules to specific brain cell types. Hybridization seen in the normal adult rat brian was specific (no background hybridization was seen with the sense probes) and was localized to specific regions in the brain. The overall pattern of expression seen for pc42 and pc43 was very similar, with the major difference being in the level of expression. pc43 appears to be expressed at a lower level than pc42. Both molecules are expressed in the germinal and pyramidal cells of the hippocampus, Purkinje cells of the cerebellum and neurons in grey matter. In addition, pc42 is expressed in glial cells in the white matter but, in contrast to the expression of pc43 in glioma cell lines (as described in Example 9), expression of pc43 in normal glial cells was not observed. In the spinal chord, both protocadherins are expressed in the motor neurons in the gray matter and pc42 is expressed in the glial cells in the white matter.
   [00115]  When expression of both protocadherin molecules was analyzed in brains and spinal chords from rats having EAE (experimental allergic encephalomyelitis) ?Vandenbark et al., Cell. Immunol., 12: 85-93 (1974)!, the same structures as described above were found to be positive. In addition, expression of pc42 was observed in the leukocytic inffitrates in the EAE tissues. Expression of pc42 in leukocytes was confirmed by in situ hybridization analysis of two leukocytic cell lines, RBL-1 and y3.
   [00116]  Expression of both protocadherin-42 and -43 was observed in the developing brain of rat embryos at all embryological days tested (E15-E19). In addition protocadherin-43 was observed in the developing rat heart at all embryological days tested (E13-E19). This finding is consistent with the immunohistochemistry results showing protocadherin-43 expression in adult heart.
   [00117]  To determine possible roles of protocadherins in the development of the nervous system, expression profiles of protocadherin members in developing rat brain and adult rat brain were also examined by in situ hybridization. A series of coronal, sagittal and horizontal sections of rat brains at postnatal days 0, 6, 14, 30 (P0 through P30) and at 3 months (young adult) were hybridized with labelled cRNA probes corresponding to various protocadherins of the invention including pc42, pc43, RAT-212, RAT-411, and RAT-418. In developing brain, RAT-411 was expressed at high levels in neurons of the olfactory bulb, i.e., mitral cells and periglomerular cells. The expression of RAT-411 mRNA was transient; expression appeared at P0, peaked at P6, diminished by P14, and was undetectable at P30 and in adult brain. In the adult, pc43 mRNA was found to be expressed predominantly in Purkinje cells in the cerebellum. The expression of pc43 mRNA in Purkinje cells was observed from the beginning of Purkinje cell differentiation at around P6. Other protocadherin members were expressed at very low levels in various areas of developing and adult brains. These results indicate that protocadherin members are differentially expressed during the development of the central nervous system, and suggest that RAT-411 and pc43 have specific roles during the development of olfactory bulb neurons and Purkinje cells, respectively.

EXAMPLE 13

   [00118]  Conventional immunoprecipitations using pc43-specific polyclonal antibodies and monoclonal antibody 38I2C were performed to identify proteins that interacted with pc43 in L cell transfectants.
   [00119]  The pc43 and chimeric pc43 transfectants were metabolically labeled by incubating the cells in Dulbecco's modified Eagle's medium containing ?.sup.35 S! methionine (50 uCi/ml) overnight. After washing, the transfectants were lysed with PBS containing Triton X 100 and incubated with anti-pc43 antibody. The immunocomplexes were then collected using protein A-Sepharose beads. The resulting beads were washed five times with a washing buffer (50 mM Tris-HCl, pH 8.0, containing 0.5M NACl, 0.1% ovalbumin, 0.5% NP-40, 0.5% Triton×100 and 1 mM EDTA) at room temperature. Protein was separated by SDS-PAGE and subjected to autoradiography.
   [00120]  The chimeric pc43 co-precipitated with 105 kDa and a 95 kDa bands that are likely to correspond to α- and β-catenins, respectively, because anti-α-catenin and anti-β-catenin antibodies stained comparable bands. Pc43, on the other hand, co-precipitated with a 120 kDa band.
   [00121]  While the present invention has been described in terms of specific methods and compositions, it is understood that variations and modifications will occur to those skilled in the art. Therefore, only such limitations as appear in the claims should be placed on the invention. __________________________________________________________________________ SEQUENCE LISTING (1) GENERAL INFORMATION: (iii) NUMBER OF SEQUENCES: 115 (2) INFORMATION FOR SEQ ID NO:1: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: AARSSNNTNGAYTRYGA17 (2) INFORMATION FOR SEQ ID NO:2: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: TTRCTRTTRCGNGGNNN17 (2) INFORMATION FOR SEQ ID NO:3: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: AAGGGAGTGGACTTTGAGGAGCAGCCTGAGCTTAGTCTCATCCTCACGGCTTTGGATGGA60 GGGACTCCATCCAGGTCTGGGACTGCATTGGTTCAAGTGGAAGTCATAGATGCCAATGAC120 AACGCACCGTA131 (2) INFORMATION FOR SEQ ID NO:4: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: LysGlyValAspPheGluGluGlnProGluLeuSerLeuIleLeuThr 151015 AlaLeuAspGlyGlyThrProSerArgSerGlyThrAlaLeuValGln 202530 ValGluValIleAspAlaAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:5: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: AAACGCATGGATTTCGAGGAGTCTTCCTCCTACCAGATCTATGTGCAAGCTACTGACCGG60 GGACCAGTACCCATGGCGGGTCATTGCAAGGTGTTGGTGGACATTATAGATGTGAACGAC120 AACGCACCTAA131 (2) INFORMATION FOR SEQ ID NO:6: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: LysAlaMetAspPheGluGluSerSerSerTyrGlnIleTyrValGln 151015 AlaThrAspArgGlyProValProMetAlaGlyHisCysLysValLeu 202530 ValAspIleIleAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:7: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: AAGCGACTGGACTTTGAGACCCTGCAGACCTTCGAGTTCAGCGTGGGTGCCACAGACCAT60 GGCTCCCCCTCGCTCCGCAGTCAGGCTCTGGTGCGCGTGGTGGTGCTGGACCACAATGAC120 AATGCCCCCAA131 (2) INFORMATION FOR SEQ ID NO:8: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: LysArgLeuAspPheGluThrLeuGlnThrPheGluPheSerValGly 151015 AlaThrAspHisGlySerProSerLeuArgSerGlnAlaLeuValArg 202530 ValValValLeuAspHisAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: AAGGGCCTGGATTACGAGGCACTGCAGTCCTTCGAGTTCTACGTGGGCGCTACAGATGGA60 GGCTCACCCGCGCTCAGCAGCCAGACTCTGGTGCGGATGGTGGTGCTGGATGACAACGAC120 AACGCCCCTAA131 (2) INFORMATION FOR SEQ ID NO:10: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: LysGlyLeuAspTyrGluAlaLeuGlnSerPheGluPheTyrValGly 151015 AlaThrAspGlyGlySerProAlaLeuSerSerGlnThrLeuValArg 202530 MetValValLeuAspAspAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:11: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: AAGGCGTTTGATTTTGAGGATCAGAGAGAGTTCCAGCTAACCGCTCATATAAACGACGGA60 GGTACCCCGGTTTTGGCCACCAACATCAGCGTGAACATATTTGTTACTGACCGCAATGAC120 AACGCCCCGCA131 (2) INFORMATION FOR SEQ ID NO:12: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: LysAlaPheAspPheGluAspGlnArgGluPheGlnLeuThrAlaHis 151015 IleAsnAspGlyGlyThrProValLeuAlaThrAsnIleSerValAsn 202530 IlePheValThrAspArgAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:13: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: AAGGCGGTGGATTACGAAATCACCAAGTCCTATGAGATAGATGTTCAAGCCCAAGATCTG60 GGTCCCAATTCTATTCCTGCTCATTGCAAAATTATAATTAAGGTCGTGGATGTCAACGAC120 AACGCTCCCAA131 (2) INFORMATION FOR SEQ ID NO:14: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: LysAlaValAspTyrGluIleThrLysSerTyrGluIleAspValGln 151015 AlaGlnAspLeuGlyProAsnSerIleProAlaHisCysLysIleIle 202530 IleLysValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:15: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 135 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: TATGACCATGATTACGAGACAACCAAAGAATATACACTGCGGATCCGGGCCCAGGATGGT60 GGCCGGACTCCACTTTCCAACGTCTCCGGTCTAGTAACCGTGCAGGTCCTAGACATCAAC120 GACAATGCCCCCCCA135 (2) INFORMATION FOR SEQ ID NO:16: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: TyrAspHisAspTyrGluThrThrLysGluTyrThrLeuArgIleArg 151015 AlaGlnAspGlyGlyArgThrProLeuSerAsnValSerGlyLeuVal 202530 ThrValGlnValLeuAspIleAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:17: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 129 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: GGGGGGTCGATTACGAGGAGAACGGCATGTTAGAGATCGACGTGCAGGCCAGAGACCTAG60 GACCTAACCCAATTCCAGCCCATTGCAAGGTCACAGTCAAGCTCATCGACCGCAATGATA120 ACGCCCCCA129 (2) INFORMATION FOR SEQ ID NO:18: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: ArgGlyValAspTyrGluGluAsnGlyMetLeuGluIleAspValGln 151015 AlaArgAspLeuGlyProAsnProIleProAlaHisCysLysValThr 202530 ValLysLeuIleAspArgAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:19: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: AAGGGGTTGGACTACGAAGACACCAAACTCCATGAGATTTACATCCAGGCCAAAGACAAA60 GGTGCCAATCCGGAAGGAGCGCATTGCAAAGTACTGGTAGAGGTTGTGGACGTTAACGAC120 AATGCCCCTCA131 (2) INFORMATION FOR SEQ ID NO:20: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: LysGlyLeuAspTyrGluAspThrLysLeuHisGluIleTyrIleGln 151015 AlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLysValLeu 202530 ValGluValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:21: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: AAGGGTTTGGACTTTGAGCAAGTAGATGTCTACAAAATCCGCGTTGACGCGACGGACAAA60 GGACACCCTCCGATGGCAGGCCATTGCACTGTTTTAGTGAGGGTATTGGATGAAAACGAC120 AATGCGCCTCT131 (2) INFORMATION FOR SEQ ID NO:22: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: LysGlyLeuAspPheGluGlnValAspValTyrLysIleArgValAsp 151015 AlaThrAspLysGlyHisProProMetAlaGlyHisCysThrValLeu 202530 ValArgValLeuAspGluAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:23: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 134 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: AAGGGTATAGACTTCGAGCAGATCAAGGACTTCAGCTTTCAAGTGGAAGCCCGGGACGCC60 GGCAGTCCCCAGGCGCTGTCCGGCAACTGCACTGTCAACATCTTGATAGTGGATCAGAAC120 GACAACGCCCCTAA134 (2) INFORMATION FOR SEQ ID NO:24: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: LysGlyIleAspPheGluGlnIleLysAspPheSerPheGlnValGlu 151015 AlaArgAspAlaGlySerProGlnAlaLeuAlaGlyAsnThrThrVal 202530 AsnIleLeuIleValAspGlnAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:25: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 134 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: AAGCCGTTCGACTATGAGCAAACCGCCAACACGCTGGCACAGATTGACGCCGTGCTGGAA60 AAACAGGGCAGCAATAAATCGAGCATTCTGGATGCCACCATTTTCCTGGCCGATAAAAAC120 GACAATGCGCCAGA134 (2) INFORMATION FOR SEQ ID NO:26: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: LysProPheAspTyrGluGlnThrAlaAsnThrLeuAlaGlnIleAsp 151015 AlaValLeuGluLysGlnGlySerAsnLysSerSerIleLeuAspAla 202530 ThrIlePheLeuAlaAspLysAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:27: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: AAGCGGCTGGATTTCGAACAGTTCCAGCAGCACAAGCTGCTCGTAAGGGCTGTTGATGGA60 GGAATGCCGCCACTGAGCAGCGATGTGGTCGTCACTGTGGATGTCACCGACCTCAACGAT120 AACGCGCCCTA131 (2) INFORMATION FOR SEQ ID NO:28: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: LysArgLeuAspPheGluGlnPheGlnGlnHisLysLeuLeuValArg 151015 AlaValAspGlyGlyMetProProLeuSerSerAspValValValThr 202530 ValAspValThrAspLeuAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:29: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: AAGGGGATAGACTTTGAGAGTGAGAATTACTATGAATTTGATGTGCGGGCTCGCGATGGG60 GGTTCTCCAGCCATGGAGCAACATTGCAGCCTTCGAGTGGATCTGCTGGACGTAAATGAC120 AACGCCCCACT131 (2) INFORMATION FOR SEQ ID NO:30: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: LysGlyIleAspPheGluSerGluAsnTyrTyrGluPheAspValArg 151015 AlaArgAspGlyGlySerProAlaMetGluGlnHisCysSerLeuArg 202530 ValAspLeuLeuAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:31: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: AAGGCATTGGACTTTGAGGCCCGGCGACTGTATTCGCTGACAGTTCAGGCCACGGACCGA60 GGCGTGCCCTCGCTCACCGGGCGTGCCGAAGCGCTTATCCAGCTGCTAGATGTCAACGAC120 AACGCACCCAT131 (2) INFORMATION FOR SEQ ID NO:32: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: LysAlaLeuAspPheGluAlaArgArgLeuTyrSerLeuThrValGln 151015 AlaThrAspArgGlyValProSerLeuThrGlyArgAlaGluAlaLeu 202530 IleGlnLeuLeuAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:33: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 125 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: AAGCCAATTGATTACGAGGCAACTCCATACTATAACATGGAAATTGTAGCCACAGACAGC60 GGAGGTCTTTCGGGAAAATGCACTGTGTCTATACAGGTGGTGGATGTGAACGACAACGCC120 CCCAA125 (2) INFORMATION FOR SEQ ID NO:34: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: LysProIleAspTyrGluAlaThrProTyrTyrAsnMetGluIleVal 151015 AlaThrAspSerGlyGlyLeuSerGlyLysCysThrValSerIleGln 202530 ValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:35: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 446 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: AAGCGGGTAGACTTCGAAATGTGCAAAAGATTTTACCTTGTGGTGGAAGCTAAAGACGGA60 GGCACCCCAGCCCTCAGCACGGCAGCCACTGTCAGCATCGACCTCACAGATGTGAATGAT120 AACCCTCCTCGGTTCAGCCAAGATGTCTACAGTGCTGTCATCAGTGAGGATGCCTTAGAG180 GGGGACTCTGTCATTCTGCTGATAGCAGAAGATGTGGATAGCAAGCCTAATGGACAGATT240 CGGTTTTCCATCGTGGGTGGAGATAGGGACAATGAATTTGCTGTCGATCCAATCTTGGGA300 CTTGTGAAAGTTAAGAAGAAACTGGACCGGGAGCGGGTGTCAGGATACTCCCTGCTCATC360 CAGGCAGTAGATAGTGGCATTCCTGCAATGTCCTCAACGACAACTGTCAACATTGATATT420 TCTGATGTGAACGACAACGCCCCCCT446 (2) INFORMATION FOR SEQ ID NO:36: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 148 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: LysArgValAspPheGluMetCysLysArgPheTyrLeuValValGlu 151015 AlaLysAspGlyGlyThrProAlaLeuSerThrAlaAlaThrValSer 202530 IleAspLeuThrAspValAsnAspAsnProProArgPheSerGlnAsp 354045 ValTyrAspAlaValIleSerGluAspAlaLeuGluGlyAspSerVal 505560 IleLeuLeuIleAlaGluAspValAspSerLysProAsnGlyGlnIle 65707580 ArgPheSerIleValGlyGlyAspArgAspAsnGluPheAlaValAsp 859095 ProIleLeuGlyLeuValLysValLysLysLysLeuAspArgGluArg 100105110 ValSerGlyTyrSerLeuLeuIleGlnAlaValAspSerGlyIlePro 115120125 AlaMetSerSerThrThrThrValAsnIleAspIleSerAspValAsn 130135140 AspAsnAlaPro 145 (2) INFORMATION FOR SEQ ID NO:37: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 440 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: AAGGGGGTTGATTATGAGACAAACCCACGGCTACGACTGGTGCTACAGGCAGAGAGTGGA60 GGAGCCTTTGCTTTCTCGGTGCTGACCCTGACCCTTCAAGATGCCAATGACAATGCTCCC120 CGTTTCCTGCAGCCTCACTACGTGGCTTTCCTGCCAGAGTCCCGACCCTTGGAAGGGCCC180 CTGCTGCAGGTGGAAGCAGACGACCTGGATCAAGGCTCTGGAGGACAGATCTCCTACAGT240 CTGGCTGCATCCCAGCCAGCACGGGGCTTGTTCCATGTAGACCCAGCCACAGGCACTATC300 ACTACCACAGCCATCCTGGACCGGGAAATCTGGGCTGAAACACGGCTGGTACTGATGGCC360 ACAGACAGAGGAAGCCCAGCATTGGTGGGCTCAGCTACCCTGACAGTGATGGTCATCGAT420 ACCAACGACAATGCTCCCCT440 (2) INFORMATION FOR SEQ ID NO:38: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: LysGlyValAspTyrGluThrAsnProArgLeuArgLeuValLeuGln 151015 AlaGluSerGlyGlyAlaPheAlaPheSerValLeuThrLeuThrLeu 202530 GlnAspAlaAsnAspAsnAlaProArgPheLeuGlnProHisTyrVal 354045 AlaPheLeuProGluSerArgProLeuGluGlyProLeuLeuGlnVal 505560 GluAlaAsnAspLeuAspGlnGlySerGlyGlyGlnIleSerTyrSer 65707580 LeuAlaAlaSerGlnProAlaArgGlyLeuPheHisValAspProAla 859095 ThrGlyThrIleThrThrThrAlaIleLeuAspArgGluIleTrpAla 100105110 GluThrArgLeuValLeuMetAlaThrAspArgGlySerProAlaLeu 115120125 ValGlySerAlaThrLeuThrValMetValIleAspThrAsnAspAsn 130135140 AlaPro 145 (2) INFORMATION FOR SEQ ID NO:39: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 124 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: AAGGTCTCGATTATGAGGCAACTCCATATTATAACGTGGAAATTGTAGCCACAGATGGTG60 GGGGCCTTTCAGGAAAATGCACTGTGGCTATAGAAGTGGTGGATGTGAACGACGGCGCTC120 CAAT124 (2) INFORMATION FOR SEQ ID NO:40: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: LysGlyLeuAspTyrGluAlaThrProTyrTyrAsnValGluIleVal 151015 AlaThrAspGlyGlyAlaPheAspGluAsnCysThrValAlaIleGlu 202530 ValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:41: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 8 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: AspXaaAsnGluXaaProXaaPhe 15 (2) INFORMATION FOR SEQ ID NO:42: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 8 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: AspXaaAspGluXaaProXaaPhe 15 (2) INFORMATION FOR SEQ ID NO:43: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 9 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: AspXaaAsnAspAsnXaaProXaaPhe 15 (2) INFORMATION FOR SEQ ID NO:44: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: AAGCGGATGGATTTTGAAGACACCAAACTCCATGAGATTTACATCCAGGCCAAAGACAAA60 GGTGCCAATCCCGAAGGAGCGCATTGCAAAGTACTTGTAGAGGTTGTAGACGTAAACGAC120 AACGCCCCAGT131 (2) INFORMATION FOR SEQ ID NO:45: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: LeuArgMetAspPheGluAspThrLysLeuHisGluIleTyrIleGln 151015 AlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLysValLeu 202530 ValGluValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:46: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: AAGGCTTTGGATTACGAGGATCAGAGAGAGTTCCAACTAACAGCTCATATAAACGACGGA60 GGTACCCCAGTCTTAGCCACCAACATCAGCGTGAACGTATTTGTTACTGACCGCAATGAT120 AACGCCCCCTA131 (2) INFORMATION FOR SEQ ID NO:47: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: LysAlaLeuAspTyrGluAspGlnArgGluPheGlnLeuThrAlaHis 151015 IleAsnAspGlyGlyThrProValLeuAlaThrAsnIleSerValAsn 202530 ValPheValThrAspArgAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:48: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: AAGCGCTTGGACTACGAGGAGAGTAACAATTATGAAATTCACGTGGATGCTACAGATAAA60 GGATACCCACCTATGGTTGCTCACTGCACCGTACTCGTGGGAATCTTGGATGAAAATGAC120 AACGCACCCAT131 (2) INFORMATION FOR SEQ ID NO:49: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: LysArgLeuAspTyrGluGluSerAsnAsnTyrGluIleHisValAsp 151015 AlaThrAspLysGlyTyrProProMetValAlaHisCysThrValLeu 202530 ValGlyIleLeuAspGluAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:50: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: AAACCGGTGGACTACGAGAAAGTCAAAGACTATACCATCGAGATCGTGGCTGTGGATTCC60 GGCAACCCTCCACTCTCTAGCACCAACTCCCTCAAGGTGCAGGTGGTAGACGTCAACGAT120 AACGCCCCTCT131 (2) INFORMATION FOR SEQ ID NO:51: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: LysProValAspTyrGluLysValLysAspTyrThrIleGluIleVal 151015 AlaValAspSerGlyAsnProProLeuSerSerThrAsnSerLeuLys 202530 ValGlnValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:52: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: AAGCCTTTTGATTTCGAGGACACCAAACTCCATGAGATTTACATCCAGGCCAAAGACAAG60 GGCGCCAATCCCGAAGGAGCACATTGCAAAGTGTTGGTGGAGGTTGTGGATGTGAACGAC120 AATGCCCCTCA131 (2) INFORMATION FOR SEQ ID NO:53: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: LysProPheAspPheGluAspThrLysLeuHisGluIleTyrIleGln 151015 AlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLysValLeu 202530 ValGluValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:54: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 122 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: AAAGGTGTCGATTACGAGGTGAGTCCACGGCTGCGACTGGTGCTGCAGGCAGAGAGTCGA60 GGAGCCTTTGCCTTCACTGTGCTGACCCTGACCCTGCAAGATGCCAACGACAACGCCCCG120 AG122 (2) INFORMATION FOR SEQ ID NO:55: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 40 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: LysGlyValAspTyrGluValSerProArgLeuArgLeuValLeuGln 151015 AlaGluSerArgGlyAlaPheAlaPheThrValLeuThrLeuThrLeu 202530 GlnAspAlaAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:56: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: AAAGGGATTGATTACGAGCAGTTGAGAGACCTACAGCTGTGGGTGACAGCCAGCGACAGC60 GGGGACCCGCCTCTTAGCAGCAACGTGTCACTGAGCCTGTTTGTGCTGGACCAGAACGAC120 AACGCCCCCCT131 (2) INFORMATION FOR SEQ ID NO:57: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: LysGlyIleAspTyrGluGlnLeuArgAspLeuGlnLeuTrpValThr 151015 AlaSerAspSerGlyAspProProLeuSerSerAsnValSerLeuSer 202530 LeuPheValLeuAspGlnAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:58: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 125 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: AAGGCGGTCGATTTTGAGCGCACATCCTCTTATCAACTCATCATTCAGGCCACCAATATG60 GCAGGAATGGCTTCCAATGCTACAGTCAATATTCAGATTGTTGATGAAAACGACAACGCC120 CCCCA125 (2) INFORMATION FOR SEQ ID NO:59: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: LysAlaValAspPheGluArgThrSerSerTyrGlnLeuIleIleGln 151015 AlaThrAsnMetAlaGlyMetAlaSerAsnAlaThrValAsnIleGln 202530 IleValAspGluAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:60: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: AAACGGCTAGACTTTGAAAAGATACAAAAATATGTTGTATGGATAGAGGCCAGAGATGGT60 GGTTTCCCTCCTTTCTCCTCTTACGAGAAACTTGATATAACAGTATTAGATGTCAACGAT120 AACGCGCCTAA131 (2) INFORMATION FOR SEQ ID NO:61: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: LysArgLeuAspPheGluLysIleGlnLysTyrValValTrpIleGlu 151015 AlaArgAspGlyGlyPheProProPheSerSerTyrGluLysLeuAsp 202530 IleThrValLeuAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:62: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: AAGGGGATCGATTATGAGAAGGTCAAAGACTACACCATTGAGATTGTGGCTGTGGACTCT60 GGCAACCCCCCACTCTCCAGCACTAACTCCCTCAAGGTGCAGGTGGTGGACGTCAATGAC120 AACGCACCGTG131 (2) INFORMATION FOR SEQ ID NO:63: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: LysGlyIleAspTyrGluLysValLysAspTyrThrIleGluIleVal 151015 AlaValAspSerGlyAsnProProLeuSerSerThrAsnSerLeuLys 202530 ValGlnValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:64: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: AAGGGACTCGACTACGAGGATCGGCGGGAATTTGAATTAACAGCTCATATCAGCGATGGG60 GGCACCCCGGTCCTAGCCACCAACATCAGCGTGAACATATTTGTCACTGATCGCAACGAT120 AATGCCCCCGT131 (2) INFORMATION FOR SEQ ID NO:65: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: LysGlyLeuAspTyrGluAspArgArgGluPheGluLeuThrAlaHis 151015 IleSerAspGlyGlyThrProValLeuAlaThrAsnIleSerValAsn 202530 IlePheValThrAspArgAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:66: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 470 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: AAGGGTTTGGACTACGAGACCACACAGGCCTACCAGCTCACGGTCAACGCCACAGATCAA60 GACAACACCAGGCCTCTGTCCACCCTGGCCAACTTGGCCATCATCATCACAGATGTCCAG120 GACATGGACCCCATCTTCATCAACCTGCCTTACAGCACCAACATCTACGAGCATTCTCCT180 CCGGGCACGACGGTGCGCATCATCACCGCCATAGACCAGGATCAAGGACGTCCCCGGGGC240 ATTGGCTACACCATCGTTTCAGGGAATACCAACAGCATCTTTGCCCTGGACTACATCAGC300 GGAGTGCTGACCTTGAATGGCCTGCTGGACCGGGAGAACCCCCTGTACAGCCATGGCTTC360 ATCCTGACTGTGAAGGGCACGGAGCTGAACGATGACCGCACCCCATCTGACGCTACAGTC420 ACCACGACCTTCAATATCCTGGTTATTGACATCAACGACAACGCCCCACT470 (2) INFORMATION FOR SEQ ID NO:67: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 156 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: LysGlyLeuAspTyrGluThrThrGlnAlaTyrGlnLeuThrValAsn 151015 AlaThrAspGlnAspAsnThrArgProLeuSerThrLeuAlaAsnLeu 202530 AlaIleIleIleThrAspValGlnAspMetAspProIlePheIleAsn 354045 LeuProTyrSerThrAsnIleTyrGluHisSerProProGlyThrThr 505560 ValArgIleIleThrAlaIleAspGlnAspGlnGlyArgProArgGly 65707580 IleGlyTyrThrIleValSerGlyAsnThrAsnSerIlePheAlaLeu 859095 AspTyrIleSerGlyValLeuThrLeuAsnGlyLeuLeuAspArgGlu 100105110 AsnProLeuTyrSerGlyGlyPheIleLeuThrValLysGlyThrGlu 115120125 LeuAsnAspAspArgThrProSerAspAlaThrValThrThrThrPhe 130135140 AsnIleLeuValIleAspIleAsnAspAsnAlaPro 145150155 (2) INFORMATION FOR SEQ ID NO:68: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: AAGGGGGTCGATTACGAGGTACTACAGGCCTTTGAGTTCCACGTGAGCGCCACAGACCGA60 GGCTCACCGGGGCTCAGCAGCCAGGCTCTGGTGCGCGTGGTGGTGCTGGACGACAATGAC120 AACGCTCCCGT131 (2) INFORMATION FOR SEQ ID NO:69: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: LysGlyValAspTyrGluValLeuGlnAlaPheGluPheHisValSer 151015 AlaThrAspArgGlySerProGlyLeuSerSerGlnAlaLeuValArg 202530 ValValValLeuAspAspAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:70: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: AAGGGGCTGGATTATGAGCAGTTCCAGACCCTACAACTGGGAGTGACCGCTAGTGACAGT60 GGAAACCCACCATTAAGAAGCAATATTTCACTGACCCTTTTCGTGCTGGACCAGAATGAT120 AACGCCCCAAA131 (2) INFORMATION FOR SEQ ID NO:71: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: LysGlyLeuAspTyrGluGlnPheGlnThrLeuGlnLeuGlyValThr 151015 AlaSerAspSerGlyAsnProProLeuArgSerAsnIleSerLeuThr 202530 LeuPheValLeuAspGlnAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:72: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: AAGCGGGTTGATTACGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGG60 CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGAC120 AATGCCCCGGA131 (2) INFORMATION FOR SEQ ID NO:73: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: LysArgValAspTyrGluAspValGlnLysTyrSerLeuSerIleLys 151015 AlaGlnAspGlyArgProProLeuIleAsnSerSerGlyValValSer 202530 ValGlnValLeuAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:74: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 125 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: AAACCGGTAGACTTTGAGCTACAGCAGTTCTATGAAGTAGCTGTGGTGGCTTGGAACTCT60 GAGGGATTTCATGTCAAAAGGGTCATTAAAGTGCAACTTTTAGATGACAACGACAATGCC120 CCGAT125 (2) INFORMATION FOR SEQ ID NO:75: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: LysProValAspPheGluLeuGlnGlnPheTyrGluValAlaValVal 151015 AlaTrpAsnSerGluGlyPheHisValLysArgValIleLysValGln 202530 LeuLeuAspAspAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:76: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 125 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: AAGGGATTAGATTTTGAAACTTTGCCCATTTACACATTGATAATACAAGGAACTAACATG60 GCTGGTTTGTCCACTAATACAACGGTTCTAGTTCACTTGCAGGATGAGAATGATAACGCC120 CCAAA125 (2) INFORMATION FOR SEQ ID NO:77: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: LysGlyLeuAspPheGluThrLeuProIleTyrThrLeuIleIleGln 151015 GlyThrAsnMetAlaGlyLeuSerThrAsnThrThrValLeuValHis 202530 LeuGlnAspGluAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:78: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 134 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: AAGCGGGCGGATTTCGAGGCGATCCGGGAGTACAGTCTGAGGATCAAAGCGCAGGACGGG60 GGGCGGCCTCCCCTCAGCAACACCACGGGCATGGTCACAGTGCAGGTCGTGGACGTCAAT120 GACAACGCACCCCT134 (2) INFORMATION FOR SEQ ID NO:79: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: LysArgAlaAspPheGluAlaIleArgGluTyrSerLeuArgIleLys 151015 AlaGlnAspGlyGlyArgProProLeuSerAsnThrThrGlyMetVal 202530 ThrValGlnValValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:80: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: AAGCGGTTGGATTACGAAAAGGCATCGGAATATGAAATCTATGTTCAAGCCGCTGACAAA60 GGCGCTGTCCCTATGGCTGGCCATTGCAAAGTGTTGCTGGAGATCGTGGATGTCAACGAC120 AACGCCCCCTT131 (2) INFORMATION FOR SEQ ID NO:81: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: LysArgLeuAspTyrGluLysAlaSerGluTyrGluIleTyrValGln 151015 AlaAlaAspLysGlyAlaValProMetAlaGlyHisCysLysValLeu 202530 LeuGluIleValAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:82: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: AAGGGGATCGATTATGAGGATCAGGTCTCTTACACATTAGCAGTAACAGCACATGACTAT60 GGCATCCCTCAAAAATCAGACACTACCTATTTGGAAATCTTAGTAATTGATGTTAACGAC120 AACGCGCCCCA131 (2) INFORMATION FOR SEQ ID NO:83: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: LysGlyIleAspTyrGluAspGlnValSerTyrThrLeuAlaValThr 151015 AlaHisAspTyrGlyIleProGlnLysSerAspThrThrTyrLeuGlu 202530 IleLeuValIleAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:84: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: AAAGGGTTAGATTTCGAGGGCACTAAAGATTCAGCGTTTAAAATAGTGGCAGCTGACACA60 GGGAAGCCCAGCCTCAACCAGACAGCCCTGGTGAGAGTAGAGCTGGAGGATGAGAACGAC120 AACGCCCCAAT131 (2) INFORMATION FOR SEQ ID NO:85: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: LysGlyLeuAspPheGluGlyThrLysAspSerAlaPheLysIleVal 151015 AlaAlaAspThrGlyLysProSerLeuAsnGlnThrAlaLeuValArg 202530 ValGluLeuGluAspGluAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:86: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 130 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: AAGGGTGTGGATTTTGAAAGTGTGCGTAGCTACAGGCTGGTTATTCGTGCTCAAGATGGA60 GGCAGCCCCTCCAGAAGTAACACCACCCAGCTCTTGGTCAACGTCATCGATCGAATGACA120 ATGCGCCGCT130 (2) INFORMATION FOR SEQ ID NO:87: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: LysGlyValAspPheGluSerValArgSerTyrArgLeuValIleArg 151015 AlaGlnAspGlyGlySerProSerArgSerAsnThrThrGlnLeuLeu 202530 ValAsnValIleAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:88: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: AAGGGTGTGGACTTCGAGCTGACACATCTGTATGAGATTTGGATTGAGGCTGCCGATGGA60 GACACGCCAAGTCTGCGTAGTGTAACTCTTATAACGCTCAACGTAACGGATGCCAATGAC120 AATGCTCCCAA131 (2) INFORMATION FOR SEQ ID NO:89: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: LysGlyValAspPheGluLeuThrHisLeuTyrGluIleTrpIleGlu 151015 AlaAlaAspGlyAspThrProSerLeuArgSerValThrLeuIleThr 202530 LeuAsnValThrAspAlaAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:90: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 441 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: CAAGGCGTTTGATTTTGAAGAGACAAGTAGATATGTGTTGAGTGTGGAAGCTAAGGATGG60 AGGAGTACACACAGCTCACTGTAATGTTCAAATAGAAATTGTTGACGAGAATGACAATGC120 CCCAGAGGTGACATTCATGTCCTTCTCTAACCAGATTCCAGAGGATTCAGACCTTGGAAC180 TGTAATAGCCCTCATAAAAGTGCGAGACAAGGATTCTGGGCAAAATGGCATGGTGACATG240 CTATACTCAGGAAGAAGTTCCTTTCAAATTAGAATCCACCTCGAAGAATTATTACAAGCT300 GGTGATTGCTGGAGCCCTAAACCGGGAGCAGACAGCAGACTACAACGTCACAATCATAGC360 CACCGACAAGGGCAAACCAGCCCTTTCCTCCAGGACAAGCATCACCCTGCACATCTCCGA420 CATCAACGATAATGCCCCCGT441 (2) INFORMATION FOR SEQ ID NO:91: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 146 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: LysAlaPheAspPheGluGluThrSerArgTyrValLeuSerValGlu 151015 AlaLysAspGlyGlyValHisThrAlaHisCysAsnValGlnIleGlu 202530 IleValAspGluAsnAspAsnAlaProGluValThrPheMetSerPhe 354045 SerAsnGlnIleProGluAspSerAspLeuGlyThrValIleAlaLeu 505560 IleLysValArgAspLysAspSerGlyGlnAsnGlyMetValThrCys 65707580 TyrThrGlnGluGluValProPheLysLeuGluSerThrSerLysAsn 859095 TyrTyrLysLeuValIleAlaGlyAlaLeuAsnArgGluGlnThrAla 100105110 AspTyrAsnValThrIleIleAlaThrAspLysGlyLysProAlaLeu 115120125 SerSerArgThrSerIleThrLeuHisIleSerAspIleAsnAspAsn 130135140 AlaPro 145 (2) INFORMATION FOR SEQ ID NO:92: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 131 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: AAGCGAGTGGATTACGAGGCCACTCGGAATTATAAGCTGAGAGTTAAGGCTACTGATCTT60 GGGATTCCACCGAGATCTTCTAACATGACACTGTTCATTCATGTCCTTGATGTTAACGAC120 AACGCTCCCTT131 (2) INFORMATION FOR SEQ ID NO:93: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: LysArgValAspTyrGluAlaThrArgAsnTyrLysLeuArgValLys 151015 AlaThrAspLeuGlyIleProProArgSerSerAsnMetThrLeuPhe 202530 IleHisValLeuAspValAsnAspAsnAlaPro 3540 (2) INFORMATION FOR SEQ ID NO:94: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4104 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 495..3572 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: CCTCTATTCGACATTCTCTTTGGATTGTTTTGCTATAACTTGAAATTTGGGATGTCACAA60 ACGAAACTGTCATCTGTTTCCGCCAAACTGTGGTTCTGCTAATCTCCCAGGCTGGCAGCA120 TTGGAGACTTGCTGACTTCTTTCATCCCCCACTCTTTTCACCTGAAATTCCTTTCCTTGG180 TTTTGCTCTAAGTCCTATGCTTCAGTCAGGGGCCAACCAAATCTCACTGCCTCCTTTTTA240 TCATGAAGCCTTTGATCACTGATAGTTCTTTTTATATCTTGAAAAATCACCCTTCCCAGT300 ACAGTTAATATTTAGTATCTCTACTCATCTTGGCACTTACTCACAGCTCCATAATTCAGT360 CGTTTTCGTACCTCTTCATGGTGATGGGGAGCCCTTTGGAGGTGGTGACTGTGCTTTATA420 CTCCTCATGATGCTTCACATGTGGCAGGCGTGGAGTGCCCGGAGGCGGCCCTCCTGATTC480 TGGGGCCTCCCAGGATGGAGCCCCTGAGGCACAGCCCAGGCCCTGGGGGG530 MetGluProLeuArgHisSerProGlyProGlyGly 1510 CAACGGCTACTGCTGCCCTCCATGCTGCTAGCACTGCTGCTCCTGCTG578 GlnArgLeuLeuLeuProSerMetLeuLeuAlaLeuLeuLeuLeuLeu 152025 GCTCCATCCCCAGGCCACGCCACTCGGGTAGTGTACAAGGTGCCGGAG626 AlaProSerProGlyHisAlaThrArgValValTyrLysValProGlu 303540 GAACAGCCACCCAACACCCTCATTGGGAGCCTCGCAGCCGACTATGGT674 GluGlnProProAsnThrLeuIleGlySerLeuAlaAlaAspTyrGly 45505560 TTTCCAGATGTGGGGCACCTGTACAAGCTAGAGGTGGGTGCCCCGTAC722 PheProAspValGlyHisLeuTyrLysLeuGluValGlyAlaProTyr 657075 CTTCGCGTGGATGGCAAGACAGGTGACATTTTCACCACCGAGACCTCC770 LeuArgValAspGlyLysThrGlyAspIlePheThrThrGluThrSer 808590 ATCGACCGTGAGGGGCTCCGTGAATGCCAGAACCAGCTCCCTGGTGAT818 IleAspArgGluGlyLeuArgGluCysGlnAsnGlnLeuProGlyAsp 95100105 CCCTGCATCCTGGAGTTTGAGGTATCTATCACAGACCTCGTGCAGAAT866 ProCysIleLeuGluPheGluValSerIleThrAspLeuValGlnAsn 110115120 GCGAGCCCCCGGCTGCTAGAGGGCCAGATAGAAGTACAAGACATCAAT914 AlaSerProArgLeuLeuGluGlyGlnIleGluValGlnAspIleAsn 125130135140 GACAACACACCCAACTTCGCCTCACCAGTCATCACTCTGGCCATCCCT962 AspAsnThrProAsnPheAlaSerProValIleThrLeuAlaIlePro 145150155 GAGAACACCAACATCGGCTCACTCTTCCCCATCCCGCTGGCTTCAGAC1010 GluAsnThrAsnIleGlySerLeuPheProIleProLeuAlaSerAsp 160165170 CGTGATGCTGGTCCCAACGGTGTGGCATCCTATGAGCTGCAGGTGGCA1058 ArgAspAlaGlyProAsnGlyValAlaSerTyrGluLeuGlnValAla 175180185 GAGGACCAGGAGGAGAAGCAACCACAGCTCATTGTGATGGGCAACCTG1106 GluAspGlnGluGluLysGlnProGlnLeuIleValMetGlyAsnLeu 190195200 GACCGTGAGCGCTGGGACTCCTATGACCTCACCATCAAGGTGCAGGAT1154 AspArgGluArgTrpAspSerTyrAspLeuThrIleLysValGlnAsp 205210215220 GGCGGCAGCCCCCCACGCGCCACGAGTGCCCTGCTGCGTGTCACCGTG1202 GlyGlySerProProArgAlaThrSerAlaLeuLeuArgValThrVal 225230235 CTTGACACCAATGACAACGCCCCCAAGTTTGAGCGGCCCTCCTATGAG1250 LeuAspThrAsnAspAsnAlaProLysPheGluArgProSerTyrGlu 240245250 GCCGAACTATCTGAGAATAGCCCCATAGGCCACTCGGTCATCCAGGTG1298 AlaGluLeuSerGluAsnSerProIleGlyHisSerValIleGlnVal 255260265 AAGGCCAATGACTCAGACCAAGGTGCCAATGCAGAAATCGAATACACA1346 LysAlaAsnAspSerAspGlnGlyAlaAsnAlaGluIleGluTyrThr 270275280 TTCCACCAGGCGCCCGAAGTTGTGAGGCGTCTTCTTCGACTGGACAGG1394 PheHisGlnAlaProGluValValArgArgLeuLeuArgLeuAspArg 285290295300 AACACTGGACTTATCACTGTTCAGGGCCCGGTGGACCGTGAGGACCTA1442 AsnThrGlyLeuIleThrValGlnGlyProValAspArgGluAspLeu 305310315 AGCACCCTGCGCTTCTCAGTGCTTGCTAAGGACCGAGGCACCAACCCC1490 SerThrLeuArgPheSerValLeuAlaLysAspArgGlyThrAsnPro 320325330 AAGAGTGCCCGTGCCCAGGTGGTTGTGACCGTGAAGGACATGAATGAC1538 LysSerAlaArgAlaGlnValValValThrValLysAspMetAsnAsp 335340345 AATGCCCCCACCATTGAGATCCGGGGCATAGGGCTAGTGACTCATCAA1586 AsnAlaProThrIleGluIleArgGlyIleGlyLeuValThrHisGln 350355360 GATGGGATGGCTAACATCTCAGAGGATGTGGCAGAGGAGACAGCTGTG1634 AspGlyMetAlaAsnIleSerGluAspValAlaGluGluThrAlaVal 365370375380 GCCCTGGTGCAGGTGTCTGACCGAGATGAGGGAGAGAATGCAGCTGTC1682 AlaLeuValGlnValSerAspArgAspGluGlyGluAsnAlaAlaVal 385390395 ACCTGTGTGGTGGCAGGTGATGTGCCCTTCCAGCTGCGCCAGGCCAGT1730 ThrCysValValAlaGlyAspValProPheGlnLeuArgGlnAlaSer 400405410 GAGACAGGCAGTGACAGCAAGAAGAAGTATTTCCTGCAGACTACCACC1778 GluThrGlySerAspSerLysLysLysTyrPheLeuGlnThrThrThr 415420425 CCGCTAGACTACGAGAAGGTCAAAGACTACACCATTGAGATTGTGGCT1826 ProLeuAspTyrGluLysValLysAspTyrThrIleGluIleValAla 430435440 GTGGACTCTGGCAACCCCCCACTCTCCAGCACTAACTCCCTCAAGGTG1874 ValAspSerGlyAsnProProLeuSerSerThrAsnSerLeuLysVal 445450455460 CAGGTGGTGGACGTCAATGACAACGCACCTGTCTTCACTCAGAGTGTC1922 GlnValValAspValAsnAspAsnAlaProValPheThrGlnSerVal 465470475 ACTGAGGTCGCCTTCCCGGAAAACAACAAGCCTGGTGAAGTGATTGCT1970 ThrGluValAlaPheProGluAsnAsnLysProGlyGluValIleAla 480485490 GAGATCACTGCCAGTGATGCTGACTCTGGCTCTAATGCTGAGCTGGTT2018 GluIleThrAlaSerAspAlaAspSerGlySerAsnAlaGluLeuVal 495500505 TACTCTCTGGAGCCTGAGCCGGCTGCTAAGGGCCTCTTCACCATCTCA2066 TyrSerLeuGluProGluProAlaAlaLysGlyLeuPheThrIleSer 510515520 CCCGAGACTGGAGAGATCCAGGTGAAGACATCTCTGGATCGGGAACAG2114 ProGluThrGlyGluIleGlnValLysThrSerLeuAspArgGluGln 525530535540 CGGGAGAGCTATGAGTTGAAGGTGGTGGCAGCTGACCGGGGCAGTCCT2162 ArgGluSerTyrGluLeuLysValValAlaAlaAspArgGlySerPro 545550555 AGCCTCCAGGGCACAGCCACTGTCCTTGTCAATGTGCTGGACTGCAAT2210 SerLeuGlnGlyThrAlaThrValLeuValAsnValLeuAspCysAsn 560565570 GACAATGACCCCAAATTTATGCTGAGTGGCTACAACTTCTCAGTGATG2258 AspAsnAspProLysPheMetLeuSerGlyTyrAsnPheSerValMet 575580585 GAGAACATGCCAGCACTGAGTCCAGTGGGCATGGTGACTGTCATTGAT2306 GluAsnMetProAlaLeuSerProValGlyMetValThrValIleAsp 590595600 GGAGACAAGGGGGAGAATGCCCAGGTGCAGCTCTCAGTGGAGCAGGAC2354 GlyAspLysGlyGluAsnAlaGlnValGlnLeuSerValGluGlnAsp 605610615620 AACGGTGACTTTGTTATCCAGAATGGCACAGGCACCATCCTATCCAGC2402 AsnGlyAspPheValIleGlnAsnGlyThrGlyThrIleLeuSerSer 625630635 CTGAGCTTTGATCGAGAGCAACAAAGCACCTACACCTTCCAGCTGAAG2450 LeuSerPheAspArgGluGlnGlnSerThrTyrThrPheGlnLeuLys 640645650 GCAGTGGATGGTGGCGTCCCACCTCGCTCAGCTTACGTTGGTGTCACC2498 AlaValAspGlyGlyValProProArgSerAlaTyrValGlyValThr 655660665 ATCAATGTGCTGGACGAGAATGACAACGCACCCTATATCACTGCCCCT2546 IleAsnValLeuAspGluAsnAspAsnAlaProTyrIleThrAlaPro 670675680 TCTAACACCTCTCACAAGCTGCTGACCCCCCAGACACGTCTTGGTGAG2594 SerAsnThrSerHisLysLeuLeuThrProGlnThrArgLeuGlyGlu 685690695700 ACGGTCAGCCAGGTGGCAGCCGAGGACTTTGACTCTGGTGTCAATGCC2642 ThrValSerGlnValAlaAlaGluAspPheAspSerGlyValAsnAla 705710715 GAGCTGATCTACAGCATTGCAGGTGGCAACCCTTATGGACTCTTCCAG2690 GluLeuIleTyrSerIleAlaGlyGlyAsnProTyrGlyLeuPheGln 720725730 ATTGGGTCACATTCAGGTGCCATCACCCTGGAGAAGGAGATTGAGCGG2738 IleGlySerHisSerGlyAlaIleThrLeuGluLysGluIleGluArg 735740745 CGCCACCATGGGCTACACCGCCTGGTGGTGAAGGTCAGTGACCGCGGC2786 ArgHisHisGlyLeuHisArgLeuValValLysValSerAspArgGly 750755760 AAGCCCCCACGCTATGGCACAGCCTTGGTCCATCTTTATGTCAATGAG2834 LysProProArgTyrGlyThrAlaLeuValHisLeuTyrValAsnGlu 765770775780 ACTCTGGCCAACCGCACGCTGCTGGAGACCCTCCTGGGCCACAGCCTG2882 ThrLeuAlaAsnArgThrLeuLeuGluThrLeuLeuGlyHisSerLeu 785790795 GACACGCCGCTGGATATTGACATTGCTGGGGATCCAGAATATGAGCGC2930 AspThrProLeuAspIleAspIleAlaGlyAspProGluTyrGluArg 800805810 TCCAAGCAGCGTGGCAACATTCTCTTTGGTGTGGTGGCTGGTGTGGTG2978 SerLysGlnArgGlyAsnIleLeuPheGlyValValAlaGlyValVal 815820825 GCCGTGGCCTTGCTCATCGCCCTGGCGGTTCTTGTGCGCTACTGCAGA3026 AlaValAlaLeuLeuIleAlaLeuAlaValLeuValArgTyrCysArg 830835840 CAGCGGGAGGCCAAAAGTGGTTACCAGGCTGGTAAGAAGGAGACCAAG3074 GlnArgGluAlaLysSerGlyTyrGlnAlaGlyLysLysGluThrLys 845850855860 GACCTGTATGCCCCCAAGCCCAGTGGCAAGGCCTCCAAGGGAAACAAA3122 AspLeuTyrAlaProLysProSerGlyLysAlaSerLysGlyAsnLys 865870875 AGCAAAGGCAAGAAGAGCAAGTCCCCAAAGCCCGTGAAGCCAGTGGAG3170 SerLysGlyLysLysSerLysSerProLysProValLysProValGlu 880885890 GACGAGGATGAGGCCGGGCTGCAGAAGTCCCTCAAGTTCAACCTGATG3218 AspGluAspGluAlaGlyLeuGlnLysSerLeuLysPheAsnLeuMet 895900905 AGCGATGCCCCTGGGGACAGTCCCCGCATCCACCTGCCCCTCAACTAC3266 SerAspAlaProGlyAspSerProArgIleHisLeuProLeuAsnTyr 910915920 CCACCAGGCAGCCCTGACCTGGGCCGCCACTATCGCTCTAACTCCCCA3314 ProProGlySerProAspLeuGlyArgHisTyrArgSerAsnSerPro 925930935940 CTGCCTTCCATCCAGCTGCAGCCCCAGTCACCCTCAGCCTCCAAGAAG3362 LeuProSerIleGlnLeuGlnProGlnSerProSerAlaSerLysLys 945950955 CACCAGGTGGTACAGGACCTGCCACCTGCAAACACATTCGTGGGCACC3410 HisGlnValValGlnAspLeuProProAlaAsnThrPheValGlyThr 960965970 GGGGACACCACGTCCACGGGCTCTGAGCAGTACTCCGACTACAGCTAC3458 GlyAspThrThrSerThrGlySerGluGlnTyrSerAspTyrSerTyr 975980985 CGCACCAACCCCCCCAAATACCCCAGCAAGCAGGTAGGCCAGCCCTTT3506 ArgThrAsnProProLysTyrProSerLysGlnValGlyGlnProPhe 9909951000 CAGCTCAGCACACCCCAGCCCCTACCCCACCCCTACCACGGAGCCATC3554 GlnLeuSerThrProGlnProLeuProHisProTyrHisGlyAlaIle 1005101010151020 TGGACCGAGGTGTGGGAGTGATGGAGCAGGTTTACTGTGCCTGCCCGT3602 TrpThrGluValTrpGlu 1025 GTTGGGGGCCAGCCTGAGCCAGCAGTGGGAGGTGGGGCCTTAGTGCCTCACCGGGCACAC3662 GGATTAGGCTGAGTGAAGATTAAGGGAGGGTGTGCTCTGTGGTCTCCTCCCTGCCCTCTC3722 CCCACTGGGGAGAGACCTGTGATTTGCCAAGTCCCTGGACCCTGGACCAGCTACTGGGCC3782 TTATGGGTTGGGGGTGGTAGGCAGGTGAGCGTAAGTGGGGAGGGAAATGGGTAAGAAGTC3842 TACTCCAAACCTAGGTCTCTATGTCAGACCAGACCTAGGTGCTTCTCTAGGAGGGAAACA3902 GGGAGACCTGGGGTCCTGTGGATAACTGAGTGGGGAGTCTGCCAGGGGAGGGCACCTTCC3962 CATTGTGCCTTCTGTGTGTATTGTGCATTAACCTCTTCCTCACCACTAGGCTTCTGGGGC4022 TGGGTCCCACATGCCCTTGACCCTGACAATAAAGTTCTCTATTTTTGGAAAAAAAAAAAA4082 AAAAAAAAAAAAAAAAAAAAAA4104 (2) INFORMATION FOR SEQ ID NO:95: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1026 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: MetGluProLeuArgHisSerProGlyProGlyGlyGlnArgLeuLeu 151015 LeuProSerMetLeuLeuAlaLeuLeuLeuLeuLeuAlaProSerPro 202530 GlyHisAlaThrArgValValTyrLysValProGluGluGlnProPro 354045 AsnThrLeuIleGlySerLeuAlaAlaAspTyrGlyPheProAspVal 505560 GlyHisLeuTyrLysLeuGluValGlyAlaProTyrLeuArgValAsp 65707580 GlyLysThrGlyAspIlePheThrThrGluThrSerIleAspArgGlu 859095 GlyLeuArgGluCysGlnAsnGlnLeuProGlyAspProCysIleLeu 100105110 GluPheGluValSerIleThrAspLeuValGlnAsnAlaSerProArg 115120125 LeuLeuGluGlyGlnIleGluValGlnAspIleAsnAspAsnThrPro 130135140 AsnPheAlaSerProValIleThrLeuAlaIleProGluAsnThrAsn 145150155160 IleGlySerLeuPheProIleProLeuAlaSerAspArgAspAlaGly 165170175 ProAsnGlyValAlaSerTyrGluLeuGlnValAlaGluAspGlnGlu 180185190 GluLysGlnProGlnLeuIleValMetGlyAsnLeuAspArgGluArg 195200205 TrpAspSerTyrAspLeuThrIleLysValGlnAspGlyGlySerPro 210215220 ProArgAlaThrSerAlaLeuLeuArgValThrValLeuAspThrAsn 225230235240 AspAsnAlaProLysPheGluArgProSerTyrGluAlaGluLeuSer 245250255 GluAsnSerProIleGlyHisSerValIleGlnValLysAlaAsnAsp 260265270 SerAspGlnGlyAlaAsnAlaGluIleGluTyrThrPheHisGlnAla 275280285 ProGluValValArgArgLeuLeuArgLeuAspArgAsnThrGlyLeu 290295300 IleThrValGlnGlyProValAspArgGluAspLeuSerThrLeuArg 305310315320 PheSerValLeuAlaLysAspArgGlyThrAsnProLysSerAlaArg 325330335 AlaGlnValValValThrValLysAspMetAsnAspAsnAlaProThr 340345350 IleGluIleArgGlyIleGlyLeuValThrHisGlnAspGlyMetAla 355360365 AsnIleSerGluAspValAlaGluGluThrAlaValAlaLeuValGln 370375380 ValSerAspArgAspGluGlyGluAsnAlaAlaValThrCysValVal 385390395400 AlaGlyAspValProPheGlnLeuArgGlnAlaSerGluThrGlySer 405410415 AspSerLysLysLysTyrPheLeuGlnThrThrThrProLeuAspTyr 420425430 GluLysValLysAspTyrThrIleGluIleValAlaValAspSerGly 435440445 AsnProProLeuSerSerThrAsnSerLeuLysValGlnValValAsp 450455460 ValAsnAspAsnAlaProValPheThrGlnSerValThrGluValAla 465470475480 PheProGluAsnAsnLysProGlyGluValIleAlaGluIleThrAla 485490495 SerAspAlaAspSerGlySerAsnAlaGluLeuValTyrSerLeuGlu 500505510 ProGluProAlaAlaLysGlyLeuPheThrIleSerProGluThrGly 515520525 GluIleGlnValLysThrSerLeuAspArgGluGlnArgGluSerTyr 530535540 GluLeuLysValValAlaAlaAspArgGlySerProSerLeuGlnGly 545550555560 ThrAlaThrValLeuValAsnValLeuAspCysAsnAspAsnAspPro 565570575 LysPheMetLeuSerGlyTyrAsnPheSerValMetGluAsnMetPro 580585590 AlaLeuSerProValGlyMetValThrValIleAspGlyAspLysGly 595600605 GluAsnAlaGlnValGlnLeuSerValGluGlnAspAsnGlyAspPhe 610615620 ValIleGlnAsnGlyThrGlyThrIleLeuSerSerLeuSerPheAsp 625630635640 ArgGluGlnGlnSerThrTyrThrPheGlnLeuLysAlaValAspGly 645650655 GlyValProProArgSerAlaTyrValGlyValThrIleAsnValLeu 660665670 AspGluAsnAspAsnAlaProTyrIleThrAlaProSerAsnThrSer 675680685 HisLysLeuLeuThrProGlnThrArgLeuGlyGluThrValSerGln 690695700 ValAlaAlaGluAspPheAspSerGlyValAsnAlaGluLeuIleTyr 705710715720 SerIleAlaGlyGlyAsnProTyrGlyLeuPheGlnIleGlySerHis 725730735 SerGlyAlaIleThrLeuGluLysGluIleGluArgArgHisHisGly 740745750 LeuHisArgLeuValValLysValSerAspArgGlyLysProProArg 755760765 TyrGlyThrAlaLeuValHisLeuTyrValAsnGluThrLeuAlaAsn 770775780 ArgThrLeuLeuGluThrLeuLeuGlyHisSerLeuAspThrProLeu 785790795800 AspIleAspIleAlaGlyAspProGluTyrGluArgSerLysGlnArg 805810815 GlyAsnIleLeuPheGlyValValAlaGlyValValAlaValAlaLeu 820825830 LeuIleAlaLeuAlaValLeuValArgTyrCysArgGlnArgGluAla 835840845 LysSerGlyTyrGlnAlaGlyLysLysGluThrLysAspLeuTyrAla 850855860 ProLysProSerGlyLysAlaSerLysGlyAsnLysSerLysGlyLys 865870875880 LysSerLysSerProLysProValLysProValGluAspGluAspGlu 885890895 AlaGlyLeuGlnLysSerLeuLysPheAsnLeuMetSerAspAlaPro 900905910 GlyAspSerProArgIleHisLeuProLeuAsnTyrProProGlySer 915920925 ProAspLeuGlyArgHisTyrArgSerAsnSerProLeuProSerIle 930935940 GlnLeuGlnProGlnSerProSerAlaSerLysLysHisGlnValVal 945950955960 GlnAspLeuProProAlaAsnThrPheValGlyThrGlyAspThrThr 965970975 SerThrGlySerGluGlnTyrSerAspTyrSerTyrArgThrAsnPro 980985990 ProLysTyrProSerLysGlnValGlyGlnProPheGlnLeuSerThr 99510001005 ProGlnProLeuProHisProTyrHisGlyAlaIleTrpThrGluVal 101010151020 TrpGlu 1025 (2) INFORMATION FOR SEQ ID NO:96: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4705 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 115..2827 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: CGAAAGCCATGTCGGACTCGTCGCCCAGCGCCCAAGCGCTAACCCGCTGAAAGTTTCTCA60 GCGAAATCTCAGGGACGATCTGGACCCCGCTGAGAGGAACTGCTTTTGAGTGAGATG117 Met GTCCCAGAGGCCTGGAGGAGCGGACTGGTAAGCACCGGGAGGGTAGTG165 ValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgValVal 51015 GGAGTTTTGCTTCTGCTTGGTGCCTTGAACAAGGCTTCCACGGTCATT213 GlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrValIle 202530 CACTATGAGATCCCGGAGGAAAGAGAGAAGGGTTTCGCTGTGGGCAAC261 HisTyrGluIleProGluGluArgGluLysGlyPheAlaValGlyAsn 354045 GTGGTCGCGAACCTTGGTTTGGATCTCGGTAGCCTCTCAGCCCGCAGG309 ValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArgArg 50556065 TTCCCGGTGGTGTCTGGAGCTAGCCGAAGATTCTTTGAGGTGAACCGG357 PheProValValSerGlyAlaSerArgArgPhePheGluValAsnArg 707580 GAGACCGGAGAGATGTTTGTGAACGACCGTCTGGATCGAGAGGAGCTG405 GluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGluLeu 859095 TGTGGGACACTGCCCTCTTGCACTGTAACTCTGGAGTTGGTAGTGGAG453 CysGlyThrLeuProSerCysThrValThrLeuGluLeuValValGlu 100105110 AACCCGCTGGAGCTGTTCAGCGTGGAAGTGGTGATCCAGGACATCAAC501 AsnProLeuGluLeuPheSerValGluValValIleGlnAspIleAsn 115120125 GACAACAATCCTGCTTTCCCTACCCAGGAAATGAAATTGGAGATTAGC549 AspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIleSer 130135140145 GAGGCCGTGGCTCCGGGGACGCGCTTTCCGCTCGAGAGCGCGCACGAT597 GluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHisAsp 150155160 CCCGATCTGGGAAGCAACTCTTTACAAACCTATGAGCTGAGCCGAAAT645 ProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArgAsn 165170175 GAATACTTTGCGCTTCGCGTGCAGACGCGGGAGGACAGCACCAAGTAC693 GluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLysTyr 180185190 GCGGAGCTGGTGTTGGAGCGCGCCCTGGACCGAGAACGGGAGCCTAGT741 AlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluProSer 195200205 CTCCAGTTAGTGCTGACGGCGTTGGACGGAGGGACCCCAGCTCTCTCC789 LeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeuSer 210215220225 GCCAGCCTGCCTATTCACATCAAGGTGCTGGACGCGAATGACAATGCG837 AlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsnAla 230235240 CCTGTCTTCAACCAGTCCTTGTACCGGGCGCGCGTTCCTGGAGGATGC885 ProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGlyCys 245250255 ACCTCCGGCACGCGCGTGGTACAAGTCCTTGCAACGGATCTGGATGAA933 ThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAspGlu 260265270 GGCCCCAACGGTGAAATTATTTACTCCTTCGGCAGCCACAACCGCGCC981 GlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArgAla 275280285 GGCGTGCGGCAACTATTCGCCTTAGACCTTGTAACCGGGATGCTGACA1029 GlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeuThr 290295300305 ATCAAGGGTCGGCTGGACTTCGAGGACACCAAACTCCATGAGATTTAC1077 IleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIleTyr 310315320 ATCCAGGCCAAAGACAAGGGCGCCAATCCCGAAGGAGCACATTGCAAA1125 IleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLys 325330335 GTGTTGGTGGAGGTTGTGGATGTGAATGACAACGCCCCGGAGATCACA1173 ValLeuValGluValValAspValAsnAspAsnAlaProGluIleThr 340345350 GTCACCTCCGTGTACAGCCCAGTACCCGAGGATGCCTCTGGGACTGTC1221 ValThrSerValTyrSerProValProGluAspAlaSerGlyThrVal 355360365 ATCGCTTTGCTCAGTGTGACTGACCTGGATGCTGGCGAGAACGGGCTG1269 IleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGlyLeu 370375380385 GTGACCTGCGAAGTTCCACCGGGTCTCCCTTTCAGCCTTACTTCTTCC1317 ValThrCysGluValProProGlyLeuProPheSerLeuThrSerSer 390395400 CTCAAGAATTACTTCACTTTGAAAACCAGTGCAGACCTGGATCGGGAG1365 LeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArgGlu 405410415 ACTGTGCCAGAATACAACCTCAGCATCACCGCCCGAGACGCCGGAACC1413 ThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGlyThr 420425430 CCTTCCCTCTCAGCCCTTACAATAGTGCGTGTTCAAGTGTCCGACATC1461 ProSerLeuSerAlaLeuThrIleValArgValGlnValSerAspIle 435440445 AATGACAACCCTCCACAATCTTCTCAATCTTCCTACGACGTTTACATT1509 AsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyrIle 450455460465 GAAGAAAACAACCTCCCCGGGGCTCCAATACTAAACCTAAGTGTCTGG1557 GluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerValTrp 470475480 GACCCCGACGCCCCGCAGAATGCTCGGCTTTCTTTCTTTCTCTTGGAG1605 AspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeuGlu 485490495 CAAGGAGCTGAAACCGGGCTAGTGGGTCGCTATTTCACAATAAATCGT1653 GlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsnArg 500505510 GACAATGGCATAGTGTCATCCTTAGTGCCCCTAGACTATGAGGATCGG1701 AspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAspArg 515520525 CGGGAATTTGAATTAACAGCTCATATCAGCGATGGGGGCACCCCGGTC1749 ArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrProVal 530535540545 CTAGCCACCAACATCAGCGTGAACATATTTGTCACTGATCGCAATGAC1797 LeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsnAsp 550555560 AATGCCCCCCAGGTCCTATATCCTCGGCCAGGTGGGAGCTCGGTGGAG1845 AsnAlaProGlnValLeuTyrProArgProGlyGlySerSerValGlu 565570575 ATGCTGCCTCGAGGTACCTCAGCTGGCCACCTAGTGTCACGGGTGGTA1893 MetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgValVal 580585590 GGCTGGGACGCGGATGCAGGGCACAATGCCTGGCTCTCCTACAGTCTC1941 GlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSerLeu 595600605 TTTGGATCCCCTAACCAGAGCCTTTTTGCCATAGGGCTGCACACTGGT1989 PheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThrGly 610615620625 CAAATCAGTACTGCCCGTCCAGTCCAAGACACAGATTCACCCAGGCAG2037 GlnIleSerThrAlaArgProValGlnAspThrAspSerProArgGln 630635640 ACTCTCACTGTCTTGATCAAAGACAATGGGGAGCCTTCGCTCTCCACC2085 ThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSerThr 645650655 ACTGCTACCCTCACTGTGTCAGTAACCGAGGACTCTCCTGAAGCCCGA2133 ThrAlaThrLeuThrValSerValThrGluAspSerProGluAlaArg 660665670 GCCGAGTTCCCCTCTGGCTCTGCCCCCCGGGAGCAGAAAAAAAATCTC2181 AlaGluPheProSerGlySerAlaProArgGluGlnLysLysAsnLeu 675680685 ACCTTTTATCTACTTCTTTCTCTAATCCTGGTTTCTGTGGGCTTCGTG2229 ThrPheTyrLeuLeuLeuSerLeuIleLeuValSerValGlyPheVal 690695700705 GTCACAGTGTTCGGAGTAATCATATTCAAAGTTTACAAGTGGAAGCAG2277 ValThrValPheGlyValIleIlePheLysValTyrLysTrpLysGln 710715720 TCTAGAGACCTATACCGAGCCCCGGTGAGCTCACTGTACCGAACACCA2325 SerArgAspLeuTyrArgAlaProValSerSerLeuTyrArgThrPro 725730735 GGGCCCTCCTTGCACGCGGACGCCGTGCGGGGAGGCCTGATGTCGCCG2373 GlyProSerLeuHisAlaAspAlaValArgGlyGlyLeuMetSerPro 740745750 CACCTTTACCATCAGGTGTATCTCACCACGGACTCCCGCCGCAGCGAC2421 HisLeuTyrHisGlnValTyrLeuThrThrAspSerArgArgSerAsp 755760765 CCGCTGCTGAAGAAACCTGGTGCAGCCAGTCCACTGGCCAGCCGCCAG2469 ProLeuLeuLysLysProGlyAlaAlaSerProLeuAlaSerArgGln 770775780785 AACACGCTGCGGAGCTGTGATCCGGTGTTCTATAGGCAGGTGTTGGGT2517 AsnThrLeuArgSerCysAspProValPheTyrArgGlnValLeuGly 790795800 GCAGAGAGCGCCCCTCCCGGACAGCAAGCCCCGCCCAACACGGACTGG2565 AlaGluSerAlaProProGlyGlnGlnAlaProProAsnThrAspTrp 805810815 CGTTTCTCTCAGGCCCAGAGACCCGGCACCAGCGGCTCCCAAAATGGC2613 ArgPheSerGlnAlaGlnArgProGlyThrSerGlySerGlnAsnGly 820825830 GATGACACCGGCACCTGGCCCAACAACCAGTTTGACACAGAGATGCTG2661 AspAspThrGlyThrTrpProAsnAsnGlnPheAspThrGluMetLeu 835840845 CAAGCCATGATCTTGGCGTCCGCCAGTGAAGCTGCTGATGGGAGCTCC2709 GlnAlaMetIleLeuAlaSerAlaSerGluAlaAlaAspGlySerSer 850855860865 ACCCTGGGAGGGGGTGCCGGCACCATGGGATTGAGCGCCCGCTACGGA2757 ThrLeuGlyGlyGlyAlaGlyThrMetGlyLeuSerAlaArgTyrGly 870875880 CCCCAGTTCACCCTGCAGCACGTGCCCGACTACCGCCAGAATGTCTAC2805 ProGlnPheThrLeuGlnHisValProAspTyrArgGlnAsnValTyr 885890895 ATCCCAGGCAGCAATGCACACTGACCAACGCAGCTGGCAAGCGGATGGCAAG2857 IleProGlySerAsnAlaHis 900 GCCCAGCAGGTGGCAATGGCAACAAGAAGAAGTCGGCAAGAAGGAGAAGAAGTAACATGG2917 AGGCCAGGCCAAGAGCCACAGGGCAGCCTCTCCCCGAACCAGCCCAGCTTCTCCTTACCT2977 GCACCCAGGCCTCAGAGTTTCAGGGCTAACCCCCAGAATACTGGTAGGGGCCAAGGCATC3037 TCCCTTGGAAACAGAAACAAGTGCCATCACACCATCCCTTCCCCAGGTGTAATATCCAAA3097 GCAGTTCCGCTGGGAACCCCATCCAATCAGTGGCTGTACCCATTTGGGTAGTGGGGTTCA3157 TGTAGACACCAAGAACCATTTGCCACACCCCGTTTAGTTACAGCTGAACCCTCCATCTTC3217 CAAATCAATCAGGCCCATCCATCCCATGCCTCCCTCCTCCCCACCCCACTCCAACAGTTC3277 CTCTTTCCCGAGTAAGGTGGTTGGGGTGTTGAAGTACCAAGTAACCTACAAGCCTCCTAG3337 TTCTGAAAAGTTGGAAGGGCATCATGACCTCTTGGCCTCTCCTTTGATTCTCAATCTTCC3397 CCCAAAGCATGGTTTGGTGCCAGCCCCTTCACCTCCTTCCAGAGCCCAAGATCAATGCTC3457 AAGTTTTGGAGGACATGATCACCATCCCCATGGTACTGATGCTTGCTGGATTTAGGGAGG3517 GCATTTTGCTACCAAGCCTCTTCCCAACGCCCTGGGACCAGTCTTCTGTTTTGTTTTTCA3577 TTGTTTGAGCTTTCCACTGCATGCCTTGACTTCCCCCACCTCCTCCTCAAACAAGAGACT3637 CCACTGCATGTTCCAAGACAGTATGGGGTGGTAAGATAAGGAAGGGAAGTGTGTGGATGT3697 GGATGGTGGGGGCATGGACAAAGCTTGACACATCAAGTTATCAAGGCCTTGGAGGAGGCT3757 CTGTATGTCCTCAGGGGACTGACAACATCCTCCAGATTCCAGCCATAAACCAATAACTAG3817 GCTGGACCCTTCCCACTACATAATAGGGCTCAGCCAGGCAGCCAGCTTTGGGCTGAGCTA3877 ACAGGACCAATGGATTAACTGGCATTTCAGTCCAAGGAAGCTCGAAGCAGGTTTAGGACC3937 AGGTCCCCTTGAGAGGTCAGAGGGGCCTCTGTGGGTGCTGGGTACTCCAGAGGTGCCACT3997 GGTGGAAGGGTCAGCGGAGCCCCAGCAGGAAGGGTGGGCCAGCCAGGCCATTCTTAGTCC4057 CTGGGTTGGGGAGGCAGGGAGCTAGGGCAGGGACCAAATGAACAGAAAGTCTCAGCCCAG4117 GATGGGGCTTCTTCAACAGGCCCCTGCCCTCCTGAAGCCTCAGTCCTTCACCTTGCCAGG4177 TGCCGTTTCTCTTCCGTGAAGGCCACTGCCCAGGTCCCCAGTGCGCCCCCTAGTGGCCAT4237 AGCCTGGTTAAAGTTCCCCAGTGCCTCCTTGTGATAGACCTTCTTCTCCCACCCCCTTCT4297 GCCCCTGGGTCCCCGGCCATCCAGCGGGGCTGCCAGAGAACCCCAGACCTGCCCTTACAG4357 TAGTGTAGCGCCCCCTCCCTCTTTCGGCTGGTGTAGAATAGCCAGTAGTGTAGTGCGGTG4417 TGCTTTTACGTGATGGCGGGTGGGCAGCGGGCGGCGGCGTCCGCGCAGCCGTCTGTCCTT4477 GATCTGCCCGCGGCGGCCCGTGTTGTGTTTTGTGCTGTGTCCAGCGCTAAGGCGACCCCC4537 TCCCCCGTACTGACTTCTCCTATAAGCGCTTCTCTTCGCATAGTCACGTAGCTCCCACCC4597 CACCCTCTTCCTGTGTCTCACGCAAGTTTTATACTCTAATATTTATATGGCTTTTTTTCT4657 TCGACAAAAAAATAATAAAACGTTTCTTCTGAAAAAAAAAAAAAAAAA4705 (2) INFORMATION FOR SEQ ID NO:97: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 904 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: MetValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgVal 151015 ValGlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrVal 202530 IleHisTyrGluIleProGluGluArgGluLysGlyPheAlaValGly 354045 AsnValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArg 505560 ArgPheProValValSerGlyAlaSerArgArgPhePheGluValAsn 65707580 ArgGluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGlu 859095 LeuCysGlyThrLeuProSerCysThrValThrLeuGluLeuValVal 100105110 GluAsnProLeuGluLeuPheSerValGluValValIleGlnAspIle 115120125 AsnAspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIle 130135140 SerGluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHis 145150155160 AspProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArg 165170175 AsnGluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLys 180185190 TyrAlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluPro 195200205 SerLeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeu 210215220 SerAlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsn 225230235240 AlaProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGly 245250255 CysThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAsp 260265270 GluGlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArg 275280285 AlaGlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeu 290295300 ThrIleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIle 305310315320 TyrIleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCys 325330335 LysValLeuValGluValValAspValAsnAspAsnAlaProGluIle 340345350 ThrValThrSerValTyrSerProValProGluAspAlaSerGlyThr 355360365 ValIleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGly 370375380 LeuValThrCysGluValProProGlyLeuProPheSerLeuThrSer 385390395400 SerLeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArg 405410415 GluThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGly 420425430 ThrProSerLeuSerAlaLeuThrIleValArgValGlnValSerAsp 435440445 IleAsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyr 450455460 IleGluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerVal 465470475480 TrpAspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeu 485490495 GluGlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsn 500505510 ArgAspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAsp 515520525 ArgArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrPro 530535540 ValLeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsn 545550555560 AspAsnAlaProGlnValLeuTyrProArgProGlyGlySerSerVal 565570575 GluMetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgVal 580585590 ValGlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSer 595600605 LeuPheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThr 610615620 GlyGlnIleSerThrAlaArgProValGlnAspThrAspSerProArg 625630635640 GlnThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSer 645650655 ThrThrAlaThrLeuThrValSerValThrGluAspSerProGluAla 660665670 ArgAlaGluPheProSerGlySerAlaProArgGluGlnLysLysAsn 675680685 LeuThrPheTyrLeuLeuLeuSerLeuIleLeuValSerValGlyPhe 690695700 ValValThrValPheGlyValIleIlePheLysValTyrLysTrpLys 705710715720 GlnSerArgAspLeuTyrArgAlaProValSerSerLeuTyrArgThr 725730735 ProGlyProSerLeuHisAlaAspAlaValArgGlyGlyLeuMetSer 740745750 ProHisLeuTyrHisGlnValTyrLeuThrThrAspSerArgArgSer 755760765 AspProLeuLeuLysLysProGlyAlaAlaSerProLeuAlaSerArg 770775780 GlnAsnThrLeuArgSerCysAspProValPheTyrArgGlnValLeu 785790795800 GlyAlaGluSerAlaProProGlyGlnGlnAlaProProAsnThrAsp 805810815 TrpArgPheSerGlnAlaGlnArgProGlyThrSerGlySerGlnAsn 820825830 GlyAspAspThrGlyThrTrpProAsnAsnGlnPheAspThrGluMet 835840845 LeuGlnAlaMetIleLeuAlaSerAlaSerGluAlaAlaAspGlySer 850855860 SerThrLeuGlyGlyGlyAlaGlyThrMetGlyLeuSerAlaArgTyr 865870875880 GlyProGlnPheThrLeuGlnHisValProAspTyrArgGlnAsnVal 885890895 TyrIleProGlySerAsnAlaHis 900 (2) INFORMATION FOR SEQ ID NO:98: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 556 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: AspTrpValIleProProIleAsnLeuProGluAsnSerArgGlyPro 151015 PheProGlnGluLeuValArgIleArgSerAspArgAspLysAsnLeu 202530 SerLeuArgTyrThrValThrGlyProGlyAlaAspGlnProProThr 354045 GlyIlePheIleIleAsnProIleSerGlyGlnLeuSerValThrLys 505560 ProLeuAspArgGluGlnIleAlaArgPheHisLeuArgAlaHisAla 65707580 ValAspIleAsnGlyAsnGlnValGluAsnProIleAspIleValIle 859095 AsnValIleAspMetAsnAspAsnArgProGluPheLeuHisGlnVal 100105110 TrpAsnGlySerValProGluGlySerLysProGlyThrTyrValMet 115120125 ThrValThrAlaIleAspAlaAspAspProAsnAlaLeuAsnGlyMet 130135140 LeuArgTyrArgIleLeuSerGlnAlaProSerThrProSerProAsn 145150155160 MetPheThrIleAsnAsnGluThrGlyAspIleIleThrValAlaAla 165170175 GlyLeuAspArgGluLysValGlnGlnTyrThrLeuIleIleGlnAla 180185190 ThrAspMetGluGlyAsnProThrTyrGlyLeuSerAsnThrAlaThr 195200205 AlaValIleThrValThrAspValAsnAspAsnProProGluPheThr 210215220 AlaMetThrPheTyrGlyGluValProGluAsnArgValAspIleIle 225230235240 ValAlaAsnLeuThrValThrAspLysAspGlnProHisThrProAla 245250255 TrpAsnAlaValThrArgIleSerGlyGlyAspProThrGlyArgPhe 260265270 AlaIleGlnThrAspProAsnSerAsnAspGlyLeuValThrValVal 275280285 LysProIleAspPheGluThrAsnArgMetPheValLeuThrValAla 290295300 AlaGluAsnGlnValProLeuAlaLysGlyIleGlnHisProProGln 305310315320 SerThrAlaThrValSerValThrValIleAspValAsnGluAsnPro 325330335 TyrPheAlaProAsnProLysIleIleArgGlnGluGluGlyLeuHis 340345350 AlaGlyThrMetLeuThrThrPheThrAlaGlyAspProAspArgTyr 355360365 MetGlnGlnAsnIleArgTyrThrLysLeuSerAspProAlaAsnTrp 370375380 LeuLysIleAspProValAsnGlyGlnIleThrThrIleAlaValLeu 385390395400 AspArgGluSerProAsnValLysAsnAsnIleTyrAsnAlaThrPhe 405410415 LeuAlaSerAspAsnGlyIleProProMetSerGlyThrGlyThrLeu 420425430 GlnIleTyrLeuLeuAspIleAsnAspAsnAlaProGlnValLeuPro 435440445 GlnGluAlaGluThrCysGluThrProAspProAsnSerIleAsnIle 450455460 ThrThrAlaLeuAspTyrAspIleAspProAsnAlaGlyProPheAla 465470475480 TyrAspLeuProLeuSerProValThrIleLysArgAsnTrpThrIle 485490495 ThrArgLeuAsnGlyAspPheAlaGlnLeuAsnLeuLysIleLysPhe 500505510 LeuGluAlaGlyIleTyrGluValProIleIleIleThrAspSerGly 515520525 AsnProProLysSerAsnLysSerIleLeuArgValArgValCysGln 530535540 CysAspPheAsnGlyAspCysThrAspValAspArg 545550555 (2) INFORMATION FOR SEQ ID NO:99: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 105 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: GluAspThrValTyrSerPheAspIleProGluAsnAlaGlnArgGly 151015 TyrGlnValGlyGlnIleValAlaArgAspAlaAspLeuGlyGlnAsn 202530 AlaGlnLeuSerTyrGlyValValSerAspTrpAlaAsnAspValPhe 354045 SerLeuAsnProGlnThrGlyMetLeuThrLeuThrAlaArgLeuAsp 505560 TyrGluGluValGlnHisTyrIleLeuIleValGlnAlaGlnAspAsn 65707580 GlyGlnProSerLeuSerThrThrIleThrValTyrCysAsnValLeu 859095 AspLeuAsnAspAsnAlaProIlePhe 100105 (2) INFORMATION FOR SEQ ID NO:100: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 7 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: AspXaaAspXaaGlyXaaAsn 15 (2) INFORMATION FOR SEQ ID NO:101: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 7 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: AlaXaaAspXaaGlyXaaPro 15 (2) INFORMATION FOR SEQ ID NO:102: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4650 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 495..4103 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: CCTCTATTCGACATTCTCTTTGGATTGTTTTGCTATAACTTGAAATTTGGGATGTCACAA60 ACGAAACTGTCATCTGTTTCCGCCAAACTGTGGTTCTGCTAATCTCCCAGGCTGGCAGCA120 TTGGAGACTTGCTGACTTCTTTCATCCCCCACTCTTTTCACCTGAAATTCCTTTCCTTGG180 TTTTGCTCTAAGTCCTATGCTTCAGTCAGGGGCCAACCAAATCTCACTGCCTCCTTTTTA240 TCATGAAGCCTTTGATCACTGATAGTTCTTTTTATATCTTGAAAAATCACCCTTCCCAGT300 ACAGTTAATATTTAGTATCTCTACTCATCTTGGCACTTACTCACAGCTCCATAATTCAGT360 CGTTTTCGTACCTCTTCATGGTGATGGGGAGCCCTTTGGAGGTGGTGACTGTGCTTTATA420 CTCCTCATGATGCTTCACATGTGGCAGGCGTGGAGTGCCCGGAGGCGGCCCTCCTGATTC480 TGGGGCCTCCCAGGATGGAGCCCCTGAGGCACAGCCCAGGCCCTGGGGGG530 MetGluProLeuArgHisSerProGlyProGlyGly 1510 CAACGGCTACTGCTGCCCTCCATGCTGCTAGCACTGCTGCTCCTGCTG578 GlnArgLeuLeuLeuProSerMetLeuLeuAlaLeuLeuLeuLeuLeu 152025 GCTCCATCCCCAGGCCACGCCACTCGGGTAGTGTACAAGGTGCCGGAG626 AlaProSerProGlyHisAlaThrArgValValTyrLysValProGlu 303540 GAACAGCCACCCAACACCCTCATTGGGAGCCTCGCAGCCGACTATGGT674 GluGlnProProAsnThrLeuIleGlySerLeuAlaAlaAspTyrGly 45505560 TTTCCAGATGTGGGGCACCTGTACAAGCTAGAGGTGGGTGCCCCGTAC722 PheProAspValGlyHisLeuTyrLysLeuGluValGlyAlaProTyr 657075 CTTCGCGTGGATGGCAAGACAGGTGACATTTTCACCACCGAGACCTCC770 LeuArgValAspGlyLysThrGlyAspIlePheThrThrGluThrSer 808590 ATCGACCGTGAGGGGCTCCGTGAATGCCAGAACCAGCTCCCTGGTGAT818 IleAspArgGluGlyLeuArgGluCysGlnAsnGlnLeuProGlyAsp 95100105 CCCTGCATCCTGGAGTTTGAGGTATCTATCACAGACCTCGTGCAGAAT866 ProCysIleLeuGluPheGluValSerIleThrAspLeuValGlnAsn 110115120 GCGAGCCCCCGGCTGCTAGAGGGCCAGATAGAAGTACAAGACATCAAT914 AlaSerProArgLeuLeuGluGlyGlnIleGluValGlnAspIleAsn 125130135140 GACAACACACCCAACTTCGCCTCACCAGTCATCACTCTGGCCATCCCT962 AspAsnThrProAsnPheAlaSerProValIleThrLeuAlaIlePro 145150155 GAGAACACCAACATCGGCTCACTCTTCCCCATCCCGCTGGCTTCAGAC1010 GluAsnThrAsnIleGlySerLeuPheProIleProLeuAlaSerAsp 160165170 CGTGATGCTGGTCCCAACGGTGTGGCATCCTATGAGCTGCAGGTGGCA1058 ArgAspAlaGlyProAsnGlyValAlaSerTyrGluLeuGlnValAla 175180185 GAGGACCAGGAGGAGAAGCAACCACAGCTCATTGTGATGGGCAACCTG1106 GluAspGlnGluGluLysGlnProGlnLeuIleValMetGlyAsnLeu 190195200 GACCGTGAGCGCTGGGACTCCTATGACCTCACCATCAAGGTGCAGGAT1154 AspArgGluArgTrpAspSerTyrAspLeuThrIleLysValGlnAsp 205210215220 GGCGGCAGCCCCCCACGCGCCACGAGTGCCCTGCTGCGTGTCACCGTG1202 GlyGlySerProProArgAlaThrSerAlaLeuLeuArgValThrVal 225230235 CTTGACACCAATGACAACGCCCCCAAGTTTGAGCGGCCCTCCTATGAG1250 LeuAspThrAsnAspAsnAlaProLysPheGluArgProSerTyrGlu 240245250 GCCGAACTATCTGAGAATAGCCCCATAGGCCACTCGGTCATCCAGGTG1298 AlaGluLeuSerGluAsnSerProIleGlyHisSerValIleGlnVal 255260265 AAGGCCAATGACTCAGACCAAGGTGCCAATGCAGAAATCGAATACACA1346 LysAlaAsnAspSerAspGlnGlyAlaAsnAlaGluIleGluTyrThr 270275280 TTCCACCAGGCGCCCGAAGTTGTGAGGCGTCTTCTTCGACTGGACAGG1394 PheHisGlnAlaProGluValValArgArgLeuLeuArgLeuAspArg 285290295300 AACACTGGACTTATCACTGTTCAGGGCCCGGTGGACCGTGAGGACCTA1442 AsnThrGlyLeuIleThrValGlnGlyProValAspArgGluAspLeu 305310315 AGCACCCTGCGCTTCTCAGTGCTTGCTAAGGACCGAGGCACCAACCCC1490 SerThrLeuArgPheSerValLeuAlaLysAspArgGlyThrAsnPro 320325330 AAGAGTGCCCGTGCCCAGGTGGTTGTGACCGTGAAGGACATGAATGAC1538 LysSerAlaArgAlaGlnValValValThrValLysAspMetAsnAsp 335340345 AATGCCCCCACCATTGAGATCCGGGGCATAGGGCTAGTGACTCATCAA1586 AsnAlaProThrIleGluIleArgGlyIleGlyLeuValThrHisGln 350355360 GATGGGATGGCTAACATCTCAGAGGATGTGGCAGAGGAGACAGCTGTG1634 AspGlyMetAlaAsnIleSerGluAspValAlaGluGluThrAlaVal 365370375380 GCCCTGGTGCAGGTGTCTGACCGAGATGAGGGAGAGAATGCAGCTGTC1682 AlaLeuValGlnValSerAspArgAspGluGlyGluAsnAlaAlaVal 385390395 ACCTGTGTGGTGGCAGGTGATGTGCCCTTCCAGCTGCGCCAGGCCAGT1730 ThrCysValValAlaGlyAspValProPheGlnLeuArgGlnAlaSer 400405410 GAGACAGGCAGTGACAGCAAGAAGAAGTATTTCCTGCAGACTACCACC1778 GluThrGlySerAspSerLysLysLysTyrPheLeuGlnThrThrThr 415420425 CCGCTAGACTACGAGAAGGTCAAAGACTACACCATTGAGATTGTGGCT1826 ProLeuAspTyrGluLysValLysAspTyrThrIleGluIleValAla 430435440 GTGGACTCTGGCAACCCCCCACTCTCCAGCACTAACTCCCTCAAGGTG1874 ValAspSerGlyAsnProProLeuSerSerThrAsnSerLeuLysVal 445450455460 CAGGTGGTGGACGTCAATGACAACGCACCTGTCTTCACTCAGAGTGTC1922 GlnValValAspValAsnAspAsnAlaProValPheThrGlnSerVal 465470475 ACTGAGGTCGCCTTCCCGGAAAACAACAAGCCTGGTGAAGTGATTGCT1970 ThrGluValAlaPheProGluAsnAsnLysProGlyGluValIleAla 480485490 GAGATCACTGCCAGTGATGCTGACTCTGGCTCTAATGCTGAGCTGGTT2018 GluIleThrAlaSerAspAlaAspSerGlySerAsnAlaGluLeuVal 495500505 TACTCTCTGGAGCCTGAGCCGGCTGCTAAGGGCCTCTTCACCATCTCA2066 TyrSerLeuGluProGluProAlaAlaLysGlyLeuPheThrIleSer 510515520 CCCGAGACTGGAGAGATCCAGGTGAAGACATCTCTGGATCGGGAACAG2114 ProGluThrGlyGluIleGlnValLysThrSerLeuAspArgGluGln 525530535540 CGGGAGAGCTATGAGTTGAAGGTGGTGGCAGCTGACCGGGGCAGTCCT2162 ArgGluSerTyrGluLeuLysValValAlaAlaAspArgGlySerPro 545550555 AGCCTCCAGGGCACAGCCACTGTCCTTGTCAATGTGCTGGACTGCAAT2210 SerLeuGlnGlyThrAlaThrValLeuValAsnValLeuAspCysAsn 560565570 GACAATGACCCCAAATTTATGCTGAGTGGCTACAACTTCTCAGTGATG2258 AspAsnAspProLysPheMetLeuSerGlyTyrAsnPheSerValMet 575580585 GAGAACATGCCAGCACTGAGTCCAGTGGGCATGGTGACTGTCATTGAT2306 GluAsnMetProAlaLeuSerProValGlyMetValThrValIleAsp 590595600 GGAGACAAGGGGGAGAATGCCCAGGTGCAGCTCTCAGTGGAGCAGGAC2354 GlyAspLysGlyGluAsnAlaGlnValGlnLeuSerValGluGlnAsp 605610615620 AACGGTGACTTTGTTATCCAGAATGGCACAGGCACCATCCTATCCAGC2402 AsnGlyAspPheValIleGlnAsnGlyThrGlyThrIleLeuSerSer 625630635 CTGAGCTTTGATCGAGAGCAACAAAGCACCTACACCTTCCAGCTGAAG2450 LeuSerPheAspArgGluGlnGlnSerThrTyrThrPheGlnLeuLys 640645650 GCAGTGGATGGTGGCGTCCCACCTCGCTCAGCTTACGTTGGTGTCACC2498 AlaValAspGlyGlyValProProArgSerAlaTyrValGlyValThr 655660665 ATCAATGTGCTGGACGAGAATGACAACGCACCCTATATCACTGCCCCT2546 IleAsnValLeuAspGluAsnAspAsnAlaProTyrIleThrAlaPro 670675680 TCTAACACCTCTCACAAGCTGCTGACCCCCCAGACACGTCTTGGTGAG2594 SerAsnThrSerHisLysLeuLeuThrProGlnThrArgLeuGlyGlu 685690695700 ACGGTCAGCCAGGTGGCAGCCGAGGACTTTGACTCTGGTGTCAATGCC2642 ThrValSerGlnValAlaAlaGluAspPheAspSerGlyValAsnAla 705710715 GAGCTGATCTACAGCATTGCAGGTGGCAACCCTTATGGACTCTTCCAG2690 GluLeuIleTyrSerIleAlaGlyGlyAsnProTyrGlyLeuPheGln 720725730 ATTGGGTCACATTCAGGTGCCATCACCCTGGAGAAGGAGATTGAGCGG2738 IleGlySerHisSerGlyAlaIleThrLeuGluLysGluIleGluArg 735740745 CGCCACCATGGGCTACACCGCCTGGTGGTGAAGGTCAGTGACCGCGGC2786 ArgHisHisGlyLeuHisArgLeuValValLysValSerAspArgGly 750755760 AAGCCCCCACGCTATGGCACAGCCTTGGTCCATCTTTATGTCAATGAG2834 LysProProArgTyrGlyThrAlaLeuValHisLeuTyrValAsnGlu 765770775780 ACTCTGGCCAACCGCACGCTGCTGGAGACCCTCCTGGGCCACAGCCTG2882 ThrLeuAlaAsnArgThrLeuLeuGluThrLeuLeuGlyHisSerLeu 785790795 GACACGCCGCTGGATATTGACATTGCTGGGGATCCAGAATATGAGCGC2930 AspThrProLeuAspIleAspIleAlaGlyAspProGluTyrGluArg 800805810 TCCAAGCAGCGTGGCAACATTCTCTTTGGTGTGGTGGCTGGTGTGGTG2978 SerLysGlnArgGlyAsnIleLeuPheGlyValValAlaGlyValVal 815820825 GCCGTGGCCTTGCTCATCGCCCTGGCGGTTCTTGTGCGCTACTGCAGA3026 AlaValAlaLeuLeuIleAlaLeuAlaValLeuValArgTyrCysArg 830835840 CAGCGGGAGGCCAAAAGTGGTTACCAGGCTGGTAAGAAGGAGACCAAG3074 GlnArgGluAlaLysSerGlyTyrGlnAlaGlyLysLysGluThrLys 845850855860 GACCTGTATGCCCCCAAGCCCAGTGGCAAGGCCTCCAAGGGAAACAAA3122 AspLeuTyrAlaProLysProSerGlyLysAlaSerLysGlyAsnLys 865870875 AGCAAAGGCAAGAAGAGCAAGTCCCCAAAGCCCGTGAAGCCAGTGGAG3170 SerLysGlyLysLysSerLysSerProLysProValLysProValGlu 880885890 GACGAGGATGAGGCCGGGCTGCAGAAGTCCCTCAAGTTCAACCTGATG3218 AspGluAspGluAlaGlyLeuGlnLysSerLeuLysPheAsnLeuMet 895900905 AGCGATGCCCCTGGGGACAGTCCCCGCATCCACCTGCCCCTCAACTAC3266 SerAspAlaProGlyAspSerProArgIleHisLeuProLeuAsnTyr 910915920 CCACCAGGCAGCCCTGACCTGGGCCGCCACTATCGCTCTAACTCCCCA3314 ProProGlySerProAspLeuGlyArgHisTyrArgSerAsnSerPro 925930935940 CTGCCTTCCATCCAGCTGCAGCCCCAGTCACCCTCAGCCTCCAAGAAG3362 LeuProSerIleGlnLeuGlnProGlnSerProSerAlaSerLysLys 945950955 CACCAGGTGGTACAGGACCTGCCACCTGCAAACACATTCGTGGGCACC3410 HisGlnValValGlnAspLeuProProAlaAsnThrPheValGlyThr 960965970 GGGGACACCACGTCCACGGGCTCTGAGCAGTACTCCGACTACAGCTAC3458 GlyAspThrThrSerThrGlySerGluGlnTyrSerAspTyrSerTyr 975980985 CGCACCAACCCCCCCAAATACCCCAGCAAGCAGTTACCTCACCGCCGC3506 ArgThrAsnProProLysTyrProSerLysGlnLeuProHisArgArg 9909951000 GTCACCTTCTCGGCCACCAGCCAGGCCCAGGAGCTGCAGGACCCATCC3554 ValThrPheSerAlaThrSerGlnAlaGlnGluLeuGlnAspProSer 1005101010151020 CAGCACAGTTACTATGACAGTGGCCTGGAGGAGTCTGAGACGCCGTCC3602 GlnHisSerTyrTyrAspSerGlyLeuGluGluSerGluThrProSer 102510301035 AGCAAGTCATCCTCAGGGCCTCGACTCGGTCCCCTGGCCCTGCCTGAG3650 SerLysSerSerSerGlyProArgLeuGlyProLeuAlaLeuProGlu 104010451050 GATCACTATGAGCGCACCACCCCTGATGGCAGCATAGGAGAGATGGAG3698 AspHisTyrGluArgThrThrProAspGlySerIleGlyGluMetGlu 105510601065 CACCCCGAGAATGACCTTCGCCCTTTGCCTGATGTCGCCATGACAGGC3746 HisProGluAsnAspLeuArgProLeuProAspValAlaMetThrGly 107010751080 ACATGTACCCGGGAGTGCAGTGAGTTTGGCCACTCTGACACATGCTGG3794 ThrCysThrArgGluCysSerGluPheGlyHisSerAspThrCysTrp 1085109010951100 ATGCCTGGCCAGTCATCTCCCAGCCGCCGGACCAAGAGCAGCGCCCTC3842 MetProGlyGlnSerSerProSerArgArgThrLysSerSerAlaLeu 110511101115 AAACTCTCCACCTTCATGCCTTACCAGGACCGAGGAGGGCAGGAGCCT3890 LysLeuSerThrPheMetProTyrGlnAspArgGlyGlyGlnGluPro 112011251130 GCGGGCGCCGGCAGCCCCAGCCCCCCGGAAGACCGGAACACCAAAACG3938 AlaGlyAlaGlySerProSerProProGluAspArgAsnThrLysThr 113511401145 GCCCCCGTGCGCCTCCTGCCCTCCTACAGTGCCTTCTCCCACAGTAGC3986 AlaProValArgLeuLeuProSerTyrSerAlaPheSerHisSerSer 115011551160 CATGATTCCTGCAAGGACTCGGCCACCTTGGAGGAAATCCCCCTGACC4034 HisAspSerCysLysAspSerAlaThrLeuGluGluIleProLeuThr 1165117011751180 CAGACCTCGGACTTCCCACCCGCAGCCACACCGGCATCTGCCCAGACG4082 GlnThrSerAspPheProProAlaAlaThrProAlaSerAlaGlnThr 118511901195 GCCAAGCGCGAGATCTACCTGTGAGCCCCCTACTGGCCGGCCCCCCTCCCC4133 AlaLysArgGluIleTyrLeu 1200 CAGCGCCGGCCAGCTCCCAAATGCCCATTCCAGGGCCTCACTCTCCACCCCTTCAGCGTG4193 GACTTCCTGCCAGGGCCCAAGTGGGGGTATCACTGACCTCATGACCACGCTGGCCCTTCT4253 CCCATGCAGGGTCCAGGTCCTCTCCCCTCATTTCCATCTCCCAGCCCAGGGGCCCCTTCC4313 CCTTTATGGGGCTTCCCCCAGCTGATGCCCAAGAGGGCTCCTCTGCAATGACTGGGCTCC4373 TTCCCTTGACTTCCAGGGAGCACCCCCTCGATTTGGGCAGATGGTGGAGTCAAGGGTGGG4433 CAGCGTACTTCTAACTCATTGTTTCCCTCATGGCCGACCAGGGCGGGGATAGCATGCCCA4493 ATTTTAGCCCTGAAGCAGGGCTGAACTGGGGAGCCCCTTTCCCTGGGAGCTCCCAGAGGA4553 AACTCTTGACCACCAGTGGCTCCCTGAAGGGCTTTTGTTACCAAAGGTGGGGTAGGGACG4613 GGGGTGGGAGTGGAGCGGAGGCCTTGTTTTCCCGTGG4650 (2) INFORMATION FOR SEQ ID NO:103: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1203 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: MetGluProLeuArgHisSerProGlyProGlyGlyGlnArgLeuLeu 151015 LeuProSerMetLeuLeuAlaLeuLeuLeuLeuLeuAlaProSerPro 202530 GlyHisAlaThrArgValValTyrLysValProGluGluGlnProPro 354045 AsnThrLeuIleGlySerLeuAlaAlaAspTyrGlyPheProAspVal 505560 GlyHisLeuTyrLysLeuGluValGlyAlaProTyrLeuArgValAsp 65707580 GlyLysThrGlyAspIlePheThrThrGluThrSerIleAspArgGlu 859095 GlyLeuArgGluCysGlnAsnGlnLeuProGlyAspProCysIleLeu 100105110 GluPheGluValSerIleThrAspLeuValGlnAsnAlaSerProArg 115120125 LeuLeuGluGlyGlnIleGluValGlnAspIleAsnAspAsnThrPro 130135140 AsnPheAlaSerProValIleThrLeuAlaIleProGluAsnThrAsn 145150155160 IleGlySerLeuPheProIleProLeuAlaSerAspArgAspAlaGly 165170175 ProAsnGlyValAlaSerTyrGluLeuGlnValAlaGluAspGlnGlu 180185190 GluLysGlnProGlnLeuIleValMetGlyAsnLeuAspArgGluArg 195200205 TrpAspSerTyrAspLeuThrIleLysValGlnAspGlyGlySerPro 210215220 ProArgAlaThrSerAlaLeuLeuArgValThrValLeuAspThrAsn 225230235240 AspAsnAlaProLysPheGluArgProSerTyrGluAlaGluLeuSer 245250255 GluAsnSerProIleGlyHisSerValIleGlnValLysAlaAsnAsp 260265270 SerAspGlnGlyAlaAsnAlaGluIleGluTyrThrPheHisGlnAla 275280285 ProGluValValArgArgLeuLeuArgLeuAspArgAsnThrGlyLeu 290295300 IleThrValGlnGlyProValAspArgGluAspLeuSerThrLeuArg 305310315320 PheSerValLeuAlaLysAspArgGlyThrAsnProLysSerAlaArg 325330335 AlaGlnValValValThrValLysAspMetAsnAspAsnAlaProThr 340345350 IleGluIleArgGlyIleGlyLeuValThrHisGlnAspGlyMetAla 355360365 AsnIleSerGluAspValAlaGluGluThrAlaValAlaLeuValGln 370375380 ValSerAspArgAspGluGlyGluAsnAlaAlaValThrCysValVal 385390395400 AlaGlyAspValProPheGlnLeuArgGlnAlaSerGluThrGlySer 405410415 AspSerLysLysLysTyrPheLeuGlnThrThrThrProLeuAspTyr 420425430 GluLysValLysAspTyrThrIleGluIleValAlaValAspSerGly 435440445 AsnProProLeuSerSerThrAsnSerLeuLysValGlnValValAsp 450455460 ValAsnAspAsnAlaProValPheThrGlnSerValThrGluValAla 465470475480 PheProGluAsnAsnLysProGlyGluValIleAlaGluIleThrAla 485490495 SerAspAlaAspSerGlySerAsnAlaGluLeuValTyrSerLeuGlu 500505510 ProGluProAlaAlaLysGlyLeuPheThrIleSerProGluThrGly 515520525 GluIleGlnValLysThrSerLeuAspArgGluGlnArgGluSerTyr 530535540 GluLeuLysValValAlaAlaAspArgGlySerProSerLeuGlnGly 545550555560 ThrAlaThrValLeuValAsnValLeuAspCysAsnAspAsnAspPro 565570575 LysPheMetLeuSerGlyTyrAsnPheSerValMetGluAsnMetPro 580585590 AlaLeuSerProValGlyMetValThrValIleAspGlyAspLysGly 595600605 GluAsnAlaGlnValGlnLeuSerValGluGlnAspAsnGlyAspPhe 610615620 ValIleGlnAsnGlyThrGlyThrIleLeuSerSerLeuSerPheAsp 625630635640 ArgGluGlnGlnSerThrTyrThrPheGlnLeuLysAlaValAspGly 645650655 GlyValProProArgSerAlaTyrValGlyValThrIleAsnValLeu 660665670 AspGluAsnAspAsnAlaProTyrIleThrAlaProSerAsnThrSer 675680685 HisLysLeuLeuThrProGlnThrArgLeuGlyGluThrValSerGln 690695700 ValAlaAlaGluAspPheAspSerGlyValAsnAlaGluLeuIleTyr 705710715720 SerIleAlaGlyGlyAsnProTyrGlyLeuPheGlnIleGlySerHis 725730735 SerGlyAlaIleThrLeuGluLysGluIleGluArgArgHisHisGly 740745750 LeuHisArgLeuValValLysValSerAspArgGlyLysProProArg 755760765 TyrGlyThrAlaLeuValHisLeuTyrValAsnGluThrLeuAlaAsn 770775780 ArgThrLeuLeuGluThrLeuLeuGlyHisSerLeuAspThrProLeu 785790795800 AspIleAspIleAlaGlyAspProGluTyrGluArgSerLysGlnArg 805810815 GlyAsnIleLeuPheGlyValValAlaGlyValValAlaValAlaLeu 820825830 LeuIleAlaLeuAlaValLeuValArgTyrCysArgGlnArgGluAla 835840845 LysSerGlyTyrGlnAlaGlyLysLysGluThrLysAspLeuTyrAla 850855860 ProLysProSerGlyLysAlaSerLysGlyAsnLysSerLysGlyLys 865870875880 LysSerLysSerProLysProValLysProValGluAspGluAspGlu 885890895 AlaGlyLeuGlnLysSerLeuLysPheAsnLeuMetSerAspAlaPro 900905910 GlyAspSerProArgIleHisLeuProLeuAsnTyrProProGlySer 915920925 ProAspLeuGlyArgHisTyrArgSerAsnSerProLeuProSerIle 930935940 GlnLeuGlnProGlnSerProSerAlaSerLysLysHisGlnValVal 945950955960 GlnAspLeuProProAlaAsnThrPheValGlyThrGlyAspThrThr 965970975 SerThrGlySerGluGlnTyrSerAspTyrSerTyrArgThrAsnPro 980985990 ProLysTyrProSerLysGlnLeuProHisArgArgValThrPheSer 99510001005 AlaThrSerGlnAlaGlnGluLeuGlnAspProSerGlnHisSerTyr 101010151020 TyrAspSerGlyLeuGluGluSerGluThrProSerSerLysSerSer 1025103010351040 SerGlyProArgLeuGlyProLeuAlaLeuProGluAspHisTyrGlu 104510501055 ArgThrThrProAspGlySerIleGlyGluMetGluHisProGluAsn 106010651070 AspLeuArgProLeuProAspValAlaMetThrGlyThrCysThrArg 107510801085 GluCysSerGluPheGlyHisSerAspThrCysTrpMetProGlyGln 109010951100 SerSerProSerArgArgThrLysSerSerAlaLeuLysLeuSerThr 1105111011151120 PheMetProTyrGlnAspArgGlyGlyGlnGluProAlaGlyAlaGly 112511301135 SerProSerProProGluAspArgAsnThrLysThrAlaProValArg 114011451150 LeuLeuProSerTyrSerAlaPheSerHisSerSerHisAspSerCys 115511601165 LysAspSerAlaThrLeuGluGluIleProLeuThrGlnThrSerAsp 117011751180 PheProProAlaAlaThrProAlaSerAlaGlnThrAlaLysArgGlu 1185119011951200 IleTyrLeu (2) INFORMATION FOR SEQ ID NO:104: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2789 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 115..2622 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: CGAAAGCCATGTCGGACTCGTCGCCCAGCGCCCAAGCGCTAACCCGCTGAAAGTTTCTCA60 GCGAAATCTCAGGGACGATCTGGACCCCGCTGAGAGGAACTGCTTTTGAGTGAGATG117 Met 1 GTCCCAGAGGCCTGGAGGAGCGGACTGGTAAGCACCGGGAGGGTAGTG165 ValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgValVal 51015 GGAGTTTTGCTTCTGCTTGGTGCCTTGAACAAGGCTTCCACGGTCATT213 GlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrValIle 202530 CACTATGAGATCCCGGAGGAAAGAGAGAAGGGTTTCGCTGTGGGCAAC261 HisTyrGluIleProGluGluArgGluLysGlyPheAlaValGlyAsn 354045 GTGGTCGCGAACCTTGGTTTGGATCTCGGTAGCCTCTCAGCCCGCAGG309 ValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArgArg 50556065 TTCCCGGTGGTGTCTGGAGCTAGCCGAAGATTCTTTGAGGTGAACCGG357 PheProValValSerGlyAlaSerArgArgPhePheGluValAsnArg 707580 GAGACCGGAGAGATGTTTGTGAACGACCGTCTGGATCGAGAGGAGCTG405 GluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGluLeu 859095 TGTGGGACACTGCCCTCTTGCACTGTAACTCTGGAGTTGGTAGTGGAG453 CysGlyThrLeuProSerCysThrValThrLeuGluLeuValValGlu 100105110 AACCCGCTGGAGCTGTTCAGCGTGGAAGTGGTGATCCAGGACATCAAC501 AsnProLeuGluLeuPheSerValGluValValIleGlnAspIleAsn 115120125 GACAACAATCCTGCTTTCCCTACCCAGGAAATGAAATTGGAGATTAGC549 AspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIleSer 130135140145 GAGGCCGTGGCTCCGGGGACGCGCTTTCCGCTCGAGAGCGCGCACGAT597 GluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHisAsp 150155160 CCCGATCTGGGAAGCAACTCTTTACAAACCTATGAGCTGAGCCGAAAT645 ProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArgAsn 165170175 GAATACTTTGCGCTTCGCGTGCAGACGCGGGAGGACAGCACCAAGTAC693 GluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLysTyr 180185190 GCGGAGCTGGTGTTGGAGCGCGCCCTGGACCGAGAACGGGAGCCTAGT741 AlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluProSer 195200205 CTCCAGTTAGTGCTGACGGCGTTGGACGGAGGGACCCCAGCTCTCTCC789 LeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeuSer 210215220225 GCCAGCCTGCCTATTCACATCAAGGTGCTGGACGCGAATGACAATGCG837 AlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsnAla 230235240 CCTGTCTTCAACCAGTCCTTGTACCGGGCGCGCGTTCCTGGAGGATGC885 ProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGlyCys 245250255 ACCTCCGGCACGCGCGTGGTACAAGTCCTTGCAACGGATCTGGATGAA933 ThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAspGlu 260265270 GGCCCCAACGGTGAAATTATTTACTCCTTCGGCAGCCACAACCGCGCC981 GlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArgAla 275280285 GGCGTGCGGCAACTATTCGCCTTAGACCTTGTAACCGGGATGCTGACA1029 GlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeuThr 290295300305 ATCAAGGGTCGGCTGGACTTCGAGGACACCAAACTCCATGAGATTTAC1077 IleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIleTyr 310315320 ATCCAGGCCAAAGACAAGGGCGCCAATCCCGAAGGAGCACATTGCAAA1125 IleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLys 325330335 GTGTTGGTGGAGGTTGTGGATGTGAATGACAACGCCCCGGAGATCACA1173 ValLeuValGluValValAspValAsnAspAsnAlaProGluIleThr 340345350 GTCACCTCCGTGTACAGCCCAGTACCCGAGGATGCCTCTGGGACTGTC1221 ValThrSerValTyrSerProValProGluAspAlaSerGlyThrVal 355360365 ATCGCTTTGCTCAGTGTGACTGACCTGGATGCTGGCGAGAACGGGCTG1269 IleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGlyLeu 370375380385 GTGACCTGCGAAGTTCCACCGGGTCTCCCTTTCAGCCTTACTTCTTCC1317 ValThrCysGluValProProGlyLeuProPheSerLeuThrSerSer 390395400 CTCAAGAATTACTTCACTTTGAAAACCAGTGCAGACCTGGATCGGGAG1365 LeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArgGlu 405410415 ACTGTGCCAGAATACAACCTCAGCATCACCGCCCGAGACGCCGGAACC1413 ThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGlyThr 420425430 CCTTCCCTCTCAGCCCTTACAATAGTGCGTGTTCAAGTGTCCGACATC1461 ProSerLeuSerAlaLeuThrIleValArgValGlnValSerAspIle 435440445 AATGACAACCCTCCACAATCTTCTCAATCTTCCTACGACGTTTACATT1509 AsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyrIle 450455460465 GAAGAAAACAACCTCCCCGGGGCTCCAATACTAAACCTAAGTGTCTGG1557 GluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerValTrp 470475480 GACCCCGACGCCCCGCAGAATGCTCGGCTTTCTTTCTTTCTCTTGGAG1605 AspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeuGlu 485490495 CAAGGAGCTGAAACCGGGCTAGTGGGTCGCTATTTCACAATAAATCGT1653 GlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsnArg 500505510 GACAATGGCATAGTGTCATCCTTAGTGCCCCTAGACTATGAGGATCGG1701 AspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAspArg 515520525 CGGGAATTTGAATTAACAGCTCATATCAGCGATGGGGGCACCCCGGTC1749 ArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrProVal 530535540545 CTAGCCACCAACATCAGCGTGAACATATTTGTCACTGATCGCAATGAC1797 LeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsnAsp 550555560 AATGCCCCCCAGGTCCTATATCCTCGGCCAGGTGGGAGCTCGGTGGAG1845 AsnAlaProGlnValLeuTyrProArgProGlyGlySerSerValGlu 565570575 ATGCTGCCTCGAGGTACCTCAGCTGGCCACCTAGTGTCACGGGTGGTA1893 MetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgValVal 580585590 GGCTGGGACGCGGATGCAGGGCACAATGCCTGGCTCTCCTACAGTCTC1941 GlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSerLeu 595600605 TTTGGATCCCCTAACCAGAGCCTTTTTGCCATAGGGCTGCACACTGGT1989 PheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThrGly 610615620625 CAAATCAGTACTGCCCGTCCAGTCCAAGACACAGATTCACCCAGGCAG2037 GlnIleSerThrAlaArgProValGlnAspThrAspSerProArgGln 630635640 ACTCTCACTGTCTTGATCAAAGACAATGGGGAGCCTTCGCTCTCCACC2085 ThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSerThr 645650655 ACTGCTACCCTCACTGTGTCAGTAACCGAGGACTCTCCTGAAGCCCGA2133 ThrAlaThrLeuThrValSerValThrGluAspSerProGluAlaArg 660665670 GCCGAGTTCCCCTCTGGCTCTGCCCCCCGGGAGCAGAAAAAAAATCTC2181 AlaGluPheProSerGlySerAlaProArgGluGlnLysLysAsnLeu 675680685 ACCTTTTATCTACTTCTTTCTCTAATCCTGGTTTCTGTGGGCTTCGTG2229 ThrPheTyrLeuLeuLeuSerLeuIleLeuValSerValGlyPheVal 690695700705 GTCACAGTGTTCGGAGTAATCATATTCAAAGTTTACAAGTGGAAGCAG2277 ValThrValPheGlyValIleIlePheLysValTyrLysTrpLysGln 710715720 TCTAGAGACCTATACCGAGCCCCGGTGAGCTCACTGTACCGAACACCA2325 SerArgAspLeuTyrArgAlaProValSerSerLeuTyrArgThrPro 725730735 GGGCCCTCCTTGCACGCGGACGCCGTGCGGGGAGGCCTGATGTCGCCG2373 GlyProSerLeuHisAlaAspAlaValArgGlyGlyLeuMetSerPro 740745750 CACCTTTACCATCAGGTGTATCTCACCACGGACTCCCGCCGCAGCGAC2421 HisLeuTyrHisGlnValTyrLeuThrThrAspSerArgArgSerAsp 755760765 CCGCTGCTGAAGAAACCTGGTGCAGCCAGTCCACTGGCCAGCCGCCAG2469 ProLeuLeuLysLysProGlyAlaAlaSerProLeuAlaSerArgGln 770775780785 AACACGCTGCGGAGCTGTGATCCGGTGTTCTATAGGCAGGTGTTGGGT2517 AsnThrLeuArgSerCysAspProValPheTyrArgGlnValLeuGly 790795800 GCAGAGAGCGCCCCTCCCGGACAGGTAAGGTTTAGCAAGTCATGCTTG2565 AlaGluSerAlaProProGlyGlnValArgPheSerLysSerCysLeu 805810815 ACCCTGTTAGTGCCTTTTTATTCCTACATCATATTGAGAAGGCTGGAG2613 ThrLeuLeuValProPheTyrSerTyrIleIleLeuArgArgLeuGlu 820825830 CTGTTTTTTTAGTGATGAAGATGTTTTCCTGGTGATGCATTCACACTTT2662 LeuPhePhe 835 CAACTGGCTCTTCCTAGATCAAAGTTAGTGCCTTTGTGAGATGGTGGCCTGCCAGAGTGT2722 GGTTTGTGGTCCCATTTCAGGGGGAAGATACTTGACTCATCTGTGGACCTAATTCACATC2782 CTCAGCG2789 (2) INFORMATION FOR SEQ ID NO:105: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 836 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: MetValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgVal 151015 ValGlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrVal 202530 IleHisTyrGluIleProGluGluArgGluLysGlyPheAlaValGly 354045 AsnValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArg 505560 ArgPheProValValSerGlyAlaSerArgArgPhePheGluValAsn 65707580 ArgGluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGlu 859095 LeuCysGlyThrLeuProSerCysThrValThrLeuGluLeuValVal 100105110 GluAsnProLeuGluLeuPheSerValGluValValIleGlnAspIle 115120125 AsnAspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIle 130135140 SerGluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHis 145150155160 AspProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArg 165170175 AsnGluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLys 180185190 TyrAlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluPro 195200205 SerLeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeu 210215220 SerAlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsn 225230235240 AlaProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGly 245250255 CysThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAsp 260265270 GluGlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArg 275280285 AlaGlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeu 290295300 ThrIleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIle 305310315320 TyrIleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCys 325330335 LysValLeuValGluValValAspValAsnAspAsnAlaProGluIle 340345350 ThrValThrSerValTyrSerProValProGluAspAlaSerGlyThr 355360365 ValIleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGly 370375380 LeuValThrCysGluValProProGlyLeuProPheSerLeuThrSer 385390395400 SerLeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArg 405410415 GluThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGly 420425430 ThrProSerLeuSerAlaLeuThrIleValArgValGlnValSerAsp 435440445 IleAsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyr 450455460 IleGluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerVal 465470475480 TrpAspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeu 485490495 GluGlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsn 500505510 ArgAspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAsp 515520525 ArgArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrPro 530535540 ValLeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsn 545550555560 AspAsnAlaProGlnValLeuTyrProArgProGlyGlySerSerVal 565570575 GluMetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgVal 580585590 ValGlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSer 595600605 LeuPheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThr 610615620 GlyGlnIleSerThrAlaArgProValGlnAspThrAspSerProArg 625630635640 GlnThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSer 645650655 ThrThrAlaThrLeuThrValSerValThrGluAspSerProGluAla 660665670 ArgAlaGluPheProSerGlySerAlaProArgGluGlnLysLysAsn 675680685 LeuThrPheTyrLeuLeuLeuSerLeuIleLeuValSerValGlyPhe 690695700 ValValThrValPheGlyValIleIlePheLysValTyrLysTrpLys 705710715720 GlnSerArgAspLeuTyrArgAlaProValSerSerLeuTyrArgThr 725730735 ProGlyProSerLeuHisAlaAspAlaValArgGlyGlyLeuMetSer 740745750 ProHisLeuTyrHisGlnValTyrLeuThrThrAspSerArgArgSer 755760765 AspProLeuLeuLysLysProGlyAlaAlaSerProLeuAlaSerArg 770775780 GlnAsnThrLeuArgSerCysAspProValPheTyrArgGlnValLeu 785790795800 GlyAlaGluSerAlaProProGlyGlnValArgPheSerLysSerCys 805810815 LeuThrLeuLeuValProPheTyrSerTyrIleIleLeuArgArgLeu 820825830 GluLeuPhePhe 835 (2) INFORMATION FOR SEQ ID NO:106: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2751 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 115..2160 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: CGAAAGCCATGTCGGACTCGTCGCCCAGCGCCCAAGCGCTAACCCGCTGAAAGTTTCTCA60 GCGAAATCTCAGGGACGATCTGGACCCCGCTGAGAGGAACTGCTTTTGAGTGAGATG117 Met 1 GTCCCAGAGGCCTGGAGGAGCGGACTGGTAAGCACCGGGAGGGTAGTG165 ValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgValVal 51015 GGAGTTTTGCTTCTGCTTGGTGCCTTGAACAAGGCTTCCACGGTCATT213 GlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrValIle 202530 CACTATGAGATCCCGGAGGAAAGAGAGAAGGGTTTCGCTGTGGGCAAC261 HisTyrGluIleProGluGluArgGluLysGlyPheAlaValGlyAsn 354045 GTGGTCGCGAACCTTGGTTTGGATCTCGGTAGCCTCTCAGCCCGCAGG309 ValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArgArg 50556065 TTCCCGGTGGTGTCTGGAGCTAGCCGAAGATTCTTTGAGGTGAACCGG357 PheProValValSerGlyAlaSerArgArgPhePheGluValAsnArg 707580 GAGACCGGAGAGATGTTTGTGAACGACCGTCTGGATCGAGAGGAGCTG405 GluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGluLeu 859095 TGTGGGACACTGCCCTCTTGCACTGTAACTCTGGAGTTGGTAGTGGAG453 CysGlyThrLeuProSerCysThrValThrLeuGluLeuValValGlu 100105110 AACCCGCTGGAGCTGTTCAGCGTGGAAGTGGTGATCCAGGACATCAAC501 AsnProLeuGluLeuPheSerValGluValValIleGlnAspIleAsn 115120125 GACAACAATCCTGCTTTCCCTACCCAGGAAATGAAATTGGAGATTAGC549 AspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIleSer 130135140145 GAGGCCGTGGCTCCGGGGACGCGCTTTCCGCTCGAGAGCGCGCACGAT597 GluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHisAsp 150155160 CCCGATCTGGGAAGCAACTCTTTACAAACCTATGAGCTGAGCCGAAAT645 ProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArgAsn 165170175 GAATACTTTGCGCTTCGCGTGCAGACGCGGGAGGACAGCACCAAGTAC693 GluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLysTyr 180185190 GCGGAGCTGGTGTTGGAGCGCGCCCTGGACCGAGAACGGGAGCCTAGT741 AlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluProSer 195200205 CTCCAGTTAGTGCTGACGGCGTTGGACGGAGGGACCCCAGCTCTCTCC789 LeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeuSer 210215220225 GCCAGCCTGCCTATTCACATCAAGGTGCTGGACGCGAATGACAATGCG837 AlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsnAla 230235240 CCTGTCTTCAACCAGTCCTTGTACCGGGCGCGCGTTCCTGGAGGATGC885 ProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGlyCys 245250255 ACCTCCGGCACGCGCGTGGTACAAGTCCTTGCAACGGATCTGGATGAA933 ThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAspGlu 260265270 GGCCCCAACGGTGAAATTATTTACTCCTTCGGCAGCCACAACCGCGCC981 GlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArgAla 275280285 GGCGTGCGGCAACTATTCGCCTTAGACCTTGTAACCGGGATGCTGACA1029 GlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeuThr 290295300305 ATCAAGGGTCGGCTGGACTTCGAGGACACCAAACTCCATGAGATTTAC1077 IleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIleTyr 310315320 ATCCAGGCCAAAGACAAGGGCGCCAATCCCGAAGGAGCACATTGCAAA1125 IleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCysLys 325330335 GTGTTGGTGGAGGTTGTGGATGTGAATGACAACGCCCCGGAGATCACA1173 ValLeuValGluValValAspValAsnAspAsnAlaProGluIleThr 340345350 GTCACCTCCGTGTACAGCCCAGTACCCGAGGATGCCTCTGGGACTGTC1221 ValThrSerValTyrSerProValProGluAspAlaSerGlyThrVal 355360365 ATCGCTTTGCTCAGTGTGACTGACCTGGATGCTGGCGAGAACGGGCTG1269 IleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGlyLeu 370375380385 GTGACCTGCGAAGTTCCACCGGGTCTCCCTTTCAGCCTTACTTCTTCC1317 ValThrCysGluValProProGlyLeuProPheSerLeuThrSerSer 390395400 CTCAAGAATTACTTCACTTTGAAAACCAGTGCAGACCTGGATCGGGAG1365 LeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArgGlu 405410415 ACTGTGCCAGAATACAACCTCAGCATCACCGCCCGAGACGCCGGAACC1413 ThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGlyThr 420425430 CCTTCCCTCTCAGCCCTTACAATAGTGCGTGTTCAAGTGTCCGACATC1461 ProSerLeuSerAlaLeuThrIleValArgValGlnValSerAspIle 435440445 AATGACAACCCTCCACAATCTTCTCAATCTTCCTACGACGTTTACATT1509 AsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyrIle 450455460465 GAAGAAAACAACCTCCCCGGGGCTCCAATACTAAACCTAAGTGTCTGG1557 GluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerValTrp 470475480 GACCCCGACGCCCCGCAGAATGCTCGGCTTTCTTTCTTTCTCTTGGAG1605 AspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeuGlu 485490495 CAAGGAGCTGAAACCGGGCTAGTGGGTCGCTATTTCACAATAAATCGT1653 GlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsnArg 500505510 GACAATGGCATAGTGTCATCCTTAGTGCCCCTAGACTATGAGGATCGG1701 AspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAspArg 515520525 CGGGAATTTGAATTAACAGCTCATATCAGCGATGGGGGCACCCCGGTC1749 ArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrProVal 530535540545 CTAGCCACCAACATCAGCGTGAACATATTTGTCACTGATCGCAATGAC1797 LeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsnAsp 550555560 AATGCCCCCCAGGTCCTATATCCTCGGCCAGGTGGGAGCTCGGTGGAG1845 AsnAlaProGlnValLeuTyrProArgProGlyGlySerSerValGlu 565570575 ATGCTGCCTCGAGGTACCTCAGCTGGCCACCTAGTGTCACGGGTGGTA1893 MetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgValVal 580585590 GGCTGGGACGCGGATGCAGGGCACAATGCCTGGCTCTCCTACAGTCTC1941 GlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSerLeu 595600605 TTTGGATCCCCTAACCAGAGCCTTTTTGCCATAGGGCTGCACACTGGT1989 PheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThrGly 610615620625 CAAATCAGTACTGCCCGTCCAGTCCAAGACACAGATTCACCCAGGCAG2037 GlnIleSerThrAlaArgProValGlnAspThrAspSerProArgGln 630635640 ACTCTCACTGTCTTGATCAAAGACAATGGGGAGCCTTCGCTCTCCACC2085 ThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSerThr 645650655 ACTGCTACCCTCACTGTGTCAGTAACCGAGGACTCTCCTGAAGCCCGA2133 ThrAlaThrLeuThrValSerValThrGluAspSerProGluAlaArg 660665670 GCCGAGTTCCCCTCTGGCTCTGCCAGTTAAACCTTCTTTAATTATGG2180 AlaGluPheProSerGlySerAlaSer 675680 ATTAGCCATTAACATTTTTGAAACGTGGACCATTTAACCTCGGCCTACCCCCTCCAACTG2240 TCCTGGTGATGAGTTCATTAGCTAAGTTAAATTAATTGAACTTTGATCTAAACCAAAACA2300 AATCAGGAAAATAAAGCTGTAAAGGAACTTATCAAGCATTCCAAAACCAACTAGAAATTA2360 CTTGAAGTTTCGAGTGAGCATTGCCTGTGCCAGTATTCTTCATTATAGGATTATAAACTC2420 GTTTTTTTCCCAAAGCGCATGTCTACGCCAGGCAGAGGAGTAATTATTCAGCCAATTTCA2480 TGGATGTAACGATGGATATAAATAATTGATAGCACCTAGAGGCTTCCAGTTTGGGTGGAA2540 GGCTAAAAGTAGAGGGGAACTCACTCACTTGAGAAATGATATTTAAGTGAATAAATAGTT2600 CTCTTCTATGAAACTATTACTATTTAGTTCTCTGGAAAACTTAAGTGTATTAATGATTAG2660 AACATCAAATCCTAAGTAAAGAAATGACATTTTAAATATAAAAAGCCAAACTTTAAATAA2720 ATCATAGAGACCTCAGACATAATATAGGAAA2751 (2) INFORMATION FOR SEQ ID NO:107: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 682 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: MetValProGluAlaTrpArgSerGlyLeuValSerThrGlyArgVal 151015 ValGlyValLeuLeuLeuLeuGlyAlaLeuAsnLysAlaSerThrVal 202530 IleHisTyrGluIleProGluGluArgGluLysGlyPheAlaValGly 354045 AsnValValAlaAsnLeuGlyLeuAspLeuGlySerLeuSerAlaArg 505560 ArgPheProValValSerGlyAlaSerArgArgPhePheGluValAsn 65707580 ArgGluThrGlyGluMetPheValAsnAspArgLeuAspArgGluGlu 859095 LeuCysGlyThrLeuProSerCysThrValThrLeuGluLeuValVal 100105110 GluAsnProLeuGluLeuPheSerValGluValValIleGlnAspIle 115120125 AsnAspAsnAsnProAlaPheProThrGlnGluMetLysLeuGluIle 130135140 SerGluAlaValAlaProGlyThrArgPheProLeuGluSerAlaHis 145150155160 AspProAspLeuGlySerAsnSerLeuGlnThrTyrGluLeuSerArg 165170175 AsnGluTyrPheAlaLeuArgValGlnThrArgGluAspSerThrLys 180185190 TyrAlaGluLeuValLeuGluArgAlaLeuAspArgGluArgGluPro 195200205 SerLeuGlnLeuValLeuThrAlaLeuAspGlyGlyThrProAlaLeu 210215220 SerAlaSerLeuProIleHisIleLysValLeuAspAlaAsnAspAsn 225230235240 AlaProValPheAsnGlnSerLeuTyrArgAlaArgValProGlyGly 245250255 CysThrSerGlyThrArgValValGlnValLeuAlaThrAspLeuAsp 260265270 GluGlyProAsnGlyGluIleIleTyrSerPheGlySerHisAsnArg 275280285 AlaGlyValArgGlnLeuPheAlaLeuAspLeuValThrGlyMetLeu 290295300 ThrIleLysGlyArgLeuAspPheGluAspThrLysLeuHisGluIle 305310315320 TyrIleGlnAlaLysAspLysGlyAlaAsnProGluGlyAlaHisCys 325330335 LysValLeuValGluValValAspValAsnAspAsnAlaProGluIle 340345350 ThrValThrSerValTyrSerProValProGluAspAlaSerGlyThr 355360365 ValIleAlaLeuLeuSerValThrAspLeuAspAlaGlyGluAsnGly 370375380 LeuValThrCysGluValProProGlyLeuProPheSerLeuThrSer 385390395400 SerLeuLysAsnTyrPheThrLeuLysThrSerAlaAspLeuAspArg 405410415 GluThrValProGluTyrAsnLeuSerIleThrAlaArgAspAlaGly 420425430 ThrProSerLeuSerAlaLeuThrIleValArgValGlnValSerAsp 435440445 IleAsnAspAsnProProGlnSerSerGlnSerSerTyrAspValTyr 450455460 IleGluGluAsnAsnLeuProGlyAlaProIleLeuAsnLeuSerVal 465470475480 TrpAspProAspAlaProGlnAsnAlaArgLeuSerPhePheLeuLeu 485490495 GluGlnGlyAlaGluThrGlyLeuValGlyArgTyrPheThrIleAsn 500505510 ArgAspAsnGlyIleValSerSerLeuValProLeuAspTyrGluAsp 515520525 ArgArgGluPheGluLeuThrAlaHisIleSerAspGlyGlyThrPro 530535540 ValLeuAlaThrAsnIleSerValAsnIlePheValThrAspArgAsn 545550555560 AspAsnAlaProGlnValLeuTyrProArgProGlyGlySerSerVal 565570575 GluMetLeuProArgGlyThrSerAlaGlyHisLeuValSerArgVal 580585590 ValGlyTrpAspAlaAspAlaGlyHisAsnAlaTrpLeuSerTyrSer 595600605 LeuPheGlySerProAsnGlnSerLeuPheAlaIleGlyLeuHisThr 610615620 GlyGlnIleSerThrAlaArgProValGlnAspThrAspSerProArg 625630635640 GlnThrLeuThrValLeuIleLysAspAsnGlyGluProSerLeuSer 645650655 ThrThrAlaThrLeuThrValSerValThrGluAspSerProGluAla 660665670 ArgAlaGluPheProSerGlySerAlaSer 675680 (2) INFORMATION FOR SEQ ID NO:108: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2831 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: GAATTCGGCACGAGGCTGAACTGAGGGTGACGGACATAAACGACTATTCTCCAGTGTTCA60 GTGAAAGAGAAATGATACTGAGGATACCAGAAAACAGTGCTCGGGGAAATACATTCCCTT120 TAAACAATGCTCTGGACTCAGACGTAGATATCAACAATATCCAGACCTATAGGCTCAGCT180 CAAACTCTCATTTCCTGGTTGTAACCCGCAACCGCAGTGATGGCAGGAAGTACCCAGAGC240 TGGTGCTGGAGAAAGAACTGGATCGAGAGGAGGAACCTGAGCTGAGGTTAACGCTGACAG300 CTTTGGATGGTGGCTCTCCTCCCCGGTCTGGGACGACACAGGTCCTCATTGAAGTAGTGG360 ACACCAACGATAATGCACCCGAGTTTCAGCAGCCAACATACCAAGTGCAAACTCCCGAGA420 ACAGTCCCACCGGCTCTCTGGTACTCACAGTCTCAGCCAATGACTTAGACAGTGGAGACT480 ATGGGAAAGTCTTGTACGCACTTTCGCAACCCTCAGAAGATATTAGCAAAACATTCGAGG540 TAAACCCTGTAACCGGGGAAATTCGCCTACGAAAAGAGGTGAATTTTGAAACTATTCCTT600 CGTATGAAGTGGTTATCAAGGGGACGGACGGGGGAGGTCTCTCAGGAAAATGCACTCTGT660 TACTGCAGGTGGTGGACGTGAATGACAATGCCCCAGAAGTGATGCTATCTGCGCTAACCA720 ACCCAGTCCCAGAAAATTCCCCCGATGAGGTAGTGGCTGTTTTCAGTGTTAGAGATCCTG780 ACTCTGGGAACAACGGAAAAGTGATTGCATCCATCGAGGAAGACCTGCCCTTTCTTCTAA840 AATCTTCAGGAAAGAACTTTTACACTTTAGTAACCAAGGGAGCACTTGACAGGGAAGAAA900 GAGAGCAATTGAACATCACCATCACAGTCACTGACCTGGGCATACCCAGGCTCACCACCC960 AACACACCATAACAGTGCAGGTGGCAGACATCAACGACAATGCCCCCTCCTTCACCCAAA1020 CCTCCTACACCATGTTTGTCCGCGAGAACAACAGCCCCGCCCTGCACATAGGCACCATCA1080 GCGCCACAGACTCAGACTCAGGATCCAATGCCCACATCACCTACTCGCTGCTACCGCCCC1140 AAGACCCACAGCTGGCCCTCGACTCGCTCATCTCCATCAATGTAGACAACGGGCAGCTGT1200 TCGCGCTCAGGGCGCTAGACTATGAGGCTCTGCAGGGCTTCGAGTTCCATGTGGGCGCCA1260 CAGACCAAGGCTCGCCCGCGCTCAGCAGCCAGGCTCTGGTGCACGTGGTGGTGTTGGACG1320 ACAATGACAATGCGCCCTTCGTGCTCTACCCGCTGCAAAACGCCTCTGCACCCTTCACTG1380 AGCTGCTGCCCAGGGCGGCAGAGCCTGGATACCTGGTTACCAAGGTGGTAGCTGTGGACC1440 GCGACTCTGGCCAGAATGCCTGGCTGTCATTCCAGCTGCTCAAGGCCACGGAGCCCGGGC1500 TGTTCAACGTATGGGCGCACAATGGCGAGGTACGCACCTCCAGGCTGCTGAGCGAGCGCG1560 ACGCACCCAAGCACAAGCTGCTGCTGTTGGTCAAGGACAATGGAGATCCTCCACGCTCTG1620 CCAGTGTTACTCTGCACGTGCTAGTGGTGGATGCCTTCTCTCAGCCCTACCTGCCTCTGC1680 CAGAGGTGGCGCACGACCCTGCACAAGAAGAAGATGCGCTAACACTCTACCTGGTCATAG1740 CTTTGGCATCTGTGTCTTCTCTCTTCCTCTTGTCTGTGCTGCTGTTCGTGGGGGTGAGGC1800 TCTGCAGGAGGGCCAGGGCAGCCTCTCTGAGTGCCTATTCTGTGCCTGAAGGCCACTTTC1860 CTGGCCAGCTGGTGGATGTCAGAGGTATGGGGACCCTGTCCCAGAGCTACCAGTATGATG1920 TATGTCTGATGGGGGATTCTTCTGGGACCAGCGAATTTAACTTCTTAAAGCCAGTTCTGC1980 CTAGCTCTCTGCACCAGTGCTCTGGGAAAGAAATAGAGGAAAATTCCACACTCCAGAATA2040 GTTTTGGGTTTCATCATTAATAGAAAACTACTTTACAGATATTTAATTCCAAATATCATC2100 TTGTTGATTAACTAAAGTCTGTTCACATGTAGCTAGCTAGCAACGATTTTAATGTTCACT2160 TTACCCATCTTTTTTCAGGGTCATGTCTAAAGCTACAAGTTTGNCTTTACTTATACTTGT2220 CGCACAGAATNNNNNNNNNNTGGTGTATAAGTCACAGTCATGGGATACTGGCACAAGATG2280 GCAGCTTGATTGCTCAGTTATGGCTGCAAAGGGGNGCTTGAGTTTAGGGAATGTGTTAGA2340 GCTGGAATAAGTTTTCTGAGAAATGTGTAAGACAAATTTCTTTTGCACATTCCCTGTGTT2400 CCTGTACCCCTGTTTCCAGAACTACGAAATGTGTCATCAGAAGGCATGCTCACATTTTCC2460 CCTTTGTTTGCGTGACCCGGGTGCCAGAAATTAAATAAAATTAGCATGGAGTTCAATGCA2520 GCATTAAAACAAAGTTACTTCTACAAACCTTTTATTCGACGGTTAAAATTGTAACTTCCC2580 CACCCATGAGGCTGGCTGTAAGAACCAGTATGAATGGGTGTCTATCGCAACCTTATTTTC2640 AAAAATCAAACAAAAGGAGAAATGAGAGACCAAACAACACGCTACAGGAAAGATTTCATA2700 AGGATGTATGTATGGACACAAAAACTGGGATACAGACATTTTAAATCTGTTGGTACCACA2760 TGGTGGCGCTGCAGGCTAAAGAAATGCAAGGGAAATTAAAAAGAGGCTGAGCTAGAAGTC2820 AAAAAAAAAAA2831 (2) INFORMATION FOR SEQ ID NO:109: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 3353 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 763..3123 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: GTATTTTTCCACAGTTTAAAATTTTCATAAAATCATAACTCTCTGACTTTATGTAGAAAG60 GATACCACACTGGAATTAACGTGTAGCTTTTTCTTGATGTAATCCAACCAATGGGAGCAC120 AATTCTGGTACATAGGCTGTCTAGAATTTGAAAGAAATTAAAGAATTCATTTTGTTTTGC180 TGATAAATTTTTAAGAAATCACGTGGCTTTATGTTATTATTATTACAAGATGACTGATCA240 CTATTATGTCTTCTTTCACTTCTCAATTTCCCTCAGAACACTACACCCAGACTACAGGCT300 CTGGAGGGTGGGGACCATGTCTGGGTTGTTTACTGATGTATTTCATAATTTGGCACATAG360 AGACCAATAATACTCCTTTAAATGAAGAAATTAATAATTACCATTGCGTGATATTGTGAT420 TACATCATTTCCTCCCAATTTCCAAACTCCTAATAGAATAGAGAATAGATCAATTGTAGC480 AATTCGTTTCGAAGCAAAGACAACGCATGGTGGCGCTGCAGGCTAAGGCTTCAAAAAAAG540 GAAAAGGAAAAAGCCCATGAAATGCTACTAGCTACTTCAGACCTCTTTCAGCCTAAGAGG600 AAAGCCTGTTAGCAGAGCACGGACCAGTGTCTCCGGAGAATGCTATTCTCCTACATTTCC660 GAACAGGTTATCAACGCACAGATCGATCACTGCCTCTGTCCCATCGCTCCCTGAAGTAGC720 TCTGACTCCGGTTCCTTGAAAGGGGCGTGTACAGAAGTAAAGATGGAGCCTGCA774 MetGluProAla 1 GGGGAGCGCTTTCCCGAACAAAGGCAAGTCCTGATTCTCCTTCTTTTA822 GlyGluArgPheProGluGlnArgGlnValLeuIleLeuLeuLeuLeu 5101520 CTGGAAGTGACTCTGGCAGGCTGGGAACCCCGTCGCTATTCTGTGATG870 LeuGluValThrLeuAlaGlyTrpGluProArgArgTyrSerValMet 253035 GAGGAAACAGAGAGAGGTTCTTTTGTAGCCAACCTGGCCAATGACCTA918 GluGluThrGluArgGlySerPheValAlaAsnLeuAlaAsnAspLeu 404550 GGGCTGGGAGTGGGGGAGCTAGCCGAGCGGGGAGCCCGGGTAGTTTCT966 GlyLeuGlyValGlyGluLeuAlaGluArgGlyAlaArgValValSer 556065 GAGGATAACGAACAAGGCTTGCAGCTTGATCTGCAGACCGGGCAGTTG1014 GluAspAsnGluGlnGlyLeuGlnLeuAspLeuGlnThrGlyGlnLeu 707580 ATATTAAATGAGAAGCTGGACCGGGAGAAGCTGTGTGGCCCTACTGAG1062 IleLeuAsnGluLysLeuAspArgGluLysLeuCysGlyProThrGlu 859095100 CCCTGTATAATGCATTTCCAAGTGTTACTGAAAAAACCTTTGGAAGTA1110 ProCysIleMetHisPheGlnValLeuLeuLysLysProLeuGluVal 105110115 TTTCGAGCTGAACTACTAGTGACAGACATAAACGATCATTCTCCTGAG1158 PheArgAlaGluLeuLeuValThrAspIleAsnAspHisSerProGlu 120125130 TTTCCTGAAAGAGAAATGACCCTGAAAATCCCAGAAACTAGCTCCCTT1206 PheProGluArgGluMetThrLeuLysIleProGluThrSerSerLeu 135140145 GGGACTGTGTTTCCTCTGAAAAAAGCTCGGGACTTGGACGTGGGCAGC1254 GlyThrValPheProLeuLysLysAlaArgAspLeuAspValGlySer 150155160 AATAATGTTCAAAACTACAATATTTCTCCCAATTCTCATTTCCATGTT1302 AsnAsnValGlnAsnTyrAsnIleSerProAsnSerHisPheHisVal 165170175180 TCCACTCGCACCCGAGGGGATGGCAGGAAATACCCAGAGCTGGTGCTG1350 SerThrArgThrArgGlyAspGlyArgLysTyrProGluLeuValLeu 185190195 GACACAGAACTGGATCGCGAGGAGCAGGCCGAGCTCAGATTAACCTTG1398 AspThrGluLeuAspArgGluGluGlnAlaGluLeuArgLeuThrLeu 200205210 ACAGCGGTGGACGGTGGCTCTCCACCCCGATCTGGCACCGTCCAGATC1446 ThrAlaValAspGlyGlySerProProArgSerGlyThrValGlnIle 215220225 CTCATCTTGGTCTTGGACGCCAATGACAATGCCCCGGAGTTTGTGCAG1494 LeuIleLeuValLeuAspAlaAsnAspAsnAlaProGluPheValGln 230235240 GCGCTCTACGAGGTGCAGGTCCCAGAGAACAGCCCAGTAGGCTCCCTA1542 AlaLeuTyrGluValGlnValProGluAsnSerProValGlySerLeu 245250255260 GTTGTCAAGGTCTCTGCTAGGGATTTAGACACTGGGACAAATGGAGAG1590 ValValLysValSerAlaArgAspLeuAspThrGlyThrAsnGlyGlu 265270275 ATATCATACTCCCTTTATTACAGCTCTCAGGAGATAGACAAACCTTTT1638 IleSerTyrSerLeuTyrTyrSerSerGlnGluIleAspLysProPhe 280285290 GAGCTAAGCAGCCTTTCAGGAGAAATTCGACTAATTAAAAAACTAGAT1686 GluLeuSerSerLeuSerGlyGluIleArgLeuIleLysLysLeuAsp 295300305 TTTGAGACAATGTCTTCATATGATCTAGATATAGAGGCATCTGATGGC1734 PheGluThrMetSerSerTyrAspLeuAspIleGluAlaSerAspGly 310315320 GGGGGACTTTCTGGAAAATGCTCTGTCTCTGTTAAGGTGCTGGATGTT1782 GlyGlyLeuSerGlyLysCysSerValSerValLysValLeuAspVal 325330335340 AACGATAACTTCCCGGAACTAAGTATTTCATCACTTACCAGCCCTATT1830 AsnAspAsnPheProGluLeuSerIleSerSerLeuThrSerProIle 345350355 CCCGAGAATTCTCCAGAGACAGAAGTGGCCCTGTTTAGGATTAGAGAC1878 ProGluAsnSerProGluThrGluValAlaLeuPheArgIleArgAsp 360365370 CGAGACTCTGGAGAAAATGGAAAAATGATTTGCTCAATTCAGGATGAT1926 ArgAspSerGlyGluAsnGlyLysMetIleCysSerIleGlnAspAsp 375380385 GTTCCTTTTAAGCTAAAACCTTCTGTTGAGAATTTCTACAGGCTGGTA1974 ValProPheLysLeuLysProSerValGluAsnPheTyrArgLeuVal 390395400 ACAGAAGGGGCGCTGGACAGAGAGACCAGAGCCGAGTACAACATCACC2022 ThrGluGlyAlaLeuAspArgGluThrArgAlaGluTyrAsnIleThr 405410415420 ATCACCATCACAGACTTGGGGACTCCAAGGCTGAAAACCGAGCAGAGC2070 IleThrIleThrAspLeuGlyThrProArgLeuLysThrGluGlnSer 425430435 ATAACCGTGCTGGTGTCGGACGTCAATGACAACGCCCCCGCCTTCACC2118 IleThrValLeuValSerAspValAsnAspAsnAlaProAlaPheThr 440445450 CAAACCTCCTACACCCTGTTCGTCCGCGAGAACAACAGCCCCGCCCTG2166 GlnThrSerTyrThrLeuPheValArgGluAsnAsnSerProAlaLeu 455460465 CACATCGGCAGTGTCAGCGCCACAGACAGAGACTCGGGCACCAACGCC2214 HisIleGlySerValSerAlaThrAspArgAspSerGlyThrAsnAla 470475480 CAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTA2262 GlnValThrTyrSerLeuLeuProProGlnAspProHisLeuProLeu 485490495500 ACCTCCCTGGTCTCCATTAACACGGACAACGGCCACCTGTTCGCTCTC2310 ThrSerLeuValSerIleAsnThrAspAsnGlyHisLeuPheAlaLeu 505510515 CAGTCGCTGGACTACGAGGCCCTGCAGGCTTTCGAGTTCCGCGTGGGC2358 GlnSerLeuAspTyrGluAlaLeuGlnAlaPheGluPheArgValGly 520525530 GCCACAGACCGCGGCTTCCCGGCGCTGAGCAGCGAGGCGCTGGTGCGA2406 AlaThrAspArgGlyPheProAlaLeuSerSerGluAlaLeuValArg 535540545 GTGCTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACCCG2454 ValLeuValLeuAspAlaAsnAspAsnSerProPheValLeuTyrPro 550555560 CTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCC2502 LeuGlnAsnGlySerAlaProCysThrGluLeuValProArgAlaAla 565570575580 GAGCCGGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCG2550 GluProGlyTyrLeuValThrLysValValAlaValAspGlyAspSer 585590595 GGCCAGAACGCCTGGCTGTCGTACCAGCTGCTCAAGGCCACGGAGCCC2598 GlyGlnAsnAlaTrpLeuSerTyrGlnLeuLeuLysAlaThrGluPro 600605610 GGGCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCACCGCCAGG2646 GlyLeuPheGlyValTrpAlaHisAsnGlyGluValArgThrAlaArg 615620625 CTGCTGAGCGAGCGCGACGTGGCCAAGCACAGGCTAGTGGTGCTGGTC2694 LeuLeuSerGluArgAspValAlaLysHisArgLeuValValLeuVal 630635640 AAGGACAATGGCGAGCCTCCGCGCTCGGCCACAGCCACGCTGCAAGTG2742 LysAspAsnGlyGluProProArgSerAlaThrAlaThrLeuGlnVal 645650655660 CTCCTGGTGGACGGCTTCTCTCAGCCCTACCTGCCGCTCCCAGAGGCG2790 LeuLeuValAspGlyPheSerGlnProTyrLeuProLeuProGluAla 665670675 GCCCCGGCCCAAGCCCAGGCCGACTCGCTTACCGTCTACCTGGTGGTG2838 AlaProAlaGlnAlaGlnAlaAspSerLeuThrValTyrLeuValVal 680685690 GCATTGGCCTCGGTGTCTTCGCTCTTCCTCTTCTCGGTGTTCCTGTTC2886 AlaLeuAlaSerValSerSerLeuPheLeuPheSerValPheLeuPhe 695700705 GTGGCAGTGCGGCTGTGCAGGAGGAGCAGGGCGGCCTCAGTGGGTCGC2934 ValAlaValArgLeuCysArgArgSerArgAlaAlaSerValGlyArg 710715720 TGCTCGGTGCCCGAGGGCCCCTTTCCAGGGCATCTGGTGGACGTGAGC2982 CysSerValProGluGlyProPheProGlyHisLeuValAspValSer 725730735740 GGCACCGGGACCCTTTCCCAGAGCTACCAGTACGAGGTGTGTCTGACG3030 GlyThrGlyThrLeuSerGlnSerTyrGlnTyrGluValCysLeuThr 745750755 GGAGGCTCTGAAAGTAATGATTTCAAGTTCTTGAAGCCTATATTCCCA3078 GlyGlySerGluSerAsnAspPheLysPheLeuLysProIlePhePro 760765770 AATATTGTAAGCCAGGACTCTAGGAGGAAATCAGAATTTCTAGAA3123 AsnIleValSerGlnAspSerArgArgLysSerGluPheLeuGlu 775780785 TAATGTAGGTATCTGTAGCTTTCCGACCGTCTGTTAATTTTGTCTTCCTCACTTTTCACC3183 TTAGTTTTTTTTAACCCTTTAGTAATCTTGAATTCTACTTTTTTTTAAATTTCTACTGTT3243 GTCTTTAGTAATGTTACTCATTTCCTTTGTCTGATTGTTAGTTTTCAAATTATTGTATTA3303 TTATAAATATTTTATATCAGGAAAGTTCATATTTCTGAATAAATTAATAG3353 (2) INFORMATION FOR SEQ ID NO:110: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 787 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: MetGluProAlaGlyGluArgPheProGluGlnArgGlnValLeuIle 151015 LeuLeuLeuLeuLeuGluValThrLeuAlaGlyTrpGluProArgArg 202530 TyrSerValMetGluGluThrGluArgGlySerPheValAlaAsnLeu 354045 AlaAsnAspLeuGlyLeuGlyValGlyGluLeuAlaGluArgGlyAla 505560 ArgValValSerGluAspAsnGluGlnGlyLeuGlnLeuAspLeuGln 65707580 ThrGlyGlnLeuIleLeuAsnGluLysLeuAspArgGluLysLeuCys 859095 GlyProThrGluProCysIleMetHisPheGlnValLeuLeuLysLys 100105110 ProLeuGluValPheArgAlaGluLeuLeuValThrAspIleAsnAsp 115120125 HisSerProGluPheProGluArgGluMetThrLeuLysIleProGlu 130135140 ThrSerSerLeuGlyThrValPheProLeuLysLysAlaArgAspLeu 145150155160 AspValGlySerAsnAsnValGlnAsnTyrAsnIleSerProAsnSer 165170175 HisPheHisValSerThrArgThrArgGlyAspGlyArgLysTyrPro 180185190 GluLeuValLeuAspThrGluLeuAspArgGluGluGlnAlaGluLeu 195200205 ArgLeuThrLeuThrAlaValAspGlyGlySerProProArgSerGly 210215220 ThrValGlnIleLeuIleLeuValLeuAspAlaAsnAspAsnAlaPro 225230235240 GluPheValGlnAlaLeuTyrGluValGlnValProGluAsnSerPro 245250255 ValGlySerLeuValValLysValSerAlaArgAspLeuAspThrGly 260265270 ThrAsnGlyGluIleSerTyrSerLeuTyrTyrSerSerGlnGluIle 275280285 AspLysProPheGluLeuSerSerLeuSerGlyGluIleArgLeuIle 290295300 LysLysLeuAspPheGluThrMetSerSerTyrAspLeuAspIleGlu 305310315320 AlaSerAspGlyGlyGlyLeuSerGlyLysCysSerValSerValLys 325330335 ValLeuAspValAsnAspAsnPheProGluLeuSerIleSerSerLeu 340345350 ThrSerProIleProGluAsnSerProGluThrGluValAlaLeuPhe 355360365 ArgIleArgAspArgAspSerGlyGluAsnGlyLysMetIleCysSer 370375380 IleGlnAspAspValProPheLysLeuLysProSerValGluAsnPhe 385390395400 TyrArgLeuValThrGluGlyAlaLeuAspArgGluThrArgAlaGlu 405410415 TyrAsnIleThrIleThrIleThrAspLeuGlyThrProArgLeuLys 420425430 ThrGluGlnSerIleThrValLeuValSerAspValAsnAspAsnAla 435440445 ProAlaPheThrGlnThrSerTyrThrLeuPheValArgGluAsnAsn 450455460 SerProAlaLeuHisIleGlySerValSerAlaThrAspArgAspSer 465470475480 GlyThrAsnAlaGlnValThrTyrSerLeuLeuProProGlnAspPro 485490495 HisLeuProLeuThrSerLeuValSerIleAsnThrAspAsnGlyHis 500505510 LeuPheAlaLeuGlnSerLeuAspTyrGluAlaLeuGlnAlaPheGlu 515520525 PheArgValGlyAlaThrAspArgGlyPheProAlaLeuSerSerGlu 530535540 AlaLeuValArgValLeuValLeuAspAlaAsnAspAsnSerProPhe 545550555560 ValLeuTyrProLeuGlnAsnGlySerAlaProCysThrGluLeuVal 565570575 ProArgAlaAlaGluProGlyTyrLeuValThrLysValValAlaVal 580585590 AspGlyAspSerGlyGlnAsnAlaTrpLeuSerTyrGlnLeuLeuLys 595600605 AlaThrGluProGlyLeuPheGlyValTrpAlaHisAsnGlyGluVal 610615620 ArgThrAlaArgLeuLeuSerGluArgAspValAlaLysHisArgLeu 625630635640 ValValLeuValLysAspAsnGlyGluProProArgSerAlaThrAla 645650655 ThrLeuGlnValLeuLeuValAspGlyPheSerGlnProTyrLeuPro 660665670 LeuProGluAlaAlaProAlaGlnAlaGlnAlaAspSerLeuThrVal 675680685 TyrLeuValValAlaLeuAlaSerValSerSerLeuPheLeuPheSer 690695700 ValPheLeuPheValAlaValArgLeuCysArgArgSerArgAlaAla 705710715720 SerValGlyArgCysSerValProGluGlyProPheProGlyHisLeu 725730735 ValAspValSerGlyThrGlyThrLeuSerGlnSerTyrGlnTyrGlu 740745750 ValCysLeuThrGlyGlySerGluSerAsnAspPheLysPheLeuLys 755760765 ProIlePheProAsnIleValSerGlnAspSerArgArgLysSerGlu 770775780 PheLeuGlu 785 (2) INFORMATION FOR SEQ ID NO:111: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 3033 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 138..2528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:111: GTGATTGGACGTGTTTTTGTGACTATTTGGGAAGAAGACACCTTCCTAATCAGATTTACT60 CCAATATCTTCCCGGACCCTCATGAGTGGATTGCAATTGACTTGAAGAAGCAGCACCCTC120 AGGACTGAATCTGAACAATGGAGACAGCACTAGCAAAAATACCACAGCAA170 MetGluThrAlaLeuAlaLysIleProGlnGln 1510 AGGCAAGTCTTTTTTCTTACTATATTGTCGTTATTGTGGAAGTCTAGC218 ArgGlnValPhePheLeuThrIleLeuSerLeuLeuTrpLysSerSer 152025 TCTGAGGCCATTAGATATTCCATGCCAGAAGAAACAGAGAGTGGCTAT266 SerGluAlaIleArgTyrSerMetProGluGluThrGluSerGlyTyr 303540 ATGGTGGCTAACCTGGCGAAAGATCTGGGGATCAGGGTTGGAGAACTG314 MetValAlaAsnLeuAlaLysAspLeuGlyIleArgValGlyGluLeu 455055 TCCTCTAGAGGAGCTCAAATCCATTACAAAGGAAACAAAGAACTTTTG362 SerSerArgGlyAlaGlnIleHisTyrLysGlyAsnLysGluLeuLeu 60657075 CAGCTGGATGCAGAGACTGGGAATTTGTTCTTAAAGGAAAAACTAGAC410 GlnLeuAspAlaGluThrGlyAsnLeuPheLeuLysGluLysLeuAsp 808590 AGAGAACTGCTGTGTGGAGAGACAGAACCCTGTGTGCTGAACTTCCAG458 ArgGluLeuLeuCysGlyGluThrGluProCysValLeuAsnPheGln 95100105 ATCATACTGGAAAACCCTATGCAGTTCTTCCAAACTGAACTGCAGCTC506 IleIleLeuGluAsnProMetGlnPhePheGlnThrGluLeuGlnLeu 110115120 ACAGATATAAACGACCATTCTCCAGAGTTCCCCAACAAGAAAATGCTT554 ThrAspIleAsnAspHisSerProGluPheProAsnLysLysMetLeu 125130135 CTAACAATTCCTGAGAGTGCCCATCCAGGGACTGTGTTTCCTCTGAAG602 LeuThrIleProGluSerAlaHisProGlyThrValPheProLeuLys 140145150155 GCAGCTCGGGACTCTGACATAGGGAGCAACGCTGTTCAGAACTACACA650 AlaAlaArgAspSerAspIleGlySerAsnAlaValGlnAsnTyrThr 160165170 GTCAATCCCAACCTCCATTTCCACGTCGTTACTCACAGTCGCACAGAT698 ValAsnProAsnLeuHisPheHisValValThrHisSerArgThrAsp 175180185 GGCAGGAAATACCCAGAGCTGGTGCTGGACAGAGCCCTGGATAGGGAG746 GlyArgLysTyrProGluLeuValLeuAspArgAlaLeuAspArgGlu 190195200 GAGCAGCCTGAGCTCACTTTAATCCTCACTGCTCTGGATGGTGGAGCT794 GluGlnProGluLeuThrLeuIleLeuThrAlaLeuAspGlyGlyAla 205210215 CCTTCCAGGTCAGGAACCACCACAGTTCACATAGAAGTTGTGGACATC842 ProSerArgSerGlyThrThrThrValHisIleGluValValAspIle 220225230235 AATGATAACTCCCCCCAGTTTGTACAGTCACTCTATAAGGTGCAAGTT890 AsnAspAsnSerProGlnPheValGlnSerLeuTyrLysValGlnVal 240245250 CCTGAGAATAATCCCCTCAATGCCTTTGTTGTCACGGTCTCTGCCACG938 ProGluAsnAsnProLeuAsnAlaPheValValThrValSerAlaThr 255260265 GATTTAGATGCTGGGGTATATGGCAATGTGACCTATTCTCTGTTTCAA986 AspLeuAspAlaGlyValTyrGlyAsnValThrTyrSerLeuPheGln 270275280 GGGTATGGGGTATTTCAACCATTTGTAATAGACGAAATCACTGGAGAA1034 GlyTyrGlyValPheGlnProPheValIleAspGluIleThrGlyGlu 285290295 ATCCATCTGAGCAAAGAGCTGGATTTTGAGGAAATTAGCAATCATAAC1082 IleHisLeuSerLysGluLeuAspPheGluGluIleSerAsnHisAsn 300305310315 ATAGAAATCGCAGCCACAGATGGAGGAGGCCTTTCAGGAAAATGCACT1130 IleGluIleAlaAlaThrAspGlyGlyGlyLeuSerGlyLysCysThr 320325330 GTGGCTGTACAGGTGTTGGATGTGAATGACAACGCCCCAGAGTTGACA1178 ValAlaValGlnValLeuAspValAsnAspAsnAlaProGluLeuThr 335340345 ATTAGGAAGCTCACAGTCCTGGTCCCAGAAAATTCCGCAGAGACTGTA1226 IleArgLysLeuThrValLeuValProGluAsnSerAlaGluThrVal 350355360 GTTGCTGTTTTTAGTGTTTCTGATTCTGATTCGGGGGACAATGGAAGG1274 ValAlaValPheSerValSerAspSerAspSerGlyAspAsnGlyArg 365370375 ATGGTGTGTTCTATTCCGAACAATATCCCATTTCTCCTGAAACCCACA1322 MetValCysSerIleProAsnAsnIleProPheLeuLeuLysProThr 380385390395 TTTGAGAATTATTACACGTTAGTGACTGAGGGGCCACTTGATAGAGAG1370 PheGluAsnTyrTyrThrLeuValThrGluGlyProLeuAspArgGlu 400405410 AACAGAGCTGAGTACAACATCACCATCACGGTCTCAGATCTGGGCACA1418 AsnArgAlaGluTyrAsnIleThrIleThrValSerAspLeuGlyThr 415420425 CCCAGGCTCACAACCCAGCACACCATAACAGTGCAAGTGTCCGACATC1466 ProArgLeuThrThrGlnHisThrIleThrValGlnValSerAspIle 430435440 AACGACAACGCCCCTGCCTTCACCCAAACCTCCTACACCATGTTTGTC1514 AsnAspAsnAlaProAlaPheThrGlnThrSerTyrThrMetPheVal 445450455 CACGAGAACAACAGCCCCGCCCTGCACATAGGCACCATCAGTGCCACA1562 HisGluAsnAsnSerProAlaLeuHisIleGlyThrIleSerAlaThr 460465470475 GACTCAGACTCAGGCTCCAATGCCCACATCACCTACTCGCTGCTGCCG1610 AspSerAspSerGlySerAsnAlaHisIleThrTyrSerLeuLeuPro 480485490 CCTGATGACCCGCAGCTGGCCCTCGACTCACTCATCTCCATCAATGTT1658 ProAspAspProGlnLeuAlaLeuAspSerLeuIleSerIleAsnVal 495500505 GACAATGGGCAGCTGTTCGCGCTCAGAGCTCTAGACTATGAGGCACTG1706 AspAsnGlyGlnLeuPheAlaLeuArgAlaLeuAspTyrGluAlaLeu 510515520 CAGTCCTTCGAGTTCTACGTGGGCGCTACAGATGGAGGCTCACCCGCG1754 GlnSerPheGluPheTyrValGlyAlaThrAspGlyGlySerProAla 525530535 CTCAGCAGCCAGACTCTGGTGCGGATGGTGGTGCTGGATGACAATGAC1802 LeuSerSerGlnThrLeuValArgMetValValLeuAspAspAsnAsp 540545550555 AATGCCCCCTTCGTGCTCTACCCACTGCAGAATGCCTCAGCACCCTGT1850 AsnAlaProPheValLeuTyrProLeuGlnAsnAlaSerAlaProCys 560565570 ACTGAGCTACTGCCTAGGGCAGCAGAGCCCGGCTACCTGATCACCAAA1898 ThrGluLeuLeuProArgAlaAlaGluProGlyTyrLeuIleThrLys 575580585 GTGGTGGCTGTGGATCGCGACTCTGGACAGAATGCTTGGCTGTCGTTC1946 ValValAlaValAspArgAspSerGlyGlnAsnAlaTrpLeuSerPhe 590595600 CAGCTACTTAAAGCTACAGAGCCAGGGCTGTTCAGTGTATGGGCACAC1994 GlnLeuLeuLysAlaThrGluProGlyLeuPheSerValTrpAlaHis 605610615 AATGGTGAAGTGCGCACCACTAGGCTGCTGAGTGAGCGAGATGCTCAG2042 AsnGlyGluValArgThrThrArgLeuLeuSerGluArgAspAlaGln 620625630635 AAGCACAAGCTACTGCTGCTGGTCAAGGACAATGGCGATCCTCTGCGC2090 LysHisLysLeuLeuLeuLeuValLysAspAsnGlyAspProLeuArg 640645650 TCTGCCAATGTCACTCTTCACGTGCTAGTGGTGGATGGCTTCTCGCAG2138 SerAlaAsnValThrLeuHisValLeuValValAspGlyPheSerGln 655660665 CCTTACCTACCATTGGCTGAGGTGGCACAGGATTCCATGCAAGATAAT2186 ProTyrLeuProLeuAlaGluValAlaGlnAspSerMetGlnAspAsn 670675680 TACGACGTTCTCACACTGTACCTAGTCATTGCCTTGGCATCTGTATCT2234 TyrAspValLeuThrLeuTyrLeuValIleAlaLeuAlaSerValSer 685690695 TCTCTCTTCCTCTTGTCTGTAGTGCTGTTTGTGGGGGTGAGGCTGTGC2282 SerLeuPheLeuLeuSerValValLeuPheValGlyValArgLeuCys 700705710715 AGGAGGGCCAGGGAGGCCTCCTTGGGTGACTACTCTGTGCCTGAGGGA2330 ArgArgAlaArgGluAlaSerLeuGlyAspTyrSerValProGluGly 720725730 CACTTTCCTAGCCACTTGGTGGATGTCAGCGGTGCCGGGACCCTGTCC2378 HisPheProSerHisLeuValAspValSerGlyAlaGlyThrLeuSer 735740745 CAGAGTTATCAATATGAGGTGTGTCTTAATGGAGGTACTAGAACAAAT2426 GlnSerTyrGlnTyrGluValCysLeuAsnGlyGlyThrArgThrAsn 750755760 GAGTTTAACTTTCTTAAACCATTGTTTCCTATCCTTCCGACCCAGGCT2474 GluPheAsnPheLeuLysProLeuPheProIleLeuProThrGlnAla 765770775 GCTGCTGCTGAAGAAAGAGAAAACGCTGTTGTGCACAATAGCGTTGGA2522 AlaAlaAlaGluGluArgGluAsnAlaValValHisAsnSerValGly 780785790795 TTCTATTAGAGCACTGATTTTGAAGTGGTGGTTACCTCATTTTTCCTTAACTATCC2578 PheTyr CTGATGTAGAATGGTGTAGTGCCGTGAATCAACTCCTGAGATATATGTTCATTTTATCCT2638 TTGTTTTGAATCAAACTATTCAGATGTGATCCTACTCTAGAGAATTTGGTTCTACTCCAT2698 TGTGTTTGTTTAGATTTCTACGCCATACCAGTGCATGCTGGGTTGTTTTTTTTTTTACAA2758 TTATTATAACTTTGCTTTGGAGGGGAACTCATATTCGCTGTAACGAATTGGAACCACTTT2818 CATTGTTAGAGATGCCTTGCTTTGTTGTGTTATTTCAGACAGGGTCTTAAATTGTAGCCC2878 TGGGTGACCTGAAATGACTATGTACAGACTGACTTTGAATTTGTGGCAGTCCATCTGCCT2938 CTGTTGTCCTATGTTGGGATTGTGAGCATGCATGAGTAGGCTCAGCTGTGGTGAGCGACC2998 TTAATAAAAATCAAATACTAAAAAAAAAAAAAAAA3033 (2) INFORMATION FOR SEQ ID NO:112: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 797 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: MetGluThrAlaLeuAlaLysIleProGlnGlnArgGlnValPhePhe 151015 LeuThrIleLeuSerLeuLeuTrpLysSerSerSerGluAlaIleArg 202530 TyrSerMetProGluGluThrGluSerGlyTyrMetValAlaAsnLeu 354045 AlaLysAspLeuGlyIleArgValGlyGluLeuSerSerArgGlyAla 505560 GlnIleHisTyrLysGlyAsnLysGluLeuLeuGlnLeuAspAlaGlu 65707580 ThrGlyAsnLeuPheLeuLysGluLysLeuAspArgGluLeuLeuCys 859095 GlyGluThrGluProCysValLeuAsnPheGlnIleIleLeuGluAsn 100105110 ProMetGlnPhePheGlnThrGluLeuGlnLeuThrAspIleAsnAsp 115120125 HisSerProGluPheProAsnLysLysMetLeuLeuThrIleProGlu 130135140 SerAlaHisProGlyThrValPheProLeuLysAlaAlaArgAspSer 145150155160 AspIleGlySerAsnAlaValGlnAsnTyrThrValAsnProAsnLeu 165170175 HisPheHisValValThrHisSerArgThrAspGlyArgLysTyrPro 180185190 GluLeuValLeuAspArgAlaLeuAspArgGluGluGlnProGluLeu 195200205 ThrLeuIleLeuThrAlaLeuAspGlyGlyAlaProSerArgSerGly 210215220 ThrThrThrValHisIleGluValValAspIleAsnAspAsnSerPro 225230235240 GlnPheValGlnSerLeuTyrLysValGlnValProGluAsnAsnPro 245250255 LeuAsnAlaPheValValThrValSerAlaThrAspLeuAspAlaGly 260265270 ValTyrGlyAsnValThrTyrSerLeuPheGlnGlyTyrGlyValPhe 275280285 GlnProPheValIleAspGluIleThrGlyGluIleHisLeuSerLys 290295300 GluLeuAspPheGluGluIleSerAsnHisAsnIleGluIleAlaAla 305310315320 ThrAspGlyGlyGlyLeuSerGlyLysCysThrValAlaValGlnVal 325330335 LeuAspValAsnAspAsnAlaProGluLeuThrIleArgLysLeuThr 340345350 ValLeuValProGluAsnSerAlaGluThrValValAlaValPheSer 355360365 ValSerAspSerAspSerGlyAspAsnGlyArgMetValCysSerIle 370375380 ProAsnAsnIleProPheLeuLeuLysProThrPheGluAsnTyrTyr 385390395400 ThrLeuValThrGluGlyProLeuAspArgGluAsnArgAlaGluTyr 405410415 AsnIleThrIleThrValSerAspLeuGlyThrProArgLeuThrThr 420425430 GlnHisThrIleThrValGlnValSerAspIleAsnAspAsnAlaPro 435440445 AlaPheThrGlnThrSerTyrThrMetPheValHisGluAsnAsnSer 450455460 ProAlaLeuHisIleGlyThrIleSerAlaThrAspSerAspSerGly 465470475480 SerAsnAlaHisIleThrTyrSerLeuLeuProProAspAspProGln 485490495 LeuAlaLeuAspSerLeuIleSerIleAsnValAspAsnGlyGlnLeu 500505510 PheAlaLeuArgAlaLeuAspTyrGluAlaLeuGlnSerPheGluPhe 515520525 TyrValGlyAlaThrAspGlyGlySerProAlaLeuSerSerGlnThr 530535540 LeuValArgMetValValLeuAspAspAsnAspAsnAlaProPheVal 545550555560 LeuTyrProLeuGlnAsnAlaSerAlaProCysThrGluLeuLeuPro 565570575 ArgAlaAlaGluProGlyTyrLeuIleThrLysValValAlaValAsp 580585590 ArgAspSerGlyGlnAsnAlaTrpLeuSerPheGlnLeuLeuLysAla 595600605 ThrGluProGlyLeuPheSerValTrpAlaHisAsnGlyGluValArg 610615620 ThrThrArgLeuLeuSerGluArgAspAlaGlnLysHisLysLeuLeu 625630635640 LeuLeuValLysAspAsnGlyAspProLeuArgSerAlaAsnValThr 645650655 LeuHisValLeuValValAspGlyPheSerGlnProTyrLeuProLeu 660665670 AlaGluValAlaGlnAspSerMetGlnAspAsnTyrAspValLeuThr 675680685 LeuTyrLeuValIleAlaLeuAlaSerValSerSerLeuPheLeuLeu 690695700 SerValValLeuPheValGlyValArgLeuCysArgArgAlaArgGlu 705710715720 AlaSerLeuGlyAspTyrSerValProGluGlyHisPheProSerHis 725730735 LeuValAspValSerGlyAlaGlyThrLeuSerGlnSerTyrGlnTyr 740745750 GluValCysLeuAsnGlyGlyThrArgThrAsnGluPheAsnPheLeu 755760765 LysProLeuPheProIleLeuProThrGlnAlaAlaAlaAlaGluGlu 770775780 ArgGluAsnAlaValValHisAsnSerValGlyPheTyr 785790795 (2) INFORMATION FOR SEQ ID NO:113: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2347 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: AAAACACGGGGGAAATGACAGTAGCAAAGAATCTGGACTATGAAGAATGCTCATTGTATG60 AAATGGAAATACAGGCTGAAGATGTGGGGGCGCTTCTGGGGAGGAGCAAAGTGGTAATTA120 TGGTAGAAGATGTAAATGACAATCGGCCAGAAGTGACCATTACATCCTTGTTTAACCCGG180 TATTGGAAAATTCTCTTCCCGGGACAGTAATTGCCTTCTTGAATGTGCATGACCGAGACT240 CTGGAAAGAACGGCCAAGTTGTCTGTTACACGCATGATAACTTACCTTTTAAATTAGAAA300 AGTCAATAGATAATTATTATAGATTGGTGACATGGAAATATTTGGACCGAGAAAAAGTCT360 CCATCTACAATATCACAGTGATAGCCTCAGATCTAGGAGCCCACTCTGTCACTGAAACTT420 ACATTGCCCTGATTGTGGCAGACACTAATGACAACCCTCCTCGTTTTCCTCACACCTCCT480 ACACAGCCTATATTCCAGAGAACAACCTGAGGGGCGCCTCCATCTTCTCACTGACTGCAC540 ATGATCCTGACAGTCAGGAAAATGCACAGGTCACTTACTCTGTGTCTGAGGACACCATAC600 AGGGAGTGCCTTTGTCCTCTTATATCTCCATCAACTCAGATACTGGTGTCCTGTATGCAC660 TGCACTCTTTTGACTTCGAGAAGATACAAGACTTGCAGCTACTGGTTGTTGCCACTGACA720 GTGGAAGCCCACCTCTCAGCAGCAATGTGTCATTGAGCTTGTTTGTGTTGGACCAGAACG780 ACAACGCACCTGAGATTCTATATCCTAGCTTCCCCACAGATGGCTCCACTGGTGTGGAAC840 TAGCACCCCGCTCTGCAGAGCCTGGATACCTAGTGACCAAAGTGGTGGCAGTGGACAAAG900 ACTCAGGACAGAATGCTTGGCTGTCCTACCGTCTGCTGAAGGCCAGCGAACCTGGGCTCT960 TCTCTGTAGGACTTCACACGGGTGAGGTGCGTACAGCGAGGGCCCTGCTGGACAGAGATG1020 CTCTCAAACAGAATCTGGTGATGGCCGTGCAGGACCATGGCCAACCCCCTCTCTCGGCCA1080 CTGTAACTCTCACTGTGGCAGTGGCTAACAGCATCCCTGAGGTGTTGGCTGACTTGAGCA1140 GCATTAGGACCCCTGGGGTACCAGAGGATTCTGATATCACGCTCCACCTGGTGGTGGCAG1200 TGGCTGTGGTCTCCTGTGTCTTCCTTGTCTTTGTCATTGTCCTCCTAGCTCTCAGGCTTC1260 AGCGCTGGCAGAAGTCTCGCCAGCTCCAGGGCTCCAAAGGTGGATTGGCTCCTGCACCTC1320 CATCACATTTTGTGGGCATCGACGGGGTACAGGCTTTTCTACAAACCTATTCTCATGAAG1380 TCTCGCTCACTTCAGGCTCCCAGACAAGCCACATTATCTTTCCTCAGCCCAACTATGCAG1440 ACATGCTCATTAACCAAGAAGGCTGTGAGAAAAATGATTCCTTATTAACATCCATAGATT1500 TTCATGAGAGTAACCGTGAAGATGCTTGCGCCCCGCAAGCCCCGCCCAACACTGACTGGC1560 GTTTCTCTCAAGCCCAGAGACCCGGCACGAGCGGATCCCAAAATGGGGATGAAACCGGCA1620 CCTGGCCCAACAACCAGTTCGATACAGAGATGCTGCAAGCCATGATCTTGGCCTCTGCCA1680 GTGAAGCCGCTGATGGGAGCTCCACTCTGGGAGGGGGCACTGGCACTATGGGTTTGAGCG1740 CTCGATATGGACCCCAGTTTACCCTGCAGCACGTGCCTGACTACCGCCAGAACGTGTACA1800 TCCCTGGCAGCAATGCCACACTGACCAACGCAGCTGGCAAACGAGATGGCAAGGCTCCGG1860 CAGGCGGCAATGGCAACAACAACAAGTCGGGCAAGAAAGAGAAGAAGTAATATGGAGGCC1920 AGGCCTTGAGCCACAGGGCAGCCTCCCTCCCCAGCCAGTCCAGCTTGTCCTTACTTGTAC1980 CCAGGCCTCAGAATTTCAGGGCTCACCCCAGGATTCTGGTAGGAGCCACAGCCAGGCCAT2040 GCTCCCCGTTGGGAAACAGAAACAAGTGCCCAAGCCAACACCCCCTCTTTGTACCCTAGG2100 GGGGTTGAATATGCAAAGAGAGTTCTGCTGGGACCCCCTATCCAATCAGTGATTGTACCC2160 ACATAGGTAGCAGGGTTAGTGTGGATACACACACACACACACACACACACACACACACAA2220 CCCTTGTCCTCCGCAGTGCCTGCCACTTTCTGGGACTTTCTCATCCCCCTACGCCCTTCC2280 TTTATCCTCTCCCACCCAGACACAGCTGCTGGAGAATAAATTTGGGGATGCTGATGCTAA2340 AAAAAAA2347 (2) INFORMATION FOR SEQ ID NO:114: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 2972 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..1849 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: AGAGGCTGCTCACCACCTGGTCCTCACGGCCTCGGATGGCGGCAAG46 GluAlaAlaHisHisLeuValLeuThrAlaSerAspGlyGlyLys 151015 CCGCCTCGCTCTAGCACAGTGCGCATCCACGTGACAGTGTTGGATACA94 ProProArgSerSerThrValArgIleHisValThrValLeuAspThr 202530 AATGACAATGCCCCGGTTTTTCCTCACCCGATTTACCGAGTGAAAGTC142 AsnAspAsnAlaProValPheProHisProIleTyrArgValLysVal 354045 CTTGAGAACATGCCCCCAGGCACGCGGCTGCTTACTGTAACAGCCAGC190 LeuGluAsnMetProProGlyThrArgLeuLeuThrValThrAlaSer 505560 GACCCGGATGAGGGAATCAACGGAAAAGTGGCATACAAATTCCGGAAA238 AspProAspGluGlyIleAsnGlyLysValAlaTyrLysPheArgLys 657075 ATTAATGAAAAACAAACTCCGTTATTCCAGCTTAATGAAAATACTGGG286 IleAsnGluLysGlnThrProLeuPheGlnLeuAsnGluAsnThrGly 80859095 GAAATATCAATAGCAAAAAGTCTAGATTATGAAGAATGTTCATTTTAT334 GluIleSerIleAlaLysSerLeuAspTyrGluGluCysSerPheTyr 100105110 GAAATGGAAATACAAGCCGAAGATGTGGGGGCACTTCTGGGGAGGACC382 GluMetGluIleGlnAlaGluAspValGlyAlaLeuLeuGlyArgThr 115120125 AAATTGCTCATTTCTGTGGAAGATGTAAATGACAATAGACCAGAAGTG430 LysLeuLeuIleSerValGluAspValAsnAspAsnArgProGluVal 130135140 ATCATTACGTCTTTGTTTAGCCCAGTGTTAGAAAATTCTCTTCCCGGG478 IleIleThrSerLeuPheSerProValLeuGluAsnSerLeuProGly 145150155 ACAGTAATTGCCTTCTTGAGTGTGCATGACCAAGACTCTGGAAAGAAT526 ThrValIleAlaPheLeuSerValHisAspGlnAspSerGlyLysAsn 160165170175 GGTCAAGTTGTCTGTTACACACGTGATAATTTACCTTTTAAATTAGAA574 GlyGlnValValCysTyrThrArgAspAsnLeuProPheLysLeuGlu 180185190 AAGTCAATAGGTAATTATTATAGATTAGTGACAAGGAAATATTTGGAC622 LysSerIleGlyAsnTyrTyrArgLeuValThrArgLysTyrLeuAsp 195200205 CGAGAAAATGTCTCTATCTACAATATCACAGTGATGGCCTCAGATCTA670 ArgGluAsnValSerIleTyrAsnIleThrValMetAlaSerAspLeu 210215220 GGAACACCACCTCTGTCCACTGAAACTCAAATCGCTCTGCACGTGGCA718 GlyThrProProLeuSerThrGluThrGlnIleAlaLeuHisValAla 225230235 GACATTAACGACAACCCTCCTACTTTCCCTCATGCCTCCTACTCAGCG766 AspIleAsnAspAsnProProThrPheProHisAlaSerTyrSerAla 240245250255 TATATCCTAGAGAACAACCTGAGAGGAGCCTCCATCTTTTCCTTGACT814 TyrIleLeuGluAsnAsnLeuArgGlyAlaSerIlePheSerLeuThr 260265270 GCACACGACCCCGACAGCCAGGAGAATGCCCAGGTCACTTACTCTGTG862 AlaHisAspProAspSerGlnGluAsnAlaGlnValThrTyrSerVal 275280285 ACCGAGGACACGCTGCAGGGGGCGCCCCTGTCCTCGTATATCTCCATC910 ThrGluAspThrLeuGlnGlyAlaProLeuSerSerTyrIleSerIle 290295300 AACTCTGACACCGGTGTCCTGTATGCGCTGCAATCTTTCGACTATGAG958 AsnSerAspThrGlyValLeuTyrAlaLeuGlnSerPheAspTyrGlu 305310315 CAGATCCGAGACCTGCAGCTACTGGTAACAGCCAGCGACAGCGGGGAC1006 GlnIleArgAspLeuGlnLeuLeuValThrAlaSerAspSerGlyAsp 320325330335 CCGCCCCTCAGCAGCAACATGTCACTGAGCCTGTTCGTGCTGGACCAG1054 ProProLeuSerSerAsnMetSerLeuSerLeuPheValLeuAspGln 340345350 AATGACAACGCGCCCGAGATCCTGTACCCCGCCCTCCCCACAGACGGT1102 AsnAspAsnAlaProGluIleLeuTyrProAlaLeuProThrAspGly 355360365 TCCACTGGCGTGGAGCTGGCGCCCCGCTCCGCAGAGCGTGGCTACCTG1150 SerThrGlyValGluLeuAlaProArgSerAlaGluArgGlyTyrLeu 370375380 GTGACCAAGGTGGTGGCGGTGGACAGAGACTCGGGCCAGAACGCCTGG1198 ValThrLysValValAlaValAspArgAspSerGlyGlnAsnAlaTrp 385390395 CTGTCCTACCGCCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTG1246 LeuSerTyrArgLeuLeuLysAlaSerGluProGlyLeuPheSerVal 400405410415 GGTCTGCACACGGGCGAGGTGCGCACGGCGCGAGCCCTGCTGGACAGA1294 GlyLeuHisThrGlyGluValArgThrAlaArgAlaLeuLeuAspArg 420425430 GACGCGCTCAAGCAGAGCCTCGTGGTGGCCGTCCAGGACCATGGCCAG1342 AspAlaLeuLysGlnSerLeuValValAlaValGlnAspHisGlyGln 435440445 CCCCCTCTCTCCGCCACTGTCACGCTCACCGTAGCCGTGGCTGACAGC1390 ProProLeuSerAlaThrValThrLeuThrValAlaValAlaAspSer 450455460 ATCCCCGAAGTCCTGACCGAGTTGGGCAGTCTGAAGCCTTCGGTCGAC1438 IleProGluValLeuThrGluLeuGlySerLeuLysProSerValAsp 465470475 CCGAACGATTCGAGCCTTACACTCTATCTCGTGGTGGCAGTGGCTGCC1486 ProAsnAspSerSerLeuThrLeuTyrLeuValValAlaValAlaAla 480485490495 ATCTCCTGTGTCTTCCTCGCCTTTGTCGCTGTGCTTCTGGGGCTCAGG1534 IleSerCysValPheLeuAlaPheValAlaValLeuLeuGlyLeuArg 500505510 CTGAGGCGCTGGCACAAGTCACGCCTGCTCCAGGATTCCGGTGGCAGA1582 LeuArgArgTrpHisLysSerArgLeuLeuGlnAspSerGlyGlyArg 515520525 TTGGTAGGCGTGCCTGCCTCACATTTTGTGGGTGTTGAGGAGGTACAG1630 LeuValGlyValProAlaSerHisPheValGlyValGluGluValGln 530535540 GCTTTCCTGCAGACCTATTCCCAGGAAGTCTCCCTCACCGCCGACTCG1678 AlaPheLeuGlnThrTyrSerGlnGluValSerLeuThrAlaAspSer 545550555 CGGAAGAGTCACCTGATCTTTCCCCAGCCCAACTACGCAGACATGCTC1726 ArgLysSerHisLeuIlePheProGlnProAsnTyrAlaAspMetLeu 560565570575 ATCAGTCAGGAGGGCTGTGAGAAAAATGATTCTTTGTTAACATCCGTA1774 IleSerGlnGluGlyCysGluLysAsnAspSerLeuLeuThrSerVal 580585590 GATTTTCATGAATATAAGAATGAAGCTGATCATGGTCAGGTGAGTTTA1822 AspPheHisGluTyrLysAsnGluAlaAspHisGlyGlnValSerLeu 595600605 GTTCTTTGCTTGCTTTTAATTTCCAGATGAATTTTATTTGGCATAAA1869 ValLeuCysLeuLeuLeuIleSerArg 610615 TTATGTTTTGAAAAACATTGTGAAGATAGTTGAAAATAATTTTTAAGGTGTATCACAGAG1929 TTTTGGGTTTATTTTGGTGGTGTTACCAAAAAATTGAACTCTAATAGTCATAGGTTATTG1989 TTTCATTTGCTTTTAAACGACTTGGAAAAGATTGTTCCACCATTTTAAACCTTCCAGTAT2049 TTTATTCCTATTATCACTCATTCACTTAAGAAGTAGCTACCCGTCCATACTGGTAATTTT2109 GCTATTGTTTGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATCCCAAACTAG2169 AACTTCAGAAAATTATCAAGAAGTCTAAAGCCTTGTTATTAGCTTAGCAAAAGTAAAATA2229 TATCTCAGAATTTTTAGGGTTATGTTTAGCATTTGAACCTGTAACTAGGCTCTTGTATAT2289 TTCTTCACTTTAAACCTCTTTTCTGAGCCCTGTTTCTGTACCAGTGCCCTTCAAAACTTT2349 AATACTTCTTACCATCCTTCAAAACATGAACAAACTTTAAAGATGGATCTTGGTGGGAGA2409 TGAGACTGGTTACTAAATATTAAGTATGTGAGTCAGTGGTCACCTGGGCTCCATCCCCAT2469 GGAGACATGAAATCTAAAGCCTAGAATGTCCATTGCTCCCCCAAACAAAAAACAAAAGCA2529 AAAACATTAGATCTGAATTAAAATGTAATTTTAAACTGTTGAAAGTGACTTTTGTAAAAT2589 ATGTAAGAACATATTTCAATACAATTCCAATTAGCTGTTTCGGTTGTGCATTGATGTGAA2649 GTGGTGAGAATGTTGATATTAAGAACCAATGTTTCAGGTACACAAGTTCTAAATAAGCTG2709 ATCAATTCAATTAAAGTTATTCAGTCTTGGCTGGACACAGTGCCTCATGTCTGAAATCCC2769 AGCACTTTGGGAGGCTGGGGCAGGAGGACCGCTTGAGCCCCGGGGGTTTGAAACTGCAGT2829 GAGCTATGATCATGCCACTGCACTCCAGCCTAGGTGGCAGAACTAGACCCTGTCTCTAAA2889 AAAACTATTATTAGGCCGCGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGAC2949 TGAGGTGGGTGGATCACCTGAGC2972 (2) INFORMATION FOR SEQ ID NO:115: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 616 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: GluAlaAlaHisHisLeuValLeuThrAlaSerAspGlyGlyLysPro 151015 ProArgSerSerThrValArgIleHisValThrValLeuAspThrAsn 202530 AspAsnAlaProValPheProHisProIleTyrArgValLysValLeu 354045 GluAsnMetProProGlyThrArgLeuLeuThrValThrAlaSerAsp 505560 ProAspGluGlyIleAsnGlyLysValAlaTyrLysPheArgLysIle 65707580 AsnGluLysGlnThrProLeuPheGlnLeuAsnGluAsnThrGlyGlu 859095 IleSerIleAlaLysSerLeuAspTyrGluGluCysSerPheTyrGlu 100105110 MetGluIleGlnAlaGluAspValGlyAlaLeuLeuGlyArgThrLys 115120125 LeuLeuIleSerValGluAspValAsnAspAsnArgProGluValIle 130135140 IleThrSerLeuPheSerProValLeuGluAsnSerLeuProGlyThr 145150155160 ValIleAlaPheLeuSerValHisAspGlnAspSerGlyLysAsnGly 165170175 GlnValValCysTyrThrArgAspAsnLeuProPheLysLeuGluLys 180185190 SerIleGlyAsnTyrTyrArgLeuValThrArgLysTyrLeuAspArg 195200205 GluAsnValSerIleTyrAsnIleThrValMetAlaSerAspLeuGly 210215220 ThrProProLeuSerThrGluThrGlnIleAlaLeuHisValAlaAsp 225230235240 IleAsnAspAsnProProThrPheProHisAlaSerTyrSerAlaTyr 245250255 IleLeuGluAsnAsnLeuArgGlyAlaSerIlePheSerLeuThrAla 260265270 HisAspProAspSerGlnGluAsnAlaGlnValThrTyrSerValThr 275280285 GluAspThrLeuGlnGlyAlaProLeuSerSerTyrIleSerIleAsn 290295300 SerAspThrGlyValLeuTyrAlaLeuGlnSerPheAspTyrGluGln 305310315320 IleArgAspLeuGlnLeuLeuValThrAlaSerAspSerGlyAspPro 325330335 ProLeuSerSerAsnMetSerLeuSerLeuPheValLeuAspGlnAsn 340345350 AspAsnAlaProGluIleLeuTyrProAlaLeuProThrAspGlySer 355360365 ThrGlyValGluLeuAlaProArgSerAlaGluArgGlyTyrLeuVal 370375380 ThrLysValValAlaValAspArgAspSerGlyGlnAsnAlaTrpLeu 385390395400 SerTyrArgLeuLeuLysAlaSerGluProGlyLeuPheSerValGly 405410415 LeuHisThrGlyGluValArgThrAlaArgAlaLeuLeuAspArgAsp 420425430 AlaLeuLysGlnSerLeuValValAlaValGlnAspHisGlyGlnPro 435440445 ProLeuSerAlaThrValThrLeuThrValAlaValAlaAspSerIle 450455460 ProGluValLeuThrGluLeuGlySerLeuLysProSerValAspPro 465470475480 AsnAspSerSerLeuThrLeuTyrLeuValValAlaValAlaAlaIle 485490495 SerCysValPheLeuAlaPheValAlaValLeuLeuGlyLeuArgLeu 500505510 ArgArgTrpHisLysSerArgLeuLeuGlnAspSerGlyGlyArgLeu 515520525 ValGlyValProAlaSerHisPheValGlyValGluGluValGlnAla 530535540 PheLeuGlnThrTyrSerGlnGluValSerLeuThrAlaAspSerArg 545550555560 LysSerHisLeuIlePheProGlnProAsnTyrAlaAspMetLeuIle 565570575 SerGlnGluGlyCysGluLysAsnAspSerLeuLeuThrSerValAsp 580585590 PheHisGluTyrLysAsnGluAlaAspHisGlyGlnValSerLeuVal 595600605 LeuCysLeuLeuLeuIleSerArg 610615 __________________________________________________________________________
(57)

What is claimed is:

1. Purified and isolated human protocadherin pc3 polypeptide comprising the amino acid sequence of SEQ ID NO: 110.
2. Purified and isolated rat protocadherin pc5 polypeptide comprising the amino acid sequence of SEQ ID NO: 112.
*****

Download Citation


Sign in to the Lens

Feedback