CLC91_CAEEL
ID CLC91_CAEEL Reviewed; 225 AA.
AC Q94417;
DT 05-SEP-2006, integrated into UniProtKB/Swiss-Prot.
DT 01-FEB-1997, sequence version 1.
DT 03-AUG-2022, entry version 131.
DE RecName: Full=C-type lectin domain-containing protein 91;
DE Flags: Precursor;
GN Name=clec-91; ORFNames=ZK858.3;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-217, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=12754521; DOI=10.1038/nbt829;
RA Kaji H., Saito H., Yamauchi Y., Shinkawa T., Taoka M., Hirabayashi J.,
RA Kasai K., Takahashi N., Isobe T.;
RT "Lectin affinity capture, isotope-coded tagging and mass spectrometry to
RT identify N-linked glycoproteins.";
RL Nat. Biotechnol. 21:667-672(2003).
RN [3]
RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-217, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC STRAIN=Bristol N2;
RX PubMed=17761667; DOI=10.1074/mcp.m600392-mcp200;
RA Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T.,
RA Taoka M., Takahashi N., Isobe T.;
RT "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis
RT elegans and suggests an atypical translocation mechanism for integral
RT membrane proteins.";
RL Mol. Cell. Proteomics 6:2100-2109(2007).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z79759; CAB02136.1; -; Genomic_DNA.
DR PIR; T28053; T28053.
DR RefSeq; NP_492448.1; NM_060047.4.
DR AlphaFoldDB; Q94417; -.
DR SMR; Q94417; -.
DR BioGRID; 38167; 7.
DR STRING; 6239.ZK858.3; -.
DR iPTMnet; Q94417; -.
DR EPD; Q94417; -.
DR PaxDb; Q94417; -.
DR PeptideAtlas; Q94417; -.
DR EnsemblMetazoa; ZK858.3.1; ZK858.3.1; WBGene00014117.
DR GeneID; 172736; -.
DR KEGG; cel:CELE_ZK858.3; -.
DR UCSC; ZK858.3; c. elegans.
DR CTD; 172736; -.
DR WormBase; ZK858.3; CE15425; WBGene00014117; clec-91.
DR eggNOG; KOG4297; Eukaryota.
DR GeneTree; ENSGT00970000196041; -.
DR HOGENOM; CLU_093598_0_0_1; -.
DR InParanoid; Q94417; -.
DR OMA; NGWSSIA; -.
DR OrthoDB; 1118305at2759; -.
DR PhylomeDB; Q94417; -.
DR PRO; PR:Q94417; -.
DR Proteomes; UP000001940; Chromosome I.
DR Bgee; WBGene00014117; Expressed in germ line (C elegans) and 4 other tissues.
DR GO; GO:0009897; C:external side of plasma membrane; IBA:GO_Central.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0030246; F:carbohydrate binding; IBA:GO_Central.
DR Gene3D; 3.10.100.10; -; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR Pfam; PF00059; Lectin_C; 1.
DR SMART; SM00034; CLECT; 1.
DR SUPFAM; SSF56436; SSF56436; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
PE 1: Evidence at protein level;
KW Disulfide bond; Glycoprotein; Lectin; Reference proteome; Secreted; Signal.
FT SIGNAL 1..21
FT /evidence="ECO:0000255"
FT CHAIN 22..225
FT /note="C-type lectin domain-containing protein 91"
FT /id="PRO_0000248424"
FT DOMAIN 85..215
FT /note="C-type lectin"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00040"
FT CARBOHYD 217
FT /note="N-linked (GlcNAc...) asparagine"
FT /evidence="ECO:0000269|PubMed:12754521,
FT ECO:0000269|PubMed:17761667"
FT DISULFID 106..214
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00040"
FT DISULFID 185..206
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00040"
SQ SEQUENCE 225 AA; 25647 MW; 04DD8C66C3A60D5C CRC64;
MRSTYILIIV PLIIIGGGVV ADNTNETPVL AHSSDEQPHQ RLTYYNWDHK DLGTSAFEDL
PPLQDQPTPL PIDQSDRCPD GWLRYSDSCY FIETESLGFA KAERKCHDKQ ATLFVANSME
EWDAVRDHAE KSVLSWIGLV RFSHYERLEQ LPRWQTTGSI NPSKINWLIK PFKPVVNGWS
SYANCAASFQ SPTEVESASY TFFYPCTMAF KSICERNSTI LNARN