位置:首页 > 蛋白库 > CO1A2_SCESW
CO1A2_SCESW
ID   CO1A2_SCESW             Reviewed;         977 AA.
AC   C0HLI8;
DT   13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT   13-NOV-2019, sequence version 1.
DT   25-MAY-2022, entry version 5.
DE   RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE   AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE   Flags: Fragments;
OS   Scelidodon sp. (strain SLP-2019) (South American ground sloth).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; Scelidodon;
OC   unclassified Scelidodon.
OX   NCBI_TaxID=2546666 {ECO:0000303|PubMed:31171860};
RN   [1] {ECO:0000305}
RP   PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP   SPECTROMETRY.
RC   TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX   PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA   Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA   Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA   Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA   Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA   Penkman K., Collins M., MacPhee R.D.E.;
RT   "Palaeoproteomics resolves sloth relationships.";
RL   Nat. Ecol. Evol. 3:1121-1130(2019).
CC   -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC       forming collagen). {ECO:0000305}.
CC   -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC       {ECO:0000305}.
CC   -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC       Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC   -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC   -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC       (G-X-Y) are hydroxylated in some or all of the chains.
CC       {ECO:0000250|UniProtKB:P08123}.
CC   -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC       femur bone collected at Cueva Rosello in Peru.
CC       {ECO:0000269|PubMed:31171860}.
CC   -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; C0HLI8; -.
DR   GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR   InterPro; IPR008160; Collagen.
DR   Pfam; PF01391; Collagen; 4.
PE   1: Evidence at protein level;
KW   Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW   Glycoprotein; Hydroxylation; Secreted.
FT   CHAIN           1..977
FT                   /note="Collagen alpha-2(I) chain"
FT                   /id="PRO_0000448459"
FT   REGION          1..977
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        153..167
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   MOD_RES         10
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         13
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         28
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         34
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         91
FT                   /note="5-hydroxylysine; alternate"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         344
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   MOD_RES         347
FT                   /note="4-hydroxyproline"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   CARBOHYD        91
FT                   /note="O-linked (Gal...) hydroxylysine; alternate"
FT                   /evidence="ECO:0000250|UniProtKB:P08123"
FT   UNSURE          9
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          21
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          87
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          99
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          102
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          123
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          172
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          192
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          210
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          219
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          228
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          247
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          301
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          310
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          349
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          355
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          373
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          418
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          439
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          460
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          484
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          544
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          565
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          653
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          688
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          721
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          769
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          770
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          776
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          778
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          787
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          800
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          839
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          916
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          919
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   UNSURE          922
FT                   /note="L or I"
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        16..17
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        68..69
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        103..104
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        238..239
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        802..803
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        813..814
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        847..848
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_CONS        914..915
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_TER         1
FT                   /evidence="ECO:0000303|PubMed:31171860"
FT   NON_TER         977
FT                   /evidence="ECO:0000303|PubMed:31171860"
SQ   SEQUENCE   977 AA;  87343 MW;  DEB38B7E75A29BE7 CRC64;
     SGGFDFSFLP QPPQEKGPMG LMGPRGPPGA SGAPGPQGFQ GPAGEPGEPG QTGPAGARGP
     AGPPGKAGER GVVGPQGARG FPGTPGLPGF KGIRGHNGLD GLKGEPGAPG ENGTPGQTGA
     RGLPGERGRV GAPGPAGSRG SDGSVGPVGP AGPIGSAGPP GFPGAPGPKG ELGPVGNTGP
     SGPAGPRGEQ GLPGVSGPVG PPGNPGANGL TGAKGAAGLP GVAGAPGLPG PRGIPGPVSG
     ATGARGLVGE PGPAGSKGES GGKGEPGSAG PQGPPGSSGE EGKRGPSGES GSTGPTGPPG
     LRGGPGSRGL PGADGRAGVI GPAGARGASG PAGVRGPSGD TGRPGEPGLM GARGLPGSPG
     NVGPAGKEGP AGLPGIDGRP GPIGPAGARG EAGNIGFPGP KGPAGDPGKA GEKGHAGLAG
     NRGAPGPDGN NGAQGPPGLQ GVQGGKGEQG PAGPPGFQGL PGPAGTTGEA GKPGERGIPG
     EFGLPGPAGP RGERGPSGES GAVGPSGAIG SRGPSGPPGP DGNKGEPGVV GAPGTAGPAG
     SGGLPGERGA AGIPGGKGEK GETGLRGEVG TTGRDGARGA PGAVGAPGPA GATGDRGEAG
     AAGPAGPAGP RGGPGERGEV GPAGPNGFAG PAGAAGQPGA KGERGTKGPK GELGIVGPTG
     PVGSAGPAGP NGPAGPAGSR GDGGPPGLTG FPGAAGRTGP PGPSGITGPP GPPGAAGKEG
     LRGPRGDQGP VGRTGETGAG GPPGFTGEKG PSGEPGTAGP PGTAGPQGLL GAPGILGLPG
     SRGERGLPGV AGAVGEPGPL GIGPPGARGP SGAGVNGAPG EAGRDGNPGS DGPPGRDGLP
     GHKGERGAGN PGPVGAAGAP GPHGAVGPAG KHGNRGEPGP AGSVGPVGAV GPRGPSGPQG
     IRGDKGEAGD KGPRGLQGLP GLAGQHGDQG APGPVGPAGP RGPAGPSGPP GKDGRTGHPG
     AVGPAGIRGS QGSQGPS
 
 
维奥蛋白资源库 - 中文蛋白资源 CopyRight © 2010-2025