CO1A2_SCESW
ID CO1A2_SCESW Reviewed; 977 AA.
AC C0HLI8;
DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot.
DT 13-NOV-2019, sequence version 1.
DT 25-MAY-2022, entry version 5.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
OS Scelidodon sp. (strain SLP-2019) (South American ground sloth).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; Scelidodon;
OC unclassified Scelidodon.
OX NCBI_TaxID=2546666 {ECO:0000303|PubMed:31171860};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS
RP SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:31171860};
RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z;
RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., Molloy K.,
RA Mackie M., Olsen J.V., Kramarz A., Taglioretti M., Scaglia F., Lezcano M.,
RA Lanata J.L., Southon J., Feranec R., Bloch J., Hajduk A., Martin F.M.,
RA Salas Gismondi R., Reguero M., de Muizon C., Greenwood A., Chait B.T.,
RA Penkman K., Collins M., MacPhee R.D.E.;
RT "Palaeoproteomics resolves sloth relationships.";
RL Nat. Ecol. Evol. 3:1121-1130(2019).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Expressed in bones. {ECO:0000269|PubMed:31171860}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains.
CC {ECO:0000250|UniProtKB:P08123}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from an ancient
CC femur bone collected at Cueva Rosello in Peru.
CC {ECO:0000269|PubMed:31171860}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HLI8; -.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 4.
PE 1: Evidence at protein level;
KW Direct protein sequencing; Extinct organism protein; Extracellular matrix;
KW Glycoprotein; Hydroxylation; Secreted.
FT CHAIN 1..977
FT /note="Collagen alpha-2(I) chain"
FT /id="PRO_0000448459"
FT REGION 1..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..167
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 10
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 13
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 28
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 34
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 91
FT /note="5-hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 344
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT MOD_RES 347
FT /note="4-hydroxyproline"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT CARBOHYD 91
FT /note="O-linked (Gal...) hydroxylysine; alternate"
FT /evidence="ECO:0000250|UniProtKB:P08123"
FT UNSURE 9
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 21
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 87
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 99
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 102
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 123
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 172
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 192
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 210
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 219
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 228
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 247
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 301
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 310
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 349
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 355
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 373
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 418
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 439
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 460
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 484
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 544
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 565
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 653
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 688
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 721
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 769
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 770
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 776
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 778
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 787
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 800
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 839
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 916
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 919
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT UNSURE 922
FT /note="L or I"
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 16..17
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 68..69
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 103..104
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 238..239
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 802..803
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 813..814
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 847..848
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_CONS 914..915
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 1
FT /evidence="ECO:0000303|PubMed:31171860"
FT NON_TER 977
FT /evidence="ECO:0000303|PubMed:31171860"
SQ SEQUENCE 977 AA; 87343 MW; DEB38B7E75A29BE7 CRC64;
SGGFDFSFLP QPPQEKGPMG LMGPRGPPGA SGAPGPQGFQ GPAGEPGEPG QTGPAGARGP
AGPPGKAGER GVVGPQGARG FPGTPGLPGF KGIRGHNGLD GLKGEPGAPG ENGTPGQTGA
RGLPGERGRV GAPGPAGSRG SDGSVGPVGP AGPIGSAGPP GFPGAPGPKG ELGPVGNTGP
SGPAGPRGEQ GLPGVSGPVG PPGNPGANGL TGAKGAAGLP GVAGAPGLPG PRGIPGPVSG
ATGARGLVGE PGPAGSKGES GGKGEPGSAG PQGPPGSSGE EGKRGPSGES GSTGPTGPPG
LRGGPGSRGL PGADGRAGVI GPAGARGASG PAGVRGPSGD TGRPGEPGLM GARGLPGSPG
NVGPAGKEGP AGLPGIDGRP GPIGPAGARG EAGNIGFPGP KGPAGDPGKA GEKGHAGLAG
NRGAPGPDGN NGAQGPPGLQ GVQGGKGEQG PAGPPGFQGL PGPAGTTGEA GKPGERGIPG
EFGLPGPAGP RGERGPSGES GAVGPSGAIG SRGPSGPPGP DGNKGEPGVV GAPGTAGPAG
SGGLPGERGA AGIPGGKGEK GETGLRGEVG TTGRDGARGA PGAVGAPGPA GATGDRGEAG
AAGPAGPAGP RGGPGERGEV GPAGPNGFAG PAGAAGQPGA KGERGTKGPK GELGIVGPTG
PVGSAGPAGP NGPAGPAGSR GDGGPPGLTG FPGAAGRTGP PGPSGITGPP GPPGAAGKEG
LRGPRGDQGP VGRTGETGAG GPPGFTGEKG PSGEPGTAGP PGTAGPQGLL GAPGILGLPG
SRGERGLPGV AGAVGEPGPL GIGPPGARGP SGAGVNGAPG EAGRDGNPGS DGPPGRDGLP
GHKGERGAGN PGPVGAAGAP GPHGAVGPAG KHGNRGEPGP AGSVGPVGAV GPRGPSGPQG
IRGDKGEAGD KGPRGLQGLP GLAGQHGDQG APGPVGPAGP RGPAGPSGPP GKDGRTGHPG
AVGPAGIRGS QGSQGPS