CO1A2_EQUSP
ID CO1A2_EQUSP Reviewed; 912 AA.
AC C0HJP0;
DT 22-JUL-2015, integrated into UniProtKB/Swiss-Prot.
DT 22-JUL-2015, sequence version 1.
DT 03-AUG-2022, entry version 13.
DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:25799987};
DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123};
DE Flags: Fragments;
GN Name=COL1A2 {ECO:0000250|UniProtKB:P08123};
OS Equus sp.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=46122 {ECO:0000303|PubMed:25799987};
RN [1] {ECO:0000305}
RP PROTEIN SEQUENCE, AND IDENTIFICATION BY MASS SPECTROMETRY.
RC TISSUE=Bone {ECO:0000303|PubMed:25799987};
RX PubMed=25799987; DOI=10.1038/nature14249;
RA Welker F., Collins M.J., Thomas J.A., Wadsley M., Brace S., Cappellini E.,
RA Turvey S.T., Reguero M., Gelfo J.N., Kramarz A., Burger J.,
RA Thomas-Oates J., Ashford D.A., Ashton P.D., Rowsell K., Porter D.M.,
RA Kessler B., Fischer R., Baessmann C., Kaspar S., Olsen J.V., Kiley P.,
RA Elliott J.A., Kelstrup C.D., Mullin V., Hofreiter M., Willerslev E.,
RA Hublin J.J., Orlando L., Barnes I., MacPhee R.D.;
RT "Ancient proteins resolve the evolutionary history of Darwin's South
RT American ungulates.";
RL Nature 522:81-84(2015).
CC -!- FUNCTION: Type I collagen is a member of group I collagen (fibrillar
CC forming collagen). {ECO:0000305}.
CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains.
CC {ECO:0000305}.
CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space.
CC Secreted, extracellular space, extracellular matrix {ECO:0000305}.
CC -!- TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones.
CC In bones, the fibrils are mineralized with calcium hydroxyapatite.
CC {ECO:0000305}.
CC -!- PTM: Prolines at the third position of the tripeptide repeating unit
CC (G-X-Y) are hydroxylated in some or all of the chains. {ECO:0000305}.
CC -!- MISCELLANEOUS: These protein fragments were extracted from fossils. The
CC tryptic peptides required multiple purification steps in order to
CC eliminate contaminants and to increase the concentration of peptidic
CC material. {ECO:0000305|PubMed:25799987}.
CC -!- SIMILARITY: Belongs to the fibrillar collagen family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; C0HJP0; -.
DR PRIDE; C0HJP0; -.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IEA:UniProtKB-SubCell.
DR InterPro; IPR008160; Collagen.
DR Pfam; PF01391; Collagen; 5.
PE 1: Evidence at protein level;
KW Calcium; Collagen; Direct protein sequencing; Extracellular matrix;
KW Hydroxylation; Repeat; Secreted.
FT CHAIN 1..912
FT /note="Collagen alpha-2(I) chain"
FT /evidence="ECO:0000269|PubMed:25799987"
FT /id="PRO_0000433501"
FT REGION 1..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..739
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 763..912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..161
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 5
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 75
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 84
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 87
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 117
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 148
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 166
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 186
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 189
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 204
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 213
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 222
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 228
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 243
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 297
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 300
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 339
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 352
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 363
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 366
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 373
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 385
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 408
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 450
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 468
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 474
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 496
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 517
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 531
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 540
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 547
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 675
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 731
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 732
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 737
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 738
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 740
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 746
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 753
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 759
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 761
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 859
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 862
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT UNSURE 865
FT /note="I or L"
FT /evidence="ECO:0000269|PubMed:25799987"
FT NON_CONS 42..43
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 79..80
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 298..299
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 481..482
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 544..545
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 589..590
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 616..617
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 687..688
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 744..745
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 800..801
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 826..827
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 854..855
FT /evidence="ECO:0000303|PubMed:25799987"
FT NON_CONS 896..897
FT /evidence="ECO:0000303|PubMed:25799987"
SQ SEQUENCE 912 AA; 80840 MW; 359348F28BE32C2C CRC64;
GPMGIMGPRG PPGASGAPGP QGFQGPAGEP GEPGQTGPAG ARAGEDGHPG KPGRPGERGV
VGPQGARGFP GTPGIPGFKG HNGIDGIKGQ PGAPGVKGEP GAPGENGTPG QAGARGIPGE
RGRVGAPGPA GARGSDGSVG PVGPAGPIGS AGPPGFPGAP GPKGEIGPVG NPGPAGPAGP
RGEVGIPGIS GPVGPPGNPG ANGITGAKGA AGIPGVAGAP GIPGPRGIPG PAGAAGATGA
RGIVGEPGPA GSKGESGNKG EPGAAGPQGP PGPSGEEGKR GPNGEPGSTG PAGPPGIRGI
PGADGRAGVM GPAGSRGASG PAGVRGPNGD SGRPGEPGIM GPRGFPGSPG NIGPAGKEGP
VGIPGIDGRP GPIGPAGARG EPGNIGFPGP KGPSGEPGKP GDKGDAGIAG ARGAPGPDGN
NGAQGPPGPQ GVQGGKGEQG PAGPPGFQGI PGPAGTAGEV GKPGERGIPG EFGIPGPAGA
RGPPGESGAA GPAGPIGSRG PSGPPGPDGN KGEPGVIGAP GTAGPSGPSG IPGERGAAGI
PGGKGEIGNP GRDGARGAPG AVGAPGPAGA NGDRGEAGAA GPAGPAGPRG EVGPAGPNGF
AGPAGAAGQP GAKGERGPKG ENGPVGPTGP VGAAGPSGPN GPPGPAGSRG DGGPPGVTGF
PGAAGRTGPP GPSGISGPPG PPGAAGKGDQ GPVGRAGETG ASGPPGFAGE KGPSGEPGTA
GPPGTPGPQG IIGAPGIIGI PGSRGIPGVA GSIGEPGPIG IAGPPGARGP PGAVGAPGVN
GAPGEAGRDG NPGSDGPPGR GYPGNAGPVG AVGAPGPHGP VGPTGKRGEP GPVGSVGPVG
AVGPRGPSGP QGVRGHNGIQ GIPGIAGQHG DQGAPGSVGP AGPRGPAGPT GPVGKDSGQP
GTVGPAGVRG SQ