CSP_PLAVS
ID CSP_PLAVS Reviewed; 377 AA.
AC P13826; A5KB63;
DT 01-JAN-1990, integrated into UniProtKB/Swiss-Prot.
DT 25-MAY-2022, sequence version 2.
DT 03-AUG-2022, entry version 65.
DE RecName: Full=Circumsporozoite protein {ECO:0000303|PubMed:2416057};
DE Short=CS {ECO:0000303|PubMed:2416057};
DE Contains:
DE RecName: Full=Circumsporozoite protein C-terminus {ECO:0000305};
DE Flags: Precursor;
GN Name=CSP {ECO:0000250|UniProtKB:P23093};
OS Plasmodium vivax (strain Salvador I).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium).
OX NCBI_TaxID=126793;
RN [1] {ECO:0000312|Proteomes:UP000008333}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Salvador I {ECO:0000312|Proteomes:UP000008333};
RX PubMed=18843361; DOI=10.1038/nature07327;
RA Carlton J.M., Adams J.H., Silva J.C., Bidwell S.L., Lorenzi H., Caler E.,
RA Crabtree J., Angiuoli S.V., Merino E.F., Amedeo P., Cheng Q.,
RA Coulson R.M.R., Crabb B.S., del Portillo H.A., Essien K., Feldblyum T.V.,
RA Fernandez-Becerra C., Gilson P.R., Gueye A.H., Guo X., Kang'a S.,
RA Kooij T.W.A., Korsinczky M., Meyer E.V.-S., Nene V., Paulsen I., White O.,
RA Ralph S.A., Ren Q., Sargeant T.J., Salzberg S.L., Stoeckert C.J.,
RA Sullivan S.A., Yamamoto M.M., Hoffman S.L., Wortman J.R., Gardner M.J.,
RA Galinski M.R., Barnwell J.W., Fraser-Liggett C.M.;
RT "Comparative genomics of the neglected human malaria parasite Plasmodium
RT vivax.";
RL Nature 455:757-763(2008).
RN [2]
RP NUCLEOTIDE SEQUENCE OF 35-377, AND REPEATS.
RX PubMed=2416057; DOI=10.1126/science.2416057;
RA McCutchan T.F., Lal A.A., de la Cruz V.F., Miller L.H., Maloy W.L.,
RA Charoenvit Y., Beaudoin R.L., Guerry P., Wistar R. Jr., Hoffman S.L.,
RA Hockmeyer W.T., Collins W.E., Wirth D.;
RT "Sequence of the immunodominant epitope for the surface protein on
RT sporozoites of Plasmodium vivax.";
RL Science 230:1381-1383(1985).
RN [3]
RP NUCLEOTIDE SEQUENCE OF 35-377, AND POLYMORPHISM.
RX PubMed=2437120; DOI=10.1016/s0021-9258(18)48264-6;
RA de la Cruz V.F., Lal A.A., Welsh J.A., McCutchan T.F.;
RT "Evolution of the immunodominant domain of the circumsporozoite protein
RT gene from Plasmodium vivax. Implications for vaccines.";
RL J. Biol. Chem. 262:6464-6467(1987).
CC -!- FUNCTION: Essential sporozoite protein (By similarity). In the mosquito
CC vector, required for sporozoite development in the oocyst, migration
CC through the vector hemolymph and entry into the vector salivary glands
CC (By similarity). In the vertebrate host, required for sporozoite
CC migration through the host dermis and infection of host hepatocytes (By
CC similarity). Binds to highly sulfated heparan sulfate proteoglycans
CC (HSPGs) on the surface of host hepatocytes (By similarity).
CC {ECO:0000250|UniProtKB:P02893, ECO:0000250|UniProtKB:P23093}.
CC -!- FUNCTION: [Circumsporozoite protein C-terminus]: In the vertebrate
CC host, binds to highly sulfated heparan sulfate proteoglycans (HSPGs) on
CC the surface of host hepatocytes and is required for sporozoite invasion
CC of the host hepatocytes. {ECO:0000250|UniProtKB:P23093}.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000250|UniProtKB:P19597};
CC Lipid-anchor, GPI-anchor {ECO:0000255}. Cytoplasm
CC {ECO:0000250|UniProtKB:P23093}. Note=Localizes to the cytoplasm and the
CC cell membrane in oocysts at day 6 post infection and then gradually
CC distributes over the entire cell surface of the sporoblast and the
CC budding sporozoites. {ECO:0000250|UniProtKB:P23093}.
CC -!- DOMAIN: The N-terminus is involved in the initial binding to heparan
CC sulfate proteoglycans (HSPGs) on the surface of host hepatocytes (By
CC similarity). The N-terminus masks the TSP type-1 (TSR) domain which
CC maintains the sporozoites in a migratory state, enabling them to
CC complete their journey to the salivary gland in the mosquito vector and
CC then to the host liver. The unmasking of the TSP type-1 (TSR) domain
CC when the sporozoite interacts with the host hepatocyte also protects
CC sporozoites from host antibodies (By similarity).
CC {ECO:0000250|UniProtKB:P23093, ECO:0000250|UniProtKB:Q7K740}.
CC -!- DOMAIN: The TSP type-1 (TSR) domain is required for sporozoite
CC development and invasion. CSP has two conformational states, an
CC adhesive conformation in which the TSP type-1 (TSR) domain is exposed
CC and a nonadhesive conformation in which the TSR is masked by the N-
CC terminus. TSR-exposed conformation occurs during sporozoite development
CC in the oocyst in the mosquito vector and during host hepatocyte
CC invasion. TSR-masked conformation occurs during sporozoite migration
CC through the hemolymph to salivary glands in the mosquito vector and in
CC the host dermis. {ECO:0000250|UniProtKB:P23093}.
CC -!- DOMAIN: The GPI-anchor is essential for cell membrane localization and
CC for sporozoite formation inside the oocyst.
CC {ECO:0000250|UniProtKB:P23093}.
CC -!- PTM: During host cell invasion, proteolytically cleaved at the cell
CC membrane in the region I by a papain-like cysteine protease of parasite
CC origin (By similarity). Cleavage is triggered by the sporozoite contact
CC with highly sulfated heparan sulfate proteoglycans (HSPGs) present on
CC the host hepatocyte cell surface (By similarity). Cleavage exposes the
CC TSP type-1 (TSR) domain and is required for productive invasion of host
CC hepatocytes but not for adhesion to the host cell membrane (By
CC similarity). Cleavage is dispensable for sporozoite development in the
CC oocyst, motility and for traversal of host and vector cells (By
CC similarity). {ECO:0000250|UniProtKB:P02893,
CC ECO:0000250|UniProtKB:P23093}.
CC -!- PTM: O-glycosylated; maybe by POFUT2. {ECO:0000250|UniProtKB:P19597}.
CC -!- POLYMORPHISM: The sequence of the repeats varies across Plasmodium
CC species and strains. {ECO:0000269|PubMed:2437120}.
CC -!- SIMILARITY: Belongs to the plasmodium circumsporozoite protein family.
CC {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKM01000017; EDL43341.1; -; Genomic_DNA.
DR RefSeq; XP_001613068.1; XM_001613018.1.
DR AlphaFoldDB; P13826; -.
DR SMR; P13826; -.
DR STRING; 5855.PVX_119355; -.
DR EnsemblProtists; EDL43341; EDL43341; PVX_119355.
DR GeneID; 5472322; -.
DR KEGG; pvx:PVX_119355; -.
DR VEuPathDB; PlasmoDB:PVX_119355; -.
DR InParanoid; P13826; -.
DR OMA; MMRKLAI; -.
DR PhylomeDB; P13826; -.
DR Proteomes; UP000008333; Chromosome 8.
DR Proteomes; UP000008333; Unassembled WGS sequence.
DR GO; GO:0031225; C:anchored component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0009986; C:cell surface; IEA:InterPro.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR Gene3D; 2.20.100.10; -; 1.
DR InterPro; IPR003067; Crcmsprzoite.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR Pfam; PF00090; TSP_1; 1.
DR PRINTS; PR01303; CRCMSPRZOITE.
DR SMART; SM00209; TSP1; 1.
DR SUPFAM; SSF82895; SSF82895; 1.
DR PROSITE; PS50092; TSP1; 1.
PE 3: Inferred from homology;
KW Cell membrane; Cytoplasm; Disulfide bond; Glycoprotein; GPI-anchor;
KW Lipoprotein; Malaria; Membrane; Reference proteome; Repeat; Signal;
KW Sporozoite.
FT SIGNAL 1..22
FT /evidence="ECO:0000255"
FT CHAIN 23..354
FT /note="Circumsporozoite protein"
FT /evidence="ECO:0000255"
FT /id="PRO_0000217182"
FT CHAIN ?..354
FT /note="Circumsporozoite protein C-terminus"
FT /evidence="ECO:0000250|UniProtKB:P23093"
FT /id="PRO_0000455507"
FT PROPEP 355..377
FT /note="Removed in mature form"
FT /evidence="ECO:0000255"
FT /id="PRO_0000455508"
FT REPEAT 95..103
FT /note="1"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 104..112
FT /note="2"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 113..121
FT /note="3"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 122..130
FT /note="4"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 131..139
FT /note="5"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 140..148
FT /note="6"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 149..157
FT /note="7"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 158..166
FT /note="8"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 167..175
FT /note="9"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 176..184
FT /note="10"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 185..193
FT /note="11"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 194..202
FT /note="12"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 203..211
FT /note="13"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 212..220
FT /note="14"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 221..229
FT /note="15"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 230..238
FT /note="16"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 239..247
FT /note="17"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 248..256
FT /note="18"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 257..265
FT /note="19"
FT /evidence="ECO:0000305|PubMed:2416057"
FT REPEAT 266..274
FT /note="20"
FT /evidence="ECO:0000305|PubMed:2416057"
FT DOMAIN 303..355
FT /note="TSP type-1"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00210"
FT REGION 51..294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 80..88
FT /note="Required for the binding to heparan sulfate
FT proteoglycans (HSPGs) on the surface of host hepatocytes"
FT /evidence="ECO:0000250|UniProtKB:Q7K740"
FT REGION 91..95
FT /note="Region I; contains the proteolytic cleavage site"
FT /evidence="ECO:0000250|UniProtKB:P23093"
FT REGION 95..274
FT /note="20 X 9 AA tandem repeats of [PA]-G-D-R-A-[DA]-G-Q-
FT [PA]"
FT /evidence="ECO:0000305|PubMed:2416057"
FT COMPBIAS 62..104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 278..292
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT LIPID 354
FT /note="GPI-anchor amidated cysteine"
FT /evidence="ECO:0000255"
FT CARBOHYD 318
FT /note="O-linked (Fuc) threonine"
FT /evidence="ECO:0000250|UniProtKB:P19597"
FT DISULFID 315..349
FT /evidence="ECO:0000250|UniProtKB:Q7K740"
FT DISULFID 319..354
FT /evidence="ECO:0000250|UniProtKB:Q7K740"
SQ SEQUENCE 377 AA; 37875 MW; BD7D54C3947869FD CRC64;
MKNFILLAVS SILLVDLFPT HCGHNVDLSK AINLNGVNFN NVDASSLGAA HVGQSASRGR
GLGENPDDEE GDAKKKKDGK KAEPKNPREN KLKQPGDRAD GQPAGDRADG QPAGDRADGQ
PAGDRADGQP AGDRAAGQPA GDRADGQPAG DRADGQPAGD RADGQPAGDR ADGQPAGDRA
AGQPAGDRAA GQPAGDRADG QPAGDRAAGQ PAGDRADGQP AGDRAAGQPA GDRADGQPAG
DRAAGQPAGD RAAGQPAGDR AAGQAAGDRA AGQAAGGNAG GQGQNNEGAN APNEKSVKEY
LDKVRATVGT EWTPCSVTCG VGVRVRRRVN AANKKPEDLT LNDLETDVCT MDKCAGIFNV
VSNSLGLVIL LVLALFN