SHU7_ECOLX
ID SHU7_ECOLX Reviewed; 456 AA.
AC P09751; Q9JMU2;
DT 01-JUL-1989, integrated into UniProtKB/Swiss-Prot.
DT 01-JUL-1989, sequence version 1.
DT 25-MAY-2022, entry version 49.
DE RecName: Full=Shufflon protein D';
OS Escherichia coli.
OG Plasmid IncI1 R64.
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=562;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RX PubMed=3029698; DOI=10.1093/nar/15.3.1165;
RA Komano T., Kubo A., Nisioka T.;
RT "Shufflon: multi-inversion of four contiguous DNA segments of plasmid R64
RT creates seven different open reading frames.";
RL Nucleic Acids Res. 15:1165-1172(1987).
CC -!- MISCELLANEOUS: This protein is expressed by a shufflon (= clustered
CC inversion region that works as a biological switch). The orfs of this
CC region share a constant N-terminus, while the C-terminus is variable.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB027308; BAA78902.1; -; Genomic_DNA.
DR PIR; G26421; G26421.
DR RefSeq; WP_001393542.1; NZ_VUEG01000022.1.
DR AlphaFoldDB; P09751; -.
DR GeneID; 60904393; -.
DR InterPro; IPR029017; Enolase-like_N.
DR InterPro; IPR007001; Shufflon_N.
DR Pfam; PF04917; Shufflon_N; 1.
DR SUPFAM; SSF54826; SSF54826; 1.
PE 4: Predicted;
KW Plasmid.
FT CHAIN 1..456
FT /note="Shufflon protein D'"
FT /id="PRO_0000097748"
FT REGION 1..361
FT /note="Constant region"
FT REGION 362..456
FT /note="Variable region"
SQ SEQUENCE 456 AA; 48620 MW; 24A8E52D0EEC5FBE CRC64;
MKKYDRGWAS LETGAALLIV MLLIAWGAGI WQDYIQTKGW QTEARLVSNW TSAARSYIGK
NYTTLQGSST TTTPAVITTT MLKNTGFLSS GFTETNSEGQ RLQAYVVRNA QNPELLQAMV
VSSGGTPYPV KALIQMAKDI TTGLGGYIQD GKTATGALRS WSVALSNYGA KSGNGHIAVL
LSTDELSGAA EDTDRLYRFQ VNGRPDLNKM HTAIDMGSNN LNNVGAVNAQ TGNFSGNVNG
VNGTFSGQVK GNSGNFDVNV TAGGDIRSNN GWLITRNSKG WLNETHGGGF YMSDGSWVRS
VNNKGIYTGG QVKGGTVRAD GRLYTGEYLQ LERTAVAGAS CSPNGLVGRD NTGAILSCQS
GTWRKSNSGS TVITGRIANG QQIPLPTGFS ASQCSWSVSN AENPQGWKPN YFAGSVATYD
ANRIVKCGFY DEYNFHKGTF RADLTGKCSY VVACQN