INSG_ECOLI
ID INSG_ECOLI Reviewed; 442 AA.
AC P03835; Q2M634;
DT 21-JUL-1986, integrated into UniProtKB/Swiss-Prot.
DT 21-JUL-1986, sequence version 1.
DT 03-AUG-2022, entry version 125.
DE RecName: Full=Transposase InsG for insertion sequence element IS4;
GN Name=insG; OrderedLocusNames=b4278, JW5767;
OS Escherichia coli (strain K12).
OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC Enterobacteriaceae; Escherichia.
OX NCBI_TaxID=83333;
RN [1]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA].
RC STRAIN=K12;
RX PubMed=6268937; DOI=10.1007/bf00268423;
RA Klaer R., Kuhn S., Tillmann E., Fritz H.-J., Starlinger P.;
RT "The sequence of IS4.";
RL Mol. Gen. Genet. 181:169-175(1981).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=7610040; DOI=10.1093/nar/23.12.2105;
RA Burland V.D., Plunkett G. III, Sofia H.J., Daniels D.L., Blattner F.R.;
RT "Analysis of the Escherichia coli genome VI: DNA sequence of the region
RT from 92.8 through 100 minutes.";
RL Nucleic Acids Res. 23:2105-2119(1995).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / MG1655 / ATCC 47076;
RX PubMed=9278503; DOI=10.1126/science.277.5331.1453;
RA Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V.,
RA Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F.,
RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B.,
RA Shao Y.;
RT "The complete genome sequence of Escherichia coli K-12.";
RL Science 277:1453-1462(1997).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=K12 / W3110 / ATCC 27325 / DSM 5911;
RX PubMed=16738553; DOI=10.1038/msb4100049;
RA Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S.,
RA Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.;
RT "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655
RT and W3110.";
RL Mol. Syst. Biol. 2:E1-E5(2006).
CC -!- FUNCTION: Involved in the transposition of the insertion sequence IS4.
CC -!- SIMILARITY: Belongs to the transposase 11 family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; J01733; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; U14003; AAA97174.1; -; Genomic_DNA.
DR EMBL; U00096; AAC77234.1; -; Genomic_DNA.
DR EMBL; AP009048; BAE78272.1; -; Genomic_DNA.
DR PIR; A04463; IEEC41.
DR RefSeq; NP_418698.1; NC_000913.3.
DR RefSeq; WP_000547191.1; NZ_CP047127.1.
DR AlphaFoldDB; P03835; -.
DR STRING; 511145.b4278; -.
DR PaxDb; P03835; -.
DR PRIDE; P03835; -.
DR EnsemblBacteria; AAC77234; AAC77234; b4278.
DR EnsemblBacteria; BAE78272; BAE78272; BAE78272.
DR GeneID; 948805; -.
DR KEGG; ecj:JW5767; -.
DR KEGG; eco:b4278; -.
DR PATRIC; fig|1411691.4.peg.2425; -.
DR EchoBASE; EB4735; -.
DR eggNOG; COG3385; Bacteria.
DR HOGENOM; CLU_028400_1_1_6; -.
DR InParanoid; P03835; -.
DR OMA; PAEQVVW; -.
DR PhylomeDB; P03835; -.
DR BioCyc; EcoCyc:G7900-MON; -.
DR PRO; PR:P03835; -.
DR Proteomes; UP000000318; Chromosome.
DR Proteomes; UP000000625; Chromosome.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004803; F:transposase activity; IEA:InterPro.
DR GO; GO:0006313; P:transposition, DNA-mediated; IEA:InterPro.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002559; Transposase_11.
DR InterPro; IPR024473; Transposases_IS4_N.
DR Pfam; PF01609; DDE_Tnp_1; 1.
DR Pfam; PF13006; Nterm_IS4; 1.
DR SUPFAM; SSF53098; SSF53098; 1.
PE 3: Inferred from homology;
KW DNA recombination; DNA-binding; Reference proteome; Transposable element;
KW Transposition.
FT CHAIN 1..442
FT /note="Transposase InsG for insertion sequence element IS4"
FT /id="PRO_0000173291"
SQ SEQUENCE 442 AA; 50386 MW; 0B9CAAA0FFAC724E CRC64;
MHIGQALDLV SRYDSLRNPL TSLGDYLDPE LISRCLAESG TVTLRKRRLP LEMMVWCIVG
MALERKEPLH QIVNRLDIML PGNRPFVAPS AVIQARQRLG SEAVRRVFTK TAQLWHNATP
HPHWCGLTLL AIDGVFWRTP DTPENDAAFP RQTHAGNPAL YPQVKMVCQM ELTSHLLTAA
AFGTMKNSEN ELAEQLIEQT GDNTLTLMDK GYYSLGLLNA WSLAGEHRHW MIPLRKGAQY
EEIRKLGKGD HLVKLKTSPQ ARKKWPGLGN EVTARLLTVT RKGKVCHLLT SMTDAMRFPG
GEMGDLYSHR WEIELGYREI KQTMQRSRLT LRSKKPELVE QELWGVLLAY NLVRYQMIKM
AEHLKGYWPN QLSFSESCGM VMRMLMTLQG ASPGRIPELM RDLASMGQLV KLPTRRERAF
PRVVKERPWK YPTAPKKSQS VA