HM37_CAEEL
ID HM37_CAEEL Reviewed; 278 AA.
AC Q93356; P90773;
DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1997, sequence version 2.
DT 03-AUG-2022, entry version 151.
DE RecName: Full=Homeobox protein ceh-37;
GN Name=ceh-37; ORFNames=C37E2.5;
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2;
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
RN [2]
RP FUNCTION, TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE.
RX PubMed=14536063; DOI=10.1016/s1534-5807(03)00293-4;
RA Lanjuin A., VanHoven M.K., Bargmann C.I., Thompson J.K., Sengupta P.;
RT "Otx/otd homeobox genes specify distinct sensory neuron identities in C.
RT elegans.";
RL Dev. Cell 5:621-633(2003).
RN [3]
RP FUNCTION, AND SUBCELLULAR LOCATION.
RX PubMed=12711598; DOI=10.1074/jbc.m302192200;
RA Kim S.H., Hwang S.B., Chung I.K., Lee J.;
RT "Sequence-specific binding to telomeric DNA by CEH-37, a homeodomain
RT protein in the nematode Caenorhabditis elegans.";
RL J. Biol. Chem. 278:28038-28044(2003).
CC -!- FUNCTION: Binds to the telomeric DNA sequence 5'-TTAGGC-3', requiring
CC at least 1.5 repeats for binding. Bends telomeric DNA and may provide
CC chromosome stability. Also required for development of AWB olfactory
CC neurons. {ECO:0000269|PubMed:12711598, ECO:0000269|PubMed:14536063}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108,
CC ECO:0000269|PubMed:12711598}. Chromosome, telomere
CC {ECO:0000269|PubMed:12711598}. Note=Binds to telomeres.
CC -!- TISSUE SPECIFICITY: Broadly expressed during early embryonic
CC development. During later embryonic stages, expression is detected in
CC AWB olfactory sensory neurons but is absent from there by early L1
CC larval stages. Expression in non-neuronal cells including the excretory
CC cell and intestine is maintained through adult stages.
CC {ECO:0000269|PubMed:14536063}.
CC -!- DEVELOPMENTAL STAGE: Expression starts during embryogenesis and
CC continues into adulthood. {ECO:0000269|PubMed:14536063}.
CC -!- DOMAIN: The N-terminal and homeobox regions are together sufficient to
CC bind telomeric DNA.
CC -!- DOMAIN: The C-terminal region is required for bending of telomeric DNA.
CC -!- SIMILARITY: Belongs to the paired homeobox family. {ECO:0000305}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; Z81046; CAB02825.1; -; Genomic_DNA.
DR PIR; T19813; T19813.
DR RefSeq; NP_510366.1; NM_077965.3.
DR PDB; 2MGQ; NMR; -; A=38-104.
DR PDBsum; 2MGQ; -.
DR AlphaFoldDB; Q93356; -.
DR BMRB; Q93356; -.
DR SMR; Q93356; -.
DR BioGRID; 46429; 1.
DR IntAct; Q93356; 1.
DR STRING; 6239.C37E2.5; -.
DR PaxDb; Q93356; -.
DR EnsemblMetazoa; C37E2.5a.1; C37E2.5a.1; WBGene00000458.
DR GeneID; 181530; -.
DR UCSC; C37E2.5; c. elegans.
DR CTD; 181530; -.
DR WormBase; C37E2.5a; CE08624; WBGene00000458; ceh-37.
DR eggNOG; KOG2251; Eukaryota.
DR HOGENOM; CLU_984296_0_0_1; -.
DR InParanoid; Q93356; -.
DR OMA; PYNAAMI; -.
DR OrthoDB; 1014909at2759; -.
DR PhylomeDB; Q93356; -.
DR PRO; PR:Q93356; -.
DR Proteomes; UP000001940; Chromosome X.
DR Bgee; WBGene00000458; Expressed in pharyngeal muscle cell (C elegans) and 3 other tissues.
DR ExpressionAtlas; Q93356; baseline and differential.
DR GO; GO:0000781; C:chromosome, telomeric region; IDA:UniProtKB.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0008301; F:DNA binding, bending; IDA:UniProtKB.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0003691; F:double-stranded telomeric DNA binding; IDA:WormBase.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0042162; F:telomeric DNA binding; IDA:UniProtKB.
DR GO; GO:0048665; P:neuron fate specification; IMP:WormBase.
DR GO; GO:0042048; P:olfactory behavior; IMP:UniProtKB.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IMP:WormBase.
DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IMP:UniProtKB.
DR GO; GO:0016233; P:telomere capping; NAS:UniProtKB.
DR GO; GO:0000723; P:telomere maintenance; TAS:WormBase.
DR GO; GO:0032200; P:telomere organization; NAS:UniProtKB.
DR CDD; cd00086; homeodomain; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; SSF46689; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Chromosome; Developmental protein; DNA-binding; Homeobox;
KW Nucleus; Reference proteome; Telomere.
FT CHAIN 1..278
FT /note="Homeobox protein ceh-37"
FT /id="PRO_0000049002"
FT DNA_BIND 41..100
FT /note="Homeobox"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108"
FT REGION 98..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 200..228
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..124
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 161..182
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT STRAND 40..42
FT /evidence="ECO:0007829|PDB:2MGQ"
FT STRAND 45..48
FT /evidence="ECO:0007829|PDB:2MGQ"
FT HELIX 51..62
FT /evidence="ECO:0007829|PDB:2MGQ"
FT HELIX 68..78
FT /evidence="ECO:0007829|PDB:2MGQ"
FT TURN 82..84
FT /evidence="ECO:0007829|PDB:2MGQ"
FT HELIX 85..88
FT /evidence="ECO:0007829|PDB:2MGQ"
FT HELIX 90..102
FT /evidence="ECO:0007829|PDB:2MGQ"
SQ SEQUENCE 278 AA; 30902 MW; 67DDA86B69FBE5FF CRC64;
MTSYSYFTIP STATTGFNYP VQPMTMFSGA PYNAAMIPRK NRRERTTYSR QQLEILETLF
NETQYPDVFA RERVADQIRL QESRIQVWFK NRRAKYRLQE KQKPKRSNEK SQEHKSEDQQ
QTDVLDGEPL KGGSPGYQPQ IKSELESCDG AVASGKLGTP KSISPVETTA STTSSNTSAA
ELQWNGDHKI LGFGKNETTT SAAVSPTADN ASTPSSSSSI TATSSLPTTS SSLSVYNYPA
IYPQWGLDYS TYANPQYAQF SHNPYAGTPF WYSNGNNL