CROC_DROME
ID CROC_DROME Reviewed; 508 AA.
AC P32027; Q9VP32;
DT 01-JUL-1993, integrated into UniProtKB/Swiss-Prot.
DT 01-NOV-1997, sequence version 2.
DT 03-AUG-2022, entry version 171.
DE RecName: Full=Fork head domain-containing protein crocodile;
DE AltName: Full=FKH protein FD1;
GN Name=croc; Synonyms=FD1, FD78E; ORFNames=CG5069;
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227;
RN [1]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Canton-S;
RX PubMed=7489720; DOI=10.1002/j.1460-2075.1995.tb00215.x;
RA Haecker U., Kaufmann E., Hartmann C., Juergens G., Knoechel W., Jaeckle H.;
RT "The Drosophila fork head domain protein crocodile is required for the
RT establishment of head structures.";
RL EMBO J. 14:5306-5317(1995).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley;
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A.,
RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F.,
RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J.,
RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R.,
RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y.,
RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T.,
RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S.,
RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W.,
RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M.,
RA Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [3]
RP GENOME REANNOTATION.
RC STRAIN=Berkeley;
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4]
RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 55-182, FUNCTION, TISSUE SPECIFICITY,
RP AND DEVELOPMENTAL STAGE.
RX PubMed=1356269; DOI=10.1073/pnas.89.18.8754;
RA Haecker U., Grossniklaus U., Gehring W.J., Jaeckle H.;
RT "Developmentally regulated Drosophila gene family encoding the fork head
RT domain.";
RL Proc. Natl. Acad. Sci. U.S.A. 89:8754-8758(1992).
CC -!- FUNCTION: Required for the establishment of head structures. Required
CC to function as an early patterning gene in the anterior-most blastoderm
CC head segment anlage and for the establishment of a specific head
CC skeletal structure that derives from the non-adjacent intercalary
CC segment at a later stage of embryogenesis. Binds the consensus DNA
CC sequence 5'-[AG]TAAA[TC]A-3'. {ECO:0000269|PubMed:1356269}.
CC -!- SUBCELLULAR LOCATION: Nucleus.
CC -!- TISSUE SPECIFICITY: Expressed in early blastoderm embryos in anterior
CC and posterior gut precursors, and, later in a subset of cells in
CC central nervous system. {ECO:0000269|PubMed:1356269}.
CC -!- DEVELOPMENTAL STAGE: Expressed throughout embryogenesis, maximally
CC during the 5-12 hours period. {ECO:0000269|PubMed:1356269}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; S80254; AAB35643.1; -; Unassigned_DNA.
DR EMBL; AE014296; AAF51727.1; -; Genomic_DNA.
DR EMBL; M96440; AAF02177.1; -; Genomic_DNA.
DR PIR; S59870; S59870.
DR RefSeq; NP_524202.1; NM_079478.4.
DR AlphaFoldDB; P32027; -.
DR SMR; P32027; -.
DR BioGRID; 65621; 23.
DR IntAct; P32027; 22.
DR STRING; 7227.FBpp0078049; -.
DR PaxDb; P32027; -.
DR DNASU; 40374; -.
DR EnsemblMetazoa; FBtr0078395; FBpp0078049; FBgn0014143.
DR GeneID; 40374; -.
DR KEGG; dme:Dmel_CG5069; -.
DR CTD; 40374; -.
DR FlyBase; FBgn0014143; croc.
DR VEuPathDB; VectorBase:FBgn0014143; -.
DR eggNOG; KOG2294; Eukaryota.
DR HOGENOM; CLU_035722_0_1_1; -.
DR InParanoid; P32027; -.
DR OMA; FTRHYAQ; -.
DR OrthoDB; 1270467at2759; -.
DR PhylomeDB; P32027; -.
DR BioGRID-ORCS; 40374; 0 hits in 3 CRISPR screens.
DR GenomeRNAi; 40374; -.
DR PRO; PR:P32027; -.
DR Proteomes; UP000000803; Chromosome 3L.
DR Bgee; FBgn0014143; Expressed in embryonic/larval foregut (Drosophila) and 43 other tissues.
DR ExpressionAtlas; P32027; baseline and differential.
DR Genevisible; P32027; DM.
DR GO; GO:0005634; C:nucleus; IDA:FlyBase.
DR GO; GO:0003677; F:DNA binding; IDA:FlyBase.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IBA:GO_Central.
DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0007380; P:specification of segmental identity, head; IMP:FlyBase.
DR CDD; cd00059; FH; 1.
DR Gene3D; 1.10.10.10; -; 1.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR018122; TF_fork_head_CS_1.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR Pfam; PF00250; Forkhead; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; SSF46785; 1.
DR PROSITE; PS00657; FORK_HEAD_1; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 2: Evidence at transcript level;
KW Developmental protein; DNA-binding; Nucleus; Reference proteome;
KW Transcription; Transcription regulation.
FT CHAIN 1..508
FT /note="Fork head domain-containing protein crocodile"
FT /id="PRO_0000091909"
FT DNA_BIND 69..160
FT /note="Fork-head"
FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00089"
FT REGION 319..412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 326..343
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..389
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT VARIANT 122
FT /note="L -> F (in allele CROC-75-3)"
FT VARIANT 453
FT /note="A -> V (in allele CROC-75-3)"
SQ SEQUENCE 508 AA; 54517 MW; 2EFED1D8F63016D6 CRC64;
MHTLFSDQNS FTRHYAQTAA GYGSASAVAA ASSASAAAAA HYAYDQYSRY PYSASAYGLG
APHQNKEIVK PPYSYIALIA MAIQNAADKK VTLNGIYQYI MERFPYYRDN KQGWQNSIRH
NLSLNECFVK VARDDKKPGK GSYWTLDPDS YNMFDNGSFL RRRRRFKKKD VMREKEEAIK
RQAMMNEKLA EMKPLKLMTN GILEAKHMAA HAAHFKKEPL MDLGCLSGKE VSHAAMLNSC
HDSLAQMNHL AGGGVEHPGF TVDSLMNVYN PRIHHSAYPY HLNEDNLATV ASSQMHHVHH
AAAAHHAQQL QRHVAHVAHP LTPGGQGAGG QSSGHSPTTI STPHGPAHGG WYTPETPPSE
PVPHNGQQGT PTHPGHNNNN SSSVLNHNGV GNGGGGGGGG GGGSSSVLTS SPTSALGFRD
MIFEQNQSCQ LDTGSPTGSL QSASPPASAS VAAASAAAAA AVISSHHHHH HHHAALSGNL
GQLGQLSNLS HYRPHVGHYQ EYGIKYGV