Structure and sequence of the human homeobox gene HOX7

Genomics. 1991 Nov;11(3):670-8. doi: 10.1016/0888-7543(91)90074-o.

Abstract

A cosmid containing the human sequence HOX7, homologous to the murine Hox-7 gene, was isolated from a genomic library, and the positions of the coding sequences were determined by hybridization. DNA sequence analysis demonstrated two exons that code for a homeodomain-containing protein of 297 amino acids. The open reading frame is interrupted by a single intron of approximately 1.6 kb, the splice donor and acceptor sites of which conform to known consensus sequences. The human HOX7 coding sequence has a very high degree of identity with the murine Hox-7 cDNA. Within the homeobox, the two sequences share 94% identity at the DNA level, all substitutions being silent. This high level of sequence similarity is not confined to the homeodomain; overall the human and murine HOX7 gene products show 80% identity at the amino acid level. Both the 5' and 3' untranslated regions also show significant similarity to the murine gene, with 79 and 70% sequence identity, respectively. The sequence upstream of the coding sequence of exon 1 contains a GC-rich putative promoter region. There is no TATA box, but a CCAAT and numerous GC boxes are present. The region encompassing the promoter region, exon 1, and the 5' region of exon 2 have a higher than expected frequency of CpG dinucleotides; numerous sites for rare-cutter restriction enzymes are present, a characteristic of HTF islands.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Cloning, Molecular
  • Consensus Sequence
  • Cosmids / genetics
  • DNA-Binding Proteins / genetics*
  • Exons / genetics
  • Genes, Homeobox / genetics*
  • Homeodomain Proteins*
  • Humans
  • Introns / genetics
  • MSX1 Transcription Factor
  • Mice
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Open Reading Frames / genetics
  • Repetitive Sequences, Nucleic Acid
  • Restriction Mapping
  • Sequence Homology, Nucleic Acid

Substances

  • DNA-Binding Proteins
  • Homeodomain Proteins
  • MSX1 Transcription Factor
  • MSX1 protein, human

Associated data

  • GENBANK/M60842
  • GENBANK/M60843
  • GENBANK/M60844
  • GENBANK/M60845
  • GENBANK/M60846
  • GENBANK/M60847
  • GENBANK/M76731
  • GENBANK/M76732
  • GENBANK/S78711
  • GENBANK/S78713