X core promoter element gene transcriptions

Jump to navigation Jump to search

Editor-In-Chief: Henry A. Hoff

File:2013 E3 Snail USA Photo Op.jpg
The image shows a group of people gathered to promote a computer game. Credit: - EMR -.

The core promoter element X core promoter element 1 (XCPE1) directs activator-, mediator-, protein-dependent but TFIID-independent RNA polymerase II transcription from TATA box-less promoters.[1]

This promoter element appears to be exclusively human such as the group in the image at the right.

Consensus sequences

"[T]he X gene core promoter element 1 ... is located between nucleotides -8 and +2 relative to the transcriptional start site (+1) and has a consensus sequence of G/A/T-G/C-G-T/C-G-G-G/A-A-G/C+1-A/C."[1]

Gene expressions

Although it is harder to regulate the transcription of genes with multiple transcription start sites, "variations in the expression of a constitutive gene would be minimized by the use of multiple start sites."[2]

Human genes

"XCPE1 is ... found in the core promoter regions of about 1% of human genes, particularly in poorly characterized TATA-less genes."[1]

Gene transcriptions

"From a teleological standpoint, this arrangement [of focused promoters] is consistent with the notion that it would be easier to regulate the transcription of a gene with a single transcription start site than one with multiple start sites."[2]

Focused promoters

"In focused transcription, there is either a single major transcription start site or several start sites within a narrow region of several nucleotides. Focused transcription is the predominant mode of transcription in simpler organisms."[2]

"Focused transcription initiation occurs in all organisms, and appears to be the predominant or exclusive mode of transcription in simpler organisms."[2]

"In vertebrates, focused transcription tends to be associated with regulated promoters".[2]

"The analysis of focused core promoters has led to the discovery of sequence motifs such as the TATA box, BREu (upstream TFIIBrecognition element), Inr (initiator), MTE (motif ten element), DPE (downstream promoter element), DCE (downstream core element), and XCPE1 (Xcore promoter element 1) [...]."[2]

Dispersed promoters

"In dispersed transcription, there are several weak transcription start sites over a broad region of about 50 to 100 nucleotides. Dispersed transcription is the most common mode of transcription in vertebrates. For instance, dispersed transcription is observed in about two-thirds of human genes."[2]

In vertebrates, "dispersed transcription is typically observed in constitutive promoters in CpG islands."[2]

Core promoters

"Focused transcription typically initiates within the Inr, and the A nucleotide in the Inr consensus is usually designed as the “+ 1” position, whether or not transcription actually initiates at that particular nucleotide. This convention is useful because other core promoter motifs, such as the MTE and DPE, function with the Inr in a manner that exhibits a strict spacing dependence with the Inr consensus sequence (and hence, the A + 1 nucleotide) rather than the actual transcription start site (Burke and Kadonaga, 1997, Kutach and Kadonaga, 2000 and Lim et al., 2004)."[2]

"NC2 (negative cofactor 2; also known as Dr1-Drap1) [...] was identified as repressor of TATA-dependent transcription [...]."[2]

"Several core promoter elements have been previously identified in eukaryotes, but those cannot account for transcription from most RNA polymerase II-transcribed genes."[1]

XCPE1 samplings

For the Basic programs (starting with SuccessablesXCPE1.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesXCPE1--.bas, looking for 3'-G/A/T-G/C-G-T/C-G-G-G/A-A-G/C-A/C-5', 1, 3'-TGGTGGGACC-5', 3744,
  2. negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesXCPE1-+.bas, looking for 3'-G/A/T-G/C-G-T/C-G-G-G/A-A-G/C-A/C-5', 0,
  3. positive strand in the negative direction is SuccessablesXCPE1+-.bas, looking for 3'-G/A/T-G/C-G-T/C-G-G-G/A-A-G/C-A/C-5', 0,
  4. positive strand in the positive direction is SuccessablesXCPE1++.bas, looking for 5'-(A/G/T)(C/G)G(C/T)GG(A/G)A(C/G)(A/C)-3', 1, 5'-GGGTGGAAGC-3' at 1406,
  5. complement, negative strand, negative direction is SuccessablesXCPE1c--.bas, looking for 3'-C/A/T-G/C-C-A/G-C-C-C/T-T-G/C-G/T-5', 0,
  6. complement, negative strand, positive direction is SuccessablesXCPE1c-+.bas, looking for 3'-C/A/T-G/C-C-A/G-C-C-C/T-T-G/C-G/T-5', 1, 5'-CCCACCTTCG-3' at 1406,
  7. complement, positive strand, negative direction is SuccessablesXCPE1c+-.bas, looking for 3'-C/A/T-G/C-C-A/G-C-C-C/T-T-G/C-G/T-5', 1, 3'-ACCACCCTGG-5', 3744,
  8. complement, positive strand, positive direction is SuccessablesXCPE1c++.bas, looking for 3'-C/A/T-G/C-C-A/G-C-C-C/T-T-G/C-G/T-5', 0,
  9. inverse complement, negative strand, negative direction is SuccessablesXCPE1ci--.bas, looking for 3'-G/T-G/C-T-C/T-C-C-A/G-C-G/C-C/A/T-5', 0,
  10. inverse complement, negative strand, positive direction is SuccessablesXCPE1ci-+.bas, looking for 3'-G/T-G/C-T-C/T-C-C-A/G-C-G/C-C/A/T-5', 0,
  11. inverse complement, positive strand, negative direction is SuccessablesXCPE1ci+-.bas, looking for 3'-G/T-G/C-T-C/T-C-C-A/G-C-G/C-C/A/T-5', 1, 5'-GCTCCCACCT-3' at 392, and complement.
  12. inverse complement, positive strand, positive direction is SuccessablesXCPE1ci++.bas, looking for 3'-G/T-G/C-T-C/T-C-C-A/G-C-G/C-C/A/T-5', 2, 5'-GGTCCCACCC-3' at 3541, 5'-TCTCCCACCT-3' at 185, and complements.
  13. inverse, negative strand, negative direction, is SuccessablesXCPE1i--.bas, looking for 3'-A/C-G/C-A-G/A-G-G-T/C-G-G/C-G/A/T-5', 1, 3'-CGAGGGTGGA-5', 392,
  14. inverse, negative strand, positive direction, is SuccessablesXCPE1i-+.bas, looking for 3'-A/C-G/C-A-G/A-G-G-T/C-G-G/C-G/A/T-5', 2, 5'-CCAGGGTGGG-3' at 3541, 5'-AGAGGGTGGA-3' at 185,
  15. inverse, positive strand, negative direction, is SuccessablesXCPE1i+-.bas, looking for 3'-A/C-G/C-A-G/A-G-G-T/C-G-G/C-G/A/T-5', 0,
  16. inverse, positive strand, positive direction, is SuccessablesXCPE1i++.bas, looking for 3'-A/C-G/C-A-G/A-G-G-T/C-G-G/C-G/A/T-5', 0.

XCPE1 (4560-2846) UTRs

  1. Negative strand, negative direction: TGGTGGGACC at 3744.

XCPE1 negative direction (2596-1) distal promoters

  1. Positive strand, negative direction: GCTCCCACCT at 392.

XCPE1 positive direction (4050-1) distal promoters

  1. Positive strand, positive direction: GGGTGGAAGC at 1406.
  2. Positive strand, positive direction: GGTCCCACCC at 3541, TCTCCCACCT at 185.

XCPE1 random dataset samplings

  1. XCPE1r0: 0.
  2. XCPE1r1: 1, GCGCGGGAGA at 2795.
  3. XCPE1r2: 1, GGGTGGAAGC at 4013.
  4. XCPE1r3: 1, TGGCGGGAGA at 2915.
  5. XCPE1r4: 1, GGGTGGGACC at 4133.
  6. XCPE1r5: 0.
  7. XCPE1r6: 1, TGGCGGAAGC at 2580.
  8. XCPE1r7: 1, AGGTGGGAGC at 4156.
  9. XCPE1r8: 3, GGGCGGGAGC at 4125, TGGCGGGACA at 4094, GGGTGGGAGA at 2645.
  10. XCPE1r9: 1, GGGTGGAAGA at 3486.
  11. XCPE1r0ci: 0.
  12. XCPE1r1ci: 2, TGTTCCGCCT at 1646, GCTCCCACGA at 1228.
  13. XCPE1r2ci: 1, GGTTCCGCCC at 640.
  14. XCPE1r3ci: 0.
  15. XCPE1r4ci: 0.
  16. XCPE1r5ci: 0.
  17. XCPE1r6ci: 1, GCTTCCACCC at 2309.
  18. XCPE1r7ci: 0.
  19. XCPE1r8ci: 0.
  20. XCPE1r9ci: 0.

XCPE1r arbitrary (evens) (4560-2846) UTRs

  1. XCPE1r2: GGGTGGAAGC at 4013.
  2. XCPE1r4: GGGTGGGACC at 4133.
  3. XCPE1r8: GGGCGGGAGC at 4125, TGGCGGGACA at 4094.

XCPE1r alternate (odds) (4560-2846) UTRs

  1. XCPE1r3: TGGCGGGAGA at 2915.
  2. XCPE1r9: GGGTGGAAGA at 3486.

XCPE1r arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. XCPE1r8: GGGTGGGAGA at 2645.

XCPE1r alternate negative direction (odds) (2811-2596) proximal promoters

  1. XCPE1r1: GCGCGGGAGA at 2795.

XCPE1r alternate positive direction (evens) (4265-4050) proximal promoters

  1. XCPE1r4: GGGTGGGACC at 4133.
  2. XCPE1r8: GGGCGGGAGC at 4125, TGGCGGGACA at 4094.

XCPE1r arbitrary negative direction (evens) (2596-1) distal promoters

  1. XCPE1r6: TGGCGGAAGC at 2580.
  2. XCPE1r2ci: GGTTCCGCCC at 640.
  3. XCPE1r6ci: GCTTCCACCC at 2309.

XCPE1r alternate negative direction (odds) (2596-1) distal promoters

  1. XCPE1r1ci: TGTTCCGCCT at 1646, GCTCCCACGA at 1228.

XCPE1r arbitrary positive direction (odds) (4050-1) distal promoters

  1. XCPE1r1: GCGCGGGAGA at 2795.
  2. XCPE1r3: TGGCGGGAGA at 2915.
  3. XCPE1r9: GGGTGGAAGA at 3486.
  4. XCPE1r1ci: TGTTCCGCCT at 1646, GCTCCCACGA at 1228.

XCPE1r alternate positive direction (evens) (4050-1) distal promoters

  1. XCPE1r2: GGGTGGAAGC at 4013.
  2. XCPE1r6: TGGCGGAAGC at 2580.
  3. XCPE1r8: GGGTGGGAGA at 2645.
  4. XCPE1r2ci: GGTTCCGCCC at 640.
  5. XCPE1r6ci: GCTTCCACCC at 2309.

XCPE1 analysis and results

"[T]he X gene core promoter element 1 ... has a consensus sequence of G/A/T-G/C-G-T/C-G-G-G/A-A-G/C+1-A/C."[1]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 1 2 0.5 0.5 ± 0.5 (--1,+-0)
Randoms UTR arbitrary negative 4 10 0.4 0.3
Randoms UTR alternate negative 2 10 0.2 0.3
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 1 10 0.1 0.1
Randoms Proximal alternate negative 1 10 0.1 0.1
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0.15
Randoms Proximal alternate positive 3 10 0.3 0.15
Reals Distal negative 1 2 0.5 0.5 ± 0.5 (--0,+-1)
Randoms Distal arbitrary negative 3 10 0.3 0.25
Randoms Distal alternate negative 2 10 0.2 0.25
Reals Distal positive 3 2 1.5 1.5 ± 1.5 (-+0,++3)
Randoms Distal arbitrary positive 5 10 0.5 0.5
Randoms Distal alternate positive 5 10 0.5 0.5

Comparison:

The occurrences of real XCPE1 UTRs and distals are greater than the randoms. This suggests that the real XCPE1s are likely active or activable.

Hypotheses

  1. XCPE1 does not participate in the transcription of A1BG.

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 Yumiko Tokusumi, Ying Ma, Xianzhou Song, Raymond H. Jacobson, and Shinako Takada (March 2007). "The New Core Promoter Element XCPE1 (X Core Promoter Element 1) Directs Activator-, Mediator-, and TATA-Binding Protein-Dependent but TFIID-Independent RNA Polymerase II Transcription from TATA-Less Promoters". Molecular and Cellular Biology. 27 (5): 1844–58. doi:10.1128/MCB.01363-06. PMID 17210644. Retrieved 2013-02-09.
  2. 2.0 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 Tamar Juven-Gershon and James T. Kadonaga (15 March 2010). "Regulation of gene expression via the core promoter and the basal transcriptional machinery". Developmental Biology. 339 (2): 225–9. doi:10.1016/j.ydbio.2009.08.009. Retrieved 2016-01-16.

Further reading

External links