Androgen response element gene transcriptions: Difference between revisions

Jump to navigation Jump to search
Line 119: Line 119:
# AnRE1r5ci: 1, TGTACC at 4395.
# AnRE1r5ci: 1, TGTACC at 4395.
# AnRE1r6ci: 1, TGTACC at 4193.
# AnRE1r6ci: 1, TGTACC at 4193.
# RDr7ci: 0.
# AnRE1r7ci: 0.
# RDr8ci: 0.
# AnRE1r8ci: 2, TGTACC at 2780, TGTACC at 455.
# RDr9ci: 0.
# AnRE1r9ci: 0.


===AnRE1r UTRs===
===AnRE1r UTRs===
Line 135: Line 135:
===AnRE1r proximal promoters===
===AnRE1r proximal promoters===
{{main|Proximal promoter gene transcriptions}}
{{main|Proximal promoter gene transcriptions}}
# AnRE1r8ci: TGTACC at 2780.
# AnRE1r9: GGTACA at 4067.
# AnRE1r9: GGTACA at 4067.


Line 142: Line 145:
# AnRE1r6: GGTACA at 1805.
# AnRE1r6: GGTACA at 1805.
# AnRE1r4ci: TGTACC at 1217.
# AnRE1r4ci: TGTACC at 1217.
# AnRE1r8ci: TGTACC at 455.





Revision as of 15:50, 12 November 2021

Associate Editor(s)-in-Chief: Henry A. Hoff

"Androgen receptors (ARs) (NR3C4; nuclear receptor subfamily 3, group C, member 4) have a crucial role in the development, function and homeostasis of PCa cells. ARs can be activated as a transcription factor via binding to androgen hormones (either testosterone or dihydrotestosterone). This receptor has two mainfunctional domains: ligand-binding and DNA-binding (DBD).8"[1]

"Androgen response elements structurally consist of a short DNA motif with base sequence specificity within the promoter upstream of the androgen-responsive genes. The HRE contains apair of conserved sequences, which are separated by a three-nucleotide spacer. This sequence is determined as 5'-GGTACAnnnTGTTCT-3'10, 11 with 5'-CGG-3' as the spacer in the androgen response element."[1]

"The androgen response element sequence, 5'-GGTACACGGTGTTCT-3', was obtained from the National Center of Biotechnology Information (NCBI)."[1]

"ARE half sites, benefit from neighboring motifs or cooperating transcription factors in regulating gene expression."[2]

"AREs are well studied but poorly defined and have been shown to contain two hexamers with a three base-pair spacer with an inverted repeat in the second hexamer22."[2]

A "strict ideal 15mer motif search pattern of two hexamers with a 3mer spacer [is] AGAACANNNTGTTCT [...]."[2]

"The ideal 15mer model of AGAACANNNTGTTCT had 71 AREs."[2]

"Using the Jaspar motif database, we grouped significant transcription factor logos within a window of ±160 bp from the ARE with a p-value of less than 0.05. Top hits included forkhead box (FOX), Krüppel-like factors (KLF), basic helix-loop-helix (BHLH), sterol regulatory element binding factor (SREBF), and v-myc avian myelocytomatosis viral oncogene homolog (MYC) families of transcription factors."[2]

"Taken together, the transcription factor network analysis (top hits in motif enrichment with p-values below 0.05) suggests that KLF, MYC, FOX, and SREBF families of transcription factors have the ability to utilize motifs in the cistrome of AREs and to cooperate with the AR."[2]

"Despite a larger number of weaker ARE half sites found in the proximity of KLF motifs, stronger AREs next to KLF motifs resulted in a larger transcriptional response."[2]

"The KLF family recognizing GC/GT boxes has been implicated in regulation of oncogenic expression signatures in LNCaP and PC3 prostate cancer cell lines85,86."[2]

Human genes

Gene ID: 367 is AR androgen receptor aka NR3C4 on Xq12: "The androgen receptor gene is more than 90 kb long and codes for a protein that has 3 major functional domains: the N-terminal domain, DNA-binding domain, and androgen-binding domain. The protein functions as a steroid-hormone activated transcription factor. Upon binding the hormone ligand, the receptor dissociates from accessory proteins, translocates into the nucleus, dimerizes, and then stimulates transcription of androgen responsive genes. This gene contains 2 polymorphic trinucleotide repeat segments that encode polyglutamine and polyglycine tracts in the N-terminal transactivation domain of its protein. Expansion of the polyglutamine tract from the normal 9-34 repeats to the pathogenic 38-62 repeats causes spinal bulbar muscular atrophy (SBMA, also known as Kennedy's disease). Mutations in this gene are also associated with complete androgen insensitivity (CAIS). Alternative splicing results in multiple transcript variants encoding different isoforms."[3]

Gene expressions

"Biased expression in liver (RPKM 12.9), endometrium (RPKM 10.8) and 13 other tissues."[3]

Consensus sequences

5′-GGTACACGGTGTTCT-3′[1]

5'-TGGAGAACAGCCTGTTCTCCA-3' or 5'-AGAACAGCCTGTTCT-3'[2] "Using the identified AREs within our experiment a refined extended canonical ARE model is proposed and deposited in transcription factor databases [...]."[2]

Hypotheses

  1. A1BG has no Androgen response elements in either promoter.
  2. A1BG is not transcribed by an Androgen response element.
  3. Androgen response element does not participate in the transcription of A1BG.

Androgen response element1 (Kouhpayeh) samplings

Copying a responsive elements consensus sequence GGTACA and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence GGTACA (starting with SuccessablesAnRE1.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 0.
  2. negative strand, positive direction: 0.
  3. positive strand, negative direction: 0.
  4. positive strand, positive direction: 3, GGTACA at 3901, GGTACA at 3336, GGTACA at 2474.
  5. inverse complement, negative strand, negative direction: 3, TGTACC at 2614, TGTACC at 2152, TGTACC at 1207.
  6. inverse complement, negative strand, positive direction: 0.
  7. inverse complement, positive strand, negative direction: 1, TGTACC at 798.
  8. inverse complement, positive strand, positive direction: 1, TGTACC at 2032.

AnRE1 proximal promoters

  1. Negative strand, negative direction: TGTACC at 2614.

AnRE1 distal promoters

  1. Negative strand, negative direction: TGTACC at 2152, TGTACC at 1207.
  2. Positive strand, negative direction: TGTACC at 798.


  1. Positive strand, positive direction: 3, GGTACA at 3901, GGTACA at 3336, GGTACA at 2474, TGTACC at 2032.

AnRE1 random dataset samplings

  1. AnRE1r0: 1, GGTACA at 1283.
  2. AnRE1r1: 2, GGTACA at 3352, GGTACA at 1127.
  3. AnRE1r2: 0.
  4. AnRE1r3: 0.
  5. AnRE1r4: 1, GGTACA at 3774.
  6. AnRE1r5: 2, GGTACA at 3253, GGTACA at 2244.
  7. AnRE1r6: 1, GGTACA at 1805.
  8. AnRE1r7: 0.
  9. AnRE1r8: 0.
  10. AnRE1r9: 1, GGTACA at 4067.
  11. AnRE1r0ci: 0.
  12. AnRE1r1ci: 0.
  13. AnRE1r2ci: 0.
  14. AnRE1r3ci: 0.
  15. AnRE1r4ci: 2, TGTACC at 4543, TGTACC at 1217.
  16. AnRE1r5ci: 1, TGTACC at 4395.
  17. AnRE1r6ci: 1, TGTACC at 4193.
  18. AnRE1r7ci: 0.
  19. AnRE1r8ci: 2, TGTACC at 2780, TGTACC at 455.
  20. AnRE1r9ci: 0.

AnRE1r UTRs

  1. AnRE1r4: GGTACA at 3774.
  2. AnRE1r4ci: TGTACC at 4543.
  3. AnRE1r6ci: TGTACC at 4193.

RDr core promoters

  1. AnRE1r5ci: TGTACC at 4395.

AnRE1r proximal promoters

  1. AnRE1r8ci: TGTACC at 2780.


  1. AnRE1r9: GGTACA at 4067.

AnRE1r distal promoters

  1. AnRE1r0: GGTACA at 1283.
  2. AnRE1r6: GGTACA at 1805.
  3. AnRE1r4ci: TGTACC at 1217.
  4. AnRE1r8ci: TGTACC at 455.


  1. AnRE1r1: GGTACA at 3352, GGTACA at 1127.
  2. AnRE1r5: GGTACA at 3253, GGTACA at 2244.

Androgen response element2 (Kouhpayeh) samplings

For the Basic programs testing consensus sequence TGTTCT (starting with SuccessablesAnRE2.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 4, TGTTCT at 3759, TGTTCT at 3635, TGTTCT at 3340, TGTTCT at 3307.
  2. negative strand, positive direction: 1, TGTTCT at 108.
  3. positive strand, negative direction: 1, TGTTCT at 45.
  4. positive strand, positive direction: TGTTCT, 0.
  5. inverse complement, negative strand, negative direction: 1, AGAACA at 281.
  6. inverse complement, negative strand, positive direction: 2, AGAACA at 4068, AGAACA at 3094.
  7. inverse complement, positive strand, negative direction: 2, AGAACA at 3668, AGAACA at 287.
  8. inverse complement, positive strand, positive direction: 0.

AnRE2 UTRs

  1. Negative strand, negative direction: TGTTCT at 3759, TGTTCT at 3635, TGTTCT at 3340, TGTTCT at 3307.
  2. Positive strand, negative direction: AGAACA at 3668.

AnRE2 (Kouhpayeh) proximal promoters

Negative strand, positive direction: AGAACA at 4068.

AnRE2 (Kouhpayeh) distal promoters

  1. Negative strand, negative direction: AGAACA at 281.
  2. Positive strand, negative direction: :AGAACA at 287, TGTTCT at 45.


  1. Negative strand, positive direction: AGAACA at 3094, TGTTCT at 108.

AnRE2 random dataset samplings

  1. RDr0: 0.
  2. RDr1: 0.
  3. RDr2: 0.
  4. RDr3: 0.
  5. RDr4: 0.
  6. RDr5: 0.
  7. RDr6: 0.
  8. RDr7: 0.
  9. RDr8: 0.
  10. RDr9: 0.
  11. RDr0ci: 0.
  12. RDr1ci: 0.
  13. RDr2ci: 0.
  14. RDr3ci: 0.
  15. RDr4ci: 0.
  16. RDr5ci: 0.
  17. RDr6ci: 0.
  18. RDr7ci: 0.
  19. RDr8ci: 0.
  20. RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Full AnRE (Kouhpayeh)

  1. positive strand, positive direction, looking for 5'-GGTACANNNTGTTCT-3', 0.
  2. inverse complement, negative strand, negative direction, looking for 5'-AGAACANNNTGTACC-3', 0.
  3. inverse negative strand, negative direction, looking for 5'-TCTTGTNNNACATGG-3', 0.
  4. inverse negative strand, positive direction, looking for 5'-TCTTGTNNNACATGG-3', 0.

Androgen response element (Wilson) samplings

Copying a responsive elements consensus sequence TGATTCGTGAG and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs. Ideal consensus sequence is AGAACANNNTGTTCT.[2]

For the Basic programs testing consensus sequence AGAACANNNTGTTCT (starting with SuccessablesAnREW.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for AGAACANNNTGTTCT, 0.
  2. negative strand, positive direction, looking for AGAACANNNTGTTCT, 0.
  3. positive strand, negative direction, looking for AGAACANNNTGTTCT-3', 0.
  4. positive strand, positive direction, looking for AGAACANNNTGTTCT-3', 0.
  5. complement, negative strand, negative direction, looking for TCTTGTNNNACAAGA, 0.
  6. complement, negative strand, positive direction, looking for TCTTGTNNNACAAGA, 0.
  7. complement, positive strand, negative direction, looking for TCTTGTNNNACAAGA, 0.
  8. complement, positive strand, positive direction, looking for TCTTGTNNNACAAGA, 0.

The inverse complement is the same as the initial direct.

The inverse is the same as the initial direct complement.

AnRE (Wilson) UTRs

  1. Negative strand, negative direction: TGTTCT at 3759, TGTTCT at 3635, TGTTCT at 3340, TGTTCT at 3307.
  2. Positive strand, negative direction: AGAACA at 3668.

AnRE (Wilson) proximal promoters

Negative strand, positive direction: AGAACA at 4068.

AnRE (Wilson) distal promoters

  1. Negative strand, negative direction: AGAACA at 281.
  2. Positive strand, negative direction: AGAACA at 287, TGTTCT at 45.


  1. Negative strand, positive direction: AGAACA at 3094, TGTTCT at 108.
  2. Positive strand, positive direction: ACAAGA at 108.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

See also

References

  1. 1.0 1.1 1.2 1.3 S Kouhpayeh, AR Einizadeh, Z Hejazi, M Boshtam, L Shariati, M Mirian, L Darzi, M Sojoudi, H Khanahmad and A Rezaei (1 July 2016). "Antiproliferative effect of a synthetic aptamer mimicking androgen response elements in the LNCaP cell line" (PDF). Cancer Gene Therapy. 23: 254–257. doi:10.1038/cgt.2016.26. Retrieved 3 October 2020.
  2. 2.00 2.01 2.02 2.03 2.04 2.05 2.06 2.07 2.08 2.09 2.10 Stephen Wilson, Jianfei Qi & Fabian V. Filipp (14 September 2016). "Refinement of the androgen response element based on ChIP-Seq in androgen-insensitive and androgen-responsive prostate cancer cell lines". Scientific Reports. 6: 32611. doi:10.1038/srep32611. Retrieved 3 October 2020.
  3. 3.0 3.1 RefSeq (January 2017). "AR androgen receptor [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 1 November 2020.

External links