M box gene transcriptions: Difference between revisions

Jump to navigation Jump to search
 
(26 intermediate revisions by the same user not shown)
Line 1: Line 1:
'''Editor-In-Chief:''' Henry A. Hoff
'''Editor-In-Chief:''' Henry A. Hoff
[[Image:RhodeusSericeusBitterlingMaleSpawingColors.JPG|thumb|right|300px|The dark appearance of the dorsal side of the male bitterling ''Rhodeus amarus'' is caused by a dispersal of melanosomes in simulate the dark bottom of the fish tank. Credit: [[commons:User:Viridiflavus|Viridiflavus]].]]
[[Image:RhodeusSericeusBitterlingMaleSpawingColors.JPG|thumb|right|300px|The dark appearance of the dorsal side of the male bitterling ''Rhodeus amarus'' is caused by a dispersal of melanosomes in simulate the dark bottom of the fish tank. Credit: [[c:User:Viridiflavus|Viridiflavus]].{tlx|free media}}]]
"In melanocytes and in melanoma cells, cyclic AMP (cAMP)-elevating agents stimulate melanogenesis and increase the transcription of tyrosinase, the rate-limiting enzyme in melanin synthesis. However, two other enzymes, tyrosinase-related protein 1 (TRP1) and TRP2, are required for a normal melanization process leading to eumelanin synthesis. In B16 melanoma cells, we demonstrated that stimulation of melanogenesis by cAMP-elevating agents results in an increase in tyrosinase, TRP1, and TRP2 expression. cAMP, through a cAMP-dependent protein kinase pathway, stimulates TRP1 and TRP2 promoter activities in both B16 mouse melanoma cells and normal human melanocytes. Regulation of the TRP1 and TRP2 promoters by cAMP involves a M box and an E box."<ref name=Bertolotto>{{ cite journal
"In melanocytes and in melanoma cells, cyclic AMP (cAMP)-elevating agents stimulate melanogenesis and increase the transcription of tyrosinase, the rate-limiting enzyme in melanin synthesis. However, two other enzymes, tyrosinase-related protein 1 (TRP1) and TRP2, are required for a normal melanization process leading to eumelanin synthesis. In B16 melanoma cells, we demonstrated that stimulation of melanogenesis by cAMP-elevating agents results in an increase in tyrosinase, TRP1, and TRP2 expression. cAMP, through a cAMP-dependent protein kinase pathway, stimulates TRP1 and TRP2 promoter activities in both B16 mouse melanoma cells and normal human melanocytes. Regulation of the TRP1 and TRP2 promoters by cAMP involves a M box and an E box."<ref name=Bertolotto>{{ cite journal
|author=Corine Bertolotto, Roser Buscà, Patricia Abbe, Karine Bille, Edith Aberdam, Jean-Paul Ortonne, and Robert Ballotti
|author=Corine Bertolotto, Roser Buscà, Patricia Abbe, Karine Bille, Edith Aberdam, Jean-Paul Ortonne, and Robert Ballotti
Line 27: Line 27:
==Human genes==
==Human genes==
{{main|Human genes}}
{{main|Human genes}}
Gene ID: 1638 is DCT [[dopachrome tautomerase]], aka TRP-2; TYRP2.<ref name=HGNC>{{ cite web
Gene ID: 1638 is DCT [[dopachrome tautomerase]], aka TRP-2; TYRP2.<ref name=HGNC1638>{{ cite web
|author=HGNC
|author=HGNC
|title=DCT dopachrome tautomerase [ Homo sapiens (human) ]
|title=DCT dopachrome tautomerase [ Homo sapiens (human) ]
Line 35: Line 35:
|url=https://www.ncbi.nlm.nih.gov/gene/1638
|url=https://www.ncbi.nlm.nih.gov/gene/1638
|accessdate=29 January 2020 }}</ref>
|accessdate=29 January 2020 }}</ref>
# NP_001123361.1 L-dopachrome tautomerase isoform 2 precursor: "Transcript Variant: This variant (2) includes two alternate in-frame exons and is predicted to encode a slightly longer protein isoform (2) compared to isoform 1."<ref name=HGNC/>
# NP_001123361.1 L-dopachrome tautomerase isoform 2 precursor: "Transcript Variant: This variant (2) includes two alternate in-frame exons and is predicted to encode a slightly longer protein isoform (2) compared to isoform 1."<ref name=HGNC1638/>
# NP_001309111.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC/>
# NP_001309111.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC1638/>
# NP_001309112.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC/>
# NP_001309112.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC1638/>
# NP_001309113.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC/>
# NP_001309113.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC1638/>
# NP_001309114.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC/>
# NP_001309114.1 L-dopachrome tautomerase isoform 3.<ref name=HGNC1638/>
# NP_001309115.1 L-dopachrome tautomerase isoform 4.<ref name=HGNC/>
# NP_001309115.1 L-dopachrome tautomerase isoform 4.<ref name=HGNC1638/>
# NP_001913.2 L-dopachrome tautomerase isoform 1 precursor: "Transcript Variant: This variant (1) represents the more abundant transcript."<ref name=HGNC/>
# NP_001913.2 L-dopachrome tautomerase isoform 1 precursor: "Transcript Variant: This variant (1) represents the more abundant transcript."<ref name=HGNC1638/>


Gene ID: 7306 is [[TYRP1]] tyrosinase related protein 1: "This gene encodes a melanosomal enzyme that belongs to the tyrosinase family and plays an important role in the melanin biosynthetic pathway. Defects in this gene are the cause of rufous oculocutaneous albinism and oculocutaneous albinism type III."<ref name=RefSeq2009>{{ cite web
Gene ID: 7306 is [[TYRP1]] tyrosinase related protein 1: "This gene encodes a melanosomal enzyme that belongs to the tyrosinase family and plays an important role in the melanin biosynthetic pathway. Defects in this gene are the cause of rufous oculocutaneous albinism and oculocutaneous albinism type III."<ref name=RefSeq7306>{{ cite web
|author=RefSeq
|author=RefSeq
|title=TYRP1 tyrosinase related protein 1 [ Homo sapiens (human) ]
|title=TYRP1 tyrosinase related protein 1 [ Homo sapiens (human) ]
Line 55: Line 55:
{{main|Consensus sequence gene transcriptions}}
{{main|Consensus sequence gene transcriptions}}
"Tyrosinase and TRP1 promoters share an 11-bp motif (AGTCATGTGCT) termed the M box located upstream of the TATA box. This motif binds microphthalmia, a basic helix-loop-helix transcription factor that increases tyrosinase and TRP1 promoter activities, thereby playing a key role in the tissue-specific expression of these genes (11, 29, 40). In the TRP2 promoter, a homologous sequence (GTCATGTGCT) is also found upstream of the TATA box (41)."<ref name=Bertolotto/>
"Tyrosinase and TRP1 promoters share an 11-bp motif (AGTCATGTGCT) termed the M box located upstream of the TATA box. This motif binds microphthalmia, a basic helix-loop-helix transcription factor that increases tyrosinase and TRP1 promoter activities, thereby playing a key role in the tissue-specific expression of these genes (11, 29, 40). In the TRP2 promoter, a homologous sequence (GTCATGTGCT) is also found upstream of the TATA box (41)."<ref name=Bertolotto/>
The M box consensus sequence GTCATGTGCT<ref name=Bertolotto/> does not occur on either side of A1BG. The random datasets had only one occurrence GTCATGTGCT at 1977 in the distal promoter.
M-box consensus sequence is GGTCATGTGCT.<ref name=Zhao>{{ cite journal
|author=Yuanyuan Zhao, Jinzhu Meng, Guoqing Cao, Pengfei Gao & Changsheng Dong
|title=Screening the optimal activity region of the dopachrome tautomerase gene promoter in sheep skin melanocytes
|journal=Journal of Applied Animal Research
|date=28 August 2018
|volume=46
|issue=1
|pages=1382-1388
|url=https://www.tandfonline.com/doi/pdf/10.1080/09712119.2018.1512497
|arxiv=
|bibcode=
|doi=10.1080/09712119.2018.1512497
|pmid=
|accessdate=6 August 2021 }}</ref> This contains the core consensus sequence GTCATGTGCT.<ref name=Bertolotto/>


"The conserved region contains a consensus M-box element (TCACATGA) for binding of MITF. This MITF binding site is aligned and conserved between at least 11 different species [...]. The clear conservation of these elements suggests that ''gpnmb'' has similar regulation in all mammals."<ref name=Ripoll>{{ cite journal
"The conserved region contains a consensus M-box element (TCACATGA) for binding of MITF. This MITF binding site is aligned and conserved between at least 11 different species [...]. The clear conservation of these elements suggests that ''gpnmb'' has similar regulation in all mammals."<ref name=Ripoll>{{ cite journal
Line 85: Line 102:
# inverse complement, negative strand, positive direction, looking for AGCACATGAC, 0.
# inverse complement, negative strand, positive direction, looking for AGCACATGAC, 0.


==M-box (Hoek) samplings==
==Bertolotto random dataset samplings==
{{main|Model samplings}}
Copying a responsive elements consensus sequence (T/N)(C/T)(A/G)TG(A/N) and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.


For the Basic programs testing consensus sequence (T/N)(C/T)(A/G)TG(A/N) (starting with SuccessablesMHbox.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:
# Mboxr0: 0.
# negative strand, negative direction, looking for (T/N)(C/T)(A/G)TG(A/N), 67, CTGTGC at 4470, ACGTGA at 4340, TTGTGA at 4335, TTGTGT at 4196, TTGTGA at 3982, GTGTGG at 3967, TCGTGT at 3915, GTATGG at 3858, TCGTGA at 3734, TTGTGT at 3670, CCGTGT at 3632, CTATGC at 3546, TTATGA at 3540, TTGTGT at 3513, CTGTGT at 3391, ACGTGA at 3289, CCGTGC at 3230, TCGTGA at 3073, GTATGG at 2993, TCGTGA at 2918, ACGTGG at 2761, ATGTGT at 2687, CCGTGT at 2665, ATATGT at 2641, TTGTGA at 2550, CCGTGC at 2523, ACGTGA at 2426, GTGTGG at 2418, TCATGG at 2343, TTATGT at 2304, TCATGA at 2214, CTATGT at 2179, TTATGA at 2161, TCGTGA at 2097, ACGTGA at 2000, TTATGT at 1877, TCGTGA at 1788, ACGTGT at 1719, CTATGG at 1667, TTATGT at 1565, GTGTGA at 1543, TTGTGT at 1541, ACGTGA at 1347, ACGTGG at 1339, TCGTGA at 1143, GTGTGG at 1128, CCGTGT at 1116, GTGTGC at 962, CCGTGT at 960, GTGTGG at 882, ACATGG at 798, TTATGT at 768, CTGTGG at 748, TCGTGA at 679, CCGTGC at 652, TTATGT at 634, TCGTGA at 543, GTGTGC at 530, CCGTGT at 518, TTATGC at 491, TCGTGA at 406, TTGTGC at 341, ACATGA at 325, CCGTGT at 266, ATATGT at 112, ATGTGG at 61, CTATGT at 59.
# Mboxr1: 0.
# positive strand, negative direction, looking for (T/N)(C/T)(A/G)TG(A/N), 51, CTGTGC at 4401, GTGTGA at 4361, GCATGC at 4248, ACGTGG at 4238, TCATGC at 4117, GCATGG at 4107, ATGTGA at 4092, CTGTGG at 3959, GTATGT at 3831, ATGTGG at 3810, CTGTGG at 3711, GTGTGC at 3560, CTGTGT at 3558, CTGTGC at 3428, CTGTGA at 3267, GTGTGG at 3186, GCGTGG at 3047, ATATGT at 2984, GTGTGC at 2862, GCATGA at 2784, TCATGC at 2752, ACATGC at 2668, GTGTGG at 2658, ACATGG at 2614, GTGTGG at 2605, TCATGC at 2534, GCATGC at 2323, GCATGG at 2277, ACATGG at 2152, ATGTGG at 2065, GCATGA at 1953, GCGTGG at 1897, ACATGT at 1850, ATATGG at 1742, GTGTGA at 1479, GCGTGG at 1244, CCGTGT at 1220, ACATGG at 1207, GCATGG at 953, GTGTGT at 795, ATGTGG at 787, GCGTGG at 741, GTGTGT at 610, ATATGG at 605, TTATGA at 351, ATATGA at 274, CTATGT at 212, TCATGG at 201, GCATGA at 124, ATATGG at 77, ATATGT at 42.
# Mboxr2: 0.
# positive strand, positive direction, looking for (T/N)(C/T)(A/G)TG(A/N), 76, ACGTGG at 4343, CTGTGA at 4334, TTGTGC at 4258, GTATGT at 4151, CCGTGA at 4006, ACGTGG at 3885, GTGTGG at 3824, TCGTGT at 3740, TCGTGG at 3601, CTGTGG at 3532, GTGTGA at 3507, GTGTGT at 3411, CCGTGT at 3409, ACGTGA at 3343, ACGTGG at 3322, CCATGA at 3115, GTGTGT at 3098, TTGTGT at 3096, TCGTGG at 3044, GTGTGG at 2964, ACGTGT at 2962, TTGTGT at 2835, CTATGA at 2739, GTGTGA at 2636, GTGTGG at 2602, TCATGG at 2597, ATATGG at 2590, GCGTGG at 2566, ATGTGG at 2430, TCGTGA at 2201, TCGTGA at 2105, CCGTGA at 1996, ACGTGT at 1822, GTGTGC at 1697, CTGTGT at 1695, CCGTGC at 1640, TCGTGG at 1628, GTGTGC at 1558, GCGTGT at 1556, GCGTGC at 1552, ACGTGA at 1472, TCGTGG at 1460, ACGTGA at 1372, TCGTGG at 1360, GCGTGC at 1300, GTGTGC at 1222, ACGTGT at 1220, TCGTGG at 1208, GTGTGA at 1138, GCGTGT at 1136, GCGTGC at 1132, TCGTGG at 1124, GCGTGT at 1052, GCGTGC at 1048, TCGTGG at 1040, GTGTGC at 986, GCGTGC at 978, TCGTGG at 956, GTGTGC at 886, GCGTGC at 878, TCGTGG at 856, GTGTGC at 802, GCGTGT at 800, GCGTGC at 796, TCGTGG at 788, TCGTGG at 772, GTGTGT at 716, CTGTGT at 714, GCGTGA at 686, TCGTGG at 620, ATGTGC at 568, GCATGT at 566, GTGTGT at 550, ACGTGT at 548, GTGTGA at 345, CCATGA at 127.
# Mboxr3: 0.
# negative strand, positive direction, looking for (T/N)(C/T)(A/G)TG(A/N), 56, CTGTGG at 4394, TCGTGG at 4377, CCATGC at 4371, TCATGT at 4365, CCATGG at 4222, ACATGA at 4154, GTGTGA at 3971, GTGTGG at 3966, ATGTGC at 3959, ACATGT at 3957, ATGTGA at 3903, CCATGT at 3901, CCATGC at 3828, CCGTGG at 3813, ACATGT at 3743, ACATGA at 3708, CTGTGG at 3642, GTGTGA at 3594, CCATGG at 3581, CTGTGG at 3436, CCATGT at 3336, CTATGA at 3260, ATATGG at 3162, GCATGC at 3141, TTATGA at 3027, CTGTGC at 2959, TTGTGC at 2680, GCGTGA at 2555, CCATGT at 2474, TCATGG at 2267, CTGTGT at 2252, CTATGG at 2159, GCATGC at 2154, ACATGA at 2141, CTGTGT at 2076, ACATGG at 2032, GTGTGG at 1971, CCATGC at 1934, ACATGT at 1872, GTGTGG at 1805, GCGTGA at 1720, TCGTGC at 1520, GCGTGC at 1243, TCATGC at 1239, GCGTGC at 1217, TCATGG at 1187, GTGTGG at 1022, GCGTGT at 1020, TCGTGC at 1016, CTGTGG at 918, CTGTGG at 818, ACGTGC at 571, GCGTGC at 545, CTGTGT at 266, ATGTGA at 230, TCGTGT at 80.
# Mboxr4: 0.
# inverse complement, negative strand, negative direction, looking for (N/T)CA(C/T)(A/G)(N/A), 37, ACACGA at 4402, TCACAC at 4360, CCACGA at 3955, TCACAA at 3940, CCACGG at 3881, TCATAC at 3830, ACACGG at 3561, ACACAC at 3559, ACACGT at 3429, CCACGG at 3197, CCACAC at 3185, ACACGT at 2863, TCACAC at 2861, TCACAC at 2657, TCACAC at 2604, TCATGG at 2343, CCACGG at 2334, TCATGA at 2214, CCACGC at 2196, TCACAT at 2086, CCACGT at 2081, TCACGT at 1535, CCACAC at 1478, TCACGT at 1470, CCACGG at 1255, GCACAG at 1221, ACATGG at 798, ACACAT at 796, CCACAC at 794, ACACAT at 611, CCACAC at 609, TCACGC at 447, CCACGC at 379, TCACGA at 335, ACATGA at 325, TCACAT at 323, CCATAT at 180.
# Mboxr5: 0.
# inverse complement, positive strand, negative direction, looking for (N/T)CA(C/T)(A/G)(N/A), 67, TCACAT at 4532, ACACGA at 4471, GCATGC at 4248, ACACAG at 4197, TCATGC at 4117, GCATGG at 4107, ACATAG at 4046, TCACAC at 3966, GCACAG at 3916, CCATAC at 3857, CCACAA at 3765, CCACAG at 3693, ACACAG at 3671, GCACAA at 3633, ACACAA at 3514, GCATAT at 3452, GCATAG at 3446, TCATAG at 3421, ACACAT at 3392, TCACGC at 3280, GCACGA at 3231, ACATAA at 3169, ACATAC at 2992, GCATGA at 2784, TCATGC at 2752, ACACAG at 2688, ACATGC at 2668, GCACAT at 2666, ACATGG at 2614, TCATGC at 2534, GCACGG at 2524, TCACAC at 2417, GCATGC at 2323, GCATGG at 2277, TCACAA at 2243, TCACGC at 2207, ACATGG at 2152, TCACGC at 1991, GCATGA at 1953, ACATGT at 1850, TCACGT at 1772, GCACAA at 1720, ACACAC at 1542, ACATGG at 1207, TCACGA at 1181, TCACAC at 1127, GCACAG at 1117, ACACGC at 963, GCACAC at 961, GCATGG at 953, TCACAC at 881, TCACGA at 707, TCACGC at 663, GCACGG at 653, TCACGA at 571, ACACGT at 531, TCACAC at 529, GCACAG at 519, ACATAG at 468, TCACGA at 434, ACACGT at 342, TCACAG at 296, ACATAA at 269, GCACAT at 267, TCATGG at 201, GCATGA at 124, CCATAT at 40.
# Mboxr6: 0.
# inverse complement, positive strand, positive direction, looking for (N/T)CA(C/T)(A/G)(N/A), 54, TCACGG at 4274, CCACAC at 3970, TCACAC at 3965, ACACGT at 3960, CCACAG at 3951, CCACGT at 3883, TCACAC at 3593, TCACGT at 3464, TCACGG at 3235, CCACGA at 3151, CCATGA at 3115, TCACGG at 3011, ACACGT at 2960, CCACAA at 2814, CCACGT at 2800, ACACGT at 2681, CCATAA at 2643, TCATGG at 2597, TCACAG at 2465, CCACGT at 2334, TCACGT at 2326, ACACAT at 2253, TCACAG at 2171, CCACGA at 2089, ACACAG at 2077, TCACGT at 2063, CCACAC at 1970, CCACAC at 1804, TCACGT at 1786, CCACGC at 1763, TCACGC at 1725, TCACGC at 1589, GCACGC at 1521, TCACGC at 1253, GCACGC at 1244, GCACGT at 1218, TCACGC at 1169, TCACGC at 1160, TCACGC at 1085, GCACAC at 1021, GCACGC at 1017, CCACGT at 783, TCACGC at 665, TCACGC at 581, GCACGG at 572, GCATGT at 566, GCACGT at 546, TCACGC at 497, CCACGC at 488, ACACAG at 267, CCATGA at 127, ACATAA at 115, CCACAA at 106, GCACAG at 81.
# Mboxr7: 0.
# inverse complement, negative strand, positive direction, looking for TTTTTTTT, 0.
# Mboxr8: 0.
# Mboxr9: 1, GTCATGTGCT at 1977.
# Mboxr0ci: 0.
# Mboxr1ci: 0.
# Mboxr2ci: 0.
# Mboxr3ci: 0.
# Mboxr4ci: 0.
# Mboxr5ci: 0.
# Mboxr6ci: 0.
# Mboxr7ci: 0.
# Mboxr8ci: 0.
# Mboxr9ci: 0.


===MHbox UTRs===
===Mboxr distal promoters===
{{main|UTR promoter gene transcriptions}}
{{main|Distal promoter gene transcriptions}}
# Mboxr9: GTCATGTGCT at 1977.


===MHbox core promoters===
==M-box (Hoek) samplings==
{{main|Core promoter gene transcriptions}}
{{main|Model samplings}}
Copying a responsive elements consensus sequence (T/N)CA(C/T)(A/G)TG(A/N) and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.


===MHbox proximal promoters===
For the Basic programs testing consensus sequence (T/N)CA(C/T)(A/G)TG(A/N) (starting with SuccessablesMHbox.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:
{{main|Proximal promoter gene transcriptions}}
# negative strand, negative direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 2, ACACATGG at 798, TCACATGA at 325.
# positive strand, negative direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 2, GCACATGC at 2668, CCATATGT at 42.
# positive strand, positive direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 5, CCACGTGG at 3885, ACACGTGT at 2962, GCACGTGT at 1220, GCATGTGC at 568, GCACGTGT at 548.
# negative strand, positive direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 7, ACATGTGC at 3959, TCACATGT at 3957, CCATGTGA at 3903, GCACATGT at 3743, CCACATGA at 3708, CCACATGG at 2032, ACACGTGC at 571.
Inverse complement is the same as the first four.


===MHbox distal promoters===
===MHbox distal promoters===
{{main|Distal promoter gene transcriptions}}
{{main|Distal promoter gene transcriptions}}
# Negative strand, negative direction: TCACATGA at 325.
# Negative strand, positive direction: TCACATGT at 3957, CCATGTGA at 3903, CCACATGA at 3708.


===MHbox random dataset samplings===
==MHbox random dataset samplings==


# MHboxr0: 0.
# MHboxr0: 1, TCATATGG at 2152.
# MHboxr1: 0.
# MHboxr1: 3, GCATGTGC at 3892, ACATATGG at 2102, GCATATGC at 1606.
# MHboxr2: 0.
# MHboxr2: 3, GCACATGC at 3735, ACATATGA at 1758, CCATGTGA at 167.
# MHboxr3: 0.
# MHboxr3: 3, GCACGTGT at 3770, TCATGTGT at 1135, CCATGTGA at 94.
# MHboxr4: 0.
# MHboxr4: 5, ACACGTGG at 2288, GCATGTGA at 2244, CCATGTGA at 2225, TCATATGA at 1686, CCACATGC at 910.
# MHboxr5: 0.
# MHboxr5: 2, CCATATGA at 1780, ACACGTGA at 60.
# MHboxr6: 0.
# MHboxr6: 6, CCACGTGT at 2906, CCACATGA at 2602, GCATGTGA at 2331, TCATGTGA at 1343, GCACGTGC at 655, CCATATGA at 246.
# MHboxr7: 0.
# MHboxr7: 1, GCACGTGA at 1857.
# MHboxr8: 0.
# MHboxr8: 0.
# MHboxr9: 0.
# MHboxr9: 3, ACACATGC at 4072, TCATGTGC at 1976, GCACGTGC at 1188.
# MHboxr0ci: 0.
 
# MHboxr1ci: 0.
===RDr arbitrary (evens) (4560-2846) UTRs===
# MHboxr2ci: 0.
 
# MHboxr3ci: 0.
# MHboxr2: GCACATGC at 3735.
# MHboxr4ci: 0.
 
# MHboxr5ci: 0.
===MHboxr alternate (odds) (4560-2846) UTRs===
# MHboxr6ci: 0.
 
# MHboxr7ci: 0.
# MHboxr1: GCATGTGC at 3892.
# MHboxr8ci: 0.
# MHboxr3: GCACGTGT at 3770.
# MHboxr9ci: 0.
 
===RDr arbitrary negative direction (evens) (2846-2811) core promoters===
 
===RDr alternate negative direction (odds) (2846-2811) core promoters===
 
===RDr arbitrary positive direction (odds) (4445-4265) core promoters===
 
===RDr alternate positive direction (evens) (4445-4265) core promoters===
 
===RDr arbitrary negative direction (evens) (2811-2596) proximal promoters===
 
===RDr alternate negative direction (odds) (2811-2596) proximal promoters===
 
===RDr arbitrary positive direction (odds) (4265-4050) proximal promoters===
 
===RDr alternate positive direction (evens) (4265-4050) proximal promoters===
 
===MHboxr arbitrary negative direction (evens) (2596-1) distal promoters===
 
# MHboxr0: TCATATGG at 2152.
# MHboxr2: ACATATGA at 1758, CCATGTGA at 167.
 
===MHboxr alternate negative direction (odds) (2596-1) distal promoters===
 
# MHboxr1: ACATATGG at 2102, GCATATGC at 1606.
# MHboxr3: TCATGTGT at 1135, CCATGTGA at 94.
 
===MHboxr arbitrary positive direction (odds) (4050-1) distal promoters===
 
# MHboxr1: GCATGTGC at 3892, ACATATGG at 2102, GCATATGC at 1606.
# MHboxr3: GCACGTGT at 3770, TCATGTGT at 1135, CCATGTGA at 94.


===MHboxr UTRs===
===MHboxr alternate positive direction (evens) (4050-1) distal promoters===
{{main|UTR promoter gene transcriptions}}


===MHboxr core promoters===
# MHboxr0: TCATATGG at 2152.
{{main|Core promoter gene transcriptions}}
# MHboxr2: GCACATGC at 3735, ACATATGA at 1758, CCATGTGA at 167.


===MHboxr proximal promoters===
===MHboxr proximal promoters===
{{main|Proximal promoter gene transcriptions}}
{{main|Proximal promoter gene transcriptions}}
# MHboxr6: CCACATGA at 2602.


===MHboxr distal promoters===
===MHboxr distal promoters===
{{main|Distal promoter gene transcriptions}}
 
# MHboxr0: TCATATGG at 2152.
# MHboxr2: ACATATGA at 1758, CCATGTGA at 167.
# MHboxr4: GCATGTGA at 2244, CCATGTGA at 2225, TCATATGA at 1686.
# MHboxr6: GCATGTGA at 2331, TCATGTGA at 1343, CCATATGA at 246.
 
 
# MHboxr3: TCATGTGT at 1135, CCATGTGA at 94.
# MHboxr5: CCATATGA at 1780, ACACGTGA at 60.
# MHboxr7: GCACGTGA at 1857.
# MHboxr9: TCATGTGC at 1976.
 
==M-box (Hoek) analysis and results==
{{main|Complex locus A1BG and ZNF497#M-box (Hoek)}}
Consensus sequences: TCAYRTG or CAYRTGA, TCA(C/T)(A/G)TG or CA(C/T)(A/G)TGA<ref name=Hoek>{{ cite journal
|author=Keith S. Hoek, Natalie C. Schlegel, Ossia M. Eichhoff, Daniel S. Widmer, Christian Praetorius, Steingrimur O. Einarsson, Sigridur Valgeirsdottir, Kristin Bergsteinsdottir, Alexander Schepsky, Reinhard Dummer, Eirikur Steingrimsson
|title=Novel MITF targets identified using a two-step DNA microarray strategy
|journal=Pigment Cell & Melanoma Research
|date=11 November 2008
|volume=21
|issue=6
|pages=665-676
|url=https://onlinelibrary.wiley.com/doi/10.1111/j.1755-148X.2008.00505.x
|arxiv=
|bibcode=
|doi=10.1111/j.1755-148X.2008.00505.x
|pmid=
|accessdate=15 December 2022 }}</ref> ~ (T/N)CA(C/T)(A/G)TG(A/N).
 
{|class="wikitable"
|-
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)
|-
| Reals || UTR || negative || 0 || 2 || 0 || 0
|-
| Randoms || UTR || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || UTR || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Core || negative || 0 || 2 || 0 || 0
|-
| Randoms || Core || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || Core || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Core || positive || 0 || 2 || 0 || 0
|-
| Randoms || Core || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Core || alternate positive || 0 || 10 || 0 || 0
|-
| Reals || Proximal || negative || 0 || 2 || 0 || 0
|-
| Randoms || Proximal || arbitrary negative || 1 || 10 || 0.1 || 0.05
|-
| Randoms || Proximal || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Proximal || positive || 0 || 2 || 0 || 0
|-
| Randoms || Proximal || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Proximal || alternate positive || 0 || 10 || 0 || 0
|-
| Reals || Distal || negative || 1 || 2 || 0.5 || 0.5 ± 0.5 (--1,+-0)
|-
| Randoms || Distal || arbitrary negative || 11 || 10 || 1.1 || 0.85 ± 0.25
|-
| Randoms || Distal || alternate negative || 6 || 10 || 0.6 || 0.85 ± 0.25
|-
| Reals || Distal || positive || 3 || 2 || 1.5 || 1.5 ± 0.5 (-+3,++0)
|-
| Randoms || Distal || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Distal || alternate positive || 0 || 10 || 0 || 0
|}
 
Comparison:
 
The occurrences of real responsive element consensus sequences are greater than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.
 
Using a more general M-box consensus of (T/N)CA(C/T)(A/G)TG(A/N) yielded four sequences in the negative direction and twelve in the positive direction. Of these only TCACATGA at 325 in the negative direction and TCACATGT at 3957, CCATGTGA at 3903, and CCACATGA at 3708 in the positive direction conform to TCAYRTG or CAYRTGA<ref name=Hoek/>. The random datasets had 25 occurrences of the general consensus but only fifteen fit TCAYRTG or CAYRTGA<ref name=Hoek/>, nine in the arbitrary negative direction and six in the positive direction. The disparity between real occurrences and random occurrences suggests that the real occurrences are likely active or can be activated.


==M-box (Ripoll) samplings==
==M-box (Ripoll) samplings==
Line 155: Line 300:
# positive strand, positive direction, looking for TCACATGA, 0.
# positive strand, positive direction, looking for TCACATGA, 0.
# negative strand, positive direction, looking for TCACATGA, 0.
# negative strand, positive direction, looking for TCACATGA, 0.
# complement, negative strand, negative direction, looking for AGTGTACT, 0.
# complement, positive strand, negative direction, looking for AGTGTACT, 1, AGTGTACT at 325.
# complement, positive strand, positive direction, looking for AGTGTACT, 0.
# complement, negative strand, positive direction, looking for AGTGTACT, 0.
# inverse complement, negative strand, negative direction, looking for TCATGTGA, 0.
# inverse complement, negative strand, negative direction, looking for TCATGTGA, 0.
# inverse complement, positive strand, negative direction, looking for TCATGTGA, 0.
# inverse complement, positive strand, negative direction, looking for TCATGTGA, 0.
# inverse complement, positive strand, positive direction, looking for TCATGTGA, 0.
# inverse complement, positive strand, positive direction, looking for TCATGTGA, 0.
# inverse complement, negative strand, positive direction, looking for TCATGTGA, 0.
# inverse complement, negative strand, positive direction, looking for TCATGTGA, 0.
# inverse negative strand, negative direction, looking for AGTACACT, 0.
# inverse positive strand, negative direction, looking for AGTACACT, 0.
# inverse positive strand, positive direction, looking for AGTACACT, 0.
# inverse negative strand, positive direction, looking for AGTACACT, 0.


===M-box distal promoters===
===M-box distal promoters===
{{main|Distal promoter gene transcriptions}}
{{main|Distal promoter gene transcriptions}}
Negative strand, negative direction: TCACATGA at 325.
Negative strand, negative direction: TCACATGA at 325.
==M-box random dataset samplings==
# M-boxr0: 0.
# M-boxr1: 0.
# M-boxr2: 0.
# M-boxr3: 0.
# M-boxr4: 0.
# M-boxr5: 0.
# M-boxr6: 0.
# M-boxr7: 0.
# M-boxr8: 0.
# M-boxr9: 0.
# M-boxr0ci: 0.
# M-boxr1ci: 0.
# M-boxr2ci: 0.
# M-boxr3ci: 0.
# M-boxr4ci: 0.
# M-boxr5ci: 0.
# M-boxr6ci: 0.
# M-boxr7ci: 0.
# M-boxr8ci: 0.
# M-boxr9ci: 0.
==M-box (Ripoll) analysis and results==
{{main|Complex locus A1BG and ZNF497#M-box (Ripoll)}}
The M-box with the consensus sequence TCACATGA<ref name=Ripoll/> occurred only once.
{|class="wikitable"
|-
! Reals or randoms !! Promoters !! direction !! Numbers !! Strands !! Occurrences !! Averages (± 0.1)
|-
| Reals || UTR || negative || 0 || 2 || 0 || 0
|-
| Randoms || UTR || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || UTR || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Core || negative || 0 || 2 || 0 || 0
|-
| Randoms || Core || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || Core || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Core || positive || 0 || 2 || 0 || 0
|-
| Randoms || Core || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Core || alternate positive || 0 || 10 || 0 || 0
|-
| Reals || Proximal || negative || 0 || 2 || 0 || 0
|-
| Randoms || Proximal || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || Proximal || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Proximal || positive || 0 || 2 || 0 || 0
|-
| Randoms || Proximal || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Proximal || alternate positive || 0 || 10 || 0 || 0
|-
| Reals || Distal || negative || 0 || 2 || 0 || 0
|-
| Randoms || Distal || arbitrary negative || 0 || 10 || 0 || 0
|-
| Randoms || Distal || alternate negative || 0 || 10 || 0 || 0
|-
| Reals || Distal || positive || 0 || 2 || 0 || 0
|-
| Randoms || Distal || arbitrary positive || 0 || 10 || 0 || 0
|-
| Randoms || Distal || alternate positive || 0 || 10 || 0 || 0
|}
Comparison:
The occurrences of real responsive element consensus sequences are greater than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.


==Acknowledgements==
==Acknowledgements==
Line 204: Line 419:


<!-- footer categories -->
<!-- footer categories -->
[[Category:Resources last modified in February 2021]]

Latest revision as of 04:32, 4 September 2023

Editor-In-Chief: Henry A. Hoff

File:RhodeusSericeusBitterlingMaleSpawingColors.JPG
free media}}

"In melanocytes and in melanoma cells, cyclic AMP (cAMP)-elevating agents stimulate melanogenesis and increase the transcription of tyrosinase, the rate-limiting enzyme in melanin synthesis. However, two other enzymes, tyrosinase-related protein 1 (TRP1) and TRP2, are required for a normal melanization process leading to eumelanin synthesis. In B16 melanoma cells, we demonstrated that stimulation of melanogenesis by cAMP-elevating agents results in an increase in tyrosinase, TRP1, and TRP2 expression. cAMP, through a cAMP-dependent protein kinase pathway, stimulates TRP1 and TRP2 promoter activities in both B16 mouse melanoma cells and normal human melanocytes. Regulation of the TRP1 and TRP2 promoters by cAMP involves a M box and an E box."[1]

"[M]icrophthalmia, a basic helix-loop-helix transcription factor, strongly stimulates the transcriptional activities of the TRP1 and TRP2 promoters, mainly through binding to the M boxes."[1]

"In mammals, pigmentation results from the synthesis and distribution of melanin in the skin, hair bulbs, and eyes. Melanin synthesis (melanogenesis) takes place in the melanocyte after differentiation of the nonpigmented precursor, the melanoblast (27). Three melanocyte-specific enzymes, tyrosinase, tyrosinase-related protein 1 (TRP1), and TRP2, are involved in this enzymatic process that converts tyrosine to melanin pigments. Although these proteins have similar structures and features, they are expressed by different genes and possess distinct enzymatic activities. Tyrosinase, encoded by the albino locus of the mouse, catalyzes the conversion of tyrosine to 3,4-dihydroxyphenylalanine (DOPA) and of DOPA to DOPA quinone (14, 25, 31). TRP2, encoded by the mouse slaty locus, possesses a Dopachrome tautomerase activity, converting the Dopachrome to 5,6-dihydroxyindole-2-carboxylic acid (DHICA) (3, 19, 42). TRP1, which has been mapped in mouse to the brown locus, catalyzes the oxidation of DHICA to indole-5,6-quinone-2-carboxylic acid (21, 24)."[1]

"In the TRP1 promoter, the M box (GTCATGTGCT) [is] located between bp −44 and −33 upstream from the initiation start site [...] and the E box (CAAGTG) [is] located between bp −238 and −233 [...] In the TRP2 promoter, the M box (GTCATGTGCT) [is] located between bp −135 and −129 upstream from the initiation start site [...] the E box (CACATG) [is] between bp −346 and −340 [and] the cAMP response element (CRE; TGAGGTCA) [is] located between bp −239 and −232 [...]."[1]

The "regulation of TRP1 gene expression by PKA in B16 melanoma cells involves the M box just upstream of the TATA box."[1]

Human genes

Gene ID: 1638 is DCT dopachrome tautomerase, aka TRP-2; TYRP2.[2]

  1. NP_001123361.1 L-dopachrome tautomerase isoform 2 precursor: "Transcript Variant: This variant (2) includes two alternate in-frame exons and is predicted to encode a slightly longer protein isoform (2) compared to isoform 1."[2]
  2. NP_001309111.1 L-dopachrome tautomerase isoform 3.[2]
  3. NP_001309112.1 L-dopachrome tautomerase isoform 3.[2]
  4. NP_001309113.1 L-dopachrome tautomerase isoform 3.[2]
  5. NP_001309114.1 L-dopachrome tautomerase isoform 3.[2]
  6. NP_001309115.1 L-dopachrome tautomerase isoform 4.[2]
  7. NP_001913.2 L-dopachrome tautomerase isoform 1 precursor: "Transcript Variant: This variant (1) represents the more abundant transcript."[2]

Gene ID: 7306 is TYRP1 tyrosinase related protein 1: "This gene encodes a melanosomal enzyme that belongs to the tyrosinase family and plays an important role in the melanin biosynthetic pathway. Defects in this gene are the cause of rufous oculocutaneous albinism and oculocutaneous albinism type III."[3]

Consensus sequences

"Tyrosinase and TRP1 promoters share an 11-bp motif (AGTCATGTGCT) termed the M box located upstream of the TATA box. This motif binds microphthalmia, a basic helix-loop-helix transcription factor that increases tyrosinase and TRP1 promoter activities, thereby playing a key role in the tissue-specific expression of these genes (11, 29, 40). In the TRP2 promoter, a homologous sequence (GTCATGTGCT) is also found upstream of the TATA box (41)."[1]

The M box consensus sequence GTCATGTGCT[1] does not occur on either side of A1BG. The random datasets had only one occurrence GTCATGTGCT at 1977 in the distal promoter.

M-box consensus sequence is GGTCATGTGCT.[4] This contains the core consensus sequence GTCATGTGCT.[1]

"The conserved region contains a consensus M-box element (TCACATGA) for binding of MITF. This MITF binding site is aligned and conserved between at least 11 different species [...]. The clear conservation of these elements suggests that gpnmb has similar regulation in all mammals."[5]

M box (Bertolotto) samplings

Copying a responsive elements consensus sequence GTCATGTGCT and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence GTCATGTGCT (starting with SuccessablesMbox.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for GTCATGTGCT, 0.
  2. positive strand, negative direction, looking for GTCATGTGCT, 0.
  3. positive strand, positive direction, looking for GTCATGTGCT, 0.
  4. negative strand, positive direction, looking for GTCATGTGCT, 0.
  5. inverse complement, negative strand, negative direction, looking for AGCACATGAC, 0.
  6. inverse complement, positive strand, negative direction, looking for AGCACATGAC, 0.
  7. inverse complement, positive strand, positive direction, looking for AGCACATGAC, 0.
  8. inverse complement, negative strand, positive direction, looking for AGCACATGAC, 0.

Bertolotto random dataset samplings

  1. Mboxr0: 0.
  2. Mboxr1: 0.
  3. Mboxr2: 0.
  4. Mboxr3: 0.
  5. Mboxr4: 0.
  6. Mboxr5: 0.
  7. Mboxr6: 0.
  8. Mboxr7: 0.
  9. Mboxr8: 0.
  10. Mboxr9: 1, GTCATGTGCT at 1977.
  11. Mboxr0ci: 0.
  12. Mboxr1ci: 0.
  13. Mboxr2ci: 0.
  14. Mboxr3ci: 0.
  15. Mboxr4ci: 0.
  16. Mboxr5ci: 0.
  17. Mboxr6ci: 0.
  18. Mboxr7ci: 0.
  19. Mboxr8ci: 0.
  20. Mboxr9ci: 0.

Mboxr distal promoters

  1. Mboxr9: GTCATGTGCT at 1977.

M-box (Hoek) samplings

Copying a responsive elements consensus sequence (T/N)CA(C/T)(A/G)TG(A/N) and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence (T/N)CA(C/T)(A/G)TG(A/N) (starting with SuccessablesMHbox.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 2, ACACATGG at 798, TCACATGA at 325.
  2. positive strand, negative direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 2, GCACATGC at 2668, CCATATGT at 42.
  3. positive strand, positive direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 5, CCACGTGG at 3885, ACACGTGT at 2962, GCACGTGT at 1220, GCATGTGC at 568, GCACGTGT at 548.
  4. negative strand, positive direction, looking for (T/N)CA(C/T)(A/G)TG(A/N), 7, ACATGTGC at 3959, TCACATGT at 3957, CCATGTGA at 3903, GCACATGT at 3743, CCACATGA at 3708, CCACATGG at 2032, ACACGTGC at 571.

Inverse complement is the same as the first four.

MHbox distal promoters

  1. Negative strand, negative direction: TCACATGA at 325.
  1. Negative strand, positive direction: TCACATGT at 3957, CCATGTGA at 3903, CCACATGA at 3708.

MHbox random dataset samplings

  1. MHboxr0: 1, TCATATGG at 2152.
  2. MHboxr1: 3, GCATGTGC at 3892, ACATATGG at 2102, GCATATGC at 1606.
  3. MHboxr2: 3, GCACATGC at 3735, ACATATGA at 1758, CCATGTGA at 167.
  4. MHboxr3: 3, GCACGTGT at 3770, TCATGTGT at 1135, CCATGTGA at 94.
  5. MHboxr4: 5, ACACGTGG at 2288, GCATGTGA at 2244, CCATGTGA at 2225, TCATATGA at 1686, CCACATGC at 910.
  6. MHboxr5: 2, CCATATGA at 1780, ACACGTGA at 60.
  7. MHboxr6: 6, CCACGTGT at 2906, CCACATGA at 2602, GCATGTGA at 2331, TCATGTGA at 1343, GCACGTGC at 655, CCATATGA at 246.
  8. MHboxr7: 1, GCACGTGA at 1857.
  9. MHboxr8: 0.
  10. MHboxr9: 3, ACACATGC at 4072, TCATGTGC at 1976, GCACGTGC at 1188.

RDr arbitrary (evens) (4560-2846) UTRs

  1. MHboxr2: GCACATGC at 3735.

MHboxr alternate (odds) (4560-2846) UTRs

  1. MHboxr1: GCATGTGC at 3892.
  2. MHboxr3: GCACGTGT at 3770.

RDr arbitrary negative direction (evens) (2846-2811) core promoters

RDr alternate negative direction (odds) (2846-2811) core promoters

RDr arbitrary positive direction (odds) (4445-4265) core promoters

RDr alternate positive direction (evens) (4445-4265) core promoters

RDr arbitrary negative direction (evens) (2811-2596) proximal promoters

RDr alternate negative direction (odds) (2811-2596) proximal promoters

RDr arbitrary positive direction (odds) (4265-4050) proximal promoters

RDr alternate positive direction (evens) (4265-4050) proximal promoters

MHboxr arbitrary negative direction (evens) (2596-1) distal promoters

  1. MHboxr0: TCATATGG at 2152.
  2. MHboxr2: ACATATGA at 1758, CCATGTGA at 167.

MHboxr alternate negative direction (odds) (2596-1) distal promoters

  1. MHboxr1: ACATATGG at 2102, GCATATGC at 1606.
  2. MHboxr3: TCATGTGT at 1135, CCATGTGA at 94.

MHboxr arbitrary positive direction (odds) (4050-1) distal promoters

  1. MHboxr1: GCATGTGC at 3892, ACATATGG at 2102, GCATATGC at 1606.
  2. MHboxr3: GCACGTGT at 3770, TCATGTGT at 1135, CCATGTGA at 94.

MHboxr alternate positive direction (evens) (4050-1) distal promoters

  1. MHboxr0: TCATATGG at 2152.
  2. MHboxr2: GCACATGC at 3735, ACATATGA at 1758, CCATGTGA at 167.

MHboxr proximal promoters

  1. MHboxr6: CCACATGA at 2602.

MHboxr distal promoters

  1. MHboxr0: TCATATGG at 2152.
  2. MHboxr2: ACATATGA at 1758, CCATGTGA at 167.
  3. MHboxr4: GCATGTGA at 2244, CCATGTGA at 2225, TCATATGA at 1686.
  4. MHboxr6: GCATGTGA at 2331, TCATGTGA at 1343, CCATATGA at 246.


  1. MHboxr3: TCATGTGT at 1135, CCATGTGA at 94.
  2. MHboxr5: CCATATGA at 1780, ACACGTGA at 60.
  3. MHboxr7: GCACGTGA at 1857.
  4. MHboxr9: TCATGTGC at 1976.

M-box (Hoek) analysis and results

Consensus sequences: TCAYRTG or CAYRTGA, TCA(C/T)(A/G)TG or CA(C/T)(A/G)TGA[6] ~ (T/N)CA(C/T)(A/G)TG(A/N).

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 0 2 0 0
Randoms UTR arbitrary negative 0 10 0 0
Randoms UTR alternate negative 0 10 0 0
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 1 10 0.1 0.05
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 1 2 0.5 0.5 ± 0.5 (--1,+-0)
Randoms Distal arbitrary negative 11 10 1.1 0.85 ± 0.25
Randoms Distal alternate negative 6 10 0.6 0.85 ± 0.25
Reals Distal positive 3 2 1.5 1.5 ± 0.5 (-+3,++0)
Randoms Distal arbitrary positive 0 10 0 0
Randoms Distal alternate positive 0 10 0 0

Comparison:

The occurrences of real responsive element consensus sequences are greater than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.

Using a more general M-box consensus of (T/N)CA(C/T)(A/G)TG(A/N) yielded four sequences in the negative direction and twelve in the positive direction. Of these only TCACATGA at 325 in the negative direction and TCACATGT at 3957, CCATGTGA at 3903, and CCACATGA at 3708 in the positive direction conform to TCAYRTG or CAYRTGA[6]. The random datasets had 25 occurrences of the general consensus but only fifteen fit TCAYRTG or CAYRTGA[6], nine in the arbitrary negative direction and six in the positive direction. The disparity between real occurrences and random occurrences suggests that the real occurrences are likely active or can be activated.

M-box (Ripoll) samplings

Copying a responsive elements consensus sequence TCACATGA and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence TCACATGA (starting with SuccessablesM-box.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for TCACATGA, 1, TCACATGA at 325.
  2. positive strand, negative direction, looking for TCACATGA, 0.
  3. positive strand, positive direction, looking for TCACATGA, 0.
  4. negative strand, positive direction, looking for TCACATGA, 0.
  5. inverse complement, negative strand, negative direction, looking for TCATGTGA, 0.
  6. inverse complement, positive strand, negative direction, looking for TCATGTGA, 0.
  7. inverse complement, positive strand, positive direction, looking for TCATGTGA, 0.
  8. inverse complement, negative strand, positive direction, looking for TCATGTGA, 0.

M-box distal promoters

Negative strand, negative direction: TCACATGA at 325.

M-box random dataset samplings

  1. M-boxr0: 0.
  2. M-boxr1: 0.
  3. M-boxr2: 0.
  4. M-boxr3: 0.
  5. M-boxr4: 0.
  6. M-boxr5: 0.
  7. M-boxr6: 0.
  8. M-boxr7: 0.
  9. M-boxr8: 0.
  10. M-boxr9: 0.
  11. M-boxr0ci: 0.
  12. M-boxr1ci: 0.
  13. M-boxr2ci: 0.
  14. M-boxr3ci: 0.
  15. M-boxr4ci: 0.
  16. M-boxr5ci: 0.
  17. M-boxr6ci: 0.
  18. M-boxr7ci: 0.
  19. M-boxr8ci: 0.
  20. M-boxr9ci: 0.

M-box (Ripoll) analysis and results

The M-box with the consensus sequence TCACATGA[5] occurred only once.

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 0 2 0 0
Randoms UTR arbitrary negative 0 10 0 0
Randoms UTR alternate negative 0 10 0 0
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 0 2 0 0
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 0 2 0 0
Randoms Distal arbitrary negative 0 10 0 0
Randoms Distal alternate negative 0 10 0 0
Reals Distal positive 0 2 0 0
Randoms Distal arbitrary positive 0 10 0 0
Randoms Distal alternate positive 0 10 0 0

Comparison:

The occurrences of real responsive element consensus sequences are greater than the randoms. This suggests that the real responsive element consensus sequences are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 Corine Bertolotto, Roser Buscà, Patricia Abbe, Karine Bille, Edith Aberdam, Jean-Paul Ortonne, and Robert Ballotti (February 1998). "Different cis-Acting Elements Are Involved in the Regulation of TRP1 and TRP2 Promoter Activities by Cyclic AMP: Pivotal Role of M Boxes (GTCATGTGCT) and of Microphthalmia". Molecular and Cellular Biology. 18 (2): 694–702. PMID 9447965. Retrieved 8 December 2018.
  2. 2.0 2.1 2.2 2.3 2.4 2.5 2.6 2.7 HGNC (21 December 2019). "DCT dopachrome tautomerase [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 29 January 2020.
  3. RefSeq (March 2009). "TYRP1 tyrosinase related protein 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 29 January 2020.
  4. Yuanyuan Zhao, Jinzhu Meng, Guoqing Cao, Pengfei Gao & Changsheng Dong (28 August 2018). "Screening the optimal activity region of the dopachrome tautomerase gene promoter in sheep skin melanocytes". Journal of Applied Animal Research. 46 (1): 1382–1388. doi:10.1080/09712119.2018.1512497. Retrieved 6 August 2021.
  5. 5.0 5.1 Vera M. Ripoll, Nicholas A. Meadows, Liza-Jane Raggatt, Ming K. Chang, Allison R. Pettit, Alan I. Cassady and David A. Hume (30 April 2005). "Microphthalmia transcription factor regulates the expression of the novel osteoclast factor GPNMB" (PDF). Gene. 413 (1–2): 32–41. doi:10.1016/j.gene.2008.01.014. Retrieved 18 March 2021.
  6. 6.0 6.1 6.2 Keith S. Hoek, Natalie C. Schlegel, Ossia M. Eichhoff, Daniel S. Widmer, Christian Praetorius, Steingrimur O. Einarsson, Sigridur Valgeirsdottir, Kristin Bergsteinsdottir, Alexander Schepsky, Reinhard Dummer, Eirikur Steingrimsson (11 November 2008). "Novel MITF targets identified using a two-step DNA microarray strategy". Pigment Cell & Melanoma Research. 21 (6): 665–676. doi:10.1111/j.1755-148X.2008.00505.x. Retrieved 15 December 2022.

External links