C box gene transcriptions: Difference between revisions

Revision as of 17:19, 27 June 2021

Associate Editor(s)-in-Chief: Henry A. Hoff

GAGGCCATCT is a C-box, [...].^[1]

"Members of the box C/D snoRNA family, which are the subject of the present report, possess characteristic sequence elements known as box C (UGAUGA) and box D (GUCUGA)."^[2]

The human ribosomal protein L11 gene (HRPL11) has [...] two potential snRNA-coding sequences in intron 4: the C box beginning at +4131 (GGTGATG), [...] a D box beginning at +4237 (TCCTG), [...].^[3]

Analysis "of the recombinant (soybean [Glycine max] TGACG-motif binding factor 1) STF1 protein revealed the C-box (nGACGTCn) to be a high-affinity binding site (Cheong et al., 1998)."^[4]

Hypotheses

The C boxes are not involved in the transcription of A1BG.

Johnson C-box samplings

For the Basic programs SuccessablesCJbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for 5'-GAGGCCATCT-3'^[1], 0.
negative strand, positive direction, looking for 5'-GAGGCCATCT-3', 0.
positive strand, negative direction, looking for 5'-GAGGCCATCT-3', 0.
positive strand, positive direction, looking for 5'-GAGGCCATCT-3', 0.
complement, negative strand, negative direction, looking for 5'-CTCCGGTAGA-3', 0.
complement, negative strand, positive direction, looking for 5'-CTCCGGTAGA-3', 0.
complement, positive strand, negative direction, looking for 5'-CTCCGGTAGA-3', 0.
complement, positive strand, positive direction, looking for 5'-CTCCGGTAGA-3', 0.
inverse complement, negative strand, negative direction, looking for 5'-AGATGGCCTC-3', 0.
inverse complement, negative strand, positive direction, looking for 5'-AGATGGCCTC-3', 0.
inverse complement, positive strand, negative direction, looking for 5'-AGATGGCCTC-3', 0.
inverse complement, positive strand, positive direction, looking for 5'-AGATGGCCTC-3', 0.
inverse, negative strand, negative direction, looking for 5'-TCTACCGGAG-3', 0.
inverse, negative strand, positive direction, looking for 5'-TCTACCGGAG-3', 0.
inverse, positive strand, negative direction, looking for 5'-TCTACCGGAG-3', 0.
inverse, positive strand, positive direction, looking for 5'-TCTACCGGAG-3', 0.

CJbox random dataset samplings

CJboxr0: 0.
CJboxr1: 0.
CJboxr2: 0.
CJboxr3: 0.
CJboxr4: 0.
CJboxr5: 0.
CJboxr6: 0.
CJboxr7: 0.
CJboxr8: 0.
CJboxr9: 0.
CJboxr0ci: 0.
CJboxr1ci: 0.
CJboxr2ci: 0.
CJboxr3ci: 0.
CJboxr4ci: 0.
CJboxr5ci: 0.
CJboxr6ci: 0.
CJboxr7ci: 0.
CJboxr8ci: 0.
CJboxr9ci: 0.

CJboxr UTRs

CJboxr core promoters

CJboxr proximal promoters

CJboxr distal promoters

snoRNA C box

File:RF00071.jpg

This example of a C/D box is a small nucleolar RNA 73 (snoRNA U73). Credit: Rfam database (RF00071).{{free media}}

File:U14 snoRNA.png

This U14 snoRNA from Saccharomyces cerevisiae shows structure and genomic organization. Credit: Dmitry A.Samarsky, Maurille J.Fournier, Robert H.Singer and Edouard Bertrand.{{fairuse}}

For "box C/D snoRNAs, boxes C and D and an adjoining stem form a vital structure, known as the box C/D motif."^[2]

"The [C and D] box elements are essential for snoRNA production [transcription] and for snoRNA-directed modification of rRNA nucleotides."^[2]

The "motif is necessary and sufficient for nucleolar targeting, both in yeast and mammals. Moreover, in mammalian cells, RNA is targeted to coiled bodies as well. Thus, the box C/D motif is the first intranuclear RNA trafficking signal identified for an RNA family. Remarkably, it also couples snoRNA localization with synthesis and, most likely, function. The distribution of snoRNA precursors in mammalian cells suggests that this coupling is provided by a specific protein(s) which binds the box C/D motif during or rapidly after snoRNA transcription."^[2]

In snoRNA U73 on the right, the C box starting from the left side of the stem consists of nucleotides: ARUGAUGA, and from the right side the D box is AGUCY. In 5' to 3' direction, the D box is YCUGA.

Shown in the second image on the right are the C box (3'-AGUAGU-5'). Substituting T for U yields C box = 3'-AGTAGT-5' in the transcription direction on the template strand.

Samarsky C box samplings

For the Basic programs (starting with SuccessablesCbox.bas or SuccessablesDbox.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand in the negative direction (from ZSCAN22 to A1BG) is SuccessablesCbox--.bas, looking for 5'-AGTAGT-3'^[2], 4, 5'-AGTAGT-3', 2888, 5'-AGTAGT-3', 2944, 5'-AGTAGT-3', 3418, 5'-AGTAGT-3', 3521,
negative strand in the positive direction (from ZNF497 to A1BG) is SuccessablesCbox-+.bas, looking for 5'-AGTAGT-3', 0,
positive strand in the negative direction is SuccessablesCbox+-.bas, looking for 5'-AGTAGT-3', 0,
positive strand in the positive direction is SuccessablesCbox++.bas, looking for 5'-AGTAGT-3', 1, 5'-AGTAGT-3', 3251,
complement, negative strand, negative direction is SuccessablesCboxc--.bas, looking for 5'-TCATCA-3', 0,
complement, negative strand, positive direction is SuccessablesCboxc-+.bas, looking for 5'-TCATCA-3', 1, 5'-TCATCA-3', 3251,
complement, positive strand, negative direction is SuccessablesCboxc+-.bas, looking for 5'-TCATCA-3', 4, 5'-TCATCA-3', 2888, 5'-TCATCA-3', 2944, 5'-TCATCA-3', 3418, 5'-TCATCA-3', 3521,
complement, positive strand, positive direction is SuccessablesCboxc++.bas, looking for 5'-TCATCA-3', 0,
inverse complement, negative strand, negative direction is SuccessablesCboxci--.bas, looking for 5'-ACTACT-3', 0,
inverse complement, negative strand, positive direction is SuccessablesCboxci-+.bas, looking for 5'-ACTACT-3', 0,
inverse complement, positive strand, negative direction is SuccessablesCboxci+-.bas, looking for 5'-ACTACT-3', 0,
inverse complement, positive strand, positive direction is SuccessablesCboxci++.bas, looking for 5'-ACTACT-3', 1, 5'-ACTACT-3' at 2144.
inverse, negative strand, negative direction, is SuccessablesCboxi--.bas, looking for 5'-TGATGA-3', 0,
inverse, negative strand, positive direction, is SuccessablesCboxi-+.bas, looking for 5'-TGATGA-3', 1, 5'-TGATGA-3', 2144,
inverse, positive strand, negative direction, is SuccessablesCboxi+-.bas, looking for 5'-TGATGA-3', 0,
inverse, positive strand, positive direction, is SuccessablesCboxi++.bas, looking for 5'-TGATGA-3', 0.

C box S UTRs

Negative strand, negative direction: AGTAGT at 3521, AGTAGT at 3418, AGTAGT at 2944, AGTAGT at 2888.

C box S distal promoters

Negative strand, positive direction: TGATGA at 2144.

Positive strand, positive direction: AGTAGT at 3251.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Voronina C box samplings

For the Basic programs starting with SuccessablesCVbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for 5'-GGTGATG-3'^[3], 0.
negative strand, positive direction, looking for 5'-GGTGATG-3', 0.
positive strand, negative direction, looking for 5'-GGTGATG-3', 1, 5'-GGTGATG-3' at 3798.
positive strand, positive direction, looking for 5'-GGTGATG-3', 0.
complement, negative strand, negative direction, looking for 5'-CCACTAC-3', 1, 5'-CCACTAC-3' at 3798.
complement, negative strand, positive direction, looking for 5'-CCACTAC-3', 0.
complement, positive strand, negative direction, looking for 5'-CCACTAC-3', 0.
complement, positive strand, positive direction, looking for 5'-CCACTAC-3', 0.
inverse complement, negative strand, negative direction, looking for 5'-CATCACC-3', 0.
inverse complement, negative strand, positive direction, looking for 5'-CATCACC-3', 0.
inverse complement, positive strand, negative direction, looking for 5'-CATCACC-3', 0.
inverse complement, positive strand, positive direction, looking for 5'-CATCACC-3', 0.
inverse, negative strand, negative direction, looking for 5'-GTAGTGG-3', 0.
inverse, negative strand, positive direction, looking for 5'-GTAGTGG-3', 0.
inverse, positive strand, negative direction, looking for 5'-GTAGTGG-3', 0.
inverse, positive strand, positive direction, looking for 5'-GTAGTGG-3', 0.

Voronina C box UTRs

Positive strand, negative direction: GGTGATG at 3798.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Song C-boxes

Analysis "of the recombinant (soybean [Glycine max] TGACG-motif binding factor 1) STF1 protein revealed the C-box (nGACGTCn) to be a high-affinity binding site (Cheong et al., 1998). The HY5 protein interacts with both the G- (CACGTG) and Z- (ATACGTGT) boxes of the light-regulated promoter of RbcS1A (ribulose bisphosphate carboxylase small subunit) and the CHS (chalcone synthase) genes (Ang et al., 1998; Chattopadhyay et al., 1998; Yadav et al., 2002). To test whether STF1 and HY5 have similar DNA-binding properties, the binding properties of each were compared with eight different DNA sequences that represent G-, C-, and C/G-box motifs [TGACGTGT]. C-box sequences carrying the mammalian cAMP responsive element (CRE; TGACGTCA) motif and the Hex sequence (TGACGTGGC), a hybrid C/G-box (Cheong et al., 1998), were high-affinity binding sites for both proteins [...]. No binding or limited binding was observed to as-1 (Lam et al., 1989), nos-1 (Lam et al., 1990), or the AP-1 site (TGACTCA; Kim et al., 1993). Binding to the palindromic G-box (PA G-box, GCCACGTGGC) was moderate. However, binding activity to the G-box of the light-responsive unit 1 (U1) region of the parsley (Petroselinum crispum) CHS promoter (CHS-U1: TCCACGTGGC; Schulze-Lefert et al., 1989) or the G-box of GmAux28 (TCCACGTGTC) was much weaker than to the PA G-box [...]."^[4]

The "binding affinities of both bZIP proteins were similar to CRE^A/T (ATGACGTCAT), a CRE sequence with flanking adenine and thymine (A/T) at positions -4 and +4. [The] bZIP domains of both STF1 and HY5 have similar binding properties for recognizing ACGT-containing elements (ACEs). [Although] the G-box is a known target site for the HY5 protein, the C-box sequences are the preferred binding sites for both STF1 and HY5."^[4]

"When analyzed by type of ACE, these sequences can be grouped into four subclasses [...]: C-box, where the C residue comes at the 12 position; a hybrid C/G- box (C/G-box), with G at the 12 position; C/A-box [TGACGTAT], with A at the 12 position; and C/T-box, with T at the 12 position. The C-box subclass contains the largest number of selected binding sites for STF1 (38% at 50 mM KCl and 48% at 150 mM), followed by the C/G- (25.3%) and the C/A-boxes (26%). Only a small number of C/T-boxes [TGACGTTA] (4/100) and non-TGACGT sequences (4/100) were selected."^[4]

C-boxes are TCTTACGTCATC, AATGACGTCGAA, TCTCACGTGTGG, TTTGACGTGTGA, GATGACGTCATC, and AGAGACGTCAAC for an apparent consensus sequence of (A/G/T)(A/C/G/T)(A/T)(C/G/T)ACGT(C/G)(A/G/T)(A/G/T)(A/C/G).^[4]

Song C-box samplings

For the Basic programs starting with SuccessablesC-box.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for GACGTC^[4], 1, GACGTC at 4316.
negative strand, positive direction, looking for GACGTC, 0.
positive strand, negative direction, looking for GACGTC, 0,
positive strand, positive direction, looking for GACGTC, 9, GACGTC at 4316, GACGTC at 3280, GACGTC at 3231, GACGTC at 2858, GACGTC at 1506, GACGTC at 1120, GACGTC at 532, GACGTC at 437, GACGTC at 193.
inverse complement is the same as the direct consensus sequence.
complement, negative strand, negative direction, looking for 5'-CTGCAG-3', 0,
complement, negative strand, positive direction, looking for 5'-CTGCAG-3', 9, 5'-CTGCAG-3' at 193, 5'-CTGCAG-3' at 437, 5'-CTGCAG-3' at 532, 5'-CTGCAG-3' at 1120, 5'-CTGCAG-3' at 1506, 5'-CTGCAG-3' at 2858, 5'-CTGCAG-3' at 3231, 5'-CTGCAG-3' at 3280, 5'-CTGCAG-3' at 4316.
complement, positive strand, negative direction, looking for 5'-CTGCAG-3', 1, 5'-CTGCAG-3' at 4316.
complement, positive strand, positive direction, looking for 5'-CTGCAG-3', 0.

Song C-box UTRs

Negative strand, negative direction: GACGTC at 4316.

Song C-box core promoters

Positive strand, positive direction: GACGTC at 4316.

Song C-box distal promoters

Positive strand, positive direction: GACGTC at 3280, GACGTC at 3231, GACGTC at 2858, GACGTC at 1506, GACGTC at 1120, GACGTC at 532, GACGTC at 437, GACGTC at 193.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Song C box hybrids

Hybrid C, A boxes

"When analyzed by type of ACE, these sequences can be grouped into four subclasses [...]: C-box, where the C residue comes at the 12 position; a hybrid C/G- box (C/G-box), with G at the 12 position; C/A-box [TGACGTAT], with A at the 12 position; and C/T-box, with T at the 12 position."^[4]

Hybrid C, G boxes

"To test whether STF1 and HY5 have similar DNA-binding properties, the binding properties of each were compared with eight different DNA sequences that represent G-, C-, and C/G-box motifs [TGACGTGT]. C-box sequences carrying the mammalian cAMP responsive element (CRE; TGACGTCA) motif and the Hex sequence (TGACGTGGC), a hybrid C/G-box (Cheong et al., 1998), were high-affinity binding sites for both proteins [...]."^[4]

Hybrid C, T boxes

"Only a small number of C/T-boxes [TGACGTTA] (4/100) and non-TGACGT sequences (4/100) were selected."^[4]

Song hybrid C box samplings

Hybrid C, A box samplings

Copying a portion of the consensus sequence for the hybrid C, A box of TGACGTAT and putting it in "⌘F" finds none located between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs SuccessablesCAbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for 5'-TGACGTAT-3'^[4], 0.
negative strand, positive direction, looking for 5'-TGACGTAT-3', 0.
positive strand, negative direction, looking for 5'-TGACGTAT-3', 0.
positive strand, positive direction, looking for 5'-TGACGTAT-3', 0.
complement, negative strand, negative direction, looking for 5'-ACTGCATA-3', 0.
complement, negative strand, positive direction, looking for 5'-ACTGCATA-3', 0.
complement, positive strand, negative direction, looking for 5'-ACTGCATA-3', 0.
complement, positive strand, positive direction, looking for 5'-ACTGCATA-3', 0.
inverse complement, negative strand, negative direction, looking for 5'-ATACGTCA-3', 0.
inverse complement, negative strand, positive direction, looking for 5'-ATACGTCA-3', 0.
inverse complement, positive strand, negative direction, looking for 5'-ATACGTCA-3', 0.
inverse complement, positive strand, positive direction, looking for 5'-ATACGTCA-3', 0.
inverse, negative strand, negative direction, looking for 5'-TATGCAGT-3', 0.
inverse, negative strand, positive direction, looking for 5'-TATGCAGT-3', 0.
inverse, positive strand, negative direction, looking for 5'-TATGCAGT-3', 0.
inverse, positive strand, positive direction, looking for 5'-TATGCAGT-3', 0.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Hybrid C, G box samplings

Copying a portion of the consensus sequence for the hybrid C, G box of TGACGTGT and putting it in "⌘F" finds none located between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs SuccessablesCGbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for 5'-TGACGTGT-3'^[4], 0.
negative strand, positive direction, looking for 5'-TGACGTGT-3', 0.
positive strand, negative direction, looking for 5'-TGACGTGT-3', 0.
positive strand, positive direction, looking for 5'-TGACGTGT-3', 0.
complement, negative strand, negative direction, looking for 5'-ACTGCACA-3', 0.
complement, negative strand, positive direction, looking for 5'-ACTGCACA-3', 0.
complement, positive strand, negative direction, looking for 5'-ACTGCACA-3', 0.
complement, positive strand, positive direction, looking for 5'-ACTGCACA-3', 0.
inverse complement, negative strand, negative direction, looking for 5'-ACACGTCA-3', 0.
inverse complement, negative strand, positive direction, looking for 5'-ACACGTCA-3', 0.
inverse complement, positive strand, negative direction, looking for 5'-ACACGTCA-3', 0.
inverse complement, positive strand, positive direction, looking for 5'-ACACGTCA-3', 1, 5'-ACACGTCA-3' at 3962.
inverse, negative strand, negative direction, looking for 5'-TGTGCAGT-3', 0.
inverse, negative strand, positive direction, looking for 5'-TGTGCAGT-3', 1, 5'-TGTGCAGT-3' at 3962.
inverse, positive strand, negative direction, looking for 5'-TGTGCAGT-3', 0.
inverse, positive strand, positive direction, looking for 5'-TGTGCAGT-3', 0.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Hybrid C, T box samplings

Copying a portion of the consensus sequence for the hybrid C, T box of TGACGTTA and putting it in "⌘F" finds none located between ZSCAN22 and A1BG and none between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs SuccessablesCTbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

negative strand, negative direction, looking for 5'-TGACGTTA-3'^[4], 0.
negative strand, positive direction, looking for 5'-TGACGTTA-3', 0.
positive strand, negative direction, looking for 5'-TGACGTTA-3', 0.
positive strand, positive direction, looking for 5'-TGACGTTA-3', 0.
complement, negative strand, negative direction, looking for 5'-ACTGCAAT-3', 0.
complement, negative strand, positive direction, looking for 5'-ACTGCAAT-3', 0.
complement, positive strand, negative direction, looking for 5'-ACTGCAAT-3', 0.
complement, positive strand, positive direction, looking for 5'-ACTGCAAT-3', 0.
inverse complement, negative strand, negative direction, looking for 5'-TAACGTCA-3', 0.
inverse complement, negative strand, positive direction, looking for 5'-TAACGTCA-3', 0.
inverse complement, positive strand, negative direction, looking for 5'-TAACGTCA-3', 0.
inverse complement, positive strand, positive direction, looking for 5'-TAACGTCA-3', 0.
inverse, negative strand, negative direction, looking for 5'-ATTGCAGT-3', 0.
inverse, negative strand, positive direction, looking for 5'-ATTGCAGT-3', 0.
inverse, positive strand, negative direction, looking for 5'-ATTGCAGT-3', 0.
inverse, positive strand, positive direction, looking for 5'-ATTGCAGT-3', 0.

Random dataset samplings

RDr0: 0.
RDr1: 0.
RDr2: 0.
RDr3: 0.
RDr4: 0.
RDr5: 0.
RDr6: 0.
RDr7: 0.
RDr8: 0.
RDr9: 0.
RDr0ci: 0.
RDr1ci: 0.
RDr2ci: 0.
RDr3ci: 0.
RDr4ci: 0.
RDr5ci: 0.
RDr6ci: 0.
RDr7ci: 0.
RDr8ci: 0.
RDr9ci: 0.

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

References

↑ ^1.0 ^1.1 PA Johnson, D Bunick, NB Hecht (1991). "Protein Binding Regions in the Mouse and Rat Protamine-2 Genes" (PDF). Biology of Reproduction. 44 (1): 127–134. Retrieved 6 April 2019.
↑ ^2.0 ^2.1 ^2.2 ^2.3 ^2.4 Dmitry A. Samarsky, Maurille J.Fournier, Robert H.Singer and Edouard Bertrand (1 July 1998). "The snoRNA box C/D motif directs nucleolar targeting and also couples snoRNA synthesis and localization" (PDF). The European Molecular Biology Organization (EMBO) Journal. 17 (13): 3747–3757. doi:10.1093/emboj/17.13.3747. PMID 9649444. Retrieved 2017-02-04.
↑ ^3.0 ^3.1 E. N. Voronina, T. D. Kolokol’tsova, E. A. Nechaeva, and M. L. Filipenko (2003). "Structural–Functional Analysis of the Human Gene for Ribosomal Protein L11" (PDF). Molecular Biology. 37 (3): 362–371. Retrieved 11 April 2019.
↑ ^4.00 ^4.01 ^4.02 ^4.03 ^4.04 ^4.05 ^4.06 ^4.07 ^4.08 ^4.09 ^4.10 ^4.11 Young Hun Song, Cheol Min Yoo, An Pio Hong, Seong Hee Kim, Hee Jeong Jeong, Su Young Shin, Hye Jin Kim, Dae-Jin Yun, Chae Oh Lim, Jeong Dong Bahk, Sang Yeol Lee, Ron T. Nagao, Joe L. Key, and Jong Chan Hong (April 2008). "DNA-Binding Study Identifies C-Box and Hybrid C/G-Box or C/A-Box Motifs as High-Affinity Binding Sites for STF1 and LONG HYPOCOTYL5 Proteins" (PDF). Plant Physiology. 146 (4): 1862–1877. doi:10.1104/pp.107.113217. Retrieved 26 March 2019.

External links

[Johnson-1] 1.0 ^1.1 PA Johnson, D Bunick, NB Hecht (1991). "Protein Binding Regions in the Mouse and Rat Protamine-2 Genes" (PDF). Biology of Reproduction. 44 (1): 127–134. Retrieved 6 April 2019.

[Samarsky-2] 2.0 ^2.1 ^2.2 ^2.3 ^2.4 Dmitry A. Samarsky, Maurille J.Fournier, Robert H.Singer and Edouard Bertrand (1 July 1998). "The snoRNA box C/D motif directs nucleolar targeting and also couples snoRNA synthesis and localization" (PDF). The European Molecular Biology Organization (EMBO) Journal. 17 (13): 3747–3757. doi:10.1093/emboj/17.13.3747. PMID 9649444. Retrieved 2017-02-04.

[Voronina-3] 3.0 ^3.1 E. N. Voronina, T. D. Kolokol’tsova, E. A. Nechaeva, and M. L. Filipenko (2003). "Structural–Functional Analysis of the Human Gene for Ribosomal Protein L11" (PDF). Molecular Biology. 37 (3): 362–371. Retrieved 11 April 2019.

[Song-4] 4.00 ^4.01 ^4.02 ^4.03 ^4.04 ^4.05 ^4.06 ^4.07 ^4.08 ^4.09 ^4.10 ^4.11 Young Hun Song, Cheol Min Yoo, An Pio Hong, Seong Hee Kim, Hee Jeong Jeong, Su Young Shin, Hye Jin Kim, Dae-Jin Yun, Chae Oh Lim, Jeong Dong Bahk, Sang Yeol Lee, Ron T. Nagao, Joe L. Key, and Jong Chan Hong (April 2008). "DNA-Binding Study Identifies C-Box and Hybrid C/G-Box or C/A-Box Motifs as High-Affinity Binding Sites for STF1 and LONG HYPOCOTYL5 Proteins" (PDF). Plant Physiology. 146 (4): 1862–1877. doi:10.1104/pp.107.113217. Retrieved 26 March 2019.

[1]

[2]

[3]

[4]

@@ Line 59: / Line 59: @@
 # inverse, positive strand, positive direction, looking for 5'-TCTACCGGAG-3', 0.
-===Random dataset samplings===
+===CJbox random dataset samplings===
-# RDr0: 0.
+# CJboxr0: 0.
-# RDr1: 0.
+# CJboxr1: 0.
-# RDr2: 0.
+# CJboxr2: 0.
-# RDr3: 0.
+# CJboxr3: 0.
-# RDr4: 0.
+# CJboxr4: 0.
-# RDr5: 0.
+# CJboxr5: 0.
-# RDr6: 0.
+# CJboxr6: 0.
-# RDr7: 0.
+# CJboxr7: 0.
-# RDr8: 0.
+# CJboxr8: 0.
-# RDr9: 0.
+# CJboxr9: 0.
-# RDr0ci: 0.
+# CJboxr0ci: 0.
-# RDr1ci: 0.
+# CJboxr1ci: 0.
-# RDr2ci: 0.
+# CJboxr2ci: 0.
-# RDr3ci: 0.
+# CJboxr3ci: 0.
-# RDr4ci: 0.
+# CJboxr4ci: 0.
-# RDr5ci: 0.
+# CJboxr5ci: 0.
-# RDr6ci: 0.
+# CJboxr6ci: 0.
-# RDr7ci: 0.
+# CJboxr7ci: 0.
-# RDr8ci: 0.
+# CJboxr8ci: 0.
-# RDr9ci: 0.
+# CJboxr9ci: 0.
-===RDr UTRs===
+===CJboxr UTRs===
 {{main|UTR promoter gene transcriptions}}
-===RDr core promoters===
+===CJboxr core promoters===
 {{main|Core promoter gene transcriptions}}
-===RDr proximal promoters===
+===CJboxr proximal promoters===
 {{main|Proximal promoter gene transcriptions}}
-===RDr distal promoters===
+===CJboxr distal promoters===
 {{main|Distal promoter gene transcriptions}}

v t e Gene project
Articles	Complex locus A1BG and ZNF497 Grainyhead-like Genes in Regulating Development and Genetic Defects Lysenin Lysine: biosynthesis, catabolism and roles RIG-I like receptors ShK toxin: history, structure and therapeutic applications for autoimmune diseases
Categories	Biochemistry Biology Genetics Medicine
Laboratories	AGC box gene transcription laboratory ATA box gene transcription laboratory C and D boxes gene transcription laboratory CArG box gene transcription laboratory CGCG box gene transcription laboratory CRE box gene transcription laboratory E2 box gene transcription laboratory Enhancer box gene transcription laboratory Factor II B recognition element gene transcription laboratory GA responsive complex gene transcription laboratory GC box gene transcription laboratory H box gene transcription laboratory HNF6 gene transcription laboratory HY box gene transcription laboratory Initiator element gene transcription laboratory Metal responsive element gene transcription laboratory STAT5 gene transcription laboratory TATA box gene transcription laboratory
Lessons	A1BG gene transcription programming Amino Acids Enzymes Enzyme catalysis Enzyme structure and function Eukaryotic transcription Gene regulation in prokaryotes
Lists	Biomolecules
Modules	Module:Infobox gene Module:InfoboxImage
Original research	Gene project
Projects	Biochemistry Gene project History of biology Molecular Biology Molecular evolution Topobiology
Proposals	Gene expressions/Cost sharing and research products Gene expressions in human exploration beyond low earth orbits Gene expressions/Project narrative
Resources	5' cap Acid-base homeostasis Actins Adenines Allergies Alpha-1-B glycoprotein Ammonoids Original research/Amino acids Amphiphiles Anabolism Animal physiology Anomeric carbons Autocatalytic reactions Autonomously replicating sequences Base pairs Biology Biodegradation Biosynthesis Biosynthesis of a human protein Biosynthesis of amino acids Blood Bodily fluids Botany Brain box Calcium signaling Capping enzymes Carbohydrates Carcinoembryonic antigen gene family Catabolism Catalysis Cells Cell signaling Centrosomes Chromatins Chromoboxes Coactivators Corepressors Cofactors Consensus sequences Cytogenetics Cytokinesis Cytosines Deoxyribonucleic acids Digestion Disaccharides Dispersed promoters Dominant group metagenomes Downregulations Endochondral ossification Enzyme inhibitors Enzymology Epigenetics Epigenomes Esters Esterification Eukaryotes Eukaryotic initiation factors Evolution Exaptation Excision repair cross-complementing Factors Fatty acids Ferredoxin Foldings Foods Forkhead boxes Functional groups Genome surveillance complexes Genealogy Genes Genetics Gene transcriptions Genomes Genomics Glycoproteins Glycosides Glycosidic bond Guanines Hair color gene expressions Helicases Heredity History of agriculture Greek and Roman histories of biology Homeostasis Human amino acid synthesis Human DNAs Human genes Human RNA Human teeth Human temperatures Immunoglobulin domain cl11960 Immunoglobulin domain genes Immunoglobulin like domain cd05751 Immunoglobulin like domain pfam13895 Immunoglobulin like domain smart00410 Immunoglobulin receptor superfamily genes Immunoglobulin supergene family Inhibitory peptides Insulators Intranuclear localizations Introduction to Cell Biology Introduction to polymer chemistry Lamarckism Leucine zipper Localization Major histocompatibility complex class I gene family Major histocompatibility complex class II gene family Major histocompatibility complex class III gene family Mammalogy Mathematical molecular biology Mediator complexes Medicine Melanocytes Membranes Metagenomes Molecular biology Molecular genetics Nitrogen metabolism Nucleotide Synthesis Origin of life Orthomolecular medicine Osteoarthritis Paleanthropology Paleontology Phosphate biochemistry Phosphate budgets Phosphate reactions Post translational modifications Principles of biosynthesis Protein isoform Proteins Proteomics Regulations Ribonucleotides Ribosomes RNA polymerases RNA polymerase II holoenzymes RNA polymerase II holoenzyme complexes RNA translations Salinity Stroke management Teeth TFIIA Transports Vascular endothelial growth factor A What is a human? Upregulations Upstream and downstream ZSCAN22 Zoology
Transcription resources	A1BG gene transcription core promoters A1BG gene transcriptions A1BG regulatory elements and regions A1BG response element gene transcriptions A1BG response element negative results A1BG response element positive results ABA-response element gene transcriptions Abf1 regulatory factor gene transcriptions A box gene transcriptions ACGT-containing element gene transcriptions Activating protein gene transcriptions Activating transcription factor gene transcriptions Adenylate–uridylate rich element gene transcriptions Adr1p gene transcriptions Aft1p gene transcriptions AGC box gene transcriptions AGCE gene transcriptions Alpha-amylase conserved element gene transcriptions Amino acid response element gene transcriptions AARE-like Androgen response element gene transcriptions Angiotensinogen core promoter element gene transcriptions Antioxidant-electrophile responsive element gene transcriptions ATA box gene transcriptions Auxin response factor gene transcriptions B box gene transcriptions Bioinformatics tool gene transcriptions Box gene transcriptions Bridge gene transcriptions CAAT box gene transcriptions CadC binding domain gene transcriptions Calcineurin-responsive transcription factor gene transcriptions Calcium-response element gene transcriptions cAMP response element gene transcriptions C and D boxes gene transcriptions Carbohydrate response element gene transcriptions Carbon source-responsive element gene transcriptions Carcinoembryonic antigen gene family CARE gene transcriptions CArG box gene transcriptions CAT box gene transcriptions Cat8p gene transcriptions Cbf1 regulatory factor gene transcriptions C box gene transcriptions CCCTC-binding factor gene transcriptions C-EBP box gene transcriptions Cell-cycle box gene transcriptions Cell cycle regulation gene transcriptions CENP-B box gene transcriptions CGCG box gene transcriptions Circadian control element gene transcriptions Cold-responsive element gene transcriptions Complement copy gene transcriptions Complement-inverse copy gene transcriptions Consensus sequence gene transcriptions Copper response element gene transcriptions Core promoter gene transcriptions Coupling element gene transcriptions CRE box gene transcriptions Cytokinin response regulator gene transcriptions Cytoplasmic polyadenylation element gene transcriptions DAF-16-associated element gene transcriptions DAF-16 binding element gene transcriptions D box gene transcriptions Defense and stress-responsive element gene transcriptions Degenerate nucleotide gene transcriptions Dispersed promoter gene transcriptions Distal promoter gene transcriptions DNA melting gene transcriptions DNA damage response element gene transcriptions DNA replication-related element gene transcriptions Downstream core element gene transcriptions Downstream promoter element gene transcriptions Downstream TFIIB recognition element gene transcriptions DREB box gene transcriptions E2 box gene transcriptions EIF4E basal element gene transcriptions EIN3 binding site gene transcriptions Enhancer activity copy gene transcriptions E box gene transcriptions Element gene transcriptions Endoplasmic reticulum stress response element gene transcriptions Endosperm expression gene transcriptions Enhancer box gene transcriptions Estrogen response element gene transcriptions Ethylene responsive element gene transcriptions Factor II B recognition element gene transcriptions F box gene transcriptions Focused promoter gene transcriptions Forkhead box gene transcriptions Fur box gene transcriptions GAAC element gene transcriptions Gal4p gene transcriptions Γ-interferon activated sequence gene transcriptions GARE gene transcriptions GA responsive complex gene transcriptions GATA gene transcriptions G box gene transcriptions GC box gene transcriptions GCC box gene transcriptions Gcn4p gene transcriptions Gcr1p gene transcriptions Gene expressions General factor II D gene transcriptions General regulatory factors General transcription factor II A gene transcriptions General transcription factor II B gene transcriptions General transcription factor II D gene transcriptions General transcription factor II F gene transcriptions General transcription factor II H gene transcriptions General transcription factor gene transcriptions Gene transcriptions GGC triplet gene transcriptions Gibberellin responsive element gene transcriptions GLM box gene transcriptions Glucocorticoid response element gene transcriptions Grainy head gene transcriptions Grainy head transcription factor gene transcriptions Growth hormone response element gene transcriptions GT boxes Hac1p gene transcriptions Hair color gene expressions H and ACA box gene transcriptions H box gene transcriptions Heat-responsive element gene transcriptions Hex sequence gene transcriptions HMG box gene transcriptions HNF gene transcriptions Homeobox gene transcriptions Hsf1p gene transcriptions HY box gene transcriptions Hybrid C, A boxes Hybrid C, G boxes Hybrid C, T boxes Hypoxia-inducible factor gene transcriptions Hypoxia response elements I box gene transcriptions Initiator element gene transcriptions Initiator-like element gene transcriptions Inositol/choline-responsive elements Interaction gene transcriptions Interferon regulatory factors Inverse copy gene transcriptions Jasmonic acid-responsive element gene transcriptions K-boxes Kozak sequence gene transcriptions Kruppel-associated box gene transcriptions Krüppel-like factor gene transcriptions L box gene transcriptions Leu3 gene transcriptions M35 box gene transcriptions MADS box gene transcriptions Maf recognition element gene transcriptions M box gene transcriptions Mcm1 regulatory factor gene transcriptions Met31p box gene transcriptions Metal responsive element gene transcriptions Middle sporulation element gene transcriptions Mig1p gene transcriptions Model samplings Motif ten element gene transcriptions Msn2,4p gene transcriptions Musashi binding element gene transcriptions MYB recognition element gene transcriptions Myelocytomatosis transcription factor gene transcriptions Myocyte enhancer factor gene transcriptions N-boxes Ndt80p gene transcriptions Nuclear factor 1 Nuclear factor 𝜿B Nuclear factor gene transcriptions Nuclear factor of activated T cell gene transcriptions (NFAT) Nuclear factor Y gene transcriptions Nutrient-sensing response element gene transcriptions Oaf1p gene transcriptions ORE1 binding site gene transcriptions p53 response element gene transcriptions P63 DNA-binding site gene transcriptions P box gene transcriptions Pdr1,3p gene transcriptions Peroxisome proliferator hormone response element gene transcriptions Phosphate starvation-response transcription factor gene transcriptions Pollen1 element gene transcriptions Polycomb response element gene transcriptions Preinitiation complex Preinitiation complex gene transcriptions Pribnow box gene transcriptions Prolamin box gene transcriptions Promoter gene transcriptions Proximal promoter gene transcriptions Promoter occurrence gene transcriptions Pyrimidine box gene transcriptions Q element gene transcriptions Rap1 regulatory factor gene transcriptions Reb1 general regulatory factor gene transcriptions Retinoblastoma control element gene transcriptions Retinoic acid response element gene transcriptions Rgt1p gene transcriptions Rlm1p gene transcriptions RNA polymerase II gene transcriptions RNA polymerase II holoenzyme complex Root specific element gene transcriptions ROR-response element gene transcriptions Rox1p gene transcriptions Rpn4p gene transcriptions R response element gene transcriptions SARE gene transcriptions Seed-specific element gene transcriptions Serum response element gene transcriptions Servenius sequence gene transcriptions Shoot specific element gene transcriptions Sip4p gene transcriptions Smp1p gene transcriptions Sp1 gene transcriptions Spaceflight gene expressions Specificity protein gene transcriptions STAT gene transcriptions Ste12p gene transcriptions Sterol response element gene transcriptions Sucrose box gene transcriptions Synaptic Activity-Responsive Elements TACTAAC box gene transcriptions TAGteam gene transcriptions Tapetum box gene transcriptions TATA binding protein associated factor gene transcriptions TATA binding protein gene transcriptions TATA box gene transcriptions TAT box gene transcriptions TATC box gene transcriptions Tbf1 regulatory factor gene transcriptions T box gene transcriptions TCCACCATA element gene transcriptions TC element gene transcriptions TCT gene transcriptions TEA consensus sequence gene transcriptions Tec1p gene transcriptions Telomeric repeat DNA-binding factor gene transcriptions Tetradecanoylphorbol-13-acetate response element gene transcriptions TGF-β control elements (TCEs) TGF-β inhibitory elements (TIEs) Thyroid hormone response element gene transcriptions Transcriptional regulation Transcription bubble gene transcriptions Transcription factor gene transcriptions Transcription factor 3 gene transcriptions Transcription factory gene transcriptions Transcription start site gene transcriptions Translational control sequence gene transcriptions U box gene transcriptions Unfolded protein response element gene transcriptions Upstream response element gene transcriptions Upstream stimulatory factor gene transcriptions UTR promoter gene transcriptions V and P box gene transcriptions V box gene transcriptions Vhr1p gene transcriptions Vitamin D response element gene transcriptions W box gene transcriptions X box gene transcriptions Xbp1p gene transcriptions X core promoter element gene transcriptions Xenobiotic response element gene transcriptions Xenobiotic responsive element gene transcriptions Yap1p,2p gene transcriptions Y box gene transcriptions YY1 gene transcriptions Zap1p gene transcriptions Z box gene transcriptions Zinc responsive element gene transcriptions

C box gene transcriptions: Difference between revisions

Revision as of 17:19, 27 June 2021

Hypotheses

Johnson C-box samplings

CJbox random dataset samplings

CJboxr UTRs

CJboxr core promoters

CJboxr proximal promoters

CJboxr distal promoters

snoRNA C box

Samarsky C box samplings

C box S UTRs

C box S distal promoters

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Voronina C box samplings

Voronina C box UTRs

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Song C-boxes

Song C-box samplings

Song C-box UTRs

Song C-box core promoters

Song C-box distal promoters

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Song C box hybrids

Hybrid C, A boxes

Hybrid C, G boxes

Hybrid C, T boxes

Song hybrid C box samplings

Hybrid C, A box samplings

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Hybrid C, G box samplings

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Hybrid C, T box samplings

Random dataset samplings

RDr UTRs

RDr core promoters

RDr proximal promoters

RDr distal promoters

Acknowledgements

See also

References

External links

Navigation menu