CCCTC-binding factor gene transcriptions: Difference between revisions

Jump to navigation Jump to search
Line 109: Line 109:
# CTCFr5ci: 8, GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr5ci: 8, GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr6ci: 7, GAGGG at 4103, GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# CTCFr6ci: 7, GAGGG at 4103, GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# RDr7ci: 0.
# CTCFr7ci: 6, GAGGG at 4318, GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
# RDr8ci: 0.
# CTCFr8ci: 7, GAGGG at 4205, GAGGG at 3551, GAGGG at 3273, GAGGG at 2864, GAGGG at 2632, GAGGG at 2018, GAGGG at 686.
# RDr9ci: 0.
# CTCFr9ci: 4, GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.


===CTCFr arbitrary (evens) (4560-2846) UTRs===
===CTCFr arbitrary (evens) (4560-2846) UTRs===
Line 123: Line 123:
# CTCFr2ci: GAGGG at 4295.
# CTCFr2ci: GAGGG at 4295.
# CTCFr6ci: GAGGG at 4103, GAGGG at 3830.
# CTCFr6ci: GAGGG at 4103, GAGGG at 3830.
# CTCFr8ci: GAGGG at 4205, GAGGG at 3551, GAGGG at 3273, GAGGG at 2864.


===CTCFr alternate (odds) (4560-2846) UTRs===
===CTCFr alternate (odds) (4560-2846) UTRs===
Line 133: Line 134:
# CTCFr3ci: GAGGG at 2864.
# CTCFr3ci: GAGGG at 2864.
# CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301.
# CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301.
# CTCFr7ci: GAGGG at 4318.


===CTCFr arbitrary negative direction (evens) (2846-2811) core promoters===
===CTCFr arbitrary negative direction (evens) (2846-2811) core promoters===
Line 146: Line 148:


# CTCFr5: CCCTC at 4416, CCCTC at 4356.
# CTCFr5: CCCTC at 4416, CCCTC at 4356.
# CTCFr7ci: GAGGG at 4318.


===CTCFr alternate positive direction (evens) (4445-4265) core promoters===
===CTCFr alternate positive direction (evens) (4445-4265) core promoters===
Line 157: Line 160:


# CTCFr6ci: GAGGG at 2694.
# CTCFr6ci: GAGGG at 2694.
# CTCFr8ci: GAGGG at 2632.


===CTCFr alternate negative direction (odds) (2811-2596) proximal promoters===
===CTCFr alternate negative direction (odds) (2811-2596) proximal promoters===
Line 164: Line 168:
# CTCFr9: CCCTC at 2746.
# CTCFr9: CCCTC at 2746.
# CTCFr5ci: GAGGG at 2752.
# CTCFr5ci: GAGGG at 2752.
===RDr arbitrary positive direction (odds) (4265-4050) proximal promoters===


===CTCFr alternate positive direction (evens) (4265-4050) proximal promoters===
===CTCFr alternate positive direction (evens) (4265-4050) proximal promoters===


# CTCFr6ci: GAGGG at 4103.
# CTCFr6ci: GAGGG at 4103.
# CTCFr8ci: GAGGG at 4205.


===CTCFr arbitrary negative direction (evens) (2596-1) distal promoters===
===CTCFr arbitrary negative direction (evens) (2596-1) distal promoters===
Line 181: Line 183:
# CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
# CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
# CTCFr6ci: GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# CTCFr6ci: GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# CTCFr8ci: GAGGG at 2018, GAGGG at 686.


===CTCFr alternate negative direction (odds) (2596-1) distal promoters===
===CTCFr alternate negative direction (odds) (2596-1) distal promoters===
Line 192: Line 195:
# CTCFr3ci: GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
# CTCFr3ci: GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
# CTCFr5ci: GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr5ci: GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr7ci: GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
# CTCFr9ci: GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.


===CTCFr arbitrary positive direction (odds) (4050-1) distal promoters===
===CTCFr arbitrary positive direction (odds) (4050-1) distal promoters===
Line 203: Line 208:
# CTCFr3ci: GAGGG at 2864, GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
# CTCFr3ci: GAGGG at 2864, GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
# CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
# CTCFr7ci: GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
# CTCFr9ci: GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.


===CTCFr alternate positive direction (evens) (4050-1) distal promoters===
===CTCFr alternate positive direction (evens) (4050-1) distal promoters===
Line 215: Line 222:
# CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
# CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
# CTCFr6ci: GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# CTCFr6ci: GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
# CTCFr8ci: GAGGG at 3551, GAGGG at 3273, GAGGG at 2864, GAGGG at 2632, GAGGG at 2018, GAGGG at 686.


==CTCF analysis and results==
==CTCF analysis and results==

Revision as of 23:30, 12 May 2023

Associate Editor(s)-in-Chief: Henry A. Hoff

Consensus sequences

"Experiments using chromatin immunoprecipitation exonuclease (ChIP-exo) uncovered a broad CTCF-binding motif that contains a 12–15 bp consensus sequence, 5′-NCA-NNA-G(G/A)N-GGC-(G/A)(C/G)(T/C)-3′ (Nakahashi et al., 2013, Rhee and Pugh, 2011) [...]."[1]

Hashimoto samplings

Copying the consensus of the CTCF: 5'-CACCAGG-3' and putting the sequence in "⌘F" finds no locations between ZSCAN22 and A1BG and CTCF: 5'-CACCAGGAGG-3' finds no locations between ZNF497 and A1BG as can be found by the computer programs.

For the Basic programs SuccessablesCFbox.bas written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction, looking for 5'-NCA-NNA-G(A/G)N-GGC-(A/G)(C/G)(C/T)-3'[1], 0.
  2. negative strand, positive direction, looking for 5'-NCA-NNA-G(A/G)N-GGC-(A/G)(C/G)(C/T)-3', 0.
  3. positive strand, negative direction, looking for 5'-NCA-NNA-G(A/G)N-GGC-(A/G)(C/G)(C/T)-3', 0.
  4. positive strand, positive direction, looking for 5'-NCA-NNA-G(A/G)N-GGC-(A/G)(C/G)(C/T)-3', 0.
  5. complement, negative strand, negative direction, looking for 5'-NGT-NNT-C(C/T)N-CCG-(C/T)(C/G)(A/G)-3', 0.
  6. complement, negative strand, positive direction, looking for 5'-NGT-NNT-C(C/T)N-CCG-(C/T)(C/G)(A/G)-3', 0.
  7. complement, positive strand, negative direction, looking for 5'-NGT-NNT-C(C/T)N-CCG-(C/T)(C/G)(A/G)-3', 0.
  8. complement, positive strand, positive direction, looking for 5'-NGT-NNT-C(C/T)N-CCG-(C/T)(C/G)(A/G)-3', 0.
  9. inverse complement, negative strand, negative direction, looking for 5'-(A/G)(C/G)(C/T)-GCC-N(C/T)C-TNN-TGN-3', 0.
  10. inverse complement, negative strand, positive direction, looking for 5'-(A/G)(C/G)(C/T)-GCC-N(C/T)C-TNN-TGN-3', 0.
  11. inverse complement, positive strand, negative direction, looking for 5'-(A/G)(C/G)(C/T)-GCC-N(C/T)C-TNN-TGN-3', 0.
  12. inverse complement, positive strand, positive direction, looking for 5'-(A/G)(C/G)(C/T)-GCC-N(C/T)C-TNN-TGN-3', 0.
  13. inverse, negative strand, negative direction, looking for 5'-(C/T)(C/G)(A/G)-CGG-N(A/G)G-ANN-ACN-3', 0.
  14. inverse, negative strand, positive direction, looking for 5'-(C/T)(C/G)(A/G)-CGG-N(A/G)G-ANN-ACN-3', 0.
  15. inverse, positive strand, negative direction, looking for 5'-(C/T)(C/G)(A/G)-CGG-N(A/G)G-ANN-ACN-3', 0.
  16. inverse, positive strand, positive direction, looking for 5'-(C/T)(C/G)(A/G)-CGG-N(A/G)G-ANN-ACN-3', 0.

CTCF (Lobanenkov) samplings

CCCTC-Binding factor or CTCF was initially discovered as a negative regulator of the chicken c-myc gene. This protein was found to be binding to three regularly spaced repeats of the core sequence CCCTC and thus was named CCCTC binding factor.[2]

For the Basic programs testing consensus sequence CCCTC (starting with SuccessablesCTCF.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. Negative strand, negative direction: 22, CCCTC at 4560, CCCTC at 4549, CCCTC at 4497, CCCTC at 4303, CCCTC at 4271, CCCTC at 4153, CCCTC at 4002, CCCTC at 3989, CCCTC at 3888, CCCTC at 3752, CCCTC at 3714, CCCTC at 3080, CCCTC at 2701, CCCTC at 2221, CCCTC at 2104, CCCTC at 1962, CCCTC at 1930, CCCTC at 1795, CCCTC at 1018, CCCTC at 686, CCCTC at 550, CCCTC at 413.
  2. Positive strand, negative direction: 1, CCCTC at 2626.
  3. Negative strand, positive direction: 7, CCCTC at 4294, CCCTC at 3502, CCCTC at 3355, CCCTC at 3206, CCCTC at 1772, CCCTC at 95, CCCTC at 87.
  4. Positive strand, positive direction: 16, CCCTC at 4432, CCCTC at 4044, CCCTC at 3978, CCCTC at 3673, CCCTC at 3657, CCCTC at 3453, CCCTC at 3184, CCCTC at 2288, CCCTC at 1896, CCCTC at 1782, CCCTC at 1683, CCCTC at 661, CCCTC at 493, CCCTC at 382, CCCTC at 368, CCCTC at 313.
  5. inverse complement, negative strand, negative direction: 3, GAGGG at 4258, GAGGG at 1507, GAGGG at 388.
  6. inverse complement, positive strand, negative direction: 5, GAGGG at 4558, GAGGG at 3652, GAGGG at 2699, GAGGG at 1673, GAGGG at 88.
  7. inverse complement, negative strand, positive direction: 16, GAGGG at 4434, GAGGG at 3980, GAGGG at 3906, GAGGG at 3879, GAGGG at 3653, GAGGG at 3479, GAGGG at 3182, GAGGG at 2796, GAGGG at 2655, GAGGG at 2290, GAGGG at 1898, GAGGG at 311, GAGGG at 258, GAGGG at 245, GAGGG at 181, GAGGG at 18.
  8. inverse complement, positive strand, positive direction: 9, GAGGG at 4296, GAGGG at 3554, GAGGG at 3332, GAGGG at 3198, GAGGG at 3077, GAGGG at 2531, GAGGG at 2395, GAGGG at 2382, GAGGG at 465.

CTCF (4560-2846) UTRs

  1. Negative strand, negative direction: CCCTC at 4560, CCCTC at 4549, CCCTC at 4497, CCCTC at 4303, CCCTC at 4271, CCCTC at 4153, CCCTC at 4002, CCCTC at 3989, CCCTC at 3888, CCCTC at 3752, CCCTC at 3714, CCCTC at 3080.
  2. Negative strand, negative direction: GAGGG at 4258.
  3. Positive strand, negative direction: GAGGG at 4558, GAGGG at 3652.

CTCF positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: CCCTC at 4294.
  2. Negative strand, positive direction: GAGGG at 4434.
  3. Positive strand, positive direction: CCCTC at 4432.
  4. Positive strand, positive direction: GAGGG at 4296.

CTCF negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: CCCTC at 2701.
  2. Positive strand, negative direction: CCCTC at 2626.
  3. Positive strand, negative direction: GAGGG at 2699.

CTCF positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: CCCTC at 4294.

CTCF negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: CCCTC at 2221, CCCTC at 2104, CCCTC at 1962, CCCTC at 1930, CCCTC at 1795, CCCTC at 1018, CCCTC at 686, CCCTC at 550, CCCTC at 413.
  2. Negative strand, negative direction: GAGGG at 1507, GAGGG at 388.
  3. Positive strand, negative direction: GAGGG at 1673, GAGGG at 88.

CTCF positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: CCCTC at 3502, CCCTC at 3355, CCCTC at 3206, CCCTC at 1772, CCCTC at 95, CCCTC at 87.
  2. Negative strand, positive direction: GAGGG at 4434, GAGGG at 3980, GAGGG at 3906, GAGGG at 3879, GAGGG at 3653, GAGGG at 3479, GAGGG at 3182, GAGGG at 2796, GAGGG at 2655, GAGGG at 2290, GAGGG at 1898, GAGGG at 311, GAGGG at 258, GAGGG at 245, GAGGG at 181, GAGGG at 18.
  3. Positive strand, positive direction: CCCTC at 4044, CCCTC at 3978, CCCTC at 3673, CCCTC at 3657, CCCTC at 3453, CCCTC at 3184, CCCTC at 2288, CCCTC at 1896, CCCTC at 1782, CCCTC at 1683, CCCTC at 661, CCCTC at 493, CCCTC at 382, CCCTC at 368, CCCTC at 313.
  4. Positive strand, positive direction: GAGGG at 3554, GAGGG at 3332, GAGGG at 3198, GAGGG at 3077, GAGGG at 2531, GAGGG at 2395, GAGGG at 2382, GAGGG at 465.

CTCF (Lobanenkov) random dataset samplings

  1. CTCFr0: 5, CCCTC at 2873, CCCTC at 2302, CCCTC at 1863, CCCTC at 587, CCCTC at 26.
  2. CTCFr1: 8, CCCTC at 2735, CCCTC at 2580, CCCTC at 2300, CCCTC at 1369, CCCTC at 859, CCCTC at 424, CCCTC at 217, CCCTC at 127.
  3. CTCFr2: 8, CCCTC at 4478, CCCTC at 3917, CCCTC at 3842, CCCTC at 3643, CCCTC at 956, CCCTC at 894, CCCTC at 692, CCCTC at 670.
  4. CTCFr3: 5, CCCTC at 3829, CCCTC at 3519, CCCTC at 2497, CCCTC at 1521, CCCTC at 1429.
  5. CTCFr4: 8, CCCTC at 4306, CCCTC at 2619, CCCTC at 2356, CCCTC at 1908, CCCTC at 1525, CCCTC at 1521, CCCTC at 1177, CCCTC at 1058.
  6. CTCFr5: 7, CCCTC at 4416, CCCTC at 4356, CCCTC at 3244, CCCTC at 2134, CCCTC at 1860, CCCTC at 1853, CCCTC at 1405.
  7. CTCFr6: 10, CCCTC at 4408, CCCTC at 3983, CCCTC at 3805, CCCTC at 3560, CCCTC at 2969, CCCTC at 2361, CCCTC at 2272, CCCTC at 2095, CCCTC at 1736, CCCTC at 886.
  8. CTCFr7: 7, CCCTC at 3262, CCCTC at 3246, CCCTC at 2909, CCCTC at 2643, CCCTC at 1747, CCCTC at 978, CCCTC at 543.
  9. CTCFr8: 3, CCCTC at 4032, CCCTC at 3783, CCCTC at 3756.
  10. CTCFr9: 14, CCCTC at 3586, CCCTC at 3516, CCCTC at 3402, CCCTC at 3023, CCCTC at 2828, CCCTC at 2746, CCCTC at 2574, CCCTC at 2157, CCCTC at 1787, CCCTC at 1778, CCCTC at 1343, CCCTC at 856, CCCTC at 191, CCCTC at 101.
  11. CTCFr0ci: 6, GAGGG at 4403, GAGGG at 2464, GAGGG at 1396, GAGGG at 1072, GAGGG at 920, GAGGG at 733.
  12. CTCFr1ci: 5, GAGGG at 3317, GAGGG at 3133, GAGGG at 1179, GAGGG at 1149, GAGGG at 838.
  13. CTCFr2ci: 5, GAGGG at 4295, GAGGG at 2839, GAGGG at 2396, GAGGG at 1179, GAGGG at 170.
  14. CTCFr3ci: 7, GAGGG at 2864, GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
  15. CTCFr4ci: 2, GAGGG at 2309, GAGGG at 1017.
  16. CTCFr5ci: 8, GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
  17. CTCFr6ci: 7, GAGGG at 4103, GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
  18. CTCFr7ci: 6, GAGGG at 4318, GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
  19. CTCFr8ci: 7, GAGGG at 4205, GAGGG at 3551, GAGGG at 3273, GAGGG at 2864, GAGGG at 2632, GAGGG at 2018, GAGGG at 686.
  20. CTCFr9ci: 4, GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.

CTCFr arbitrary (evens) (4560-2846) UTRs

  1. CTCFr0: CCCTC at 2873.
  2. CTCFr2: CCCTC at 4478, CCCTC at 3917, CCCTC at 3842, CCCTC at 3643.
  3. CTCFr4: CCCTC at 4306.
  4. CTCFr6: CCCTC at 4408, CCCTC at 3983, CCCTC at 3805, CCCTC at 3560, CCCTC at 2969.
  5. CTCFr8: CCCTC at 4032, CCCTC at 3783, CCCTC at 3756.
  6. CTCFr0ci: GAGGG at 4403.
  7. CTCFr2ci: GAGGG at 4295.
  8. CTCFr6ci: GAGGG at 4103, GAGGG at 3830.
  9. CTCFr8ci: GAGGG at 4205, GAGGG at 3551, GAGGG at 3273, GAGGG at 2864.

CTCFr alternate (odds) (4560-2846) UTRs

  1. CTCFr3: CCCTC at 3829, CCCTC at 3519.
  2. CTCFr5: CCCTC at 4416, CCCTC at 4356, CCCTC at 3244.
  3. CTCFr7: CCCTC at 3262, CCCTC at 3246, CCCTC at 2909.
  4. CTCFr9: CCCTC at 3586, CCCTC at 3516, CCCTC at 3402, CCCTC at 3023.
  5. CTCFr1ci: GAGGG at 3317, GAGGG at 3133.
  6. CTCFr3ci: GAGGG at 2864.
  7. CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301.
  8. CTCFr7ci: GAGGG at 4318.

CTCFr arbitrary negative direction (evens) (2846-2811) core promoters

  1. CTCFr2ci: GAGGG at 2839.
  2. CTCFr6ci: GAGGG at 2832.

CTCFr alternate negative direction (odds) (2846-2811) core promoters

  1. CTCFr9: CCCTC at 2828.

CTCFr arbitrary positive direction (odds) (4445-4265) core promoters

  1. CTCFr5: CCCTC at 4416, CCCTC at 4356.
  2. CTCFr7ci: GAGGG at 4318.

CTCFr alternate positive direction (evens) (4445-4265) core promoters

  1. CTCFr4: CCCTC at 4306.
  2. CTCFr6: CCCTC at 4408.
  3. CTCFr0ci: GAGGG at 4403.
  4. CTCFr2ci: GAGGG at 4295.

CTCFr arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. CTCFr6ci: GAGGG at 2694.
  2. CTCFr8ci: GAGGG at 2632.

CTCFr alternate negative direction (odds) (2811-2596) proximal promoters

  1. CTCFr1: CCCTC at 2735.
  2. CTCFr7: CCCTC at 2643.
  3. CTCFr9: CCCTC at 2746.
  4. CTCFr5ci: GAGGG at 2752.

CTCFr alternate positive direction (evens) (4265-4050) proximal promoters

  1. CTCFr6ci: GAGGG at 4103.
  2. CTCFr8ci: GAGGG at 4205.

CTCFr arbitrary negative direction (evens) (2596-1) distal promoters

  1. CTCFr0: CCCTC at 2302, CCCTC at 1863, CCCTC at 587, CCCTC at 26.
  2. CTCFr2: CCCTC at 956, CCCTC at 894, CCCTC at 692, CCCTC at 670.
  3. CTCFr4: CCCTC at 2356, CCCTC at 1908, CCCTC at 1525, CCCTC at 1521, CCCTC at 1177, CCCTC at 1058.
  4. CTCFr6: CCCTC at 2361, CCCTC at 2272, CCCTC at 2095, CCCTC at 1736, CCCTC at 886.
  5. CTCFr0ci: GAGGG at 2464, GAGGG at 1396, GAGGG at 1072, GAGGG at 920, GAGGG at 733.
  6. CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
  7. CTCFr6ci: GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
  8. CTCFr8ci: GAGGG at 2018, GAGGG at 686.

CTCFr alternate negative direction (odds) (2596-1) distal promoters

  1. CTCFr1: CCCTC at 2580, CCCTC at 2300, CCCTC at 1369, CCCTC at 859, CCCTC at 424, CCCTC at 217, CCCTC at 127.
  2. CTCFr3: CCCTC at 2497, CCCTC at 1521, CCCTC at 1429.
  3. CTCFr5: CCCTC at 2134, CCCTC at 1860, CCCTC at 1853, CCCTC at 1405.
  4. CTCFr7: CCCTC at 1747, CCCTC at 978, CCCTC at 543.
  5. CTCFr9: CCCTC at 2574, CCCTC at 2157, CCCTC at 1787, CCCTC at 1778, CCCTC at 1343, CCCTC at 856, CCCTC at 191, CCCTC at 101.
  6. CTCFr1ci: GAGGG at 1179, GAGGG at 1149, GAGGG at 838.
  7. CTCFr3ci: GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
  8. CTCFr5ci: GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
  9. CTCFr7ci: GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
  10. CTCFr9ci: GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.

CTCFr arbitrary positive direction (odds) (4050-1) distal promoters

  1. CTCFr1: CCCTC at 2735, CCCTC at 2580, CCCTC at 2300, CCCTC at 1369, CCCTC at 859, CCCTC at 424, CCCTC at 217, CCCTC at 127.
  2. CTCFr3: CCCTC at 3829, CCCTC at 3519, CCCTC at 2497, CCCTC at 1521, CCCTC at 1429.
  3. CTCFr5: CCCTC at 3244, CCCTC at 2134, CCCTC at 1860, CCCTC at 1853, CCCTC at 1405.
  4. CTCFr7: CCCTC at 3262, CCCTC at 3246, CCCTC at 2909, CCCTC at 2643, CCCTC at 1747, CCCTC at 978, CCCTC at 543.
  5. CTCFr9: CCCTC at 3586, CCCTC at 3516, CCCTC at 3402, CCCTC at 3023, CCCTC at 2828, CCCTC at 2746, CCCTC at 2574, CCCTC at 2157, CCCTC at 1787, CCCTC at 1778, CCCTC at 1343, CCCTC at 856, CCCTC at 191, CCCTC at 101.
  6. CTCFr1ci: GAGGG at 3317, GAGGG at 3133, GAGGG at 1179, GAGGG at 1149, GAGGG at 838.
  7. CTCFr3ci: GAGGG at 2864, GAGGG at 2319, GAGGG at 1920, GAGGG at 1628, GAGGG at 1623, GAGGG at 1368, GAGGG at 962.
  8. CTCFr5ci: GAGGG at 3901, GAGGG at 3839, GAGGG at 3301, GAGGG at 2752, GAGGG at 2381, GAGGG at 2252, GAGGG at 1392, GAGGG at 517.
  9. CTCFr7ci: GAGGG at 1782, GAGGG at 1728, GAGGG at 1243, GAGGG at 1189, GAGGG at 631.
  10. CTCFr9ci: GAGGG at 1578, GAGGG at 1332, GAGGG at 789, GAGGG at 227.

CTCFr alternate positive direction (evens) (4050-1) distal promoters

  1. CTCFr0: CCCTC at 2873, CCCTC at 2302, CCCTC at 1863, CCCTC at 587, CCCTC at 26.
  2. CTCFr2: CCCTC at 3917, CCCTC at 3842, CCCTC at 3643, CCCTC at 956, CCCTC at 894, CCCTC at 692, CCCTC at 670.
  3. CTCFr4: CCCTC at 2619, CCCTC at 2356, CCCTC at 1908, CCCTC at 1525, CCCTC at 1521, CCCTC at 1177, CCCTC at 1058.
  4. CTCFr6: CCCTC at 3983, CCCTC at 3805, CCCTC at 3560, CCCTC at 2969, CCCTC at 2361, CCCTC at 2272, CCCTC at 2095, CCCTC at 1736, CCCTC at 886.
  5. CTCFr8: CCCTC at 4032, CCCTC at 3783, CCCTC at 3756.
  6. CTCFr0ci: GAGGG at 2464, GAGGG at 1396, GAGGG at 1072, GAGGG at 920, GAGGG at 733.
  7. CTCFr2ci: GAGGG at 2839, GAGGG at 2396, GAGGG at 1179, GAGGG at 170.
  8. CTCFr4ci: GAGGG at 2309, GAGGG at 1017.
  9. CTCFr6ci: GAGGG at 3830, GAGGG at 2832, GAGGG at 2694, GAGGG at 1507, GAGGG at 1159, GAGGG at 109.
  10. CTCFr8ci: GAGGG at 3551, GAGGG at 3273, GAGGG at 2864, GAGGG at 2632, GAGGG at 2018, GAGGG at 686.

CTCF analysis and results

This protein was found to be binding to three regularly spaced repeats of the core sequence CCCTC and thus was named CCCTC binding factor.[2]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 15 2 7.5 7.5 ± 5.5 (--13,+-2)
Randoms UTR arbitrary negative 0 10 0 0
Randoms UTR alternate negative 0 10 0 0
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0
Randoms Core alternate negative 0 10 0 0
Reals Core positive 4 2 2 2 ± 0 (--+2,++2)
Randoms Core arbitrary positive 0 10 0 0
Randoms Core alternate positive 0 10 0 0
Reals Proximal negative 3 2 1.5 1.5 ± 0.5 (--1,+-2)
Randoms Proximal arbitrary negative 0 10 0 0
Randoms Proximal alternate negative 0 10 0 0
Reals Proximal positive 1 2 0.5 0.5 ± 0.5 (-+1,++0)
Randoms Proximal arbitrary positive 0 10 0 0
Randoms Proximal alternate positive 0 10 0 0
Reals Distal negative 13 2 6.5 6.5 ± 4.5 (--11,+-2)
Randoms Distal arbitrary negative 0 10 0 0
Randoms Distal alternate negative 0 10 0 0
Reals Distal positive 45 2 22.5 22.5 ± 0.5 (-+22,++23)
Randoms Distal arbitrary positive 0 10 0 0
Randoms Distal alternate positive 0 10 0 0

Comparison:

The occurrences of real CTCFs are greater than the randoms. This suggests that the real CTCFs are likely active or activable.

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

See also

References

  1. 1.0 1.1 Hideharu Hashimoto, Dongxue Wang, John R. Horton, Xing Zhang, Victor G. Corces and Xiaodong Cheng (1 June 2017). "Structural Basis for the Versatile and Methylation-Dependent Binding of CTCF to DNA". Molecular Cell. 66 (5): 711–720.e3. doi:10.1016/j.molcel.2017.05.004. PMID 28529057. Retrieved 28 August 2020.
  2. 2.0 2.1 Lobanenkov VV, Nicolas RH, Adler VV, Paterson H, Klenova EM, Polotskaja AV, Goodwin GH (December 1990). "A novel sequence-specific DNA binding protein which interacts with three regularly spaced direct repeats of the CCCTC-motif in the 5'-flanking sequence of the chicken c-myc gene". Oncogene. 5 (12): 1743–53. PMID 2284094.

External links