Initiator element gene transcriptions

Jump to navigation Jump to search

Editor-In-Chief: Henry A. Hoff

In the biosynthesis of any human protein, the gene that contains the nucleotide sequence which is translated into that protein must be transcribed. For RNA polymerase II holoenzyme to transcribe the gene, the gene's promoter must be located. After the promoter is located, the transcription start site (TSS) is pinpointed by using nucleotide sequences that include the TSS. Within the promoter, most human genes lack a TATA box and have an initiator element (Inr) or downstream promoter element instead.

On the basis of descriptions available, various Inrs are located to test whether the known TSS is located.

Notations

Notation: let the symbol Inr denote an initiator element.

Notation: let the symbol +1 designate the nucleotide that is the transcription start site (TSS).

Genetics

Inr in humans was first explained and sequenced in 1989.[1]

The Inr element for core promoters was found to be more prevalent than the TATA box in eukaryotic promoter domains.[2] In a study of 1800+ distinct human promoter sequences it was found that 49% contain the Inr element while 21.8% contain the TATA box.[2]

Gene transcriptions

Two subunits, TAF1 and TAF2, of the TFIID recognize the Inr sequence and bring the complex together.[3]

The interaction between TFIID and Inr is believed to be most imperative in initiating transcription due to the Inr sequence overlapping the start site.[4]

The Inr element is also believed to interact with the activator Sp1 transcription factor (Sp1), specificity protein 1 transcription factor, which is then able to regulate the activation and initiation of transcription[5]

Promoters with a functional Inr are more likely to lack a TATA box or to possess a degenerate TATA sequence because a gene with an active Inr is less dependent on a functional TATA box or additional promoters.[6] Although Inr element varies between promoters, the sequence is highly conserved between humans and yeast.[6] An analysis of 7670 transcription start sites showed that roughly 40% had an exact match to the BBCA+1BW Inr sequence, while 16% contained only one mismatch [7] TFIID and subunits are very sensitive to the Inr sequence and nucleotide changes have been shown to drastically change the binding affinity, where the +1 and -3 positions have been identified as the most critical for transcription efficiency and Inr function.[6] A replacement of the Adenosine (A) nucleotide at the +1 to G or T changes transcription activity by 10% and a replacement of Thymine (T) at the +3 position changes transcription activity levels by 22%.[8]

Theoretical initiator elements

Here's a theoretical definition:

Def. a series of nucleotides including a transcription start site on one DNA strand whose presence in a gene promoter eventually leads to a chain reaction or polymerization such as transcription is called an initiator element.

RNA polymerase IIs

"RNA pol II itself recognizes features of the Inr which might assist the correct positioning of the polymerase on the promoter (Carcamo et al., 1991; Weis and Reinberg, 1997)."[9][10][11]

RNA polymerase II may form a stable complex on TATA-less promoters that contain Inr elements and possess a weak, intrinsic preference for Inr-like sequences.[10]

RNA polymerase II holoenzyme complexes

Gene ID: 672 is BRCA1 BRCA1, DNA repair associated. "This gene encodes a nuclear phosphoprotein that plays a role in maintaining genomic stability, and it also acts as a tumor suppressor. The encoded protein combines with other tumor suppressors, DNA damage sensors, and signal transducers to form a large multi-subunit protein complex known as the BRCA1-associated genome surveillance complex (BASC). This gene product associates with RNA polymerase II, and through the C-terminal domain, also interacts with histone deacetylase complexes. This protein thus plays a role in transcription, DNA repair of double-stranded breaks, and recombination. Mutations in this gene are responsible for approximately 40% of inherited breast cancers and more than 80% of inherited breast and ovarian cancers. Alternative splicing plays a role in modulating the subcellular localization and physiological function of this gene. Many alternatively spliced transcript variants, some of which are disease-associated mutations, have been described for this gene, but the full-length natures of only some of these variants has been described. A related pseudogene, which is also located on chromosome 17, has been identified."[12]

Gene ID: 1660 is DHX9 DExH-box helicase 9 (aka LKP; RHA; DDX9; NDH2; NDHII). "This gene encodes a member of the DEAH-containing family of RNA helicases. The encoded protein is an enzyme that catalyzes the ATP-dependent unwinding of double-stranded RNA and DNA-RNA complexes. This protein localizes to both the nucleus and the cytoplasm and functions as a transcriptional regulator. This protein may also be involved in the expression and nuclear export of retroviral RNAs. Alternate splicing results in multiple transcript variants. Pseudogenes of this gene are found on chromosomes 11 and 13."[13]

BRCA1 has been shown to interact with DHX9; i.e., overexpression of a protein fragment of RNA helicase A causes inhibition of endogenous BRCA1 function and defects in ploidy and cytokinesis in mammary epithelial cells[14] and the BRCA1 protein is linked to the RNA polymerase II holoenzyme complex via RNA helicase A.[15]

ATP-dependent RNA helicase A (RHA; also known as DHX9, LKP, and NDHI) is an enzyme that in humans is encoded by the DHX9 gene.[16][17][13]

RNA polymerase II subunit A C-terminal domain phosphatase is an enzyme that in humans is encoded by the CTDP1 gene.[18][19][20]

Gene ID: 9150 is CTDP1 CTD phosphatase subunit 1. "This gene encodes a protein which interacts with the carboxy-terminus of the RAP74 subunit of transcription initiation factor TFIIF, and functions as a phosphatase that processively dephosphorylates the C-terminus of POLR2A (a subunit of RNA polymerase II), making it available for initiation of gene expression. Mutations in this gene are associated with congenital cataracts, facial dysmorphism and neuropathy syndrome (CCFDN). Alternatively spliced transcript variants encoding different isoforms have been described for this gene."[20]

"This gene encodes a protein which interacts with the carboxy-terminus of transcription initiation factor TFIIF, a transcription factor which regulates elongation as well as initiation by RNA polymerase II. The protein may also represent a component of an RNA polymerase II holoenzyme complex. Alternative splicing of this gene results in two transcript variants encoding 2 different isoforms."[21]

CTDP1 has been shown to interact with WD repeat-containing protein 77,[22] GTF2F1[19] and POLR2A.[23]

Gene ID: 168400 is DDX53 DEAD-box helicase 53. "This intronless gene encodes a protein which contains several domains found in members of the DEAD-box helicase protein family. Other members of this protein family participate in ATP-dependent RNA unwinding."[24]

"DEAD/DEAH box helicases are proteins, and are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein with RNA helicase activity. It may participate in melting of DNA:RNA hybrids, such as those that occur during transcription, and may play a role in X-linked gene expression. It contains 2 copies of a double-stranded RNA-binding domain, a DEXH core domain and an RGG box. The RNA-binding domains and RGG box influence and regulate RNA helicase activity."[24]

Consensus sequences

As in other metazoans, for genes lacking a TATA box, the Inr is functionally analogous, with a base pair (bp) consensus 5'-YYA+1NWYY-3', to direct transcription initiation.[25] Using the degenerate nucleotide code, the consensus sequence is 5'-C/T-C/T-A-A/C/G/T-A/T-C/T-C/T-3', or in the direction of transcription on the template strand: 3'-C/T-C/T-A-A/C/G/T-A/T-C/T-C/T-5'.

"TATA-less core promoters that lack AT-rich sequences in the -30 region and do not stably bind TBP are likely to assemble PICs via alternative pathways and to be regulated by distinct mechanisms (Smale and Kadonaga, 2003). However, the number of such bona fide TATA-less genes remains unclear in eukaryotic genomes."[6]

In Entamoeba histolytica, the consensus sequence is AAAAATTCA.[26] Tested and none found either as direct or complement inverse.

The Inr has the consensus sequence YYANWYY.[27] Similarly to the TATA box, the Inr element facilitates the binding of transcription Factor II D (TATA binding protein TAF).[27]

Enhancers

An Inr for mammalian RNA polymerase II can be defined as a DNA sequence element that overlaps a TSS and is sufficient for

  1. determining the start site location in a promoter that lacks a TATA box and
  2. enhancing the strength of a promoter that contains a TATA box.[28]

TATA binding protein associated factors

"Although any isolated TAF may not exhibit sequence-specific interactions at the Inr element in the absence of a TATA-box, a combination of TAFs may bind sequence specifically to the Inr element regardless of the TATA-box and/or DPE (Chalkley and Verrijzer, 1999)."[29] Bold added.

TAF1 "binds to core promoter sequences encompassing the transcription start site. It also binds to activators and other transcriptional regulators, and these interactions affect the rate of transcription initiation."[30]

Prior to transcription, stable binding to an Inr occurs by a complex consisting of TAF1 and TAF2.[9]

TATA box-likes

The Inr is the only element in metazoan protein-encoding genes known to be a functional analog of the TATA box, in that it is sufficient for directing accurate transcription initiation in genes that lack TATA boxes.[31]

General transcription factor II As

General transcription factor II A is critical for the cooperative binding of TFIID to the Inr.[32]

General transcription factor II Ds

The general transcription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex.[33] Before the start of transcription, the transcription factor II D (TFIID) complex, binds to the core promoter of the gene.[33]

TFIID is the first protein to bind to DNA during the formation of the pre-initiation transcription complex of RNA polymerase II (RNA Pol II).[33]

General transcription factor II Is

General transcription factor II I, or TFII-I, is a factor capable of binding the Inr element.[34][35]

Transcription start sites

Usually the Inr contains the TSS.

"[T]he initiator (INR) element located at, or immediately adjacent to, the TSS, ... is recognized by the TBP-associated factors TAF1 and TAF2 of the TFIID complex".[6]

"[T]ranscription does not need to begin at the +1 nucleotide for the Inr to function. RNA polymerase II has been redirected to alternative start sites by reducing ATP concentrations within a nuclear extract, by altering the spacing between the TATA and Inr in a promoter containing both elements, and by dinucleotide initiation strategies".[36]

Hypotheses

  1. A1BG is not transcribed by an initiator element.
  2. A1BG is not transcribed by a TATA box.

YYRNWYY (Juven-Gershon June 2008) samplings

The wider consensus sequence of 3'-YYRNWYY-5' allows a G at the TSS but at most only allows two Gs in a row.[37]

For the Basic programs (starting with SuccessablesInr.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. Negative strand, negative direction: 121, TTACTCC at 4557, TCACACT at 4361, TCGGACC at 4349, CCAGTTT at 4309, TCGGACC at 4300, CTGCACC at 4238, TCGGTCT at 4233, TCACTCT at 4202, TCGAACC at 4188, CCGGTCC at 4170, CCGTACC at 4107, CCGGTCC at 4102, TTACACT at 4092, TCACTCT at 4051, TTGTATC at 4046, TCGGACC at 4037, CCGGTCC at 3951, CTACTTT at 3922, TCATTCT at 3893, CTGGTCC at 3871, CTACACC at 3810, CTGTTCT at 3759, TTGGTCT at 3486, TTGATCT at 3463, CCGTATC at 3446, CCGAACT at 3401, TCGTTCT at 3374, TTGTTCT at 3340, TCGTTTT at 3313, TTGTTCT at 3307, TCGGACC at 3298, TCGGTTC at 3273, CCACACC at 3186, TTGTATT at 3169, CCACTTT at 3146, TTGTTCC at 3141, TCGGACC at 3128, CCGCACC at 3047, TTGATTC at 3031, CCGATTT at 3009, TTGATTC at 2914, TCGTACT at 2784, TCGGACC at 2770, TTGGACC at 2720, TCACACC at 2658, CCACTTT at 2619, TTGTACC at 2614, TCACACC at 2605, CCAGTCC at 2587, CCGGTCC at 2519, TCATTCT at 2503, TTGTTTT at 2490, TCGTTTT at 2476, TCACTCT at 2449, TCGGACC at 2435, TTGGACC at 2385, CCACTTT at 2282, TCGTACC at 2277, TCGGACC at 2268, TCAAACT at 2257, CCAGTCC at 2250, CCGCTTT at 2157, TTGTACC at 2152, TCAAACT at 2141, TCACATT at 2087, CCGGTCC at 2077, TTACACC at 2065, TCGTTCT at 2023, TCGGACC at 2009, TTGGACC at 1959, CCGTACT at 1953, CCGCACC at 1897, TTATACC at 1742, TTAATTT at 1697, TTGGATT at 1591, TTACTTT at 1582, CCGTTTT at 1561, TTGCTTC at 1555, CCACACT at 1479, TTGTTTT at 1394, TCGTTTT at 1371, TTATTCT at 1365, TCAGACC at 1356, TTGGATC at 1306, CCGCACC at 1244, CCACTTT at 1212, TTGTACC at 1207, TCGGACC at 1198, TCACTCT at 1079, TTGGACC at 1015, TTAGTCC at 984, CCGTACC at 953, TCGGTCC at 948, TCGCTCT at 913, TCGGACC at 899, TCGGTTC at 874, CTACACC at 787, TCGCACC at 741, TCGGACT at 732, CCAGTCC at 714, CCGGTTC at 692, CCGGTCC at 648, TTATACC at 605, CCAGTCC at 578, CCGGTTC at 556, TCGGACC at 508, TCACTTT at 473, TTGTATC at 468, TCGGACC at 459, CCAGTCC at 441, CCGGTTC at 419, CTGCTTT at 312, TCACTCT at 301, TTATACT at 274, TTGGTCC at 262, CTACATT at 247, CCATATT at 181, CCGTACT at 124, CCGTTTC at 93, CTATACC at 77, TTGTTCC at 71.
  2. Positive strand, negative direction: 40, TTAATTC at 4542, TCACATT at 4533, CCACTTT at 4461, CCACTCC at 4425, CCAGTTC at 4417, CTGCACT at 4340, CCGGACT at 4327, TCACACC at 3967, CCATACC at 3858, CTGAACC at 3784, CTGGACT at 3747, CCATTTC at 3688, CTGCTCC at 3582, CCAGATC at 3488, TTGCACT at 3289, TTGAACC at 3245, CTGCACC at 2761, TTGAACC at 2717, TTGAATC at 2708, CTGCACT at 2426, TCACACC at 2418, TTGAACC at 2382, CTACTCC at 2352, CTGCACT at 2000, TTATTTT at 1727, CTATATC at 1528, TCGCTCT at 1450, CCATTTC at 1380, CCAGTCT at 1354, TTGCACT at 1347, TTGCACC at 1339, TTGAACC at 1303, TCACACC at 1128, TCACTCC at 1058, TTGAACC at 1012, TCACACC at 882, TTGAACC at 846, CTGCATT at 152, TTGGACC at 32, CTGAATT at 20.
  3. Negative strand, positive direction: 45, CTGCACC at 4343, TTAGTTT at 4139, TTGATTT at 4134, TCACTCT at 4128, TCATTTT at 4120, TCACACC at 3824, CTGTTCC at 3625, CCAGACC at 3550, TCACACT at 3507, TTGCATC at 3402, CTGTTCC at 3352, TTGCACT at 3343, CCGCATC at 3328, CTGCACC at 3322, CTGCTCC at 3309, CTGGTCT at 3299, TCGCTCT at 3276, CTGGTCT at 3245, CCAGTCC at 3084, CCAGTCC at 2998, CTGCTCC at 2978, TCAGATT at 2868, CCACACT at 2636, CCACACC at 2602, TTATACC at 2590, CCGCACC at 2566, CTAATTT at 2440, CTACACC at 2430, TCACTCT at 2306, CTGTTTC at 2263, TCAATCT at 2235, CCAGATC at 2230, CTGCATT at 2206, TCATATT at 2178, TCGCTTC at 2095, CCAGTCC at 2026, CTATTTC at 1978, CCACTTC at 1914, CCAGACT at 1744, CTGCACT at 1472, CTGCACT at 1372, CCGGACT at 746, CCACACT at 345, CTGTTTT at 147, TTGTATT at 115.
  4. Positive strand, positive direction: 75, CCAGACC at 4416, CCACTCC at 4401, CTAAATC at 4136, CTACTCC at 4102, TTACTCC at 4096, CCACACT at 3971, TCACACC at 3966, TCAGACT at 3924, TCACTCC at 3878, CTGGACC at 3787, CCGGACC at 3758, CCGGACC at 3679, CCACTCC at 3647, TCACACT at 3594, CTGGTCT at 3548, TCGATCC at 3522, CCGATCC at 3484, CTACTCC at 3478, TCGGTCT at 3221, CTGGTTT at 3175, TTATACC at 3162, CCAGACC at 3021, CCGGACC at 2988, CCAGACT at 2943, CTGGTCC at 2876, CTAAACT at 2871, TTGCTCC at 2806, TCGATTC at 2789, TCGTTTT at 2707, TCAATCC at 2668, CTATATT at 2662, TCAGTCC at 2620, TCAGTTC at 2615, TCAGTCT at 2609, CCGGTCC at 2574, CCGCACT at 2555, TTGGTCT at 2228, CCAGTCT at 2222, CCGTTCT at 2190, CTACTTT at 2146, TTGTACT at 2141, TCAATTT at 2136, CCACACC at 1971, CCGTTCT at 1948, CCGCTCT at 1921, CCACACC at 1805, CCGCACT at 1720, CCGCTCT at 1565, TCGTTCC at 1511, CCGCTCT at 1481, CCGTTCC at 1427, CCGCTCT at 1381, CCGTTCC at 1327, CCGTTCC at 1259, CCGCTCT at 1229, CCGGTCC at 1175, TCGCTCT at 1061, CCGTTCC at 1007, TTGGACC at 947, TCGGTCT at 935, CCGTTCC at 923, TTGGACC at 847, TCGGTCT at 835, CCGTTCC at 823, CCGGACT at 725, CCGTTCC at 671, CCGCTCT at 641, CCGTTCC at 587, CCGCTCT at 557, TCGGTCC at 515, CCGTTCC at 503, CCGGACC at 286, TTACACT at 230, CCGGTCC at 215, CTGGACC at 40.
  5. inverse complement, negative strand, negative direction: 32, AGTGTAA at 4533, GGTCCGA at 4255, AGTACGG at 4118, AGTGTGG at 3967, GGTCCGG at 3873, GGTATGG at 3858, GGTCTAG at 3488, AGTCCGA at 3398, AGTGCGG at 3281, GGACCGG at 3130, GATTCGA at 3033, AAAGTAG at 2887, AGTACGG at 2753, AGTACGG at 2535, AGTGTGG at 2418, AGTGCGG at 2208, AGTGCGG at 1992, GGACCGA at 1843, AGTGCAG at 1773, AAAATAG at 1730, AGAACGG at 1608, GATATAG at 1528, GGTCCGA at 1462, AGAGCGA at 1448, GGACCGG at 1200, AGTGTGG at 1128, GAAGTGA at 1056, AGTGTGG at 882, GGACTGG at 734, AGTGCGG at 664, GGACCGA at 598, GATACAA at 213.
  6. inverse complement, positive strand, negative direction: 100, GGAATGA at 4555, AGTCCAA at 4502, AGTGTGA at 4361, AAAATAA at 4221, AGTTCAA at 4177, AATGTGA at 4092, AAAATAA at 4071, AGACCAG at 4032, AGTTCAA at 4026, GGAGTAA at 3891, GGACCAG at 3870, GATGTGG at 3810, AATGCAG at 3772, GGACTGG at 3749, GGAACAG at 3725, AATCCAG at 3681, AAACCAG at 3485, GAACTAG at 3462, GAAGTGA at 3410, AAATTGA at 3358, AAAACAA at 3330, AGAGCAA at 3311, GGTGTGG at 3186, AAATTAG at 3176, AGACCAG at 3123, AAACTAA at 3030, AAAATAA at 3013, AGAATGG at 3004, AAAACAA at 2842, AGTGTGG at 2658, AAATCAG at 2649, AGTGTGG at 2605, AGACCAG at 2600, AAAACAA at 2509, AAAGCAA at 2480, AAAGCAA at 2474, GATTCGG at 2454, AGAGTGA at 2447, AAACTAG at 2313, AATACAA at 2305, AGACCAG at 2263, AGTTTGA at 2257, GGTGCGG at 2197, AAAATGA at 2187, GATACAA at 2180, AGACCAA at 2147, AGTTTGA at 2141, AGTGTAA at 2087, GGTGCAG at 2082, AATGTGG at 2065, AGAGCAA at 2021, AGAATGG at 1948, AGACTGA at 1935, AAATTAG at 1887, AATACAA at 1878, AATATGG at 1742, GAATTAA at 1696, AAAGCGG at 1680, GAAATGA at 1663, GAAACAA at 1585, AATACAG at 1566, AGAACGA at 1553, AGTGCAA at 1536, GGTGTGA at 1479, AGTGCAG at 1471, AAAACAA at 1388, AGAGCAA at 1369, AGTCTGG at 1356, AAATTAG at 1234, AGAGTGA at 1077, AGATTGG at 1045, GATCCAG at 975, AGAGCGA at 911, GATGTGG at 787, AAATTAG at 777, AATACAA at 769, AGACCAG at 727, AGTTCGA at 721, AAATTGG at 643, AATACAA at 635, AATATGG at 605, AGATTGA at 585, AAATTAG at 499, AATACGA at 492, AGTGCGA at 448, GGTGCGG at 380, AAACTGA at 307, AGAACAG at 288, AATATGA at 274, AAACCAG at 261, AGTTCAA at 255, GATGTAA at 247, GAAACAA at 229, GGTATAA at 181, AAAACAG at 167, AAACTGA at 130, GATATGG at 77, AAAACAA at 69, GGACCAG at 34, AGACTGA at 17.
  7. inverse complement, negative strand, positive direction: 61, GGAACAG at 4445, GGTCTGG at 4416, GGAGTGA at 4350, GATTTAG at 4136, GAAATGA at 4094, AGAACAG at 4069, AGAGTGG at 4040, GGTGTGA at 3971, AGTGTGG at 3966, AGTCTGA at 3924, AGAGTGA at 3876, GAACCAG at 3840, AGAATGA at 3835, AATCCGA at 3799, GAAGCGG at 3670, AGTGTGA at 3594, GGAATGA at 3567, GGACCAG at 3547, AGTGCAG at 3465, GATGCAG at 3460, GGAATGA at 3441, GGACCAA at 3174, GAAATGG at 3168, AATATGG at 3162, GGTCTGG at 3021, GGTCTGA at 2943, GATTTGA at 2871, AGAATGA at 2841, GGTGCAA at 2801, AAAGTGG at 2711, AGAGCAA at 2705, GGACTGA at 2674, GATATAA at 2662, GAAATAG at 2626, GGTGCAA at 2335, AGTGCAG at 2327, AGATCAA at 2232, GAACCAG at 2227, AGTGCAG at 2064, AAAGCAG at 2007, GGTGTGG at 1971, GAACTGG at 1953, GGTGTGG at 1805, AGTGCAG at 1787, GGTGCGG at 1764, GAAGCGG at 1636, AGTGCGG at 1590, AATGCGG at 1422, AATGCGG at 1322, AGTGCGG at 1254, AGTGCGG at 1170, AGTGCGG at 1086, GGTGCAG at 784, AGTGCGG at 666, AGTGCGG at 582, AGTGCGG at 498, GGTGCGG at 489, AGACCGG at 442, GGAGCGA at 429, AATGTGA at 230, AGAGTGG at 53.
  8. inverse complement, positive strand, positive direction: 75, AGAACGA at 4390, GGTACGA at 4372, AGTACAG at 4366, GGAGTAA at 4309, GGACTGG at 4216, GAAACGG at 4210, AAATCAA at 4138, GAACTAA at 4133, AAAATAG at 4123, GAACTGG at 4018, AGTGTGG at 3824, GGACCGG at 3681, AGAGTGG at 3612, GGTCTGG at 3550, GATCCGA at 3524, AGTGTGA at 3507, GGAACGG at 3375, GGTACAA at 3337, AGAGTGA at 3317, GGACCAG at 3298, AGTGCAG at 3255, GAAGTAG at 3250, GGACCAA at 3049, AGTCCGG at 3036, AGACCAA at 3023, GGTCCAG at 3018, GGAACAG at 3003, GGACCGG at 2990, AGACCGG at 2985, AGACTGA at 2945, GGAGTAA at 2902, AGACCGA at 2885, GGTCCGG at 2878, AAACTGG at 2873, AGTCTAA at 2868, GGTGTGA at 2636, AGTTCAG at 2617, GGTGTGG at 2602, AATATGG at 2590, GGACCGG at 2571, GGTACAA at 2475, AGAGTGG at 2470, GGACCGA at 2435, GATGTGG at 2430, AATCCGA at 2368, GGTCCGA at 2318, AAAGTGA at 2304, AGAGTGG at 2247, GGTCTAG at 2230, GGACTGG at 2213, AGTATAA at 2178, GAAGTAG at 2110, AGAATGG at 1888, GGTCCGG at 1857, GGACCGA at 1817, GGTCTGA at 1744, GGACTGG at 1662, GATGCGA at 1576, AATTCGG at 1541, GAAGCGG at 1408, GAAGCGG at 1308, AAAGCAG at 1183, GGTCCGA at 1177, GGACCGG at 949, GGACCGG at 849, GGTGCGA at 777, GATGCGA at 652, GAAGCGG at 595, AGAATGA at 524, GAAGCGG at 459, GGTGTGA at 345, GGTCCAG at 217, AATCCAG at 152, AGTCCGG at 92, GGTCCGA at 10.

YYR (4560-2846) UTRs

YYR UTR negative strand

  1. Negative strand, negative direction: TTACTCC at 4557, TCACACT at 4361, TCGGACC at 4349, CCAGTTT at 4309, TCGGACC at 4300, CTGCACC at 4238, TCGGTCT at 4233, TCACTCT at 4202, TCGAACC at 4188, CCGGTCC at 4170, CCGTACC at 4107, CCGGTCC at 4102, TTACACT at 4092, TCACTCT at 4051, TTGTATC at 4046, TCGGACC at 4037, CCGGTCC at 3951, CTACTTT at 3922, TCATTCT at 3893, CTGGTCC at 3871, CTACACC at 3810, CTGTTCT at 3759, TTGGTCT at 3486, TTGATCT at 3463, CCGTATC at 3446, CCGAACT at 3401, TCGTTCT at 3374, TTGTTCT at 3340, TCGTTTT at 3313, TTGTTCT at 3307, TCGGACC at 3298, TCGGTTC at 3273, CCACACC at 3186, TTGTATT at 3169, CCACTTT at 3146, TTGTTCC at 3141, TCGGACC at 3128, CCGCACC at 3047, TTGATTC at 3031, CCGATTT at 3009, TTGATTC at 2914.
  2. Negative strand, negative direction: AGTGTAA at 4533, GGTCCGA at 4255, AGTACGG at 4118, AGTGTGG at 3967, GGTCCGG at 3873, GGTATGG at 3858, GGTCTAG at 3488, AGTCCGA at 3398, AGTGCGG at 3281, GGACCGG at 3130, GATTCGA at 3033, AAAGTAG at 2887.

YYR UTR positive strand

  1. Positive strand, negative direction: TTAATTC at 4542, TCACATT at 4533, CCACTTT at 4461, CCACTCC at 4425, CCAGTTC at 4417, CTGCACT at 4340, CCGGACT at 4327, TCACACC at 3967, CCATACC at 3858, CTGAACC at 3784, CTGGACT at 3747, CCATTTC at 3688, CTGCTCC at 3582, CCAGATC at 3488, TTGCACT at 3289, TTGAACC at 3245.
  2. Positive strand, negative direction: GGAATGA at 4555, AGTCCAA at 4502, AGTGTGA at 4361, AAAATAA at 4221, AGTTCAA at 4177, AATGTGA at 4092, AAAATAA at 4071, AGACCAG at 4032, AGTTCAA at 4026, GGAGTAA at 3891, GGACCAG at 3870, GATGTGG at 3810, AATGCAG at 3772, GGACTGG at 3749, GGAACAG at 3725, AATCCAG at 3681, AAACCAG at 3485, GAACTAG at 3462, GAAGTGA at 3410, AAATTGA at 3358, AAAACAA at 3330, AGAGCAA at 3311, GGTGTGG at 3186, AAATTAG at 3176, AGACCAG at 3123, AAACTAA at 3030, AAAATAA at 3013, AGAATGG at 3004.

YYR negative direction (2846-2811) core promoters

  1. Positive strand, negative direction: AAAACAA at 2842.

YYR positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: CTGCACC at 4343.
  2. Negative strand, positive direction: GGAACAG at 4445, GGTCTGG at 4416, GGAGTGA at 4350.
  3. Positive strand, positive direction: CCAGACC at 4416, CCACTCC at 4401.
  4. Positive strand, positive direction: AGAACGA at 4390, GGTACGA at 4372, AGTACAG at 4366, GGAGTAA at 4309.

YYR negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: TCGTACT at 2784, TCGGACC at 2770, TTGGACC at 2720, TCACACC at 2658, CCACTTT at 2619, TTGTACC at 2614, TCACACC at 2605.
  2. Negative strand, negative direction: AGTACGG at 2753.
  3. Positive strand, negative direction: CTGCACC at 2761, TTGAACC at 2717, TTGAATC at 2708.
  4. Positive strand, negative direction: AAATCAG at 2649, AGACCAG at 2600.

YYR positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: TTAGTTT at 4139, TTGATTT at 4134, TCACTCT at 4128, TCATTTT at 4120.
  2. Negative strand, positive direction: GATTTAG at 4136, GAAATGA at 4094, AGAACAG at 4069.
  3. Positive strand, positive direction: CTACTCC at 4102, TTACTCC at 4096.
  4. Positive strand, positive direction: GGACTGG at 4216, GAAACGG at 4210, AAATCAA at 4138, GAACTAA at 4133, AAAATAG at 4123.

YYR negative direction (2596-1) distal promoters

YYR distal negative strand

  1. Negative strand, negative direction: CCAGTCC at 2587, CCGGTCC at 2519, TCATTCT at 2503, TTGTTTT at 2490, TCGTTTT at 2476, TCACTCT at 2449, TCGGACC at 2435, TTGGACC at 2385, CCACTTT at 2282, TCGTACC at 2277, TCGGACC at 2268, TCAAACT at 2257, CCAGTCC at 2250, CCGCTTT at 2157, TTGTACC at 2152, TCAAACT at 2141, TCACATT at 2087, CCGGTCC at 2077, TTACACC at 2065, TCGTTCT at 2023, TCGGACC at 2009, TTGGACC at 1959, CCGTACT at 1953, CCGCACC at 1897, TTATACC at 1742, TTAATTT at 1697, TTGGATT at 1591, TTACTTT at 1582, CCGTTTT at 1561, TTGCTTC at 1555, CCACACT at 1479, TTGTTTT at 1394, TCGTTTT at 1371, TTATTCT at 1365, TCAGACC at 1356, TTGGATC at 1306, CCGCACC at 1244, CCACTTT at 1212, TTGTACC at 1207, TCGGACC at 1198, TCACTCT at 1079, TTGGACC at 1015, TTAGTCC at 984, CCGTACC at 953, TCGGTCC at 948, TCGCTCT at 913, TCGGACC at 899, TCGGTTC at 874, CTACACC at 787, TCGCACC at 741, TCGGACT at 732, CCAGTCC at 714, CCGGTTC at 692, CCGGTCC at 648, TTATACC at 605, CCAGTCC at 578, CCGGTTC at 556, TCGGACC at 508, TCACTTT at 473, TTGTATC at 468, TCGGACC at 459, CCAGTCC at 441, CCGGTTC at 419, CTGCTTT at 312, TCACTCT at 301, TTATACT at 274, TTGGTCC at 262, CTACATT at 247, CCATATT at 181, CCGTACT at 124, CCGTTTC at 93, CTATACC at 77, TTGTTCC at 71.
  2. Negative strand, negative direction: AGTACGG at 2535, AGTGTGG at 2418, AGTGCGG at 2208, AGTGCGG at 1992, GGACCGA at 1843, AGTGCAG at 1773, AAAATAG at 1730, AGAACGG at 1608, GATATAG at 1528, GGTCCGA at 1462, AGAGCGA at 1448, GGACCGG at 1200, AGTGTGG at 1128, GAAGTGA at 1056, AGTGTGG at 882, GGACTGG at 734, AGTGCGG at 664, GGACCGA at 598, GATACAA at 213.

YYR distal positive strand

  1. Positive strand, negative direction: AAAACAA at 2509, AAAGCAA at 2480, AAAGCAA at 2474, GATTCGG at 2454, AGAGTGA at 2447, AAACTAG at 2313, AATACAA at 2305, AGACCAG at 2263, GGTGCGG at 2197, AAAATGA at 2187, GATACAA at 2180, AGACCAA at 2147, AGTTTGA at 2141, AGTGTAA at 2087, GGTGCAG at 2082, AATGTGG at 2065, AGAGCAA at 2021, AGAATGG at 1948, AGACTGA at 1935, AAATTAG at 1887, AATACAA at 1878, AATATGG at 1742, GAATTAA at 1696, AAAGCGG at 1680, GAAATGA at 1663, GAAACAA at 1585, AATACAG at 1566, AGAACGA at 1553, AGTGCAA at 1536, GGTGTGA at 1479, AGTGCAG at 1471, AAAACAA at 1388, AGAGCAA at 1369, AGTCTGG at 1356, AAATTAG at 1234, AGAGTGA at 1077, AGATTGG at 1045, GATCCAG at 975, AGAGCGA at 911, GATGTGG at 787, AAATTAG at 777, AATACAA at 769, AGACCAG at 727, AGTTCGA at 721, AAATTGG at 643, AATACAA at 635, AATATGG at 605, AGATTGA at 585, AAATTAG at 499, AATACGA at 492, AGTGCGA at 448, GGTGCGG at 380, AAACTGA at 307, AGAACAG at 288, AATATGA at 274, AAACCAG at 261, AGTTCAA at 255, GATGTAA at 247, GAAACAA at 229, GGTATAA at 181, AAAACAG at 167, AAACTGA at 130, GATATGG at 77, AAAACAA at 69, GGACCAG at 34, AGACTGA at 17.
  2. Positive strand, negative direction: CTGCACT at 2426, TTGAACC at 2382, CTACTCC at 2352, CTGCACT at 2000, TTATTTT at 1727, TCGCTCT at 1450, CCATTTC at 1380, CCAGTCT at 1354, TTGCACT at 1347, TTGCACC at 1339, TTGAACC at 1303, TCACTCC at 1058, TTGAACC at 1012, TTGAACC at 846, CTGCATT at 152, TTGGACC at 32, CTGAATT at 20.

YYR positive direction (4050-1) distal promoters

YYR negative strand distal

  1. Negative strand, positive direction: TCACACC at 3824, CTGTTCC at 3625, CCAGACC at 3550, TCACACT at 3507, TTGCATC at 3402, CTGTTCC at 3352, TTGCACT at 3343, CCGCATC at 3328, CTGCACC at 3322, CTGCTCC at 3309, CTGGTCT at 3299, TCGCTCT at 3276, CTGGTCT at 3245, CCAGTCC at 3084, CCAGTCC at 2998, CTGCTCC at 2978, TCAGATT at 2868, CCACACT at 2636, CCACACC at 2602, TTATACC at 2590, CCGCACC at 2566, CTAATTT at 2440, CTACACC at 2430, TCACTCT at 2306, CTGTTTC at 2263, TCAATCT at 2235, CCAGATC at 2230, CTGCATT at 2206, TCATATT at 2178, TCGCTTC at 2095, CCAGTCC at 2026, CTATTTC at 1978, CCACTTC at 1914, CCAGACT at 1744, CTGCACT at 1472, CTGCACT at 1372, CCGGACT at 746, CCACACT at 345, CTGTTTT at 147, TTGTATT at 115.
  2. Negative strand, positive direction: AGAGTGG at 4040, GGTGTGA at 3971, AGTGTGG at 3966, AGTCTGA at 3924, AGAGTGA at 3876, GAACCAG at 3840, AGAATGA at 3835, AATCCGA at 3799, GAAGCGG at 3670, AGTGTGA at 3594, GGAATGA at 3567, GGACCAG at 3547, AGTGCAG at 3465, GATGCAG at 3460, GGAATGA at 3441, GGACCAA at 3174, GAAATGG at 3168, AATATGG at 3162, GGTCTGG at 3021, GGTCTGA at 2943, GATTTGA at 2871, AGAATGA at 2841, GGTGCAA at 2801, AAAGTGG at 2711, AGAGCAA at 2705, GGACTGA at 2674, GATATAA at 2662, GAAATAG at 2626, GGTGCAA at 2335, AGTGCAG at 2327, AGATCAA at 2232, GAACCAG at 2227, AGTGCAG at 2064, AAAGCAG at 2007, GGTGTGG at 1971, GAACTGG at 1953, GGTGTGG at 1805, AGTGCAG at 1787, GGTGCGG at 1764, GAAGCGG at 1636, AGTGCGG at 1590, AATGCGG at 1422, AATGCGG at 1322, AGTGCGG at 1254, AGTGCGG at 1170, AGTGCGG at 1086, GGTGCAG at 784, AGTGCGG at 666, AGTGCGG at 582, AGTGCGG at 498, GGTGCGG at 489, AGACCGG at 442, GGAGCGA at 429, AATGTGA at 230, AGAGTGG at 53.

YYR positive strand distal

  1. Positive strand, positive direction: CCACACT at 3971, TCACACC at 3966, TCAGACT at 3924, TCACTCC at 3878, CTGGACC at 3787, CCGGACC at 3758, CCGGACC at 3679, CCACTCC at 3647, TCACACT at 3594, CTGGTCT at 3548, TCGATCC at 3522, CCGATCC at 3484, CTACTCC at 3478, TCGGTCT at 3221, CTGGTTT at 3175, TTATACC at 3162, CCAGACC at 3021, CCGGACC at 2988, CCAGACT at 2943, CTGGTCC at 2876, CTAAACT at 2871, TTGCTCC at 2806, TCGATTC at 2789, TCGTTTT at 2707, TCAATCC at 2668, CTATATT at 2662, TCAGTCC at 2620, TCAGTTC at 2615, TCAGTCT at 2609, CCGGTCC at 2574, CCGCACT at 2555, TTGGTCT at 2228, CCAGTCT at 2222, CCGTTCT at 2190, CTACTTT at 2146, TTGTACT at 2141, TCAATTT at 2136, CCACACC at 1971, CCGTTCT at 1948, CCGCTCT at 1921, CCACACC at 1805, CCGCACT at 1720, CCGCTCT at 1565, TCGTTCC at 1511, CCGCTCT at 1481, CCGTTCC at 1427, CCGCTCT at 1381, CCGTTCC at 1327, CCGTTCC at 1259, CCGCTCT at 1229, CCGGTCC at 1175, TCGCTCT at 1061, CCGTTCC at 1007, TTGGACC at 947, TCGGTCT at 935, CCGTTCC at 923, TTGGACC at 847, TCGGTCT at 835, CCGTTCC at 823, CCGGACT at 725, CCGTTCC at 671, CCGCTCT at 641, CCGTTCC at 587, CCGCTCT at 557, TCGGTCC at 515, CCGTTCC at 503, CCGGACC at 286, TTACACT at 230, CCGGTCC at 215, CTGGACC at 40.
  2. Positive strand, positive direction: GAACTGG at 4018, AGTGTGG at 3824, GGACCGG at 3681, AGAGTGG at 3612, GGTCTGG at 3550, GATCCGA at 3524, AGTGTGA at 3507, GGAACGG at 3375, GGTACAA at 3337, AGAGTGA at 3317, GGACCAG at 3298, AGTGCAG at 3255, GAAGTAG at 3250, GGACCAA at 3049, AGTCCGG at 3036, AGACCAA at 3023, GGTCCAG at 3018, GGAACAG at 3003, GGACCGG at 2990, AGACCGG at 2985, AGACTGA at 2945, GGAGTAA at 2902, AGACCGA at 2885, GGTCCGG at 2878, AAACTGG at 2873, AGTCTAA at 2868, GGTGTGA at 2636, AGTTCAG at 2617, GGTGTGG at 2602, AATATGG at 2590, GGACCGG at 2571, GGTACAA at 2475, AGAGTGG at 2470, GGACCGA at 2435, GATGTGG at 2430, AATCCGA at 2368, GGTCCGA at 2318, AAAGTGA at 2304, AGAGTGG at 2247, GGTCTAG at 2230, GGACTGG at 2213, AGTATAA at 2178, GAAGTAG at 2110, AGAATGG at 1888, GGTCCGG at 1857, GGACCGA at 1817, GGTCTGA at 1744, GGACTGG at 1662, GATGCGA at 1576, AATTCGG at 1541, GAAGCGG at 1408, GAAGCGG at 1308, AAAGCAG at 1183, GGTCCGA at 1177, GGACCGG at 949, GGACCGG at 849, GGTGCGA at 777, GATGCGA at 652, GAAGCGG at 595, AGAATGA at 524, GAAGCGG at 459, GGTGTGA at 345, GGTCCAG at 217, AATCCAG at 152, AGTCCGG at 92, GGTCCGA at 10.

YYRNWYY (Juven-Gershon June 2008) random dataset samplings

  1. Inrr0: 91, CTACTCC at 4542, CCGCTTT at 4421, CTGTTTT at 4376, TTGCACT at 4356, CCGGATC at 4322, TCGTACC at 4284, CTGGATC at 4271, TTATTTT at 4202, TCATACC at 4058, CCATTTT at 4003, TCATATC at 3963, CTAAATT at 3926, CCGTATC at 3859, TCGCACT at 3751, TTAATTT at 3694, CTGGTCC at 3612, CCGCTCT at 3527, TTGCACC at 3477, CCACTTT at 3412, TCATTCC at 3390, CCAATTT at 3353, CTGCTTC at 3334, TTGTTTT at 3182, TTAGTCC at 3134, CCGTACC at 3058, TTAGTTT at 3030, TCAGTTT at 2966, TTAGATT at 2856, TCATATT at 2851, CTAAACC at 2659, CCAATTT at 2616, CCGTTTT at 2566, CTAAATC at 2559, TTAATCC at 2507, CCAATTT at 2494, TCGGTTT at 2479, TTGTTCC at 2431, TTATATT at 2373, CTAGTTT at 2343, CTGCTCC at 2299, CTAGACT at 2294, CCACACC at 2164, TTATATT at 2143, CTACTTT at 2109, TCAAATT at 1988, CTGTTCT at 1979, CCAGATC at 1949, CTATTTC at 1911, TTACTCT at 1906, TTGATTT at 1819, TCAGATT at 1774, TCGCTTT at 1768, TTGAACC at 1692, CCGGTTT at 1687, TTGTTTT at 1671, CTGGTTT at 1638, CTATTTT at 1569, TCGTATC at 1544, TCGCTCT at 1513, TTGATTT at 1459, CCAGTCT at 1373, TCACTTT at 1363, CCGCATT at 1265, TCGAACC at 1260, TCGTTCT at 1229, TCGTACC at 1176, CCACATT at 1165, TCAATCC at 1160, CCAAACC at 1087, CTACTTT at 1020, CCGGATT at 952, CCACTCC at 947, TCGAACT at 897, TCACATT at 889, CTGGTTC at 884, CCAAATT at 862, CCGAATT at 811, TCGTTCT at 651, TTGTTCC at 619, CTATTTT at 494, TCGCATT at 481, CTAATTT at 474, TTGCATC at 364, CCAAACC at 283, CCGTATT at 267, TTGTATC at 156, CCAGATT at 128, TTGGTTC at 84, TTGAATT at 79, CCATACT at 41, TCGAACC at 23.
  2. Inrr1: 75, TTGATCT at 4399, TTAGTTT at 4270, TTGCTTT at 4225, CCGAATT at 4218, CCGCACT at 4117, CTACATC at 4096, CCGATCC at 4080, CCGTACT at 4072, CTGCTCT at 4064, CTGGTCT at 3958, TCAGTCT at 3949, CCGAATT at 3780, CTGTATC at 3523, CCATTTC at 3409, TCGAATC at 3228, CCGAACC at 3188, CCGAATT at 2977, CCGTACT at 2932, CCGAATT at 2897, CTAGACC at 2765, CCGTATC at 2689, CCGGTCT at 2670, TCGAACC at 2642, CTATACC at 2576, CCAATTT at 2565, TTAAACC at 2537, TCAGATT at 2448, TTGGTCC at 2430, CTAATCC at 2377, CCATTCT at 2295, CCAGATC at 2250, TCACTTT at 2202, TTGTTTC at 2163, TCATATT at 1995, TCGGTTT at 1976, CTAAACT at 1833, TCAGATC at 1798, CCGGATC at 1789, TCAGACT at 1754, TTAAATT at 1670, CCGTTTT at 1663, TCGTTTC at 1656, TTGTTCC at 1642, TCGGTTC at 1499, CTAAATT at 1492, CTGGTTT at 1392, TCGATCT at 1336, CCATTCT at 1318, CTAATTT at 1243, CTGCTCC at 1223, CTGTTCT at 1218, TTGCATT at 1142, CTACTCC at 1094, TTGCATT at 1025, TTAAACT at 976, CTGGTTT at 922, CTAGTCT at 904, CTAATTT at 827, TTATTCC at 815, CCAATTT at 788, TCATTCC at 774, CTATTTT at 767, CCGTATC at 739, TTGATTC at 707, TTACACT at 528, TTGAACC at 392, TTACACT at 371, TTATATT at 366, TCAGATT at 361, CCGCTTT at 329, TTAAATT at 312, CCGCACC at 235, CTGCTCT at 133, TCGATTC at 110, CCATATC at 58.
  3. Inrr2: 78, TCAATTT at 4483, TCAAACT at 4426, TTGGACT at 4418, CCAATCC at 4399, TCAAATC at 4310, TCGGACC at 4268, TCGTATC at 4246, CCGTTCT at 4121, CCACATT at 4068, CTGTTCC at 4063, CCGAACT at 4058, CCGTACT at 4001, CCAATCT at 3962, CTGATCT at 3762, CTACATT at 3722, CTGGTCC at 3599, TCGATTT at 3557, CTGGTTT at 3492, TCGCATT at 3483, CTGTATC at 3478, TCGGATC at 3436, CTATTTC at 3405, CCGGTCT at 3400, CCATTTT at 3381, CTAAACT at 3364, TCGCTTT at 3341, TTAATCC at 3231, CCGAATC at 2991, TTGGTCT at 2962, TCGGACC at 2888, CTGTATT at 2882, TCGTTCC at 2876, TCGCATT at 2752, CCGTTTT at 2528, CTAAATT at 2518, TCACACC at 2423, CTATATC at 2414, CCGTTTT at 2369, TCGTATC at 2356, TTGTTTC at 2349, CCGTATC at 2322, CCGAACT at 2237, CCACTTC at 2100, TCACTCC at 2090, TCACTTT at 2054, CTAGTCT at 1899, TTATACC at 1783, TTAAATT at 1769, CCAATTC at 1614, CCGGATT at 1580, CCGAACC at 1531, CTGTTCC at 1516, TCGGTCC at 1477, TTGGATC at 1418, TCGCATT at 1413, TTGGTCC at 1400, CTGTATT at 1372, TCAATCC at 1360, TTGGTCT at 1338, TTGTATT at 1273, CCGGTCT at 1214, TTGATCT at 1193, TTACATT at 1188, CCGTTTC at 1145, CCGCACT at 1105, CTGTTTC at 1099, TCAATTC at 1074, CTACACT at 900, CCACTTT at 888, CCAAACT at 745, TCGTTTC at 739, TCGTTTC at 718, CCGGTTC at 635, CCAGACC at 607, TCAATCC at 568, TCGTTTT at 446, CCAGACT at 355, CTATTTT at 34.
  4. Inrr3: 96, CCGATCC at 4552, CCAGTTC at 4541, TTGATCT at 4373, TTAATTT at 4349, CCACATT at 4285, CCGTACC at 4247, CCAAACC at 4239, CTAAATC at 4232, TTAGTTC at 4218, TTAAATT at 4193, TCATACC at 4183, TTAGTTC at 4158, TTGTTTT at 4101, CTAATTT at 4093, TTGCTTT at 4069, TTAATTT at 4050, CCAAATC at 4021, TTGGTCT at 4012, TTGTTTC at 4005, TTGTATT at 4000, CCAGTTC at 3987, TCACTCT at 3847, TTAGATC at 3824, CCGCATC at 3814, TCGATTT at 3688, CCAGTCC at 3570, CTACTTC at 3564, TCAATCC at 3516, TCATTCC at 3472, TCGGTCT at 3466, CTGAATC at 3461, CCAGTCT at 3216, CTGCTCC at 3182, TTGAACC at 3172, TCGGTCC at 3127, CCATTCC at 3088, CTGCACC at 3076, TTAGTCC at 3027, TTAGTTT at 3021, CCACTCT at 2998, TTACTCC at 2977, CCGGTCC at 2887, TCATACT at 2858, TCAGTCT at 2821, TCGCACT at 2803, TTGTTTC at 2784, TCAAATC at 2765, TCAATTC at 2760, CCGCTTC at 2524, TTGAATC at 2517, CCATTTC at 2505, CTAGTCC at 2336, CCGTTCT at 2159, CCAAACT at 2103, CCGTTTC at 2092, CTGCTTT at 2025, TTGATTT at 2018, CTAGACC at 1961, TTAAACT at 1852, CCGCATT at 1827, CCGATCT at 1756, CTACACT at 1742, TCGCATC at 1734, TTGCACC at 1721, TCAGTCT at 1659, CCAAATT at 1645, CTATTCC at 1640, CCGGTCC at 1585, CTGAATC at 1554, TCGGATT at 1488, CCATTCC at 1461, TTACTCC at 1383, TTAATTC at 1251, TTGGTCC at 1234, CCACTTC at 1180, TTGCTCT at 1152, CCATACC at 1079, TTGTTTC at 1030, TCATTTT at 925, TTGATCC at 847, CTACACT at 838, CCAATCC at 832, CTACATT at 804, CCAAATT at 795, TCATTCC at 481, CCAAATT at 445, CCACACC at 436, CCGTACC at 417, TTAATTT at 408, CCATTTT at 403, CCGGTCC at 347, TTACATT at 321, TTGCTTT at 316, TTGCTTT at 287, TTATTCC at 261, TTGGTCT at 26.
  5. Inrr4: 64, CCGTTTT at 4536, TTACTTT at 4523, TTGGTTT at 4423, CCAAATT at 4372, TCGGACT at 4219, TCAAACC at 4190, TTAAACC at 4151, TTAATCT at 3984, CTGGATC at 3922, CTACTCT at 3852, CCACTCT at 3839, CCGAACT at 3815, TTATATT at 3721, CTACATT at 3698, CTAATTT at 3672, TTGTTTT at 3533, TTATTTT at 3515, TTAAACC at 3374, TCGAATC at 3346, TCGGTCC at 3299, CTGTACT at 3290, TTAGATC at 3132, TTATTTC at 3018, CCAAACT at 2831, CTAAATC at 2816, CCGAATC at 2738, TCGGTCC at 2666, CCGTTTT at 2633, CCATTTC at 2547, TCGGTTC at 2372, TTATATT at 2296, TTATTTT at 2255, CTACTTT at 2127, TCATTCC at 2113, CCACTTC at 2081, TTACTTT at 2018, CTGGTTC at 1927, CTGCTCC at 1917, TTGTATT at 1795, CCAATTC at 1786, CCAATCT at 1763, CCACTTC at 1718, CCATTCC at 1552, CCGAATC at 1273, TTGTACC at 1217, TCGTTTC at 1187, CTGCTCT at 1066, CTACACT at 1053, TTGGATT at 840, CCGCTTT at 782, TTACACT at 624, TTAGTCT at 609, CCGCTTC at 589, TCAAATT at 478, TCAATTC at 440, TCGTTCT at 371, TCGTTTT at 361, CCATTCT at 310, CTAAACC at 303, CCAAACC at 231, CTGGTCC at 206, TCAATTT at 120, CCGCACC at 106, TCGATCT at 66.
  6. Inrr5: 77, TTGGTTT at 4536, TCGCACC at 4511, TCGATTT at 4468, CCGCTTT at 4441, TTGATTT at 4195, TTAATCT at 4184, CCAATCC at 4158, TCAGATC at 4030, CCGTTCT at 3992, TTGAACC at 3941, CTAATCC at 3853, CTGAACT at 3798, CTATTTT at 3704, CCGTTCT at 3692, TTATTTC at 3656, CCAAACT at 3513, TCGAATC at 3449, CTACTTC at 3444, TCGAATT at 3279, TTGAATC at 3274, TTGGTTT at 3269, TCAGACC at 3239, TCACTTC at 3219, TCGCTTT at 3095, CCGGACC at 3025, TCAATTC at 2989, CCGGTTC at 2984, TCGTACC at 2943, CTGCTTC at 2914, CCGAACC at 2891, CTGAACC at 2869, TCGAACC at 2863, CCGTTCC at 2821, TTAATCC at 2648, CCATATT at 2628, CCATTTT at 2601, TCGTTTC at 2544, CCGCTTC at 2524, TTGTTCT at 2442, TCGCACC at 2185, TTATATC at 2180, CCGCATC at 2023, CCGCTTT at 1975, CCGATTC at 1930, TCAAATT at 1886, CCACTCC at 1673, CCATTTC at 1609, TCGAATT at 1558, CTGGACC at 1527, CTAGACC at 1457, TTGATTT at 1447, CTAGTTT at 1413, TCAGTTT at 1358, TTAATCT at 1317, TCAAATT at 1271, TTGCATC at 1155, TTATTCC at 1114, CTAATTT at 1109, TCGTTTT at 1094, TTGGTCT at 1014, CCAGACC at 1002, TTGTTCT at 958, CCAGATC at 943, TCATACC at 938, CCAATCC at 913, CCAGATT at 884, TTATTTT at 860, CTGTTTT at 831, TCGAACT at 826, CTAGACC at 764, TCAATTC at 627, CCGTATC at 510, TTAATTC at 431, CCATTTC at 294, TTATTTT at 273, CTGTACT at 167, CCAAATC at 134.
  7. Inrr6: 83, TTGGTTT at 4530, TTATTTC at 4266, TTACTTC at 4241, CCGCTTT at 4214, CCACTTT at 4181, TTGTTCC at 4164, CCATTCT at 3999, CTGCACC at 3878, TCGAATT at 3810, TTATTCC at 3765, TTAGACC at 3731, TCAAATT at 3715, CCATTCC at 3690, CCACACC at 3630, CCATACT at 3597, CCGCTTC at 3591, CCAAATT at 3527, TCACTCC at 3495, TTGCTTC at 3479, TTATTCT at 3472, CCGGACC at 3415, CCGGATT at 3391, CCATACT at 3307, TTGATTC at 3177, CTGAACT at 3096, CTGTTCC at 3060, TCGTTTT at 3014, TCAATTT at 2974, TTGAATT at 2888, TCAAATT at 2740, TTGCATC at 2667, TCAATTT at 2649, CCGAATC at 2621, CCGGACC at 2615, TCGTATT at 2514, TCGGTTT at 2414, CCATATC at 2351, TTAGTTT at 2344, CCGGACC at 2269, CCGCTTT at 2252, CCAATTT at 2230, CTAGTTT at 2077, TTGGACC at 2015, CTACTTT at 1993, TCACACT at 1840, TCAATCT at 1741, CCGTATC at 1720, TCAGTTT at 1699, TTGCATT at 1660, TTAATTT at 1654, CCAATTT at 1648, CTAGACC at 1599, CTACATC at 1587, TTATTCC at 1490, CCGTTCT at 1383, CTGAATT at 1324, TTGAATT at 1304, CCACATT at 1299, CCACTTT at 1250, CTAAACC at 1208, TTAATCC at 1187, TTAGACC at 1177, CTATATT at 1138, CTGTTTC at 1000, TTGTATT at 914, TCAGTCC at 894, TTGATCT at 880, CTAGTTT at 875, CCAGACC at 854, CTGTTCT at 827, CTAATTT at 815, CCAAATC at 727, CCGATCT at 702, CCAATCT at 694, TTAATTC at 666, CCATATC at 626, TCGGACC at 518, TCGATCT at 509, CTAGTCC at 490, TTGATTC at 259, CTAAACT at 234, CTGGACT at 205, TCGAACC at 199.
  8. Inrr7: 74, TCGGTCT at 4410, CTGGTCT at 4356, CCATATT at 4222, TTGGACT at 4183, TTGCTTC at 4125, CTACTTT at 4120, CTACTTT at 3927, TTGGTTT at 3727, CCGGACC at 3661, CCACACT at 3643, CTATATC at 3588, CCGATCT at 3551, CTAATTT at 3452, TCAGATT at 3374, TTGGATC at 3367, TCGTTCT at 3281, TTGGTCT at 3122, TCGTTCC at 3048, TTGCTCC at 2876, TTGCTTT at 2837, CTGTTTT at 2821, TTGGATT at 2764, CCACTCT at 2649, TCACTTT at 2589, CTGGACT at 2564, TTGAATT at 2557, TCAATCT at 2475, TTGATTC at 2467, TTGGATC at 2460, TCGAACT at 2384, CCATTCT at 2272, TTAATTT at 2253, TTACTTC at 2202, CCGTTTT at 2180, CCAAATT at 2150, TCGGTCC at 2051, TTACACT at 2036, TTGCACC at 1985, TTGAACC at 1973, TTACTTC at 1899, TCATTTT at 1894, TTGTTCT at 1886, TTGAACC at 1867, TCAGTTC at 1752, CTAATCC at 1738, TTGAATT at 1637, CTGGACT at 1582, CCAATCT at 1563, TCGGTCC at 1476, CCATTTT at 1342, CCAAATT at 1211, CCACACC at 1206, CTGATCC at 1163, TCGTTTC at 1089, TCGATTC at 1084, CTATATT at 1075, TCATTCT at 1018, CTATTCC at 890, CCAAACT at 885, CTGTTCC at 879, CCGTTCC at 828, CTAGTTT at 766, CTAAATC at 565, CCATATC at 549, CTAAATC at 508, CCATTTT at 489, TTGGACT at 325, TTAATCC at 211, TCATATT at 206, TTGATTT at 110, CTGCACT at 75, CCGTTCC at 37, TTAATCC at 31, CTGTTTC at 24.
  9. Inrr8: 76, CTAGACT at 4510, TTGAACC at 4455, TCGCATC at 4418, TTAGTTT at 4397, CTATTTC at 4355, CCAGACC at 4292, TCAATCT at 4175, CCACTCC at 4074, TCAAACC at 4001, CCAGTTT at 3956, CTATACT at 3945, CCAATCT at 3940, TTGGTTT at 3923, TTGAACC at 3797, CCGGTTC at 3652, CCGGACC at 3616, TTGCTTC at 3521, TTGCTTT at 3380, TTAATTC at 3354, TCATTTT at 3335, CTGGTCC at 3143, TTGGTTT at 3120, TTGCTTC at 3065, TTGGATC at 3016, TCGGACC at 3007, TCGCATT at 2959, CTGGTCC at 2876, CCAGTTC at 2793, CCAAATT at 2496, CCGGACC at 2491, CCGCACC at 2446, TTAATTC at 2437, TTGGATT at 2432, TTGTTTC at 2383, TCGAACC at 2314, TTGAACC at 2258, CCAATTT at 2221, CCGCTCC at 2216, CCACACC at 1948, TTAGTTT at 1924, CTGTTTC at 1889, TTAGTTT at 1850, TCGTTTT at 1833, TTACTTC at 1823, CCGATTT at 1811, CCATTTC at 1802, CTACTCT at 1772, CTGAACC at 1583, CCGTTTC at 1531, TCACATT at 1505, CCGGACT at 1464, CCAATTC at 1422, CTACACC at 1402, CTAGTTC at 1371, CTGAACT at 1342, TTGGTCC at 1258, CTGCTTT at 1164, CCGGATC at 1057, CCACACC at 1051, TCATATT at 877, CCGCACT at 763, TTATTTT at 740, CTAATTT at 728, CCGCTCT at 630, CCACTCC at 562, CCGTTCC at 426, CCGGTTT at 367, TTACACT at 199, TCAATTT at 194, CTATTTT at 166, CTAAATC at 140, CCGGTTC at 122, TCACTCC at 83, CCAAACT at 58, CCGGTTC at 52, CTGTATT at 32.
  10. Inrr9: 84, CCAAATC at 4402, TTAAATC at 4301, TTGGATT at 4276, CCGGATT at 4234, TTATTCT at 4179, CCACTCC at 4101, TTAGACT at 4059, TCAAACC at 4041, TTATATT at 3990, CCGGTTT at 3985, CCGATTC at 3972, CCGGACC at 3921, TCGCTTT at 3867, TTGCACC at 3807, CTACACT at 3757, CCGTATC at 3665, CCATTCC at 3660, TCGAACT at 3653, CTAATTC at 3613, TTGATCC at 3583, TCAAATT at 3577, TTAATCC at 3558, CTGAATC at 3544, CTGCTTC at 3528, CTAGATT at 3313, CTATACT at 3246, CCAAACT at 3141, TCGCTTC at 3133, TCGCATT at 3101, TTGTTCC at 3094, TCGTTTC at 3028, CTGTTCC at 3009, TCGTTTC at 2899, TCGCTCC at 2874, TCGTTCT at 2751, TTGCACC at 2743, TCGTTCC at 2720, TCGGATT at 2603, TCAATTT at 2562, CTGAATT at 2509, CTGATCC at 2485, CTAATCC at 2394, TCATTTT at 2342, TCAGTTT at 2287, CCGTACC at 2275, TTGAACC at 2220, TTATATT at 2124, CCGAACT at 2102, TCAGATT at 2060, TCGTATC at 1984, CTGTATT at 1902, TCGCTTC at 1854, CCACACT at 1816, CTGATTT at 1741, TTGTTTC at 1735, CCGGACT at 1710, CCGCATT at 1651, CTGCACT at 1601, CCACATC at 1522, CCGAATT at 1349, TCAGTTT at 1246, CCGTTTC at 1238, TCACTTC at 1199, CTGGACC at 1057, CCACTTC at 1013, CTGGTCT at 926, TTACACC at 920, CCGTTCT at 822, CTGTTTT at 717, TTGCTTC at 711, TCGTTTT at 706, CCGCACT at 693, TTGTATT at 621, CTGTATC at 554, CCGTACC at 530, TTATTTT at 467, TCGTTTT at 348, TCAGATT at 315, TTGAACT at 279, CTATTTT at 159, CTGCTTC at 125, TCAGTCT at 110, TTGTTTC at 97, CTGGTTT at 92.
  11. Inrr0ci: 75, AGAACGG at 4549, AGATTAG at 4464, GGTGTAG at 4399, GAACCAG at 4310, GGATCAA at 4273, GGATTAA at 3975, AAAGTAG at 3759, AGTACAA at 3727, GAATTAG at 3722, AAATTGA at 3569, GGAACAA at 3501, GAACTGG at 3496, AAAGTAG at 3485, GAAACGG at 3419, AAAGTAG at 3341, GGTTTAA at 3297, GAACTAG at 3232, GATACGG at 3221, AAAATGG at 3174, AGAATAG at 3151, AGTCCAG at 3136, GGAATAA at 3124, AGATTGA at 2858, GATCCGG at 2764, GGAACGA at 2754, GGTCCGG at 2749, AAAACGG at 2740, GGAGCAG at 2533, AATCCGA at 2509, AATTTGG at 2496, GGTTTGA at 2481, GATTTGA at 2417, GGTGCGG at 2324, AAAATAA at 2261, GAAACGG at 2202, GGAATAA at 2016, AAAACGG at 2011, AGATCGA at 1951, GGAATGG at 1898, AGAGTAG at 1873, AGATCAA at 1854, AATTTGA at 1816, AGATTGG at 1776, AATACAG at 1709, GGTTTGA at 1689, GGAACAG at 1534, AAAACGA at 1502, GGAATAA at 1490, GAAGCAG at 1481, GGTGCAG at 1401, AGTGCAG at 1339, AAACCAG at 1325, AGACCGA at 1317, GAACTAA at 1304, GGTACAG at 1284, AAACCAG at 1245, GGAATAA at 1103, AGAATGG at 935, GAACTGA at 899, AGTCCGA at 877, AAACTAA at 834, AAAATGG at 799, AATTCAA at 794, GATGTGG at 765, GATCTGA at 739, AAACCGA at 668, AAACTGG at 638, AGTTTGG at 565, GGTATAG at 385, AAAATAA at 197, GAAGTAA at 182, GAAGTGG at 109, GGATCAA at 96, GGTTCGA at 86, GAATTGG at 81.
  12. Inrr1ci: 70, AAAACGA at 4456, AAAGTGG at 4359, GGAGCAG at 4313, AGTTTGG at 4272, GGTGTAA at 4171, GATATGG at 4165, AAAATAG at 3885, GAACCAA at 3880, GAAACGA at 3875, AATTTGA at 3783, GAAGTGG at 3672, AATACAA at 3571, GAAGCGA at 3565, AAACCAA at 3537, GGAGTAG at 3508, AAATTGA at 3492, AGTGCGA at 3468, GATACAG at 3385, GGTACAG at 3353, GGTTTAA at 3344, GATGTAG at 3288, AAAACAG at 3273, GATTCGA at 3225, AAAGCGA at 3220, GAACCGA at 3185, AGAATAA at 3082, AATGTAA at 3076, AAAGTGA at 3068, AGATCAG at 2997, AATTCAA at 2991, GGTGCGG at 2958, GAATTAA at 2899, GATCCAG at 2837, GATCTAG at 2629, AGAACAG at 2608, GGTCCAA at 2562, GGTATAG at 2509, GGTGTAA at 2490, GGACCGG at 2463, AATTTGG at 2427, AGAATGA at 2270, AAACCAG at 2247, AGTACAA at 2069, AGTTCGG at 1973, AATACAA at 1945, AAAATGG at 1935, AATTTAG at 1873, AATACGA at 1826, AAAACAA at 1821, GAAGCGG at 1811, AGATCAG at 1781, AGAGTAG at 1553, AAAGCGA at 1520, AAAGTGG at 1445, GATCTGG at 1338, GATATGG at 1109, AAAACAG at 1054, AATGTAA at 1049, GAACTGA at 1037, AATTTAA at 1005, GGTTTGG at 924, AAAGCAG at 885, AAACCAA at 785, GGATTGA at 704, GGTGCGG at 600, AGTTCAA at 562, GGTTTGG at 413, GAACCGG at 394, AAATTGG at 314, AATACAA at 300.
  13. Inrr2ci: 79, GGAGTAG at 4454, AAACTGG at 4428, AGTTTAA at 4363, GGAATGA at 4357, AAAACAA at 4325, AAATCAA at 4312, GGACCAG at 4270, AAACTAG at 4162, AAAGTAA at 4157, GATTTAA at 4145, AATATAA at 3855, AAAATGA at 3800, AAATTAG at 3789, GATTCGA at 3773, GATCTGG at 3764, AAAGCGA at 3658, GGTCCAA at 3601, GGATCGA at 3438, AGAGTGA at 3334, AAACCAA at 3173, AGTACGA at 3157, AAAGCAG at 3152, GGTATGA at 3141, GAATTAA at 3031, GAACTAG at 3025, GATTTGG at 3004, GGTCCAA at 2984, AAAGCAA at 2916, AAAACGG at 2783, AATGTGG at 2765, GGTTCAA at 2662, GATTCGA at 2655, GGTCCGG at 2615, AATTTAG at 2521, GGAATAA at 2509, GGTATGA at 2475, AAACCAA at 2444, GGTACGG at 2401, AAATCAA at 2385, GGACTAG at 2335, AGACCGA at 2234, AGAGCAA at 2209, GGTGTGG at 2201, AAAACAA at 2191, AATTTAA at 2132, AAACTGG at 2072, GGTGCAA at 2006, GGTATGA at 1918, AAACTAA at 1853, AATTTAG at 1772, AAACCGA at 1747, GATTCAA at 1693, GGTTTAA at 1671, GATGTAG at 1650, AAACCAA at 1632, GGTCCGG at 1577, AAATTAA at 1554, GGTCTGA at 1340, AAATTGG at 1335, AATCTAG at 1170, AAAGTGG at 1023, GAAACAG at 948, AAAACAG at 942, AATCCGG at 797, GAAACGG at 581, GATGTGG at 528, AGTCCGA at 523, GATTCAG at 490, GGAGTAA at 385, AAAATGA at 371, AATCCAG at 352, AGATTAG at 338, AGTATAA at 239, AGTGTAA at 195, AAAATGG at 187, AAAGTGA at 147, GATTCGG at 137, GATTTAG at 130, GGTTCAA at 106.
  14. Inrr3ci: 76, GAACTGG at 4533, AATTCGA at 4383, AAATTAA at 4346, GAACCAA at 4307, AAACTAA at 4229, GGTGCAG at 4129, GAAATAA at 3942, GGACTGA at 3898, AGTGTGG at 3854, AGACCAA at 3739, GGAATAA at 3672, AGACTGG at 3648, AAAGCAG at 3617, GGAGTAA at 3601, GGTTTAA at 3533, AATGTAG at 3491, GAATCGG at 3463, AGTGTGA at 3438, AGACCAA at 3389, AATCCAA at 3366, GGTATAA at 3337, AAAACAA at 3265, AGTGTAA at 3059, AATGCAA at 3047, AGTGTGA at 2969, GGATTGG at 2942, GAAATGG at 2937, AAAACAA at 2923, AAAACGG at 2903, AATTCGG at 2851, AATTCAA at 2762, AGAGTGA at 2635, AATTCAA at 2595, GGTTTAG at 2558, GATCTAG at 2437, AAATTGG at 2389, GGTATAG at 2371, AGTCCAG at 2338, AGTGTGA at 2206, AGTTTAA at 2152, GGACTGA at 2118, AAACCGG at 2068, AGTCTAA at 2061, GGTGTGG at 2011, AGTCTGG at 1985, AAATCAA at 1838, AAACTAA at 1801, GATGTAA at 1793, AATTTGG at 1786, GATCTGA at 1758, GATGCGG at 1691, AGTCTAG at 1661, AAATTAA at 1647, GGTCCAG at 1587, GATGCAG at 1411, GAATTAA at 1349, GAAGCGA at 1322, GGACCAA at 1276, AATTCGA at 1253, GGATTGG at 1231, GGTGTGG at 1107, GATTTAA at 1089, GGATCAG at 970, GGAACGA at 959, AGTCCAG at 951, GGTTCAG at 859, GGAGTAA at 823, GAACTGA at 718, GATGCGA at 706, AATGCAA at 545, AAAATGA at 539, GAAGTAG at 492, GGACCGG at 344, GAACTAG at 223, AGTTTAA at 202, GGAACAG at 197.
  15. Inrr4ci: 80, GAAATAG at 4387, AAATTAA at 4374, AAAGCAA at 4327, GGACCGG at 4237, AAATCAA at 4187, GGACCAG at 4135, AGAGTGA at 4003, GAACCAA at 3966, AAACCAA at 3908, AGTGCGG at 3828, GGTACAA at 3775, GGAACAA at 3710, AGACTGG at 3620, AGTCCAG at 3600, GGTTTAG at 3471, AGTTCAA at 3440, AAACTAG at 3418, GAATCAG at 3348, AGTGCAG at 3336, GGTATGG at 3310, AAAGTGA at 3177, GAACCAA at 3172, GGTTCAA at 3150, GATCTAA at 3135, GGAGCAG at 3115, AGTATAG at 3009, GATGCAG at 2944, AAAGCAG at 2877, AATCCGG at 2819, GGAGTGA at 2777, GGTGTGG at 2761, GGAATGG at 2702, AAAGCGG at 2613, AAAATGG at 2517, GAATCGG at 2506, AGTGTAA at 2487, GGTGCGG at 2466, AGTTTAA at 2400, GATACGA at 2230, GGACTAA at 2056, GAAGCAA at 1853, GGTATGG at 1830, GATGTGG at 1824, GGTCCGG at 1815, AGAGCAA at 1806, GGACCAA at 1760, AGAATAA at 1695, GGACTAG at 1655, AAAGTAA at 1644, AGACCAG at 1575, GGTGCGG at 1509, GGTTTGG at 1339, AGTGCAA at 1332, AATGCAG at 1327, AATGTGA at 1317, GAATCGG at 1275, AATGTAG at 1103, AAAATGG at 1075, GGTCTAG at 982, AGATTAA at 947, AATCCGA at 894, GGAGCGG at 883, AAAGTGA at 737, GAAGTGG at 683, GGTCCGA at 678, GGTTCGG at 572, GAAACAA at 531, GATACGG at 513, GAAGCAA at 490, AAATTAA at 480, AATTCAA at 442, AATCCGA at 382, AATGTGG at 287, AGTCCAA at 228, AAAGCGG at 217, AGAATAG at 152, GGTGCAG at 147, GGTCCAA at 132, AATTTAG at 122, AAACCAG at 57.
  16. Inrr5ci: 82, AGAATGG at 4341, AGAATAA at 4307, GATCCGA at 4291, GAAATGA at 4286, GAAGTAG at 4217, GAACCAA at 4155, GAACCGG at 4047, GGTGTGA at 4042, AATGTAG at 4006, AGAGCAG at 3979, AAAATGA at 3965, AAATTAA at 3931, GGAACGA at 3864, AGTTCGG at 3770, GATGTAA at 3713, GAAGCGA at 3593, AGAATGG at 3531, GGTCTAG at 3486, AAAGCAG at 3479, AATCCGG at 3452, GGAGTAG at 3408, GGTGTGG at 3374, GGTTCGG at 3288, AATTTGG at 3282, GAATCGA at 3276, GGTTTGA at 3271, GGTACAA at 3254, AGATTGG at 3172, GGAGCAG at 3156, AGTATGG at 3052, AATCCAG at 3035, GGTTCAA at 2986, GAACCAA at 2893, GATGTAG at 2784, AAAACGG at 2776, GAAATGA at 2768, GGAACGG at 2744, AGTGTGG at 2671, AATCCAG at 2650, GGTTCGG at 2589, GGTCTAG at 2574, GATGCGA at 2505, GGTCCGA at 2471, GAAGTGG at 2466, GGAACAA at 2414, GGAATGG at 2341, AGTCTGG at 2335, GGAGTAA at 2303, GGTTTGG at 2298, GGTACAA at 2245, GAAGCGG at 2240, GGTGTAG at 2220, GATGCAG at 2210, GAATCAG at 2159, AATCTGA at 2108, GGTGTAA at 1960, GAACCAA at 1950, AAATTGG at 1888, AGAGCGG at 1753, AGATCAG at 1748, GGTCCAG at 1593, GAACCGG at 1432, AGTTTAA at 1360, AGTCCAG at 1339, AAATTGA at 1291, AAACTAG at 1212, AGAACAG at 1197, AATGTGG at 1130, GGTCTGG at 1045, AGTGCAA at 900, AGTGCAA at 819, AGACCAG at 766, AATCCAG at 757, GATATAA at 671, GAACCAG at 615, GAAGCGA at 544, AGAATGA at 503, AGTACGG at 411, AGAGCAG at 381, GGAATAG at 376, AAATCAA at 136, AATCCAA at 131.
  17. Inrr6ci: 77, AATTTGG at 4527, AATACAA at 4520, GATACAA at 4491, AATACAG at 4400, AATCCAA at 4380, GAAACGG at 4251, AATTCGG at 4202, GGAGCGA at 4172, GGACTAA at 4078, GGTTTAA at 4069, GATGCGA at 4036, GAAGTGG at 3973, GGTTTGA at 3934, AAAATGA at 3699, GGATTAA at 3611, AATTCAG at 3530, AAAGCGG at 3425, GGATTAA at 3393, AATCCGG at 3339, GATACGA at 3219, GAAATGA at 3190, AATTTGG at 2976, GGTGTGA at 2948, GATCTGA at 2939, GAATTAA at 2890, AAAGCAA at 2865, GGAGCGA at 2823, AATTTAG at 2764, AGTCCAG at 2635, AATATAG at 2608, GGTATGG at 2557, GATTCAA at 2542, AGAGCAA at 2507, GAATTGG at 2495, GAACCGG at 2427, GGTTCGG at 2411, AATATAA at 2239, GGATCGG at 2215, AAAGTAA at 2184, AGTTCGA at 2164, AAAGTGA at 2135, GGTTCGG at 1949, GATACAA at 1915, GGTATGG at 1853, GGTTTAG at 1822, GGTACAA at 1806, AGTTTAA at 1633, GGTGCGG at 1554, AGAACGA at 1524, GGTTCGA at 1430, GGAATAG at 1364, GAATTGG at 1326, AGACTAA at 1205, GAAGTGG at 1195, AATCCGG at 1189, GAAACAG at 1100, AGTTTGA at 1095, GGTATAA at 1063, GGTCTGG at 934, AGTTTGA at 877, AGTCTGA at 805, AGAGTAG at 800, AAAATGA at 749, AATCTGA at 730, AGTACAA at 714, AGAACAG at 647, AGTCCGG at 492, GGTTTAG at 449, GGTGTAA at 413, GATTCAA at 284, GGTTCGA at 215, GATACGG at 153, GGTTTAA at 139, GGTGTGA at 101, GATTTGG at 96, AAAATGG at 73, GAAACGG at 10.
  18. Inrr7ci: 73, GGAACGA at 4543, GGTGTAA at 4487, AATGTAA at 4447, GGTCTAA at 4358, AGACTGG at 4353, GGAGCAA at 4324, AAACCAA at 4205, AGATTGA at 4199, AGATTGG at 4180, GGAGCGG at 4158, AGATTAG at 4148, AATTCGG at 4110, GATGTAA at 4102, AATCCAG at 4053, AAAATGA at 3964, AAAGCGG at 3936, AATTTAA at 3915, GGAACGA at 3868, AATACAA at 3783, GGTGCAG at 3769, AGACCAA at 3761, AGTTCAG at 3749, GGTTTGA at 3729, GAACCGG at 3658, AAACTGA at 3528, GGAGCGG at 3468, GAATTAG at 3412, GGAACGG at 3341, AAATCGA at 3310, GATATAA at 3056, AAATTAA at 2958, GAATCAA at 2947, GAACCAA at 2923, AGAACGG at 2916, AAAGCAA at 2893, GATTTAA at 2767, AGACTGA at 2696, GGACCAA at 2690, GGTGCAA at 2581, GGACTAA at 2566, GGATCGA at 2532, AATTTGA at 2397, GGTTCGA at 2381, AGAATAA at 2239, AGTCCAG at 2225, GGAGTGG at 2212, GGACTGA at 2072, AAAATAG at 2066, GAACCAG at 1975, AGTTTAG at 1960, GGATCAA at 1923, GGTTTGG at 1848, AAAATGA at 1835, AGTTCAG at 1754, AGAATGG at 1644, GAATTAG at 1639, AATGTGG at 1510, AGAATAA at 1445, AATATAA at 1413, AAAATGA at 1186, AAAATGA at 1114, AGAGTGA at 1000, AGAATAG at 995, AATACAG at 909, GAAGTGG at 652, AAATCGA at 510, AGAATAA at 418, AAAGCAA at 363, GATGTAA at 317, AAACTAA at 283, GGATTAA at 243, AAAATAA at 144, GATTTGG at 112.
  19. Inrr8ci: 75, GAAGCGA at 4488, GAATCGA at 4427, GAAGCAA at 4388, AGACCAA at 4294, AAAACGA at 4100, AATTCAA at 3998, GATATAA at 3982, AGTTTGG at 3958, AATTCGG at 3880, GGACCGG at 3728, GATCTGA at 3705, AGTACGG at 3688, GGACCGG at 3649, GGACCAG at 3618, GGAACGG at 3529, AAATTGA at 3500, AATTTGG at 3490, GGATTAG at 3483, AGTTTGA at 3398, GGTCTAA at 3364, AATTCGG at 3356, GATGCGG at 3285, GGTACGG at 3278, GAATCGA at 3205, GGAACAA at 3168, AAAATGG at 3106, AAAGTAA at 3099, GAAGTAA at 2924, AAAACGG at 2702, AGAATGA at 2649, AATTCGA at 2623, GGAGTGG at 2586, GGAATGA at 2537, GGACCAA at 2493, GGATTAA at 2434, GAACCGA at 2260, GATTTAG at 2245, GAAACAA at 2130, AAAACAA at 2103, GGTCCGG at 2077, GAATTGG at 1977, AATCCGA at 1941, GGTTTAG at 1934, AGTTTAA at 1926, GGAATAA at 1840, GGTCTAA at 1519, AATGTAA at 1430, AATTCGA at 1424, GGTCCAA at 1419, AGTTCAA at 1373, GAACTAA at 1344, GGAGCGG at 1330, AGAATGA at 1296, GAAGCGG at 1190, GAACCGA at 1071, AAAACGA at 1037, GATCCGA at 969, GAACTGA at 887, AGATCAG at 866, GAACCGA at 844, GGTACGG at 793, AATTTAG at 730, AAACTAA at 725, AAAGCGG at 712, GGACCAG at 691, GGTGTGA at 537, GATGTGG at 524, AGTGTGA at 440, GGTTTAA at 414, AATTCGG at 380, AGTATAG at 333, GGTGCGG at 295, AATTCAG at 278, GATGCAA at 174, GGAATGG at 158.
  20. Inrr9ci: 89, GATGCGG at 4546, AGAATGA at 4413, GATGTGG at 4390, GGATTAA at 4278, GGTATAA at 4218, GAATTGA at 4203, AAACCGA at 4191, AAATTAG at 4056, AAACCGG at 4043, GATTCGA at 3974, AATGCGG at 3961, GAAACGA at 3847, GAACTGA at 3842, AAAATGG at 3787, AGATTGG at 3722, GGTATGA at 3702, GGAATGA at 3678, GGACTAG at 3635, AATTTGA at 3580, GGTGCGG at 3535, AATTTGA at 3494, GAAATAG at 3467, AAAACGG at 3421, AAACTAG at 3310, GGTTTAA at 3284, AAATCGG at 3237, AGAGCGG at 3230, AAACTAA at 3070, GGAACAA at 3065, AATACAG at 2914, GAATTGG at 2889, AGAATAA at 2847, AAAATGG at 2625, GATTCAG at 2606, AATTTGG at 2564, GGTTTAG at 2542, AGAATGA at 2463, GGAGCGA at 2443, AAACTAA at 2391, AGAGTAA at 2381, AGATTGG at 2329, GATACAG at 2260, GGAATGA at 2231, GGTTTGA at 2217, GGTGTAA at 2205, AAAACGG at 2187, GAAGCGG at 2180, AAAACAA at 2091, AAAGCGG at 2053, GAAACGG at 1933, GAACTGG at 1872, GAAACGA at 1867, GGTACGG at 1809, AAACTGA at 1754, GAAACAA at 1748, GATTTGA at 1743, GGACTGG at 1712, AGTCCGA at 1567, AGAGCAA at 1513, GATGTGG at 1470, GGAGTGG at 1454, GATGCGG at 1426, GGACTGA at 1387, GAAATAA at 1379, GGAACAA at 1297, GGTACGG at 1217, AGTCTGG at 1208, GGTGTAG at 1175, AATTTAG at 1102, GGACCAA at 1059, AAAACGG at 1021, AATATAG at 941, AAAACGG at 895, AAAGCGG at 886, AAAACGG at 741, GGTCTAA at 663, AAACCGG at 596, GGTCCAA at 577, AAAATAG at 408, AAAATGG at 332, GAAACGG at 298, GAACTAA at 281, AAATTGA at 276, GGATTAA at 241, AAATTGA at 224, AAACCAG at 183, GGAGCAG at 43, AGTCTGG at 37, AGTACAG at 12.

Inrr arbitrary UTRs

  1. Inrr0: CTACTCC at 4542, CCGCTTT at 4421, CTGTTTT at 4376, TTGCACT at 4356, CCGGATC at 4322, TCGTACC at 4284, CTGGATC at 4271, TTATTTT at 4202, TCATACC at 4058, CCATTTT at 4003, TCATATC at 3963, CTAAATT at 3926, CCGTATC at 3859, TCGCACT at 3751, TTAATTT at 3694, CTGGTCC at 3612, CCGCTCT at 3527, TTGCACC at 3477, CCACTTT at 3412, TCATTCC at 3390, CCAATTT at 3353, CTGCTTC at 3334, TTGTTTT at 3182, TTAGTCC at 3134, CCGTACC at 3058, TTAGTTT at 3030, TCAGTTT at 2966, TTAGATT at 2856, TCATATT at 2851.
  2. Inrr2: TCAATTT at 4483, TCAAACT at 4426, TTGGACT at 4418, CCAATCC at 4399, TCAAATC at 4310, TCGGACC at 4268, TCGTATC at 4246, CCGTTCT at 4121, CCACATT at 4068, CTGTTCC at 4063, CCGAACT at 4058, CCGTACT at 4001, CCAATCT at 3962, CTGATCT at 3762, CTACATT at 3722, CTGGTCC at 3599, TCGATTT at 3557, CTGGTTT at 3492, TCGCATT at 3483, CTGTATC at 3478, TCGGATC at 3436, CTATTTC at 3405, CCGGTCT at 3400, CCATTTT at 3381, CTAAACT at 3364, TCGCTTT at 3341, TTAATCC at 3231, CCGAATC at 2991, TTGGTCT at 2962, TCGGACC at 2888, CTGTATT at 2882, TCGTTCC at 2876.
  3. Inrr4: CCGTTTT at 4536, TTACTTT at 4523, TTGGTTT at 4423, CCAAATT at 4372, TCGGACT at 4219, TCAAACC at 4190, TTAAACC at 4151, TTAATCT at 3984, CTGGATC at 3922, CTACTCT at 3852, CCACTCT at 3839, CCGAACT at 3815, TTATATT at 3721, CTACATT at 3698, CTAATTT at 3672, TTGTTTT at 3533, TTATTTT at 3515, TTAAACC at 3374, TCGAATC at 3346, TCGGTCC at 3299, CTGTACT at 3290, TTAGATC at 3132, TTATTTC at 3018.
  4. Inrr6: TTGGTTT at 4530, TTATTTC at 4266, TTACTTC at 4241, CCGCTTT at 4214, CCACTTT at 4181, TTGTTCC at 4164, CCATTCT at 3999, CTGCACC at 3878, TCGAATT at 3810, TTATTCC at 3765, TTAGACC at 3731, TCAAATT at 3715, CCATTCC at 3690, CCACACC at 3630, CCATACT at 3597, CCGCTTC at 3591, CCAAATT at 3527, TCACTCC at 3495, TTGCTTC at 3479, TTATTCT at 3472, CCGGACC at 3415, CCGGATT at 3391, CCATACT at 3307, TTGATTC at 3177, CTGAACT at 3096, CTGTTCC at 3060, TCGTTTT at 3014, TCAATTT at 2974, TTGAATT at 2888.
  5. Inrr8: CTAGACT at 4510, TTGAACC at 4455, TCGCATC at 4418, TTAGTTT at 4397, CTATTTC at 4355, CCAGACC at 4292, TCAATCT at 4175, CCACTCC at 4074, TCAAACC at 4001, CCAGTTT at 3956, CTATACT at 3945, CCAATCT at 3940, TTGGTTT at 3923, TTGAACC at 3797, CCGGTTC at 3652, CCGGACC at 3616, TTGCTTC at 3521, TTGCTTT at 3380, TTAATTC at 3354, TCATTTT at 3335, CTGGTCC at 3143, TTGGTTT at 3120, TTGCTTC at 3065, TTGGATC at 3016, TCGGACC at 3007, TCGCATT at 2959, CTGGTCC at 2876.
  6. Inrr0ci: AGAACGG at 4549, AGATTAG at 4464, GGTGTAG at 4399, GAACCAG at 4310, GGATCAA at 4273, GGATTAA at 3975, AAAGTAG at 3759, AGTACAA at 3727, GAATTAG at 3722, AAATTGA at 3569, GGAACAA at 3501, GAACTGG at 3496, AAAGTAG at 3485, GAAACGG at 3419, AAAGTAG at 3341, GGTTTAA at 3297, GAACTAG at 3232, GATACGG at 3221, AAAATGG at 3174, AGAATAG at 3151, AGTCCAG at 3136, GGAATAA at 3124, AGATTGA at 2858.
  7. Inrr2ci: GGAGTAG at 4454, AAACTGG at 4428, AGTTTAA at 4363, GGAATGA at 4357, AAAACAA at 4325, AAATCAA at 4312, GGACCAG at 4270, AAACTAG at 4162, AAAGTAA at 4157, GATTTAA at 4145, AATATAA at 3855, AAAATGA at 3800, AAATTAG at 3789, GATTCGA at 3773, GATCTGG at 3764, AAAGCGA at 3658, GGTCCAA at 3601, GGATCGA at 3438, AGAGTGA at 3334, AAACCAA at 3173, AGTACGA at 3157, AAAGCAG at 3152, GGTATGA at 3141, GAATTAA at 3031, GAACTAG at 3025, GATTTGG at 3004, GGTCCAA at 2984, AAAGCAA at 2916.
  8. Inrr4ci: GAAATAG at 4387, AAATTAA at 4374, AAAGCAA at 4327, GGACCGG at 4237, AAATCAA at 4187, GGACCAG at 4135, AGAGTGA at 4003, GAACCAA at 3966, AAACCAA at 3908, AGTGCGG at 3828, GGTACAA at 3775, GGAACAA at 3710, AGACTGG at 3620, AGTCCAG at 3600, GGTTTAG at 3471, AGTTCAA at 3440, AAACTAG at 3418, GAATCAG at 3348, AGTGCAG at 3336, GGTATGG at 3310, AAAGTGA at 3177, GAACCAA at 3172, GGTTCAA at 3150, GATCTAA at 3135, GGAGCAG at 3115, AGTATAG at 3009, GATGCAG at 2944, AAAGCAG at 2877.
  9. Inrr6ci: AATTTGG at 4527, AATACAA at 4520, GATACAA at 4491, AATACAG at 4400, AATCCAA at 4380, GAAACGG at 4251, AATTCGG at 4202, GGAGCGA at 4172, GGACTAA at 4078, GGTTTAA at 4069, GATGCGA at 4036, GAAGTGG at 3973, GGTTTGA at 3934, AAAATGA at 3699, GGATTAA at 3611, AATTCAG at 3530, AAAGCGG at 3425, GGATTAA at 3393, AATCCGG at 3339, GATACGA at 3219, GAAATGA at 3190, AATTTGG at 2976, GGTGTGA at 2948, GATCTGA at 2939, GAATTAA at 2890, AAAGCAA at 2865.
  10. Inrr8ci: GAAGCGA at 4488, GAATCGA at 4427, GAAGCAA at 4388, AGACCAA at 4294, AAAACGA at 4100, AATTCAA at 3998, GATATAA at 3982, AGTTTGG at 3958, AATTCGG at 3880, GGACCGG at 3728, GATCTGA at 3705, AGTACGG at 3688, GGACCGG at 3649, GGACCAG at 3618, GGAACGG at 3529, AAATTGA at 3500, AATTTGG at 3490, GGATTAG at 3483, AGTTTGA at 3398, GGTCTAA at 3364, AATTCGG at 3356, GATGCGG at 3285, GGTACGG at 3278, GAATCGA at 3205, GGAACAA at 3168, AAAATGG at 3106, AAAGTAA at 3099, GAAGTAA at 2924.

Inrr alternate UTRs

  1. Inrr1: TTGATCT at 4399, TTAGTTT at 4270, TTGCTTT at 4225, CCGAATT at 4218, CCGCACT at 4117, CTACATC at 4096, CCGATCC at 4080, CCGTACT at 4072, CTGCTCT at 4064, CTGGTCT at 3958, TCAGTCT at 3949, CCGAATT at 3780, CTGTATC at 3523, CCATTTC at 3409, TCGAATC at 3228, CCGAACC at 3188, CCGAATT at 2977, CCGTACT at 2932, CCGAATT at 2897.
  2. Inrr3: CCGATCC at 4552, CCAGTTC at 4541, TTGATCT at 4373, TTAATTT at 4349, CCACATT at 4285, CCGTACC at 4247, CCAAACC at 4239, CTAAATC at 4232, TTAGTTC at 4218, TTAAATT at 4193, TCATACC at 4183, TTAGTTC at 4158, TTGTTTT at 4101, CTAATTT at 4093, TTGCTTT at 4069, TTAATTT at 4050, CCAAATC at 4021, TTGGTCT at 4012, TTGTTTC at 4005, TTGTATT at 4000, CCAGTTC at 3987, TCACTCT at 3847, TTAGATC at 3824, CCGCATC at 3814, TCGATTT at 3688, CCAGTCC at 3570, CTACTTC at 3564, TCAATCC at 3516, TCATTCC at 3472, TCGGTCT at 3466, CTGAATC at 3461, CCAGTCT at 3216, CTGCTCC at 3182, TTGAACC at 3172, TCGGTCC at 3127, CCATTCC at 3088, CTGCACC at 3076, TTAGTCC at 3027, TTAGTTT at 3021, CCACTCT at 2998, TTACTCC at 2977, CCGGTCC at 2887, TCATACT at 2858.
  3. Inrr5: TTGGTTT at 4536, TCGCACC at 4511, TCGATTT at 4468, CCGCTTT at 4441, TTGATTT at 4195, TTAATCT at 4184, CCAATCC at 4158, TCAGATC at 4030, CCGTTCT at 3992, TTGAACC at 3941, CTAATCC at 3853, CTGAACT at 3798, CTATTTT at 3704, CCGTTCT at 3692, TTATTTC at 3656, CCAAACT at 3513, TCGAATC at 3449, CTACTTC at 3444, TCGAATT at 3279, TTGAATC at 3274, TTGGTTT at 3269, TCAGACC at 3239, TCACTTC at 3219, TCGCTTT at 3095, CCGGACC at 3025, TCAATTC at 2989, CCGGTTC at 2984, TCGTACC at 2943, CTGCTTC at 2914, CCGAACC at 2891, CTGAACC at 2869, TCGAACC at 2863.
  4. Inrr7: TCGGTCT at 4410, CTGGTCT at 4356, CCATATT at 4222, TTGGACT at 4183, TTGCTTC at 4125, CTACTTT at 4120, CTACTTT at 3927, TTGGTTT at 3727, CCGGACC at 3661, CCACACT at 3643, CTATATC at 3588, CCGATCT at 3551, CTAATTT at 3452, TCAGATT at 3374, TTGGATC at 3367, TCGTTCT at 3281, TTGGTCT at 3122, TCGTTCC at 3048, TTGCTCC at 2876.
  5. Inrr9: CCAAATC at 4402, TTAAATC at 4301, TTGGATT at 4276, CCGGATT at 4234, TTATTCT at 4179, CCACTCC at 4101, TTAGACT at 4059, TCAAACC at 4041, TTATATT at 3990, CCGGTTT at 3985, CCGATTC at 3972, CCGGACC at 3921, TCGCTTT at 3867, TTGCACC at 3807, CTACACT at 3757, CCGTATC at 3665, CCATTCC at 3660, TCGAACT at 3653, CTAATTC at 3613, TTGATCC at 3583, TCAAATT at 3577, TTAATCC at 3558, CTGAATC at 3544, CTGCTTC at 3528, CTAGATT at 3313, CTATACT at 3246, CCAAACT at 3141, TCGCTTC at 3133, TCGCATT at 3101, TTGTTCC at 3094, TCGTTTC at 3028, CTGTTCC at 3009, TCGTTTC at 2899, TCGCTCC at 2874.
  6. Inrr1ci: AAAACGA at 4456, AAAGTGG at 4359, GGAGCAG at 4313, AGTTTGG at 4272, GGTGTAA at 4171, GATATGG at 4165, AAAATAG at 3885, GAACCAA at 3880, GAAACGA at 3875, AATTTGA at 3783, GAAGTGG at 3672, AATACAA at 3571, GAAGCGA at 3565, AAACCAA at 3537, GGAGTAG at 3508, AAATTGA at 3492, AGTGCGA at 3468, GATACAG at 3385, GGTACAG at 3353, GGTTTAA at 3344, GATGTAG at 3288, AAAACAG at 3273, GATTCGA at 3225, AAAGCGA at 3220, GAACCGA at 3185, AGAATAA at 3082, AATGTAA at 3076, AAAGTGA at 3068, AGATCAG at 2997, AATTCAA at 2991, GGTGCGG at 2958, GAATTAA at 2899.
  7. Inrr3ci: GAACTGG at 4533, AATTCGA at 4383, AAATTAA at 4346, GAACCAA at 4307, AAACTAA at 4229, GGTGCAG at 4129, GAAATAA at 3942, GGACTGA at 3898, AGTGTGG at 3854, AGACCAA at 3739, GGAATAA at 3672, AGACTGG at 3648, AAAGCAG at 3617, GGAGTAA at 3601, GGTTTAA at 3533, AATGTAG at 3491, GAATCGG at 3463, AGTGTGA at 3438, AGACCAA at 3389, AATCCAA at 3366, GGTATAA at 3337, AAAACAA at 3265, AGTGTAA at 3059, AATGCAA at 3047, AGTGTGA at 2969, GGATTGG at 2942, GAAATGG at 2937, AAAACAA at 2923, AAAACGG at 2903, AATTCGG at 2851.
  8. Inrr5ci: AGAATGG at 4341, AGAATAA at 4307, GATCCGA at 4291, GAAATGA at 4286, GAAGTAG at 4217, GAACCAA at 4155, GAACCGG at 4047, GGTGTGA at 4042, AATGTAG at 4006, AGAGCAG at 3979, AAAATGA at 3965, AAATTAA at 3931, GGAACGA at 3864, AGTTCGG at 3770, GATGTAA at 3713, GAAGCGA at 3593, AGAATGG at 3531, GGTCTAG at 3486, AAAGCAG at 3479, AATCCGG at 3452, GGAGTAG at 3408, GGTGTGG at 3374, GGTTCGG at 3288, AATTTGG at 3282, GAATCGA at 3276, GGTTTGA at 3271, GGTACAA at 3254, AGATTGG at 3172, GGAGCAG at 3156, AGTATGG at 3052, AATCCAG at 3035, GGTTCAA at 2986, GAACCAA at 2893.
  9. Inrr7ci: GGAACGA at 4543, GGTGTAA at 4487, AATGTAA at 4447, GGTCTAA at 4358, AGACTGG at 4353, GGAGCAA at 4324, AAACCAA at 4205, AGATTGA at 4199, AGATTGG at 4180, GGAGCGG at 4158, AGATTAG at 4148, AATTCGG at 4110, GATGTAA at 4102, AATCCAG at 4053, AAAATGA at 3964, AAAGCGG at 3936, AATTTAA at 3915, GGAACGA at 3868, AATACAA at 3783, GGTGCAG at 3769, AGACCAA at 3761, AGTTCAG at 3749, GGTTTGA at 3729, GAACCGG at 3658, AAACTGA at 3528, GGAGCGG at 3468, GAATTAG at 3412, GGAACGG at 3341, AAATCGA at 3310, GATATAA at 3056, AAATTAA at 2958, GAATCAA at 2947, GAACCAA at 2923, AGAACGG at 2916, AAAGCAA at 2893.
  10. Inrr9ci: 89, GATGCGG at 4546, AGAATGA at 4413, GATGTGG at 4390, GGATTAA at 4278, GGTATAA at 4218, GAATTGA at 4203, AAACCGA at 4191, AAATTAG at 4056, AAACCGG at 4043, GATTCGA at 3974, AATGCGG at 3961, GAAACGA at 3847, GAACTGA at 3842, AAAATGG at 3787, AGATTGG at 3722, GGTATGA at 3702, GGAATGA at 3678, GGACTAG at 3635, AATTTGA at 3580, GGTGCGG at 3535, AATTTGA at 3494, GAAATAG at 3467, AAAACGG at 3421, AAACTAG at 3310, GGTTTAA at 3284, AAATCGG at 3237, AGAGCGG at 3230, AAACTAA at 3070, GGAACAA at 3065, AATACAG at 2914, GAATTGG at 2889, AGAATAA at 2847.

Inrr arbitrary negative direction core promoters

  1. Inrr4: CCAAACT at 2831, CTAAATC at 2816.
  2. Inrr4ci: AATCCGG at 2819.
  3. Inrr6ci: GGAGCGA at 2823.

Inrr alternate negative direction core promoters

  1. Inrr3: TCAGTCT at 2821.
  2. Inrr5: CCGTTCC at 2821.
  3. Inrr7: TTGCTTT at 2837, CTGTTTT at 2821.
  4. Inrr1ci: GATCCAG at 2837.

Inrr arbitrary positive direction core promoters

  1. Inrr1: TTGATCT at 4399, TTAGTTT at 4270.
  2. Inrr3: TTGATCT at 4373, TTAATTT at 4349, CCACATT at 4285.
  3. Inrr5: CCGCTTT at 4441.
  4. Inrr7: TCGGTCT at 4410, CTGGTCT at 4356.
  5. Inrr9: CCAAATC at 4402, TTAAATC at 4301, TTGGATT at 4276.
  6. Inrr1ci: AAAGTGG at 4359, GGAGCAG at 4313, AGTTTGG at 4272.
  7. Inrr3ci: AATTCGA at 4383, AAATTAA at 4346, GAACCAA at 4307.
  8. Inrr5ci: AGAATGG at 4341, AGAATAA at 4307, GATCCGA at 4291, GAAATGA at 4286.
  9. Inrr7ci: GGTCTAA at 4358, AGACTGG at 4353, GGAGCAA at 4324.
  10. Inrr9ci: AGAATGA at 4413, GATGTGG at 4390, GGATTAA at 4278.

Inrr alternate positive direction core promoters

  1. Inrr0: CCGCTTT at 4421, CTGTTTT at 4376, TTGCACT at 4356, CCGGATC at 4322, TCGTACC at 4284, CTGGATC at 4271.
  2. Inrr2: TCAAACT at 4426, TTGGACT at 4418, CCAATCC at 4399, TCAAATC at 4310, TCGGACC at 4268.
  3. Inrr4: TTGGTTT at 4423, CCAAATT at 4372.
  4. Inrr6: TTATTTC at 4266.
  5. Inrr8: TCGCATC at 4418, TTAGTTT at 4397, CTATTTC at 4355, CCAGACC at 4292.
  6. Inrr0ci: GGTGTAG at 4399, GAACCAG at 4310, GGATCAA at 4273.
  7. Inrr2ci: AAACTGG at 4428, AGTTTAA at 4363, GGAATGA at 4357, AAAACAA at 4325, AAATCAA at 4312, GGACCAG at 4270.
  8. Inrr4ci: GAAATAG at 4387, AAATTAA at 4374, AAAGCAA at 4327.
  9. Inrr6ci: AATACAG at 4400, AATCCAA at 4380.
  10. Inrr8ci: GAATCGA at 4427, GAAGCAA at 4388, AGACCAA at 4294.

Inrr arbitrary negative direction proximal promoters

  1. Inrr0: CTAAACC at 2659, CCAATTT at 2616.
  2. Inrr2: TCGCATT at 2752.
  3. Inrr4: CCGAATC at 2738, TCGGTCC at 2666, CCGTTTT at 2633.
  4. Inrr6: TCAAATT at 2740, TTGCATC at 2667, TCAATTT at 2649, CCGAATC at 2621, CCGGACC at 2615.
  5. Inrr8: CCAGTTC at 2793.
  6. Inrr0ci: GATCCGG at 2764, GGAACGA at 2754, GGTCCGG at 2749, AAAACGG at 2740.
  7. Inrr2ci: AAAACGG at 2783, AATGTGG at 2765, GGTTCAA at 2662, GATTCGA at 2655, GGTCCGG at 2615.
  8. Inrr4ci: GGAGTGA at 2777, GGTGTGG at 2761, GGAATGG at 2702, AAAGCGG at 2613.
  9. Inrr6ci: AATTTAG at 2764, AGTCCAG at 2635, AATATAG at 2608.
  10. Inrr8ci: AAAACGG at 2702, AGAATGA at 2649, AATTCGA at 2623.

Inrr alternate negative direction proximal promoters

  1. Inrr1: CTAGACC at 2765, CCGTATC at 2689, CCGGTCT at 2670, TCGAACC at 2642.
  2. Inrr3: TCGCACT at 2803, TTGTTTC at 2784, TCAAATC at 2765, TCAATTC at 2760.
  3. Inrr5: TTAATCC at 2648, CCATATT at 2628, CCATTTT at 2601.
  4. Inrr7: TTGGATT at 2764, CCACTCT at 2649.
  5. Inrr9: TCGTTCT at 2751, TTGCACC at 2743, TCGTTCC at 2720, TCGGATT at 2603.
  6. Inrr1ci: GATCTAG at 2629, AGAACAG at 2608.
  7. Inrr3ci: AATTCAA at 2762, AGAGTGA at 2635.
  8. Inrr5ci: GATGTAG at 2784, AAAACGG at 2776, GAAATGA at 2768, GGAACGG at 2744, AGTGTGG at 2671, AATCCAG at 2650.
  9. Inrr7ci: GATTTAA at 2767, AGACTGA at 2696, GGACCAA at 2690.
  10. Inrr9ci: AAAATGG at 2625, GATTCAG at 2606.

Inrr arbitrary positive direction proximal promoters

  1. Inrr1: TTGCTTT at 4225, CCGAATT at 4218, CCGCACT at 4117, CTACATC at 4096, CCGATCC at 4080, CCGTACT at 4072, CTGCTCT at 4064.
  2. Inrr3: CCGTACC at 4247, CCAAACC at 4239, CTAAATC at 4232, TTAGTTC at 4218, TTAAATT at 4193, TCATACC at 4183, TTAGTTC at 4158, TTGTTTT at 4101, CTAATTT at 4093, TTGCTTT at 4069, TTAATTT at 4050.
  3. Inrr5: TTGATTT at 4195, TTAATCT at 4184, CCAATCC at 4158.
  4. Inrr7: CCATATT at 4222, TTGGACT at 4183, TTGCTTC at 4125, CTACTTT at 4120.
  5. Inrr9: CCGGATT at 4234, TTATTCT at 4179, CCACTCC at 4101, TTAGACT at 4059.
  6. Inrr1ci: GGTGTAA at 4171, GATATGG at 4165.
  7. Inrr3ci: AAACTAA at 4229, GGTGCAG at 4129.
  8. Inrr5ci: GAAGTAG at 4217, GAACCAA at 4155.
  9. Inrr7ci: AAACCAA at 4205, AGATTGA at 4199, AGATTGG at 4180, GGAGCGG at 4158, AGATTAG at 4148, AATTCGG at 4110, GATGTAA at 4102, AATCCAG at 4053.
  10. Inrr9ci: GGTATAA at 4218, GAATTGA at 4203, AAACCGA at 4191, AAATTAG at 4056.

Inrr alternate positive direction proximal promoters

  1. Inrr0: TTATTTT at 4202, TCATACC at 4058.
  2. Inrr2: TCGTATC at 4246, CCGTTCT at 4121, CCACATT at 4068, CTGTTCC at 4063, CCGAACT at 4058.
  3. Inrr4: TCGGACT at 4219, TCAAACC at 4190, TTAAACC at 4151.
  4. Inrr6: TTACTTC at 4241, CCGCTTT at 4214, CCACTTT at 4181, TTGTTCC at 4164.
  5. Inrr8: TCAATCT at 4175, CCACTCC at 4074.
  6. Inrr2ci: AAACTAG at 4162, AAAGTAA at 4157, GATTTAA at 4145.
  7. Inrr4ci: GGACCGG at 4237, AAATCAA at 4187, GGACCAG at 4135.
  8. Inrr6ci: GAAACGG at 4251, AATTCGG at 4202, GGAGCGA at 4172, GGACTAA at 4078, GGTTTAA at 4069.
  9. Inrr8ci: AAAACGA at 4100.

Inrr arbitrary negative direction distal promoters

  1. Inrr0: CCGTTTT at 2566, CTAAATC at 2559, TTAATCC at 2507, CCAATTT at 2494, TCGGTTT at 2479, TTGTTCC at 2431, TTATATT at 2373, CTAGTTT at 2343, CTGCTCC at 2299, CTAGACT at 2294, CCACACC at 2164, TTATATT at 2143, CTACTTT at 2109, TCAAATT at 1988, CTGTTCT at 1979, CCAGATC at 1949, CTATTTC at 1911, TTACTCT at 1906, TTGATTT at 1819, TCAGATT at 1774, TCGCTTT at 1768, TTGAACC at 1692, CCGGTTT at 1687, TTGTTTT at 1671, CTGGTTT at 1638, CTATTTT at 1569, TCGTATC at 1544, TCGCTCT at 1513, TTGATTT at 1459, CCAGTCT at 1373, TCACTTT at 1363, CCGCATT at 1265, TCGAACC at 1260, TCGTTCT at 1229, TCGTACC at 1176, CCACATT at 1165, TCAATCC at 1160, CCAAACC at 1087, CTACTTT at 1020, CCGGATT at 952, CCACTCC at 947, TCGAACT at 897, TCACATT at 889, CTGGTTC at 884, CCAAATT at 862, CCGAATT at 811, TCGTTCT at 651, TTGTTCC at 619, CTATTTT at 494, TCGCATT at 481, CTAATTT at 474, TTGCATC at 364, CCAAACC at 283, CCGTATT at 267, TTGTATC at 156, CCAGATT at 128, TTGGTTC at 84, TTGAATT at 79, CCATACT at 41, TCGAACC at 23.
  2. Inrr2: CCGTTTT at 2528, CTAAATT at 2518, TCACACC at 2423, CTATATC at 2414, CCGTTTT at 2369, TCGTATC at 2356, TTGTTTC at 2349, CCGTATC at 2322, CCGAACT at 2237, CCACTTC at 2100, TCACTCC at 2090, TCACTTT at 2054, CTAGTCT at 1899, TTATACC at 1783, TTAAATT at 1769, CCAATTC at 1614, CCGGATT at 1580, CCGAACC at 1531, CTGTTCC at 1516, TCGGTCC at 1477, TTGGATC at 1418, TCGCATT at 1413, TTGGTCC at 1400, CTGTATT at 1372, TCAATCC at 1360, TTGGTCT at 1338, TTGTATT at 1273, CCGGTCT at 1214, TTGATCT at 1193, TTACATT at 1188, CCGTTTC at 1145, CCGCACT at 1105, CTGTTTC at 1099, TCAATTC at 1074, CTACACT at 900, CCACTTT at 888, CCAAACT at 745, TCGTTTC at 739, TCGTTTC at 718, CCGGTTC at 635, CCAGACC at 607, TCAATCC at 568, TCGTTTT at 446, CCAGACT at 355, CTATTTT at 34.
  3. Inrr4: CCATTTC at 2547, TCGGTTC at 2372, TTATATT at 2296, TTATTTT at 2255, CTACTTT at 2127, TCATTCC at 2113, CCACTTC at 2081, TTACTTT at 2018, CTGGTTC at 1927, CTGCTCC at 1917, TTGTATT at 1795, CCAATTC at 1786, CCAATCT at 1763, CCACTTC at 1718, CCATTCC at 1552, CCGAATC at 1273, TTGTACC at 1217, TCGTTTC at 1187, CTGCTCT at 1066, CTACACT at 1053, TTGGATT at 840, CCGCTTT at 782, TTACACT at 624, TTAGTCT at 609, CCGCTTC at 589, TCAAATT at 478, TCAATTC at 440, TCGTTCT at 371, TCGTTTT at 361, CCATTCT at 310, CTAAACC at 303, CCAAACC at 231, CTGGTCC at 206, TCAATTT at 120, CCGCACC at 106, TCGATCT at 66.
  4. Inrr6: TCGTATT at 2514, TCGGTTT at 2414, CCATATC at 2351, TTAGTTT at 2344, CCGGACC at 2269, CCGCTTT at 2252, CCAATTT at 2230, CTAGTTT at 2077, TTGGACC at 2015, CTACTTT at 1993, TCACACT at 1840, TCAATCT at 1741, CCGTATC at 1720, TCAGTTT at 1699, TTGCATT at 1660, TTAATTT at 1654, CCAATTT at 1648, CTAGACC at 1599, CTACATC at 1587, TTATTCC at 1490, CCGTTCT at 1383, CTGAATT at 1324, TTGAATT at 1304, CCACATT at 1299, CCACTTT at 1250, CTAAACC at 1208, TTAATCC at 1187, TTAGACC at 1177, CTATATT at 1138, CTGTTTC at 1000, TTGTATT at 914, TCAGTCC at 894, TTGATCT at 880, CTAGTTT at 875, CCAGACC at 854, CTGTTCT at 827, CTAATTT at 815, CCAAATC at 727, CCGATCT at 702, CCAATCT at 694, TTAATTC at 666, CCATATC at 626, TCGGACC at 518, TCGATCT at 509, CTAGTCC at 490, TTGATTC at 259, CTAAACT at 234, CTGGACT at 205, TCGAACC at 199.
  5. Inrr8: CCAAATT at 2496, CCGGACC at 2491, CCGCACC at 2446, TTAATTC at 2437, TTGGATT at 2432, TTGTTTC at 2383, TCGAACC at 2314, TTGAACC at 2258, CCAATTT at 2221, CCGCTCC at 2216, CCACACC at 1948, TTAGTTT at 1924, CTGTTTC at 1889, TTAGTTT at 1850, TCGTTTT at 1833, TTACTTC at 1823, CCGATTT at 1811, CCATTTC at 1802, CTACTCT at 1772, CTGAACC at 1583, CCGTTTC at 1531, TCACATT at 1505, CCGGACT at 1464, CCAATTC at 1422, CTACACC at 1402, CTAGTTC at 1371, CTGAACT at 1342, TTGGTCC at 1258, CTGCTTT at 1164, CCGGATC at 1057, CCACACC at 1051, TCATATT at 877, CCGCACT at 763, TTATTTT at 740, CTAATTT at 728, CCGCTCT at 630, CCACTCC at 562, CCGTTCC at 426, CCGGTTT at 367, TTACACT at 199, TCAATTT at 194, CTATTTT at 166, CTAAATC at 140, CCGGTTC at 122, TCACTCC at 83, CCAAACT at 58, CCGGTTC at 52, CTGTATT at 32.
  6. Inrr0ci: GGAGCAG at 2533, AATCCGA at 2509, AATTTGG at 2496, GGTTTGA at 2481, GATTTGA at 2417, GGTGCGG at 2324, AAAATAA at 2261, GAAACGG at 2202, GGAATAA at 2016, AAAACGG at 2011, AGATCGA at 1951, GGAATGG at 1898, AGAGTAG at 1873, AGATCAA at 1854, AATTTGA at 1816, AGATTGG at 1776, AATACAG at 1709, GGTTTGA at 1689, GGAACAG at 1534, AAAACGA at 1502, GGAATAA at 1490, GAAGCAG at 1481, GGTGCAG at 1401, AGTGCAG at 1339, AAACCAG at 1325, AGACCGA at 1317, GAACTAA at 1304, GGTACAG at 1284, AAACCAG at 1245, GGAATAA at 1103, AGAATGG at 935, GAACTGA at 899, AGTCCGA at 877, AAACTAA at 834, AAAATGG at 799, AATTCAA at 794, GATGTGG at 765, GATCTGA at 739, AAACCGA at 668, AAACTGG at 638, AGTTTGG at 565, GGTATAG at 385, AAAATAA at 197, GAAGTAA at 182, GAAGTGG at 109, GGATCAA at 96, GGTTCGA at 86, GAATTGG at 81.
  7. Inrr2ci: AATTTAG at 2521, GGAATAA at 2509, GGTATGA at 2475, AAACCAA at 2444, GGTACGG at 2401, AAATCAA at 2385, GGACTAG at 2335, AGACCGA at 2234, AGAGCAA at 2209, GGTGTGG at 2201, AAAACAA at 2191, AATTTAA at 2132, AAACTGG at 2072, GGTGCAA at 2006, GGTATGA at 1918, AAACTAA at 1853, AATTTAG at 1772, AAACCGA at 1747, GATTCAA at 1693, GGTTTAA at 1671, GATGTAG at 1650, AAACCAA at 1632, GGTCCGG at 1577, AAATTAA at 1554, GGTCTGA at 1340, AAATTGG at 1335, AATCTAG at 1170, AAAGTGG at 1023, GAAACAG at 948, AAAACAG at 942, AATCCGG at 797, GAAACGG at 581, GATGTGG at 528, AGTCCGA at 523, GATTCAG at 490, GGAGTAA at 385, AAAATGA at 371, AATCCAG at 352, AGATTAG at 338, AGTATAA at 239, AGTGTAA at 195, AAAATGG at 187, AAAGTGA at 147, GATTCGG at 137, GATTTAG at 130, GGTTCAA at 106.
  8. Inrr4ci: AAAATGG at 2517, GAATCGG at 2506, AGTGTAA at 2487, GGTGCGG at 2466, AGTTTAA at 2400, GATACGA at 2230, GGACTAA at 2056, GAAGCAA at 1853, GGTATGG at 1830, GATGTGG at 1824, GGTCCGG at 1815, AGAGCAA at 1806, GGACCAA at 1760, AGAATAA at 1695, GGACTAG at 1655, AAAGTAA at 1644, AGACCAG at 1575, GGTGCGG at 1509, GGTTTGG at 1339, AGTGCAA at 1332, AATGCAG at 1327, AATGTGA at 1317, GAATCGG at 1275, AATGTAG at 1103, AAAATGG at 1075, GGTCTAG at 982, AGATTAA at 947, AATCCGA at 894, GGAGCGG at 883, AAAGTGA at 737, GAAGTGG at 683, GGTCCGA at 678, GGTTCGG at 572, GAAACAA at 531, GATACGG at 513, GAAGCAA at 490, AAATTAA at 480, AATTCAA at 442, AATCCGA at 382, AATGTGG at 287, AGTCCAA at 228, AAAGCGG at 217, AGAATAG at 152, GGTGCAG at 147, GGTCCAA at 132, AATTTAG at 122, AAACCAG at 57.
  9. Inrr6ci: GGTATGG at 2557, GATTCAA at 2542, AGAGCAA at 2507, GAATTGG at 2495, GAACCGG at 2427, GGTTCGG at 2411, AATATAA at 2239, GGATCGG at 2215, AAAGTAA at 2184, AGTTCGA at 2164, AAAGTGA at 2135, GGTTCGG at 1949, GATACAA at 1915, GGTATGG at 1853, GGTTTAG at 1822, GGTACAA at 1806, AGTTTAA at 1633, GGTGCGG at 1554, AGAACGA at 1524, GGTTCGA at 1430, GGAATAG at 1364, GAATTGG at 1326, AGACTAA at 1205, GAAGTGG at 1195, AATCCGG at 1189, GAAACAG at 1100, AGTTTGA at 1095, GGTATAA at 1063, GGTCTGG at 934, AGTTTGA at 877, AGTCTGA at 805, AGAGTAG at 800, AAAATGA at 749, AATCTGA at 730, AGTACAA at 714, AGAACAG at 647, AGTCCGG at 492, GGTTTAG at 449, GGTGTAA at 413, GATTCAA at 284, GGTTCGA at 215, GATACGG at 153, GGTTTAA at 139, GGTGTGA at 101, GATTTGG at 96, AAAATGG at 73, GAAACGG at 10.
  10. Inrr8ci: GGAGTGG at 2586, GGAATGA at 2537, GGACCAA at 2493, GGATTAA at 2434, GAACCGA at 2260, GATTTAG at 2245, GAAACAA at 2130, AAAACAA at 2103, GGTCCGG at 2077, GAATTGG at 1977, AATCCGA at 1941, GGTTTAG at 1934, AGTTTAA at 1926, GGAATAA at 1840, GGTCTAA at 1519, AATGTAA at 1430, AATTCGA at 1424, GGTCCAA at 1419, AGTTCAA at 1373, GAACTAA at 1344, GGAGCGG at 1330, AGAATGA at 1296, GAAGCGG at 1190, GAACCGA at 1071, AAAACGA at 1037, GATCCGA at 969, GAACTGA at 887, AGATCAG at 866, GAACCGA at 844, GGTACGG at 793, AATTTAG at 730, AAACTAA at 725, AAAGCGG at 712, GGACCAG at 691, GGTGTGA at 537, GATGTGG at 524, AGTGTGA at 440, GGTTTAA at 414, AATTCGG at 380, AGTATAG at 333, GGTGCGG at 295, AATTCAG at 278, GATGCAA at 174, GGAATGG at 158.

Inrr alternate negative direction distal promoters

  1. Inrr1: CTATACC at 2576, CCAATTT at 2565, TTAAACC at 2537, TCAGATT at 2448, TTGGTCC at 2430, CTAATCC at 2377, CCATTCT at 2295, CCAGATC at 2250, TCACTTT at 2202, TTGTTTC at 2163, TCATATT at 1995, TCGGTTT at 1976, CTAAACT at 1833, TCAGATC at 1798, CCGGATC at 1789, TCAGACT at 1754, TTAAATT at 1670, CCGTTTT at 1663, TCGTTTC at 1656, TTGTTCC at 1642, TCGGTTC at 1499, CTAAATT at 1492, CTGGTTT at 1392, TCGATCT at 1336, CCATTCT at 1318, CTAATTT at 1243, CTGCTCC at 1223, CTGTTCT at 1218, TTGCATT at 1142, CTACTCC at 1094, TTGCATT at 1025, TTAAACT at 976, CTGGTTT at 922, CTAGTCT at 904, CTAATTT at 827, TTATTCC at 815, CCAATTT at 788, TCATTCC at 774, CTATTTT at 767, CCGTATC at 739, TTGATTC at 707, TTACACT at 528, TTGAACC at 392, TTACACT at 371, TTATATT at 366, TCAGATT at 361, CCGCTTT at 329, TTAAATT at 312, CCGCACC at 235, CTGCTCT at 133, TCGATTC at 110, CCATATC at 58.
  2. Inrr3: CCGCTTC at 2524, TTGAATC at 2517, CCATTTC at 2505, CTAGTCC at 2336, CCGTTCT at 2159, CCAAACT at 2103, CCGTTTC at 2092, CTGCTTT at 2025, TTGATTT at 2018, CTAGACC at 1961, TTAAACT at 1852, CCGCATT at 1827, CCGATCT at 1756, CTACACT at 1742, TCGCATC at 1734, TTGCACC at 1721, TCAGTCT at 1659, CCAAATT at 1645, CTATTCC at 1640, CCGGTCC at 1585, CTGAATC at 1554, TCGGATT at 1488, CCATTCC at 1461, TTACTCC at 1383, TTAATTC at 1251, TTGGTCC at 1234, CCACTTC at 1180, TTGCTCT at 1152, CCATACC at 1079, TTGTTTC at 1030, TCATTTT at 925, TTGATCC at 847, CTACACT at 838, CCAATCC at 832, CTACATT at 804, CCAAATT at 795, TCATTCC at 481, CCAAATT at 445, CCACACC at 436, CCGTACC at 417, TTAATTT at 408, CCATTTT at 403, CCGGTCC at 347, TTACATT at 321, TTGCTTT at 316, TTGCTTT at 287, TTATTCC at 261, TTGGTCT at 26.
  3. Inrr5: TCGTTTC at 2544, CCGCTTC at 2524, TTGTTCT at 2442, TCGCACC at 2185, TTATATC at 2180, CCGCATC at 2023, CCGCTTT at 1975, CCGATTC at 1930, TCAAATT at 1886, CCACTCC at 1673, CCATTTC at 1609, TCGAATT at 1558, CTGGACC at 1527, CTAGACC at 1457, TTGATTT at 1447, CTAGTTT at 1413, TCAGTTT at 1358, TTAATCT at 1317, TCAAATT at 1271, TTGCATC at 1155, TTATTCC at 1114, CTAATTT at 1109, TCGTTTT at 1094, TTGGTCT at 1014, CCAGACC at 1002, TTGTTCT at 958, CCAGATC at 943, TCATACC at 938, CCAATCC at 913, CCAGATT at 884, TTATTTT at 860, CTGTTTT at 831, TCGAACT at 826, CTAGACC at 764, TCAATTC at 627, CCGTATC at 510, TTAATTC at 431, CCATTTC at 294, TTATTTT at 273, CTGTACT at 167, CCAAATC at 134.
  4. Inrr7: TCACTTT at 2589, CTGGACT at 2564, TTGAATT at 2557, TCAATCT at 2475, TTGATTC at 2467, TTGGATC at 2460, TCGAACT at 2384, CCATTCT at 2272, TTAATTT at 2253, TTACTTC at 2202, CCGTTTT at 2180, CCAAATT at 2150, TCGGTCC at 2051, TTACACT at 2036, TTGCACC at 1985, TTGAACC at 1973, TTACTTC at 1899, TCATTTT at 1894, TTGTTCT at 1886, TTGAACC at 1867, TCAGTTC at 1752, CTAATCC at 1738, TTGAATT at 1637, CTGGACT at 1582, CCAATCT at 1563, TCGGTCC at 1476, CCATTTT at 1342, CCAAATT at 1211, CCACACC at 1206, CTGATCC at 1163, TCGTTTC at 1089, TCGATTC at 1084, CTATATT at 1075, TCATTCT at 1018, CTATTCC at 890, CCAAACT at 885, CTGTTCC at 879, CCGTTCC at 828, CTAGTTT at 766, CTAAATC at 565, CCATATC at 549, CTAAATC at 508, CCATTTT at 489, TTGGACT at 325, TTAATCC at 211, TCATATT at 206, TTGATTT at 110, CTGCACT at 75, CCGTTCC at 37, TTAATCC at 31, CTGTTTC at 24.
  5. Inrr9: TCAATTT at 2562, CTGAATT at 2509, CTGATCC at 2485, CTAATCC at 2394, TCATTTT at 2342, TCAGTTT at 2287, CCGTACC at 2275, TTGAACC at 2220, TTATATT at 2124, CCGAACT at 2102, TCAGATT at 2060, TCGTATC at 1984, CTGTATT at 1902, TCGCTTC at 1854, CCACACT at 1816, CTGATTT at 1741, TTGTTTC at 1735, CCGGACT at 1710, CCGCATT at 1651, CTGCACT at 1601, CCACATC at 1522, CCGAATT at 1349, TCAGTTT at 1246, CCGTTTC at 1238, TCACTTC at 1199, CTGGACC at 1057, CCACTTC at 1013, CTGGTCT at 926, TTACACC at 920, CCGTTCT at 822, CTGTTTT at 717, TTGCTTC at 711, TCGTTTT at 706, CCGCACT at 693, TTGTATT at 621, CTGTATC at 554, CCGTACC at 530, TTATTTT at 467, TCGTTTT at 348, TCAGATT at 315, TTGAACT at 279, CTATTTT at 159, CTGCTTC at 125, TCAGTCT at 110, TTGTTTC at 97, CTGGTTT at 92.
  6. Inrr1ci: GGTCCAA at 2562, GGTATAG at 2509, GGTGTAA at 2490, GGACCGG at 2463, AATTTGG at 2427, AGAATGA at 2270, AAACCAG at 2247, AGTACAA at 2069, AGTTCGG at 1973, AATACAA at 1945, AAAATGG at 1935, AATTTAG at 1873, AATACGA at 1826, AAAACAA at 1821, GAAGCGG at 1811, AGATCAG at 1781, AGAGTAG at 1553, AAAGCGA at 1520, AAAGTGG at 1445, GATCTGG at 1338, GATATGG at 1109, AAAACAG at 1054, AATGTAA at 1049, GAACTGA at 1037, AATTTAA at 1005, GGTTTGG at 924, AAAGCAG at 885, AAACCAA at 785, GGATTGA at 704, GGTGCGG at 600, AGTTCAA at 562, GGTTTGG at 413, GAACCGG at 394, AAATTGG at 314, AATACAA at 300.
  7. Inrr3ci: AATTCAA at 2595, GGTTTAG at 2558, GATCTAG at 2437, AAATTGG at 2389, GGTATAG at 2371, AGTCCAG at 2338, AGTGTGA at 2206, AGTTTAA at 2152, GGACTGA at 2118, AAACCGG at 2068, AGTCTAA at 2061, GGTGTGG at 2011, AGTCTGG at 1985, AAATCAA at 1838, AAACTAA at 1801, GATGTAA at 1793, AATTTGG at 1786, GATCTGA at 1758, GATGCGG at 1691, AGTCTAG at 1661, AAATTAA at 1647, GGTCCAG at 1587, GATGCAG at 1411, GAATTAA at 1349, GAAGCGA at 1322, GGACCAA at 1276, AATTCGA at 1253, GGATTGG at 1231, GGTGTGG at 1107, GATTTAA at 1089, GGATCAG at 970, GGAACGA at 959, AGTCCAG at 951, GGTTCAG at 859, GGAGTAA at 823, GAACTGA at 718, GATGCGA at 706, AATGCAA at 545, AAAATGA at 539, GAAGTAG at 492, GGACCGG at 344, GAACTAG at 223, AGTTTAA at 202, GGAACAG at 197.
  8. Inrr5ci: GGTTCGG at 2589, GGTCTAG at 2574, GATGCGA at 2505, GGTCCGA at 2471, GAAGTGG at 2466, GGAACAA at 2414, GGAATGG at 2341, AGTCTGG at 2335, GGAGTAA at 2303, GGTTTGG at 2298, GGTACAA at 2245, GAAGCGG at 2240, GGTGTAG at 2220, GATGCAG at 2210, GAATCAG at 2159, AATCTGA at 2108, GGTGTAA at 1960, GAACCAA at 1950, AAATTGG at 1888, AGAGCGG at 1753, AGATCAG at 1748, GGTCCAG at 1593, GAACCGG at 1432, AGTTTAA at 1360, AGTCCAG at 1339, AAATTGA at 1291, AAACTAG at 1212, AGAACAG at 1197, AATGTGG at 1130, GGTCTGG at 1045, AGTGCAA at 900, AGTGCAA at 819, AGACCAG at 766, AATCCAG at 757, GATATAA at 671, GAACCAG at 615, GAAGCGA at 544, AGAATGA at 503, AGTACGG at 411, AGAGCAG at 381, GGAATAG at 376, AAATCAA at 136, AATCCAA at 131.
  9. Inrr7ci: GGTGCAA at 2581, GGACTAA at 2566, GGATCGA at 2532, AATTTGA at 2397, GGTTCGA at 2381, AGAATAA at 2239, AGTCCAG at 2225, GGAGTGG at 2212, GGACTGA at 2072, AAAATAG at 2066, GAACCAG at 1975, AGTTTAG at 1960, GGATCAA at 1923, GGTTTGG at 1848, AAAATGA at 1835, AGTTCAG at 1754, AGAATGG at 1644, GAATTAG at 1639, AATGTGG at 1510, AGAATAA at 1445, AATATAA at 1413, AAAATGA at 1186, AAAATGA at 1114, AGAGTGA at 1000, AGAATAG at 995, AATACAG at 909, GAAGTGG at 652, AAATCGA at 510, AGAATAA at 418, AAAGCAA at 363, GATGTAA at 317, AAACTAA at 283, GGATTAA at 243, AAAATAA at 144, GATTTGG at 112.
  10. Inrr9ci: AATTTGG at 2564, GGTTTAG at 2542, AGAATGA at 2463, GGAGCGA at 2443, AAACTAA at 2391, AGAGTAA at 2381, AGATTGG at 2329, GATACAG at 2260, GGAATGA at 2231, GGTTTGA at 2217, GGTGTAA at 2205, AAAACGG at 2187, GAAGCGG at 2180, AAAACAA at 2091, AAAGCGG at 2053, GAAACGG at 1933, GAACTGG at 1872, GAAACGA at 1867, GGTACGG at 1809, AAACTGA at 1754, GAAACAA at 1748, GATTTGA at 1743, GGACTGG at 1712, AGTCCGA at 1567, AGAGCAA at 1513, GATGTGG at 1470, GGAGTGG at 1454, GATGCGG at 1426, GGACTGA at 1387, GAAATAA at 1379, GGAACAA at 1297, GGTACGG at 1217, AGTCTGG at 1208, GGTGTAG at 1175, AATTTAG at 1102, GGACCAA at 1059, AAAACGG at 1021, AATATAG at 941, AAAACGG at 895, AAAGCGG at 886, AAAACGG at 741, GGTCTAA at 663, AAACCGG at 596, GGTCCAA at 577, AAAATAG at 408, AAAATGG at 332, GAAACGG at 298, GAACTAA at 281, AAATTGA at 276, GGATTAA at 241, AAATTGA at 224, AAACCAG at 183, GGAGCAG at 43, AGTCTGG at 37, AGTACAG at 12.

Inrr arbitrary positive direction distal promoters

  1. Inrr1: CTGGTCT at 3958, TCAGTCT at 3949, CCGAATT at 3780, CTGTATC at 3523, CCATTTC at 3409, TCGAATC at 3228, CCGAACC at 3188, CCGAATT at 2977, CCGTACT at 2932, CCGAATT at 2897, CTAGACC at 2765, CCGTATC at 2689, CCGGTCT at 2670, TCGAACC at 2642, CTATACC at 2576, CCAATTT at 2565, TTAAACC at 2537, TCAGATT at 2448, TTGGTCC at 2430, CTAATCC at 2377, CCATTCT at 2295, CCAGATC at 2250, TCACTTT at 2202, TTGTTTC at 2163, TCATATT at 1995, TCGGTTT at 1976, CTAAACT at 1833, TCAGATC at 1798, CCGGATC at 1789, TCAGACT at 1754, TTAAATT at 1670, CCGTTTT at 1663, TCGTTTC at 1656, TTGTTCC at 1642, TCGGTTC at 1499, CTAAATT at 1492, CTGGTTT at 1392, TCGATCT at 1336, CCATTCT at 1318, CTAATTT at 1243, CTGCTCC at 1223, CTGTTCT at 1218, TTGCATT at 1142, CTACTCC at 1094, TTGCATT at 1025, TTAAACT at 976, CTGGTTT at 922, CTAGTCT at 904, CTAATTT at 827, TTATTCC at 815, CCAATTT at 788, TCATTCC at 774, CTATTTT at 767, CCGTATC at 739, TTGATTC at 707, TTACACT at 528, TTGAACC at 392, TTACACT at 371, TTATATT at 366, TCAGATT at 361, CCGCTTT at 329, TTAAATT at 312, CCGCACC at 235, CTGCTCT at 133, TCGATTC at 110, CCATATC at 58.
  2. Inrr3: TTAATTT at 4050, CCAAATC at 4021, TTGGTCT at 4012, TTGTTTC at 4005, TTGTATT at 4000, CCAGTTC at 3987, TCACTCT at 3847, TTAGATC at 3824, CCGCATC at 3814, TCGATTT at 3688, CCAGTCC at 3570, CTACTTC at 3564, TCAATCC at 3516, TCATTCC at 3472, TCGGTCT at 3466, CTGAATC at 3461, CCAGTCT at 3216, CTGCTCC at 3182, TTGAACC at 3172, TCGGTCC at 3127, CCATTCC at 3088, CTGCACC at 3076, TTAGTCC at 3027, TTAGTTT at 3021, CCACTCT at 2998, TTACTCC at 2977, CCGGTCC at 2887, TCATACT at 2858, TCAGTCT at 2821, TCGCACT at 2803, TTGTTTC at 2784, TCAAATC at 2765, TCAATTC at 2760, CCGCTTC at 2524, TTGAATC at 2517, CCATTTC at 2505, CTAGTCC at 2336, CCGTTCT at 2159, CCAAACT at 2103, CCGTTTC at 2092, CTGCTTT at 2025, TTGATTT at 2018, CTAGACC at 1961, TTAAACT at 1852, CCGCATT at 1827, CCGATCT at 1756, CTACACT at 1742, TCGCATC at 1734, TTGCACC at 1721, TCAGTCT at 1659, CCAAATT at 1645, CTATTCC at 1640, CCGGTCC at 1585, CTGAATC at 1554, TCGGATT at 1488, CCATTCC at 1461, TTACTCC at 1383, TTAATTC at 1251, TTGGTCC at 1234, CCACTTC at 1180, TTGCTCT at 1152, CCATACC at 1079, TTGTTTC at 1030, TCATTTT at 925, TTGATCC at 847, CTACACT at 838, CCAATCC at 832, CTACATT at 804, CCAAATT at 795, TCATTCC at 481, CCAAATT at 445, CCACACC at 436, CCGTACC at 417, TTAATTT at 408, CCATTTT at 403, CCGGTCC at 347, TTACATT at 321, TTGCTTT at 316, TTGCTTT at 287, TTATTCC at 261, TTGGTCT at 26.
  3. Inrr5: TCAGATC at 4030, CCGTTCT at 3992, TTGAACC at 3941, CTAATCC at 3853, CTGAACT at 3798, CTATTTT at 3704, CCGTTCT at 3692, TTATTTC at 3656, CCAAACT at 3513, TCGAATC at 3449, CTACTTC at 3444, TCGAATT at 3279, TTGAATC at 3274, TTGGTTT at 3269, TCAGACC at 3239, TCACTTC at 3219, TCGCTTT at 3095, CCGGACC at 3025, TCAATTC at 2989, CCGGTTC at 2984, TCGTACC at 2943, CTGCTTC at 2914, CCGAACC at 2891, CTGAACC at 2869, TCGAACC at 2863, CCGTTCC at 2821, TTAATCC at 2648, CCATATT at 2628, CCATTTT at 2601, TCGTTTC at 2544, CCGCTTC at 2524, TTGTTCT at 2442, TCGCACC at 2185, TTATATC at 2180, CCGCATC at 2023, CCGCTTT at 1975, CCGATTC at 1930, TCAAATT at 1886, CCACTCC at 1673, CCATTTC at 1609, TCGAATT at 1558, CTGGACC at 1527, CTAGACC at 1457, TTGATTT at 1447, CTAGTTT at 1413, TCAGTTT at 1358, TTAATCT at 1317, TCAAATT at 1271, TTGCATC at 1155, TTATTCC at 1114, CTAATTT at 1109, TCGTTTT at 1094, TTGGTCT at 1014, CCAGACC at 1002, TTGTTCT at 958, CCAGATC at 943, TCATACC at 938, CCAATCC at 913, CCAGATT at 884, TTATTTT at 860, CTGTTTT at 831, TCGAACT at 826, CTAGACC at 764, TCAATTC at 627, CCGTATC at 510, TTAATTC at 431, CCATTTC at 294, TTATTTT at 273, CTGTACT at 167, CCAAATC at 134.
  4. Inrr7: CTACTTT at 3927, TTGGTTT at 3727, CCGGACC at 3661, CCACACT at 3643, CTATATC at 3588, CCGATCT at 3551, CTAATTT at 3452, TCAGATT at 3374, TTGGATC at 3367, TCGTTCT at 3281, TTGGTCT at 3122, TCGTTCC at 3048, TTGCTCC at 2876, TTGCTTT at 2837, CTGTTTT at 2821, TTGGATT at 2764, CCACTCT at 2649, TCACTTT at 2589, CTGGACT at 2564, TTGAATT at 2557, TCAATCT at 2475, TTGATTC at 2467, TTGGATC at 2460, TCGAACT at 2384, CCATTCT at 2272, TTAATTT at 2253, TTACTTC at 2202, CCGTTTT at 2180, CCAAATT at 2150, TCGGTCC at 2051, TTACACT at 2036, TTGCACC at 1985, TTGAACC at 1973, TTACTTC at 1899, TCATTTT at 1894, TTGTTCT at 1886, TTGAACC at 1867, TCAGTTC at 1752, CTAATCC at 1738, TTGAATT at 1637, CTGGACT at 1582, CCAATCT at 1563, TCGGTCC at 1476, CCATTTT at 1342, CCAAATT at 1211, CCACACC at 1206, CTGATCC at 1163, TCGTTTC at 1089, TCGATTC at 1084, CTATATT at 1075, TCATTCT at 1018, CTATTCC at 890, CCAAACT at 885, CTGTTCC at 879, CCGTTCC at 828, CTAGTTT at 766, CTAAATC at 565, CCATATC at 549, CTAAATC at 508, CCATTTT at 489, TTGGACT at 325, TTAATCC at 211, TCATATT at 206, TTGATTT at 110, CTGCACT at 75, CCGTTCC at 37, TTAATCC at 31, CTGTTTC at 24.
  5. Inrr9: TCAAACC at 4041, TTATATT at 3990, CCGGTTT at 3985, CCGATTC at 3972, CCGGACC at 3921, TCGCTTT at 3867, TTGCACC at 3807, CTACACT at 3757, CCGTATC at 3665, CCATTCC at 3660, TCGAACT at 3653, CTAATTC at 3613, TTGATCC at 3583, TCAAATT at 3577, TTAATCC at 3558, CTGAATC at 3544, CTGCTTC at 3528, CTAGATT at 3313, CTATACT at 3246, CCAAACT at 3141, TCGCTTC at 3133, TCGCATT at 3101, TTGTTCC at 3094, TCGTTTC at 3028, CTGTTCC at 3009, TCGTTTC at 2899, TCGCTCC at 2874, TCGTTCT at 2751, TTGCACC at 2743, TCGTTCC at 2720, TCGGATT at 2603, TCAATTT at 2562, CTGAATT at 2509, CTGATCC at 2485, CTAATCC at 2394, TCATTTT at 2342, TCAGTTT at 2287, CCGTACC at 2275, TTGAACC at 2220, TTATATT at 2124, CCGAACT at 2102, TCAGATT at 2060, TCGTATC at 1984, CTGTATT at 1902, TCGCTTC at 1854, CCACACT at 1816, CTGATTT at 1741, TTGTTTC at 1735, CCGGACT at 1710, CCGCATT at 1651, CTGCACT at 1601, CCACATC at 1522, CCGAATT at 1349, TCAGTTT at 1246, CCGTTTC at 1238, TCACTTC at 1199, CTGGACC at 1057, CCACTTC at 1013, CTGGTCT at 926, TTACACC at 920, CCGTTCT at 822, CTGTTTT at 717, TTGCTTC at 711, TCGTTTT at 706, CCGCACT at 693, TTGTATT at 621, CTGTATC at 554, CCGTACC at 530, TTATTTT at 467, TCGTTTT at 348, TCAGATT at 315, TTGAACT at 279, CTATTTT at 159, CTGCTTC at 125, TCAGTCT at 110, TTGTTTC at 97, CTGGTTT at 92.
  6. Inrr1ci: AAAATAG at 3885, GAACCAA at 3880, GAAACGA at 3875, AATTTGA at 3783, GAAGTGG at 3672, AATACAA at 3571, GAAGCGA at 3565, AAACCAA at 3537, GGAGTAG at 3508, AAATTGA at 3492, AGTGCGA at 3468, GATACAG at 3385, GGTACAG at 3353, GGTTTAA at 3344, GATGTAG at 3288, AAAACAG at 3273, GATTCGA at 3225, AAAGCGA at 3220, GAACCGA at 3185, AGAATAA at 3082, AATGTAA at 3076, AAAGTGA at 3068, AGATCAG at 2997, AATTCAA at 2991, GGTGCGG at 2958, GAATTAA at 2899, GATCCAG at 2837, GATCTAG at 2629, AGAACAG at 2608, GGTCCAA at 2562, GGTATAG at 2509, GGTGTAA at 2490, GGACCGG at 2463, AATTTGG at 2427, AGAATGA at 2270, AAACCAG at 2247, AGTACAA at 2069, AGTTCGG at 1973, AATACAA at 1945, AAAATGG at 1935, AATTTAG at 1873, AATACGA at 1826, AAAACAA at 1821, GAAGCGG at 1811, AGATCAG at 1781, AGAGTAG at 1553, AAAGCGA at 1520, AAAGTGG at 1445, GATCTGG at 1338, GATATGG at 1109, AAAACAG at 1054, AATGTAA at 1049, GAACTGA at 1037, AATTTAA at 1005, GGTTTGG at 924, AAAGCAG at 885, AAACCAA at 785, GGATTGA at 704, GGTGCGG at 600, AGTTCAA at 562, GGTTTGG at 413, GAACCGG at 394, AAATTGG at 314, AATACAA at 300.
  7. Inrr3ci: GAAATAA at 3942, GGACTGA at 3898, AGTGTGG at 3854, AGACCAA at 3739, GGAATAA at 3672, AGACTGG at 3648, AAAGCAG at 3617, GGAGTAA at 3601, GGTTTAA at 3533, AATGTAG at 3491, GAATCGG at 3463, AGTGTGA at 3438, AGACCAA at 3389, AATCCAA at 3366, GGTATAA at 3337, AAAACAA at 3265, AGTGTAA at 3059, AATGCAA at 3047, AGTGTGA at 2969, GGATTGG at 2942, GAAATGG at 2937, AAAACAA at 2923, AAAACGG at 2903, AATTCGG at 2851, AATTCAA at 2762, AGAGTGA at 2635, AATTCAA at 2595, GGTTTAG at 2558, GATCTAG at 2437, AAATTGG at 2389, GGTATAG at 2371, AGTCCAG at 2338, AGTGTGA at 2206, AGTTTAA at 2152, GGACTGA at 2118, AAACCGG at 2068, AGTCTAA at 2061, GGTGTGG at 2011, AGTCTGG at 1985, AAATCAA at 1838, AAACTAA at 1801, GATGTAA at 1793, AATTTGG at 1786, GATCTGA at 1758, GATGCGG at 1691, AGTCTAG at 1661, AAATTAA at 1647, GGTCCAG at 1587, GATGCAG at 1411, GAATTAA at 1349, GAAGCGA at 1322, GGACCAA at 1276, AATTCGA at 1253, GGATTGG at 1231, GGTGTGG at 1107, GATTTAA at 1089, GGATCAG at 970, GGAACGA at 959, AGTCCAG at 951, GGTTCAG at 859, GGAGTAA at 823, GAACTGA at 718, GATGCGA at 706, AATGCAA at 545, AAAATGA at 539, GAAGTAG at 492, GGACCGG at 344, GAACTAG at 223, AGTTTAA at 202, GGAACAG at 197.
  8. Inrr5ci: GAACCGG at 4047, GGTGTGA at 4042, AATGTAG at 4006, AGAGCAG at 3979, AAAATGA at 3965, AAATTAA at 3931, GGAACGA at 3864, AGTTCGG at 3770, GATGTAA at 3713, GAAGCGA at 3593, AGAATGG at 3531, GGTCTAG at 3486, AAAGCAG at 3479, AATCCGG at 3452, GGAGTAG at 3408, GGTGTGG at 3374, GGTTCGG at 3288, AATTTGG at 3282, GAATCGA at 3276, GGTTTGA at 3271, GGTACAA at 3254, AGATTGG at 3172, GGAGCAG at 3156, AGTATGG at 3052, AATCCAG at 3035, GGTTCAA at 2986, GAACCAA at 2893, GATGTAG at 2784, AAAACGG at 2776, GAAATGA at 2768, GGAACGG at 2744, AGTGTGG at 2671, AATCCAG at 2650, GGTTCGG at 2589, GGTCTAG at 2574, GATGCGA at 2505, GGTCCGA at 2471, GAAGTGG at 2466, GGAACAA at 2414, GGAATGG at 2341, AGTCTGG at 2335, GGAGTAA at 2303, GGTTTGG at 2298, GGTACAA at 2245, GAAGCGG at 2240, GGTGTAG at 2220, GATGCAG at 2210, GAATCAG at 2159, AATCTGA at 2108, GGTGTAA at 1960, GAACCAA at 1950, AAATTGG at 1888, AGAGCGG at 1753, AGATCAG at 1748, GGTCCAG at 1593, GAACCGG at 1432, AGTTTAA at 1360, AGTCCAG at 1339, AAATTGA at 1291, AAACTAG at 1212, AGAACAG at 1197, AATGTGG at 1130, GGTCTGG at 1045, AGTGCAA at 900, AGTGCAA at 819, AGACCAG at 766, AATCCAG at 757, GATATAA at 671, GAACCAG at 615, GAAGCGA at 544, AGAATGA at 503, AGTACGG at 411, AGAGCAG at 381, GGAATAG at 376, AAATCAA at 136, AATCCAA at 131.
  9. Inrr7ci: 73, AAAATGA at 3964, AAAGCGG at 3936, AATTTAA at 3915, GGAACGA at 3868, AATACAA at 3783, GGTGCAG at 3769, AGACCAA at 3761, AGTTCAG at 3749, GGTTTGA at 3729, GAACCGG at 3658, AAACTGA at 3528, GGAGCGG at 3468, GAATTAG at 3412, GGAACGG at 3341, AAATCGA at 3310, GATATAA at 3056, AAATTAA at 2958, GAATCAA at 2947, GAACCAA at 2923, AGAACGG at 2916, AAAGCAA at 2893, GATTTAA at 2767, AGACTGA at 2696, GGACCAA at 2690, GGTGCAA at 2581, GGACTAA at 2566, GGATCGA at 2532, AATTTGA at 2397, GGTTCGA at 2381, AGAATAA at 2239, AGTCCAG at 2225, GGAGTGG at 2212, GGACTGA at 2072, AAAATAG at 2066, GAACCAG at 1975, AGTTTAG at 1960, GGATCAA at 1923, GGTTTGG at 1848, AAAATGA at 1835, AGTTCAG at 1754, AGAATGG at 1644, GAATTAG at 1639, AATGTGG at 1510, AGAATAA at 1445, AATATAA at 1413, AAAATGA at 1186, AAAATGA at 1114, AGAGTGA at 1000, AGAATAG at 995, AATACAG at 909, GAAGTGG at 652, AAATCGA at 510, AGAATAA at 418, AAAGCAA at 363, GATGTAA at 317, AAACTAA at 283, GGATTAA at 243, AAAATAA at 144, GATTTGG at 112.
  10. Inrr9ci: AAACCGG at 4043, GATTCGA at 3974, AATGCGG at 3961, GAAACGA at 3847, GAACTGA at 3842, AAAATGG at 3787, AGATTGG at 3722, GGTATGA at 3702, GGAATGA at 3678, GGACTAG at 3635, AATTTGA at 3580, GGTGCGG at 3535, AATTTGA at 3494, GAAATAG at 3467, AAAACGG at 3421, AAACTAG at 3310, GGTTTAA at 3284, AAATCGG at 3237, AGAGCGG at 3230, AAACTAA at 3070, GGAACAA at 3065, AATACAG at 2914, GAATTGG at 2889, AGAATAA at 2847, AAAATGG at 2625, GATTCAG at 2606, AATTTGG at 2564, GGTTTAG at 2542, AGAATGA at 2463, GGAGCGA at 2443, AAACTAA at 2391, AGAGTAA at 2381, AGATTGG at 2329, GATACAG at 2260, GGAATGA at 2231, GGTTTGA at 2217, GGTGTAA at 2205, AAAACGG at 2187, GAAGCGG at 2180, AAAACAA at 2091, AAAGCGG at 2053, GAAACGG at 1933, GAACTGG at 1872, GAAACGA at 1867, GGTACGG at 1809, AAACTGA at 1754, GAAACAA at 1748, GATTTGA at 1743, GGACTGG at 1712, AGTCCGA at 1567, AGAGCAA at 1513, GATGTGG at 1470, GGAGTGG at 1454, GATGCGG at 1426, GGACTGA at 1387, GAAATAA at 1379, GGAACAA at 1297, GGTACGG at 1217, AGTCTGG at 1208, GGTGTAG at 1175, AATTTAG at 1102, GGACCAA at 1059, AAAACGG at 1021, AATATAG at 941, AAAACGG at 895, AAAGCGG at 886, AAAACGG at 741, GGTCTAA at 663, AAACCGG at 596, GGTCCAA at 577, AAAATAG at 408, AAAATGG at 332, GAAACGG at 298, GAACTAA at 281, AAATTGA at 276, GGATTAA at 241, AAATTGA at 224, AAACCAG at 183, GGAGCAG at 43, AGTCTGG at 37, AGTACAG at 12.

Inrr alternate positive direction distal promoters

  1. Inrr0: CCATTTT at 4003, TCATATC at 3963, CTAAATT at 3926, CCGTATC at 3859, TCGCACT at 3751, TTAATTT at 3694, CTGGTCC at 3612, CCGCTCT at 3527, TTGCACC at 3477, CCACTTT at 3412, TCATTCC at 3390, CCAATTT at 3353, CTGCTTC at 3334, TTGTTTT at 3182, TTAGTCC at 3134, CCGTACC at 3058, TTAGTTT at 3030, TCAGTTT at 2966, TTAGATT at 2856, TCATATT at 2851, CTAAACC at 2659, CCAATTT at 2616, CCGTTTT at 2566, CTAAATC at 2559, TTAATCC at 2507, CCAATTT at 2494, TCGGTTT at 2479, TTGTTCC at 2431, TTATATT at 2373, CTAGTTT at 2343, CTGCTCC at 2299, CTAGACT at 2294, CCACACC at 2164, TTATATT at 2143, CTACTTT at 2109, TCAAATT at 1988, CTGTTCT at 1979, CCAGATC at 1949, CTATTTC at 1911, TTACTCT at 1906, TTGATTT at 1819, TCAGATT at 1774, TCGCTTT at 1768, TTGAACC at 1692, CCGGTTT at 1687, TTGTTTT at 1671, CTGGTTT at 1638, CTATTTT at 1569, TCGTATC at 1544, TCGCTCT at 1513, TTGATTT at 1459, CCAGTCT at 1373, TCACTTT at 1363, CCGCATT at 1265, TCGAACC at 1260, TCGTTCT at 1229, TCGTACC at 1176, CCACATT at 1165, TCAATCC at 1160, CCAAACC at 1087, CTACTTT at 1020, CCGGATT at 952, CCACTCC at 947, TCGAACT at 897, TCACATT at 889, CTGGTTC at 884, CCAAATT at 862, CCGAATT at 811, TCGTTCT at 651, TTGTTCC at 619, CTATTTT at 494, TCGCATT at 481, CTAATTT at 474, TTGCATC at 364, CCAAACC at 283, CCGTATT at 267, TTGTATC at 156, CCAGATT at 128, TTGGTTC at 84, TTGAATT at 79, CCATACT at 41, TCGAACC at 23.
  2. Inrr2: CCGTACT at 4001, CCAATCT at 3962, CTGATCT at 3762, CTACATT at 3722, CTGGTCC at 3599, TCGATTT at 3557, CTGGTTT at 3492, TCGCATT at 3483, CTGTATC at 3478, TCGGATC at 3436, CTATTTC at 3405, CCGGTCT at 3400, CCATTTT at 3381, CTAAACT at 3364, TCGCTTT at 3341, TTAATCC at 3231, CCGAATC at 2991, TTGGTCT at 2962, TCGGACC at 2888, CTGTATT at 2882, TCGTTCC at 2876, TCGCATT at 2752, CCGTTTT at 2528, CTAAATT at 2518, TCACACC at 2423, CTATATC at 2414, CCGTTTT at 2369, TCGTATC at 2356, TTGTTTC at 2349, CCGTATC at 2322, CCGAACT at 2237, CCACTTC at 2100, TCACTCC at 2090, TCACTTT at 2054, CTAGTCT at 1899, TTATACC at 1783, TTAAATT at 1769, CCAATTC at 1614, CCGGATT at 1580, CCGAACC at 1531, CTGTTCC at 1516, TCGGTCC at 1477, TTGGATC at 1418, TCGCATT at 1413, TTGGTCC at 1400, CTGTATT at 1372, TCAATCC at 1360, TTGGTCT at 1338, TTGTATT at 1273, CCGGTCT at 1214, TTGATCT at 1193, TTACATT at 1188, CCGTTTC at 1145, CCGCACT at 1105, CTGTTTC at 1099, TCAATTC at 1074, CTACACT at 900, CCACTTT at 888, CCAAACT at 745, TCGTTTC at 739, TCGTTTC at 718, CCGGTTC at 635, CCAGACC at 607, TCAATCC at 568, TCGTTTT at 446, CCAGACT at 355, CTATTTT at 34.
  3. Inrr4: TTAATCT at 3984, CTGGATC at 3922, CTACTCT at 3852, CCACTCT at 3839, CCGAACT at 3815, TTATATT at 3721, CTACATT at 3698, CTAATTT at 3672, TTGTTTT at 3533, TTATTTT at 3515, TTAAACC at 3374, TCGAATC at 3346, TCGGTCC at 3299, CTGTACT at 3290, TTAGATC at 3132, TTATTTC at 3018, CCAAACT at 2831, CTAAATC at 2816, CCGAATC at 2738, TCGGTCC at 2666, CCGTTTT at 2633, CCATTTC at 2547, TCGGTTC at 2372, TTATATT at 2296, TTATTTT at 2255, CTACTTT at 2127, TCATTCC at 2113, CCACTTC at 2081, TTACTTT at 2018, CTGGTTC at 1927, CTGCTCC at 1917, TTGTATT at 1795, CCAATTC at 1786, CCAATCT at 1763, CCACTTC at 1718, CCATTCC at 1552, CCGAATC at 1273, TTGTACC at 1217, TCGTTTC at 1187, CTGCTCT at 1066, CTACACT at 1053, TTGGATT at 840, CCGCTTT at 782, TTACACT at 624, TTAGTCT at 609, CCGCTTC at 589, TCAAATT at 478, TCAATTC at 440, TCGTTCT at 371, TCGTTTT at 361, CCATTCT at 310, CTAAACC at 303, CCAAACC at 231, CTGGTCC at 206, TCAATTT at 120, CCGCACC at 106, TCGATCT at 66.
  4. Inrr6: CCATTCT at 3999, CTGCACC at 3878, TCGAATT at 3810, TTATTCC at 3765, TTAGACC at 3731, TCAAATT at 3715, CCATTCC at 3690, CCACACC at 3630, CCATACT at 3597, CCGCTTC at 3591, CCAAATT at 3527, TCACTCC at 3495, TTGCTTC at 3479, TTATTCT at 3472, CCGGACC at 3415, CCGGATT at 3391, CCATACT at 3307, TTGATTC at 3177, CTGAACT at 3096, CTGTTCC at 3060, TCGTTTT at 3014, TCAATTT at 2974, TTGAATT at 2888, TCAAATT at 2740, TTGCATC at 2667, TCAATTT at 2649, CCGAATC at 2621, CCGGACC at 2615, TCGTATT at 2514, TCGGTTT at 2414, CCATATC at 2351, TTAGTTT at 2344, CCGGACC at 2269, CCGCTTT at 2252, CCAATTT at 2230, CTAGTTT at 2077, TTGGACC at 2015, CTACTTT at 1993, TCACACT at 1840, TCAATCT at 1741, CCGTATC at 1720, TCAGTTT at 1699, TTGCATT at 1660, TTAATTT at 1654, CCAATTT at 1648, CTAGACC at 1599, CTACATC at 1587, TTATTCC at 1490, CCGTTCT at 1383, CTGAATT at 1324, TTGAATT at 1304, CCACATT at 1299, CCACTTT at 1250, CTAAACC at 1208, TTAATCC at 1187, TTAGACC at 1177, CTATATT at 1138, CTGTTTC at 1000, TTGTATT at 914, TCAGTCC at 894, TTGATCT at 880, CTAGTTT at 875, CCAGACC at 854, CTGTTCT at 827, CTAATTT at 815, CCAAATC at 727, CCGATCT at 702, CCAATCT at 694, TTAATTC at 666, CCATATC at 626, TCGGACC at 518, TCGATCT at 509, CTAGTCC at 490, TTGATTC at 259, CTAAACT at 234, CTGGACT at 205, TCGAACC at 199.
  5. Inrr8: TCAAACC at 4001, CCAGTTT at 3956, CTATACT at 3945, CCAATCT at 3940, TTGGTTT at 3923, TTGAACC at 3797, CCGGTTC at 3652, CCGGACC at 3616, TTGCTTC at 3521, TTGCTTT at 3380, TTAATTC at 3354, TCATTTT at 3335, CTGGTCC at 3143, TTGGTTT at 3120, TTGCTTC at 3065, TTGGATC at 3016, TCGGACC at 3007, TCGCATT at 2959, CTGGTCC at 2876, CCAGTTC at 2793, CCAAATT at 2496, CCGGACC at 2491, CCGCACC at 2446, TTAATTC at 2437, TTGGATT at 2432, TTGTTTC at 2383, TCGAACC at 2314, TTGAACC at 2258, CCAATTT at 2221, CCGCTCC at 2216, CCACACC at 1948, TTAGTTT at 1924, CTGTTTC at 1889, TTAGTTT at 1850, TCGTTTT at 1833, TTACTTC at 1823, CCGATTT at 1811, CCATTTC at 1802, CTACTCT at 1772, CTGAACC at 1583, CCGTTTC at 1531, TCACATT at 1505, CCGGACT at 1464, CCAATTC at 1422, CTACACC at 1402, CTAGTTC at 1371, CTGAACT at 1342, TTGGTCC at 1258, CTGCTTT at 1164, CCGGATC at 1057, CCACACC at 1051, TCATATT at 877, CCGCACT at 763, TTATTTT at 740, CTAATTT at 728, CCGCTCT at 630, CCACTCC at 562, CCGTTCC at 426, CCGGTTT at 367, TTACACT at 199, TCAATTT at 194, CTATTTT at 166, CTAAATC at 140, CCGGTTC at 122, TCACTCC at 83, CCAAACT at 58, CCGGTTC at 52, CTGTATT at 32.
  6. Inrr0ci: GGATTAA at 3975, AAAGTAG at 3759, AGTACAA at 3727, GAATTAG at 3722, AAATTGA at 3569, GGAACAA at 3501, GAACTGG at 3496, AAAGTAG at 3485, GAAACGG at 3419, AAAGTAG at 3341, GGTTTAA at 3297, GAACTAG at 3232, GATACGG at 3221, AAAATGG at 3174, AGAATAG at 3151, AGTCCAG at 3136, GGAATAA at 3124, AGATTGA at 2858, GATCCGG at 2764, GGAACGA at 2754, GGTCCGG at 2749, AAAACGG at 2740, GGAGCAG at 2533, AATCCGA at 2509, AATTTGG at 2496, GGTTTGA at 2481, GATTTGA at 2417, GGTGCGG at 2324, AAAATAA at 2261, GAAACGG at 2202, GGAATAA at 2016, AAAACGG at 2011, AGATCGA at 1951, GGAATGG at 1898, AGAGTAG at 1873, AGATCAA at 1854, AATTTGA at 1816, AGATTGG at 1776, AATACAG at 1709, GGTTTGA at 1689, GGAACAG at 1534, AAAACGA at 1502, GGAATAA at 1490, GAAGCAG at 1481, GGTGCAG at 1401, AGTGCAG at 1339, AAACCAG at 1325, AGACCGA at 1317, GAACTAA at 1304, GGTACAG at 1284, AAACCAG at 1245, GGAATAA at 1103, AGAATGG at 935, GAACTGA at 899, AGTCCGA at 877, AAACTAA at 834, AAAATGG at 799, AATTCAA at 794, GATGTGG at 765, GATCTGA at 739, AAACCGA at 668, AAACTGG at 638, AGTTTGG at 565, GGTATAG at 385, AAAATAA at 197, GAAGTAA at 182, GAAGTGG at 109, GGATCAA at 96, GGTTCGA at 86, GAATTGG at 81.
  7. Inrr2ci: AATATAA at 3855, AAAATGA at 3800, AAATTAG at 3789, GATTCGA at 3773, GATCTGG at 3764, AAAGCGA at 3658, GGTCCAA at 3601, GGATCGA at 3438, AGAGTGA at 3334, AAACCAA at 3173, AGTACGA at 3157, AAAGCAG at 3152, GGTATGA at 3141, GAATTAA at 3031, GAACTAG at 3025, GATTTGG at 3004, GGTCCAA at 2984, AAAGCAA at 2916, AAAACGG at 2783, AATGTGG at 2765, GGTTCAA at 2662, GATTCGA at 2655, GGTCCGG at 2615, AATTTAG at 2521, GGAATAA at 2509, GGTATGA at 2475, AAACCAA at 2444, GGTACGG at 2401, AAATCAA at 2385, GGACTAG at 2335, AGACCGA at 2234, AGAGCAA at 2209, GGTGTGG at 2201, AAAACAA at 2191, AATTTAA at 2132, AAACTGG at 2072, GGTGCAA at 2006, GGTATGA at 1918, AAACTAA at 1853, AATTTAG at 1772, AAACCGA at 1747, GATTCAA at 1693, GGTTTAA at 1671, GATGTAG at 1650, AAACCAA at 1632, GGTCCGG at 1577, AAATTAA at 1554, GGTCTGA at 1340, AAATTGG at 1335, AATCTAG at 1170, AAAGTGG at 1023, GAAACAG at 948, AAAACAG at 942, AATCCGG at 797, GAAACGG at 581, GATGTGG at 528, AGTCCGA at 523, GATTCAG at 490, GGAGTAA at 385, AAAATGA at 371, AATCCAG at 352, AGATTAG at 338, AGTATAA at 239, AGTGTAA at 195, AAAATGG at 187, AAAGTGA at 147, GATTCGG at 137, GATTTAG at 130, GGTTCAA at 106.
  8. Inrr4ci: AGAGTGA at 4003, GAACCAA at 3966, AAACCAA at 3908, AGTGCGG at 3828, GGTACAA at 3775, GGAACAA at 3710, AGACTGG at 3620, AGTCCAG at 3600, GGTTTAG at 3471, AGTTCAA at 3440, AAACTAG at 3418, GAATCAG at 3348, AGTGCAG at 3336, GGTATGG at 3310, AAAGTGA at 3177, GAACCAA at 3172, GGTTCAA at 3150, GATCTAA at 3135, GGAGCAG at 3115, AGTATAG at 3009, GATGCAG at 2944, AAAGCAG at 2877, AATCCGG at 2819, GGAGTGA at 2777, GGTGTGG at 2761, GGAATGG at 2702, AAAGCGG at 2613, AAAATGG at 2517, GAATCGG at 2506, AGTGTAA at 2487, GGTGCGG at 2466, AGTTTAA at 2400, GATACGA at 2230, GGACTAA at 2056, GAAGCAA at 1853, GGTATGG at 1830, GATGTGG at 1824, GGTCCGG at 1815, AGAGCAA at 1806, GGACCAA at 1760, AGAATAA at 1695, GGACTAG at 1655, AAAGTAA at 1644, AGACCAG at 1575, GGTGCGG at 1509, GGTTTGG at 1339, AGTGCAA at 1332, AATGCAG at 1327, AATGTGA at 1317, GAATCGG at 1275, AATGTAG at 1103, AAAATGG at 1075, GGTCTAG at 982, AGATTAA at 947, AATCCGA at 894, GGAGCGG at 883, AAAGTGA at 737, GAAGTGG at 683, GGTCCGA at 678, GGTTCGG at 572, GAAACAA at 531, GATACGG at 513, GAAGCAA at 490, AAATTAA at 480, AATTCAA at 442, AATCCGA at 382, AATGTGG at 287, AGTCCAA at 228, AAAGCGG at 217, AGAATAG at 152, GGTGCAG at 147, GGTCCAA at 132, AATTTAG at 122, AAACCAG at 57.
  9. Inrr6ci: GATGCGA at 4036, GAAGTGG at 3973, GGTTTGA at 3934, AAAATGA at 3699, GGATTAA at 3611, AATTCAG at 3530, AAAGCGG at 3425, GGATTAA at 3393, AATCCGG at 3339, GATACGA at 3219, GAAATGA at 3190, AATTTGG at 2976, GGTGTGA at 2948, GATCTGA at 2939, GAATTAA at 2890, AAAGCAA at 2865, GGAGCGA at 2823, AATTTAG at 2764, AGTCCAG at 2635, AATATAG at 2608, GGTATGG at 2557, GATTCAA at 2542, AGAGCAA at 2507, GAATTGG at 2495, GAACCGG at 2427, GGTTCGG at 2411, AATATAA at 2239, GGATCGG at 2215, AAAGTAA at 2184, AGTTCGA at 2164, AAAGTGA at 2135, GGTTCGG at 1949, GATACAA at 1915, GGTATGG at 1853, GGTTTAG at 1822, GGTACAA at 1806, AGTTTAA at 1633, GGTGCGG at 1554, AGAACGA at 1524, GGTTCGA at 1430, GGAATAG at 1364, GAATTGG at 1326, AGACTAA at 1205, GAAGTGG at 1195, AATCCGG at 1189, GAAACAG at 1100, AGTTTGA at 1095, GGTATAA at 1063, GGTCTGG at 934, AGTTTGA at 877, AGTCTGA at 805, AGAGTAG at 800, AAAATGA at 749, AATCTGA at 730, AGTACAA at 714, AGAACAG at 647, AGTCCGG at 492, GGTTTAG at 449, GGTGTAA at 413, GATTCAA at 284, GGTTCGA at 215, GATACGG at 153, GGTTTAA at 139, GGTGTGA at 101, GATTTGG at 96, AAAATGG at 73, GAAACGG at 10.
  10. Inrr8ci: AATTCAA at 3998, GATATAA at 3982, AGTTTGG at 3958, AATTCGG at 3880, GGACCGG at 3728, GATCTGA at 3705, AGTACGG at 3688, GGACCGG at 3649, GGACCAG at 3618, GGAACGG at 3529, AAATTGA at 3500, AATTTGG at 3490, GGATTAG at 3483, AGTTTGA at 3398, GGTCTAA at 3364, AATTCGG at 3356, GATGCGG at 3285, GGTACGG at 3278, GAATCGA at 3205, GGAACAA at 3168, AAAATGG at 3106, AAAGTAA at 3099, GAAGTAA at 2924, AAAACGG at 2702, AGAATGA at 2649, AATTCGA at 2623, GGAGTGG at 2586, GGAATGA at 2537, GGACCAA at 2493, GGATTAA at 2434, GAACCGA at 2260, GATTTAG at 2245, GAAACAA at 2130, AAAACAA at 2103, GGTCCGG at 2077, GAATTGG at 1977, AATCCGA at 1941, GGTTTAG at 1934, AGTTTAA at 1926, GGAATAA at 1840, GGTCTAA at 1519, AATGTAA at 1430, AATTCGA at 1424, GGTCCAA at 1419, AGTTCAA at 1373, GAACTAA at 1344, GGAGCGG at 1330, AGAATGA at 1296, GAAGCGG at 1190, GAACCGA at 1071, AAAACGA at 1037, GATCCGA at 969, GAACTGA at 887, AGATCAG at 866, GAACCGA at 844, GGTACGG at 793, AATTTAG at 730, AAACTAA at 725, AAAGCGG at 712, GGACCAG at 691, GGTGTGA at 537, GATGTGG at 524, AGTGTGA at 440, GGTTTAA at 414, AATTCGG at 380, AGTATAG at 333, GGTGCGG at 295, AATTCAG at 278, GATGCAA at 174, GGAATGG at 158.

YYRNWYY (Juven-Gershon June 2008) analysis and results

The wider consensus sequence of YYRNWYY allows a G at the TSS but at most only allows two Gs in a row.[37]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 97 2 48.5 48.5 ± 4.5 (--53,+-44)
Randoms UTR arbitrary negative 273 10 27.3 29.1
Randoms UTR alternate negative 309 10 30.9 29.1
Reals Core negative 1 2 0.5 0.5 (--0,+-1)
Randoms Core arbitrary negative 4 10 0.4 0.45
Randoms Core alternate negative 5 10 0.5 0.45
Reals Core positive 10 2 5 5 (-+4,++6)
Randoms Core arbitrary positive 27 10 2.7 3.1
Randoms Core alternate positive 35 10 3.5 3.1
Reals Proximal negative 14 2 7 7 ± 2 (--9,+-5)
Randoms Proximal arbitrary negative 31 10 3.1 3.15
Randoms Proximal alternate negative 32 10 3.2 3.15
Reals Proximal positive 15 2 7.5 7.5 ± 0.5 (-+7,++8)
Randoms Proximal arbitrary positive 47 10 4.7 3.75
Randoms Proximal alternate positive 28 10 2.8 3.75
Reals Distal negative 175 2 87.5 87.5 ± 4.5 (--92,+-83)
Randoms Distal arbitrary negative 470 10 47.0 46.0
Randoms Distal alternate negative 450 10 45.0 46.0
Reals Distal positive 231 2 115.5 115.5 ± 20.5 (-+95,++136)
Randoms Distal arbitrary positive 712 10 71.2 70.65
Randoms Distal alternate positive 701 10 70.1 70.65

Comparison:

The occurrences of real YYRNWYY Inr UTRs, cores, proximals and distals are greater than the randoms. This suggests that the real YYRNWYY Inrs are likely active or activable.

BBCABW (Ngoc 20 January 2017) samplings

For the Basic programs (starting with SuccessablesInr2.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. Negative strand, negative direction: 44, GTCACA at 4359, CCCACT at 4353, GTCACT at 4319, TCCAGT at 4307, GTCACT at 4200, TTCACA at 3939, CTCATT at 3891, CTCATA at 3829, TCCACT at 3825, GTCATT at 3480, TTCACT at 3410, CCCACA at 3184, TCCACT at 3144, TTCACA at 2860, GTCACT at 2739, GTCACA at 2656, GTCACA at 2603, TCCAGT at 2585, CTCACT at 2447, GTCACT at 2404, TCCAGT at 2248, GTCACA at 2085, GTCACT at 1978, TGCAGA at 1774, GGCAGT at 1511, CTCAGA at 1444, GTCAGA at 1354, GTCACT at 1325, GGCACA at 1220, CTCACT at 1077, CCCACT at 1049, GTCACT at 1034, GCCACT at 868, GGCAGA at 754, TCCAGT at 712, TCCAGT at 576, TCCAGT at 568, TGCATT at 533, TCCAGT at 439, TTCACA at 322, GTCACT at 299, CTCAGA at 278, CCCAGT at 206, TCCATA at 179.
  2. Positive strand, negative direction: 59, TTCACA at 4531, CCCACT at 4485, TCCACT at 4459, CCCAGA at 4448, TCCACT at 4423, GCCAGT at 4415, TGCACT at 4340, TGCAGT at 4317, GCCAGA at 4233, CTCACA at 3965, CCCATA at 3856, TCCACA at 3692, GCCATT at 3686, CTCAGA at 3644, GGCACA at 3632, GTCAGA at 3625, GGCAGT at 3600, GGCAGA at 3589, GGCAGT at 3478, GGCATA at 3451, GGCATA at 3445, TGCAGA at 3431, TGCACT at 3289, GCCATT at 3284, GCCACT at 2756, TGCAGT at 2737, GGCACA at 2665, GCCAGT at 2654, TCCACT at 2632, TGCACT at 2426, TGCAGT at 2402, GCCAGT at 2211, TGCAGT at 2083, TGCACT at 2000, GCCACT at 1995, TGCAGT at 1976, GGCAGA at 1967, TGCACA at 1719, TCCAGT at 1532, CCCAGA at 1518, CTCACT at 1491, TGCAGT at 1472, CCCAGA at 1411, TCCATT at 1378, TCCAGT at 1352, TGCACT at 1347, TGCAGT at 1323, GGCAGA at 1314, CTCACA at 1126, GGCACA at 1116, TTCACT at 1056, TGCAGT at 1032, GGCAGA at 1023, GGCACA at 960, GGCACA at 518, GGCACA at 266, GTCACT at 208, TGCATT at 152, GCCATA at 39.
  3. Negative strand, positive direction: 87, CTCACT at 4338, CCCAGA at 4330, TGCAGA at 4317, CTCATT at 4309, GTCAGT at 4271, CTCAGA at 4195, TCCACT at 4013, GGCACT at 4006, TGCAGT at 3962, GTCACA at 3954, CGCAGA at 3916, TCCAGA at 3891, TGCAGA at 3831, GTCACA at 3822, TCCAGA at 3806, GCCACA at 3705, CTCACA at 3505, GGCAGA at 3473, TGCAGT at 3461, GGCACA at 3409, CCCACT at 3388, CCCAGT at 3379, TGCACT at 3343, CTCACT at 3317, TGCAGT at 3281, TGCAGT at 3232, GCCAGA at 3221, CTCACA at 3209, TCCACA at 3192, CCCAGA at 3091, CCCAGT at 3082, TGCATT at 3072, TGCACA at 2962, TTCAGT at 2936, GTCACT at 2929, CTCATT at 2902, CTCAGA at 2866, TGCAGA at 2859, CTCAGA at 2729, TGCAGA at 2721, CTCAGA at 2699, GTCAGA at 2609, GTCACT at 2425, TGCAGT at 2328, TTCACT at 2304, CTCAGA at 2239, GTCAGA at 2222, TGCATT at 2206, CTCATA at 2176, TTCAGT at 2098, GCCACT at 2072, TGCAGT at 2065, CTCAGT at 2060, TCCACA at 2029, CCCAGT at 2024, GGCACT at 1996, TGCAGA at 1937, TCCACT at 1912, TGCACA at 1822, CCCAGA at 1742, GGCATT at 1702, CGCACA at 1556, CCCACT at 1502, TGCACT at 1472, CGCAGA at 1416, TGCACT at 1372, CGCAGA at 1316, CCCAGT at 1250, TGCACA at 1220, CGCACA at 1136, CGCACA at 1052, GCCACA at 984, GCCAGA at 935, GCCACA at 884, GCCAGA at 835, CGCACA at 800, CGCACT at 686, TCCACA at 632, TGCACA at 548, CCCAGA at 468, TGCAGA at 438, CGCAGA at 396, GCCACA at 343, CCCAGA at 204, GTCACA at 155, GGCATT at 22, TCCAGA at 15.
  4. Positive strand, positive direction: 40, CCCAGA at 4414, CCCACT at 4399, CTCACT at 4350, TCCAGT at 4269, CGCAGA at 4056, GTCACA at 3964, TCCACT at 3934, TTCAGA at 3922, CTCACT at 3876, GTCACT at 3843, CCCAGT at 3820, TCCAGA at 3771, TCCATT at 3731, CTCACT at 3712, GCCAGA at 3608, CTCACA at 3592, TGCAGA at 3256, CTCAGA at 3187, TCCAGA at 3019, TCCATA at 2642, TTCAGT at 2618, CTCAGT at 2613, GTCAGT at 2607, CGCACT at 2555, TTCACT at 2511, CCCAGA at 2489, GTCACA at 2464, CGCAGT at 2423, TCCACT at 2375, TCCAGA at 2258, TCCAGT at 2220, TCCACT at 2128, GTCAGT at 2100, TCCACA at 1969, CCCAGA at 1958, CCCACA at 1803, CGCACT at 1720, CCCAGA at 1711, CGCACA at 1020, TCCAGT at 153.
  5. inverse complement, negative strand, negative direction: 46, TCTGGG at 4366, TGTGAC at 4336, TCTGCA at 4236, TCTGGG at 4205, AGTGAA at 4161, TCTGAG at 4054, AGTGAA at 4010, TGTGAA at 3983, TGTGGA at 3968, TATGGA at 3859, TATGCG at 3547, TATGAC at 3541, TCTGAC at 3425, AGTGCG at 3280, AGTGAA at 3240, AGTGAA at 3101, AGTGGG at 3057, TATGGA at 2994, ACTGAG at 2787, AGTGAA at 2578, TGTGAA at 2551, AGTGCG at 2207, ACTGGC at 2190, TATGAC at 2162, TCTGAG at 2026, AGTGCG at 1991, TCTGAC at 1934, AGTGCA at 1772, TCTGAA at 1617, TGTGAA at 1544, AGTGAC at 1492, TCTGAG at 1403, AATGAA at 1298, AGTGGA at 1171, TGTGGA at 1129, TCTGAG at 1082, AGTGAG at 1057, ACTGAA at 1052, TGTGCG at 963, TCTGAG at 916, TGTGGG at 749, AGTGCG at 663, TGTGCA at 531, TGTGCA at 342, TGTGGA at 62, TCTGAC at 16.
  6. inverse complement, positive strand, negative direction: 54, AATGAG at 4556, TGTGAG at 4362, ACTGCA at 4338, ACTGCA at 4330, AGTGAG at 4320, ACTGCA at 4315, AGTGAG at 4201, TGTGAG at 4093, AGTGAG at 4050, TGTGGC at 3960, ACTGCC at 3852, TCTGGA at 3836, AATGCA at 3771, ACTGGG at 3750, TGTGGG at 3712, AATGGG at 3660, TGTGCC at 3561, TGTGCA at 3429, AGTGAC at 3411, TGTGAG at 3268, AATGGC at 3005, TGTGCA at 2863, ACTGCA at 2759, AGTGAG at 2740, TGTGGC at 2606, AGTGAG at 2448, ACTGCA at 2424, AGTGAG at 2405, AATGAC at 2188, TGTGGC at 2066, ACTGCA at 1998, AGTGAG at 1979, AATGGC at 1949, ACTGAG at 1936, TATGGC at 1743, AATGCC at 1634, AATGAA at 1581, AGTGCA at 1535, ACTGCA at 1494, AGTGCA at 1470, TCTGGG at 1357, AGTGAG at 1326, AGTGGC at 1121, AGTGAG at 1078, AGTGAG at 1035, AGTGGA at 523, AGTGAA at 472, AGTGCG at 447, ACTGAC at 308, AGTGAG at 300, TATGAG at 275, ACTGAA at 131, TATGGG at 78, ACTGAA at 18.
  7. inverse complement, negative strand, positive direction: 94, TCTGGG at 4417, TGTGGG at 4395, AGTGAG at 4351, ACTGCA at 4341, AGTGCC at 4274, AATGAG at 4095, ACTGAA at 4090, AGTGGG at 4041, TGTGAC at 3972, TGTGCA at 3960, TCTGAA at 3925, TGTGAG at 3904, AGTGAG at 3877, AATGAA at 3836, AATGAC at 3783, ACTGAG at 3736, AGTGAC at 3713, TGTGAA at 3595, AATGAC at 3568, AGTGCA at 3464, AGTGGG at 3450, AATGAG at 3446, AATGAA at 3442, TGTGGA at 3437, AATGCC at 3431, TCTGGC at 3406, TCTGCC at 3359, ACTGGC at 3346, ACTGCA at 3320, TCTGCA at 3279, TCTGCA at 3268, TATGAG at 3261, AGTGCC at 3235, AATGGG at 3169, TATGGA at 3163, TCTGAG at 3124, ACTGGC at 3118, AATGCA at 3070, TCTGCA at 3061, TATGAC at 3028, AGTGCC at 3011, TCTGAG at 3007, TCTGGC at 2984, TGTGCA at 2960, TCTGAG at 2951, TCTGAC at 2944, AATGGG at 2911, TCTGGC at 2884, TCTGCA at 2857, AATGAC at 2842, ACTGCC at 2823, AGTGGA at 2712, TGTGCA at 2681, AGTGCA at 2326, ACTGCA at 2204, TATGGC at 2160, AGTGGC at 2068, AGTGCA at 2063, TCTGGC at 1993, TGTGGC at 1972, ACTGGG at 1954, TCTGGG at 1865, TGTGGA at 1806, AGTGCA at 1786, AGTGCG at 1725, AGTGCG at 1589, ACTGCA at 1505, TCTGCG at 1496, TCTGGC at 1477, AATGCG at 1421, TCTGCG at 1396, TCTGGC at 1377, AATGCG at 1321, ACTGAG at 1287, AGTGCG at 1253, AGTGCG at 1169, AGTGCG at 1160, AGTGCG at 1085, TGTGGC at 1023, ACTGCG at 1001, TGTGGC at 919, ACTGCC at 901, TGTGGC at 819, ACTGCG at 749, AGTGCG at 665, AGTGCG at 581, AGTGCG at 497, ACTGGG at 348, TCTGGA at 271, TCTGAG at 256, ACTGCC at 238, TGTGAA at 231, TCTGCA at 224, AGTGGG at 54.
  8. inverse complement, positive strand, positive direction: 47, AGTGAC at 4339, TGTGAG at 4335, AGTGGG at 4326, TCTGCG at 4320, TGTGCC at 4259, ACTGGG at 4217, AGTGGG at 4204, AGTGAG at 4127, AGTGAC at 4088, ACTGGA at 4019, ACTGGA at 3785, AGTGCC at 3748, AGTGGG at 3613, TCTGGA at 3551, TGTGGG at 3533, TGTGAG at 3508, AGTGAC at 3318, AGTGCA at 3254, ACTGAA at 3030, TGTGGG at 2965, ACTGAA at 2946, AGTGAC at 2930, TCTGGA at 2862, TATGAA at 2740, TGTGGA at 2431, TCTGAA at 2417, AGTGAC at 2341, AGTGGG at 2313, AGTGAG at 2305, AGTGGA at 2248, ACTGGC at 2214, AATGGG at 1889, TCTGAA at 1745, TGTGCC at 1698, ACTGGG at 1663, TGTGCC at 1559, TGTGCC at 1223, TGTGAC at 1139, TGTGCG at 987, TGTGCG at 887, TGTGCG at 803, TGTGCA at 569, AATGAA at 525, TCTGGC at 441, TCTGCC at 399, TGTGAC at 346, TCTGAC at 236.

BBC (4560-2846) UTRs

Negative strand BBC UTR

  1. Negative strand, negative direction: GTCACA at 4359, CCCACT at 4353, GTCACT at 4319, TCCAGT at 4307, GTCACT at 4200, TTCACA at 3939, CTCATT at 3891, CTCATA at 3829, TCCACT at 3825, GTCATT at 3480, TTCACT at 3410, CCCACA at 3184, TCCACT at 3144, TTCACA at 2860.
  2. Negative strand, negative direction: TCTGGG at 4366, TGTGAC at 4336, TCTGCA at 4236, TCTGGG at 4205, AGTGAA at 4161, TCTGAG at 4054, AGTGAA at 4010, TGTGAA at 3983, TGTGGA at 3968, TATGGA at 3859, TATGCG at 3547, TATGAC at 3541, TCTGAC at 3425, AGTGCG at 3280, AGTGAA at 3240, AGTGAA at 3101, AGTGGG at 3057, TATGGA at 2994.

Positive strand BBC UTR

  1. Positive strand, negative direction: TTCACA at 4531, CCCACT at 4485, TCCACT at 4459, CCCAGA at 4448, TCCACT at 4423, GCCAGT at 4415, TGCACT at 4340, TGCAGT at 4317, GCCAGA at 4233, CTCACA at 3965, CCCATA at 3856, TCCACA at 3692, GCCATT at 3686, CTCAGA at 3644, GGCACA at 3632, GTCAGA at 3625, GGCAGT at 3600, GGCAGA at 3589, GGCAGT at 3478, GGCATA at 3451, GGCATA at 3445, TGCAGA at 3431, TGCACT at 3289, GCCATT at 3284.
  2. Positive strand, negative direction: AATGAG at 4556, TGTGAG at 4362, ACTGCA at 4338, ACTGCA at 4330, AGTGAG at 4320, ACTGCA at 4315, AGTGAG at 4201, TGTGAG at 4093, AGTGAG at 4050, TGTGGC at 3960, ACTGCC at 3852, TCTGGA at 3836, AATGCA at 3771, ACTGGG at 3750, TGTGGG at 3712, AATGGG at 3660, TGTGCC at 3561, TGTGCA at 3429, AGTGAC at 3411, TGTGAG at 3268, AATGGC at 3005, TGTGCA at 2863.

BBC positive direction (4445-4265) core promoters

  1. Negative strand, positive direction: CTCACT at 4338, CCCAGA at 4330, TGCAGA at 4317, CTCATT at 4309, GTCAGT at 4271.
  2. Negative strand, positive direction: TCTGGG at 4417, TGTGGG at 4395, AGTGAG at 4351, ACTGCA at 4341, AGTGCC at 4274.
  3. Positive strand, positive direction: CCCAGA at 4414, CCCACT at 4399, CTCACT at 4350, TCCAGT at 4269.
  4. Positive strand, positive direction: AGTGAC at 4339, TGTGAG at 4335, AGTGGG at 4326, TCTGCG at 4320.

BBC negative direction (2811-2596) proximal promoters

  1. Negative strand, negative direction: GTCACT at 2739, GTCACA at 2656, GTCACA at 2603.
  2. Negative strand, negative direction: ACTGAG at 2787.
  3. Positive strand, negative direction: GCCACT at 2756, TGCAGT at 2737, GGCACA at 2665, GCCAGT at 2654, TCCACT at 2632.
  4. Positive strand, negative direction: ACTGCA at 2759, AGTGAG at 2740, TGTGGC at 2606.

BBC positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: CTCAGA at 4195.
  2. Negative strand, positive direction: AATGAG at 4095, ACTGAA at 4090.
  3. Positive strand, positive direction: CGCAGA at 4056.
  4. Positive strand, positive direction: TGTGCC at 4259, ACTGGG at 4217, AGTGGG at 4204, AGTGAG at 4127, AGTGAC at 4088.

BBC negative direction (2596-1) distal promoters

Negative strand BBC distals

  1. Negative strand, negative direction: TCCAGT at 2585, CTCACT at 2447, GTCACT at 2404, TCCAGT at 2248, GTCACA at 2085, GTCACT at 1978, TGCAGA at 1774, GGCAGT at 1511, CTCAGA at 1444, GTCAGA at 1354, GTCACT at 1325, GGCACA at 1220, CTCACT at 1077, CCCACT at 1049, GTCACT at 1034, GCCACT at 868, GGCAGA at 754, TCCAGT at 712, TCCAGT at 576, TCCAGT at 568, TGCATT at 533, TCCAGT at 439, TTCACA at 322, GTCACT at 299, CTCAGA at 278, CCCAGT at 206, TCCATA at 179.
  2. Negative strand, negative direction: AGTGAA at 2578, TGTGAA at 2551, AGTGCG at 2207, ACTGGC at 2190, TATGAC at 2162, TCTGAG at 2026, AGTGCG at 1991, TCTGAC at 1934, AGTGCA at 1772, TCTGAA at 1617, TGTGAA at 1544, AGTGAC at 1492, TCTGAG at 1403, AATGAA at 1298, AGTGGA at 1171, TGTGGA at 1129, TCTGAG at 1082, AGTGAG at 1057, ACTGAA at 1052, TGTGCG at 963, TCTGAG at 916, TGTGGG at 749, AGTGCG at 663, TGTGCA at 531, TGTGCA at 342, TGTGGA at 62, TCTGAC at 16.

Positive strand BBC distals

  1. Positive strand, negative direction: TGCACT at 2426, TGCAGT at 2402, GCCAGT at 2211, TGCAGT at 2083, TGCACT at 2000, GCCACT at 1995, TGCAGT at 1976, GGCAGA at 1967, TGCACA at 1719, TCCAGT at 1532, CCCAGA at 1518, CTCACT at 1491, TGCAGT at 1472, CCCAGA at 1411, TCCATT at 1378, TCCAGT at 1352, TGCACT at 1347, TGCAGT at 1323, GGCAGA at 1314, CTCACA at 1126, GGCACA at 1116, TTCACT at 1056, TGCAGT at 1032, GGCAGA at 1023, GGCACA at 960, GGCACA at 518, GGCACA at 266, GTCACT at 208, TGCATT at 152, GCCATA at 39.
  2. Positive strand, negative direction: AGTGAG at 2448, ACTGCA at 2424, AGTGAG at 2405, AATGAC at 2188, TGTGGC at 2066, ACTGCA at 1998, AGTGAG at 1979, AATGGC at 1949, ACTGAG at 1936, TATGGC at 1743, AATGCC at 1634, AATGAA at 1581, AGTGCA at 1535, ACTGCA at 1494, AGTGCA at 1470, TCTGGG at 1357, AGTGAG at 1326, AGTGGC at 1121, AGTGAG at 1078, AGTGAG at 1035, AGTGGA at 523, AGTGAA at 472, AGTGCG at 447, ACTGAC at 308, AGTGAG at 300, TATGAG at 275, ACTGAA at 131, TATGGG at 78, ACTGAA at 18.

BBC positive direction (4050-1) distal promoters

Negative strand BBC distals positive direction

  1. Negative strand, positive direction: TCCACT at 4013, GGCACT at 4006, TGCAGT at 3962, GTCACA at 3954, CGCAGA at 3916, TCCAGA at 3891, TGCAGA at 3831, GTCACA at 3822, TCCAGA at 3806, GCCACA at 3705, CTCACA at 3505, GGCAGA at 3473, TGCAGT at 3461, GGCACA at 3409, CCCACT at 3388, CCCAGT at 3379, TGCACT at 3343, CTCACT at 3317, TGCAGT at 3281, TGCAGT at 3232, GCCAGA at 3221, CTCACA at 3209, TCCACA at 3192, CCCAGA at 3091, CCCAGT at 3082, TGCATT at 3072, TGCACA at 2962, TTCAGT at 2936, GTCACT at 2929, CTCATT at 2902, CTCAGA at 2866, TGCAGA at 2859, CTCAGA at 2729, TGCAGA at 2721, CTCAGA at 2699, GTCAGA at 2609, GTCACT at 2425, TGCAGT at 2328, TTCACT at 2304, CTCAGA at 2239, GTCAGA at 2222, TGCATT at 2206, CTCATA at 2176, TTCAGT at 2098, GCCACT at 2072, TGCAGT at 2065, CTCAGT at 2060, TCCACA at 2029, CCCAGT at 2024, GGCACT at 1996, TGCAGA at 1937, TCCACT at 1912, TGCACA at 1822, CCCAGA at 1742, GGCATT at 1702, CGCACA at 1556, CCCACT at 1502, TGCACT at 1472, CGCAGA at 1416, TGCACT at 1372, CGCAGA at 1316, CCCAGT at 1250, TGCACA at 1220, CGCACA at 1136, CGCACA at 1052, GCCACA at 984, GCCAGA at 935, GCCACA at 884, GCCAGA at 835, CGCACA at 800, CGCACT at 686, TCCACA at 632, TGCACA at 548, CCCAGA at 468, TGCAGA at 438, CGCAGA at 396, GCCACA at 343, CCCAGA at 204, GTCACA at 155, GGCATT at 22, TCCAGA at 15.
  2. Negative strand, positive direction: AGTGGG at 4041, TGTGAC at 3972, TGTGCA at 3960, TCTGAA at 3925, TGTGAG at 3904, AGTGAG at 3877, AATGAA at 3836, AATGAC at 3783, ACTGAG at 3736, AGTGAC at 3713, TGTGAA at 3595, AATGAC at 3568, AGTGCA at 3464, AGTGGG at 3450, AATGAG at 3446, AATGAA at 3442, TGTGGA at 3437, AATGCC at 3431, TCTGGC at 3406, TCTGCC at 3359, ACTGGC at 3346, ACTGCA at 3320, TCTGCA at 3279, TCTGCA at 3268, TATGAG at 3261, AGTGCC at 3235, AATGGG at 3169, TATGGA at 3163, TCTGAG at 3124, ACTGGC at 3118, AATGCA at 3070, TCTGCA at 3061, TATGAC at 3028, AGTGCC at 3011, TCTGAG at 3007, TCTGGC at 2984, TGTGCA at 2960, TCTGAG at 2951, TCTGAC at 2944, AATGGG at 2911, TCTGGC at 2884, TCTGCA at 2857, AATGAC at 2842, ACTGCC at 2823, AGTGGA at 2712, TGTGCA at 2681, AGTGCA at 2326, ACTGCA at 2204, TATGGC at 2160, AGTGGC at 2068, AGTGCA at 2063, TCTGGC at 1993, TGTGGC at 1972, ACTGGG at 1954, TCTGGG at 1865, TGTGGA at 1806, AGTGCA at 1786, AGTGCG at 1725, AGTGCG at 1589, ACTGCA at 1505, TCTGCG at 1496, TCTGGC at 1477, AATGCG at 1421, TCTGCG at 1396, TCTGGC at 1377, AATGCG at 1321, ACTGAG at 1287, AGTGCG at 1253, AGTGCG at 1169, AGTGCG at 1160, AGTGCG at 1085, TGTGGC at 1023, ACTGCG at 1001, TGTGGC at 919, ACTGCC at 901, TGTGGC at 819, ACTGCG at 749, AGTGCG at 665, AGTGCG at 581, AGTGCG at 497, ACTGGG at 348, TCTGGA at 271, TCTGAG at 256, ACTGCC at 238, TGTGAA at 231, TCTGCA at 224, AGTGGG at 54.

Positive strand BBC distals positive direction

  1. Positive strand, positive direction: GTCACA at 3964, TCCACT at 3934, TTCAGA at 3922, CTCACT at 3876, GTCACT at 3843, CCCAGT at 3820, TCCAGA at 3771, TCCATT at 3731, CTCACT at 3712, GCCAGA at 3608, CTCACA at 3592, TGCAGA at 3256, CTCAGA at 3187, TCCAGA at 3019, TCCATA at 2642, TTCAGT at 2618, CTCAGT at 2613, GTCAGT at 2607, CGCACT at 2555, TTCACT at 2511, CCCAGA at 2489, GTCACA at 2464, CGCAGT at 2423, TCCACT at 2375, TCCAGA at 2258, TCCAGT at 2220, TCCACT at 2128, GTCAGT at 2100, TCCACA at 1969, CCCAGA at 1958, CCCACA at 1803, CGCACT at 1720, CCCAGA at 1711, CGCACA at 1020, TCCAGT at 153.
  2. Positive strand, positive direction: ACTGGA at 4019, ACTGGA at 3785, AGTGCC at 3748, AGTGGG at 3613, TCTGGA at 3551, TGTGGG at 3533, TGTGAG at 3508, AGTGAC at 3318, AGTGCA at 3254, ACTGAA at 3030, TGTGGG at 2965, ACTGAA at 2946, AGTGAC at 2930, TCTGGA at 2862, TATGAA at 2740, TGTGGA at 2431, TCTGAA at 2417, AGTGAC at 2341, AGTGGG at 2313, AGTGAG at 2305, AGTGGA at 2248, ACTGGC at 2214, AATGGG at 1889, TCTGAA at 1745, TGTGCC at 1698, ACTGGG at 1663, TGTGCC at 1559, TGTGCC at 1223, TGTGAC at 1139, TGTGCG at 987, TGTGCG at 887, TGTGCG at 803, TGTGCA at 569, AATGAA at 525, TCTGGC at 441, TCTGCC at 399, TGTGAC at 346, TCTGAC at 236.

BBCABW (Ngoc 20 January 2017) random dataset samplings

  1. Inr2r0: 55, TCCAGA at 4545, CCCATA at 4384, TGCACT at 4356, GCCACT at 4254, TTCATT at 4098, GTCATA at 4056, GCCATT at 4001, GTCATA at 3961, GCCATA at 3796, CCCATT at 3775, CGCACT at 3751, TGCAGA at 3554, GGCATA at 3458, GGCATA at 3447, GCCACT at 3410, CCCACA at 3394, CTCATT at 3388, GCCATA at 3303, TCCAGT at 3286, GTCACT at 3190, GGCATT at 3048, CCCATT at 2922, TGCAGA at 2903, CTCATA at 2849, CTCACT at 2587, TCCATA at 2519, GGCACT at 2473, CGCACA at 2451, GGCAGT at 2445, GCCATT at 2389, GCCACA at 2162, TTCATA at 2149, CTCAGT at 2137, CCCATA at 2039, GCCAGA at 1947, TCCAGA at 1867, TGCACA at 1785, TTCAGA at 1772, CGCAGA at 1614, GGCACA at 1606, TCCAGA at 1476, CCCAGT at 1371, TTCACT at 1361, TGCAGT at 1340, CGCATT at 1265, GCCATT at 1184, TCCACA at 1163, TTCACA at 887, GTCAGA at 847, TCCATA at 539, GGCAGT at 528, CGCATT at 481, TTCATA at 313, CCCAGA at 126, CCCATA at 39.
  2. Inr2r1: 47, GGCACA at 4523, CTCATT at 4265, CTCACT at 4250, CCCACA at 4235, GGCACA at 4138, TTCAGA at 4128, CGCACT at 4117, GGCATT at 4049, TTCACT at 3898, CCCAGA at 3870, GCCAGT at 3852, GGCACT at 3677, TGCACT at 3596, CTCATT at 3553, GCCATT at 3407, GGCACT at 3205, CCCACT at 3055, CTCACA at 2696, TGCATA at 2367, CCCAGT at 2358, TCCATT at 2293, GTCATT at 2239, TTCACT at 2200, CCCAGT at 2149, TTCAGT at 2065, TTCATA at 1993, GCCATT at 1908, TTCAGA at 1796, GTCAGA at 1752, CTCAGT at 1650, TGCATA at 1603, CTCAGA at 1533, CGCATA at 1356, GGCACT at 1327, CCCATT at 1316, TTCACT at 1213, TGCATT at 1142, TGCATT at 1025, TCCACA at 985, GCCACT at 917, TTCATT at 772, GGCAGT at 480, CTCAGT at 427, GTCAGA at 359, CCCATT at 335, CCCACT at 239, TCCATA at 56.
  3. Inr2r2: 60, GGCAGA at 4549, GCCATA at 4542, CGCATA at 4528, GGCAGA at 4443, TCCACT at 4409, GTCACT at 4383, CCCACT at 4337, TTCACT at 4280, GTCATA at 4191, CCCAGT at 4138, TCCACA at 4066, TTCACT at 4050, CCCAGA at 3941, CGCAGT at 3847, CCCACA at 3836, TGCACA at 3732, CGCATT at 3483, TCCATT at 3379, GTCAGA at 3330, GGCATT at 3298, TGCAGT at 3243, CGCAGA at 2929, GTCAGA at 2901, CCCATT at 2893, TTCACA at 2829, CGCATT at 2752, CTCAGA at 2744, GTCACA at 2682, CCCATT at 2625, GCCATA at 2581, TGCAGA at 2540, TTCACA at 2421, CTCATT at 2241, TCCACT at 2148, GCCACT at 2098, TTCACT at 2088, TTCACT at 2052, GCCACA at 2026, CTCAGT at 1985, TTCACA at 1797, TTCACT at 1725, CCCATT at 1706, GCCATT at 1471, CGCATT at 1413, CTCACT at 1367, TCCACT at 1363, GGCACA at 1306, CGCACT at 1105, TTCACA at 1087, TCCATA at 972, GCCACT at 886, GTCATA at 866, GCCATA at 791, CGCATA at 777, GGCATA at 616, CCCAGA at 605, GTCAGA at 469, TCCAGA at 353, GGCAGA at 334, GGCATT at 114.
  4. Inr2r3: 54, CCCAGT at 4539, CCCACA at 4283, CTCATT at 4213, CTCATA at 4181, TGCAGT at 4130, TCCATT at 4025, CCCAGT at 3985, GTCACT at 3845, CCCATA at 3796, CGCATA at 3731, GGCACT at 3652, TCCAGT at 3568, CTCATT at 3470, CTCACT at 3196, CCCACT at 2996, GTCATA at 2856, CTCATT at 2840, CTCAGT at 2819, CGCACT at 2803, CCCATT at 2797, CTCAGA at 2771, GGCACA at 2697, TGCATA at 2611, GGCATT at 2537, TCCATT at 2503, TCCATT at 2351, GGCATT at 2346, TTCATT at 2325, GTCAGA at 2240, CCCAGA at 2179, CGCACT at 2168, CGCAGT at 2148, CTCAGA at 1998, GGCACT at 1882, TTCAGA at 1831, CGCATT at 1827, CCCATT at 1465, GCCATT at 1459, TCCAGA at 1386, CCCATT at 1335, CGCAGA at 1298, GCCACT at 1178, CTCAGT at 1156, CCCATT at 1065, TTCATT at 1046, TCCATT at 935, TTCATT at 923, TTCAGA at 688, GTCATT at 479, CGCAGA at 473, CCCACA at 434, GCCATT at 401, CCCAGA at 296, CTCAGA at 218.
  5. Inr2r4: 46, CCCATA at 4511, CCCAGT at 4417, CGCACA at 4394, CCCACA at 4198, TTCACT at 3946, TGCAGT at 3931, GCCACT at 3837, TGCATT at 3662, TCCAGT at 3601, GCCATT at 3574, TGCAGA at 3365, TGCAGA at 3337, CCCAGT at 3332, TGCACT at 3065, GCCATT at 3025, TGCAGT at 2945, CCCATT at 2916, CCCAGT at 2896, CCCAGT at 2748, GCCACT at 2684, CTCATT at 2567, CCCATT at 2545, CTCAGT at 2459, TTCATA at 2375, GCCAGA at 2334, GTCATT at 2250, TGCACA at 2197, CGCAGA at 2176, TCCACT at 2079, CCCATT at 1842, TCCACA at 1771, CCCACT at 1716, CTCATA at 1683, CTCATT at 1674, GCCATT at 1550, CTCATT at 1528, TGCAGT at 1418, TGCAGT at 1328, GCCATA at 1095, CCCATA at 920, CCCACA at 907, GCCATA at 463, TTCATA at 315, CCCATT at 308, CCCACA at 236, TGCAGA at 148.
  6. Inr2r5: 45, CCCACT at 4269, CCCATA at 4104, TTCACA at 4072, GTCATT at 4053, CTCAGA at 4028, GCCACT at 3913, CTCATT at 3808, TCCACT at 3699, TCCATT at 3624, TTCACA at 3430, CGCACA at 3416, TTCACA at 3343, CCCATA at 3211, CCCATT at 3146, TCCAGT at 3036, GCCATT at 2803, GGCAGT at 2795, TCCAGT at 2651, CCCATA at 2626, CCCATT at 2599, GGCAGA at 2593, CCCATA at 2515, CCCAGA at 2311, TTCAGT at 2192, CTCAGA at 1945, CCCATA at 1777, CCCAGT at 1693, TCCACT at 1671, CTCATA at 1653, TCCAGA at 1594, CGCATA at 1507, GGCAGT at 1497, GGCAGA at 1427, GTCAGT at 1356, TTCACA at 1282, GGCACT at 1219, GCCATT at 1202, TCCAGA at 1000, GGCACT at 970, TTCATA at 936, TCCAGA at 882, GGCAGA at 688, GCCACT at 439, GCCAGA at 339, GCCATT at 292.
  7. Inr2r6: 55, GCCACA at 4391, CGCAGA at 4308, CCCACT at 4179, TGCAGT at 4009, GCCATT at 3997, CTCACA at 3770, GGCATT at 3746, GCCATT at 3726, TCCATA at 3693, CCCACA at 3628, TCCATA at 3595, GTCACT at 3493, CGCAGT at 3327, CCCATT at 3231, TCCATT at 2911, TCCAGT at 2636, CCCATT at 2460, TCCACA at 2355, TCCATA at 2349, GGCAGA at 2323, CCCACA at 2312, GCCATT at 2292, GCCACT at 2260, GCCATA at 2203, CTCATT at 2177, GGCAGT at 2160, TCCACT at 2060, CGCACA at 1941, CCCACA at 1776, TCCACT at 1764, CTCAGT at 1697, CCCACA at 1670, TGCATT at 1660, GGCACT at 1582, GGCACT at 1568, CCCACA at 1297, GCCACT at 1248, CCCACT at 1181, CCCACA at 1166, TTCATA at 1122, GTCATT at 1111, TTCAGA at 1043, CGCATT at 982, GGCACA at 967, CCCAGA at 852, GTCATT at 840, TTCATT at 669, CCCATA at 624, TTCACA at 580, CCCAGT at 528, GGCATT at 468, CCCAGT at 391, TCCATT at 354, CGCAGT at 349, TTCATT at 146.
  8. Inr2r7: 56, TGCATA at 4290, CCCATA at 4220, TGCAGA at 4176, GGCATT at 4163, CGCAGA at 4144, TCCAGT at 4054, GTCACA at 3896, CGCACT at 3874, TGCAGT at 3770, CCCACA at 3641, CTCATT at 3620, GGCACT at 3501, TGCACT at 3479, CTCAGA at 3372, TCCACA at 3266, TGCATT at 3240, GCCACT at 3116, GGCACT at 3081, TCCAGA at 3051, GGCACT at 3027, CTCAGA at 2912, CCCAGT at 2864, TCCATA at 2847, CCCACA at 2680, CCCATA at 2671, TCCACT at 2647, CTCAGA at 2634, CCCAGA at 2622, GTCACT at 2587, GCCATT at 2444, GTCATA at 2326, GCCATT at 2270, TTCATT at 1892, TTCAGT at 1803, CTCAGT at 1750, TGCATA at 1683, TGCACT at 1486, CCCATT at 1340, GGCAGA at 1265, CCCACA at 1204, CCCATT at 1175, GGCACT at 1070, GTCATT at 1016, CCCACA at 814, TTCACT at 612, GCCAGA at 583, TCCATT at 569, GCCACT at 560, CGCAGT at 554, TCCATA at 547, CTCACT at 522, GTCATA at 204, CGCACA at 198, CGCATT at 168, TCCAGA at 125, TGCACT at 75.
  9. Inr2r8: 42, CGCAGT at 4460, GCCAGA at 4404, TCCACA at 4336, GGCATT at 4321, CGCAGA at 4257, TCCACT at 4157, GCCATT at 4021, GGCACA at 3962, CCCAGT at 3954, CTCAGA at 3819, GGCACT at 3739, GCCACA at 3711, GGCAGA at 3539, TTCATT at 3333, TGCATA at 3093, CGCATT at 2959, CGCAGT at 2911, GTCATT at 2854, TCCATT at 2809, TTCACT at 2796, GCCAGT at 2791, CCCACT at 2766, CCCAGA at 2724, GCCATT at 2461, GGCATT at 2426, TGCAGT at 2298, CCCACT at 2178, CCCAGA at 2005, CCCACA at 1903, GCCAGT at 1640, TCCACT at 1578, CCCAGA at 1382, TCCAGA at 1359, GGCATT at 1279, CCCACT at 1112, TTCATA at 875, CCCATA at 780, CGCACT at 763, GGCACT at 664, TGCACA at 658, TGCATT at 354, TTCAGA at 279.
  10. Inr2r9: 46, CCCAGA at 4483, GCCACT at 4099, GGCACA at 4050, TGCACA at 4031, TTCAGA at 4025, CCCAGT at 3947, CCCACT at 3829, GGCAGT at 3813, TGCATA at 3770, TCCATT at 3658, TTCACT at 3644, TCCATT at 3617, GGCAGT at 3601, GGCACT at 3539, CCCATT at 3475, CCCACA at 3449, CCCAGA at 3226, CGCATT at 3101, TGCACA at 2711, CTCATT at 2340, CCCATT at 2244, GGCATA at 2193, GTCAGA at 2058, GGCACA at 2021, GTCAGA at 1862, GCCACA at 1814, GTCATT at 1802, CCCACA at 1674, CGCATT at 1651, TGCACT at 1601, GCCATA at 1561, CCCACA at 1520, CGCATT at 1446, GGCATA at 1221, CTCACT at 1197, TGCATA at 1191, CCCAGT at 734, CGCACT at 693, TTCACT at 648, CCCATT at 616, TTCACA at 415, CTCAGA at 313, GGCACA at 302, GTCAGT at 108, CTCAGT at 104, CCCATT at 84.
  11. Inr2r0ci: 45, TGTGGC at 4536, AGTGGG at 4503, ACTGAA at 4359, TATGCC at 4291, ACTGCC at 4257, AATGGC at 4239, TGTGCA at 4108, TCTGCC at 4075, TGTGCG at 3933, TGTGCC at 3770, TGTGGC at 3671, AATGGA at 3582, ACTGGA at 3535, ACTGGA at 3497, ACTGGA at 3273, TCTGAC at 3269, TCTGGA at 2988, TGTGGC at 2844, TATGAC at 2729, AATGCC at 2159, TATGGA at 2153, TGTGGC at 2065, TGTGCC at 1944, TGTGCC at 1760, AGTGCC at 1681, AATGGA at 1580, AGTGCA at 1338, AGTGAA at 1329, ACTGGG at 1201, TGTGCC at 1061, TATGGA at 990, AATGGG at 936, ACTGAA at 900, AATGCC at 773, TGTGGA at 766, AGTGAA at 744, TCTGAG at 740, TGTGAC at 726, ACTGGG at 639, AGTGGG at 531, TATGGC at 486, TGTGAA at 463, TATGCC at 341, ACTGGC at 138, AGTGGG at 110.
  12. Inr2r1ci: 57, TATGAC at 4498, AGTGGG at 4360, AATGAA at 4338, TATGCC at 4212, TATGGG at 4166, AGTGGG at 4149, AATGGC at 3940, TATGAA at 3793, TCTGGC at 3705, ACTGGG at 3680, AGTGGG at 3673, AATGCA at 3594, AGTGCG at 3467, TGTGGC at 3403, AGTGAC at 3357, TCTGCA at 3336, AATGCC at 3306, TCTGCG at 3102, AATGAA at 3086, AGTGAA at 3069, AATGCC at 2917, TGTGAA at 2906, AATGCC at 2782, AGTGCC at 2635, TGTGCC at 2530, ACTGGG at 2485, AATGCA at 2365, AATGAA at 2271, AGTGGA at 2152, TATGGC at 2103, TGTGCC at 2027, TGTGCG at 1805, AATGCA at 1622, TATGCG at 1607, TGTGCG at 1585, AGTGGA at 1446, AATGCC at 1425, TCTGGA at 1339, TATGGC at 1283, AATGGA at 1257, TCTGCA at 1133, TATGGG at 1110, ACTGAA at 1038, TGTGGG at 873, TCTGAG at 834, TGTGGG at 699, TATGGA at 688, TGTGCA at 642, AATGAA at 457, TATGAG at 342, ACTGCG at 242, TATGCG at 171, ACTGAC at 153, TCTGGG at 62, TGTGGA at 46, TCTGGA at 27, TCTGCG at 10.
  13. Inr2r2ci: 56, TGTGGA at 4489, ACTGGG at 4429, TATGGG at 4388, AATGAG at 4358, ACTGGC at 4283, ACTGCC at 4053, ACTGGG at 4004, AATGAC at 3986, TGTGCC at 3896, AGTGCC at 3861, AATGCC at 3831, TCTGGG at 3765, AGTGGC at 3681, TCTGAC at 3579, TGTGCA at 3507, TCTGCC at 3346, TATGAA at 3142, TCTGCC at 3102, AATGAA at 3054, ACTGAG at 2837, AATGGG at 2733, TCTGCG at 2713, TATGCG at 2670, AGTGGG at 2606, ACTGCC at 2595, TCTGCA at 2538, AATGGA at 2505, TATGAA at 2476, TGTGGA at 2202, AGTGGG at 2141, ACTGGC at 2073, AGTGAG at 2060, ACTGCC at 2023, TGTGGG at 2001, TATGAC at 1919, TATGAG at 1759, ACTGGC at 1728, AATGCA at 1464, AATGGG at 1452, TCTGAC at 1341, ACTGCG at 913, ACTGGG at 839, TGTGCG at 728, TGTGCG at 704, TGTGGC at 529, TGTGGC at 460, AATGGC at 429, AATGAC at 372, TATGGG at 361, AATGGG at 284, AGTGAA at 280, TCTGCC at 205, AATGGG at 188, TGTGAG at 168, ACTGCG at 152, AGTGAC at 148.
  14. Inr2r3ci: 50, ACTGGC at 4534, TATGCG at 4490, TCTGGA at 4403, TATGAA at 4171, AGTGAG at 4133, TATGGG at 3893, AATGGA at 3882, TGTGGG at 3855, ACTGGC at 3649, TGTGAG at 3439, AATGGC at 3370, AATGAG at 3277, TCTGAG at 3253, AGTGGA at 3241, AATGCA at 3046, TGTGAA at 2970, AATGGA at 2938, AATGCG at 2706, AATGGC at 2694, TGTGAG at 2671, AGTGAC at 2636, TCTGCA at 2609, TGTGAC at 2448, AGTGAC at 2441, TCTGCA at 2415, TGTGAC at 2207, TATGCC at 2175, TCTGGG at 2162, ACTGAA at 2119, TCTGGG at 1986, ACTGCA at 1950, TCTGAG at 1772, TCTGAC at 1759, TCTGGA at 1686, TGTGCC at 1575, TCTGAA at 1433, ACTGCC at 1144, TCTGAC at 1140, TCTGAG at 946, AGTGGC at 813, TCTGGG at 774, ACTGAA at 719, AGTGAA at 714, AATGCA at 544, AATGAA at 540, TGTGGG at 513, AGTGAC at 268, AATGCA at 160, TGTGAA at 95, TCTGCC at 29.
  15. Inr2r4ci: 54, TCTGAA at 4481, TATGCG at 4466, TATGGG at 4226, AGTGAA at 4004, TCTGGG at 3855, TCTGGG at 3842, AGTGCG at 3827, ACTGGA at 3621, TGTGCC at 3541, AGTGAA at 3422, ACTGAG at 3400, AGTGCA at 3335, TATGGA at 3311, AATGGC at 3265, TGTGAA at 3252, AATGGG at 3220, AGTGAG at 3178, AATGCA at 3123, TGTGGC at 2983, AGTGAC at 2778, TGTGGG at 2762, AATGGG at 2703, ACTGGG at 2687, AGTGGG at 2606, AATGGG at 2518, ACTGGG at 2391, TCTGGG at 2383, TATGCC at 2331, AGTGCG at 2268, TGTGAG at 2245, AATGAA at 2038, TGTGAA at 1962, TATGGG at 1831, TGTGGG at 1825, TATGAC at 1687, ACTGCA at 1663, TGTGGC at 1457, TATGCC at 1373, AATGGC at 1365, TGTGCC at 1358, AGTGCA at 1331, AATGCA at 1326, TGTGAA at 1318, TATGCA at 1265, AATGCA at 1119, AATGGC at 1076, TCTGCA at 1069, AATGAG at 997, ACTGAA at 993, TCTGCC at 966, AGTGAG at 738, AATGAA at 720, TGTGGA at 288, TATGAG at 71.
  16. Inr2r5ci: 48, TATGCG at 4541, AATGGA at 4342, TATGCG at 4275, TGTGAA at 4043, AATGAC at 3966, ACTGAA at 3916, TATGGA at 3898, TGTGGC at 3882, AGTGAG at 3837, TCTGAA at 3796, TATGGC at 3787, AATGCC at 3634, AATGCG at 3187, AGTGGG at 3160, TATGGG at 3151, TATGGC at 3053, AGTGAC at 3039, AATGAC at 2920, TGTGCC at 2856, AGTGCG at 2798, AATGAA at 2769, TATGAG at 2711, TGTGGC at 2672, AATGCC at 2661, AGTGGG at 2425, AATGGA at 2410, AATGGA at 2342, TCTGGG at 2336, TCTGAG at 2109, TGTGAC at 1901, ACTGAG at 1876, ACTGCC at 1849, TGTGGG at 1723, TGTGCG at 1698, TCTGAG at 1625, AATGAA at 1567, ACTGGA at 1525, TCTGCA at 1322, AGTGGC at 1216, ACTGGG at 1182, TGTGGC at 1131, TCTGGC at 1046, AGTGCA at 899, AGTGCA at 818, AATGAC at 504, TATGGC at 243, TATGGG at 174, TATGCA at 151.
  17. Inr2r6ci: 43, TATGGG at 4345, ACTGAC at 4232, AGTGGA at 4074, TCTGCA at 4007, AATGAA at 3700, AGTGCG at 3457, AATGAC at 3191, AGTGGC at 3134, TCTGAA at 3094, AATGCC at 3080, AATGCG at 3005, TGTGAA at 2949, TCTGAA at 2940, TGTGCA at 2655, TCTGGC at 2574, TATGGC at 2558, TCTGAG at 2447, TGTGAA at 2423, TGTGAA at 2332, AGTGAC at 2136, AGTGGC at 2091, AATGCG at 2068, TCTGCG at 1965, AATGGC at 1958, TCTGCC at 1883, TATGGC at 1854, ACTGAG at 1811, TCTGGG at 1359, TGTGAC at 1344, TCTGAA at 1322, TATGCC at 1245, AGTGGG at 1196, ACTGGG at 1058, TCTGGC at 935, TCTGAC at 806, AATGGA at 743, TCTGCA at 705, AGTGAA at 561, AGTGGA at 534, AATGGC at 251, TATGAA at 247, TGTGAG at 102, AATGGC at 74.
  18. Inr2r7ci: 51, TCTGCG at 4537, TCTGAA at 4307, AGTGCA at 4288, TGTGGG at 4228, AGTGCC at 4057, TCTGCC at 4038, TCTGCG at 4007, TATGGG at 3990, AATGAC at 3965, ACTGGC at 3877, TATGCC at 3776, ACTGAA at 3529, TATGCC at 3495, TATGCA at 3477, AATGGC at 3389, TCTGGG at 3335, AATGCA at 3238, TATGCG at 3227, ACTGGA at 2969, TATGCA at 2903, AGTGGG at 2867, TATGGA at 2717, ACTGAG at 2697, AGTGCA at 2657, ACTGGG at 2613, ACTGAG at 2600, TCTGGA at 2562, AGTGGG at 2213, TCTGAG at 2171, TGTGCC at 2141, ACTGAG at 2073, AATGAA at 1954, AATGAG at 1836, TGTGCA at 1681, AATGGA at 1645, TGTGGA at 1511, TATGCA at 1484, TGTGCC at 1404, TGTGAC at 1329, AGTGGG at 1254, AATGAG at 1187, TCTGGG at 968, AATGGG at 955, TGTGGA at 919, TCTGAC at 701, AGTGGC at 653, ACTGCC at 615, AGTGCC at 557, TATGGA at 312, TCTGCA at 73, ACTGGC at 12.
  19. Inr2r8ci: 59, AGTGGC at 4528, ACTGCG at 4513, TGTGCG at 4238, AATGAG at 4203, AATGAA at 3986, TCTGCG at 3789, TCTGAG at 3706, AATGGG at 3597, AATGCC at 3582, TCTGGA at 3525, AATGGG at 3438, AATGGC at 3107, ACTGCA at 3091, ACTGAG at 3025, AATGGG at 2950, AGTGCA at 2844, TGTGCA at 2730, AATGAC at 2650, AGTGGG at 2587, TATGCC at 2569, AATGCC at 2515, AATGGG at 2468, AATGGC at 2423, TATGCA at 2296, ACTGAC at 2234, TATGGA at 2087, AATGGA at 2050, TCTGCA at 2038, TATGAG at 1963, ACTGAA at 1954, TATGGC at 1781, TGTGGC at 1715, AATGAG at 1673, TATGCG at 1647, ACTGAA at 1581, AGTGGG at 1566, TCTGGG at 1484, AATGAA at 1434, AATGCC at 1378, TCTGAA at 1340, AGTGGG at 1325, AATGAA at 1297, TGTGGA at 1286, AGTGGG at 1205, AATGAC at 1132, TATGAC at 911, ACTGAA at 888, AATGGG at 788, AGTGGC at 754, ACTGGG at 667, TATGCA at 656, AATGGC at 640, TATGGG at 571, TGTGAC at 538, AATGGG at 409, AGTGCA at 352, AATGGC at 159, TATGGG at 95, TCTGGG at 37.
  20. Inr2r9ci: 58, AGTGAC at 4471, AATGAG at 4414, TGTGGG at 4391, AATGGC at 4343, TATGAA at 4339, TATGCG at 4255, ACTGAC at 4208, TATGCG at 4134, AATGCG at 3960, ACTGAA at 3843, TCTGCG at 3818, AATGGC at 3788, ACTGCA at 3768, ACTGGA at 3760, TATGAA at 3745, TATGAC at 3703, AATGAC at 3679, ACTGAA at 3542, TCTGAC at 3520, TCTGGA at 3301, AGTGAA at 3215, TATGGG at 3178, TATGAA at 3120, TATGGA at 2863, TCTGCC at 2756, TCTGCG at 2651, AATGGA at 2626, TATGAA at 2350, AATGAC at 2232, AATGAC at 2079, ACTGAC at 1755, AATGGC at 1724, ACTGGG at 1713, ACTGGG at 1699, AATGAA at 1679, AGTGCC at 1670, ACTGCA at 1599, TGTGGA at 1471, AGTGGG at 1455, AATGAC at 1404, TATGAA at 1400, ACTGAA at 1388, TCTGGC at 1209, TCTGGA at 1055, ACTGGA at 866, ACTGCC at 843, TCTGCG at 825, TCTGCC at 730, ACTGGC at 628, ACTGGC at 487, TATGCA at 457, TATGGC at 443, ACTGGA at 397, AGTGCC at 361, AATGGC at 333, TCTGGA at 146, TCTGCG at 113, TCTGGG at 38.

Inr2r arbitrary UTRs

  1. Inr2r0: TCCAGA at 4545, CCCATA at 4384, TGCACT at 4356, GCCACT at 4254, TTCATT at 4098, GTCATA at 4056, GCCATT at 4001, GTCATA at 3961, GCCATA at 3796, CCCATT at 3775, CGCACT at 3751, TGCAGA at 3554, GGCATA at 3458, GGCATA at 3447, GCCACT at 3410, CCCACA at 3394, CTCATT at 3388, GCCATA at 3303, TCCAGT at 3286, GTCACT at 3190, GGCATT at 3048, CCCATT at 2922, TGCAGA at 2903, CTCATA at 2849.
  2. Inr2r2: GGCAGA at 4549, GCCATA at 4542, CGCATA at 4528, GGCAGA at 4443, TCCACT at 4409, GTCACT at 4383, CCCACT at 4337, TTCACT at 4280, GTCATA at 4191, CCCAGT at 4138, TCCACA at 4066, TTCACT at 4050, CCCAGA at 3941, CGCAGT at 3847, CCCACA at 3836, TGCACA at 3732, CGCATT at 3483, TCCATT at 3379, GTCAGA at 3330, GGCATT at 3298, TGCAGT at 3243, CGCAGA at 2929, GTCAGA at 2901, CCCATT at 2893.
  3. Inr2r4: CCCAGT at 4417, CGCACA at 4394, CCCACA at 4198, TTCACT at 3946, TGCAGT at 3931, GCCACT at 3837, TGCATT at 3662, TCCAGT at 3601, GCCATT at 3574, TGCAGA at 3365, TGCAGA at 3337, CCCAGT at 3332, TGCACT at 3065, GCCATT at 3025, TGCAGT at 2945, CCCATT at 2916, CCCAGT at 2896.
  4. Inr2r6: GCCACA at 4391, CGCAGA at 4308, CCCACT at 4179, TGCAGT at 4009, GCCATT at 3997, CTCACA at 3770, GGCATT at 3746, GCCATT at 3726, TCCATA at 3693, CCCACA at 3628, TCCATA at 3595, GTCACT at 3493, CGCAGT at 3327, CCCATT at 3231, TCCATT at 2911.
  5. Inr2r8: CGCAGT at 4460, GCCAGA at 4404, TCCACA at 4336, GGCATT at 4321, CGCAGA at 4257, TCCACT at 4157, GCCATT at 4021, GGCACA at 3962, CCCAGT at 3954, CTCAGA at 3819, GGCACT at 3739, GCCACA at 3711, GGCAGA at 3539, TTCATT at 3333, TGCATA at 3093, CGCATT at 2959, CGCAGT at 2911, GTCATT at 2854.
  6. Inr2r0ci: TGTGGC at 4536, AGTGGG at 4503, ACTGAA at 4359, TATGCC at 4291, ACTGCC at 4257, AATGGC at 4239, TGTGCA at 4108, TCTGCC at 4075, TGTGCG at 3933, TGTGCC at 3770, TGTGGC at 3671, AATGGA at 3582, ACTGGA at 3535, ACTGGA at 3497, ACTGGA at 3273, TCTGAC at 3269, TCTGGA at 2988.
  7. Inr2r2ci: TGTGGA at 4489, ACTGGG at 4429, TATGGG at 4388, AATGAG at 4358, ACTGGC at 4283, ACTGCC at 4053, ACTGGG at 4004, AATGAC at 3986, TGTGCC at 3896, AGTGCC at 3861, AATGCC at 3831, TCTGGG at 3765, AGTGGC at 3681, TCTGAC at 3579, TGTGCA at 3507, TCTGCC at 3346, TATGAA at 3142, TCTGCC at 3102, AATGAA at 3054.
  8. Inr2r4ci: TCTGAA at 4481, TATGCG at 4466, TATGGG at 4226, AGTGAA at 4004, TCTGGG at 3855, TCTGGG at 3842, AGTGCG at 3827, ACTGGA at 3621, TGTGCC at 3541, AGTGAA at 3422, ACTGAG at 3400, AGTGCA at 3335, TATGGA at 3311, AATGGC at 3265, TGTGAA at 3252, AATGGG at 3220, AGTGAG at 3178, AATGCA at 3123, TGTGGC at 2983.
  9. Inr2r6ci: TATGGG at 4345, ACTGAC at 4232, AGTGGA at 4074, TCTGCA at 4007, AATGAA at 3700, AGTGCG at 3457, AATGAC at 3191, AGTGGC at 3134, TCTGAA at 3094, AATGCC at 3080, AATGCG at 3005, TGTGAA at 2949, TCTGAA at 2940.
  10. Inr2r8ci: AGTGGC at 4528, ACTGCG at 4513, TGTGCG at 4238, AATGAG at 4203, AATGAA at 3986, TCTGCG at 3789, TCTGAG at 3706, AATGGG at 3597, AATGCC at 3582, TCTGGA at 3525, AATGGG at 3438, AATGGC at 3107, ACTGCA at 3091, ACTGAG at 3025, AATGGG at 2950.

Inr2r alternate UTRs

  1. Inr2r1: GGCACA at 4523, CTCATT at 4265, CTCACT at 4250, CCCACA at 4235, GGCACA at 4138, TTCAGA at 4128, CGCACT at 4117, GGCATT at 4049, TTCACT at 3898, CCCAGA at 3870, GCCAGT at 3852, GGCACT at 3677, TGCACT at 3596, CTCATT at 3553, GCCATT at 3407, GGCACT at 3205, CCCACT at 3055.
  2. Inr2r3: CCCAGT at 4539, CCCACA at 4283, CTCATT at 4213, CTCATA at 4181, TGCAGT at 4130, TCCATT at 4025, CCCAGT at 3985, GTCACT at 3845, CCCATA at 3796, CGCATA at 3731, GGCACT at 3652, TCCAGT at 3568, CTCATT at 3470, CTCACT at 3196, CCCACT at 2996, GTCATA at 2856.
  3. Inr2r5: CCCACT at 4269, CCCATA at 4104, TTCACA at 4072, GTCATT at 4053, CTCAGA at 4028, GCCACT at 3913, CTCATT at 3808, TCCACT at 3699, TCCATT at 3624, TTCACA at 3430, CGCACA at 3416, TTCACA at 3343, CCCATA at 3211, CCCATT at 3146, TCCAGT at 3036.
  4. Inr2r7: TGCATA at 4290, CCCATA at 4220, TGCAGA at 4176, GGCATT at 4163, CGCAGA at 4144, TCCAGT at 4054, GTCACA at 3896, CGCACT at 3874, TGCAGT at 3770, CCCACA at 3641, CTCATT at 3620, GGCACT at 3501, TGCACT at 3479, CTCAGA at 3372, TCCACA at 3266, TGCATT at 3240, GCCACT at 3116, GGCACT at 3081, TCCAGA at 3051, GGCACT at 3027, CTCAGA at 2912, CCCAGT at 2864, TCCATA at 2847.
  5. Inr2r9: CCCAGA at 4483, GCCACT at 4099, GGCACA at 4050, TGCACA at 4031, TTCAGA at 4025, CCCAGT at 3947, CCCACT at 3829, GGCAGT at 3813, TGCATA at 3770, TCCATT at 3658, TTCACT at 3644, TCCATT at 3617, GGCAGT at 3601, GGCACT at 3539, CCCATT at 3475, CCCACA at 3449, CCCAGA at 3226, CGCATT at 3101.
  6. Inr2r1ci: TATGAC at 4498, AGTGGG at 4360, AATGAA at 4338, TATGCC at 4212, TATGGG at 4166, AGTGGG at 4149, AATGGC at 3940, TATGAA at 3793, TCTGGC at 3705, ACTGGG at 3680, AGTGGG at 3673, AATGCA at 3594, AGTGCG at 3467, TGTGGC at 3403, AGTGAC at 3357, TCTGCA at 3336, AATGCC at 3306, TCTGCG at 3102, AATGAA at 3086, AGTGAA at 3069, AATGCC at 2917, TGTGAA at 2906.
  7. Inr2r3ci: ACTGGC at 4534, TATGCG at 4490, TCTGGA at 4403, TATGAA at 4171, AGTGAG at 4133, TATGGG at 3893, AATGGA at 3882, TGTGGG at 3855, ACTGGC at 3649, TGTGAG at 3439, AATGGC at 3370, AATGAG at 3277, TCTGAG at 3253, AGTGGA at 3241, AATGCA at 3046, TGTGAA at 2970, AATGGA at 2938.
  8. Inr2r5ci: TATGCG at 4541, AATGGA at 4342, TATGCG at 4275, TGTGAA at 4043, AATGAC at 3966, ACTGAA at 3916, TATGGA at 3898, TGTGGC at 3882, AGTGAG at 3837, TCTGAA at 3796, TATGGC at 3787, AATGCC at 3634, AATGCG at 3187, AGTGGG at 3160, TATGGG at 3151, TATGGC at 3053, AGTGAC at 3039, AATGAC at 2920, TGTGCC at 2856.
  9. Inr2r7ci: TCTGCG at 4537, TCTGAA at 4307, AGTGCA at 4288, TGTGGG at 4228, AGTGCC at 4057, TCTGCC at 4038, TCTGCG at 4007, TATGGG at 3990, AATGAC at 3965, ACTGGC at 3877, TATGCC at 3776, ACTGAA at 3529, TATGCC at 3495, TATGCA at 3477, AATGGC at 3389, TCTGGG at 3335, AATGCA at 3238, TATGCG at 3227, ACTGGA at 2969, TATGCA at 2903, AGTGGG at 2867.
  10. Inr2r9ci: AGTGAC at 4471, AATGAG at 4414, TGTGGG at 4391, AATGGC at 4343, TATGAA at 4339, TATGCG at 4255, ACTGAC at 4208, TATGCG at 4134, AATGCG at 3960, ACTGAA at 3843, TCTGCG at 3818, AATGGC at 3788, ACTGCA at 3768, ACTGGA at 3760, TATGAA at 3745, TATGAC at 3703, AATGAC at 3679, ACTGAA at 3542, TCTGAC at 3520, TCTGGA at 3301, AGTGAA at 3215, TATGGG at 3178, TATGAA at 3120, TATGGA at 2863.

Inr2r arbitrary negative direction core promoters

  1. Inr2r2: TTCACA at 2829.
  2. Inr2r0ci: TGTGGC at 2844.
  3. Inr2r2ci: ACTGAG at 2837.
  4. Inr2r8ci: AGTGCA at 2844.

Inr2r alternate negative direction core promoters

  1. Inr2r3: CTCATT at 2840, CTCAGT at 2819.

Inr2r arbitrary positive direction core promoters

  1. Inr2r1: CTCATT at 4265.
  2. Inr2r3: CCCACA at 4283.
  3. Inr2r5: CCCACT at 4269.
  4. Inr2r7: TGCATA at 4290.
  5. Inr2r1ci: AGTGGG at 4360, AATGAA at 4338.
  6. Inr2r3ci: TCTGGA at 4403.
  7. Inr2r5ci: AATGGA at 4342, TATGCG at 4275.
  8. Inr2r7ci: TCTGAA at 4307, AGTGCA at 4288.
  9. Inr2r9ci: AATGAG at 4414, TGTGGG at 4391, AATGGC at 4343, TATGAA at 4339.

Inr2r alternate positive direction core promoters

  1. Inr2r0: CCCATA at 4384, TGCACT at 4356.
  2. Inr2r2: GGCAGA at 4443, TCCACT at 4409, GTCACT at 4383, CCCACT at 4337, TTCACT at 4280.
  3. Inr2r4: CCCAGT at 4417, CGCACA at 4394.
  4. Inr2r6: GCCACA at 4391, CGCAGA at 4308.
  5. Inr2r8: GCCAGA at 4404, TCCACA at 4336, GGCATT at 4321.
  6. Inr2r0ci: ACTGAA at 4359, TATGCC at 4291.
  7. Inr2r2ci: ACTGGG at 4429, TATGGG at 4388, AATGAG at 4358, ACTGGC at 4283.
  8. Inr2r6ci: TATGGG at 4345.

Inr2r arbitrary negative direction proximal promoters

  1. Inr2r2: CGCATT at 2752, CTCAGA at 2744, GTCACA at 2682, CCCATT at 2625.
  2. Inr2r4: CCCAGT at 2748, GCCACT at 2684.
  3. Inr2r6: TCCAGT at 2636.
  4. Inr2r8: TCCATT at 2809, TTCACT at 2796, GCCAGT at 2791, CCCACT at 2766, CCCAGA at 2724.
  5. Inr2r0ci: TATGAC at 2729.
  6. Inr2r2ci: AATGGG at 2733, TCTGCG at 2713, TATGCG at 2670, AGTGGG at 2606.
  7. Inr2r4ci: AGTGAC at 2778, TGTGGG at 2762, AATGGG at 2703, ACTGGG at 2687, AGTGGG at 2606.
  8. Inr2r6ci: TGTGCA at 2655.
  9. Inr2r8ci: TGTGCA at 2730, AATGAC at 2650.

Inr2r alternate negative direction proximal promoters

  1. Inr2r1: CTCACA at 2696.
  2. Inr2r3: CGCACT at 2803, CCCATT at 2797, CTCAGA at 2771, GGCACA at 2697, TGCATA at 2611.
  3. Inr2r5: GCCATT at 2803, GGCAGT at 2795, TCCAGT at 2651, CCCATA at 2626, CCCATT at 2599.
  4. Inr2r7: CCCACA at 2680, CCCATA at 2671, TCCACT at 2647, CTCAGA at 2634, CCCAGA at 2622.
  5. Inr2r9: TGCACA at 2711.
  6. Inr2r1ci: AATGCC at 2782, AGTGCC at 2635.
  7. Inr2r3ci: AATGCG at 2706, AATGGC at 2694, TGTGAG at 2671, AGTGAC at 2636, TCTGCA at 2609.
  8. Inr2r5ci: AGTGCG at 2798, AATGAA at 2769, TATGAG at 2711, TGTGGC at 2672, AATGCC at 2661.
  9. Inr2r7ci: TATGGA at 2717, ACTGAG at 2697, AGTGCA at 2657, ACTGGG at 2613, ACTGAG at 2600.
  10. Inr2r9ci: TCTGCC at 2756, TCTGCG at 2651, AATGGA at 2626.

Inr2r arbitrary positive direction proximal promoters

  1. Inr2r1: CTCATT at 4265, CTCACT at 4250, CCCACA at 4235, GGCACA at 4138, TTCAGA at 4128, CGCACT at 4117.
  2. Inr2r3: CTCATT at 4213, CTCATA at 4181, TGCAGT at 4130.
  3. Inr2r5: CCCATA at 4104, TTCACA at 4072, GTCATT at 4053.
  4. Inr2r7: CCCATA at 4220, TGCAGA at 4176, GGCATT at 4163, CGCAGA at 4144, TCCAGT at 4054.
  5. Inr2r9: GCCACT at 4099, GGCACA at 4050.
  6. Inr2r1ci: TATGCC at 4212, TATGGG at 4166, AGTGGG at 4149.
  7. Inr2r3ci: TATGAA at 4171, AGTGAG at 4133.
  8. Inr2r7ci: TGTGGG at 4228, AGTGCC at 4057.
  9. Inr2r9ci: TATGCG at 4255, ACTGAC at 4208, TATGCG at 4134.

Inr2r alternate positive direction proximal promoters

  1. Inr2r0: GCCACT at 4254, TTCATT at 4098, GTCATA at 4056.
  2. Inr2r2: GTCATA at 4191, CCCAGT at 4138, TCCACA at 4066, TTCACT at 4050.
  3. Inr2r4: CCCACA at 4198.
  4. Inr2r6: CCCACT at 4179.
  5. Inr2r8: CGCAGA at 4257, TCCACT at 4157.
  6. Inr2r0ci: ACTGCC at 4257, AATGGC at 4239, TGTGCA at 4108, TCTGCC at 4075.
  7. Inr2r2ci: ACTGCC at 4053.
  8. Inr2r4ci: TATGGG at 4226.
  9. Inr2r6ci: ACTGAC at 4232, AGTGGA at 4074.
  10. Inr2r8ci: TGTGCG at 4238, AATGAG at 4203.

Inr2r arbitrary negative direction distal promoters

  1. Inr2r0: CTCACT at 2587, TCCATA at 2519, GGCACT at 2473, CGCACA at 2451, GGCAGT at 2445, GCCATT at 2389, GCCACA at 2162, TTCATA at 2149, CTCAGT at 2137, CCCATA at 2039, GCCAGA at 1947, TCCAGA at 1867, TGCACA at 1785, TTCAGA at 1772, CGCAGA at 1614, GGCACA at 1606, TCCAGA at 1476, CCCAGT at 1371, TTCACT at 1361, TGCAGT at 1340, CGCATT at 1265, GCCATT at 1184, TCCACA at 1163, TTCACA at 887, GTCAGA at 847, TCCATA at 539, GGCAGT at 528, CGCATT at 481, TTCATA at 313, CCCAGA at 126, CCCATA at 39.
  2. Inr2r2: GCCATA at 2581, TGCAGA at 2540, TTCACA at 2421, CTCATT at 2241, TCCACT at 2148, GCCACT at 2098, TTCACT at 2088, TTCACT at 2052, GCCACA at 2026, CTCAGT at 1985, TTCACA at 1797, TTCACT at 1725, CCCATT at 1706, GCCATT at 1471, CGCATT at 1413, CTCACT at 1367, TCCACT at 1363, GGCACA at 1306, CGCACT at 1105, TTCACA at 1087, TCCATA at 972, GCCACT at 886, GTCATA at 866, GCCATA at 791, CGCATA at 777, GGCATA at 616, CCCAGA at 605, GTCAGA at 469, TCCAGA at 353, GGCAGA at 334, GGCATT at 114.
  3. Inr2r4: CTCATT at 2567, CCCATT at 2545, CTCAGT at 2459, TTCATA at 2375, GCCAGA at 2334, GTCATT at 2250, TGCACA at 2197, CGCAGA at 2176, TCCACT at 2079, CCCATT at 1842, TCCACA at 1771, CCCACT at 1716, CTCATA at 1683, CTCATT at 1674, GCCATT at 1550, CTCATT at 1528, TGCAGT at 1418, TGCAGT at 1328, GCCATA at 1095, CCCATA at 920, CCCACA at 907, GCCATA at 463, TTCATA at 315, CCCATT at 308, CCCACA at 236, TGCAGA at 148.
  4. Inr2r6: CCCATT at 2460, TCCACA at 2355, TCCATA at 2349, GGCAGA at 2323, CCCACA at 2312, GCCATT at 2292, GCCACT at 2260, GCCATA at 2203, CTCATT at 2177, GGCAGT at 2160, TCCACT at 2060, CGCACA at 1941, CCCACA at 1776, TCCACT at 1764, CTCAGT at 1697, CCCACA at 1670, TGCATT at 1660, GGCACT at 1582, GGCACT at 1568, CCCACA at 1297, GCCACT at 1248, CCCACT at 1181, CCCACA at 1166, TTCATA at 1122, GTCATT at 1111, TTCAGA at 1043, CGCATT at 982, GGCACA at 967, CCCAGA at 852, GTCATT at 840, TTCATT at 669, CCCATA at 624, TTCACA at 580, CCCAGT at 528, GGCATT at 468, CCCAGT at 391, TCCATT at 354, CGCAGT at 349, TTCATT at 146.
  5. Inr2r8: GCCATT at 2461, GGCATT at 2426, TGCAGT at 2298, CCCACT at 2178, CCCAGA at 2005, CCCACA at 1903, GCCAGT at 1640, TCCACT at 1578, CCCAGA at 1382, TCCAGA at 1359, GGCATT at 1279, CCCACT at 1112, TTCATA at 875, CCCATA at 780, CGCACT at 763, GGCACT at 664, TGCACA at 658, TGCATT at 354, TTCAGA at 279.
  6. Inr2r0ci: AATGCC at 2159, TATGGA at 2153, TGTGGC at 2065, TGTGCC at 1944, TGTGCC at 1760, AGTGCC at 1681, AATGGA at 1580, AGTGCA at 1338, AGTGAA at 1329, ACTGGG at 1201, TGTGCC at 1061, TATGGA at 990, AATGGG at 936, ACTGAA at 900, AATGCC at 773, TGTGGA at 766, AGTGAA at 744, TCTGAG at 740, TGTGAC at 726, ACTGGG at 639, AGTGGG at 531, TATGGC at 486, TGTGAA at 463, TATGCC at 341, ACTGGC at 138, AGTGGG at 110.
  7. Inr2r2ci: ACTGCC at 2595, TCTGCA at 2538, AATGGA at 2505, TATGAA at 2476, TGTGGA at 2202, AGTGGG at 2141, ACTGGC at 2073, AGTGAG at 2060, ACTGCC at 2023, TGTGGG at 2001, TATGAC at 1919, TATGAG at 1759, ACTGGC at 1728, AATGCA at 1464, AATGGG at 1452, TCTGAC at 1341, ACTGCG at 913, ACTGGG at 839, TGTGCG at 728, TGTGCG at 704, TGTGGC at 529, TGTGGC at 460, AATGGC at 429, AATGAC at 372, TATGGG at 361, AATGGG at 284, AGTGAA at 280, TCTGCC at 205, AATGGG at 188, TGTGAG at 168, ACTGCG at 152, AGTGAC at 148.
  8. Inr2r4ci: AATGGG at 2518, ACTGGG at 2391, TCTGGG at 2383, TATGCC at 2331, AGTGCG at 2268, TGTGAG at 2245, AATGAA at 2038, TGTGAA at 1962, TATGGG at 1831, TGTGGG at 1825, TATGAC at 1687, ACTGCA at 1663, TGTGGC at 1457, TATGCC at 1373, AATGGC at 1365, TGTGCC at 1358, AGTGCA at 1331, AATGCA at 1326, TGTGAA at 1318, TATGCA at 1265, AATGCA at 1119, AATGGC at 1076, TCTGCA at 1069, AATGAG at 997, ACTGAA at 993, TCTGCC at 966, AGTGAG at 738, AATGAA at 720, TGTGGA at 288, TATGAG at 71.
  9. Inr2r6ci: TCTGGC at 2574, TATGGC at 2558, TCTGAG at 2447, TGTGAA at 2423, TGTGAA at 2332, AGTGAC at 2136, AGTGGC at 2091, AATGCG at 2068, TCTGCG at 1965, AATGGC at 1958, TCTGCC at 1883, TATGGC at 1854, ACTGAG at 1811, TCTGGG at 1359, TGTGAC at 1344, TCTGAA at 1322, TATGCC at 1245, AGTGGG at 1196, ACTGGG at 1058, TCTGGC at 935, TCTGAC at 806, AATGGA at 743, TCTGCA at 705, AGTGAA at 561, AGTGGA at 534, AATGGC at 251, TATGAA at 247, TGTGAG at 102, AATGGC at 74.
  10. Inr2r8ci: AGTGGG at 2587, TATGCC at 2569, AATGCC at 2515, AATGGG at 2468, AATGGC at 2423, TATGCA at 2296, ACTGAC at 2234, TATGGA at 2087, AATGGA at 2050, TCTGCA at 2038, TATGAG at 1963, ACTGAA at 1954, TATGGC at 1781, TGTGGC at 1715, AATGAG at 1673, TATGCG at 1647, ACTGAA at 1581, AGTGGG at 1566, TCTGGG at 1484, AATGAA at 1434, AATGCC at 1378, TCTGAA at 1340, AGTGGG at 1325, AATGAA at 1297, TGTGGA at 1286, AGTGGG at 1205, AATGAC at 1132, TATGAC at 911, ACTGAA at 888, AATGGG at 788, AGTGGC at 754, ACTGGG at 667, TATGCA at 656, AATGGC at 640, TATGGG at 571, TGTGAC at 538, AATGGG at 409, AGTGCA at 352, AATGGC at 159, TATGGG at 95, TCTGGG at 37.

Inr2r alternate negative direction distal promoters

  1. Inr2r1: TGCATA at 2367, CCCAGT at 2358, TCCATT at 2293, GTCATT at 2239, TTCACT at 2200, CCCAGT at 2149, TTCAGT at 2065, TTCATA at 1993, GCCATT at 1908, TTCAGA at 1796, GTCAGA at 1752, CTCAGT at 1650, TGCATA at 1603, CTCAGA at 1533, CGCATA at 1356, GGCACT at 1327, CCCATT at 1316, TTCACT at 1213, TGCATT at 1142, TGCATT at 1025, TCCACA at 985, GCCACT at 917, TTCATT at 772, GGCAGT at 480, CTCAGT at 427, GTCAGA at 359, CCCATT at 335, CCCACT at 239, TCCATA at 56.
  2. Inr2r3: GGCATT at 2537, TCCATT at 2503, TCCATT at 2351, GGCATT at 2346, TTCATT at 2325, GTCAGA at 2240, CCCAGA at 2179, CGCACT at 2168, CGCAGT at 2148, CTCAGA at 1998, GGCACT at 1882, TTCAGA at 1831, CGCATT at 1827, CCCATT at 1465, GCCATT at 1459, TCCAGA at 1386, CCCATT at 1335, CGCAGA at 1298, GCCACT at 1178, CTCAGT at 1156, CCCATT at 1065, TTCATT at 1046, TCCATT at 935, TTCATT at 923, TTCAGA at 688, GTCATT at 479, CGCAGA at 473, CCCACA at 434, GCCATT at 401, CCCAGA at 296, CTCAGA at 218.
  3. Inr2r5: GGCAGA at 2593, CCCATA at 2515, CCCAGA at 2311, TTCAGT at 2192, CTCAGA at 1945, CCCATA at 1777, CCCAGT at 1693, TCCACT at 1671, CTCATA at 1653, TCCAGA at 1594, CGCATA at 1507, GGCAGT at 1497, GGCAGA at 1427, GTCAGT at 1356, TTCACA at 1282, GGCACT at 1219, GCCATT at 1202, TCCAGA at 1000, GGCACT at 970, TTCATA at 936, TCCAGA at 882, GGCAGA at 688, GCCACT at 439, GCCAGA at 339, GCCATT at 292.
  4. Inr2r7: GTCACT at 2587, GCCATT at 2444, GTCATA at 2326, GCCATT at 2270, TTCATT at 1892, TTCAGT at 1803, CTCAGT at 1750, TGCATA at 1683, TGCACT at 1486, CCCATT at 1340, GGCAGA at 1265, CCCACA at 1204, CCCATT at 1175, GGCACT at 1070, GTCATT at 1016, CCCACA at 814, TTCACT at 612, GCCAGA at 583, TCCATT at 569, GCCACT at 560, CGCAGT at 554, TCCATA at 547, CTCACT at 522, GTCATA at 204, CGCACA at 198, CGCATT at 168, TCCAGA at 125, TGCACT at 75.
  5. Inr2r9: CTCATT at 2340, CCCATT at 2244, GGCATA at 2193, GTCAGA at 2058, GGCACA at 2021, GTCAGA at 1862, GCCACA at 1814, GTCATT at 1802, CCCACA at 1674, CGCATT at 1651, TGCACT at 1601, GCCATA at 1561, CCCACA at 1520, CGCATT at 1446, GGCATA at 1221, CTCACT at 1197, TGCATA at 1191, CCCAGT at 734, CGCACT at 693, TTCACT at 648, CCCATT at 616, TTCACA at 415, CTCAGA at 313, GGCACA at 302, GTCAGT at 108, CTCAGT at 104, CCCATT at 84.
  6. Inr2r1ci: TGTGCC at 2530, ACTGGG at 2485, AATGCA at 2365, AATGAA at 2271, AGTGGA at 2152, TATGGC at 2103, TGTGCC at 2027, TGTGCG at 1805, AATGCA at 1622, TATGCG at 1607, TGTGCG at 1585, AGTGGA at 1446, AATGCC at 1425, TCTGGA at 1339, TATGGC at 1283, AATGGA at 1257, TCTGCA at 1133, TATGGG at 1110, ACTGAA at 1038, TGTGGG at 873, TCTGAG at 834, TGTGGG at 699, TATGGA at 688, TGTGCA at 642, AATGAA at 457, TATGAG at 342, ACTGCG at 242, TATGCG at 171, ACTGAC at 153, TCTGGG at 62, TGTGGA at 46, TCTGGA at 27, TCTGCG at 10.
  7. Inr2r3ci: TGTGAC at 2448, AGTGAC at 2441, TCTGCA at 2415, TGTGAC at 2207, TATGCC at 2175, TCTGGG at 2162, ACTGAA at 2119, TCTGGG at 1986, ACTGCA at 1950, TCTGAG at 1772, TCTGAC at 1759, TCTGGA at 1686, TGTGCC at 1575, TCTGAA at 1433, ACTGCC at 1144, TCTGAC at 1140, TCTGAG at 946, AGTGGC at 813, TCTGGG at 774, ACTGAA at 719, AGTGAA at 714, AATGCA at 544, AATGAA at 540, TGTGGG at 513, AGTGAC at 268, AATGCA at 160, TGTGAA at 95, TCTGCC at 29.
  8. Inr2r5ci: AGTGGG at 2425, AATGGA at 2410, AATGGA at 2342, TCTGGG at 2336, TCTGAG at 2109, TGTGAC at 1901, ACTGAG at 1876, ACTGCC at 1849, TGTGGG at 1723, TGTGCG at 1698, TCTGAG at 1625, AATGAA at 1567, ACTGGA at 1525, TCTGCA at 1322, AGTGGC at 1216, ACTGGG at 1182, TGTGGC at 1131, TCTGGC at 1046, AGTGCA at 899, AGTGCA at 818, AATGAC at 504, TATGGC at 243, TATGGG at 174, TATGCA at 151.
  9. Inr2r7ci: TCTGGA at 2562, AGTGGG at 2213, TCTGAG at 2171, TGTGCC at 2141, ACTGAG at 2073, AATGAA at 1954, AATGAG at 1836, TGTGCA at 1681, AATGGA at 1645, TGTGGA at 1511, TATGCA at 1484, TGTGCC at 1404, TGTGAC at 1329, AGTGGG at 1254, AATGAG at 1187, TCTGGG at 968, AATGGG at 955, TGTGGA at 919, TCTGAC at 701, AGTGGC at 653, ACTGCC at 615, AGTGCC at 557, TATGGA at 312, TCTGCA at 73, ACTGGC at 12.
  10. Inr2r9ci: TATGAA at 2350, AATGAC at 2232, AATGAC at 2079, ACTGAC at 1755, AATGGC at 1724, ACTGGG at 1713, ACTGGG at 1699, AATGAA at 1679, AGTGCC at 1670, ACTGCA at 1599, TGTGGA at 1471, AGTGGG at 1455, AATGAC at 1404, TATGAA at 1400, ACTGAA at 1388, TCTGGC at 1209, TCTGGA at 1055, ACTGGA at 866, ACTGCC at 843, TCTGCG at 825, TCTGCC at 730, ACTGGC at 628, ACTGGC at 487, TATGCA at 457, TATGGC at 443, ACTGGA at 397, AGTGCC at 361, AATGGC at 333, TCTGGA at 146, TCTGCG at 113, TCTGGG at 38.

Inr2r arbitrary positive direction distal promoters

  1. Inr2r1: GGCATT at 4049, TTCACT at 3898, CCCAGA at 3870, GCCAGT at 3852, GGCACT at 3677, TGCACT at 3596, CTCATT at 3553, GCCATT at 3407, GGCACT at 3205, CCCACT at 3055, CTCACA at 2696, TGCATA at 2367, CCCAGT at 2358, TCCATT at 2293, GTCATT at 2239, TTCACT at 2200, CCCAGT at 2149, TTCAGT at 2065, TTCATA at 1993, GCCATT at 1908, TTCAGA at 1796, GTCAGA at 1752, CTCAGT at 1650, TGCATA at 1603, CTCAGA at 1533, CGCATA at 1356, GGCACT at 1327, CCCATT at 1316, TTCACT at 1213, TGCATT at 1142, TGCATT at 1025, TCCACA at 985, GCCACT at 917, TTCATT at 772, GGCAGT at 480, CTCAGT at 427, GTCAGA at 359, CCCATT at 335, CCCACT at 239, TCCATA at 56.
  2. Inr2r3: TCCATT at 4025, CCCAGT at 3985, GTCACT at 3845, CCCATA at 3796, CGCATA at 3731, GGCACT at 3652, TCCAGT at 3568, CTCATT at 3470, CTCACT at 3196, CCCACT at 2996, GTCATA at 2856, CTCATT at 2840, CTCAGT at 2819, CGCACT at 2803, CCCATT at 2797, CTCAGA at 2771, GGCACA at 2697, TGCATA at 2611, GGCATT at 2537, TCCATT at 2503, TCCATT at 2351, GGCATT at 2346, TTCATT at 2325, GTCAGA at 2240, CCCAGA at 2179, CGCACT at 2168, CGCAGT at 2148, CTCAGA at 1998, GGCACT at 1882, TTCAGA at 1831, CGCATT at 1827, CCCATT at 1465, GCCATT at 1459, TCCAGA at 1386, CCCATT at 1335, CGCAGA at 1298, GCCACT at 1178, CTCAGT at 1156, CCCATT at 1065, TTCATT at 1046, TCCATT at 935, TTCATT at 923, TTCAGA at 688, GTCATT at 479, CGCAGA at 473, CCCACA at 434, GCCATT at 401, CCCAGA at 296, CTCAGA at 218.
  3. Inr2r5: CTCAGA at 4028, GCCACT at 3913, CTCATT at 3808, TCCACT at 3699, TCCATT at 3624, TTCACA at 3430, CGCACA at 3416, TTCACA at 3343, CCCATA at 3211, CCCATT at 3146, TCCAGT at 3036, GCCATT at 2803, GGCAGT at 2795, TCCAGT at 2651, CCCATA at 2626, CCCATT at 2599, GGCAGA at 2593, CCCATA at 2515, CCCAGA at 2311, TTCAGT at 2192, CTCAGA at 1945, CCCATA at 1777, CCCAGT at 1693, TCCACT at 1671, CTCATA at 1653, TCCAGA at 1594, CGCATA at 1507, GGCAGT at 1497, GGCAGA at 1427, GTCAGT at 1356, TTCACA at 1282, GGCACT at 1219, GCCATT at 1202, TCCAGA at 1000, GGCACT at 970, TTCATA at 936, TCCAGA at 882, GGCAGA at 688, GCCACT at 439, GCCAGA at 339, GCCATT at 292.
  4. Inr2r7: GTCACA at 3896, CGCACT at 3874, TGCAGT at 3770, CCCACA at 3641, CTCATT at 3620, GGCACT at 3501, TGCACT at 3479, CTCAGA at 3372, TCCACA at 3266, TGCATT at 3240, GCCACT at 3116, GGCACT at 3081, TCCAGA at 3051, GGCACT at 3027, CTCAGA at 2912, CCCAGT at 2864, TCCATA at 2847, CCCACA at 2680, CCCATA at 2671, TCCACT at 2647, CTCAGA at 2634, CCCAGA at 2622, GTCACT at 2587, GCCATT at 2444, GTCATA at 2326, GCCATT at 2270, TTCATT at 1892, TTCAGT at 1803, CTCAGT at 1750, TGCATA at 1683, TGCACT at 1486, CCCATT at 1340, GGCAGA at 1265, CCCACA at 1204, CCCATT at 1175, GGCACT at 1070, GTCATT at 1016, CCCACA at 814, TTCACT at 612, GCCAGA at 583, TCCATT at 569, GCCACT at 560, CGCAGT at 554, TCCATA at 547, CTCACT at 522, GTCATA at 204, CGCACA at 198, CGCATT at 168, TCCAGA at 125, TGCACT at 75.
  5. Inr2r9: GGCACA at 4050, TGCACA at 4031, TTCAGA at 4025, CCCAGT at 3947, CCCACT at 3829, GGCAGT at 3813, TGCATA at 3770, TCCATT at 3658, TTCACT at 3644, TCCATT at 3617, GGCAGT at 3601, GGCACT at 3539, CCCATT at 3475, CCCACA at 3449, CCCAGA at 3226, CGCATT at 3101, TGCACA at 2711, CTCATT at 2340, CCCATT at 2244, GGCATA at 2193, GTCAGA at 2058, GGCACA at 2021, GTCAGA at 1862, GCCACA at 1814, GTCATT at 1802, CCCACA at 1674, CGCATT at 1651, TGCACT at 1601, GCCATA at 1561, CCCACA at 1520, CGCATT at 1446, GGCATA at 1221, CTCACT at 1197, TGCATA at 1191, CCCAGT at 734, CGCACT at 693, TTCACT at 648, CCCATT at 616, TTCACA at 415, CTCAGA at 313, GGCACA at 302, GTCAGT at 108, CTCAGT at 104, CCCATT at 84.
  6. Inr2r1ci: AATGGC at 3940, TATGAA at 3793, TCTGGC at 3705, ACTGGG at 3680, AGTGGG at 3673, AATGCA at 3594, AGTGCG at 3467, TGTGGC at 3403, AGTGAC at 3357, TCTGCA at 3336, AATGCC at 3306, TCTGCG at 3102, AATGAA at 3086, AGTGAA at 3069, AATGCC at 2917, TGTGAA at 2906, AATGCC at 2782, AGTGCC at 2635, TGTGCC at 2530, ACTGGG at 2485, AATGCA at 2365, AATGAA at 2271, AGTGGA at 2152, TATGGC at 2103, TGTGCC at 2027, TGTGCG at 1805, AATGCA at 1622, TATGCG at 1607, TGTGCG at 1585, AGTGGA at 1446, AATGCC at 1425, TCTGGA at 1339, TATGGC at 1283, AATGGA at 1257, TCTGCA at 1133, TATGGG at 1110, ACTGAA at 1038, TGTGGG at 873, TCTGAG at 834, TGTGGG at 699, TATGGA at 688, TGTGCA at 642, AATGAA at 457, TATGAG at 342, ACTGCG at 242, TATGCG at 171, ACTGAC at 153, TCTGGG at 62, TGTGGA at 46, TCTGGA at 27, TCTGCG at 10.
  7. Inr2r3ci: TATGGG at 3893, AATGGA at 3882, TGTGGG at 3855, ACTGGC at 3649, TGTGAG at 3439, AATGGC at 3370, AATGAG at 3277, TCTGAG at 3253, AGTGGA at 3241, AATGCA at 3046, TGTGAA at 2970, AATGGA at 2938, AATGCG at 2706, AATGGC at 2694, TGTGAG at 2671, AGTGAC at 2636, TCTGCA at 2609, TGTGAC at 2448, AGTGAC at 2441, TCTGCA at 2415, TGTGAC at 2207, TATGCC at 2175, TCTGGG at 2162, ACTGAA at 2119, TCTGGG at 1986, ACTGCA at 1950, TCTGAG at 1772, TCTGAC at 1759, TCTGGA at 1686, TGTGCC at 1575, TCTGAA at 1433, ACTGCC at 1144, TCTGAC at 1140, TCTGAG at 946, AGTGGC at 813, TCTGGG at 774, ACTGAA at 719, AGTGAA at 714, AATGCA at 544, AATGAA at 540, TGTGGG at 513, AGTGAC at 268, AATGCA at 160, TGTGAA at 95, TCTGCC at 29.
  8. Inr2r5ci: TGTGAA at 4043, AATGAC at 3966, ACTGAA at 3916, TATGGA at 3898, TGTGGC at 3882, AGTGAG at 3837, TCTGAA at 3796, TATGGC at 3787, AATGCC at 3634, AATGCG at 3187, AGTGGG at 3160, TATGGG at 3151, TATGGC at 3053, AGTGAC at 3039, AATGAC at 2920, TGTGCC at 2856, AGTGCG at 2798, AATGAA at 2769, TATGAG at 2711, TGTGGC at 2672, AATGCC at 2661, AGTGGG at 2425, AATGGA at 2410, AATGGA at 2342, TCTGGG at 2336, TCTGAG at 2109, TGTGAC at 1901, ACTGAG at 1876, ACTGCC at 1849, TGTGGG at 1723, TGTGCG at 1698, TCTGAG at 1625, AATGAA at 1567, ACTGGA at 1525, TCTGCA at 1322, AGTGGC at 1216, ACTGGG at 1182, TGTGGC at 1131, TCTGGC at 1046, AGTGCA at 899, AGTGCA at 818, AATGAC at 504, TATGGC at 243, TATGGG at 174, TATGCA at 151.
  9. Inr2r7ci: TCTGCC at 4038, TCTGCG at 4007, TATGGG at 3990, AATGAC at 3965, ACTGGC at 3877, TATGCC at 3776, ACTGAA at 3529, TATGCC at 3495, TATGCA at 3477, AATGGC at 3389, TCTGGG at 3335, AATGCA at 3238, TATGCG at 3227, ACTGGA at 2969, TATGCA at 2903, AGTGGG at 2867, TATGGA at 2717, ACTGAG at 2697, AGTGCA at 2657, ACTGGG at 2613, ACTGAG at 2600, TCTGGA at 2562, AGTGGG at 2213, TCTGAG at 2171, TGTGCC at 2141, ACTGAG at 2073, AATGAA at 1954, AATGAG at 1836, TGTGCA at 1681, AATGGA at 1645, TGTGGA at 1511, TATGCA at 1484, TGTGCC at 1404, TGTGAC at 1329, AGTGGG at 1254, AATGAG at 1187, TCTGGG at 968, AATGGG at 955, TGTGGA at 919, TCTGAC at 701, AGTGGC at 653, ACTGCC at 615, AGTGCC at 557, TATGGA at 312, TCTGCA at 73, ACTGGC at 12.
  10. Inr2r9ci: AATGCG at 3960, ACTGAA at 3843, TCTGCG at 3818, AATGGC at 3788, ACTGCA at 3768, ACTGGA at 3760, TATGAA at 3745, TATGAC at 3703, AATGAC at 3679, ACTGAA at 3542, TCTGAC at 3520, TCTGGA at 3301, AGTGAA at 3215, TATGGG at 3178, TATGAA at 3120, TATGGA at 2863, TCTGCC at 2756, TCTGCG at 2651, AATGGA at 2626, TATGAA at 2350, AATGAC at 2232, AATGAC at 2079, ACTGAC at 1755, AATGGC at 1724, ACTGGG at 1713, ACTGGG at 1699, AATGAA at 1679, AGTGCC at 1670, ACTGCA at 1599, TGTGGA at 1471, AGTGGG at 1455, AATGAC at 1404, TATGAA at 1400, ACTGAA at 1388, TCTGGC at 1209, TCTGGA at 1055, ACTGGA at 866, ACTGCC at 843, TCTGCG at 825, TCTGCC at 730, ACTGGC at 628, ACTGGC at 487, TATGCA at 457, TATGGC at 443, ACTGGA at 397, AGTGCC at 361, AATGGC at 333, TCTGGA at 146, TCTGCG at 113, TCTGGG at 38.

Inr2r alternate positive direction distal promoters

  1. Inr2r0: GCCATT at 4001, GTCATA at 3961, GCCATA at 3796, CCCATT at 3775, CGCACT at 3751, TGCAGA at 3554, GGCATA at 3458, GGCATA at 3447, GCCACT at 3410, CCCACA at 3394, CTCATT at 3388, GCCATA at 3303, TCCAGT at 3286, GTCACT at 3190, GGCATT at 3048, CCCATT at 2922, TGCAGA at 2903, CTCATA at 2849, CTCACT at 2587, TCCATA at 2519, GGCACT at 2473, CGCACA at 2451, GGCAGT at 2445, GCCATT at 2389, GCCACA at 2162, TTCATA at 2149, CTCAGT at 2137, CCCATA at 2039, GCCAGA at 1947, TCCAGA at 1867, TGCACA at 1785, TTCAGA at 1772, CGCAGA at 1614, GGCACA at 1606, TCCAGA at 1476, CCCAGT at 1371, TTCACT at 1361, TGCAGT at 1340, CGCATT at 1265, GCCATT at 1184, TCCACA at 1163, TTCACA at 887, GTCAGA at 847, TCCATA at 539, GGCAGT at 528, CGCATT at 481, TTCATA at 313, CCCAGA at 126, CCCATA at 39.
  2. Inr2r2: TTCACT at 4050, CCCAGA at 3941, CGCAGT at 3847, CCCACA at 3836, TGCACA at 3732, CGCATT at 3483, TCCATT at 3379, GTCAGA at 3330, GGCATT at 3298, TGCAGT at 3243, CGCAGA at 2929, GTCAGA at 2901, CCCATT at 2893, TTCACA at 2829, CGCATT at 2752, CTCAGA at 2744, GTCACA at 2682, CCCATT at 2625, GCCATA at 2581, TGCAGA at 2540, TTCACA at 2421, CTCATT at 2241, TCCACT at 2148, GCCACT at 2098, TTCACT at 2088, TTCACT at 2052, GCCACA at 2026, CTCAGT at 1985, TTCACA at 1797, TTCACT at 1725, CCCATT at 1706, GCCATT at 1471, CGCATT at 1413, CTCACT at 1367, TCCACT at 1363, GGCACA at 1306, CGCACT at 1105, TTCACA at 1087, TCCATA at 972, GCCACT at 886, GTCATA at 866, GCCATA at 791, CGCATA at 777, GGCATA at 616, CCCAGA at 605, GTCAGA at 469, TCCAGA at 353, GGCAGA at 334, GGCATT at 114.
  3. Inr2r4: TTCACT at 3946, TGCAGT at 3931, GCCACT at 3837, TGCATT at 3662, TCCAGT at 3601, GCCATT at 3574, TGCAGA at 3365, TGCAGA at 3337, CCCAGT at 3332, TGCACT at 3065, GCCATT at 3025, TGCAGT at 2945, CCCATT at 2916, CCCAGT at 2896, CCCAGT at 2748, GCCACT at 2684, CTCATT at 2567, CCCATT at 2545, CTCAGT at 2459, TTCATA at 2375, GCCAGA at 2334, GTCATT at 2250, TGCACA at 2197, CGCAGA at 2176, TCCACT at 2079, CCCATT at 1842, TCCACA at 1771, CCCACT at 1716, CTCATA at 1683, CTCATT at 1674, GCCATT at 1550, CTCATT at 1528, TGCAGT at 1418, TGCAGT at 1328, GCCATA at 1095, CCCATA at 920, CCCACA at 907, GCCATA at 463, TTCATA at 315, CCCATT at 308, CCCACA at 236, TGCAGA at 148.
  4. Inr2r6: TGCAGT at 4009, GCCATT at 3997, CTCACA at 3770, GGCATT at 3746, GCCATT at 3726, TCCATA at 3693, CCCACA at 3628, TCCATA at 3595, GTCACT at 3493, CGCAGT at 3327, CCCATT at 3231, TCCATT at 2911, TCCAGT at 2636, CCCATT at 2460, TCCACA at 2355, TCCATA at 2349, GGCAGA at 2323, CCCACA at 2312, GCCATT at 2292, GCCACT at 2260, GCCATA at 2203, CTCATT at 2177, GGCAGT at 2160, TCCACT at 2060, CGCACA at 1941, CCCACA at 1776, TCCACT at 1764, CTCAGT at 1697, CCCACA at 1670, TGCATT at 1660, GGCACT at 1582, GGCACT at 1568, CCCACA at 1297, GCCACT at 1248, CCCACT at 1181, CCCACA at 1166, TTCATA at 1122, GTCATT at 1111, TTCAGA at 1043, CGCATT at 982, GGCACA at 967, CCCAGA at 852, GTCATT at 840, TTCATT at 669, CCCATA at 624, TTCACA at 580, CCCAGT at 528, GGCATT at 468, CCCAGT at 391, TCCATT at 354, CGCAGT at 349, TTCATT at 146.
  5. Inr2r8: GCCATT at 4021, GGCACA at 3962, CCCAGT at 3954, CTCAGA at 3819, GGCACT at 3739, GCCACA at 3711, GGCAGA at 3539, TTCATT at 3333, TGCATA at 3093, CGCATT at 2959, CGCAGT at 2911, GTCATT at 2854, TCCATT at 2809, TTCACT at 2796, GCCAGT at 2791, CCCACT at 2766, CCCAGA at 2724, GCCATT at 2461, GGCATT at 2426, TGCAGT at 2298, CCCACT at 2178, CCCAGA at 2005, CCCACA at 1903, GCCAGT at 1640, TCCACT at 1578, CCCAGA at 1382, TCCAGA at 1359, GGCATT at 1279, CCCACT at 1112, TTCATA at 875, CCCATA at 780, CGCACT at 763, GGCACT at 664, TGCACA at 658, TGCATT at 354, TTCAGA at 279.
  6. Inr2r0ci: TGTGCG at 3933, TGTGCC at 3770, TGTGGC at 3671, AATGGA at 3582, ACTGGA at 3535, ACTGGA at 3497, ACTGGA at 3273, TCTGAC at 3269, TCTGGA at 2988, TGTGGC at 2844, TATGAC at 2729, AATGCC at 2159, TATGGA at 2153, TGTGGC at 2065, TGTGCC at 1944, TGTGCC at 1760, AGTGCC at 1681, AATGGA at 1580, AGTGCA at 1338, AGTGAA at 1329, ACTGGG at 1201, TGTGCC at 1061, TATGGA at 990, AATGGG at 936, ACTGAA at 900, AATGCC at 773, TGTGGA at 766, AGTGAA at 744, TCTGAG at 740, TGTGAC at 726, ACTGGG at 639, AGTGGG at 531, TATGGC at 486, TGTGAA at 463, TATGCC at 341, ACTGGC at 138, AGTGGG at 110.
  7. Inr2r2ci: ACTGGG at 4004, AATGAC at 3986, TGTGCC at 3896, AGTGCC at 3861, AATGCC at 3831, TCTGGG at 3765, AGTGGC at 3681, TCTGAC at 3579, TGTGCA at 3507, TCTGCC at 3346, TATGAA at 3142, TCTGCC at 3102, AATGAA at 3054, ACTGAG at 2837, AATGGG at 2733, TCTGCG at 2713, TATGCG at 2670, AGTGGG at 2606, ACTGCC at 2595, TCTGCA at 2538, AATGGA at 2505, TATGAA at 2476, TGTGGA at 2202, AGTGGG at 2141, ACTGGC at 2073, AGTGAG at 2060, ACTGCC at 2023, TGTGGG at 2001, TATGAC at 1919, TATGAG at 1759, ACTGGC at 1728, AATGCA at 1464, AATGGG at 1452, TCTGAC at 1341, ACTGCG at 913, ACTGGG at 839, TGTGCG at 728, TGTGCG at 704, TGTGGC at 529, TGTGGC at 460, AATGGC at 429, AATGAC at 372, TATGGG at 361, AATGGG at 284, AGTGAA at 280, TCTGCC at 205, AATGGG at 188, TGTGAG at 168, ACTGCG at 152, AGTGAC at 148.
  8. Inr2r4ci: AGTGAA at 4004, TCTGGG at 3855, TCTGGG at 3842, AGTGCG at 3827, ACTGGA at 3621, TGTGCC at 3541, AGTGAA at 3422, ACTGAG at 3400, AGTGCA at 3335, TATGGA at 3311, AATGGC at 3265, TGTGAA at 3252, AATGGG at 3220, AGTGAG at 3178, AATGCA at 3123, TGTGGC at 2983, AGTGAC at 2778, TGTGGG at 2762, AATGGG at 2703, ACTGGG at 2687, AGTGGG at 2606, AATGGG at 2518, ACTGGG at 2391, TCTGGG at 2383, TATGCC at 2331, AGTGCG at 2268, TGTGAG at 2245, AATGAA at 2038, TGTGAA at 1962, TATGGG at 1831, TGTGGG at 1825, TATGAC at 1687, ACTGCA at 1663, TGTGGC at 1457, TATGCC at 1373, AATGGC at 1365, TGTGCC at 1358, AGTGCA at 1331, AATGCA at 1326, TGTGAA at 1318, TATGCA at 1265, AATGCA at 1119, AATGGC at 1076, TCTGCA at 1069, AATGAG at 997, ACTGAA at 993, TCTGCC at 966, AGTGAG at 738, AATGAA at 720, TGTGGA at 288, TATGAG at 71.
  9. Inr2r6ci: TCTGCA at 4007, AATGAA at 3700, AGTGCG at 3457, AATGAC at 3191, AGTGGC at 3134, TCTGAA at 3094, AATGCC at 3080, AATGCG at 3005, TGTGAA at 2949, TCTGAA at 2940, TGTGCA at 2655, TCTGGC at 2574, TATGGC at 2558, TCTGAG at 2447, TGTGAA at 2423, TGTGAA at 2332, AGTGAC at 2136, AGTGGC at 2091, AATGCG at 2068, TCTGCG at 1965, AATGGC at 1958, TCTGCC at 1883, TATGGC at 1854, ACTGAG at 1811, TCTGGG at 1359, TGTGAC at 1344, TCTGAA at 1322, TATGCC at 1245, AGTGGG at 1196, ACTGGG at 1058, TCTGGC at 935, TCTGAC at 806, AATGGA at 743, TCTGCA at 705, AGTGAA at 561, AGTGGA at 534, AATGGC at 251, TATGAA at 247, TGTGAG at 102, AATGGC at 74.
  10. Inr2r8ci: AATGAA at 3986, TCTGCG at 3789, TCTGAG at 3706, AATGGG at 3597, AATGCC at 3582, TCTGGA at 3525, AATGGG at 3438, AATGGC at 3107, ACTGCA at 3091, ACTGAG at 3025, AATGGG at 2950, AGTGCA at 2844, TGTGCA at 2730, AATGAC at 2650, AGTGGG at 2587, TATGCC at 2569, AATGCC at 2515, AATGGG at 2468, AATGGC at 2423, TATGCA at 2296, ACTGAC at 2234, TATGGA at 2087, AATGGA at 2050, TCTGCA at 2038, TATGAG at 1963, ACTGAA at 1954, TATGGC at 1781, TGTGGC at 1715, AATGAG at 1673, TATGCG at 1647, ACTGAA at 1581, AGTGGG at 1566, TCTGGG at 1484, AATGAA at 1434, AATGCC at 1378, TCTGAA at 1340, AGTGGG at 1325, AATGAA at 1297, TGTGGA at 1286, AGTGGG at 1205, AATGAC at 1132, TATGAC at 911, ACTGAA at 888, AATGGG at 788, AGTGGC at 754, ACTGGG at 667, TATGCA at 656, AATGGC at 640, TATGGG at 571, TGTGAC at 538, AATGGG at 409, AGTGCA at 352, AATGGC at 159, TATGGG at 95, TCTGGG at 37.

BBCABW (Ngoc 20 January 2017) analysis and results

An analysis of 7670 transcription start sites showed that roughly 40% had an exact match to the BBCA+1BW Inr sequence, while 16% contained only one mismatch [7]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 78 2 39 39 ± 7 (--32,+-46)
Randoms UTR arbitrary negative 181 10 18.1 18.65 ± 0.55
Randoms UTR alternate negative 192 10 19.2 18.65 ± 0.55
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 4 10 0.4 0.3 ± 0.1
Randoms Core alternate negative 2 10 0.2 0.3 ± 0.1
Reals Core positive 18 2 9 9 ± 1 (-+10,++8)
Randoms Core arbitrary positive 15 10 1.5 1.8 ± 0.3
Randoms Core alternate positive 21 10 2.1 1.8 ± 0.3
Reals Proximal negative 12 2 6 6 ± 2 (--4,+-8)
Randoms Proximal arbitrary negative 25 10 2.5 3.1 ± 0.6
Randoms Proximal alternate negative 37 10 3.7 3.1 ± 0.6
Reals Proximal positive 9 2 4.5 4.5 ±1.5 (-+3,++6)
Randoms Proximal arbitrary positive 29 10 2.9 2.5 ± 0.4
Randoms Proximal alternate positive 21 10 2.1 2.5 ± 0.4
Reals Distal negative 113 2 56.5 56.5 ± 2.5 (--54,+-59)
Randoms Distal arbitrary negative 304 10 30.4 29.25 ± 1.15
Randoms Distal alternate negative 281 10 28.1 29.25 ± 1.15
Reals Distal positive 241 2 120.5 120.5 ± 47.5 (-+168,++73)
Randoms Distal arbitrary positive 461 10 46.1 46.1
Randoms Distal alternate positive 461 10 46.1 46.1

Comparison:

The occurrences of real BBCABW Inrs are greater than the randoms. This suggests that the real BBCABW Inrs are likely active or activable.

Drosophila melanogaster (Butler 2002) initiator element samplings

Copying a responsive elements consensus sequence TCA(G/T)T(C/T) and putting the sequence in "⌘F" finds none between ZNF497 and A1BG or none between ZSCAN22 and A1BG as can be found by the computer programs.

For the Basic programs testing consensus sequence TCA(G/T)T(C/T) (starting with SuccessablesInrDm.bas) written to compare nucleotide sequences with the sequences on either the template strand (-), or coding strand (+), of the DNA, in the negative direction (-), or the positive direction (+), the programs are, are looking for, and found:

  1. negative strand, negative direction: 3, TCATTC at 3892, TCATTT at 3481, TCATTC at 2502.
  2. positive strand, negative direction: 0.
  3. negative strand, positive direction: 2, TCATTT at 4119, TCAGTC at 2099.
  4. positive strand, positive direction: 4, TCAGTC at 2619, TCAGTT at 2614, TCAGTC at 2608, TCAGTC at 2101.
  5. inverse complement, negative strand, negative direction: 0.
  6. inverse complement, positive strand, negative direction: 9, GAATGA at 4555, AAATGA at 2187, GACTGA at 1935, AAATGA at 1700, AAATGA at 1663, AAATGA at 1580, AACTGA at 307, AACTGA at 130, GACTGA at 17.
  7. inverse complement, negative strand, positive direction: 9, AAATGA at 4094, GAATGA at 3835, GAATGA at 3782, AACTGA at 3735, GAATGA at 3567, GAATGA at 3445, GAATGA at 3441, GAATGA at 2841, GACTGA at 2674.
  8. inverse complement, positive strand, positive direction: 2, GACTGA at 2945, GAATGA at 524.

InrDm (4560-2846) UTRs

  1. Negative strand, negative direction: TCATTC at 3892, TCATTT at 3481.
  2. Positive strand, negative direction: GAATGA at 4555.

InrDm positive direction (4265-4050) proximal promoters

  1. Negative strand, positive direction: TCATTT at 4119.
  2. Negative strand, positive direction: AAATGA at 4094.

InrDm negative direction (2596-1) distal promoters

  1. Negative strand, negative direction: TCATTC at 2502.
  2. Positive strand, negative direction: AAATGA at 2187, GACTGA at 1935, AAATGA at 1700, AAATGA at 1663, AAATGA at 1580, AACTGA at 307, AACTGA at 130, GACTGA at 17.

InrDm positive direction (4050-1) distal promoters

  1. Negative strand, positive direction: TCAGTC at 2099.
  2. Negative strand, positive direction: GAATGA at 3835, GAATGA at 3782, AACTGA at 3735, GAATGA at 3567, GAATGA at 3445, GAATGA at 3441, GAATGA at 2841, GACTGA at 2674.
  3. Positive strand, positive direction: TCAGTC at 2619, TCAGTT at 2614, TCAGTC at 2608, TCAGTC at 2101.
  4. Positive strand, positive direction: GACTGA at 2945, GAATGA at 524.

Drosophila melanogaster (Butler 2002) initiator element random dataset samplings

  1. InrDmr0: 3, TCATTC at 3389, TCAGTT at 2965, TCAGTT at 2138.
  2. InrDmr1: 4, TCAGTC at 3948, TCATTC at 3554, TCAGTC at 1651, TCATTC at 773.
  3. InrDmr2: 1, TCAGTT at 1986.
  4. InrDmr3: 7, TCATTC at 3471, TCAGTC at 2820, TCATTT at 2326, TCAGTC at 1658, TCATTT at 1558, TCATTT at 924, TCATTC at 480.
  5. InrDmr4: 3, TCATTC at 2112, TCATTT at 1675, TCATTC at 1529.
  6. InrDmr5: 3, TCATTT at 4054, TCAGTC at 2193, TCAGTT at 13570.
  7. InrDmr6: 4, TCATTT at 3760, TCAGTT at 1698, TCAGTC at 893, TCATTC at 841.
  8. InrDmr7: 4, TCATTT at 1893, TCAGTT at 1804, TCAGTT at 1751, TCATTC at 1017.
  9. InrDmr8: 1, TCATTT at 3334.
  10. InrDmr9: 5, TCATTT at 2341, TCAGTT at 2286, TCAGTT at 1245, TCAGTC at 109, TCAGTC at 105.
  11. InrDmr0ci: 1, AACTGA at 899.
  12. InrDmr1ci: 5, AAATGA at 3442, GAATGA at 2270, AACTGA at 1037, GAATGA at 456, GACTGA at 152.
  13. InrDmr2ci: 3, GAATGA at 4357, AAATGA at 3800, AAATGA at 371.
  14. InrDmr3ci: 4, GACTGA at 3898, GACTGA at 2118, AACTGA at 718, AAATGA at 539.
  15. InrDmr4ci: 1, GAATGA at 996.
  16. InrDmr5ci: 6, AAATGA at 4286, AAATGA at 3965, AAATGA at 2768, AACTGA at 1875, AAATGA at 1566, GAATGA at 503.
  17. InrDmr6ci: 4, AAATGA at 3699, AAATGA at 3190, AACTGA at 1810, AAATGA at 749.
  18. InrDmr7ci: 9, AAATGA at 3964, AACTGA at 3528, GACTGA at 2696, GACTGA at 2072, AAATGA at 1953, AAATGA at 1835, AAATGA at 1186, AAATGA at 1114, AAATGA at 1037.
  19. InrDmr8ci: 7, GACTGA at 4186, GAATGA at 2649, GAATGA at 2537, GAATGA at 1672, GAATGA at 1296, AAATGA at 964, AACTGA at 887.
  20. InrDmr9ci: 10, GAATGA at 4413, GACTGA at 4207, AACTGA at 3842, GAATGA at 3678, GAATGA at 2463, GAATGA at 2231, AAATGA at 2078, AACTGA at 1754, GAATGA at 1403, GACTGA at 1387.

InrDmr arbitrary (evens) (4560-2846) UTRs

  1. InrDmr0: TCATTC at 3389, TCAGTT at 2965.
  2. InrDmr6: TCATTT at 3760.
  3. InrDmr8: TCATTT at 3334.
  4. InrDmr2ci: GAATGA at 4357, AAATGA at 3800.
  5. InrDmr6ci: AAATGA at 3699, AAATGA at 3190.
  6. InrDmr8ci: GACTGA at 4186.

InrDmr alternate (odds) (4560-2846) UTRs

  1. InrDmr1: TCAGTC at 3948, TCATTC at 3554.
  2. InrDmr3: TCATTC at 3471.
  3. InrDmr5: TCATTT at 4054.
  4. InrDmr1ci: AAATGA at 3442.
  5. InrDmr3ci: GACTGA at 3898, GACTGA at 2118, AACTGA at 718, AAATGA at 539.
  6. InrDmr5ci: AAATGA at 4286, AAATGA at 3965.
  7. InrDmr7ci: AAATGA at 3964, AACTGA at 3528, GACTGA at 2696, GACTGA at 2072, AAATGA at 1953, AAATGA at 1835, AAATGA at 1186, AAATGA at 1114, AAATGA at 1037.
  8. InrDmr9ci: GAATGA at 4413, GACTGA at 4207, AACTGA at 3842, GAATGA at 3678.

InrDmr alternate negative direction (odds) (2846-2811) core promoters

  1. InrDmr3: TCAGTC at 2820.

InrDmr arbitrary positive direction (odds) (4445-4265) core promoters

  1. InrDmr5ci: AAATGA at 4286.
  2. InrDmr9ci: GAATGA at 4413.

InrDmr alternate positive direction (evens) (4445-4265) core promoters

  1. InrDmr2ci: GAATGA at 4357.

InrDmr arbitrary negative direction (evens) (2811-2596) proximal promoters

  1. InrDmr8ci: GAATGA at 2649.

InrDmr alternate negative direction (odds) (2811-2596) proximal promoters

  1. InrDmr5ci: AAATGA at 2768.
  2. InrDmr7ci: GACTGA at 2696.
  3. InrDmr9ci: GAATGA at 2463.

InrDmr arbitrary positive direction (odds) (4265-4050) proximal promoters

  1. InrDmr5: TCATTT at 4054.
  2. InrDmr9ci: GACTGA at 4207.

InrDmr alternate positive direction (evens) (4265-4050) proximal promoters

  1. InrDmr8ci: GACTGA at 4186.

InrDmr arbitrary negative direction (evens) (2596-1) distal promoters

  1. InrDmr0: TCAGTT at 2138.
  2. InrDmr2: TCAGTT at 1986.
  3. InrDmr4: TCATTC at 2112, TCATTT at 1675, TCATTC at 1529.
  4. InrDmr6: TCAGTT at 1698, TCAGTC at 893, TCATTC at 841.
  5. InrDmr0ci: AACTGA at 899.
  6. InrDmr2ci: AAATGA at 371.
  7. InrDmr4ci: GAATGA at 996.
  8. InrDmr6ci: AACTGA at 1810, AAATGA at 749.
  9. InrDmr8ci: GAATGA at 2537, GAATGA at 1672, GAATGA at 1296, AAATGA at 964, AACTGA at 887.

InrDmr alternate negative direction (odds) (2596-1) distal promoters

  1. InrDmr1: TCAGTC at 1651, TCATTC at 773.
  2. InrDmr3: TCATTT at 2326, TCAGTC at 1658, TCATTT at 1558, TCATTT at 924, TCATTC at 480.
  3. InrDmr5: TCAGTC at 2193, TCAGTT at 1357.
  4. InrDmr7: TCATTT at 1893, TCAGTT at 1804, TCAGTT at 1751, TCATTC at 1017.
  5. InrDmr9: TCATTT at 2341, TCAGTT at 2286, TCAGTT at 1245, TCAGTC at 109, TCAGTC at 105.
  6. InrDmr1ci: GAATGA at 2270, AACTGA at 1037, GAATGA at 456, GACTGA at 152.
  7. InrDmr3ci: GACTGA at 2118, AACTGA at 718, AAATGA at 539.
  8. InrDmr5ci: AACTGA at 1875, AAATGA at 1566, GAATGA at 503.
  9. InrDmr7ci: GACTGA at 2072, AAATGA at 1953, AAATGA at 1835, AAATGA at 1186, AAATGA at 1114, AAATGA at 1037.
  10. InrDmr9ci: GAATGA at 2463, GAATGA at 2231, AAATGA at 2078, AACTGA at 1754, GAATGA at 1403, GACTGA at 1387.

InrDmr arbitrary positive direction (odds) (4050-1) distal promoters

  1. InrDmr1: TCAGTC at 3948, TCATTC at 3554, TCAGTC at 1651, TCATTC at 773.
  2. InrDmr3: TCATTC at 3471, TCAGTC at 2820, TCATTT at 2326, TCAGTC at 1658, TCATTT at 1558, TCATTT at 924, TCATTC at 480.
  3. InrDmr5: TCATTT at 4054, TCAGTC at 2193, TCAGTT at 1357.
  4. InrDmr7: TCATTT at 1893, TCAGTT at 1804, TCAGTT at 1751, TCATTC at 1017.
  5. InrDmr9: TCATTT at 2341, TCAGTT at 2286, TCAGTT at 1245, TCAGTC at 109, TCAGTC at 105.
  6. InrDmr1ci: AAATGA at 3442, GAATGA at 2270, AACTGA at 1037, GAATGA at 456, GACTGA at 152.
  7. InrDmr3ci: GACTGA at 3898, GACTGA at 2118, AACTGA at 718, AAATGA at 539.
  8. InrDmr5ci: AAATGA at 3965, AAATGA at 2768, AACTGA at 1875, AAATGA at 1566, GAATGA at 503.
  9. InrDmr7ci: AAATGA at 3964, AACTGA at 3528, GACTGA at 2696, GACTGA at 2072, AAATGA at 1953, AAATGA at 1835, AAATGA at 1186, AAATGA at 1114, AAATGA at 1037.
  10. InrDmr9ci: AACTGA at 3842, GAATGA at 3678, GAATGA at 2463, GAATGA at 2231, AAATGA at 2078, AACTGA at 1754, GAATGA at 1403, GACTGA at 1387.

InrDmr alternate positive direction (evens) (4050-1) distal promoters

  1. InrDmr0: TCATTC at 3389, TCAGTT at 2965, TCAGTT at 2138.
  2. InrDmr2: TCAGTT at 1986.
  3. InrDmr4: TCATTC at 2112, TCATTT at 1675, TCATTC at 1529.
  4. InrDmr6: TCATTT at 3760, TCAGTT at 1698, TCAGTC at 893, TCATTC at 841.
  5. InrDmr8: TCATTT at 3334.
  6. InrDmr0ci: AACTGA at 899.
  7. InrDmr2ci: AAATGA at 3800, AAATGA at 371.
  8. InrDmr4ci: GAATGA at 996.
  9. InrDmr6ci: AAATGA at 3699, AAATGA at 3190, AACTGA at 1810, AAATGA at 749.
  10. InrDmr8ci: GAATGA at 2649, GAATGA at 2537, GAATGA at 1672, GAATGA at 1296, AAATGA at 964, AACTGA at 887.

Drosophila melanogaster (Butler 2002) initiator element analysis and results

Dm: TCA(G/T)T(C/T).[38]

Reals or randoms Promoters direction Numbers Strands Occurrences Averages (± 0.1)
Reals UTR negative 3 2 1.5 1.5 ± 0.5 (--2,+-1)
Randoms UTR arbitrary negative 9 10 0.9 1.65 ± 0.75
Randoms UTR alternate negative 24 10 2.4 1.65 ± 0.75
Reals Core negative 0 2 0 0
Randoms Core arbitrary negative 0 10 0 0.05
Randoms Core alternate negative 1 10 0.1 0.05
Reals Core positive 0 2 0 0
Randoms Core arbitrary positive 2 10 0.2 0.15
Randoms Core alternate positive 1 10 0.1 0.15
Reals Proximal negative 0 2 0 0
Randoms Proximal arbitrary negative 1 10 0.1 0.2
Randoms Proximal alternate negative 3 10 0.3 0.2
Reals Proximal positive 2 2 1 1 ± 1 (-+2,++0)
Randoms Proximal arbitrary positive 2 10 0.2 0.15
Randoms Proximal alternate positive 1 10 0.1 0.15
Reals Distal negative 9 2 4.5 4.5 ± 3.5 (--1,+-8)
Randoms Distal arbitrary negative 18 10 1.8 2.9 ± 1.1
Randoms Distal alternate negative 40 10 4.0 2.9 ± 1.1
Reals Distal positive 15 2 7.5 7.5 ± 1.5 (-+9,++6)
Randoms Distal arbitrary positive 54 10 5.4 4.0 ± 1.4
Randoms Distal alternate positive 26 10 2.6 4.0 ± 1.4

Comparison:

The occurrences of real Drosophila melanogaster (Butler 2002) initiator element positive direction proximals and distals are greater than the randoms, UTRs are likely random, and negative distals are outside the randoms. This suggests that the real Drosophila melanogaster (Butler 2002) initiator elements are likely active or activable, UTRs are likely random.

Comparisons of Inrs for UTRs nn

Juven-Gershon (2008) Ngoc (2017) Butler 2002 Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
UTR nn(4560-2846) UTR nn(4560-2846) UTR nn(4560-2846) UTR nn(4560-2846) UTR nn(4560-2846)
TTACTCC at 4557 - - - -
ciAGTGTAA at 4533 - - ciAGAGAA at 4527 -
- - - - TCTTTTC at 4391
- - - - TCTTTTT at 4384
- ciTCTGGG at 4366 - - -
TCACACT at 4361 GTCACA at 4359 - - -
- CCCACT at 4353 - - -
TCGGACC at 4349 - - - -
- ciTGTGAC at 4336 - - -
- GTCACT at 4319 - - -
CCAGTTT at 4309 TCCAGT at 4307 - - -
TCGGACC at 4300 - - - -
ciGGTCCGA at 4255 - - - -
CTGCACC at 4238 ciTCTGCA at 4236 - - -
TCGGTCT at 4233 - - - -
- ciTCTGGG at 4205 - - -
TCACTCT at 4202 GTCACT at 4200 - - -
TCGAACC at 4188 - - - -
CCGGTCC at 4170 - - - -
- ciAGTGAA at 4161 - - -
ciAGTACGG at 4118 - - - -
CCGTACC at 4107 - - - -
CCGGTCC at 4102 - - - -
TTACACT at 4092 - - - TCTTTTT at 4087
- - - - TCTTTCT at 4083
TCACTCT at 4051 ciTCTGAG at 4054 - - -
TTGTATC at 4046 - - - -
TCGGACC at 4037 ciAGTGAA at 4010 - - -
- ciTGTGAA at 3983 - - -
ciAGTGTGG at 3967 - ciTGTGGA at 3968 - -
CCGGTCC at 3951 - - - -
- TTCACA at 3939 - - -
CTACTTT at 3922 - - - -
TCATTCT at 3893 CTCATT at 3891 TCATTC at 3892 - -
ciGGTCCGG at 3873 - - - -
CTGGTCC at 3871 - - - -
ciGGTATGG at 3858 ciTATGGA at 3859 - - -
- CTCATA at 3829 - - -
- TCCACT at 3825 - - -
CTACACC at 3810 - - - -
CTGTTCT at 3759 - - - -
- - - - CCTTTCT at 3665
- ciTATGCG at 3547 - - -
- ciTATGAC at 3541 - - -
ciGGTCTAG at 3488 - - - -
TTGGTCT at 3486 - - - -
- GTCATT at 3480 TCATTT at 3481 - -
TTGATCT at 3463 - - - -
CCGTATC at 3446 - - - -
- ciTCTGAC at 3425 - - -
- - TTCACT at 3410 - TCTCTTC at 3407
CCGAACT at 3401 - - - -
ciAGTCCGA at 3398 - - - -
- - - - TCTCTCC at 3386
- - - TTCTCT at 3380 TCTCTCT at 3384
- - - TTCTCT at 3380 TCTCTCT at 3382
- - - TTCTCT at 3380 TCTTTCT at 3378
TCGTTCT at 3374 - - - -
TTGTTCT at 3340 - - - TCTTTTC at 3344
TCGTTTT at 3313 - - - -
TTGTTCT at 3307 - - - -
TCGGACC at 3298 - - - -
ciAGTGCGG at 3281 ciAGTGCG at 3280 - - -
TCGGTTC at 3273 - - - -
- ciAGTGAA at 3240 - - -
CCACACC at 3186 - CCCACA at 3184 - -
TTGTATT at 3169 - - - -
CCACTTT at 3146 - TCCACT at 3144 - -
TTGTTCC at 3141 - - - -
ciGGACCGG at 3130 - - - -
TCGGACC at 3128 - - - -
- ciAGTGAA at 3101 - - -
- ciAGTGGG at 3057 - - -
CCGCACC at 3047 - - - -
TTGATTC at 3031 - - - -
ciGATTCGA at 3033 - - - -
CCGATTT at 3009 - - - -
- ciTATGGA at 2994 - - -
- - - - CCTTTTT at 2928
TTGATTC at 2914 - - - -
ciAAAGTAG at 2887 - - - -
- TTCACA at 2860 - - -

Comparisons of Inrs for UTRs pn

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
UTR pn(4560-2846) UTR (4560-2846) UTR (4560-2846) UTR (4560-2846) UTR (4560-2846)
ciGGAATGA at 4555 ciAATGAG at 4556 ciGAATGA at 4555 - -
TTAATTC at 4542 - - - -
TCACATT at 4533 TTCACA at 4531 - - TCTCTTC at 4528
- - - - CCTCTCT at 4526
ciAGTCCAA at 4502 - - - -
- CCCACT at 4485 - - -
CCACTTT at 4461 TCCACT at 4459 - - -
- CCCAGA at 4448 - - -
CCACTCC at 4425 TCCACT at 4423 - - -
CCAGTTC at 4417 GCCAGT at 4415 - - -
- - - - ciGAAAAGA at 4392
- - - - ciAAAAAGA at 4387
- - - - ciAAAAAGA at 4380
ciAGTGTGA at 4361 ciTGTGAG at 4362 - - -
CTGCACT at 4340 TGCACT at 4340 - - -
- - ciACTGCA at 4338 - -
CCGGACT at 4327 ciACTGCA at 4330 - - -
- ciAGTGAG at 4320 - - -
- ciACTGCA at 4315 - - -
- TGCAGT at 4317 - - -
ciAAAATAA at 4221 GCCAGA at 4233 - - -
- ciAGTGAG at 4201 - - -
ciAGTTCAA at 4177 - - - -
ciAATGTGA at 4092 ciTGTGAG at 4093 - - -
ciAAAATAA at 4071 - - - -
ciAGACCAG at 4032 ciAGTGAG at 4050 - - -
ciAGTTCAA at 4026 - - - -
TCACACC at 3967 CTCACA at 3965 - - -
- ciTGTGGC at 3960 - - -
- - - - ciAAAGAGG at 3926
ciGGAGTAA at 3891 - - - -
ciGGACCAG at 3870 - - - -
CCATACC at 3858 CCCATA at 3856 - - -
- ciACTGCC at 3852 - - -
- ciTCTGGA at 3836 - - -
ciGATGTGG at 3810 - - - -
ciAATGCAG at 3772 ciAATGCA at 3771 - - -
CTGAACC at 3784 - - - -
ciGGACTGG at 3749 ciACTGGG at 3750 - - -
CTGGACT at 3747 - - - -
ciGGAACAG at 3725 - - - -
- ciTGTGGG at 3712 - - -
CCATTTC at 3688 TCCACA at 3692 - - -
ciAATCCAG at 3681 GCCATT at 3686 - - -
- ciAATGGG at 3660 - - -
- CTCAGA at 3644 - - -
- GGCACA at 3632 - - -
- GTCAGA at 3625 - - -
- GGCAGT at 3600 - - -
CTGCTCC at 3582 GGCAGA at 3589 - - -
- ciTGTGCC at 3561 - - -
CCAGATC at 3488 - - - -
ciAAACCAG at 3485 GGCAGT at 3478 - - -
ciGAACTAG at 3462 - - - -
- GGCATA at 3451 - - -
- GGCATA at 3445 - - -
- TGCAGA at 3431 - - -
- ciTGTGCA at 3429 - - -
ciGAAGTGA at 3410 ciAGTGAC at 3411 - ciAGAGAA at 3406 -
- - - - ciAAAGAGA at 3380
ciAAATTGA at 3358 - - - -
- - - - ciGAAAAGG at 3345
ciAGAGCAA at 3311 - - - -
TTGCACT at 3289 TGCACT at 3289 - - -
- GCCATT at 3284 - - -
- ciTGTGAG at 3268 - - -
TTGAACC at 3245 - - - -
ciGGTGTGG at 3186 - - - -
ciAGACCAG at 3123 - - - -
ciAAACTAA at 3030 - - - -
ciAAAATAA at 3013 - - - -
ciAGAATGG at 3004 ciAATGGC at 3005 - - -
- ciTGTGCA at 2863 - - -

Comparisons of Inrs for core promoters nn

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
core nn(2846-2811) core nn(2846-2811) core nn(2846-2811) core nn(2846-2811) core nn(2846-2811)
- - - - TCTTTTT at 2833
- - - TTCTCT at 2826 TCTCTTC at 2828
- - - TTCTCT at 2826 TCTTTTC at 2823
- - - - TCTTTTT at 2816
- - - - TCTCTTC at 2811

Comparisons of Inrs for core promoters pn

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
core pn(2846-2811) core pn(2846-2811) core pn(2846-2811) pncore (2846-2811) core pn(2846-2811)
ciAAAACAA at 2842 - - - -
- - - - ciAAAAAGA at 2836
- - - ciAGAGAA at 2827 ciAAAGAGA at 2826
- - - ciAGAGAA at 2827 ciGAAAAGA at 2824
- - - - ciAAAAAGA at 2819

Comparisons of Inrs for core promoters np

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
core np(4445-4265) core np(4445-4265) core np(4445-4265) core np(4445-4265) core (4445-4265)
ciGGAACAG at 4445 - - - -
ciGGTCTGG at 4416 ciTCTGGG at 4417 - - -
- ciTGTGGG at 4395 - - -
- - - TTCTCT at 4386 TCTTTCT at 4384
ciGGAGTGA at 4350 ciAGTGAG at 4351 - - -
CTGCACC at 4343 CTCACT at 4338 - - -
- CCCAGA at 4330 - - -
- TGCAGA at 4317 - - -
- CTCATT at 4309 - - -
- GTCAGT at 4271 - - ciGAAGAGG at 4266

Comparisons of Inrs for core promoters pp

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
core pp(4445-4265) core pp(4445-4265) core pp(4445-4265) core pp(4445-4265) core (4445-4265)
CCAGACC at 4416 CCCAGA at 4414 - - -
CCACTCC at 4401 CCCACT at 4399 - - -
ciAGAACGA at 4390 - - ciAGAGAA at 4387 ciAAAGAGA at 4386
- - - - ciAGAAAGA at 4384
ciGGTACGA at 4372 - - - -
ciAGTACAG at 4366 - - - -
- CTCACT at 4350 - - -
- ciAGTGAC at 4339 - - -
- ciTGTGAG at 4335 - - -
- ciAGTGGG at 4326 - - -
- ciTCTGCG at 4320 - - -
ciGGAGTAA at 4309 - - - -
- TCCAGT at 4269 - - -

Comparisons of Inrs for proximal promoters nn

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
proximal nn(2811-2596) proximal nn(2811-2596) proximal nn(2811-2596) proximal nn(2811-2596) proximal nn(2811-2596)
- - - TTCTCT at 2809 TCTCTTC at 2811
- - - - TCTTTTC at 2806
TCGTACT at 2784 ciACTGAG at 2787 - - TCTTTCT at 2802
TCGGACC at 2770 - - - -
ciAGTACGG at 2753 - - - -
- GTCACT at 2739 - - -
TTGGACC at 2720 - - - -
TCACACC at 2658 GTCACA at 2656 - - -
CCACTTT at 2619 - - - -
TTGTACC at 2614 - - - -
TCACACC at 2605 GTCACA at 2603 - - -

Comparisons of Inrs for proximal promoters pn

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
proximal pn(2811-2596) proximal pn(2811-2596) proximal pn(2811-2596) proximal pn(2811-2596) proximal pn(2811-2596)
- - - ciAGAGAA at 2810 ciAAAGAGA at 2809
- - - ciAGAGAA at 2810 ciGAAAAGA at 2807
- - - - ciAAAAAGA at 2798
CTGCACC at 2761 ciACTGCA at 2759 - - -
- GCCACT at 2756 - - -
- ciAGTGAG at 2740 - - -
- TGCAGT at 2737 - - -
TTGAACC at 2717 - - - -
TTGAATC at 2708 - - - -
- GGCACA at 2665 - - -
- GCCAGT at 2654 - - -
ciAAATCAG at 2649 - - - -
- TCCACT at 2632 - - CCTCTCC at 2629
- ciTGTGGC at 2606 - - -
ciAGACCAG at 2600 - - - -
- - - - - -

Comparisons of Inrs for proximal promoters np

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
proximal np(4265-4050) proximal np(4265-4050) proximal np(4265-4050) proximal np(4265-4050) proximal np(4265-4050)
- CTCAGA at 4195 - - -
TTAGTTT at 4139 - - - -
ciGATTTAG at 4136 - - - -
TTGATTT at 4134 - - - -
TCACTCT at 4128 - - - -
TCATTTT at 4120 - TCATTT at 4119 - -
ciGAAATGA at 4094 ciAATGAG at 4095 ciAAATGA at 4094 - -
- ciACTGAA at 4090 - - -
ciAGAACAG at 4069 - - - -

Comparisons of Inrs for proximal promoters pp

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
proximal pp(4265-4050) proximal pp(4265-4050) proximal pp(4265-4050) proximal pp(4265-4050) proximal pp(4265-4050)
- ciTGTGCC at 4259 - - -
ciGGACTGG at 4216 ciACTGGG at 4217 - - -
ciGAAACGG at 4210 - - - -
- ciAGTGGG at 4204 - - -
ciAAATCAA at 4138 - - - -
ciGAACTAA at 4133 - - - -
ciAAAATAG at 4123 ciAGTGAG at 4127 - - -
CTACTCC at 4102 - - - -
TTACTCC at 4096 ciAGTGAC at 4088 - - -
- CGCAGA at 4056 - - -

Comparisons of Inrs for distal promoters nn

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
distal nn(2596-1) distal nn(2596-1) distal nn(2596-1) distal nn(2596-1) distal nn(2596-1)
CCAGTCC at 2587 TCCAGT at 2585 - - -
- ciAGTGAA at 2578 - - -
- ciTGTGAA at 2551 - - -
ciAGTACGG at 2535 - - - -
CCGGTCC at 2519 - - - -
TCATTCT at 2503 - TCATTC at 2502 - -
TTGTTTT at 2490 - - - -
TCGTTTT at 2476 - - - -
- - - - CCTTTTT at 2460
TCACTCT at 2449 CTCACT at 2447 - - -
TCGGACC at 2435 - - - -
ciAGTGTGG at 2418 - - - -
- GTCACT at 2404 - - -
TTGGACC at 2385 - - - -
CCACTTT at 2282 - - - -
TCGTACC at 2277 - - - -
TCGGACC at 2268 - - - -
TCAAACT at 2257 - - - -
CCAGTCC at 2250 TCCAGT at 2248 - - -
ciAGTGCGG at 2208 ciAGTGCG at 2207 - - -
- ciACTGGC at 2190 - - -
- ciTATGAC at 2162 - - -
CCGCTTT at 2157 - - - -
TTGTACC at 2152 - - - -
TCAAACT at 2141 - - - -
TCACATT at 2087 GTCACA at 2085 - - -
CCGGTCC at 2077 - - - -
TTACACC at 2065 - - - -
- - - - TCTTTTT at 2057
TCGTTCT at 2023 ciTCTGAG at 2026 - - -
TCGGACC at 2009 - - - -
ciAGTGCGG at 1992 ciAGTGCG at 1991 - - -
- GTCACT at 1978 - - -
TTGGACC at 1959 - - - -
CCGTACT at 1953 - - - -
- ciTCTGAC at 1934 - - -
CCGCACC at 1897 - - - -
ciGGACCGA at 1843 - - - -
- TGCAGA at 1774 - - -
ciAGTGCAG at 1773 ciAGTGCA at 1772 - - -
TTATACC at 1742 - - - -
ciAAAATAG at 1730 - - - -
TTAATTT at 1697 - - - -
- ciTCTGAA at 1617 - - -
ciAGAACGG at 1608 - - - -
TTGGATT at 1591 - - - -
TTACTTT at 1582 - - - -
CCGTTTT at 1561 - - - -
TTGCTTC at 1555 - - - -
- ciTGTGAA at 1544 - - -
ciGATATAG at 1528 - - - -
- GGCAGT at 1511 - - -
- ciAGTGAC at 1492 - - -
CCACACT at 1479 - - - -
ciGGTCCGA at 1462 - - - -
ciAGAGCGA at 1448 CTCAGA at 1444 - - -
- ciTCTGAG at 1403 - - -
TTGTTTT at 1394 - - - -
TCGTTTT at 1371 - - - -
TTATTCT at 1365 - - - -
TCAGACC at 1356 GTCAGA at 1354 - - -
- GTCACT at 1325 - - -
TTGGATC at 1306 - - - -
- ciAATGAA at 1298 - - -
CCGCACC at 1244 - - - -
CCACTTT at 1212 - - - -
- GGCACA at 1220 - - -
TTGTACC at 1207 - - - -
ciGGACCGG at 1200 - - - -
TCGGACC at 1198 - - - -
- ciAGTGGA at 1171 - - -
ciAGTGTGG at 1128 ciTGTGGA at 1129 - - -
- ciTCTGAG at 1082 - - -
TCACTCT at 1079 CTCACT at 1077 - - -
ciGAAGTGA at 1056 ciAGTGAG at 1057 - - -
- ciACTGAA at 1052 - - -
- CCCACT at 1049 - - -
- GTCACT at 1034 - - -
TTGGACC at 1015 - - - -
TTAGTCC at 984 - - - -
- ciTGTGCG at 963 - - -
CCGTACC at 953 - - - -
TCGGTCC at 948 - - - -
TCGCTCT at 913 ciTCTGAG at 916 - - -
TCGGACC at 899 - - - -
ciAGTGTGG at 882 - - - -
TCGGTTC at 874 - - - -
- GCCACT at 868 - - -
CTACACC at 787 - - - -
- GGCAGA at 754 - - -
- ciTGTGGG at 749 - - -
TCGCACC at 741 - - - -
ciGGACTGG at 734 - - - -
TCGGACT at 732 - - - -
CCAGTCC at 714 TCCAGT at 712 - - -
CCGGTTC at 692 - - - -
ciAGTGCGG at 664 ciAGTGCG at 663 - - -
CCGGTCC at 648 - - - -
- - - - ciGAAGAGA at 622
TTATACC at 605 - - - -
ciGGACCGA at 598 - - - -
CCAGTCC at 578 TCCAGT at 576 - - -
- TCCAGT at 568 - - -
CCGGTTC at 556 - - - -
- TGCATT at 533 - - -
- ciTGTGCA at 531 - - -
TCGGACC at 508 - - - -
TCACTTT at 473 - - - -
TTGTATC at 468 - - - -
TCGGACC at 459 - - - -
CCAGTCC at 441 TCCAGT at 439 - - -
CCGGTTC at 419 - - - -
- ciTGTGCA at 342 - - -
- TTCACA at 322 - - -
CTGCTTT at 312 - - - -
TCACTCT at 301 GTCACT at 299 - - -
TTATACT at 274 CTCAGA at 278 - - -
TTGGTCC at 262 - - - -
CTACATT at 247 - - - -
ciGATACAA at 213 - - - -
- CCCAGT at 206 - - -
CCATATT at 181 TCCATA at 179 - - -
CCGTACT at 124 - - - -
- - - - TCTTTTC at 104
CCGTTTC at 93 - - - -
CTATACC at 77 - - - -
TTGTTCC at 71 - - - -
- ciTGTGGA at 62 - - -
- - - - TCTTTTC at 54
- ciTCTGAC at 16 - - -

Comparisons of Inrs for distal promoters pn

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
distal pn(2596-1) distal pn(2596-1) distal pn(2596-1) distal pn(2596-1) distal pn(2596-1)
ciAAAACAA at 2509 - - - -
ciAAAGCAA at 2480 - - - -
ciAAAGCAA at 2474 - - - -
ciGATTCGG at 2454 - - - -
ciAGAGTGA at 2447 ciAGTGAG at 2448 - - -
CTGCACT at 2426 TGCACT at 2426 - - -
- TGCAGT at 2402 - - -
TTGAACC at 2382 - - - -
CTACTCC at 2352 - - - -
ciAAACTAG at 2313 - - - -
ciAATACAA at 2305 - - - -
ciAGACCAG at 2263 - - - -
- GCCAGT at 2211 - - -
ciGGTGCGG at 2197 - - - -
ciAAAATGA at 2187 ciAATGAC at 2188 ciAAATGA at 2187 - -
ciGATACAA at 2180 - - - -
ciAGACCAA at 2147 - - - -
ciAGTTTGA at 2141 - - - -
ciAGTGTAA at 2087 - - - -
ciGGTGCAG at 2082 TGCAGT at 2083 - - -
ciAATGTGG at 2065 ciTGTGGC at 2066 - - -
- - - - ciAAAAAGA at 2053
ciAGAGCAA at 2021 - - - -
CTGCACT at 2000 TGCACT at 2000 - - -
- GCCACT at 1995 - - -
- TGCAGT at 1976 - - -
- GGCAGA at 1967 - - -
ciAGAATGG at 1948 ciAATGGC at 1949 - - -
ciAGACTGA at 1935 ciACTGAG at 1936 ciGACTGA at 1935 - -
ciAAATTAG at 1887 - - - -
ciAATACAA at 1878 - - - -
- TGCAGA at 1774 - - -
ciAATATGG at 1742 ciTATGGC at 1743 - - -
TTATTTT at 1727 - - - -
- TGCACA at 1719 - - -
ciGAATTAA at 1696 - ciAAATGA at 1700 - -
ciAAAGCGG at 1680 - - - -
ciGAAATGA at 1663 - ciAAATGA at 1663 - -
- ciAATGCC at 1634 - - -
- - - - ciGAAAAGA at 1628
ciGAAACAA at 1585 ciAATGAA at 1581 ciAAATGA at 1580 - -
ciAATACAG at 1566 - - - -
ciAGAACGA at 1553 - - - -
ciAGTGCAA at 1536 ciAGTGCA at 1535 - - -
- TCCAGT at 1532 - - -
- CCCAGA at 1518 - - -
- CTCACT at 1491 - - -
ciGGTGTGA at 1479 - - - -
AGTGCAG at 1471 TGCAGT at 1472 - - -
TCGCTCT at 1450 - - - -
- - - - TCTTTTT at 1420
- CCCAGA at 1411 - - -
- - - - ciAAAAAGA at 1400
ciAAAACAA at 1388 - - - -
CCATTTC at 1380 TCCATT at 1378 - - -
ciAGAGCAA at 1369 - - - -
ciAGTCTGG at 1356 ciTCTGGG at 1357 - - -
CCAGTCT at 1354 TCCAGT at 1352 - - -
TTGCACT at 1347 TGCACT at 1347 - - -
TTGCACC at 1339 - - - -
- TGCAGT at 1323 - - -
- GGCAGA at 1314 - - -
TTGAACC at 1303 - - - -
ciAAATTAG at 1234 - - - -
- CTCACA at 1126 - - -
- GGCACA at 1116 - - -
- - - - ciAAAAAGG at 1107
ciAGAGTGA at 1077 ciAGTGAG at 1078 - - -
TCACTCC at 1058 TTCACT at 1056 - - -
ciAGATTGG at 1045 - - - -
- TGCAGT at 1032 - - -
- GGCAGA at 1023 - - -
TTGAACC at 1012 - - - -
ciGATCCAG at 975 - - - -
- GGCACA at 960 - - -
ciAGAGCGA at 911 - - - -
TCACACC at 882 - - - -
TTGAACC at 846 - - - -
ciGATGTGG at 787 - - - -
ciAAATTAG at 777 - - - -
ciAATACAA at 769 - - - -
ciAGACCAG at 727 - - - -
ciAGTTCGA at 721 - - - -
ciAAATTGG at 643 - - - -
ciAATACAA at 635 - - - -
- - - TTCTCT at 622 -
ciAATATGG at 605 - - - -
ciAGATTGA at 585 - - - -
- ciAGTGGA at 523 - - -
- GGCACA at 518 - - -
ciAAATTAG at 499 - - - -
ciAATACGA at 492 - - - -
- ciAGTGAA at 472 - - -
ciAGTGCGA at 448 ciAGTGCG at 447 - - -
ciGGTGCGG at 380 - - - -
- - - - ciGAAGAGG at 370
ciAAACTGA at 307 ciACTGAC at 308 ciAACTGA at 307 - -
- ciAGTGAG at 300 - - -
ciAGAACAG at 288 - - - -
ciAATATGA at 274 ciTATGAG at 275 - - -
- GGCACA at 266 - - -
ciAAACCAG at 261 - - - -
ciAGTTCAA at 255 - - - -
ciGATGTAA at 247 - - - -
ciGAAACAA at 229 - - - -
- GTCACT at 208 - - -
GGTATAA at 181 - - - -
ciAAAACAG at 167 - - - -
CTGCATT at 152 TGCATT at 152 - - -
ciAAACTGA at 130 ciACTGAA at 131 ciAACTGA at 130 - -
- - - - ciGAAAAGG at 105
ciGATATGG at 77 ciTATGGG at 78 - - -
ciAAAACAA at 69 - - - -
- - - - ciGAAAAGA at 55
- GCCATA at 39 - - -
ciGGACCAG at 34 - - - -
TTGGACC at 32 - - - -
- - - - TCTTTTT at 27
CTGAATT at 20 - - - -
ciAGACTGA at 17 ciACTGAA at 18 ciGACTGA at 17 - -

Comparisons of Inrs for distal promoters np

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
distal np(4050-1) distal np(4050-1) distal np(4050-1) distal np(4050-1) distal np(4050-1)
ciAGAGTGG at 4040 ciAGTGGG at 4041 - - -
- TCCACT at 4013 - - -
- GGCACT at 4006 - - -
ciGGTGTGA at 3971 ciTGTGAC at 3972 - - -
ciAGTGTGG at 3966 TGCAGT at 3962 - - -
- GTCACA at 3954 - - -
- ciTGTGCA at 3960 - - -
- - - - ciAAAGAGG at 3931
ciAGTCTGA at 3924 ciTCTGAA at 3925 - - ciGAAAAGA at 3929
- CGCAGA at 3916 - - -
- ciTGTGAG at 3904 - - -
- TCCAGA at 3891 - - -
ciAGAGTGA at 3876 ciAGTGAG at 3877 - - -
ciGAACCAG at 3840 - - - -
ciAGAATGA at 3835 ciAATGAA at 3836 ciGAATGA at 3835 - -
ciAGAATGA at 3835 TGCAGA at 3831 - - -
TCACACC at 3824 GTCACA at 3822 - - -
ciAATCCGA at 3799 TCCAGA at 3806 - - -
- ciAATGAC at 3783 ciGAATGA at 3782 - -
- ciACTGAG at 3736 ciAACTGA at 3735 - -
- ciAGTGAC at 3713 - - -
- GCCACA at 3705 - - -
ciGAAGCGG at 3670 - - - -
- - - - ciGGAGAGG at 3652
CTGTTCC at 3625 - - - -
ciAGTGTGA at 3594 ciTGTGAA at 3595 - - -
ciGGAATGA at 3567 ciAATGAC at 3568 ciGAATGA at 3567 - -
CCAGACC at 3550 - - - -
ciGGACCAG at 3547 - - - -
TCACACT at 3507 CTCACA at 3505 - - -
- GGCAGA at 3473 - - -
ciAGTGCAG at 3465 ciAGTGCA at 3464 - - -
ciGATGCAG at 3460 TGCAGT at 3461 - - -
- ciAGTGGG at 3450 - - -
- ciAATGAG at 3446 ciGAATGA at 3445 - -
ciGGAATGA at 3441 ciAATGAA at 3442 ciGAATGA at 3441 - -
- ciTGTGGA at 3437 - - -
- ciAATGCC at 3431 - - -
- GGCACA at 3409 - - -
- ciTCTGGC at 3406 - - -
TTGCATC at 3402 - - - -
- CCCACT at 3388 - - -
- CCCAGT at 3379 - - -
- ciTCTGCC at 3359 - - -
CTGTTCC at 3352 - - - -
- ciACTGGC at 3346 - - -
TTGCACT at 3343 TGCACT at 3343 - - -
CCGCATC at 3328 - - - -
CTGCACC at 3322 ciACTGCA at 3320 - - -
- CTCACT at 3317 - - -
CTGCTCC at 3309 - - - -
CTGGTCT at 3299 - - - -
- TGCAGT at 3281 - - -
TCGCTCT at 3276 ciTCTGCA at 3279 - - -
- ciTCTGCA at 3268 - - -
- ciTATGAG at 3261 - - -
CTGGTCT at 3245 - - - -
- TGCAGT at 3232 ciAGTGCC at 3235 - -
- GCCAGA at 3221 - - -
- CTCACA at 3209 - - -
- TCCACA at 3192 - - -
ciGGACCAA at 3174 - - - -
ciGAAATGG at 3168 ciAATGGG at 3169 - - -
ciAATATGG at 3162 ciTATGGA at 3163 - - -
- ciTCTGAG at 3124 - - -
- ciACTGGC at 3118 - - -
- CCCAGA at 3091 - - -
CCAGTCC at 3084 CCCAGT at 3082 - - -
- TGCATT at 3072 - - -
- ciAATGCA at 3070 - - -
- ciTCTGCA at 3061 - - TCTCTTC at 3057
- ciTATGAC at 3028 - - -
ciGGTCTGG at 3021 - - - -
- ciAGTGCC at 3011 - - -
- ciTCTGAG at 3007 - - -
CCAGTCC at 2998 - - - -
- ciTCTGGC at 2984 - - -
CTGCTCC at 2978 - - - -
- TGCACA at 2962 - - -
- ciTGTGCA at 2960 - - -
ciTCTGAG at 2951 - - -
ciGGTCTGA at 2943 ciTCTGAC at 2944 - - -
- TTCAGT at 2936 - - -
- GTCACT at 2929 - - -
- ciAATGGG at 2911 - - -
- CTCATT at 2902 - - -
- ciTCTGGC at 2884 - - -
ciGATTTGA at 2871 - - - -
TCAGATT at 2868 CTCAGA at 2866 - - -
- TGCAGA at 2859 - - -
- ciTCTGCA at 2857 - - -
ciAGAATGA at 2841 ciAATGAC at 2842 ciGAATGA at 2841 - -
- ciACTGCC at 2823 - - -
ciGGTGCAA at 2801 - - - -
- CTCAGA at 2729 - - -
- TGCAGA at 2721 - - -
ciAAAGTGG at 2711 ciAGTGGA at 2712 - - -
ciAGAGCAA at 2705 - - - -
- CTCAGA at 2699 - - -
- ciTGTGCA at 2681 - - -
ciGGACTGA at 2674 - ciGACTGA at 2674 - -
ciGATATAA at 2662 - - - -
CCACACT at 2636 - - - -
ciGAAATAG at 2626 - - - -
- GTCAGA at 2609 - - -
CCACACC at 2602 - - - -
TTATACC at 2590 - - - -
CCGCACC at 2566 - - - -
CTAATTT at 2440 - - - -
CTACACC at 2430 GTCACT at 2425 - - -
ciGGTGCAA at 2335 - - - -
ciAGTGCAG at 2327 TGCAGT at 2328 - - -
ciAGTGCAG at 2327 ciAGTGCA at 2326 - - -
TCACTCT at 2306 TTCACT at 2304 - - -
- - - - TCTTTTT at 2280
CTGTTTC at 2263 - - - -
TCAATCT at 2235 CTCAGA at 2239 - - -
ciAGATCAA at 2232 - - - -
CCAGATC at 2230 - - - -
ciGAACCAG at 2227 - - - -
- GTCAGA at 2222 - - -
CTGCATT at 2206 TGCATT at 2206 - - -
CTGCATT at 2206 ciACTGCA at 2204 - - -
TCATATT at 2178 CTCATA at 2176 - - -
- ciTATGGC at 2160 - - -
TCGCTTC at 2095 TTCAGT at 2098 TCAGTC at 2099 - -
- GCCACT at 2072 - - -
- ciAGTGGC at 2068 - - -
ciAGTGCAG at 2064 TGCAGT at 2065 - - -
ciAGTGCAG at 2064 ciAGTGCA at 2063 - - -
- CTCAGT at 2060 - - -
CCAGTCC at 2026 TCCACA at 2029 - - -
CCAGTCC at 2026 CCCAGT at 2024 - - -
ciAAAGCAG at 2007 - - - -
- GGCACT at 1996 - - -
- GGCACT at 1996 ciTCTGGC at 1993 - -
- - ciTCTGGC at 1993 TTCTCT at 1990 -
CTATTTC at 1978 - - - -
ciGGTGTGG at 1971 ciTGTGGC at 1972 - - -
ciGAACTGG at 1953 ciACTGGG at 1954 - - -
- TGCAGA at 1937 - - -
CCACTTC at 1914 TCCACT at 1912 - - -
- ciTCTGGG at 1865 - - -
- TGCACA at 1822 - - -
ciGGTGTGG at 1805 ciTGTGGA at 1806 - - -
ciAGTGCAG at 1787 ciAGTGCA at 1786 - - -
ciGGTGCGG at 1764 - - - -
CCAGACT at 1744 CCCAGA at 1742 - - -
- ciAGTGCG at 1725 - - -
- GGCATT at 1702 - - -
ciGAAGCGG at 1636 - - - -
ciAGTGCGG at 1590 ciAGTGCG at 1589 - - -
- CGCACA at 1556 - - -
- ciACTGCA at 1505 - - -
- CCCACT at 1502 - - -
- ciTCTGCG at 1496 - - -
- ciTCTGGC at 1477 - - -
CTGCACT at 1472 TGCACT at 1472 - - -
ciAATGCGG at 1422 ciAATGCG at 1421 - - -
- CGCAGA at 1416 - - -
- ciTCTGCG at 1396 - - -
- ciTCTGGC at 1377 - - -
CTGCACT at 1372 TGCACT at 1372 - - -
ciAATGCGG at 1322 ciAATGCG at 1321 - - -
- CGCAGA at 1316 - - -
- ciACTGAG at 1287 - - -
ciAGTGCGG at 1254 ciAGTGCG at 1253 - - -
ciAGTGCGG at 1254 CCCAGT at 1250 - - -
- TGCACA at 1220 - - -
ciAGTGCGG at 1170 ciAGTGCG at 1169 - - -
- ciAGTGCG at 1160 - - -
- CGCACA at 1136 - - -
- - - - ciGGAAAGG at 1091
ciAGTGCGG at 1086 ciAGTGCG at 1085 - - -
- CGCACA at 1052 - - -
- ciTGTGGC at 1023 - - -
- ciACTGCG at 1001 - - -
- GCCACA at 984 - - -
- GCCAGA at 935 - - -
- ciTGTGGC at 919 - - -
- ciACTGCC at 901 - - -
- GCCACA at 884 - - -
- GCCAGA at 835 - - -
- ciTGTGGC at 819 - - -
- CGCACA at 800 - - -
ciGGTGCAG at 784 - - - -
CCGGACT at 746 ciACTGCG at 749 - - -
- CGCACT at 686 - - -
ciAGTGCGG at 666 ciAGTGCG at 665 - - -
- TCCACA at 632 - - -
ciAGTGCGG at 582 ciAGTGCG at 581 - - -
- TGCACA at 548 - - -
ciAGTGCGG at 498 ciAGTGCG at 497 - - -
ciGGTGCGG at 489 - - - -
- CCCAGA at 468 - - CCTCTCC at 464
ciAGACCGG at 442 - - - -
- TGCAGA at 438 - - -
ciGGAGCGA at 429 - - - -
- CGCAGA at 396 - - -
- ciACTGGG at 348 - - -
CCACACT at 345 GCCACA at 343 - - -
- ciTCTGGA at 271 - - -
- ciTCTGAG at 256 - - CCTCTCT at 253
- ciACTGCC at 238 - - -
ciAATGTGA at 230 ciTGTGAA at 231 - - -
- ciTCTGCA at 224 - - -
- CCCAGA at 204 - - -
- GTCACA at 155 - - -
CTGTTTT at 147 - - - -
- - - TTCTCT at 139 TCTCTCC at 141
- - - - CCTTTTC at 136
TTGTATT at 115 - - TTCTCT at 119 -
ciAGAGTGG at 53 ciAGTGGG at 54 - - -
- GGCATT at 22 - - -
- TCCAGA at 15 - - -

Comparisons of Inrs for distal promoters pp

Table includes complement inverses (ci)

Juven-Gershon (2008) Ngoc (2017) Butler (2002) Matsumoto (2020) Parry (2010)
YYRNWYY BBCABW TCAKTY TTCTCT YCTYTYY
distal pp(4050-1) distal pp(4050-1) distal pp(4050-1) distal pp(4050-1) distal pp(4050-1)
ciGAACTGG at 4018 ciACTGGA at 4019 - - -
CCACACT at 3971 - - - -
TCACACC at 3966 GTCACA at 3964 - - -
- TCCACT at 3934 - - -
TCAGACT at 3924 TTCAGA at 3922 - - -
TCACTCC at 3878 CTCACT at 3876 - - -
- - - - CCTCTTC at 3851
- GTCACT at 3843 - - -
ciAGTGTGG at 3824 CCCAGT at 3820 - - -
CTGGACC at 3787 ciACTGGA at 3785 - - -
- TCCAGA at 3771 - - -
CCGGACC at 3758 - - - -
- ciAGTGCC at 3748 - - -
- TCCATT at 3731 - - -
- CTCACT at 3712 - - -
ciGGACCGG at 3681 - - - -
CCGGACC at 3679 - - - -
CCACTCC at 3647 - - - CCTCTCC at 3652
ciAGAGTGG at 3612 ciAGTGGG at 3613 - - -
ciAGAGTGG at 3612 GCCAGA at 3608 - - -
TCACACT at 3594 CTCACA at 3592 - - -
ciGGTCTGG at 3550 ciTCTGGA at 3551 - - -
CTGGTCT at 3548 - - - -
- ciTGTGGG at 3533 - - -
ciGATCCGA at 3524 - - - -
TCGATCC at 3522 - - - -
ciAGTGTGA at 3507 ciTGTGAG at 3508 - - -
CCGATCC at 3484 - - - -
ciGGAACGG at 3375 - - - -
ciAGAGTGA at 3317 ciAGTGAC at 3318 - - -
ciGGACCAG at 3298 - - - -
ciAGTGCAG at 3255 TGCAGA at 3256 - - -
ciAGTGCAG at 3255 ciAGTGCA at 3254 - - -
ciGAAGTAG at 3250 - - - -
TCGGTCT at 3221 - - - -
TCGGTCT at 3221 - - - -
- CTCAGA at 3187 - - -
CTGGTTT at 3175 - - - -
TTATACC at 3162 - - - -
- - - ciAGAGAA at 3056 -
ciGGACCAA at 3049 - - - -
ciAGTCCGG at 3036 - - - -
- ciACTGAA at 3030 - - -
ciAGACCAA at 3023 - - - -
CCAGACC at 3021 - - - -
ciGGTCCAG at 3018 TCCAGA at 3019 - - -
ciGGAACAG at 3003 - - - -
ciGGACCGG at 2990 - - - -
CCGGACC at 2988 - - - -
ciAGACCGG at 2985 - - - -
- ciTGTGGG at 2965 - - -
ciAGACTGA at 2945 ciACTGAA at 2946 - - -
CCAGACT at 2943 - ciGACTGA at 2945 - -
- ciAGTGAC at 2930 - - -
ciGGAGTAA at 2902 - - - -
ciAGACCGA at 2885 - - - -
ciGGTCCGG at 2878 - - - -
CTGGTCC at 2876 - - - -
ciAAACTGG at 2873 - - - -
CTAAACT at 2871 - - - -
ciAGTCTAA at 2868 - - - -
- ciTCTGGA at 2862 - - -
TTGCTCC at 2806 - - - -
TCGATTC at 2789 - - - -
- ciTATGAA at 2740 - - -
TCGTTTT at 2707 - - - -
TCAATCC at 2668 - - - -
CTATATT at 2662 - - - -
- TCCATA at 2642 - - -
ciGGTGTGA at 2636 - - - -
TCAGTCC at 2620 TTCAGT at 2618 TCAGTC at 2619 - -
ciAGTTCAG at 2617 - - - -
TCAGTTC at 2615 CTCAGT at 2613 TCAGTT at 2614 - -
TCAGTCT at 2609 GTCAGT at 2607 TCAGTC at 2608 - -
ciGGTGTGG at 2602 - - - -
ciAATATGG at 2590 - - - -
CCGGTCC at 2574 - - - -
ciGGACCGG at 2571 - - - -
CCGCACT at 2555 CGCACT at 2555 - - -
- TTCACT at 2511 - - -
- CCCAGA at 2489 - - -
ciGGTACAA at 2475 - - - -
ciAGAGTGG at 2470 - - - -
- GTCACA at 2464 - - -
ciGGACCGA at 2435 - - - -
ciGATGTGG at 2430 ciTGTGGA at 2431 - - -
- CGCAGT at 2423 - - -
- ciTCTGAA at 2417 - - -
- TCCACT at 2375 - - -
ciAATCCGA at 2368 - - - -
- ciAGTGAC at 2341 - - -
ciGGTCCGA at 2318 - - - -
- ciAGTGGG at 2313 - - -
ciAAAGTGA at 2304 ciAGTGAG at 2305 - - -
- - - - ciGAAAAGA at 2276
- TCCAGA at 2258 - - -
ciAGAGTGG at 2247 ciAGTGGA at 2248 - - -
ciGGTCTAG at 2230 - - - -
TTGGTCT at 2228 - - - -
CCAGTCT at 2222 TCCAGT at 2220 - - -
ciGGACTGG at 2213 ciACTGGC at 2214 - - -
CCGTTCT at 2190 - - - -
ciAGTATAA at 2178 - - - -
CTACTTT at 2146 - - - -
TTGTACT at 2141 - - - -
TCAATTT at 2136 - - - -
- TCCACT at 2128 - - -
ciGAAGTAG at 2110 - - - -
- GTCAGT at 2100 TCAGTC at 2101 - -
CCACACC at 1971 TCCACA at 1969 - - -
- CCCAGA at 1958 - - -
CCGTTCT at 1948 - - - -
CCGCTCT at 1921 - - - -
ciAGAATGG at 1888 ciAATGGG at 1889 - - -
ciGGTCCGG at 1857 - - - -
ciGGACCGA at 1817 - - - -
CCACACC at 1805 CCCACA at 1803 - - -
ciGGTCTGA at 1744 ciTCTGAA at 1745 - - -
CCGCACT at 1720 CGCACT at 1720 - - -
- CCCAGA at 1711 - - -
- ciTGTGCC at 1698 - - -
ciGGACTGG at 1662 ciACTGGG at 1663 - - -
ciGATGCGA at 1576 - - - -
CCGCTCT at 1565 - - - -
- ciTGTGCC at 1559 - - -
ciAATTCGG at 1541 - - - -
TCGTTCC at 1511 - - - -
CCGCTCT at 1481 - - - -
CCGTTCC at 1427 - - - -
ciGAAGCGG at 1408 - - - -
CCGCTCT at 1381 - - - -
CCGTTCC at 1327 - - - -
ciGAAGCGG at 1308 - - - -
CCGTTCC at 1259 - - - -
CCGCTCT at 1229 - - - -
- ciTGTGCC at 1223 - - -
ciAAAGCAG at 1183 - - - -
ciGGTCCGA at 1177 - - - -
CCGGTCC at 1175 - - - -
- ciTGTGAC at 1139 - - -
TCGCTCT at 1061 - - - CCTTTCC at 1091
TCGCTCT at 1061 - - - -
- CGCACA at 1020 - - -
CCGTTCC at 1007 - - - -
- ciTGTGCG at 987 - - -
ciGGACCGG at 949 - - - -
TTGGACC at 947 - - - -
TCGGTCT at 935 - - - -
CCGTTCC at 923 - - - -
- ciTGTGCG at 887 - - -
ciGGACCGG at 849 - - - -
TTGGACC at 847 - - - -
TCGGTCT at 835 - - - -
CCGTTCC at 823 - - - -
- ciTGTGCG at 803 - - -
ciGGTGCGA at 777 - - - -
CCGGACT at 725 - - - -
CCGTTCC at 671 - - - -
ciGATGCGA at 652 - - - -
CCGCTCT at 641 - - - -
ciGAAGCGG at 595 - - - -
CCGTTCC at 587 - - - -
- ciTGTGCA at 569 - - -
CCGCTCT at 557 - - - -
ciAGAATGA at 524 ciAATGAA at 525 ciGAATGA at 524 - -
TCGGTCC at 515 - - - -
CCGTTCC at 503 - - - -
ciGAAGCGG at 459 - - - -
- ciTCTGGC at 441 - - -
- ciTCTGCC at 399 - - -
ciGGTGTGA at 345 ciTGTGAC at 346 - - -
CCGGACC at 286 - - - -
- ciTCTGAC at 236 - - -
TTACACT at 230 - - - -
ciGGTCCAG at 217 - - - -
CCGGTCC at 215 - - - -
ciAATCCAG at 152 TCCAGT at 153 - - -
- - - - ciAAAGAGA at 139
- - - - ciGAAAAGA at 137
ciAGTCCGG at 92 - - - -
- - - - CCTCTTC at 48
CTGGACC at 40 - - - -
ciGGTCCGA at 10 - - - -

Acknowledgements

The content on this page was first contributed by: Henry A. Hoff.

Initial content for this page in some instances came from Wikiversity.

See also

References

  1. Smale, Stephen T.; Baltimore, David (1989-04-07). "The "initiator" as a transcription control element". Cell. 57 (1): 103–113. doi:10.1016/0092-8674(89)90176-1. PMID 2467742.
  2. 2.0 2.1 Gershenzon, Naum I.; Ioshikhes, Ilya P. (2005-04-15). "Synergy of human Pol II core promoter elements revealed by statistical sequence analysis". Bioinformatics. 21 (8): 1295–1300. doi:10.1093/bioinformatics/bti172.
  3. Lim, Chin Yan; Santoso, Buyung; Boulay, Thomas; Dong, Emily; Ohler, Uwe; Kadonaga, James T. (2004-07-01). "The MTE, a new core promoter element for transcription by RNA polymerase II". Genes & Development. 18 (13): 1606–1617. doi:10.1101/gad.1193404. PMC 443522. PMID 15231738.
  4. Kaufmann, J.; Smale, S. T. (1994-04-01). "Direct recognition of initiator elements by a component of the transcription factor IID complex". Genes & Development. 8 (7): 821–829. doi:10.1101/gad.8.7.821. PMID 7926770.
  5. O'Shea-Greenfield, A.; Smale, S. T. (1992-01-15). "Roles of TATA and initiator elements in determining the start site location and direction of RNA polymerase II transcription". The Journal of Biological Chemistry. 267 (2): 1391–1402. PMID 1730658.
  6. 6.0 6.1 6.2 6.3 6.4 Yang, Chuhu; Bolotin, Eugene; Jiang, Tao; Sladek, Frances M.; Martinez, Ernest (2007-03-01). "Prevalence of the Initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMC 1955227. PMID 17123746.
  7. 7.0 7.1 Ngoc, Long Vo; Cassidy, California Jack; Huang, Cassidy Yunjing; Duttke, Sascha H. C.; Kadonaga, James T. (2017-01-20). "The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters". Genes & Development. doi:10.1101/gad.293837.116. PMC 5287114. PMID 28108474.
  8. Javahery, R; Khachi, A; Lo, K; Zenzie-Gregory, B; Smale, S T (1994-01-01). "DNA sequence requirements for transcriptional initiator activity in mammalian cells". Molecular and Cellular Biology. 14 (1): 116–127. doi:10.1128/mcb.14.1.116. PMC 358362. PMID 8264580.
  9. 9.0 9.1 Gillian E. Chalkley and C. Peter Verrijzer (September 1, 1999). "DNA binding site selection by RNA polymerase II TAFs: a TAFII250-TAFII150 complex recognizes the Initiator" (PDF). The EMBO Journal. 18 (17): 4835–45. PMID 10469661. Retrieved 2012-04-26.
  10. 10.0 10.1 J. Carcamo, L. Buckbinder, D. Reinberg (1991). "The initiator directs the assembly of a transcription factor IID-dependent transcription complex". Proceedings of the National Academy of Sciences USA. 88 (18): 8052–6. doi:10.1073/pnas.88.18.8052. Retrieved 2012-05-29.
  11. L. Weis and D. Reinberg (1997). "Accurate positioning of RNA polymerase II on a natural TATA-less promoter is independent of TATA-binding protein associated factors and initiator-binding proteins" (PDF). Molecular and Cellular Biology. 17: 2973–84. Retrieved 2012-04-26.
  12. RefSeq (May 2009). "BRCA1 BRCA1, DNA repair associated [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 22 December 2018.
  13. 13.0 13.1 RefSeq (February 2010). "BRCA1 BRCA1, DNA repair associated [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 22 December 2018.
  14. Schlegel BP, Starita LM, Parvin JD (February 2003). "Overexpression of a protein fragment of RNA helicase A causes inhibition of endogenous BRCA1 function and defects in ploidy and cytokinesis in mammary epithelial cells". Oncogene. 22 (7): 983–91. doi:10.1038/sj.onc.1206195. PMID 12592385.
  15. Anderson SF, Schlegel BP, Nakajima T, Wolpin ES, Parvin JD (July 1998). "BRCA1 protein is linked to the RNA polymerase II holoenzyme complex via RNA helicase A". Nature Genetics. 19 (3): 254–6. doi:10.1038/930. PMID 9662397.
  16. Lee CG, Hurwitz J (August 1993). "Human RNA helicase A is homologous to the maleless protein of Drosophila". The Journal of Biological Chemistry. 268 (22): 16822–30. PMID 8344961.
  17. Zhang S, Grosse F (April 1997). "Domain structure of human nuclear DNA helicase II (RNA helicase A)". The Journal of Biological Chemistry. 272 (17): 11487–94. doi:10.1074/jbc.272.17.11487. PMID 9111062.
  18. Archambault J, Chambers RS, Kobor MS, Ho Y, Cartier M, Bolotin D, Andrews B, Kane CM, Greenblatt J (February 1998). "An essential component of a C-terminal domain phosphatase that interacts with transcription factor IIF in Saccharomyces cerevisiae". Proceedings of the National Academy of Sciences USA. 94 (26): 14300–5. Bibcode:1997PNAS...9414300A. doi:10.1073/pnas.94.26.14300. PMC 24951. PMID 9405607.
  19. 19.0 19.1 Archambault J, Pan G, Dahmus GK, Cartier M, Marshall N, Zhang S, Dahmus ME, Greenblatt J (November 1998). "FCP1, the RAP74-interacting subunit of a human protein phosphatase that dephosphorylates the carboxyl-terminal domain of RNA polymerase IIO". J Biol Chem. 273 (42): 27593–601. doi:10.1074/jbc.273.42.27593. PMID 9765293.
  20. 20.0 20.1 RefSeq (February 2011). "CTDP1 CTD phosphatase subunit 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 22 December 2018.
  21. RefSeq (February 2011). "CTDP1 CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) phosphatase, subunit 1". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine.
  22. Licciardo, Paolo; Amente Stefano; Ruggiero Luca; Monti Maria; Pucci Piero; Lania Luigi; Majello Barbara (February 2003). "The FCP1 phosphatase interacts with RNA polymerase II and with MEP50 a component of the methylosome complex involved in the assembly of snRNP". Nucleic Acids Research. 31 (3): 999–1005. doi:10.1093/nar/gkg197. PMC 149217. PMID 12560496.
  23. Scully, R; Anderson S F; Chao D M; Wei W; Ye L; Young R A; Livingston D M; Parvin J D (May 1997). "BRCA1 is a component of the RNA polymerase II holoenzyme". Proc. Natl. Acad. Sci. U.S.A. 94 (11): 5605–10. Bibcode:1997PNAS...94.5605S. doi:10.1073/pnas.94.11.5605. PMC 20825. PMID 9159119.
  24. 24.0 24.1 RefSeq (September 2011). "DDX53 DEAD-box helicase 53 [ Homo sapiens (human)". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 22 December 2018.
  25. DR Liston, PJ Johnson (March 1999). "Analysis of a Ubiquitous Promoter Element in a Primitive Eukaryote: Early Evolution of the Initiator Element". Molecular and Cellular Biology. 19 (3): 2380–8. PMID 10022924. Retrieved 2012-04-06.
  26. JE Purdy, BJ Mann, LT Pho, WA Petri Jr (July 19, 1994). "Transient transfection of the enteric parasite Entamoeba histolytica and expression of firefly luciferase". Proceedings of the National Academy of Science USA. 91 (15): 7099–103. PMID 8041752. Retrieved 2012-06-10.
  27. 27.0 27.1 Hualin Xi, Yong Yu, Yutao Fu, Jonathan Foley, Anason Halees, and Zhiping Weng (June 2007). "Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY1" (PDF). Genome Research. 17 (6): 798–806. doi:10.1101/gr.5754707. PMC 1891339. PMID 17567998.
  28. R. Javahery, A. Khachi, K. Lo, B. Zenzie-Gregory, S. T. Smale (January 1994). "DNA Sequence Requirements for Transcriptional Initiator Activity in Mammalian Cells" (PDF). Molecular and Cellular Biology. 14 (1): 116–27. PMID 8264580. Retrieved 2012-04-06.
  29. Ananda L. Roy (August 2001). "Biochemistry and biology of the inducible multifunctional transcription factor TFII-I" (PDF). Gene. 274 (1–2): 1–13. doi:10.1016/S0378-1119(01)00625-4. Retrieved 2012-04-06.
  30. HGNC:11535 (March 24, 2012). "TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 250kDa". Bethesda, Maryland: NCBI. Retrieved 2012-04-09.
  31. ST Smale (March 1997). "Transcription initiation from TATA-less promoters within eukaryotic protein-coding genes". Biochimica & Biophysica Acta. 1351 (1–2): 73–88. doi:10.1016/S0167-4781(96)00206-0. PMID 9116046. Retrieved 2012-04-06.
  32. KH Emami, A Jain, ST Smale (1997). "Mechanism of synergy between TATA and initiator: synergistic binding of TFIID following a putative TFIIA-induced isomerization". Genes & Development. 11: 3007–19. Retrieved 2012-05-20.
  33. 33.0 33.1 33.2 Benjamin Lewin (2004). Genes VIII. Upper Saddle River, NJ: Pearson Prentice Hall. pp. 636–637. Text " Template:Isbn " ignored (help)
  34. Ananda L. Roy, Michael Meisterernst, Philippe Pognonec & Robert G. Roeder (21 November 1991). "Cooperative interaction of an initiator-binding transcription initiation factor and the helix–loop–helix activator USF". Nature. 354: 245–8. Retrieved 2012-05-29.
  35. Ananda L. Roy, Sohail Malik, Michael Meisterernst & Robert G. Roeder (1993). "An alternative pathway for transcription initiation involving TFII-I". Nature. 365: 355–9. Retrieved 2012-05-29.
  36. Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter" (PDF). Annual Review of Biochemistry. 72 (1): 449–79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. Retrieved 2012-05-07.
  37. 37.0 37.1 Tamar Juven-Gershon, Jer-Yuan Hsu, Joshua W. M. Theisen, and James T. Kadonaga (June 2008). "The RNA Polymerase II Core Promoter – the Gateway to Transcription". Current Opinion in Cell Biology. 20 (3): 253–9. doi:10.1016/j.ceb.2008.03.003. Retrieved 2013-02-13.
  38. Jennifer E.F. Butler, James T. Kadonaga (October 15, 2002). "The RNA polymerase II core promoter: a key component in the regulation of gene expression". Genes & Development 16 (20): 2583–292. doi:10.1101/gad.1026202. PMID 12381658.

Further reading

External links

{{Phosphate biochemistry}}