F box gene transcriptions
Associate Editor(s)-in-Chief: Henry A. Hoff
"The F-box is a protein motif of approximately 50 amino acids that functions as a site of protein-protein interaction."[1]
Gene transcriptions
"The F-box of Elongin A binds Elongin C (El C). The association of Elongins B and C with A increases Elongin A transcriptional activity."[1]
Def. that "part of a biomolecule (such as a protein) that is the target of an immune response"[2] is called an epitope.
"SCF complexes generally recognize substrates after they are phosphorylated on specific epitopes [10]. Phosphorylation is one of the major mechanisms used by cells to rapidly transduce signals. SCF complexes are therefore ideal for dynamic processes that require an abrupt change to be made irreversible (at least in the short term) via the degradation of key proteins. Examples of such processes are cell-cycle phase transitions - during which the cell-cycle regulators that were required for the previous phase are degraded as the cell enters the new phase - and shifts in transcription that last for a longer time period than otherwise because a transcriptional inhibitor is degraded. There is a wide variety of SCF targets that include cell-cycle regulators, for example, G1-phase cyclins, cyclin-dependent kinase inhibitors, DNA replication factors, and transcription factors that promote cell-cycle progression, as well as non-cell-cycle functions, such as a cytoskeletal regulator, cell-surface receptors, transcription-factor inhibitors, and non-cell-cycle transcription factors (Table 2)."[1]
"Second, Elongin A, the transcriptionally active subunit of the Elongin (SIII) complex - which facilitates transcription elongation by RNA polymerase II [16] - is an F-box protein (Figure 2c). Elongin A was isolated by virtue of its ability to increase the catalytic rate of transcript elongation by RNA polymerase II in vitro [16]. Binding of the other components of the complex, Elongin B and C, increases the specific activity of Elongin A. The F-box motif of Elongin A is in the smallest region shown to be sufficient for Elongin A to bind Elongin C in both yeast and humans [17,18]. Elongin C has homology to Skp1; the F-box-Elongin C interaction may therefore be evolutionarily conserved."[1]
"In order to examine the factors involved in regulating telomere length homoeostasis, Liu et al.25 carried out a genome-wide screen of deletion mutations that impacted telomere length in S. pombe25. Deletion of 168 genes was found to alter telomere length, with four causing ‘very short’ telomeres (>150 bp shorter than wild type). These were est1, trt1 and ccq1, which are essential for telomerase activity, and pof8, of unknown function. Pof8 is a putative F-box protein, which bind to Skp1 and Cullin to form an SCF E3 ubiquitin ligase complex26. Previous studies identified Pof8 as a Skp1 binding protein but the association with Cullin was not determined27,28, implying a function distinct from a canonical F-box protein. Here we redefine Pof8 as an RNA-binding protein of the ancient LARP7 family (Laribonucleoprotein domain family member 7 or La-related protein 7) and thus rename it Lar7. LARP7 proteins are characterised by an N-terminal RNA-binding domain termed a La-motif and two RNA-recognition motifs (RRMs). LARP7 proteins associate with non-coding RNAs transcribed by RNA polymerase III, which have a hallmark 3′-UUU-OH tag29. Here we report that Lar7 interacts with TER1 and plays a crucial role during its biogenesis. Lar7 stabilises the interaction between TER1 and the Lsm2–8 complex and facilitates assembly with Trt1. Loss of Lar7 leads to disassembly of telomerase and exosomal degradation of TER1, resulting in impaired telomerase activity and maintenance of very short telomeres. Further, we report a high degree of conservation between Lar7 and both the human protein hLARP7 and Tetrahymena thermophila telomerase RNA-binding protein p65. In human cells, loss of hLARP7 has recently been shown to cause telomere shortening30. Thus, LARP7-familiy proteins are universally crucial for telomere maintenance."[3]
"Lar7 is a member of the LARP7 RNA-binding protein family. In order to assess the function of Lar7 in telomere biology, we first analysed its predicted structure for key domains or moieties that may allude to its function. Using the bioinformatics tool HHpred (homology detection and structure prediction by HMM-HMM) and the Protein Homology/Analogy Recognition Engine v2.0 (Phyre2)33,34, we identified two likely RNA-recognition motifs (RRMs), which have the format β1-α1-β2-β3-α2-β4, in four fission yeast species [...]. Additionally, we identified structural similarity between Lar7 and the human protein hLARP7 (La-related protein 7 or La ribonucleoprotein domain family member 7), which is part of the wider LARP7 family of non-coding RNA-binding proteins35. Remarkably, a well-described member of this family and orthologue of hLARP7 is the telomerase-binding protein p65 in ciliate T. thermophila. All LARP-family proteins (including hLARP7 and p65) have a conserved N-terminal La-motif followed by an RRM (together termed the ‘La-module’) and a second RRM toward the C-terminus [...]. The alignment of S. pombe Lar7 with p65 and hLARP7 and a comparison of Lar7 with the published structures of hLARP7 and p6536–38 revealed considerable sequence and structural similarity between the three proteins [...]. The reported secondary structure of a La-motif is α1-β1-α2-α3-β2-β339. According to the HHpred analysis, the N-terminus of S. pombe Lar7 corresponds to this predicted structure (amino acids 64–138), preceding RRM1 (amino acids 148–230). We have therefore assigned this region a La-motif. Interestingly, the domain assigned as an F-box in Lar7 (amino acids 66–10627,28) lies within the La-motif. Due to this striking conservation and the criteria defining LARP7 family proteins29, in addition to the function of Lar7 in telomerase RNA binding (defined in this report), we concluded that the annotated Pof8 is a member of the LARP7 family and accordingly renamed it Lar7 (La-related protein 7)."[3]
F box genes
Gene ID: 22992 is KDM2A lysine demethylase 2A on 11q13.2: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains at least six highly degenerated leucine-rich repeats. This family member plays a role in epigenetic silencing. It nucleates at CpG islands and specifically demethylates both mono- and di-methylated lysine-36 of histone H3. Alternative splicing results in multiple transcript variants."[4]
- NP_001243334.1 lysine-specific demethylase 2A isoform b: "Transcript Variant: This variant (3) lacks several 5' exons but contains an alternate 5' exon, uses an internal downstream promoter, and differs in its 5' UTR and 5' coding region, compared to variant 1. The encoded isoform (b, also known as SF-KDM2A or short-form KDM2A) has a distinct and shorter N-terminus, compared to isoform a."[4]
- NP_036440.1 lysine-specific demethylase 2A isoform a: "Transcript Variant: This variant (1) represents the longest transcript and encodes the longer isoform (a)."[4]
Gene ID: 23194 is FBXL7 F-box and leucine rich repeat protein 7 on 5p15.1: "This gene encodes a member of the F-box protein family which is characterized by a 42-48 amino acid motif, the F-box, which binds to the S-phase kinase-associated protein 1 (Skp1) protein. The F-box proteins constitute one of the four subunits of E3 ubiquitin protein ligases called SCFs (SKP1-Cul1-F-box), which play a role in phosphorylation-dependent ubiquitination of proteins. The F-box proteins are divided into 3 subfamilies based on the other domain in the protein: F-box proteins that also have a WD-40 domain (Fbws subfamily), F-box proteins that also have leucine-rich repeats (Fbls subfamily) and F-box proteins that contain other motifs or lack known protein-interaction domains (Fbxs subfamily). The protein encoded by this gene belongs to the Fbls subfamily. Alternative splicing results in multiple transcript variants of this gene."[5]
- NP_001265246.1 F-box/LRR-repeat protein 7 isoform 2: "Transcript Variant: This variant (2) differs in the 5' UTR and initiates translation at a downstream start codon, compared to variant 1. It encodes isoform 2 which is shorter at the N-terminus compared to isoform 1."[5]
- NP_036436.1 F-box/LRR-repeat protein 7 isoform 1: "Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1)."[5]
Gene ID: 25827 is FBXL2 F-box and leucine rich repeat protein 2 on 3p22.3: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains 12 tandem leucine-rich repeats. Alternatively spliced transcript variants encoding different isoforms have been found for this gene."[6]
- NP_001336245.1 F-box/LRR-repeat protein 2 isoform 3: "Transcript Variant: This variant (3) encodes the longest isoform (3)."[6]
- NP_001336248.1 F-box/LRR-repeat protein 2 isoform 4: "Transcript Variant: This variant (4), as well as variants 5 and 6, encodes isoform 4."[6]
- NP_001336249.1 F-box/LRR-repeat protein 2 isoform 4: "Transcript Variant: This variant (5), as well as variants 4 and 6, encodes isoform 4."[6]
- NP_001336250.1 F-box/LRR-repeat protein 2 isoform 4: "Transcript Variant: This variant (6), as well as variants 4 and 5, encodes isoform 4."[6]
- NP_001336251.1 F-box/LRR-repeat protein 2 isoform 5 precursor: "Transcript Variant: This variant (7) encodes isoform 5."[6]
- NP_001336252.1 F-box/LRR-repeat protein 2 isoform 6: "Transcript Variant: This variant (8), as well as variants 9, 10, and 11, encodes isoform 6."[6]
- NP_001336253.1 F-box/LRR-repeat protein 2 isoform 6: "Transcript Variant: This variant (9), as well as variants 8, 10, and 11, encodes isoform 6."[6]
- NP_001336254.1 F-box/LRR-repeat protein 2 isoform 6: "Transcript Variant: This variant (10), as well as variants 8, 9, and 11, encodes isoform 6."[6]
- NP_001336255.1 F-box/LRR-repeat protein 2 isoform 6: "Transcript Variant: This variant (11), as well as variants 8, 9, and 10, encodes isoform 6."[6]
- NP_036289.3 F-box/LRR-repeat protein 2 isoform 1: "Transcript Variant: This variant (1) encodes isoform 1."[6]
Gene ID: 26224 is FBXL3 F-box and leucine rich repeat protein 3 on 13q22.3: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats and is localized in the nucleus."[7]
Gene ID: 23232 is FBXO2 F-box protein 2 on 1p36.22: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. This protein is highly similar to the rat NFB42 (neural F Box 42 kDa) protein which is enriched in the nervous system and may play a role in maintaining neurons in a postmitotic state."[8]
Gene ID: 26233 is FBXL6 F-box and leucine rich repeat protein 6 on 8q24.3: "This gene encodes a member of a family of proteins that are characterized by an F-box motif. The encoded protein also contains leucine-rich repeats. F-box-containing proteins comprise one of the subunits of the SCF (SKP1-cullin-F-box) complex, which functions in phosphorylation-dependent ubiquitination. Alternative splicing results in multiple transcript variants."[9]
- NP_036294.2 F-box/LRR-repeat protein 6 isoform 1: "Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1)."[9]
- NP_078831.4 F-box/LRR-repeat protein 6 isoform 2: "Transcript Variant: This variant (2) uses an alternate splice site in the coding region, but maintains the reading frame, compared to variant 1. The encoded isoform (2) is shorter than isoform 1."[9]
Gene ID: 26234 is FBXL5 F-box and leucine rich repeat protein 5 on 4p15.32: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats. Alternatively spliced transcript variants have been described for this locus."[10]
- NP_001180463.1 F-box/LRR-repeat protein 5 isoform 3: "Transcript Variant: This variant (3) uses an alternate in-frame splice site in the 5' coding region, compared to variant 1. This results in a shorter protein (isoform 3), compared to isoform 1."[10]
- NP_001180464.1 F-box/LRR-repeat protein 5 isoform 4: "Transcript Variant: This variant (4) uses an alternate in-frame splice site in the 5' coding region, compared to variant 1. This results in a shorter protein (isoform 4), compared to isoform 1."[10]
- NP_036293.1 F-box/LRR-repeat protein 5 isoform 1: "Transcript Variant: This variant (1) encodes the longest isoform (1)."[10]
Gene ID: 26235 is FBXL4 F-box and leucine rich repeat protein 4 on 6q16.1-q16.2: "This gene encodes a member of the F-box protein family, which are characterized by an approximately 40 amino acid motif, the F-box. F-box proteins constitute one subunit of modular E3 ubiquitin ligase complexes, called SCF complexes, which function in phosphorylation-dependent ubiquitination. The F-box domain mediates protein-protein interactions and binds directly to S-phase kinase-associated protein 1. In addition to an F-box domain, the encoded protein contains at least 9 tandem leucine-rich repeats. The ubiquitin ligase complex containing the encoded protein may function in cell-cycle control by regulating levels of lysine-specific demethylase 4A. Alternative splicing results in multiple transcript variants."[11]
- NP_001265645.1 F-box/LRR-repeat protein 4: "Transcript Variant: This variant (2) differs in the 5' UTR compared to variant 1. Both variants 1 and 2 encode the same protein."[11]
- NP_036292.2 F-box/LRR-repeat protein 4: "Transcript Variant: This variant (1) encodes the functional protein. Both variants 1 and 2 encode the same protein."[11]
Gene ID: 54620 is FBXL19 F-box and leucine rich repeat protein 19 on 16p11.2: "This gene encodes a member of the Skp1-Cullin-F-box family of E3 ubiquitin ligases. The encoded protein is reported to bind to the transmembrane receptor interleukin 1 receptor-like 1 and regulate its ubiquitination and degradation. This protein has been linked to the regulation of pulmonary inflammation and psoriasis. Alternative splicing results in multiple transcript variants."[12]
- NP_001093254.2 F-box/LRR-repeat protein 19 isoform 1: "Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1)."[12]
- NP_001269280.1 F-box/LRR-repeat protein 19 isoform 3: "Transcript Variant: This variant (3) represents use of an alternate promoter and thus differs in the 5' UTR and 5' coding region compared to variant 1. These differences cause translation initiation at a downstream AUG, and the resulting isoform (3) has a shorter N-terminus compared to isoform 1."[12]
Gene ID: 54850 is FBXL12 F-box and leucine rich repeat protein 12 on 19p13.2: "Members of the F-box protein family, such as FBXL12, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004 [PubMed 15520277])."[13]
- NP_001303865.1 F-box/LRR-repeat protein 12 isoform b: "Transcript Variant: This variant (2) uses an alternate splice junction and lacks an alternate exon compared to variant 1. The resulting isoform (b) has the same N- and C-termini but is shorter compared to isoform a."[13]
- NP_001303866.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (3) uses an alternate splice junction and a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_001303867.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (4) contains an alternate exon and uses a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_001303868.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (5) uses an alternate splice junction, contains an alternate exon, and uses a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_001303869.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (6) uses an alternate splice junction and a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_001303870.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (7) lacks an alternate exon and uses a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_001303871.1 F-box/LRR-repeat protein 12 isoform c: "Transcript Variant: This variant (8) uses an alternate splice junction, contains an alternate exon, and uses a downstream AUG start codon compared to variant 1. The resulting isoform (c) is shorter at the N-terminus compared to isoform a. Variants 3, 4, 5, 6, 7, and 8 all encode the same isoform (c)."[13]
- NP_060173.1 F-box/LRR-repeat protein 12 isoform a: "Transcript Variant: This variant (1) encodes the longest isoform (a)."[13]
Gene ID: 55336 is FBXL8 F-box and leucine rich repeat protein 8 on 16q22.1: "This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class. It shares 78% sequence identity with the mouse protein."[14]
See also
References
- ↑ 1.0 1.1 1.2 1.3 Edward T Kipreos and Michele Pagano (10 November 2000). "The F-box protein family". Genome Biology. 1 (5): 3002. doi:10.1186/gb-2000-1-5-reviews3002. Retrieved 2017-02-13.
- ↑ SemperBlotto (30 August 2006). "epitope". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020.
- ↑ 3.0 3.1 Laura C. Collopy, Tracy L. Ware, Tomas Goncalves, Sunnvør í Kongsstovu, Qian Yang, Hanna Amelina, Corinne Pinder, Ala Alenazi, Vera Moiseeva, Siân R. Pearson, Christine A. Armstrong & Kazunori Tomita (2018). "LARP7 family proteins have conserved function in telomerase assembly" (PDF). Nature Communications. 9 (557): 1–8. doi:10.1038/s41467-017-02296-4. Retrieved 1 August 2019.
- ↑ 4.0 4.1 4.2 RefSeq (January 2012). "KDM2A lysine demethylase 2A [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 8 May 2020.
- ↑ 5.0 5.1 5.2 RefSeq (May 2013). "FBXL7 F-box and leucine rich repeat protein 7 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 8 May 2020.
- ↑ 6.00 6.01 6.02 6.03 6.04 6.05 6.06 6.07 6.08 6.09 6.10 RefSeq (January 2010). "FBXL2 F-box and leucine rich repeat protein 2 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 8 May 2020.
- ↑ RefSeq (July 2008). "FBXL3 F-box and leucine rich repeat protein 3 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 8 May 2020.
- ↑ RefSeq (July 2008). "FBXO2 F-box protein 2 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ 9.0 9.1 9.2 RefSeq (July 2013). "FBXL6 F-box and leucine rich repeat protein 6 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ 10.0 10.1 10.2 10.3 RefSeq (August 2010). "FBXL5 F-box and leucine rich repeat protein 5 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ 11.0 11.1 11.2 RefSeq (July 2013). "FBXL4 F-box and leucine rich repeat protein 4 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ 12.0 12.1 12.2 RefSeq (August 2013). "FBXL19 F-box and leucine rich repeat protein 19 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ 13.0 13.1 13.2 13.3 13.4 13.5 13.6 13.7 13.8 OMIM (March 2008). "FBXL12 F-box and leucine rich repeat protein 12 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.
- ↑ RefSeq (July 2008). "GATA1 GATA binding protein 1 [ Homo sapiens (human) ]". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 9 May 2020.