Complex locus A1BG and ZNF497: Difference between revisions

Jump to navigation Jump to search
Line 825: Line 825:


<!-- footer categories -->
<!-- footer categories -->
[[Category:Resources last modified in January 2021]]
[[Category:Resources last modified in May 2021]]

Revision as of 18:07, 6 May 2021

Associate Editor(s)-in-Chief: Henry A. Hoff

Alpha-1-B glycoprotein is a 54.3 kDa protein in humans that is encoded by the A1BG gene.[1] The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins.

A1BG was located on the DNA strand of chromosome 19.[2] Additionally, A1BG, in current nucleotide numbering (58,345,183-58,353,492), is located adjacent to the ZSCAN22 gene (58,326,994-58,342,332) on the positive DNA strand, as well as the ZNF837 (58,367,623 - 58,381,030, complement) and ZNF497 (58,354,357 - 58,362,751, complement) genes on the negative strand.[2] In the current nucleotide numbering, the A1BG untranslated region (UTR) has been expanded so that with ZSCAN22 ending at 58,342,332, the nucleotides used in this study are 58,342,347 to 58,346,897 on both strands, with the current UTR for A1BG beginning at 58,345,183.

Introduction

"Many important disease-related pathways utilize transcription factors that specifically bind DNA (e.g., c-Myc, HIF-1, TCF1, p53) as key nodes or endpoints in complex signaling networks. In such cases the transcription factor itself is often the most attractive target. However, drugging transcription factors is challenging owing to an absence of small ligand binding sites in their DNA-binding domain and the presence of a highly charged DNA-binding surface [1]."[3]

If a specific gene appears to be involved in a disease-related or deleterious pathway being able to alter its expression so as to improve the person's health may be needed. To alter its expression constructively may require knowing what regulatory elements exist in the gene's nearby promoters.

Response elements

Identifying a bona fide response element is more difficult than a simple inspection. In order to attribute the response element to a candidate sequence, some observations have to be conducted using molecular, biological and biophysical methods and functional approaches. Findings may indicate that response element in the promoter is a functional element.[4]

A likely response element found by simple inspection may also be inactive due to methylation.

Response Elements: "Nucleotide sequences, usually upstream, which are recognized by specific regulatory transcription factors, thereby causing gene response to various regulatory agents. These elements may be found in both promoter and enhancer regions."[5]

"Under conditions of stress, a transcription activator protein binds to the response element and stimulates transcription. If the same response element sequence is located in the control regions of different genes, then these genes will be activated by the same stimuli, thus producing a coordinated response."[6]

Abscissic acid (ABA) response elements

In A1BG response element positive results, abscissic acid (ABA) response elements (ABREs) have been identified for example in the UTR for A1BG between ZSCAN22 and A1BG. If these response elements are active then A1BG can be transcribed as a key cis-regulatory element in ABA signaling. "However, for ABA responsive transcription to occur, a single copy of the ABRE is not sufficient. In barley, the combination of an ABRE and one of two known coupling elements CE1 (TGCCACCGG) and CE3 (GCGTGTC) constitutes an ABA responsive complex (ABRC) in the regulation of the ABA‐inducible genes HVA1 and HVA22 (Shen and Ho 1995; Shen et al. 1996). It was also shown that a pair of ABREs can function as an ABRC with the second ABRE playing the role of the coupling element in rice (Hobo et al. 1999), barley (Shen et al. 1996) and Arabidopsis (Nakashima et al. 2006). Coupling of two CE3s is much less active in conferring ABA response to the minimal promoter (Shen et al. 2004). Interestingly, CE3 appears to be specific to monocots. In Arabidopsis, the CE3 element is practically absent; thus, Arabidopsis relies on paired ABREs to form ABRCs (Gomez‐Porras et al. 2007) or on the coupling of a DRE (TACCGACAT) with ABRE (Narusaka et al. 2003; Nakashima et al. 2006)."[7]

No coupling elements occur in the negative direction between ZSCAN22 and A1BG. Two ABREs occur in both directions suggesting pairs may be available to function as an ABRC on either side of A1BG, subject to needed proximity.

No DRE (TACCGACAT) occurs in either direction of A1BG.

"Receptor for activated C kinase (RACK1) is a highly conserved, eukaryotic protein of the WD-40 repeat family. [...] During Phaseolus vulgaris root development, RACK1 (PvRACK1) mRNA expression was induced by auxins, abscissic acid, cytokinin, and gibberellic acid."[8]

Auxin response factors

In A1BG response element positive results, auxin response factors (ARFs) have been identified in the UTR for A1BG between ZSCAN22 and A1BG and between ZNF497 and A1BG but not in this side's core promoter or proximal promoter. If these response elements are active then A1BG can be transcribed as a key cis-regulatory element in auxin signaling. The "genome binding of two ARFs (ARF2 and ARF5/Monopteros [MP]) differ largely because these two factors have different preferred ARF binding site (ARFbs) arrangements (orientation and spacing)."[9]

Cytokinins

In A1BG response element positive results, several of the cytokinin response regulators: ARR10s, ARR12s, and those of Rashotte et al. (2003) have occurrences in the UTR of A1BG from the ZSCAN22 side and in the proximal promoters of A1BG from the ZNF497 side.

Any of the randomly generated nucleotide data sets (0 through 9) can be used to represent any of the real data for the response element of interest.

Every response element ARR1 has 6 in the distal promoter (2 negative direction and 4 positive direction). Random samplings ranges from 0-3. For the proximal promoter there is one for two datasets out of 20. No random in the core promoter. And, four for two datasets in the UTR out of 20.

ARR10 has only one element in the negative direction in the UTR. Four of the 10 random datasets had 1.25 response elements in the UTR. No sequences in the core promoters, but one sequence in the proximal promoter and many in the distal promoters. The only response element in the A1BG promoters is an inverse complement and five of the six random UTR response elements are inverse complements.

For ARR12, the random samples have about 2.5 UTR elements, but ARR12 has only one in the UTR. The A1BG has no core promoters on either side but one random sample out of ten has one on the positive direction from ZNF497. The ARR12 in A1BG proximal promoters have none while two out of ten random datasets have one each. In the distal promoters the negative directions averaged four while the random datasets average 1.3. The positive direction has only one and the random datasets averaged 1.3.

The Rashotte1 ARR (ARRR1) results differ from random datasets: one in the UTR on the ZSCAN22 side versus two (random). No core promoter elements versus one in two of ten (random). Three proximal promoters versus none (random). Distal promoters: one in the negative direction and three in the positive and one to three (random).

For (G/A)GAT(T/C) in ARRR2 (Rashotte2) UTRs, there are none for the negative strand, negative direction, but each random data set finds 2-8. For the positive strand, negative direction, there are seven versus 2-8 from the randomly generated nucleotide data sets (0 through 9).

For (A/G)ATC(C/T) in ARRR2 (Rashotte2) UTRs, all four of the negative strand, negative direction results are inverse complements, where the random data sets have (4-9). For the positive strand, negative direction there are eleven, but the random datasets have 2-9.

Neither direction for ARRR2 (Rashotte2) has core promoter elements, while the random results have an average of (1 per data set, 0-1, negative direction, 0-3, positive direction).

For the negative direction and the proximal promoter, ARRR2 (Rashotte2) produced only one. And, the random datasets have an average of (1 per data set, 0-4).

The positive direction, ARRR2 (Rashotte2) has six. The random datasets have about 1 per dataset (0-3).

The distal promoters, negative direction, ARRR2 (Rashotte2) has 37 and the positive direction 25. The random datasets cover 8-20.

Ethylene response factors

Gibberellic acid response elements

The TAACAAA box (GARE) has an inverse complement TTTGTTA at 230 nucleotides from ZSCAN22 toward A1BG. This is in the distal promoter for A1BG or is a response element for ZSCAN22 rather than A1BG. The GARE-like 1 TTAACA(A/G)A occurs as an inverse complement TTTGTTA at 230 nucleotides from the gene end of ZSCAN22 and may be a response element for ZSCAN22.

ZSCAN22

  1. Gene ID: 342945 is ZSCAN22 zinc finger and SCAN domain containing 22 on 19q13.43.[10] ZSCAN22 is transcribed in the negative direction from LOC100887072.[10]
  2. Gene ID: 102465484 is MIR6806 microRNA 6806 on 19q13.43: "microRNAs (miRNAs) are short (20-24 nt) non-coding RNAs that are involved in post-transcriptional regulation of gene expression in multicellular organisms by affecting both the stability and translation of mRNAs. miRNAs are transcribed by RNA polymerase II as part of capped and polyadenylated primary transcripts (pri-miRNAs) that can be either protein-coding or non-coding. The primary transcript is cleaved by the Drosha ribonuclease III enzyme to produce an approximately 70-nt stem-loop precursor miRNA (pre-miRNA), which is further cleaved by the cytoplasmic Dicer ribonuclease to generate the mature miRNA and antisense miRNA star (miRNA*) products. The mature miRNA is incorporated into a RNA-induced silencing complex (RISC), which recognizes target mRNAs through imperfect base pairing with the miRNA and most commonly results in translational inhibition or destabilization of the target mRNA. The RefSeq represents the predicted microRNA stem-loop."[11] MIR6806 is transcribed in the negative direction from LOC105372480.[11]

Of the some 111 gaps between genes on chromosome locus 19q13.43 as of 4 August 2020, gap number 88 is between ZSCAN22 and A1BG. But, there is no gap between ZNF497 and A1BG.

Promoters

The core promoter begins approximately -35 nts upstream from the transcription start site (TSS). For the numbered nucleotides between ZSCAN22 and A1BG the core promoter extends from 4425 nts up to 4460 nts (TSS). The proximal promoter extends from approximately -250 to the TSS or 4210 nts up to 4460 nts. The distal promoter begins at about 2460 nts and extends to about 4210 nts.

From the ZNF497 side the core promoter begins about 4265 nts up to 4300 nts, the proximal promoter from 4050 nts to 4265 nts, and the distal promoter from 2300 nts to 4050 nts.

Alpha-1-B glycoprotein

Def. "a substance that induces an immune response, usually foreign"[12] is called an antigen.

Def. any "substance that elicits [an] immune response"[13] is called an immunogen.

An antigen "or immunogen is a molecule that sometimes stimulates an immune system response."[14] But, "the immune system does not consist of only antibodies",[14] instead it "encompasses all substances that can be recognized by the adaptive immune system."[14]

Def. "a protein produced by B-lymphocytes that binds to [a specific antigen or][15] an antigen"[16] is called an antibody.

Five different antibody isotypes are known in mammals, which perform different roles, and help direct the appropriate immune response for each different type of foreign object they encounter.[17]

Although the general structure of all antibodies is very similar, a small region, known as the hypervariable region, at the tip of the protein is extremely variable, allowing millions of antibodies with slightly different tip structures to exist, where each of these variants can bind to a different target, known as an antigen.[18]

Def. "any of the glycoproteins in blood serum that respond to invasion by foreign antigens and that protect the host by removing pathogens;"[19] "an antibody"[20] is called an immunoglobulin.

Gene ID: 1 is A1BG alpha-1-B glycoprotein on 19q13.43, a 54.3 kDa protein in humans that is encoded by the A1BG gene.[21] A1BG is transcribed in the positive direction from ZNF497.[21] "The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins."[21]

  1. NP_570602.2 alpha-1B-glycoprotein precursor, cd05751 Location: 401 → 493 Ig1_LILRB1_like; First immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1) and similar proteins, smart00410 Location: 218 → 280 IG_like; Immunoglobulin like, pfam13895 Location: 210 → 301 Ig_2; Immunoglobulin domain and cl11960 Location: 28 → 110 Ig; Immunoglobulin domain.[21]

Patients who have pancreatic ductal adenocarcinoma show an overexpression of A1BG in pancreatic juice.[22]

Immunoglobulin supergene family

"𝛂1B-glycoprotein(𝛂1B) [...] consists of a single polypeptide chain N-linked to four glucosamine oligosaccharides. The polypeptide has five intrachain disulfide bonds and contains 474 amino acid residues. [...] 𝛂1B exhibits internal duplication and consists of five repeating structural domains, each containing about 95 amino acids and one disulfide bond. [...] several domains of 𝛂1B, especially the third, show statistically significant homology to variable regions of certain immunoglobulin light and heavy chains. 𝛂1B [...] exhibits sequence similarity to other members of the immunoglobulin supergene family such as the receptor for transepithelial transport of IgA and IgM and the secretory component of human IgA."[23]

"Some of the domains of 𝛂1B show significant homology to variable (V) and constant (C) regions of certain immunoglobulins. Likewise, there is statistically significant homology between 𝛂1B and the secretory component (SC) of human IgA (15) and also with the extracellular portion of the rabbit receptor for transepithelial transport of polymeric immunoglobulins (IgA and IgM). Mostov et al. (16) have called the later protein the poly-Ig receptor or poly-IgR and have shown that it is the precursor of SC."[23]

The immunoglobulin supergene family is "the group of proteins that have immunoglobulin-like domains, including histocompatibility antigens, the T-cell antigen receptor, poly-IgR, and other proteins involved in the vertebrate immune response (17)."[23]

"The internal homology in primary structure [...] and the presence of an intrasegment disulfide bond suggest that 𝛂1B is composed of five structural domains that arose by duplication of a primordial gene coding for about 95 amino acid residues."[23]

"Unlike immunoglobulins (25), ceruloplasmin (6), and hemopexin (7), 𝛂1B is not subject to limited interdomain cleavage by proteolytic enzymes. At least, we were not able to produce such fragments by use of a variety of proteases. This stability of 𝛂1B is probably associated with the frequency of proline in the sequences linking the domains [...]."[23]

"A peptide identified in the late and early milk proteomes showed homology to eutherian alpha 1B glycoprotein (A1BG), a plasma protein with unknown function46, as well as venom inhibitors characterised in the Southern opossum Didelphis marsupialis (DM43 and DM4647,48,49), all members of the immunoglobulin superfamily. To characterise the relationship between the peptide sequence identified in koala, A1BG, DM43 and DM46, a phylogenetic tree was constructed [...] including all marsupial and monotreme homologs (identified by BLAST), three phylogenetically representative eutherian sequences, with human IGSF1 and TARM1, related members of the immunoglobulin super family, used as outgroups. This phylogeny indicates that A1BG-like proteins in marsupials and the Didelphis antitoxic proteins are homologs of eutherian A1BG, with excellent bootstrap support (98%). The marsupial A1BG-like sequences and the Didelphis antitoxic proteins formed a single clade with strong bootstrap support (97%)."[24]

"Human TARM1 and IGSF1, related members of the immunoglobulin superfamily are used as outgroups. The tree was constructed using the maximum likelihood approach and the JTT model with bootstrap support values from 500 bootstrap tests. Bootstrap values less than 50% are not displayed. Accession numbers: Tasmanian devil (Sarcophilus harrisii; XP_012402143), Wallaby (Macropus eugenii; FY619507), Possum (Trichosurus vulpecula; DY596639) Virginia opossum (Didelphis virginiana; AAA30970, AAN06914), Southern opossum (Didelphis marsupialis; AAL82794, P82957, AAN64698), Human (Homo sapiens; P04217, B6A8C7, Q8N6C5), Platypus (Ornithorhychus anatinus; ENSOANP00000000762), Cow (Bos taurus; Q2KJF1), Alpaca (Vicugna pacos; XP_015107031)."[24]

"The sequences of 𝛂1B-glycoprotein (38) and chicken N-CAM (neural cell-adhesion molecule) (39) have been shown to be related to the immunoglobulin supergene family."[25]

A1BG contains the immunoglobulin domain: cl11960 and three immunoglobulin-like domains: pfam13895, cd05751 and smart00410.

"Immunoglobulin (Ig) domain [cl11960] found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of this group are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond."[26]

"This domain [pfam13895] contains immunoglobulin-like domains."[27]

"Ig1_LILR_KIR_like: [cd05751] domain similar to the first immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILRs) and Natural killer inhibitory receptors (KIRs). This group includes LILRB1 (or LIR-1), LILRA5 (or LIR9), an activating natural cytotoxicity receptor NKp46, the immune-type receptor glycoprotein VI (GPVI), and the IgA-specific receptor Fc-alphaRI (or CD89). LILRs are a family of immunoreceptors expressed on expressed on T and B cells, on monocytes, dendritic cells, and subgroups of natural killer (NK) cells. The human LILR family contains nine proteins (LILRA1-3,and 5, and LILRB1-5). From functional assays, and as the cytoplasmic domains of various LILRs, for example LILRB1 (LIR-1), LILRB2 (LIR-2), and LILRB3 (LIR-3) contain immunoreceptor tyrosine-based inhibitory motifs (ITIMs) it is thought that LIR proteins are inhibitory receptors. Of the eight LIR family proteins, only LIR-1 (LILRB1), and LIR-2 (LILRB2), show detectable binding to class I MHC molecules; ligands for the other members have yet to be determined. The extracellular portions of the different LIR proteins contain different numbers of Ig-like domains for example, four in the case of LILRB1 (LIR-1), and LILRB2 (LIR-2), and two in the case of LILRB4 (LIR-5). The activating natural cytotoxicity receptor NKp46 is expressed in natural killer cells, and is organized as an extracellular portion having two Ig-like extracellular domains, a transmembrane domain, and a small cytoplasmic portion. GPVI, which also contains two Ig-like domains, participates in the processes of collagen-mediated platelet activation and arterial thrombus formation. Fc-alphaRI is expressed on monocytes, eosinophils, neutrophils and macrophages; it mediates IgA-induced immune effector responses such as phagocytosis, antibody-dependent cell-mediated cytotoxicity and respiratory burst."[28]

"IG domains [smart00410] that cannot be classified into one of IGv1, IGc1, IGc2, IG."[29] "𝛂1B-glycoprotein(𝛂1B) [...] consists of a single polypeptide chain N-linked to four glucosamine oligosaccharides. The polypeptide has five intrachain disulfide bonds and contains 474 amino acid residues. [...] 𝛂1B exhibits internal duplication and consists of five repeating structural domains, each containing about 95 amino acids and one disulfide bond. [...] several domains of 𝛂1B, especially the third, show statistically significant homology to variable regions of certain immunoglobulin light and heavy chains. 𝛂1B [...] exhibits sequence similarity to other members of the immunoglobulin supergene family such as the receptor for transepithelial transport of IgA and IgM and the secretory component of human IgA."[23]

A1BG protein species

Def. a "group of plants or animals having similar appearance"[30] or "the largest group of organisms in which [any][31] two individuals [of the appropriate sexes or mating types][31] can produce fertile offspring, typically by sexual reproduction"[32] is called a species.

The gene contains 20 distinct introns.[33] Transcription produces 15 different mRNAs, 10 alternatively spliced variants and 5 unspliced forms.[33] There are 4 probable alternative promoters, 4 non overlapping alternative last exons and 7 validated alternative polyadenylation sites.[33] The mRNAs appear to differ by truncation of the 5' end, truncation of the 3' end, presence or absence of 4 cassette exons, overlapping exons with different boundaries, splicing versus retention of 3 introns.[33]

Variants or isoforms

Def. a "different sequence of a gene (locus)"[34] is called a variant.

Def. any "of several different forms of the same protein, arising from either single nucleotide polymorphisms,[35] differential splicing of mRNA, or post-translational modifications (e.g. sulfation, glycosylation, etc.)"[36] is called an isoform.

Regarding additional isoforms, mention has been made of "new genetic variants of A1BG."[37]

"Proteomic analysis revealed that [a circulating] set of plasma proteins was α 1 B-glycoprotein (A1BG) and its post-translationally modified isoforms."[38]

Pharmacogenomic variants have been reported.[39]

Genotypes

Def. the "part (DNA sequence) of the genetic makeup of an organism which determines a specific characteristic (phenotype) of that organism"[40] or a "group of organisms having the same genetic constitution" [41]is called a genotype.

There are A1BG genotypes.[39]

A1BG has a genetic risk score of rs893184.[39]

"A genetic risk score, including rs16982743, rs893184, and rs4525 in F5, was significantly associated with treatment-related adverse cardiovascular outcomes in whites and Hispanics from the INVEST study and in the Nordic Diltiazem study (meta-analysis interaction P=2.39×10−5)."[39]

Polymorphs

Def. the "regular existence of two or more different genotypes within a given species or population; also, variability of amino acid sequences within a gene's protein"[42] is called polymorphism.

Def. "one of a number of alternative forms of the same gene occupying a given position, [or locus],[43] on a chromosome"[44] is called an allele.

"rs893184 causes a histidine (His) to arginine (Arg) [nonsynonymous single nucleotide polymorphism (nsSNP), A (minor) for G (major)] substitution at amino acid position 52 in A1BG."[39]

"Genetic polymorphism of human plasma (serum) alpha 1B-glycoprotein (alpha 1B) was observed using one-dimensional horizontal polyacrylamide gel electrophoresis (PAGE) pH 9.0 of plasma samples followed by Western blotting with specific antiserum to alpha 1B."[45]

A1B*5 is a "new allele [...] of human plasma 𝜶1B-glycoprotein [...]."[46]

"Genetic polymorphism of human plasma 𝜶1B-glycoprotein (𝜶1B) was reported first, in brief, by Altland et al. [1983; also given in Altkand and Hacklar, 1984]. A detailed description of human 𝜶1B polymorphism was reported in subsequent studies [Gahne et al., 1987; Juneja et al., 1988, 1989]. Five different 𝜶1B alleles (A1B*1, A1B*2, A1B*3, A1B*4 and A1B*5) were reported. In Caucasian whites, the frequencies of A1B*1 and ''A1B*2 were about 0.95 and 0.05, respectively. A1B*4 was observed in 2 related Czech individuals. In American blacks, A1B*1 and A1B*2 occurred with a frequency of 0.73 and 0.21, respectively, while a new allele, viz, A1B*3 had a frequency of 0.06. A1B*5 was observed only in Swedish Lapps and in Finns with a frequency of 0.04 and 0.007, respectively."[47]

"The frequency of A1B*1 varied from 0.89 to 0.91 and that of A1B*2 from 0.08 to 0.10. The A1B*3 allele, reported previously only in American blacks, was observed with a frequency range of 0.003-0.01 in 3 of the Chinese populations, in Koreans and in Malays. A new 𝜶1B allele (A1B*6) was observed in 2 Chinese individuals."[47]

Phenotypes

Def. the "appearance of an organism based on a single trait [multifactorial combination of genetic traits and environmental factors][48], especially used in pedigrees"[49] or any "observable characteristic of an organism, such as its morphological, developmental, biochemical or physiological properties, or its behavior"[50] is called a phenotype.

"The three different phenotypes of α1B observed (designated 1-1, 1-2, and 2-2) were apparently identical to those reported by Altland et al. (1983), who used double one-dimensional electrophoresis. Family data supported the hypothesis that the three α1B phenotypes are determined by two codominant alleles at an autosomal locus, designated A1B. Allele frequencies in a Swedish population were: A1B *1, 0.937; A1B *2, 0.063; PIC, 0.111."[45]

Protein species

"Both protein species of [alpha 1-beta glycoprotein] A1B (A1Ba, p = 0.008; f.c.= +1.62, A1Bb, p = 0.003; f.c. = +1.82) [...] were apparently overexpressed in patients with PTCa [...]."[51]

A1BG is mainly produced in the liver, and is secreted to plasma to levels of approximately 0.22 mg/mL.[23]

CRISPs

The human cysteine-rich secretory protein (CRISP3) "is present in exocrine secretions and in secretory granules of neutrophilic granulocytes and is believed to play a role in innate immunity."[52] CRISP3 has a relatively high content in human plasma.[52]

"The A1BG-CRISP-3 complex is noncovalent with a 1:1 stoichiometry and is held together by strong electrostatic forces."[52] "Similar [complex formation] between toxins from snake venom and A1BG-like plasma proteins ... inhibits the toxic effect of snake venom metalloproteinases or myotoxins and protects the animal from envenomation."[52]

Opossums have a remarkably robust immune system, and show partial or total immunity to the venom of rattlesnakes, Agkistrodon piscivorus, cottonmouths, and other Crotalinae, pit vipers.[53][54]

"Crisp3 [is] mainly [expressed] in the salivary glands, pancreas, and prostate."[55] "CRISP3 is highly expressed in the human cauda epididymidis and ampulla of vas deferens (Udby et al. 2005)."[55]

ZNF497

Gene ID: 503538 is A1BG-AS1 A1BG antisense RNA 1.[56] A1BG-AS1 is transcribed in the negative direction from ZSCAN22.[56]

Gene ID: 162968 is ZNF497 zinc finger protein 497.[57] ZNF497 is transcribed in the positive direction from RNA5SP473.[57]

  1. NP_001193938.1 zinc finger protein 497: "Transcript Variant: This variant (2) lacks an alternate exon in the 5' UTR, compared to variant 1. Variants 1 and 2 encode the same protein."[57]
  2. NP_940860.2 zinc finger protein 497: "Transcript Variant: This variant (1) is the longer transcript. Variants 1 and 2 encode the same protein."[57]

Gene ID: 100419840 is LOC100419840 zinc finger protein 446 pseudogene.[58] LOC100419840 may be transcribed in the positive direction from LOC105372483.[58]

Gene ID: 105372483 is LOC105372483 uncharacterized LOC105372483 ncRNA.[59] LOC105372483 is transcribed in the negative direction from LOC100419840.[59]

Gene ID: 106479017 is RNA5SP473 RNA, 5S ribosomal pseudogene 473.[60] RNA5SP473 may be transcribed in the negative direction from ZNF497.[60]

GC contents

Approximately "76% of human core promoters lack TATA-like elements, have a high GC content, and are enriched in Sp1 binding sites."[61]

CpG islands typically occur at or near the transcription start site of genes, particularly housekeeping genes, in vertebrates.[62]

The number of CG or GC pairs near the TSS for A1BG appears to be low: between ZSCAN22 and A1BG are 8.2 % CG/GC and between ZNF497 and A1BG are 15 % CG/GC.

19q13.43

Regulatory elements and regions

Functions of A1BG

"Receptors of the leukocyte receptor cluster (LRC) play a range of important functions in the human immune system."[63]

"The leukocyte receptor cluster (LRC) is a family of structurally related genes for immunoregulatory receptors. Originally, the term LRC was introduced to emphasize the linkage of the genes encoding killer immunoglobulin-like receptors (KIRs), leukocyte Ig-like receptors (LILRs), and FcαR on human chromosome 19q13.4 (Wagtmann et al. 1997; Wende et al. 1999). Subsequently, it has been found that the region contains some other structurally related genes, such as NCR1, GPVI, LAIR1, LAIR2, and OSCAR (Meyaard et al. 1997; Sivori et al. 1997; Clemetson et al. 1999; Kim et al. 2002). Most recently, the LRC has been further extended by adding two more genes named VSTM1/SIRL1 and TARM1 (Steevels et al. 2010; Radjabova et al. 2015)."[63]

"Except for LAIR2, which is a secreted protein, all human LRC products are type I cell surface receptors with extracellular regions composed of 1–4 C2-type Ig-like domains."[63]

The "eutherian LRC family, in addition to commonly recognized members, includes two new, IGSF1 and alpha-1-B glycoprotein (A1BG)."[63]

"Nucleotide sequences were retrieved and analyzed using utilities at the NCBI (https://www.ncbi.nlm.nih.gov/, last accessed May 20, 2019) and Ensemble (http://www.ensembl.org, last accessed May 20, 2019) websites."[63]

"In our previous studies, it was observed that the Ig-like domains of the frog and chicken LRC proteins reproducibly showed homology not only to known LRC members but also to the products of four mammalian genes that to our knowledge have never been considered in the phylogenetic analyses of LRC. These genes are VSTM1, TARM1, A1BG, and IGSF1. VSTM1 and TARM1 are the most recently identified members of the human LRC (Steevels et al. 2010; Radjabova et al. 2015). A1BG encodes alpha-1 B glycoprotein, a soluble component of mammalian blood plasma that is known for half a century (Schultze et al. 1963). The protein is composed of five Ig-like domains and has been shown to bind to CRISP-3, a small polypeptide that is present in exocrine secretions of neutrophilic granulocytes and that is believed to play a role in innate immunity (Udby et al. 2004). In the human genome, A1BG maps to 19q13.4 some 3.3 Mb away from GPVI [...]."[63]

"The attribution of IGSF1 and A1BG domains to the LRC was supported by their 3D structures predicted using homology modeling [...]."[63]

"Noteworthy is that the D1 and D6 domains of IgSF1 fall into one clade with the N-terminal (d1) domains of A1BG and OSCAR (cluster B1). Closer relationship of A1BG and OSCAR was supported by clustering of the d2–d5 domains of A1BG with membrane-proximal (d2) domain of OSCAR (cluster B2)."[63]

"Altogether, these results support the attribution of IGSF1 and A1BG to the LRC and suggest their relatedness to OSCAR, TARM1, and VSTM1."[63]

"Clustering of the N-terminal domains of OSCAR, IGSF1, and A1BG with each other and with IGSF1 d6 was also reproduced. Finally, the d2 domains of OSCAR cluster with the d2–d5 domains of A1BG (fig. 5). These results further justify grouping IGSF1, A1BG, OSCAR, TARM1, and VSTM1 into a distinct group B."[63]

Hypotheses

  1. Downstream core promoters may work as transcription factors even as their complements or inverses.
  2. In addition to the DNA binding sequences listed above, the transcription factors that can open up and attach through the local epigenome need to be known and specified.
  3. Each DNA binding domain serving as a transcription factor for the promoter of any immunoglobulin supergene family member, also serves or is present in the promoters for A1BG.
  4. The function of A1BG is the same as other immunoglobulin genes possessing the immunoglobulin domain cl11960 and/or any of three immunoglobulin-like domains: pfam13895, cd05751 and smart00410 in the order and nucleotide sequence: cd05751 Location: 401 → 493, smart00410 Location: 218 → 280, pfam13895 Location: 210 → 301 and cl11960 Location: 28 → 110.

See also

References

  1. "Entrez Gene: Alpha-1-B glycoprotein". Retrieved 2012-11-09.
  2. 2.0 2.1 "A1BG alpha-1-B glycoprotein". Retrieved May 10, 2013.
  3. Qingliang Li, Rezaul M. Karim, Mo Cheng, Mousumi Das, Lihong Chen, Chen Zhang, Harshani R. Lawrence, Gary W. Daughdrill, Ernst Schonbrunn, Haitao Ji and Jiandong Chen (July 2020). "Inhibition of p53 DNA binding by a small molecule protects mice from radiation toxicity". Oncogene. 39 (29): 5187–5200. doi:10.1038/s41388-020-1344-y. PMID 32555331 Check |pmid= value (help). Retrieved 29 August 2020.
  4. Ruoyi Gu, Jun Xu, Yixiang Lin, Jing Zhang, Huijun Wang, Wei Sheng, Duan Ma, Xiaojing Ma & Guoying Huang (July 2016). "Liganded retinoic acid X receptor α represses connexin 43 through a potential retinoic acid response element in the promoter region". Pediatric Research. 80 (1): 159–168. doi:10.1038/pr.2016.47. PMID 26991262. Retrieved 7 September 2020.
  5. U.S. National Library of Medicine (2008/07/08). "Response Elements MeSH Descriptor Data 2021". 8600 Rockville Pike, Bethesda, MD 20894: National Institutes of Health. Retrieved 22 April 2021. Check date values in: |date= (help)
  6. Benjamin A. Pierce (24 December 2004). Control of Gene Expression, In: Genetics Solutions and Problem Solving MegaManual. Macmillan. p. 221. Retrieved 22 April 2021.
  7. Kenneth A. Watanabe, Arielle Homayouni, Lingkun Gu, Kuan‐Ying Huang, Tuan‐Hua David Ho, Qingxi J. Shen (18 June 2017). "Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element". Plant, Cell & Environment. 40 (9): 2004–2016. doi:10.1111/pce.13006. Retrieved 5 October 2020.
  8. Tania Islas-Flores, Gabriel Guillén, Xóchitl Alvarado-Affantranger, Miguel Lara-Flores, Federico Sánchez, and Marco A. Villanueva (2011). "PvRACK1 Loss-of-Function Impairs Cell Expansion and Morphogenesis in Phaseolus vulgaris L. Root Nodules". Molecular Plant-Microbe Interactions. 24 (7): 819–826. doi:10.1094/MPMI-11-10-0261. Retrieved 25 April 2021.
  9. Arnaud Stigliani, Raquel Martin-Arevalillo, Jérémy Lucas, Adrien Bessy, Thomas Vinos-Poyo, Victoria Mironova, Teva Vernoux, Renaud Dumas and François Parcy (3 June 2019). "Capturing Auxin Response Factors Syntax Using DNA Binding Models". Molecular Plant. 12 (6): 822–832. doi:10.1016/j.molp.2018.09.010. PMID 30336329. Retrieved 29 August 2020.
  10. 10.0 10.1 HGNC (13 March 2020). "ZSCAN22 zinc finger and SCAN domain containing 22 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  11. 11.0 11.1 RefSeq (10 September 2009). "MIR6806 microRNA 6806 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  12. Jag123 (7 March 2005). "antigen". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020. Vancouver style error: non-Latin character (help)
  13. SemperBlotto (21 April 2008). "immunogen". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 8 March 2020.
  14. 14.0 14.1 14.2 C. Michael Gibson (27 April 2008). "Antigen". Boston, Massachusetts: WikiDoc Foundation. Retrieved 8 March 2020.
  15. Williamsayers79 (26 February 2007). "antibody". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020. Vancouver style error: non-Latin character (help)
  16. Jag123 (7 March 2005). "antibody". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020. Vancouver style error: non-Latin character (help)
  17. Eleonora Market, F. Nina Papavasiliou (2003). "V(D)J Recombination and the Evolution of the Adaptive Immune System". PLoS Biology. 1 (1): e16. doi:10.1371/journal.pbio.0000016.
  18. Charles A Janeway, Jr, Paul Travers, Mark Walport, and Mark J Shlomchik (2001). Immunobiolog (5th ed. ed.). Garland Publishing. ISBN 0-8153-3642-X.
  19. SemperBlotto (25 February 2006). "immunoglobulin". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020.
  20. SemperBlotto (28 April 2008). "immunoglobulin". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 7 March 2020.
  21. 21.0 21.1 21.2 21.3 RefSeq (10 December 2019). "A1BG alpha-1-B glycoprotein [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  22. Mei Tian, Ya-Zhou Cui, Guan-Hua Song, Mei-Juan Zong, Xiao-Yan Zhou, Yu Chen, Jin-Xiang Han (2008). "Proteomic analysis identifies MMP-9, DJ-1 and A1BG as overexpressed proteins in pancreatic juice from pancreatic ductal adenocarcinoma patients". BMC Cancer. 8: 241. doi:10.1186/1471-2407-8-241. PMC 2528014. PMID 18706098.
  23. 23.0 23.1 23.2 23.3 23.4 23.5 23.6 Noriaki Ishioka, Nobuhiro Takahashi, and Frank W. Putnam (April 1986). "Amino acid sequence of human plasma 𝛂1B-glycoprotein: Homology to the immunoglobulin supergene family" (PDF). Proceedings of the National Academy of Sciences USA. 83 (8): 2363–7. doi:10.1073/pnas.83.8.2363. PMID 3458201. Retrieved 9 March 2020.
  24. 24.0 24.1 Katrina M. Morris, Denis O’Meally, Thiri Zaw, Xiaomin Song, Amber Gillett, Mark P. Molloy, Adam Polkinghorne, and Katherine Belova (7 October 2016). "Characterisation of the immune compounds in koala milk using a combined transcriptomic and proteomic approach". Scientific Reports. 6: 35011. doi:10.1038/srep35011. PMID 27713568. Retrieved 14 March 2020.
  25. R. J. Paxton, G. Mooser, H. Pande, T. D. Lee, and J. E. Shively (1 February 1987). "Sequence analysis of carcinoembryonic antigen: identification of glycosylation sites and homology with the immunoglobulin supergene family" (PDF). Proceedings of the National Academy of Sciences USA. 84 (4): 920–924. doi:10.1073/pnas.84.4.920. PMID 3469650. Retrieved 26 March 2020.
  26. NCBI (2 February 2016). "Conserved Protein Domain Family cl11960: Ig Superfamily". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 22 May 2020.
  27. NCBI (5 August 2015). "Conserved Protein Domain Family pfam13895: Ig_2". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 24 May 2020.
  28. NCBI (16 August 2016). "Conserved Protein Domain Family cd05751: Ig1_LILR_KIR_like". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 24 May 2020.
  29. NCBI (16 January 2013). "Conserved Protein Domain Family smart00410: IG_like". 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 24 May 2020.
  30. 24.98.118.180 (28 February 2007). "species". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020. Vancouver style error: non-Latin character (help)
  31. 31.0 31.1 Peter coxhead (22 August 2018). "Species". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  32. Chiswick Chap (1 December 2016). "Species". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  33. 33.0 33.1 33.2 33.3 "AceView: A1BG". Retrieved May 11, 2013.
  34. Pdeitiker (26 July 2008). "variant". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  35. SemperBlotto (6 January 2007). "isoform". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2 December 2018.
  36. 72.178.245.181 (30 November 2008). "isoform". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2 December 2018. Vancouver style error: non-Latin character (help)
  37. H Eiberg, ML Bisgaard, J Mohr (1 December 1989). "Linkage between alpha 1B-glycoprotein (A1BG) and Lutheran (LU) red blood group system: assignment to chromosome 19: new genetic variants of A1BG". Clinical genetics. 36 (6): 415–8. PMID 2591067. Retrieved 2017-10-08.
  38. John R. Stehle Jr., Mark E. Weeks, Kai Lin, Mark C. Willingham, Amy M. Hicks, John F. Timms, Zheng Cui (January 2007). "Mass spectrometry identification of circulating alpha-1-B glycoprotein, increased in aged female C57BL/6 mice". Biochimica et Biophysica Acta (BBA) - General Subjects. 1770 (1): 79–86. doi:10.1016/j.bbagen.2006.06.020. PMID 16945486. Retrieved 2017-10-08.
  39. 39.0 39.1 39.2 39.3 39.4 Caitrin W. McDonough, Yan Gong, Sandosh Padmanabhan, Ben Burkley, Taimour Y. Langaee, Olle Melander, Carl J. Pepine, Anna F. Dominiczak, Rhonda M. Cooper-DeHoff, and Julie A. Johnson (June 2013). "Pharmacogenomic Association of Nonsynonymous SNPs in SIGLEC12, A1BG, and the Selectin Region and Cardiovascular Outcomes" (PDF). Hypertension. 62 (1): 48–54. doi:10.1161/HYPERTENSIONAHA.111.00823. PMID 23690342. Retrieved 2017-10-08.
  40. DTLHS (10 January 2018). "genotype". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  41. SemperBlotto (22 October 2005). "genotype". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  42. Widsith (28 March 2012). "polymorphism". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020.
  43. 217.105.66.98 (8 September 2016). "allele". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020. Vancouver style error: non-Latin character (help)
  44. 138.130.33.215 (7 April 2004). "allele". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 25 March 2020. Vancouver style error: non-Latin character (help)
  45. 45.0 45.1 B. Gahne, R. K. Juneja, and A. Stratil (June 1987). "Genetic polymorphism of human plasma alpha 1B-glycoprotein: phenotyping by immunoblotting or by a simple method of 2-D electrophoresis". Human Genetics. 76 (2): 111–5. doi:10.1007/bf00284904. PMID 3610142. Retrieved 25 March 2020.
  46. R.K. Juneja, G. Beckman, M. Lukka, B. Gahne, and C. Ehnholm (1989). "Plasma α1B-Glycoprotein Allele Frequencies in Finns and Swedish Lapps: Evidence for a New α1B Allele". Human Heredity. 39 (1): 32–36. doi:10.1159/000153828. PMID 2759622. Retrieved 25 March 2020.
  47. 47.0 47.1 R.K. Juneja, N. Saha, B. Gahne and J.S.H. Tay (1989). "Distribution of Plasma Alpha-1-B-Glycoprotein Phenotypes in Several Mongoloid Populations of East Asia". Human Heredity. 39: 218–222. doi:10.1159/000153863. PMID 2583734. Retrieved 25 March 2020.
  48. 24.235.196.118 (23 September 2007). "phenotype". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2016-10-04. Vancouver style error: non-Latin character (help)
  49. SemperBlotto (14 February 2005). "phenotype". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2016-10-04.
  50. N2e (3 July 2008). "phenotype". San Francisco, California: Wikimedia Foundation, Inc. Retrieved 2016-10-04. Vancouver style error: non-Latin character (help)
  51. Mardiaty Iryani Abdullah, Ching Chin Lee, Sarni Mat Junit, Khoon Leong Ng, and Onn Haji Hashim (13 September 2016). "Tissue and serum samples of patients with papillary thyroid cancer with and without benign background demonstrate different altered expression of proteins". Peer J. 4: e2450. doi:10.7717/peerj.2450. PMID 27672505. Retrieved 15 March 2020.
  52. 52.0 52.1 52.2 52.3 Udby L, Sørensen OE, Pass J, Johnsen AH, Behrendt N, Borregaard N, Kjeldsen L (12 October 2004). "Cysteine-rich secretory protein 3 is a ligand of alpha1B-glycoprotein in human plasma". Biochemistry. 43 (40): 12877–86. doi:10.1021/bi048823e. PMID 15461460. Retrieved 2011-11-28. Vancouver style error: punctuation (help)
  53. "The Opossum: Our Marvelous Marsupial, The Social Loner". Wildlife Rescue League.
  54. Journal Of Venomous Animals And Toxins – Anti-Lethal Factor From Opossum Serum Is A Potent Antidote For Animal, Plant And Bacterial Toxins. Retrieved 2009-12-29.
  55. 55.0 55.1 B Haendler, J Krätzschmar, F Theuring and W D Schleuning (July 1993). "Transcripts for cysteine-rich secretory protein-1 (CRISP-1; DE/AEG) and the novel related CRISP-3 are expressed under androgen control in the mouse salivary gland". Endocrinology. 133 (1): 192–8. doi:10.1210/en.133.1.192. PMID 8319566. Retrieved 2012-02-20.
  56. 56.0 56.1 HGNC (10 December 2019). "A1BG-AS1 A1BG antisense RNA 1 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  57. 57.0 57.1 57.2 57.3 HGNC (10 December 2019). "ZNF497 zinc finger protein 497 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  58. 58.0 58.1 HGNC (10 December 2019). "LOC100419840 zinc finger protein 446 pseudogene [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  59. 59.0 59.1 HGNC (10 December 2019). "LOC105372483 uncharacterized LOC105372483 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  60. 60.0 60.1 HGNC (10 December 2019). "RNA5SP473 RNA, 5S ribosomal pseudogene 473 [ Homo sapiens (human) ]". U.S. National Library of Medicine, 8600 Rockville Pike, Bethesda MD, 20894 USA: National Center for Biotechnology Information. Retrieved 2019-12-18.
  61. Chuhu Yang, Eugene Bolotin, Tao Jiang, Frances M. Sladek, Ernest Martinez. (2007). "Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene. 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746. Unknown parameter |month= ignored (help)
  62. Saxonov S, Berg P, Brutlag DL (2006). "A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters". Proc Natl Acad Sci USA. 103 (5): 1412–1417. doi:10.1073/pnas.0510310103. PMC 1345710. PMID 16432200.
  63. 63.0 63.1 63.2 63.3 63.4 63.5 63.6 63.7 63.8 63.9 Sergey V Guselnikov and Alexander V Taranin (1 June 2019). "Unraveling the LRC Evolution in Mammals: IGSF1 and A1BG Provide the Keys". Genome Biology and Evolution. 11 (6): 1586–1601. doi:10.1093/gbe/evz102. PMID 31106814. |access-date= requires |url= (help)

External links

{{Phosphate biochemistry}}