BEND2 (protein)


BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

Gene

Common aliases

BEND2 stands for BEN domain containing 2 and is also known as CXorf20.

Locus and size

The locus for BEND2 is on the minus strand of the X chromosome at Xp22.13. The gene is approximately 58 kilobases in length.

mRNA

Alternative splicing

BEND2 contains 14 exons which undergo alternative splicing to create five transcript variants that vary from 4,720 base pairs to 2,144 bp in the mature mRNA. The longest and most complete transcript of the gene, variant 1, encodes isoform 1 of the BEND2 protein.

5' and 3'UTR

The untranslated regions flanking the coding sequence of BEND2 at the 5' and 3' end of the mature mRNA molecule contain sites for RNA-binding proteins, including RBMX, pum2, and EIF4B as well as microRNA binding sites. The 5'UTR also contains an upstream in-frame stop codon and the 3'UTR contains a polyadenylation signal sequence.

Protein (Isoform 1)

Molecular weight and internal composition

The predicted molecular weight is 87.9 kDal.
The predicted isoelectric point is pH 5.07.
The internal composition is enriched for serine residues.

Isoforms

Corresponding to the five alternative transcripts of BEND2, the protein encoded by this gene is found in two isoforms as well as three predicted structures. These isoforms range from 813 to 645 amino acids in length. Isoform 1 is 799 amino acids in length.

Subcellular location

The presence of nuclear localization signals within the amino acid sequence or primary structure of the BEND2 protein leads to a prediction of subcellular localization in the nucleus. The pat7 signal and a nuclear bipartite signal are both found near the N-terminus of the protein.

Structure

The secondary structure for BEND2 is unclear, in particular at the N-terminus, which is poorly conserved between orthologs. The C-terminus contains two BEN domains, which are predicted to form a series of alpha helices.

Post-translational modifications

Based on its primary structure, BEND2 is predicted to undergo N-terminus acetylation, glycation of several lysine residues, SUMOlation, a SUMO interaction at the N-terminus, S-palmitoylation, and extensive phosphorylation.

Interacting Proteins

BEND2 is found to interact with the following proteins through experimental yeast two-hybrid screens or pull down assays.
Experiment typeProteinProtein FunctionAssociated diseases
Two-hybrid screenAtaxin 1Chromatin-binding factor; RNA metabolismSpinocerebellar ataxia 1/spinocerebellar degeneration
Two-hybrid screenSplicing factor 3A subunit 2Activation of U2 snRNP; microtubule-binding protein
Two-hybrid screenLIM Homeobox 2 Transcriptional regulator for cell differentiation; sequence-specific DNA bindingSchizencephaly
Two-hybrid screenProline Rich 20D Unknown function
Pull down assayAmyolid precursor protein Cell surface receptor in neurons; cleaved to form transcriptional activatorsCerebral amyloid angiopathy; Alzheimer's disease

BEN Domains (protein feature)

BEND2 has two BEN domains at its C-terminus. BEN domains are found in a diverse array of proteins and are predicted to be important for chromatin remodeling as well as for the recruitment of chromatin-modifying factors utilized during the process of transcriptional regulation of gene expression. BEN domains are predicted to form four alpha helices that allow this domain to interact with its DNA target.
showed that the Drosophila melanogaster Insensitive gene and corresponding protein has no domains of known chemical function yet it contains a single BEN domain. They illustrated the activity of the Insv protein in transcriptional regulation of genes and obtained a crystal structure of two Insv BEN domains interacting with their DNA target site.

Expression

Tissue expression pattern

The expression of the BEND2 gene is regulated and it is therefore not ubiquitously expressed in the human body. High expression occurs in the testis and in the bone marrow. The NCBI EST profile for this gene shows expression only in the testis and in the muscle.

Transcriptional regulation of expression

The promoter regulating expression of BEND2 is 1255 base pairs in length and is located directly upstream of the BEND2 gene. It regulates transcription of all five transcriptional variants of BEND2. Genomatix's MatInspector program predicted 418 transcription factor binding sites within the BEND2 promoter, including for SRY, neurogenin, interferon regulatory factor-3, Ikaros2, and TCF/LEF-1.

Homology

Paralogs

The BEND2 protein has no known paralogs within the human genome.

BEN-domain containing gene family

The BEND2 gene belongs to a family of human genes known as . This includes BANP, BEND3, BEND4, BEND5, BEND6, BEND7, NACC1, and NACC2. The loci for these genes are spread throughout the human genome. Each of these genes contains between one and four BEN domains. Except for at these motifs, the genes of the BEN family do not have similar sequences.

Orthologs

The BEND2 gene is conserved across evolutionary time as it has 114 known orthologs in a wide range of vertebrate species including mammals, birds, crocodilia, and amphibians. The BEND2 protein has 42 known orthologs. The C-terminus of the protein, the location of its BEN domains, is highly conserved; however, the N-terminus is not well conserved, even within the order of Primates.
Genus/speciesCommon nameOrderDate of divergence from H. sapiens Accession numberSequence lengthWhole sequence identity C-terminus identity
Homo sapiensHumanPrimates0NP_699177.27991.0001.000
Pongo abeliiOrangutanPrimates15.76--7840.9210.854
Macaca nemestrinaSouthern pig-tailed macaquePrimates29.44XP_011733709.18230.6940.828
Vicugna pacosAlpacaArtiodactyla96XP_015106214.17400.4330.512
Ceratotherium simum simumWhite rhinocerosPerissodactyla96XP_014646569.18640.4120.527
Loxodonta africanaAfrican bush elephantProboscidea105XP_010594135.18290.3820.489
Canis lupus familiarisDogCarnivora96XP_013967473.19000.3620.445
Ailuropoda melanoleucaGiant panadaCarnivora96XP_019665441.18520.3530.460
Rhinolophus sinicusChinese horseshoe batChiroptera96XP_019610944.18080.3450.459
Dasypus novemcinctusNine-banded armadilloCingulata105XP_012377569.18860.3420.500
Trichechus manatus latirostrisManateeSirenia105XP_012412857.19500.3350.475
Chrysochloris asiaticaCape golden moleAfrosoricida105XP_006835746.16830.3300.443
Oryctolagus cuniculusEuropean rabbitLagomorpha90XP_017205124.18110.3050.438
Monodelphis domesticaGray short-tailed opossumDidelphimorphia159XP_007500895.17280.3030.443
Ornithorhynchus anatinusPlatypusMonotremata177XP_007668655.17150.3020.429
Gavialis gangeticusFish-eating crocodileCrocodilia312XP_019380828.16970.3090.458
Chelonia mydasGreen sea turtleTestudines312XP_007070584.17490.2970.453
Apteryx australis mantelliNorth Island brown kiwiApterygiformes312XP_013807123.16470.2950.444
Columba liviaRock doveColumbiformes312XP_005509980.16680.2870.442
Pygoscelis adeliaeAdelie penguinSphenisciformes312XP_009323754.16570.2820.458
Nanorana parkeriTibet frogAnura352XP_018417228.15860.2600.376

Function

BEND2 is predicted to be a DNA-binding protein due to the presence of BEN domains at its C-terminus, a hypothesis supported by its localization to the nucleus, the transcription factors found in its promoter region, and the nature of the proteins it interacts with. Though the precise function of the BEND2 protein is not yet well understood by the scientific community, BEN domains have been found to be important regulators of transcription.

Clinical significance

The diseases that have been linked to BEND2 are related to the central nervous system though expression of the gene is not highly observed in these tissues.