C21orf140
| C21orf140 | |||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Aliases | C21orf140, chromosome 21 open reading frame 140, FAM243A, family with sequence similarity 243 member A | ||||||||||||||||||||||||||||||||||||||||||||||||||
| External IDs | MGI: 1922578; HomoloGene: 57100; GeneCards: C21orf140; OMA:C21orf140 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Chromosome 21, open reading frame 140 (C21orf140)[5] is a protein which in humans is encoded by the C21orf140 gene.[6] Its primary alias is Family With Sequence Similarity 243 Member A (FAM243A).[7]
Gene
C21orf140 is 961 base pairs long, located in chromosome 21 of the human genome at the coordinates 21q22.12 on the minus strand, position 34,400,112 to 34,401,072. It contains only one exon.[6]
Expression patterns
C21orf140 is widely expressed in most tissues, with the stomach and retina showing the highest expression levels. Little to no expression has been found in the parathyroid gland, rectum, epididymis, seminal vesicle, smooth muscle, tonsil, and thymus. Additionally, expression is seen in many regions of the brain, with the highest levels in the white matter and choroid plexus.[8]
Transcript and regulation
C21orf140 has a single transcript, an mRNA that is 961 nucleotides long.[9] It has no 5' untranslated region (UTR).
Despite its lack of a 5' UTR, there are RBMX and PABPC1 RNA binding protein sites in the 60 nucleotides upstream of the C21orf140 start codon.[10]
In the coding region, sequences conserved in mammalian orthologs overlap with some identified transcription factor binding sites, including JUN, TFAP4, ETV1, and ZBTB18.[11]
The 3' UTR contains binding sites for the microRNAs miR-9-5p, miR-1294, miR-9986, miR-647, miR-628-3p, and miR-590-3p.[10]
Protein
The C21orf140 protein contains 251 amino acids and a calculated molecular weight of ~29 kDa. Its theoretical isoelectric point is pH = 8.7.[12] There is one known isoform. It is the sole member of the FAM243 protein family in humans, according to a UniProt query.[13] It contains less alanine (3.6%) and less histidine (0.8%) than the average human protein.[14]
Protein motifs
The C21orf140 protein is predicted to contain mixed charge cluster from residues 110 to 141. The periodic pattern "LXX" repeats four times from residues 27 to 38. Another periodic pattern, “SXXXXLXXX," repeats four times from residues 154 to 193 (the letter X stands in for any amino acid). A predicted bipartite nuclear localization signal spans from residues 113 to 127.
Post-translational modifications
The human C21orf140 protein is bioinformatically predicted to contain two phosphorylated serine residues, three phosphorylatedthreonine residues, and an N-glycosylated asparagine.[15] Additionally, four cysteine pairs are predicted to form disulfide bonds.[16] No protein modifications have been experimentally confirmed.[5]
| Predicted PTM | Residue Location(s) | Possible Function |
|---|---|---|
| phosphoserine | 16, 188 | activation or signaling |
| phosphothreonine | 32, 57, 236 | activation or signaling |
| N-glycosylated asparagine | 228 | protein sorting or folding |
| cysteine-cysteine disulfide bond | 26/73, 109/119, 157/176, 201/213 | protein tertiary structure formation |
Though it has not been found yet, there may be a cleavage site to separate the portion of the protein containing the bipartite nuclear localization signal from the portion containing the N-glycosylated asparagine, allowing the former to be sorted to the nucleus and the latter to exist in the extracellular space.
Secondary and tertiary structures
The C21orf140 protein is predicted to contain a globular domain from residues 2 to 167. Residues 168 to 182 and 243 to 251 are predicted to be disordered regions.[19]
Protein interactions
The C21orf140 protein has no known interactions with other human proteins other than co-mentions in an article listed in PubMed.[20]
Evolutionary history
Paralogs
An NCBI protein BLAST search against the human genome shows no paralogs of C21orf140 in humans.[21]
Orthologs
An NCBI protein BLAST search[21] shows orthologs of C21orf140 in mammals, birds, reptiles, amphibians, bony and cartilaginous fish, bivalves, insects, and sponges. A list of selected orthologs is shown below. C21orf140 has been moderately conserved across its evolutionary history. C21orf140 likely appeared around 758 million years ago in an ancient invertebrate ancestor, as evidenced by the most distant orthologs found in sponges of the phylum Porifera.
| Scientific name | Common name | Taxonomic group | Median date of human divergence (MYA) | NCBI Accession number | Protein size (aa) | Sequence identity to human (%) | Sequence similarity to human (%) |
| Homo sapiens | Human | Primates | 0 | NP_001269466 | 251 | 100 | 100 |
| Mus musculus | House mouse | Rodentia | 87 | NP_083528 | 251 | 70.5 | 85.3 |
| Equus caballus | Domestic horse | Perissodactyla | 94 | XP_014591979 | 243 | 78.1 | 86.1 |
| Phocoena sinus | Vaquita | Cetacea | 94 | XP_032487142 | 256 | 77 | 87.5 |
| Indicator indicator | Greater honeyguide | Aves | 319 | XP_054247325 | 253 | 47.4 | 68.4 |
| Gallus gallus | Red junglefowl | Aves | 319 | XP_004934593 | 316 | 40.2 | 57.6 |
| Crocodylus porosus | Saltwater crocodile | Reptilia | 319 | XP_019384587 | 252 | 57.5 | 73.8 |
| Gekko japonicus | Schlegel's Japanese gecko | Reptilia | 319 | XP_015278925 | 252 | 56.6 | 73.4 |
| Ascaphus truei | Coastal tailed frog | Amphibia | 352 | XP_075449410 | 250 | 49.4 | 67.7 |
| Microcaecilia unicolor | N/A | Amphibia | 352 | XP_030058389 | 255 | 47.7 | 67.3 |
| Huso huso | Beluga sturgeon | Actinopterygii | 429 | KAK6473583 | 253 | 39.8 | 54.2 |
| Tachysurus fulvidraco | Yellow catfish | Actinopterygii | 429 | XP_026991742 | 252 | 27.6 | 43.5 |
| Heterodontus francisci | Horn shark | Chondrichthyes | 462 | XP_067903089 | 257 | 43.1 | 59.9 |
| Hemitrygon akajei | Red stingray | Chondrichthyes | 462 | XP_072915967 | 273 | 36.6 | 53.3 |
| Vanessa atalanta | Red admiral | Insecta | 686 | XP_047544623 | 487 | 13.9 | 23.9 |
| Aphis gossypii | Cotton aphid | Insecta | 686 | XP_027839080 | 460 | 13.6 | 22.9 |
| Mytilus galloprovincialis | Mediterranean mussel | Bivalvia | 686 | VDI40316 | 549 | 14.2 | 22.6 |
| Eriocheir sinensis | Chinese mitten crab | Crustacea | 686 | XP_050722331 | 530 | 11.5 | 20.4 |
| Dysidea avara | N/A | Porifera | 758 | XP_065883736 | 252 | 20.1 | 34 |
| Amphimedon queenslandica | N/A | Porifera | 758 | XP_011406833 | 457 | 16.2 | 26.5 |
Clinical significance
C21orf140 has been mentioned in the scientific literature in contexts with implications for human health. A 2024 proteomics study found that C21orf140 is downregulated in individuals with obsessive-compulsive disorder.[22] A 2024 GWAS study mapped C21orf140 and its SNPs as one of twenty genes in a shared genetic risk cluster linked with conditions including schizophrenia, substance abuse disorders, bipolar disorder, and depression.[23] A 2022 oncogenomic study associated tumoral C21orf140 mutations with a lack of lasting benefit from hyperthermic intraperitoneal chemotherapy for gastric cancer.[24]
References
- ^ a b c GRCh38: Ensembl release 89: ENSG00000222018 – Ensembl, May 2017
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000051728 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ a b "uncharacterized protein C21orf140 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
- ^ a b "C21orf140 chromosome 21 open reading frame 140 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
- ^ "C21orf140 Gene". GeneCards Human Gene Database. Archived from the original on 2022-12-01. Retrieved 2025-10-16.
- ^ "C21orf140 transcriptomics data - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2025-12-04.
- ^ "Homo sapiens chromosome 21 open reading frame 140 (C21orf140), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 28 April 2025. Retrieved 2025-10-16.
- ^ a b "miRDB - MicroRNA Target Prediction Database". mirdb.org. Retrieved 2025-12-04.
- ^ "UCSC Genome Browser". genome.ucsc.edu. Retrieved 2025-12-04.
- ^ "Expasy - Compute pI/Mw tool". web.expasy.org. Retrieved 2025-12-02.
- ^ "UniProt". UniProt. Retrieved 2025-10-16.
- ^ Institute EB. "Job Dispatcher homepage". EMBL-EBI. Retrieved 2025-12-02.
- ^ "ScanProsite". prosite.expasy.org. Retrieved 2025-12-04.
- ^ "DIpro". Scratch Protein Predictor. 2006-01-01. Retrieved 2025-12-03.
- ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2025-12-04.
- ^ "iCn3D: Web-based 3D Structure Viewer". www.ncbi.nlm.nih.gov. Retrieved 2025-12-04.
- ^ "ELM - unknown". elm.eu.org. Retrieved 2025-12-03.
- ^ "FAM243A protein (human) - STRING interaction network". string-db.org. Retrieved 2025-12-04.
- ^ a b "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
- ^ Deng M, Li X, Shi D, Fan Q, Zhang H, Wang Z, et al. (August 2024). "iTRAQ-Based Serum Proteomic Analysis Reveals Multifactorial Cellular Function Impairment and Aggravated Systematic Inflammation in Drug-free Obsessive-Compulsive Disorders". ACS Chemical Neuroscience. 15 (16): 3053–3063. doi:10.1021/acschemneuro.4c00317. PMID 39120470.
- ^ Han X, Shen Q, Hou C, Yang H, Chen W, Zeng Y, et al. (February 2024). "Disease clusters subsequent to anxiety and stress-related disorders and their genetic determinants". Nature Communications. 15 (1) 1209. Bibcode:2024NatCo..15.1209H. doi:10.1038/s41467-024-45445-2. PMC 10853285. PMID 38332132.
- ^ Zeng L, Huang X, Tian Y, Huang J, Liu H, Wen J, et al. (2022-03-08). "Tumor Mutational Burden Associated With Response to Hyperthermic Intraperitoneal Chemotherapy". Frontiers in Oncology. 12 796263. doi:10.3389/fonc.2022.796263. PMC 8958003. PMID 35350562.