C21orf140

C21orf140
Identifiers
AliasesC21orf140, chromosome 21 open reading frame 140, FAM243A, family with sequence similarity 243 member A
External IDsMGI: 1922578; HomoloGene: 57100; GeneCards: C21orf140; OMA:C21orf140 - orthologs
Orthologs
SpeciesHumanMouse
Entrez

101928147

75328

Ensembl

ENSG00000222018

ENSMUSG00000051728

UniProt

P0DPQ4

Q8CDS7

RefSeq (mRNA)

NM_001282537

NM_029252

RefSeq (protein)

NP_001351640
NP_001269466

NP_083528

Location (UCSC)Chr 21: 34.4 – 34.4 MbChr 16: 92.12 – 92.12 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Chromosome 21, open reading frame 140 (C21orf140)[5] is a protein which in humans is encoded by the C21orf140 gene.[6] Its primary alias is Family With Sequence Similarity 243 Member A (FAM243A).[7]

Gene

C21orf140 is 961 base pairs long, located in chromosome 21 of the human genome at the coordinates 21q22.12 on the minus strand, position 34,400,112 to 34,401,072. It contains only one exon.[6]

Expression patterns

C21orf140 is widely expressed in most tissues, with the stomach and retina showing the highest expression levels. Little to no expression has been found in the parathyroid gland, rectum, epididymis, seminal vesicle, smooth muscle, tonsil, and thymus. Additionally, expression is seen in many regions of the brain, with the highest levels in the white matter and choroid plexus.[8]

Transcript and regulation

C21orf140 has a single transcript, an mRNA that is 961 nucleotides long.[9] It has no 5' untranslated region (UTR).

Despite its lack of a 5' UTR, there are RBMX and PABPC1 RNA binding protein sites in the 60 nucleotides upstream of the C21orf140 start codon.[10]

In the coding region, sequences conserved in mammalian orthologs overlap with some identified transcription factor binding sites, including JUN, TFAP4, ETV1, and ZBTB18.[11]

The 3' UTR contains binding sites for the microRNAs miR-9-5p, miR-1294, miR-9986, miR-647, miR-628-3p, and miR-590-3p.[10]

Protein

The C21orf140 protein contains 251 amino acids and a calculated molecular weight of ~29 kDa. Its theoretical isoelectric point is pH = 8.7.[12] There is one known isoform. It is the sole member of the FAM243 protein family in humans, according to a UniProt query.[13] It contains less alanine (3.6%) and less histidine (0.8%) than the average human protein.[14]

Protein motifs

The C21orf140 protein is predicted to contain mixed charge cluster from residues 110 to 141. The periodic pattern "LXX" repeats four times from residues 27 to 38. Another periodic pattern, “SXXXXLXXX," repeats four times from residues 154 to 193 (the letter X stands in for any amino acid). A predicted bipartite nuclear localization signal spans from residues 113 to 127.

Post-translational modifications

The human C21orf140 protein is bioinformatically predicted to contain two phosphorylated serine residues, three phosphorylatedthreonine residues, and an N-glycosylated asparagine.[15] Additionally, four cysteine pairs are predicted to form disulfide bonds.[16] No protein modifications have been experimentally confirmed.[5]

Predicted Human C21orf140 PTMs
Predicted PTM Residue Location(s) Possible Function
phosphoserine 16, 188 activation or signaling
phosphothreonine 32, 57, 236 activation or signaling
N-glycosylated asparagine 228 protein sorting or folding
cysteine-cysteine disulfide bond 26/73, 109/119, 157/176, 201/213 protein tertiary structure formation

Though it has not been found yet, there may be a cleavage site to separate the portion of the protein containing the bipartite nuclear localization signal from the portion containing the N-glycosylated asparagine, allowing the former to be sorted to the nucleus and the latter to exist in the extracellular space.

Secondary and tertiary structures

The C21orf140 protein is predicted to contain a globular domain from residues 2 to 167. Residues 168 to 182 and 243 to 251 are predicted to be disordered regions.[19]

Protein interactions

The C21orf140 protein has no known interactions with other human proteins other than co-mentions in an article listed in PubMed.[20]

Evolutionary history

Paralogs

An NCBI protein BLAST search against the human genome shows no paralogs of C21orf140 in humans.[21]

Orthologs

An NCBI protein BLAST search[21] shows orthologs of C21orf140 in mammals, birds, reptiles, amphibians, bony and cartilaginous fish, bivalves, insects, and sponges. A list of selected orthologs is shown below. C21orf140 has been moderately conserved across its evolutionary history. C21orf140 likely appeared around 758 million years ago in an ancient invertebrate ancestor, as evidenced by the most distant orthologs found in sponges of the phylum Porifera.

Selected Orthologs of Human C21orf140 Protein
Scientific name Common name Taxonomic group Median date of human divergence (MYA) NCBI Accession number Protein size (aa) Sequence identity to human (%) Sequence similarity to human (%)
Homo sapiens Human Primates 0 NP_001269466 251 100 100
Mus musculus House mouse Rodentia 87 NP_083528 251 70.5 85.3
Equus caballus Domestic horse Perissodactyla 94 XP_014591979 243 78.1 86.1
Phocoena sinus Vaquita Cetacea 94 XP_032487142 256 77 87.5
Indicator indicator Greater honeyguide Aves 319 XP_054247325 253 47.4 68.4
Gallus gallus Red junglefowl Aves 319 XP_004934593 316 40.2 57.6
Crocodylus porosus Saltwater crocodile Reptilia 319 XP_019384587 252 57.5 73.8
Gekko japonicus Schlegel's Japanese gecko Reptilia 319 XP_015278925 252 56.6 73.4
Ascaphus truei Coastal tailed frog Amphibia 352 XP_075449410 250 49.4 67.7
Microcaecilia unicolor N/A Amphibia 352 XP_030058389 255 47.7 67.3
Huso huso Beluga sturgeon Actinopterygii 429 KAK6473583 253 39.8 54.2
Tachysurus fulvidraco Yellow catfish Actinopterygii 429 XP_026991742 252 27.6 43.5
Heterodontus francisci Horn shark Chondrichthyes 462 XP_067903089 257 43.1 59.9
Hemitrygon akajei Red stingray Chondrichthyes 462 XP_072915967 273 36.6 53.3
Vanessa atalanta Red admiral Insecta 686 XP_047544623 487 13.9 23.9
Aphis gossypii Cotton aphid Insecta 686 XP_027839080 460 13.6 22.9
Mytilus galloprovincialis Mediterranean mussel Bivalvia 686 VDI40316 549 14.2 22.6
Eriocheir sinensis Chinese mitten crab Crustacea 686 XP_050722331 530 11.5 20.4
Dysidea avara N/A Porifera 758 XP_065883736 252 20.1 34
Amphimedon queenslandica N/A Porifera 758 XP_011406833 457 16.2 26.5

Clinical significance

C21orf140 has been mentioned in the scientific literature in contexts with implications for human health. A 2024 proteomics study found that C21orf140 is downregulated in individuals with obsessive-compulsive disorder.[22] A 2024 GWAS study mapped C21orf140 and its SNPs as one of twenty genes in a shared genetic risk cluster linked with conditions including schizophrenia, substance abuse disorders, bipolar disorder, and depression.[23] A 2022 oncogenomic study associated tumoral C21orf140 mutations with a lack of lasting benefit from hyperthermic intraperitoneal chemotherapy for gastric cancer.[24]

References

  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000222018Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000051728Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ a b "uncharacterized protein C21orf140 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
  6. ^ a b "C21orf140 chromosome 21 open reading frame 140 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
  7. ^ "C21orf140 Gene". GeneCards Human Gene Database. Archived from the original on 2022-12-01. Retrieved 2025-10-16.
  8. ^ "C21orf140 transcriptomics data - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2025-12-04.
  9. ^ "Homo sapiens chromosome 21 open reading frame 140 (C21orf140), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 28 April 2025. Retrieved 2025-10-16.
  10. ^ a b "miRDB - MicroRNA Target Prediction Database". mirdb.org. Retrieved 2025-12-04.
  11. ^ "UCSC Genome Browser". genome.ucsc.edu. Retrieved 2025-12-04.
  12. ^ "Expasy - Compute pI/Mw tool". web.expasy.org. Retrieved 2025-12-02.
  13. ^ "UniProt". UniProt. Retrieved 2025-10-16.
  14. ^ Institute EB. "Job Dispatcher homepage". EMBL-EBI. Retrieved 2025-12-02.
  15. ^ "ScanProsite". prosite.expasy.org. Retrieved 2025-12-04.
  16. ^ "DIpro". Scratch Protein Predictor. 2006-01-01. Retrieved 2025-12-03.
  17. ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2025-12-04.
  18. ^ "iCn3D: Web-based 3D Structure Viewer". www.ncbi.nlm.nih.gov. Retrieved 2025-12-04.
  19. ^ "ELM - unknown". elm.eu.org. Retrieved 2025-12-03.
  20. ^ "FAM243A protein (human) - STRING interaction network". string-db.org. Retrieved 2025-12-04.
  21. ^ a b "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2025-10-16.
  22. ^ Deng M, Li X, Shi D, Fan Q, Zhang H, Wang Z, et al. (August 2024). "iTRAQ-Based Serum Proteomic Analysis Reveals Multifactorial Cellular Function Impairment and Aggravated Systematic Inflammation in Drug-free Obsessive-Compulsive Disorders". ACS Chemical Neuroscience. 15 (16): 3053–3063. doi:10.1021/acschemneuro.4c00317. PMID 39120470.
  23. ^ Han X, Shen Q, Hou C, Yang H, Chen W, Zeng Y, et al. (February 2024). "Disease clusters subsequent to anxiety and stress-related disorders and their genetic determinants". Nature Communications. 15 (1) 1209. Bibcode:2024NatCo..15.1209H. doi:10.1038/s41467-024-45445-2. PMC 10853285. PMID 38332132.
  24. ^ Zeng L, Huang X, Tian Y, Huang J, Liu H, Wen J, et al. (2022-03-08). "Tumor Mutational Burden Associated With Response to Hyperthermic Intraperitoneal Chemotherapy". Frontiers in Oncology. 12 796263. doi:10.3389/fonc.2022.796263. PMC 8958003. PMID 35350562.