C4orf17
| C4orf17 | |||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
| Aliases | C4orf17, chromosome 4 open reading frame 17 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| External IDs | MGI: 1914991; HomoloGene: 12962; GeneCards: C4orf17; OMA:C4orf17 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Chromosome 4 open reading frame 17 (C4orf17), is a protein-coding gene in humans. C4orf17 (accession: NP_115525.2) spans approximately 31,289 base pairs on the plus strand of chromosome 4 at gene locus 4q23.[4] It has a molecular weight of approximately 39.6 kDa.[5]
C4orf17 Identifiers Aliases[6] dkfzp434g072, Q53FE4, chromosome 4 open reading frame 17, Ci-Spatial External IDs UniProt: Q53FE4; GeneCards: GC04P099511; HGNC ID: HGNC:25274; NCBI Gene ID: 84103; Ensembl: ENSG00000138813 Accession numbers[7] (Homo sapiens) NM_032149 (mRNA) and NP_115525 (protein) Gene location (human) Chromosome Chromosome 4 (human)[6] Band 4q23 Start chr4: 99,511,012[6] bp Orientation Plus (+) strand End chr4: 99,542,303[6] bp Gene location[8] (mouse) Chromosome Chromosome 3 (mouse) Band 3 G3; 3 64.1 cM Start chr3: 137,869,839 bp Orientation Complement (-) strand End chr3: 137,899.892 bp RNA Expression pattern Human Mouse (ortholog) Bgee[9] Top expressed in
- Left ventricle myocardium
- Buccal mucosa cell
- Sperm
- Male germline stem cell (in testes)
Top expressed in
- Left ventricle myocardium
- Buccal mucosa cell
- Sperm
- Male germline stem cell (in testes)
BioGPS[10]
- Testes
- Sperm
- Testes
mRNA
C4orf17 has a total of three known splice variants.[11][5] There are 9 exons found within C4orf17's mRNA sequence. The two additional isoforms have 10 and 7 exons, respectively.
| mRNA variant | Variant identifier | mRNA length | Protein length | Exons |
|---|---|---|---|---|
| NM_032149.3 | Q53FE4 | ~31K bp | 359 aa | 9 |
| XM_011532315.3 | Q53FE4-1 | ~31K bp | 186 aa | 10 |
| XM_054350982.1 | Q53FE4-2 | ~23K bp | 93 aa | 7 |
Protein
The C4orf17 gene encodes the Q53FE4 protein, otherwise knowns as the C4orf17 protein.[13]
Expression
In Ciona intestinalis, Ci-Spatial/C4orf17 is an essential part of beta-catenin signaling pathways in early embryonic development. Researchers isolated various cDNA clones and injected them with specific morpholinos. As a result, this study suggests that it is possible that Ci-Spatial/C4orf17 is involved in nuclear translocation of Ci-beta-catenin or enhancement of transcriptional activation.
In humans, there is notable expression of C4orf17 in the testes and early spermatids.[12]
Promoter
One of the promoters for the human C4orf17 gene is Alcohol Dehydrogenase 6 (ADH6),[5] located on chr4:99,404,578-99,405,078 (500 nucleotides.) It acts as a catalyst dependent on nicotinamide adenine dinucleotide (NAD) oxidation of primary alcohols to corresponding aldehydes. It also oxidizes secondary alcohols to the corresponding ketones.
Localization
In humans, C4orf17 is specific to the testes.[13] During human fetal development, however, C4orf17 expression appeared in adrenal, intestinal, and renal tissues[14] at 20 weeks after conception.
Interactions
The C4orf17 protein interacts with numerous other proteins, such as ATRX, CHD3, KAT2B, KDM1A, PRMT1, and SUV39H1. These specific interactions suggest that C4orf17’s function is associated with chromatin modifications through histone methylation.[15]
In a 2006 study[16] regarding Abetalipoproteinemia (ABL,) an autosomal recessive disorder caused by mutations of the MTP gene (and thus the microsomal triglyceride transfer protein,) researchers hypothesized their subject was homozygous for a genomic deletion. This deletion included the entire MTP gene, spanning the 4q23 region, which ultimately deleted C4orf17 among eight other genes. Many of these genes are a part of an alcohol dehydrogenase (ADH) gene cluster that essentially metabolizes ethanol, retinol, aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. However, C4orf17 is not included in this cluster.
A 2022 study,[17] nevertheless, further investigated the ADH gene cluster. According to the researchers, “Nucleotide variation in ADH genes can affect the catalytic properties of these enzymes and is associated with a variety of traits, including alcoholism and cancer,” (McQuillan et al., 2022). C4orf17 is one of the many genetic mutations listed as having a strong effect on ADH traits.
Evolutionary aspects
Orthologs
C4orf17 has a total of 86 orthologs recorded in the Ensembl database as well as 331 listed in NCBI. There were no orthologs found among invertebrates, protists, fungi, plants, or bacteria and archaea.[18] The C4orf17 gene likely arose approximately 463 million years ago[18]
| Taxonomic Class | Genus and species | Organism Common Name | NCBI Accession | Percent Identity (%) | Percent Similarity (%) | Length (aa) | Date of Divergence (MYA) |
|---|---|---|---|---|---|---|---|
| Mammalia | Homo sapiens | Human | NP_115525 | 100 | 100 | 359 | 0 |
| Reptilia | Anolis carolinensis | Green anole | XP_062838383 | 25.1 | 86 | 327 | 319 |
| Amphibia | Ambystoma mexicanum | Axolotl | XP_069486409.1 | 25.9 | 86 | 396 | 352 |
| Aves | Apteryx mantelli | North Island brown kiwi | XP_013806968.2 | 33 | 85 | 372 | 319 |
| Chondrichthyes | Heptranchias perlo | Sharpnose sevengill shark | XP_067828475.1 | 28.8 | 55 | 379 | 462 |
| Mammalia | Petaurus breviceps papuanus | Sugar glider | XP_068955299.1 | 45.9 | 62 | 333 | 160 |
| Mammalia
(Primate) |
Papio anubis | Olive baboon | XP_017814371.1 | 79.7 | 100 | 313 | 28.8 |
| Mammalia
(Rodentia) |
Castor canadensis | North American beaver | XP_020036598.2 | 67.2 | 99 | 366 | 87 |
Table 2: Percent identity accounts for exact matches, such as a DNA base or amino acids. Percent similarity accounts for identical and biochemically similar substitutions, such as substituted amino acids.
Paralogs
The C4orf17 protein is a SPATIAL (stromal protein associated with thymic and lymph node) domain-containing protein,[19] specifically at the interval 27-223 aa.[20] This suggests that it may be involved in spermatid differentiation. This family may also be referred to as “TBATA” or “TBATA-like.”
Protein Analysis Through Evolutionary Relationships (PANTHER) Classification System categorizes C4orf17 into the PTHR33772:SF2 protein family.[21]
Protein Divergence
The figure to the right named "Date of Divergence..." exhibits the evolution of human C4orf17, cytochrome c, and fibrinogen alpha chain. Cytochrome C has a weaker slope indicating fewer mutations over time, while the fibrinogen alpha chain has a taller slope, meaning it has evolved quicker than both human C4orf17 and cytochrome c. [13]
Multiple sequence alignment
Figure 4 displays the most conserved regions. Accession numbers on the left side of the images indicate the species including sea lamprey, turbot, cyclopterus, sharpnose sevengill shark, giant oceanic manta, rhinatrema, axolotl, Montequma quail, Norther Island brown kiwi, Komodo dragon, green anole, red-eared slider, Chinese softshell turtle, sugar glider, koala, Bennet's brown lemur, gray mouse lemur, North American beaver, Olive baboon, and human.
Conceptual translation
Figure 5 exhibits a conceptual translation of C4orf17 in humans. It is annotated to highlight exons, start and stop codons, and disordered regions among other genetic features.
SNPs
C4orf17 has a total of 14,681 SNPs recorded in the NCBI Variant Viewer, though only two are clinically significant.[22] Variants rs1397320372 and rs1417597081 are both single nucleotide, missense variants.
| SNP | Position | Alleles | Type |
|---|---|---|---|
| rs1397320372 | chr4:99529854 | A>C / A>G | missense |
| rs1417597081 | chr4:99529903 | G>A / G>C / G>T | missense |
References
- ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000012042 – Ensembl, May 2017
- ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
- ^ a b "UniProt". UniProt. Retrieved 2025-09-19.
- ^ a b c GeneCards Human Gene Database. "C4orf17 Gene - GeneCards | CD017 Protein | CD017 Antibody". www.genecards.org. Archived from the original on 2024-05-09. Retrieved 2025-12-12.
- ^ a b c d e "Gene: C4orf17 (ENSG00000138813) - Gene expression - Homo_sapiens - Ensembl genome browser 115". useast.ensembl.org. Retrieved 2025-12-03.
- ^ "Genes for Homo sapiens (human)". NCBI. Retrieved 2025-09-20.
- ^ "4930579F01Rik RIKEN cDNA 4930579F01 gene [Mus musculus (house mouse)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-12-04.
- ^ "C4orf17 ENSG00000138813 expression in Homo sapiens (human)". Bgee. Retrieved 2025-12-12.
- ^ "BioGPS - your Gene Portal System". biogps.org. Retrieved 2025-12-12.
- ^ "Homo sapiens chromosome 4 open reading frame 17 (C4orf17), mRNA". 2025-04-28.
- ^ a b "AceView: Gene:C4orf17, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2025-12-12.
- ^ a b c "Chromosome 4 open reading frame 17 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-12-12.
- ^ "C4orf17 chromosome 4 open reading frame 17 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2025-12-14.
- ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2025-12-14.
- ^ Benayoun L, Granot E, Rizel L, Allon-Shalev S, Behar DM, Ben-Yosef T (April 2007). "Abetalipoproteinemia in Israel: evidence for a founder mutation in the Ashkenazi Jewish population and a contiguous gene deletion in an Arab patient". Molecular Genetics and Metabolism. 90 (4): 453–457. doi:10.1016/j.ymgme.2006.12.010. PMID 17275380.
- ^ McQuillan MA, Ranciaro A, Hansen ME, Fan S, Beggs W, Belay G, et al. (October 2022). Falush D (ed.). "Signatures of Convergent Evolution and Natural Selection at the Alcohol Dehydrogenase Gene Region are Correlated with Agriculture in Ethnically Diverse Africans". Molecular Biology and Evolution. 39 (10) msac183. doi:10.1093/molbev/msac183. PMC 9547508. PMID 36026493.
- ^ a b "C4orf17 orthologs". NCBI. Retrieved 2025-09-20.
- ^ "CDSearch: Navigate results". www.ncbi.nlm.nih.gov. Retrieved 2025-12-12.
- ^ "CDSearch: Navigate results". www.ncbi.nlm.nih.gov. Retrieved 2025-12-14.
- ^ "Transcript: ENST00000326581.9 (C4orf17-201) - Protein summary - Homo_sapiens - Ensembl genome browser 109". feb2023.archive.ensembl.org. Retrieved 2025-12-12.
- ^ "Variation Viewer". www.ncbi.nlm.nih.gov. Retrieved 2025-12-12.