Are Peptideins the Missing Regulators of Cancer Cell Viability? Three Testable Hypotheses from the Dark Proteome

Three testable hypotheses on how non-coding microproteins (peptideins) regulate cancer survival via splicing, scaffolding, and degradation pathways.


Scientific Hypothesis Generation

Are Peptideins the Missing Regulators of Cancer Cell Viability? Three Testable Hypotheses from the Dark Proteome

Over 1,700 previously hidden microproteins, encoded by non-coding genomic regions and now termed "peptideins," have been identified by large-scale proteogenomic analyses. CRISPR screens show that 10-17% of these non-canonical translation products are essential for cancer cell survival, a rate comparable to canonical protein-coding genes. Three hypotheses emerge from the BioSkepsis literature synthesis, each targeting a different layer of regulation: splicing-dependent expression, competitive complex occupancy, and protein quality control escape.

Hypothesis 1: The SF3B1-SRSF2 Splicing Switch Enables Selective Microprotein Translation in Cancer

Cancer-associated splicing mutations bypass canonical start codons to unlock a hidden pro-survival proteome encoded in lncRNAs and 5' UTRs.

The Gap

Thousands of non-canonical ORFs in lncRNAs and 5' UTRs are translated at detectable levels, yet only a fraction produce stable, functionally essential microproteins. The mechanism determining which microproteins are selectively expressed in cancer, but not in normal tissue, remains poorly characterised. It is unclear whether a single upstream regulatory axis governs this selectivity.

The Claim

The SF3B1-SRSF2 splicing axis functions as a universal stoichiometric regulator of cancer cell viability by producing alternative transcript isoforms that exclude canonical upstream start codons, thereby permitting translation of essential microproteins such as DEDD2-SEP and APPLE (PMID: 39484585). SF3B1 mutations (e.g., K700E) drive aberrant lncRNA biogenesis and the production of neopeptides essential for leukemia cell survival. Once translated, these microproteins function as stoichiometric adaptors for critical oncogenic complexes: APPLE scaffolds the MAPK axis by binding ERK1/2 and preventing PP1/PP2A dephosphorylation (PMID: 40646641); ASDURF acts as a subunit of the prefoldin-like chaperone complex in MYC-driven medulloblastoma (PMID: 38176414). Pharmacological restoration of canonical splicing should decrease microprotein-scaffolded complex assembly and reduce viability in peptidein-dependent tumours.

Why It's Testable Now

Isogenic SF3B1-WT vs. K700E cell lines (Nalm-6) are available. Long-read ISO-seq can quantify alternative transcript ratios. TMT-based quantitative proteomics and Ribo-seq confirm microprotein levels. Start-codon mutant cDNAs (ATG to ATT/AGG) distinguish protein-mediated from RNA-mediated effects.

The Intriguing Outcome

If confirmed, this would establish splicing factors as druggable gatekeepers of the entire non-canonical proteome, not just individual transcript targets. It would reframe SF3B1 inhibitors (e.g., pladienolide B) as tools for collapsing microprotein-dependent signalling hubs in myeloid malignancies. It would also imply that the dark proteome is not constitutively expressed but conditionally unlocked by cancer-specific splicing events.

Thesis Entry Points

  1. Perform ISO-seq in isogenic Nalm-6 SF3B1-WT vs. K700E cells to quantify the ratio of microprotein-encoding alternative transcripts (DEDD2-SEP, APPLE) relative to canonical counterparts.
  2. Knock down SRSF2 by siRNA in the same lines, then run Ribo-seq and TMT proteomics to measure changes in microprotein translation and abundance.
  3. Express start-codon mutant (ATG-to-ATT) cDNAs of DEDD2-SEP and APPLE to confirm that pro-survival effects require the protein product; validate by co-IP for complex assembly (ERK-PP2A, prefoldin-like) and CRISPR-dropout fitness scores.

Novelty Signal

Open field: Individual microproteins (APPLE, ASDURF) have been characterised, but no study has tested whether a single splicing axis governs the expression of the entire essential microprotein repertoire in cancer.

Hypothesis 2: Microproteins Sustain Oncogenic Signalling by Competitively Displacing Canonical Complex Subunits

Peptideins do not merely participate in signalling complexes; they lock them into a constitutively active state by occupying regulatory grooves that would otherwise bind phosphatases or tumour suppressors.

The Gap

Several microproteins (APPLE, EMBOW, SMIMP) have been shown to bind core signalling scaffolds such as ERK1/2, WDR5, and cohesin subunit SMC1A. However, it remains unknown whether these microproteins compete stoichiometrically with canonical regulatory subunits for the same binding interface, and whether this competitive displacement is the mechanism by which they sustain signalling.

The Claim

Cancer-associated SF3B1-SRSF2 mutations establish a "microprotein-enforced" signalling hub by preferentially producing alternative transcript isoforms that encode scaffolding microproteins. Once translated, these microproteins (APPLE, EMBOW) competitively displace canonical regulatory subunits from protein complexes, preventing signalling deactivation. APPLE binds the ERK-PP1/PP2A interface and blocks dephosphorylation (PMID: 40646641). EMBOW occupies the WDR5 WIN-site, displacing canonical ligands (PMID: 37725512). Disruption of these specific interfaces by small molecules (WIN-site inhibitors, PP2A activators) should restore canonical dephosphorylation and induce apoptosis. The hypothesis is falsified if microprotein-encoding isoforms are produced at equivalent levels in wild-type cells but are translationally silent, or if pharmacological activation of PP1/PP2A fails to induce cell death in APPLE-overexpressing tumours.

Why It's Testable Now

WIN-site inhibitors are commercially available. Co-immunoprecipitation and competition assays can quantify microprotein:target occupancy ratios. TMT proteomics in SF3B1-mutant isogenic lines enables direct measurement of displacement dynamics.

The Intriguing Outcome

If validated, this would establish microprotein occupancy as a previously unrecognised mechanism of therapeutic resistance. It would explain why some cancers are refractory to MEK or HDAC inhibitors: the microprotein physically blocks the site where the canonical regulator would restore homeostasis. It would also prioritise PPI disruptors over kinase inhibitors for peptidein-dependent tumours.

Thesis Entry Points

  1. Quantify the stoichiometric ratio of APPLE to PP1/PP2A and EMBOW to WDR5 using TMT-multiplexed co-IP proteomics in isogenic Nalm-6 and HCC cell lines (SF3B1-WT vs. K700E).
  2. Treat microprotein-high cell lines with WIN-site inhibitors or PP2A activators (DT-061) and measure the kinetics of canonical subunit re-engagement by time-resolved co-IP, coupled with viability assays (CCK-8, Annexin V).
  3. Knock out the APPLE or EMBOW start codon (ATG to ATT) while preserving host transcript structure; rescue with synonymous-codon-recoded microprotein cDNA to confirm protein-dependent complex displacement.

Novelty Signal

Frontier: No published study has directly measured stoichiometric competition between a microprotein and a canonical regulatory subunit at a shared binding interface in cancer cells.

Hypothesis 3: The BAG6 Degradation Pathway Is a Targetable Gatekeeper That Selectively Permits Oncogenic Microprotein Accumulation

The cell's own protein quality control system normally destroys non-canonical translation products, but cancer-specific transcript recoding bypasses this filter to allow scaffolding microproteins through.

The Gap

CRISPR screens identify thousands of translated non-canonical ORFs, yet only a small fraction produce stable, functional microproteins at detectable levels. The BAG6 mitigation pathway degrades non-canonical translation products with hydrophobic C-terminal domains (PMID: 40245354, PMID: 39753408). It is unknown whether oncogenic microproteins escape BAG6 degradation through a specific mechanism, or whether they simply outcompete the pathway by overproduction.

The Claim

The BAG6 pathway functions as a selective stoichiometric filter that permits accumulation of oncogenic scaffolding microproteins (APPLE, SMIMP, ASDURF) only when their synthesis is driven by alternative transcript isoforms that lack upstream inhibitory open reading frames (uORFs). This establishes a "recoding-dependent" threshold: the microprotein is produced in sufficient quantity to saturate BAG6 capacity and accumulate at signalling hubs only when the alternative isoform is the dominant transcript. The hypothesis predicts that pharmacological inhibition of BAG6 will paradoxically stabilise microproteins and increase oncogenic complex occupancy in microprotein-dependent cell lines. Conversely, forced reinsertion of upstream uORFs should restore translational repression and trigger BAG6-mediated clearance. The hypothesis is falsified if microprotein accumulation occurs regardless of the presence or absence of upstream uORFs, or if loss of signalling complex occupancy does not result in reduced cancer cell viability.

Why It's Testable Now

BAG6 knockout cell lines are available. TMT proteomics can quantify the global stabilisation of non-canonical translation products upon BAG6 loss. CRISPR-mediated uORF insertion and deletion in endogenous loci is feasible with paired gRNA designs.

The Intriguing Outcome

If confirmed, this would establish BAG6 as a druggable node, but with a counter-intuitive therapeutic logic: activating BAG6 (rather than inhibiting it) would collapse the microprotein pool and dismantle peptidein-enforced signalling hubs. It would also reframe protein quality control as a context-dependent tumour suppressor that is overwhelmed, not mutated, in peptidein-dependent cancers.

Thesis Entry Points

  1. Generate BAG6 knockout in isogenic cancer cell lines and perform TMT proteomics to quantify global stabilisation of non-canonical microproteins; identify which essential candidates (APPLE, SMIMP, ASDURF) accumulate upon BAG6 loss.
  2. Measure microprotein occupancy at the WDR5 WIN-site and PP1/PP2A grooves by co-IP in BAG6-WT vs. BAG6-KO cells to test whether increased microprotein stability translates to increased complex engagement.
  3. Use CRISPR to insert a synthetic uORF upstream of the APPLE or SMIMP coding sequence in the endogenous locus; measure the impact on microprotein translation (Ribo-seq), BAG6-mediated degradation (cycloheximide chase), and cell viability (EdU incorporation, CRISPR-dropout score).

Novelty Signal

Frontier: No published study has tested whether BAG6 degradation capacity acts as a stoichiometric threshold for oncogenic microprotein accumulation, or whether uORF-dependent translational repression is the switch that determines which microproteins escape this filter.

Frequently asked questions

What are peptideins?

Peptideins are a newly proposed category of microproteins encoded by non-canonical open reading frames (ncORFs) in genomic regions previously considered non-coding, such as lncRNAs and 5' UTRs. They are typically shorter than 100 amino acids and were identified through large-scale proteogenomic analyses. Unlike conventional microproteins, many peptideins do not resemble known protein folds, and their functions are only beginning to be characterised.

How were peptideins discovered to be essential for cancer cell survival?

Systematic CRISPR/Cas9 loss-of-function screens targeting non-canonical ORFs revealed that approximately 10% to 17% of translated microproteins are required for cancer cell fitness, a rate comparable to canonical protein-coding genes. In a screen of 553 small ORFs, 57 were identified as essential for cancer cell survival (PMID: 33510483).

What is the SF3B1-SRSF2 splicing axis and why does it matter for microproteins?

SF3B1 and SRSF2 are splicing factors frequently mutated in cancers, particularly myeloid malignancies. Mutations in these factors can alter transcript isoform production, bypassing upstream canonical start codons that normally repress the translation of internal microprotein-encoding ORFs. This creates a splicing-dependent mechanism for selective microprotein expression in cancer cells.

What is the BAG6 mitigation pathway?

The BAG6 pathway is a protein quality control system that triages and degrades non-canonical translation products bearing hydrophobic C-terminal domains. It normally prevents the accumulation of aberrant peptides. The hypothesis proposes that oncogenic recoding via alternative transcript isoforms may override this filter, allowing essential scaffolding microproteins to accumulate selectively in cancer cells.

Which microproteins are named as druggable candidates?

Several candidates have been identified. GREP1, a secreted microprotein in breast cancer, is accessible to neutralizing antibodies. APPLE binds the ERK-PP1/PP2A interface, offering a target for small-molecule PPI disruptors. EMBOW occupies the WDR5 WIN-site, which is already targeted by existing inhibitors. Non-canonical ORF-derived peptides also serve as tumour-specific antigens for immunotherapy in tumours with low mutational burden (PMID: 38985879).

How do researchers distinguish microprotein function from host lncRNA function?

The standard control is a start-codon mutation (ATG to ATT or AGG) in the microprotein ORF while preserving the host transcript RNA structure. If the phenotype is lost upon abolishing translation but the RNA remains intact, the effect is protein-dependent. Rescue with a synonymous-codon-recoded microprotein cDNA provides additional confirmation.

Can these hypotheses be tested in mouse models?

Partially. Many peptideins lack sequence conservation beyond primates, limiting the utility of conventional mouse models for some candidates. Patient-derived xenograft (PDX) models and humanised systems offer alternatives. For conserved candidates like ASDURF, genetic mouse models remain viable.

Generate Your Own Testable Hypotheses with BioSkepsis

Every hypothesis above was synthesised from PubMed-indexed literature by BioSkepsis, with automated citation verification. Ask your own research question, get PMID-grounded answers, and generate falsifiable hypotheses with named experimental readouts.

Start free

Sources and further reading

  1. Chen J et al. Pervasive functional translation of noncanonical human open reading frames. Science. 2020;367(6482):1140-1146. PMID: 32139545
  2. Prensner JR et al. Noncanonical open reading frames encode functional proteins essential for cancer cell survival. Nat Biotechnol. 2021;39(6):697-707. PMID: 33510483
  3. Kovalski JR et al. A uORF-encoded microprotein is an essential subunit of the prefoldin-like complex in MYC-driven medulloblastoma. Nature. 2024;625(7993):112-119. PMID: 38176414
  4. Li Y et al. The non-canonical proteome uniquely populates the proteome or immunopeptidome. Cell Rep. 2024;48(9):114409. PMID: 39379206
  5. Liu Y et al. APPLE promotes hepatocellular carcinoma through sustained MAPK activation by scaffolding ERK1/2 and preventing PP1/PP2A dephosphorylation. Nat Commun. 2025;16(1):5445. PMID: 40646641
  6. Chiu HS et al. The microprotein EMBOW competes with WDR5 ligands. Mol Cell. 2023;83(21):3838-3852. PMID: 37725512
  7. Lu S et al. SMIMP promotes colorectal cancer by epigenetic silencing of tumour suppressors via cohesin subunit SMC1A. Mol Cancer. 2023;22(1):192. PMID: 37932451
  8. Huang N et al. iORF-dependent DEDD2-SEP translation is essential for cancer cell survival. Nat Struct Mol Biol. 2024;31(12):1898-1908. PMID: 39484585
  9. Tay LS et al. BAG6 mitigation of non-canonical translation products. Nature. 2025;639(8055):205-214. PMID: 39753408
  10. Hsu PY et al. BAG6 triages non-canonical ORF products. Science. 2025;387(6739):eadq1424. PMID: 40245354
  11. Zhao X et al. Non-canonical ORFs as tumour-specific antigens in HCC. J Hepatol. 2024;81(4):679-692. PMID: 38985879
  12. Xu W et al. TPM3P9 promotes ccRCC via RBM4-mediated oncogenic splicing. Adv Sci. 2025;12(6):e2412375. PMID: 39865075