Case Study · May 5, 2026 · ~16 min read

From 406 compounds to 26 translational leads: an end-to-end DRUG-seq dose-response decoder

Rebuilding the analytical pipeline on Ginkgo Datapoints' GDPx4 DRUG-seq screen — from raw count matrix to tiered, target-annotated compound hypotheses.

Source dataset

Ginkgo Datapoints GDPx4 DRUG-seq screen: 406 compounds × 6 doses (0.03–10 µM) in synchronised HEK293 across 14 plates, with four positive-control anchors (BFA, TSA, Rigosertib, Staurosporine) and DMSO sham wells. Raw matrix 78,778 × 5,264 wells.

High-throughput transcriptomic screens are a chemical biologist's dream and a translational lead's nightmare. They are dense, plate-confounded, replicate-poor, and stuffed with cytotoxic noise. The promise — recover mechanism of action (MoA) and rank compounds by the cleanliness of their dose-response — is real, but realising it requires a pipeline that holds together from raw count to target nomination.

We rebuilt that pipeline end-to-end on the Ginkgo Datapoints GDPx4 dataset. The 78,778 × 5,264 raw matrix went through seven analytical steps and emerged as a tiered, target-annotated, indication-mapped table of compound hypotheses. This post walks through what the analysis found, what it concluded, and — most importantly for biopharma readers — what those conclusions mean for translational decision-making.

TL;DR for the translational reader
  • Analytical validity is established. Three of four positive controls (BFA → UPR, TSA → HDAC chromatin, Rigosertib → PLK1/cell-cycle) were independently recovered through five orthogonal computational routes. Staurosporine's complete QC failure is itself a biological readout (apoptosis-driven mitochondrial RNA release).
  • 26 Tier-1 compounds survive the conjunctive filter of clean dose-response × coherent MoA cluster × multi-source target evidence. These are the highest-confidence leads.
  • 75 Tier-2 compounds are the novel-MoA hypothesis pool — clean dose-response, but cluster heterogeneity or missing target annotation prevents full triangulation. Ouabain, emetine, mitoxantrone, omaveloxolone lead this list.
  • Cluster 12 antimetabolites are systematically under-ranked by monotonicity weighting — the strongest pathway-enrichment signal in the screen sits in Tier 3 because cytostatic plateau-shaped responses are not what the cleanliness score rewards. A structural artefact, not a biological one, with direct implications for how you set up the next screen.

QC, or, why losing Staurosporine is good news

DRUG-seq plates fail in characteristic ways: low UMI counts (sequencing artefact), inflated mitochondrial fraction (cell stress / apoptosis), bimodal duplication (library prep failure). MAD-based thresholds on per-well metrics retained 3,716 of 5,264 wells (70.6%) and isolated 7 systematically degraded plates. Plate08 was a sequencing/alignment failure, Plate12 a library-prep failure, Plate14 multi-modal, plates 01/09/10 high-mito only.

The most diagnostic finding of the QC step is that all 28 Staurosporine wells failed on high mitochondrial fraction. Staurosporine is a pan-kinase inhibitor whose dose range here is overtly apoptotic, and apoptosis releases mitochondrial RNA into the cytoplasm. The QC threshold did exactly what it should — it flagged the wells whose transcriptional state is incompatible with a steady-state DE comparison to DMSO. The "failure" is biology.

Plate-effect correction (CPM+log1p with OLS plate regression and Pearson-residual VST) collapsed the DMSO PC1–PC2 plate silhouette to –0.39 and dropped |r(PC1, total UMI)| from 0.733 to 0.025. Crucially, UMI regression was not propagated to compound wells — that preserves the cytotoxicity-driven depth signal that some MoAs require.

Per-plate QC pass rates across the 14 plates
Per-plate QC pass rates. Seven plates (01, 08, 09, 10, 12, 14) retain <80%; Plate08 and Plate12 are dominated by single distinct failure modes.
DMSO PC1 versus total UMI after plate and depth regression
After plate+UMI regression, DMSO PC1 is decorrelated from sequencing depth (|r| = 0.025), confirming clean confounder removal.

Per-plate limma-voom recovers canonical biology

A structural feature of the design — each compound occupies exactly one plate — is usually a confound. Here it lets us run per-plate cell-means voom with empirical-Bayes shrinkage on ~181 residual degrees of freedom per plate, eliminating the need for a plate covariate inside any compound contrast. The model produced ~29.1M per-test records (399 compounds × 6 doses × 14,429 genes).

The positive controls came back exactly as the literature predicts:

AnchorBest doseSig. genes (FDR<0.05)Hallmark recoveryCanonical interpretation
BFA1–10 µM4,865–5,907HSPA5/BiP, DDIT3/CHOP, ATF3UPR / ER stress (ARF-GEF inhibition)
TSA0.1–1 µM4,607–4,733HDAC chromatin, CCND1, CDKN1APan-HDAC inhibition
Rigosertib0.1–10 µM4,310–5,242PLK1, CCNB1, CDK1, CDC20G2/M & PLK1 / cell-cycle

Each anchor recovered 12–14 of 15 curated hallmark genes. TSA at 10 µM failed QC, consistent with HEK293 cytotoxicity at high HDAC dose.

Dose-response of curated hallmark genes for the three evaluable positive controls
Dose-response of curated hallmark genes for the three evaluable positive controls — monotonic and on-target.
Top-50-gene heatmap across positive controls
Top-50-gene heatmap across positive controls. Distinct UPR (BFA), HDAC (TSA), and cell-cycle (Rigosertib) blocks.
BFA 10 µM volcano plot
BFA 10 µM volcano — 5,907 significant genes, UPR signature dominant.

Compound-level clustering and NMF: independent recovery of MoA

The 399 compound-level signatures were embedded and clustered with Leiden, k-means, and hierarchical at multiple resolutions. Leiden was the most representation-stable (ARI 0.58–0.82 between dose-aggregated and dose-resolved inputs), and grouped all three evaluable positive controls in cluster 0 at every resolution.

The translationally meaningful detail is in the neighbours. Rigosertib's top-5 nearest neighbours by cosine similarity are exclusively microtubule-destabilising agents (colchicine 0.521, thiocolchicine 0.397, podophyllotoxin 0.381, lexibulin 0.374, plinabulin 0.334). This is a textbook independent reproduction of the Jost et al. 2017 CRISPRi/a result that Rigosertib is a microtubule destabiliser, not (only) a PLK1 / RAS-mimetic — and it falls out of an unsupervised analysis with no tubulin prior.

Independent NMF at rank-15 (cosine seed-stability 1.000, 65.3% variance explained) produced 15 reproducible programs. The biology re-emerged through a different lens:

  • P01 (G2M / E2F / MYC, GSEA NES=3.08, FDR<0.001) — top compound activity from Rigosertib (1.074); BFA also lands here (4th highest).
  • P02 (chromatin / TSA-specific) — top loadings include CCND1, CDKN1A, HDAC1.
  • P11 + P14 (translation initiation / elongation) — capture ribosome biology with danusertib and ancitabine as top compounds.

The same biology arrives by Leiden clustering and by NMF program decomposition. That redundancy is what gives the next steps confidence to commit.

Leiden clusters in UMAP space
Leiden clusters in UMAP space. Positive controls and tubulin binders co-localise; cluster 12 (antimetabolites) sits as a distinct island.
Rigosertib's top nearest neighbours dominated by tubulin-destabilising agents
Rigosertib's top nearest neighbours are dominated by tubulin-destabilising agents — unsupervised reproduction of the published CRISPR result.
Compound by NMF program activity heatmap
Compound × NMF program activity. Cell-cycle (P01), chromatin (P02), and translation (P11/P14) blocks are visible.
GSEA Hallmark NES across NMF programs
GSEA Hallmark NES across NMF programs — P01/P02/P11/P14 are the interpretable MoA backbone.

A defensible composite cleanliness score

The translational question is not "which compound is loudest?" — it is "which compound has a clean dose-response that a chemist can prosecute?" We built a 5-component cleanliness score:

ComponentWeightWhat it captures
Dose-response monotonicity25%Spearman r(log dose, mean |logFC|)
Gene-level monotonicity25%Per-gene monotonic recruitment across doses
Replicate concordance20%Within-dose Pearson between replicates
Signature strength at best dose20%Number of FDR<0.05 genes; max |logFC|
SNR vs DMSO sham null10%Bootstrap from 700 DMSO half-splits

The DMSO sham null (50 random half-splits × 14 plates = 700 samples) was deliberately conservative on plates with few DMSO wells (Plate08 n=2, Plate12 n=3 inflate null variance). That motivated the down-weighted SNR contribution — monotonicity, not raw magnitude, drives the composite.

All three evaluable positive controls land in the top 9% of the library: BFA rank 16/399 (0.839), TSA rank 17/399 (0.837), Rigosertib rank 38/399 (0.763). The top library compound, ouabain (0.969), has Spearman r = 1.0 between log-dose and signature magnitude across all 6 doses.

Rank-ordered cleanliness waterfall for all 399 compounds
Rank-ordered cleanliness waterfall for all 399 compounds. Positive controls (highlighted) sit in the top 9%.
Positive-control spotlight: per-component scores
Positive-control spotlight: per-component scores for BFA, TSA, Rigosertib across the 5 axes.

Top-15 cleanest library compounds (positive controls excluded)

#CompoundCompositeBest dose (µM)Sig. genesPutative target
1Ouabain0.969105,347ATP1A1 (Na⁺/K⁺-ATPase)
2Emetine0.956105,275EIF4A / 40S ribosome
3(S)-Camptothecin0.9430.11,647TOP1
4BI-25360.922101,112PLK1
5Mitoxantrone0.8980.31,359TOP2A
6PMA0.89732,266PKC family
7Nanchangmycin0.87432,919Unknown
8Irinotecan0.8660.31,414TOP1
9Vistusertib0.8641564mTORC1/2
10Volasertib0.8623634PLK1
11Methotrexate0.85510820DHFR
12Rabusertib0.8530.3131CHEK1
13Calcimycin0.848Ca²⁺ ionophore
14DactolisibPI3K/mTOR
15BorussertibAKT

The top of the cleanliness ranking is dominated by mechanistically potent agents — Na⁺/K⁺-ATPase, protein synthesis, TOP1/TOP2, PLK1, mTOR, PKC, DHFR, CHEK1.

Pathway enrichment & DGIdb target mapping

DGIdb queries recovered canonical targets for all evaluable positive controls: TSA → HDAC1–11 (DTC + GuideToPharmacology, 27 PMIDs), Rigosertib → PLK1/PIK3CA/KRAS-NRAS (TTD, TALC, CKB-CORE, CIViC). BFA's GBF1/Sec7-domain ARF-GEF target is absent from DGIdb — a known database gap, not a recovery failure.

Cluster-level enrichment is structurally weak: averaging 28–30 mechanistically diverse compounds cancels MoA-specific signal. Cluster 12 is the only cluster reaching FDR<0.05 at the consensus level — and it does so emphatically, with 10 GSEA terms FDR<0.001 spanning KEGG_RIBOSOME, REACTOME_RESPONSE_OF_EIF2AK4_GCN2_TO_AMINO_ACID_DEFICIENCY, and REACTOME_EUKARYOTIC_TRANSLATION_ELONGATION. The same biology is recovered independently by NMF programs P11 and P14. This triple convergence (cluster + two NMF programs) is the single strongest biological theme in the screen.

Of 53 target nominations across top-ranked compounds and cluster representatives:

  • 60% had ≥2 independent DGIdb sources for direct biochemical interaction.
  • 19% had a single direct source.
  • 15% relied on pharmacological-class inference.
  • 6% remained unknown (nanchangmycin, BO-264).
Positive-control target recovery via DGIdb
Positive-control target recovery via DGIdb. TSA → HDAC family and Rigosertib → PLK1/PIK3CA/RAS triangulate canonically.
Cluster-level target family annotations
Cluster-level target family annotations — 5 of 14 clusters carry coherent pharmacological themes.

Tiering: 26 leads, 75 hypotheses, 309 deferred

The integration step applies a conjunctive policy:

  • Tier 1 (n=26) — top-quartile cleanliness AND coherent cluster MoA AND target evidence (or positive-control anchor).
  • Tier 2 (n=75) — top-quartile cleanliness, but cluster heterogeneity OR missing target annotation prevents triangulation.
  • Tier 3 (n=309) — below-cleanliness or QC-excluded.

Tier 1 spans canonical oncology MoA backbones — TOP1 (camptothecin, irinotecan), PLK1 (BI-2536, volasertib), mTOR (vistusertib), DHFR (methotrexate), CHEK1 (rabusertib), PI3K/mTOR (dactolisib), AKT (borussertib), microtubule (multiple). All four positive controls placed in Tier 1, with Staurosporine flagged QC-excluded.

Tier distribution across the library
Tier distribution across the library: 26 / 75 / 309. Conservative by design.
Tier 1 compound by score-component heatmap
Tier 1 compound × score-component heatmap. Cleanliness drivers vary by MoA (signature strength dominates for cytotoxics; monotonicity dominates for kinase inhibitors).
Per-cluster tier composition
Per-cluster tier composition. Cluster 12 (antimetabolites) has zero Tier 1/2 members despite the strongest enrichment signal in the screen — a structural under-ranking.

Top Tier-2 novel-MoA candidates

CompoundScoreClusterTop NMFTarget / hypothesisWhy Tier 2
Ouabain0.9698 — DNA Damage / WNT / MixedP01 (G2M)ATP1A1 confirmed (multi-source)Cluster 8 heterogeneity
Emetine0.9561 — KRAS-Dn / Diverse (unannotated)P01 (G2M)EIF4A / ribosomeCluster unannotated
Mitoxantrone0.8984 — ECM / GPCR / Ion-channelP01 (G2M)TOP2ACluster mixed
PMA0.8978 — DNA Damage / WNT / MixedP15 (WNT-βcat)PRKC familyCluster heterogeneity
Nanchangmycin0.8748 — DNA Damage / WNT / MixedP01 (G2M)UnknownTarget unknown
Cevipabulin0.8404 — ECM / GPCR / Ion-channelP01 (G2M)Tubulin (VDA class)Cluster mixed
Omaveloxolone0.82313 — Pro-inflammatory / TGF-βP02 (chromatin)NRF2 / NFE2L2 activatorCluster unannotated
PF-4777360.8059 — ErbB / GAG / IL6-JAK-STATP06 (KRAS-Dn)CHEK1 / WEE1Cluster heterogeneity
Capivasertib0.8039 — ErbB / GAG / IL6-JAK-STATP06 (KRAS-Dn)AKT1/2/3Cluster heterogeneity
GSK23344700.8019 — ErbB / GAG / IL6-JAK-STATP06 (KRAS-Dn)PDK1Cluster heterogeneity
Plinabulin0.7989 — ErbB / GAG / IL6-JAK-STATP01 (G2M)TubulinCluster heterogeneity
Mocetinostat0.7939 — ErbB / GAG / IL6-JAK-STATP06 (KRAS-Dn)HDACCluster heterogeneity
Bimiralisib0.7659 — ErbB / GAG / IL6-JAK-STATP06 (KRAS-Dn)PI3KCluster heterogeneity

Tier 2 is the highest-confidence pool of novel-MoA hypotheses where cluster heterogeneity (not biology) blocked Tier 1 placement. These are the rows a translational lead should circle.

Translational hypotheses generated by this analysis

This is the section a discovery lead actually cares about. The pipeline produced named, falsifiable hypotheses that map directly onto downstream decision points — orthogonal validation, target deconvolution, indication selection, and screen redesign.

  1. Rigosertib's microtubule activity is the dominant transcriptional driver in HEK293. Rigosertib's nearest neighbours are exclusively tubulin destabilisers. This is independent transcriptomic confirmation of the Jost 2017 CRISPR result and argues that biomarker strategies for Rigosertib (currently MDS/AML) should incorporate tubulin-binder co-response signatures rather than relying solely on PLK1/RAS readouts.
  2. Ouabain has a translatable signature beyond cardiac-glycoside cytotoxicity. Highest cleanliness score in the entire library (0.969), confirmed ATP1A1 target, but cluster 8 is mixed. The combination of clean dose-response with strong G2M program activity suggests ATP1A1 modulation re-purposable for cell-cycle-driven indications is worth following up — particularly given recent literature on Na⁺/K⁺-ATPase as an oncology target. Compound-level (not cluster-level) MoA scoring would have moved this to Tier 1.
  3. Cluster 12 antimetabolites converge on GCN2/eIF2AK4 amino-acid- deficiency response. Triple-convergent biology (cluster + P11 + P14): methotrexate, 5-azacytidine, amsacrine, ancitabine. The integrated stress response via GCN2 is a known but under-prosecuted oncology axis; this screen identifies which library scaffolds engage it most cleanly. Therapeutic implication: combination with GCN2 modulators or amino-acid-restriction strategies in tumour types with elevated translation flux.
  4. Cluster 9 is an actionable AKT/PI3K/mTOR pharmacology hub. Capivasertib, GSK2334470, bimiralisib, mocetinostat and 11+ other clinical-stage kinase inhibitors co-cluster with coherent KRAS-Dn (P06) program activity. For a translational team, this is a ready-made co-response panel for AKT/PI3K- pathway combination scouting — the cluster effectively behaves as a computational synthetic-lethal screen of pathway redundancy.
  5. Omaveloxolone's NRF2 signature is recoverable from a 2-replicate well. Cleanly placed in cluster 13 (pro-inflammatory / TGF-β) with P02 chromatin-program loading. NRF2 activators are a hard-to-screen MoA because the readout is often pathway-restricted; DRUG-seq-style transcriptional screening looks like a viable surrogate for NRF2 functional confirmation in a high- throughput format.

What this analysis tells you about screen design

Two structural lessons that biopharma teams running their own DRUG-seq programs should internalise:

  1. Compound-plate confounding is a feature when exploited correctly. Per-plate cell-means voom with EB shrinkage gives you stable variance from n=2 replicates that no pooled-plate model can match. Fix one compound per plate; don't randomise across plates.
  2. Monotonicity-weighted cleanliness penalises cytostatic MoAs. Cluster 12 — the most enrichment-significant theme in the entire screen — has zero Tier 1/2 members because antimetabolite trajectories plateau. Run a parallel cytostatic scoring track (e.g. signature divergence onset + plateau strength) to surface this MoA class.

And one limitation that should be stated plainly: HEK293 is not a disease-relevant cell line. Every biological inference here is a transcriptional MoA hypothesis, not an indication claim. The natural next step is to replay the Tier 1 / top Tier 2 set in two or three disease-relevant lines (e.g. cancer panels for the cell-cycle/topoisomerase compounds; iPSC-derived cardiomyocytes for ouabain) and ask whether the cleanliness ranking transfers.

The pipeline reconstructed canonical MoA biology for three of four positive controls through five independent computational routes — DE volcanoes, Leiden cluster co-assignment, NMF program loadings, GSEA enrichment, and DGIdb target lookup. That redundancy is what licenses the downstream tiering: when a Tier 1 call is made, it rests on five concordant pieces of evidence, not one. When a Tier 2 hypothesis is proposed (ouabain, emetine, omaveloxolone), the gap to Tier 1 is named — cluster heterogeneity, missing target annotation, plateau response — and therefore actionable.

For biopharma translational teams, the value of a DRUG-seq screen is not the count of significant genes per compound. It is the size and quality of the falsifiable hypothesis pool the screen generates. This analysis produced 26 high-confidence leads, 75 named hypotheses, and a clearly-flagged structural blind spot — the right shape of output for a translational portfolio decision.

Acknowledgements. This re-analysis builds on the publicly released Ginkgo Datapoints GDPx4 DRUG-seq dataset. The original screen is the foundation; everything here is a downstream re-read.

Interested in this kind of analysis?

See how Inflexa runs end-to-end DRUG-seq pipelines from raw counts to tiered, target-annotated hypothesis tables.