CORTEX

Patient Stratification

Identify molecularly distinct subgroups within a clinical cohort and characterize what makes each one different. Cortex performs dimensionality reduction, unsupervised clustering, and subgroup-specific differential analysis to reveal stratification that may not be visible from clinical variables alone. Each stratum is characterized by its distinguishing molecular features, enriched pathways, and, critically, whether those molecular differences translate to outcome differences.

What You Get

Deliverables

Subgroup definitions with molecular features

PCA-based dimensionality reduction reveals ancestry-driven sample separation in the TCGA-BRCA cohort. Each subgroup is characterized by its distinguishing genomic and transcriptomic features: African American patients show higher TP53 mutation rates (46% vs 31%), elevated 16p loss (OR 3.87), and enriched IFNα response pathways (NES +1.97). European American patients show higher PIK3CA rates (37% vs 21%) and different pathway profiles.

PC2 8.1%PC1 12.3%-101234-2-1012345AA (n=120)EA (n=200)

Survival analysis across subgroups

Kaplan-Meier and Cox regression analysis tests whether molecular subgroups correspond to outcome differences. For the ancestry comparison, the result is a null survival effect after clinical adjustment (HR 1.046, p=0.837), a critically important finding. This demonstrates that molecular stratification must be interpreted carefully: not every molecular difference drives clinical outcomes. The platform reports this with the same rigor as positive findings.

Surv. prob.Months0.00.20.40.60.81.0020406080100120AAEA

Distinguishing features per stratum

Immune checkpoint markers show large effect sizes between groups: OX40/TNFRSF4 (Cohen's d=0.82), PD-1, CTLA-4, and LAG-3 all elevated in African American patients. The prognostic signature overlap between groups is remarkably low, only 1.2% (2 shared genes out of 167), suggesting that population-specific biomarker panels may be needed rather than one-size-fits-all approaches.

CDH1PIK3CATP530%10%20%30%40%50%5%21%46%15%37%31%AAEA
DECISION ENABLED

Define inclusion criteria for enrichment trials or identify subpopulations for targeted therapies. Use molecular stratification to design more precise clinical trials, but only when molecular differences translate to meaningful outcome differences.

Sample Output

Breast cancer ancestry stratification: PCA, genomics, survival

PCA: Ancestry-Driven Sample Separation1,084 patients
PC2 (8.1% variance)PC1 (12.3% variance)-101234-2-1012345African AmericanEuropean American
Subgroup Molecular Featuresper-ancestry comparison
GroupTop PathwayTP53PIK3CA
African AmericanIFNα response (NES +1.97)46%21%
European AmericanProtein secretion (NES -1.93)31%37%
Distinguishing Genomic Featuresmutations + CNA
FeatureEffectDirectionType
16p lossOR 3.87AA > EACNA
TP53 mutation46% vs 31%AA > EAMutation
PIK3CA mutation21% vs 37%EA > AAMutation
Immune Checkpoint DifferencesCohen's d
MarkerCohen's dDirection
OX40/TNFRSF4+0.82AA > EA
PD-1+0.65AA > EA
CTLA-4+0.58AA > EA
LAG-3+0.54AA > EA
Survival Analysis: Overall Survival by AncestryKaplan-Meier
Survival probabilityMonths00.20.40.60.81020406080100120African AmericanEuropean American

HR 1.046 (p=0.837): null survival difference after clinical adjustment. Molecular differences do not always translate to outcome differences. Prognostic signature overlap: 1.2% (2 shared genes out of 167).

ATOPIC DERMATITIS PATIENT SUBTYPES (GSE157194)

AD Patient Clusters: 3 Molecular Subtypes57 patients · silhouette 0.131
C1: Low-inflammation34%C2: Moderate41%C3: High-inflammation25%
ClusterLabelPatientsKey Feature
C1Low-inflammation19 (34%)Minimal barrier disruption
C2Moderate23 (41%)Mixed Th2/Th17 signature
C3High-inflammation14 (25%)Strong S100A/SPRR upregulation

C1 (low-inflammation, 34%) represents a potential non-responder subgroup. These patients may not benefit from aggressive immunosuppression.

ATG7 MICROGLIA BIFURCATION (GSE286094-GSE286095)

Homeostatic Microglia State TransitionHM1 depletion / HM2 enrichment
logFCHM1 (MAPK/PI3K hyperactive)HM2 (p53/NFκB senescent)-2.5-2-1012-2.00+1.21
ATG7: Immune Cell Composition Shiftsdeconvolution
Cell TypeChangeOdds RatioDirection
Neutrophil-45%0.55decreased
T cell+76%1.76increased

ATG7 knockout drives HM1 (MAPK/PI3K hyperactive) depletion (logFC -2.00, 806 cells lost) with compensatory HM2 (p53/NFκB senescent) enrichment (+1.21, 541 cells gained). Peripheral immune: neutrophil loss (-45%, OR 0.55) paired with T cell infiltration (+76%, OR 1.76), a shift from innate to adaptive immunity.

How It Works

Methodology

STEP 1

Dimensionality reduction

PCA on normalized expression data reveals the major axes of variation in the dataset. For the 1,084-patient TCGA-BRCA cohort, the first two principal components captured 12.3% and 8.1% of variance respectively, with ancestry as a visible separation axis.

STEP 2

Unsupervised clustering

Clustering algorithms identify molecularly distinct groups without relying on known labels. The resulting clusters are compared against clinical annotations (ancestry, subtype, stage) to assess what drives the molecular separation.

STEP 3

Subgroup characterization

Each subgroup is profiled for differentially expressed genes, enriched pathways, mutation frequencies, copy number alterations, and immune marker levels. This produces a comprehensive molecular fingerprint for each stratum.

STEP 4

Survival analysis with covariate adjustment

Cox regression models test whether subgroup membership predicts outcomes after adjusting for clinical covariates (stage, grade, treatment). This separates molecular associations from clinical confounders.

STEP 5

Feature importance ranking

Features that best distinguish each stratum are ranked by effect size, statistical significance, and biological interpretability. This prioritized list guides downstream validation experiments and biomarker panel design.

Who This Is For

Target personas

Clinical trial designer

Use molecular stratification to define enrichment criteria and design more precise inclusion/exclusion criteria.

Precision medicine lead

Identify subpopulations with distinct molecular profiles that may respond differently to targeted therapies.

Biostatistician

Assess whether molecular subgroups translate to outcome differences with rigorous covariate-adjusted survival analysis.