Model Performance
Three ML models (Logistic Regression, Random Forest, MLP) trained on TCGA RNA-seq data with 5-fold stratified cross-validation.
β
Best Accuracy
bal. accuracy
β
Best AUC
across cancers
β
Avg Specificity Gain
percentage points
5
Cancer Types
TCGA cohorts
3
Models
LR Β· RF Β· MLP
Specificity Improvements by Cancer Type
MLP Performance Dashboard
| Cancer | Bal. Accuracy | Specificity | Sensitivity | AUC | MCC | Architecture | Samples (T/N) |
|---|---|---|---|---|---|---|---|
| Loading⦠| |||||||
Task Γ Model Results
| Task | Model | Accuracy | Precision | Recall | ROC AUC |
|---|---|---|---|---|---|
| Loading⦠| |||||
Limitations
- Near-perfect AUC (0.99+) reflects the intrinsic separability of tumor vs. normal transcriptomes, not signature-specific discriminatory power. Random gene sets of similar size may achieve comparable performance.
- PRAD specificity (73.5%) is the lowest across cancers. Prostate adjacent-normal tissue is known to contain tumor cell contamination, creating a harder classification boundary. PRAD results should be considered exploratory.
- UCEC has only 201 samples (smallest dataset) yet achieves AUC = 1.000. This should be interpreted with caution given the small test set size per fold.
- SMOTE oversampling is applied for PRAD and BLCA within CV folds. Class weighting may be more appropriate for high-dimensional data.
All metrics are averaged over 5-fold stratified cross-validation.
Architecture is selected dynamically: 512β256β128 for datasets with
n > 600 samples, 256β128 for smaller datasets.