DenseNet121 chest X-ray classifier compressed 51.43% (6.97M → 3.38M parameters) at constant accuracy (AUROC +0.0003). Compression makes channel attribution atomic enough for surgical correction: 5-channel weight zeroing reduces a target false positive by ∆prob −0.13 with zero true-positive loss and exactly zero AUROC change on the other 13 pathologies. Polarized channels reframed as bipolar discriminative axes exploiting label mutual exclusivity (Jaccard < 0.1 in 89 of 100 polarized channels, zero architectural conflicts).
Insights on intelligence
Works
Diagnose transformer prediction failures from weights alone, then correct them by retraining one layer. GPT-2 capital-city accuracy 2/8 → 8/8 with zero side effects, achievable through any of attention, FFN, or V-only (590K params).
Layer-level analysis of BERT on the five GLUE tasks, applying a forward-primary learning framework. Three findings: (1) separability-guided layer skip with compensation classifier — lossless compression on 3 of 5 GLUE tasks; (2) FFN's role decomposed as 92% structural (norm normalization) and 8% classification, explaining why FFN removal hurts even when individual layers look classification-harmful; (3) 60–93% of misclassifications are high-confidence errors — the BERT CLS vector itself is the fundamental limitation.
A redefinition of narrative as the spatiotemporal design of cognitive energy. Same five-dimensional measurement applies across text, music, dance, education, art, architecture, and UI.
Notes
- 2026-05-12
Why one person can now go broad and deep
For two centuries, you couldn't be both broad and deep. You had to choose. That trade-off is collapsing — not because anyone is suddenly smarter, but because the cost of operating in a field has dropped by an order of magnitude. A note from one bottega trying to live in that change.
- 2026-05-10
Deep learning is editable
We treat deep learning as something you train, not something you fix. That's a habit, not a fact. I edited 590,000 parameters in GPT-2 — half a percent of the model — and corrected six wrong answers without damaging anything else.