Publications¶
Publication artifacts from the platform's investigations.
Artifacts¶
| Title | Type | Investigation | Status |
|---|---|---|---|
| SAE Replication Crisis — preprint draft | Paper (preprint) | Investigation #3 | Draft — needs multi-model + GPU scale-up |
| SAE Replication Crisis — tweet thread | 12-tweet thread | Investigation #3 | Draft ready to post |
Publication pipeline¶
The platform is designed to go from investigation to artifact with minimal friction:
- Run the investigation —
mech run/mech sweep - Generate the report —
mech report --run-id <id> - Write the narrative — under
docs/investigations/ - Produce artifacts — paper draft + tweet thread under
docs/publications/
The SAE replication crisis is the first investigation to reach the publication stage. The refusal audit (Investigation #1) and factual recall story (Investigation #2) are candidates for the next round once the platform's multi-model sweep infrastructure is exercised at publishable scale.
Scale gap¶
The current results are single-model, single-corpus, small dictionary size. The publishable-minimum version of the SAE replication crisis requires:
- 3+ models (GPT-2 small, GPT-2 medium, Pythia-1.4B at minimum)
- 5 layers per model
- 4 dictionary sizes (128, 256, 512, 1024)
- 20+ seeds
- ~1200 training runs, ~10 GPU-hours on A100
The analysis pipeline (mech analyze-sae-stability --live-only) is ready.
Compute and model diversity are the remaining gap.