I Rebuilt Qwen3 From Scratch and Pretrained It on a University Supercomputer

Act 1 — The Hook

Placeholder spine. The five acts go here. The first time a glossed term appears, wrap it in <Term id="...">term</Term> — for example, models learn by predicting the next token .

Act 2 — Architecture

Component-by-component reconstruction goes here.

Act 3 — Data

13B-token curated pipeline goes here.

Act 4 — Training

Longleaf A100 run, loss curves, ablations go here.

Act 5 — What I’d Do Differently

Self-critique goes here.