About
Taylor Reese.
Data Science · UNC Chapel Hill · Class of 2028
What I do
I’m a sophomore studying Data Science at UNC Chapel Hill, but most of what I actually do sits closer to ML systems work — building and training models from first principles to understand how they work.
The frame I keep coming back to: using a model isn’t the same as understanding it. Most of my time is spent rebuilding architectures component-by-component, then putting them through real pretraining on real hardware to see what breaks.
Currently
Pretraining a 751M-parameter Qwen3 reconstruction on UNC’s Longleaf A100 cluster. The architecture and data pipeline are done; the full training run is underway.
Toolkit
-
LANGUAGES
- Python
- TypeScript
- Shell
-
MACHINE LEARNING
- PyTorch
- HuggingFace
- NumPy
-
INFRASTRUCTURE
- SLURM
- CUDA
- Longleaf HPC
-
DATA
- pandas
- memmap shards
- GTFS
-
WEB
- Astro
- Svelte
- Tailwind
-
GENERAL
- Git
- Make
- Obsidian
Background
I got into this through the usual path — PyTorch tutorials, then architectures-from-papers exercises. Around the same time I started using UNC’s Longleaf cluster for a project that became RQwen3, and the gap between “I read about how transformers work” and “I have a SLURM job stuck in the queue at 2am because the GRES string is wrong” clarified what I find interesting about this field.
I’m drawn to pretraining methodology, data curation, and how small-model behavior diverges from scaled-up versions of the same architecture — particularly the work coming out of groups like UNC’s MURGe-Lab.
Looking for
- Summer 2027 Machine-learning internship — research, infrastructure, or model-side.
- Beyond Graduate study in NLP / ML. Interested in groups working on pretraining methodology, data curation, and small-model behavior.
Contact
Reach out anytime.
treese2028@gmail.com