Research
AI systems close to the machine.
I work around the point where AI ideas meet practical systems: memory limits, local hardware, inference runtimes, agent workflows, and the tooling needed to study them.
The work moves between papers, implementation probes, and research tools, with a bias toward things that can be tested on real machines.
Open questions
The thread keeps moving.
- What can local constraints reveal about useful AI systems?
- How should inference systems handle memory pressure and long context?
- How can agent workflows become more reliable without becoming opaque?