I am an ML Researcher and the Head of Frontier Engineering at InstaLily.
My work focuses on the deep intersection of LLM deployment infrastructure, model compression, and high-performance algorithmic design. I specialize in designing fine-tuning pipelines (SFT, DPO) and building real-time multimodal voice agentic systems using PyTorch, vLLM, and raw CUDA.
I was a Research Assistant at the Penn Computer Assisted Surgery and Outcomes Laboratory, where I built novel Vision Transformer architectures for identifying surgical phases in robotic procedures. I hold an M.S. in Electrical Engineering from the University of Pennsylvania.
Selected Writing
- Apr 12, 2026
First Post: Site Foundation
A brief overview of the goals for this blog.