# work
what i ship.
i keep finding myself at very small companies right when they're deciding what to build. currently a stealth-stage ai startup; previously runanywhere (yc w26) and tegore-ai (yc x25). early-stage is where i'm sharpest — small teams, real users, weekly cadence.
the unstructured version of the same restless mind lives at /about; long-form project writeups at /projects.
Stealth Startup
Software Engineer · Feb 2026 → present
San Francisco · forward-deployed at an enterprise client
Building an end-to-end multi-agent AI platform that autonomously processes insurance claims — voice calls, browser automation, and LLM reasoning, orchestrated.
- LLM orchestration layer that loads claim state, plans next actions, and dispatches to specialized agents (voice / browser / sheets). Durable execution on Temporal with Redis-backed distributed locking and retry for multi-call campaigns.
- Real-time voice pipeline on ElevenLabs Conversational AI + Telnyx SIP: live payer calls with claim context dynamically injected, structured outcomes parsed back from transcripts.
- Vision-language browser agents on Playwright that drive insurance portals (Availity, Kern), fill multi-step forms, and programmatically clear MFA via the Microsoft Graph API.
- Forward-deployed at an enterprise client — owning the relationship and integrating the pipeline into legacy order processing.
# previously
past full-time and internships, newest first.
RunAnywhere (YC W26) · Software Engineer Intern
Dec 2025 — Feb 2026San Francisco, CA
Local-first inference SDK — bringing real models to the browser and the edge instead of the data center.
- Extended the core inference SDK with multi-provider model routing, optimized tokenization, and request batching. LoRA-fine-tuned models down to a footprint that fits edge devices.
- Built the on-device browser agent — VLM-driven web automation in a Chrome extension with Transformers.js, ONNX runtime over CDN, and a state-machine-first architecture that survives restricted pages and MV3 CSP.
- Shipped Android Use Agent into the SDK Playground — VLM-on-Android wired into a real interaction loop, plus x86_64 ABI support so it runs on emulators, not just hardware.
- Cross-platform Playground (Swift + Android + on-device browser) so partners and customers could touch the SDK without standing up infra.
merged into RunanywhereAI/runanywhere-sdks
shipped in RunanywhereAI/on-device-browser-agent
TypeScriptSwiftKotlinWebGPUTransformers.jsONNXVLMsPyTorchLoRATegore-AI (YC X25) · Software Engineer Intern
Oct 2025 — Dec 2025San Francisco, CA
AI tutoring that draws on a whiteboard mid-conversation. Voice and text in the same session, sharing state.
- Designed an LLM tool-calling architecture that dynamically renders interactive React components mid-conversation — so the tutor can draw a diagram or quiz the student without leaving the chat.
- FastAPI / TypeScript / Postgres backend with prompt-chaining, contextual retrieval, and concurrent workers. +27% tutor accuracy, −35% latency.
PythonTypeScriptFastAPIPostgresOpenAIElevenLabsCartesiaThe Mind Company · Machine Learning Engineer Intern
Sep 2024 — Dec 2024San Jose, CA
Real-time brain-computer interface — EEG signals to motor-imagery control, sub-50ms.
- CNN + Common Spatial Patterns classifier hit 92% on a 4-class motor imagery task. INT8 quantization let it run on a Raspberry Pi at sub-50ms inference.
- Rebuilt the signal-processing pipeline with ICA artifact rejection and adaptive bandpass filtering — +12 dB SNR. Mixed-precision PyTorch training cut iteration time 3×.
PythonPyTorchNumPyEEG / DSPRaspberry PiQuantization
# say hi
building something where this résumé would be useful?
i'm most useful around multi-agent systems, voice + browser automation, and infra for AI that has to actually run in production. write to me.
send me an email