RAG Knowledge Assistant
Production-ready retrieval pipeline with hybrid search, reranking, and cited responses. Handles multi-format ingestion and streams answers under 200ms TTFT.
available for new projects
I build production-grade LLM applications — retrieval pipelines, agentic workflows, and fine-tuned models that ship real value. Currently exploring the edges of context engineering and multi-modal RAG.
01. stack
A pragmatic toolkit for taking LLM ideas from notebook to production — model orchestration, retrieval, evals, and the infrastructure to glue it all together.
02. work
A few things I've built. Each one solved a real problem and taught me something I now bring to the next one.
Production-ready retrieval pipeline with hybrid search, reranking, and cited responses. Handles multi-format ingestion and streams answers under 200ms TTFT.
Multi-step agent runtime with tool calling, memory, and human-in-the-loop checkpoints. Built for reliability with full trace logging and replay.
03. contact
Working on an LLM problem? Want to collaborate? Drop a message and I'll get back to you within a day.