Building intelligent RAG & agentic AI systems.

I help enterprises build AI that actually works in production. Ahad Khan, Agentic AI Engineer at Capgemini, turning RAG prototypes into reliable, hallucination-free pipelines.

View Projects Read Blog

Core Technologies

Python

FastAPI

MCP

RAG

LLMs

Azure

LangGraph

Docker

Tool Calling

SQL / DB

Data Pipelines

Currently Building

Featured Project

Agentic RAG for Manufacturing

Multi-agent RAG system for manufacturing Q&A achieving 90%+ retrieval accuracy. Router → Retriever → Generator pipeline answering queries from equipment manuals and safety SOPs — fully offline, zero cloud dependency.

PythonFastAPIMilvusOllamaLangChain

View Case Study

Recent Work

View all projects ->

AI Gym Memory System

Conversational workout tracker with sub-second intent extraction via Gemini Flash. Log exercises in natural language and query history semantically — 'What did I train last Tuesday?' — with ChromaDB vector retrieval.

PythonFastAPIGemini Flash

Document Intelligence RAG

Enterprise RAG system handling 10K+ document pages with Azure AI Search. Hybrid retrieval (BM25 + semantic) reduced hallucinations by 85% while maintaining sub-2s response times. Source-attributed answers via GPT-4.

PythonFastAPIAzure AI Search

MiA-RAG: Mindscape-Aware RAG

Paper-accurate implementation of Mindscape-Aware RAG (arXiv:2512.17220) achieving +12% recall over baseline retrievers. Uses MiA-Emb-0.6B with hierarchical summarization and residual score fusion for context-enriched retrieval.

PythonStreamlitPyTorch

Latest Posts

View all posts ->

LLMAgentic AI

I Run an AI Agent on a VPS. Here's My Actual Setup

A walkthrough of my real OpenClaw deployment: 13 Telegram topics, GPT-5.2 on Azure free tier, Playwright browser automation with anti-bot bypass, and a massive ecosystem of over 5,700 ClawHub skills powering my Second Brain. Pulled directly from my live droplet.

Mar 5

Zero-Cloud Agentic AI: Running Milvus and Local LLMs On-Prem

Sending sensitive internal data to closed APIs wasn't an option. Here is the exact architecture I used to build a fully local, autonomous agentic pipeline using Milvus, Ollama, and open-source embeddings.

Mar 5

LLMAgentic AI

How I Set Up an On-Prem Agentic AI Stack with Open-Source Embeddings and Fully Local Inference

A practical guide to building a fully on-prem agentic AI system using open-source embeddings and local LLM inference — no APIs, no cloud, complete data control.

Mar 3