AI Engineer.
Systems
Architect.
Building AI-native infrastructure, LLM agent systems, and production-grade platforms. Currently within the G42 / Presight AI ecosystem in Abu Dhabi.
How I can
help your team.
Design and build multi-agent systems, tool-use pipelines, and autonomous workflows using Claude, OpenAI, or open-source LLMs. From POC to production-grade infrastructure.
Build retrieval-augmented generation pipelines, embedding stores, and hybrid search systems. Optimize for cost, latency, and accuracy at scale.
Generative Engine Optimization for Arabic-first markets. Help your brand appear in AI-generated answers across ChatGPT, Perplexity, and Gemini in MENA.
Articles &
Perspectives.
The tension between serving dynamic, role-aware content and keeping inference costs low isn't a framework problem — it's an architectural one. Here's how to resolve it.
Read article →Selected
Work.
Provider-agnostic agent framework with unified tool interfaces, cost tracking, and support for Claude, GPT-4, and Gemini in a single opinionated loop.
RAG pipeline using Anthropic's Contextual Retrieval method with batched chunk enhancement, reducing embedding costs by ~97% vs naive approaches.
Production voice agent platform evaluation and deployment. Compared FastRTC vs LiveKit architectures, shipping a reference implementation on Railway.
Real-time trading signal monitoring system with time-series storage, client-side visualization, and automated alerting built on Railway PaaS.
problem to solve?