// Live Demo
A production multi-tenant AI agent platform with RAG, intelligent handoffs, and booking automation. Try the live demo below — it's trained as "Rapid Flow Plumbing" to showcase real-world capabilities.
Built by Alex Turner · Self-hosted on AWS
// Interactive Demo
This is a fully functional AI assistant trained as a fictional plumbing business. Ask about services, pricing, or try booking an appointment.
The agent uses RAG (Retrieval-Augmented Generation) with Gemini embeddings stored in Supabase's pgvector. It searches a knowledge base of 16 entries about "Rapid Flow Plumbing" services, then generates responses using Claude/GPT/Gemini via dynamic LLM routing.
Status: ● ONLINE
// Architecture
Multi-LLM Support: The system dynamically routes between Claude (Anthropic), GPT-4 (OpenAI), Gemini (Google), and Llama 3.3 (via Groq) based on client configuration. RAG embeddings use Gemini's 768-dimensional model for semantic search. Full source available on GitHub.