Multi-Modal WhatsApp AI Agent Workflow by rafay ahmad hussainMulti-Modal WhatsApp AI Agent Workflow by rafay ahmad hussain

Multi-Modal WhatsApp AI Agent Workflow

rafay ahmad  hussain

rafay ahmad hussain

Multi-Modal WhatsApp AI Agent Workflow in n8n (Audio, Image, Text)
This is a multi-modal AI agent workflow using n8n and OpenAI for WhatsApp automation: 1.Triggered on incoming WhatsApp audio, image, or text 2.Transcribes audio and analyzes content for intent 3. Processes and interprets image-based prompts 4. Routes inputs to an AI Agent (OpenAI) with memory support 5. Responds in text or generated audio, depending on user preference 6. Tech Stack: n8n, OpenAI, Whisper, Code node, WhatsApp API
Automates real-time, intelligent user interactions across media types.
Like this project

Posted Jan 30, 2026

Multi-Modal WhatsApp AI Agent Workflow in n8n (Audio, Image, Text) This is a multi-modal AI agent workflow using n8n and OpenAI for WhatsApp automation: 1.Tr...