📱🤖 Create Stunning AI Images Directly from WhatsApp with Gemini
This workflow transforms your WhatsApp into a personal AI image generation studio. Simply send a text message with your idea, and this bot will use the advanced prompt engineering capabilities of Gemini 2.5 Pro to craft a detailed, artistic prompt. It then uses Gemini 2.0 Flash to generate a high-quality image from that prompt and sends it right back to your chat.
It's a powerful yet simple way to bring your creative ideas to life, all from the convenience of your favorite messaging app!
What this workflow does
-
Listens for WhatsApp Messages : The workflow starts automatically when you send a message to your connected WhatsApp number.
-
Enhances Your Idea with AI : It takes your basic text (eg, "a knight on a horse") and uses Gemini 2.5 Pro to expand it into a rich, detailed prompt perfect for image generation (eg, "A cinematic, full-body shot of a stoic knight in gleaming, ornate silver armor, riding a powerful black warhorse through a misty, ancient forest. The scene is lit by ethereal morning sunbeams piercing the canopy, creating dramatic volumetric lighting and long shadows. Photorealistic, 8K, ultra-detailed, award-winning fantasy concept art.").
-
Generates a Unique Image : It sends this enhanced prompt to the Google Gemini 2.0 Flash image generation API.
-
Prepares the Image : The API returns the image in Base64 format, and the workflow instantly converts it into a binary file.
-
Sends it Back to You : The final, high-quality image is sent directly back to you in the same WhatsApp chat.
Nodes Used
-
🟢 WhatsApp Trigger : The entry point that listens for incoming messages.
-
🧠 LangChain Chain (LLM) : Uses Gemini 2.5 Pro for advanced prompt engineering.
-
➡️ HTTP Request : Calls the Google Gemini 2.0 Flash API to generate the image.
-
🔄 Convert to File : Converts the Base64 image data into a sendable file format.
-
💬 WhatsApp : Sends the final image back to the user.
Prerequisites
To use this workflow, you will need:
- An n8n instance .
- A WhatsApp Business Account connected to n8n. You can find instructions on how to set this up in the n8n docs .
- A Google Gemini API Key . You can get one for free from Google AI Studio .
How to use this workflow
-
Get your Google Gemini API Key : Visit the Google AI Studio and create a new API key.
-
Configure the Gemini 2.5 Pro Node :
- In the n8n workflow, select the Gemini 2.5 Pro node.
- Under 'Connect your account', click 'Create New' to add a new credential.
- Paste your Gemini API key from the previous step and save.
-
Configure the Generate Image (HTTP Request) Node :
- Select the Generate Image node.
- In the properties panel on the right, find the Query Parameters section.
- In the 'Value' field for the
key
parameter, replace "Your API Key"
with your actual Google Gemini API Key.
-
Connect WhatsApp :
- Select the WhatsApp Trigger node.
- Follow the instructions to connect your WhatsApp Business Account credential. If you haven't created one, the node will guide you through the process.
-
Activate and Test :
- Save the workflow using the button at the top right.
- Activate the workflow using the toggle switch.
- Send a message to your connected WhatsApp number (eg, "A futuristic city in the style of Van Gogh"). The bot will process your request and send a stunning AI-generated image right back to you!