Image models are available in the Generate Image, Edit Image, and Upscale Image nodes.Generation and editing models
| Model | Modes | Description |
|---|
| Nano Banana Pro | Generate, Edit | Google’s state-of-the-art image generation and editing model |
| Seedream V4.5 | Generate, Edit | ByteDance Seedream 4.5 for text-to-image and image editing |
| Nano Banana | Generate, Edit | Fast-paced, dynamic rendering model for generation and editing |
| Seedream V4 | Generate, Edit | ByteDance Seedream 4 for generating high-quality images |
| Flux 2 | Generate, Edit | Great aesthetics and prompt adherence for text-to-image generation |
| Ideogram Character | Generate, Edit | Character-focused Ideogram generation and editing |
| ChatGPT Image | Generate, Edit | OpenAI’s most advanced image model for realistic images |
| Flux Pro Kontext | Generate, Edit | Flux 1.0 Pro Kontext with text and reference image inputs for editing |
| Flux Pro Kontext Max | Generate, Edit | Premium editing with stronger prompt adherence and typography |
| Flux Pro Kontext Max Multi | Generate, Edit | Experimental multi-reference image editing for complex compositions |
| Qwen Image | Generate, Edit | Text-to-image with parallel CFG, LoRA, and turbo acceleration |
| Flux Pro 1.1 Ultra | Generate, Edit | Advanced version of Flux Pro 1.1 for detailed, realistic scenes |
| Ideogram V3 | Generate, Edit | Advanced typography handling for intricate text elements and design visuals |
| ImagineArt 1.5 | Generate | Great aesthetics and prompt adherence for high-quality images |
| Z Image Turbo | Generate | Latest image generation model from Alibaba with high-speed results |
| Seedream V3 | Generate | ByteDance Seedream 3.0 for text-to-image generation |
| Dreamina 3.1 | Generate | ByteDance Dreamina 3.1 with rich styles and enhanced prompt features |
| Flux Pro 1.1 | Generate | High-quality text-to-image offering realistic and detailed outputs |
| Flux Dev | Generate | Flexible and customizable model ideal for experimental and conceptual design |
| Recaraft V3 | Generate | Text-to-image designed for vector and typography-friendly outputs |
| Minimax Image 01 | Generate | Fast, efficient text-to-image for quick, high-quality generation |
| Ideogram V2 Turbo | Generate | Fast, high-efficiency model for typography-heavy content |
Upscale models
Used in the Upscale Image node.| Model | Description |
|---|
| ImagineArt Subtle | Enhances resolution with subtle refinement and natural detail |
| Topaz Upscale | Industry-leading upscaling technology enhancing texture while reducing noise |
| ImagineArt Creative | Creative upscaling with artistic detail and visual style enhancement |
| FreePik Upscaler Creative | AI-driven upscaling introducing stylistic detail and artistic enhancement |
| FreePik Precision v1 | Focuses on image fidelity, sharpness, and natural color preservation |
| FreePik Precision v2 | Advanced precision upscaling with adaptive content-aware algorithms |
For maximum multi-reference editing capability, use Nano Banana Pro, Nano Banana 2, or Seedream V4.5—these models support uploading 10+ reference images in the Edit Image node.
Text models power the AI Copilot node. These are large language models (LLMs) from leading AI providers, available for generating, analyzing, and transforming text within your workflows.| Model | Use cases |
|---|
| Gemini 3.0 Pro Preview | Multimodal powerhouse for complex reasoning, creative text, image, and video generation |
| Gemini 2.5 Pro | Google’s model for deep text analysis, multi-step reasoning, and complex tasks |
| GPT-5.1 | OpenAI’s top-tier model for high-performance problem-solving and creative text |
| GPT-5 Mini | Optimized for fast response times; ideal for high-volume text generation |
| GPT-4o | Omni model with speed and multimodal capabilities for text, voice, and video tasks |
| Claude Sonnet 4.5 | Balanced workhorse for content generation, analysis, and general-purpose workflows |
| Grok 4 | Distinctive personality, real-time knowledge integration, and dialogue |
| GPT-5 Nano | Small, efficient model for basic text processing and simple tasks |
| GPT-4o Mini | Fast, cost-effective multimodal model for quick text generation |
| Gemini 2.5 Flash | Speed-optimized for high-quality multimodal tasks; ideal for large text content |
| Gemini 2.0 Flash | Efficient model for common tasks like blog posts, social media, and scripts |
| Gemini 2.5 Flash Lite | Compact model for rapid, low-latency text responses in real-time applications |
| Claude Opus 4.1 | Ideal for complex reasoning, creative writing, and detailed content generation |
| Claude Haiku 4.5 | Fast, cost-effective model for quick tasks: dialogue, captions, and summaries |
For workflows that require analyzing images or videos as inputs to the AI Copilot node, choose a multimodal model such as Gemini 3.0 Pro Preview, GPT-4o, or Gemini 2.5 Flash.