Documentation Index
Fetch the complete documentation index at: https://docs.imagine.art/llms.txt
Use this file to discover all available pages before exploring further.
Quick picks by use case
Best overall
ImagineArt 2.0 — ImagineArt’s flagship proprietary model. Best photorealism, comprehension, and composition control.
Best for typography
Ideogram v3 — ~90–95% text accuracy. Best for posters, signage, and branded layouts.
Best open-source
Recraft v4 — #1 HuggingFace Arena (ELO 1172). Only model with native SVG vector output.
Fastest generation
Nano Banana 2 — ~4× faster than Nano Banana Pro, up to 14 references, 4K output.
By what you’re building
Posters, banners, and print-ready assets
Posters, banners, and print-ready assets
Product photography and e-commerce
Product photography and e-commerce
Priority: consistency + accuracy + material renderingNano Banana Pro (Google Gemini 3 Pro Image) is built for production product work — preserves the identity of up to 5 subjects for consistent brand asset creation, with Google Search grounding for accurate product rendering. Nano Banana 2 handles up to 14 references at near-real-time speed for high-volume batches.Seedream v4.5 supports up to 14 references at 4K — ideal for multi-product campaign sets. Flux.2 Max accepts up to 10 references with real-time web grounding.
| Model | Reference images | Output quality | Best for |
|---|---|---|---|
| Nano Banana Pro | 5 subjects | 4K | Complex product composites |
| Nano Banana 2 | Up to 14 | 4K | High-volume batch product |
| Seedream v4.5 | Up to 14 | Up to 8K | Multi-product campaign sets |
| Flux.2 Max | Up to 10 | Up to 4MP | Premium product with grounding |
Portrait and lifestyle photography
Portrait and lifestyle photography
Priority: skin fidelity + lighting + anatomyImagineArt 2.0 is ImagineArt’s highest-capability proprietary model — the best choice for premium portraits and lifestyle photography where the output needs to stand alongside actual photography. ImagineArt 1.5 Pro is the strong middle option at 4K native. ImagineArt 1.5 is cost-efficient for draft exploration.Minimax Image also excels at lifelike skin rendering with cinematic lighting inherited from MiniMax’s video model lineage.
| Model | Portrait quality | Speed | Cost tier |
|---|---|---|---|
| ImagineArt 2.0 | Flagship | Fast | Premium |
| ImagineArt 1.5 Pro | Excellent | Fast | Standard |
| Minimax Image | Excellent | Fast | — |
| ImagineArt 1.5 | Good | Fast | Lower |
Text and typography in images
Text and typography in images
Priority: spelling accuracy + layout controlIdeogram v3 is the clear leader — ~90–95% text accuracy with support for stylized lettering, complex multi-line layouts, and multilingual text in six languages. Use the DESIGN style type for layout-aware graphic work.ChatGPT Image 2 delivers 99%+ text accuracy across multilingual scripts including CJK and Indic — the strongest option when text accuracy and language coverage both matter. ChatGPT Image 1.5 (and the original ChatGPT Image) are best when your text references real-world knowledge — logos, flags, diagrams. Recraft v4 delivers production-quality text with native SVG output. Z Image Turbo and Qwen Image both excel at bilingual Chinese-English layouts.
| Model | Text accuracy | Multilingual | Best for |
|---|---|---|---|
| ChatGPT Image 2 | 99%+ | CJK, Indic, and more | Multilingual layouts, infographics, knowledge-grounded |
| Ideogram v3 | ~90–95% | 6 languages | Posters, packaging, brand typography |
| ChatGPT Image 1.5 | Superior (dense) | Limited | Infographics, fast knowledge-grounded |
| Recraft v4 | Production-grade | Limited | Design, SVG, signage |
| Qwen Image | Excellent (complex) | ZH + EN | Bilingual layouts, East Asian markets |
| Z Image Turbo | Excellent (lowest WER) | EN + ZH | Bilingual advertising, fast iteration |
| Flux.2 Pro | Production-grade | Limited | Commercial campaigns with text |
Character consistency and brand mascots
Character consistency and brand mascots
Priority: identity locking + cross-scene variationNano Banana Pro preserves up to 5 distinct subject identities simultaneously — the strongest option for mascot or character consistency across scenes. Seedream v4.5 supports up to 14 reference images for multi-character compositions. Flux.2 Max accepts up to 10 references.
| Model | Reference images | Consistency | Training needed |
|---|---|---|---|
| Nano Banana Pro | 5 subjects | Excellent | None |
| Seedream v4.5 | Up to 14 | Excellent | None |
| Flux.2 Max | Up to 10 | Strong | None |
| Nano Banana 2 | Up to 14 | Good | None |
Cinematic, atmospheric, and concept art
Cinematic, atmospheric, and concept art
Priority: lighting depth + mood + stylistic rangeDreamina Image 3.1 is designed for cinematic aesthetics — rich lighting, atmospheric depth, and vibrant color palettes across a style range from photorealism to anime. Midjourney V7 delivers richer textures and improved anatomical accuracy through its completely rebuilt architecture.Flux Dev offers strong artistic diversity as the most popular open-weight model for creative experimentation.
| Model | Cinematic quality | Style range | Best for |
|---|---|---|---|
| Midjourney V7 | Excellent | Wide | Artistic, textured, atmospheric |
| Dreamina Image 3.1 | Excellent | Photo to anime | Cinematic portraits, stylized art |
| Seedream v4 | Strong | Wide | Concept boards, branded imagery |
| Flux Dev | Strong | Bold / experimental | Open-weight creative exploration |
Marketing, editorial, and commercial campaigns
Marketing, editorial, and commercial campaigns
Priority: photorealism + scalability + consistencyImagineArt 2.0 is the flagship for high-stakes commercial imagery. Seedream v4.5 handles up to 14 references and generates at 4K in seconds — ideal for consistent multi-output campaigns. Flux.2 Max with real-time web grounding is best when you need current-subject accuracy.
| Model | Photorealism | Speed | Consistency | Best for |
|---|---|---|---|---|
| ImagineArt 2.0 | Flagship | Fast | Excellent | Hero shots, premium commercial |
| Seedream v4.5 | Excellent | 8–14s | Excellent | Multi-output campaign sets |
| Flux.2 Max | Excellent | 4–10s | Strong | Grounded commercial imagery |
| Nano Banana Pro | High | Fast | Excellent | High-volume consistent campaigns |
Illustration, stylized art, and worldbuilding
Illustration, stylized art, and worldbuilding
Priority: style control + detail + creative rangeQwen Image is ranked #1 on the AI Arena leaderboard for stylized illustration — exceptional for detailed concept art, scientific illustrations, and complex text-in-image layouts. Dreamina Image 3.1 supports styles from anime to Baroque oil painting. Ideogram v3 has 58 style presets including Oil Painting, Cyberpunk, Art Deco, and more.
| Model | Illustration quality | Style presets | Best for |
|---|---|---|---|
| Qwen Image | Excellent | Open-ended | Concept art, scientific illustrations |
| Dreamina Image 3.1 | Excellent | Wide | Anime, fantasy, cinematic stills |
| Ideogram v3 | Strong | 58 presets | Styled graphic design, mood art |
| Flux Dev | Strong | Experimental | Bold creative, LoRA-based styles |
Quick drafts and rapid iteration
Quick drafts and rapid iteration
Priority: speed + low costNano Banana 2 is the fastest at ~4× the speed of Nano Banana Pro. Seedream v5 Lite generates in 3–5 seconds with intelligent prompt interpretation. Z Image Turbo delivers ~4× faster than FLUX with bilingual capability.A recommended workflow: iterate on Seedream v5 Lite or Nano Banana 2 to validate direction, then move to ImagineArt 1.5 Pro, Seedream v4.5, or ImagineArt 2.0 for the final output.
| Model | Speed | Best for |
|---|---|---|
| Seedream v5 Lite | 3–5 seconds | Exploration, intelligent prompting |
| Nano Banana 2 | ~4× faster than Pro | High-volume, reference-based |
| Z Image Turbo | ~4× faster than FLUX | Open-source, bilingual drafts |
| ImagineArt 1.5 | Fast | Lower-cost photorealistic drafts |
High-resolution for print (4MP+)
High-resolution for print (4MP+)
Priority: pixel count + detail at scaleSeedream v4.5 supports up to 8192×8192 — the highest raw resolution available. Flux.2 Max and Flux 1.1 Ultra both generate at 4MP. Recraft v4 Pro delivers 4MP with native SVG vector alongside raster output.
| Model | Max resolution | Key advantage |
|---|---|---|
| Seedream v4.5 | Up to 8192×8192 | Highest raw resolution + 14 refs |
| Flux.2 Max | 4MP | Web grounding + 10 refs |
| Flux 1.1 Ultra | 4MP | Raw mode, architectural |
| Recraft v4 Pro | 4MP | SVG vector + raster at 4MP |
| ImagineArt 1.5 Pro | Native 4K | Composition + text at 4K |
Full model comparison
| Model | Resolution | Text | Multi-ref | Speed | Best for |
|---|---|---|---|---|---|
| ImagineArt 2.0 | 2K | Good | — | Fast | Flagship photorealism, commercial hero |
| ImagineArt 1.5 Pro | Native 4K | Strong | — | Fast | Posters, product visuals, professional |
| ImagineArt 1.5 | 2K | Good | — | Fast | Cost-efficient photorealistic drafts |
| Nano Banana 2 | Up to 4K | Good | 14 | Ultra-fast | High-volume, rapid iteration |
| Nano Banana Pro | Up to 4K | Excellent | 14 | Fast | Complex compositions, production |
| Nano Banana | Up to 2K | Strong | 4 | Near real-time | Quick edits, product swaps |
| Seedream v5 Lite | Up to 4K | Good (titles) | 14 | 3–5s | Rapid exploration, intelligent prompting |
| Seedream v4.5 | Native 4K | Excellent (dense) | 4 | 8–14s | Production deliverables, large-format |
| Seedream v4 | Native 4K | Multilingual | 4 | Near real-time | Commercial campaigns |
| Recraft v4 Pro | View tooltip | Excellent | — | ~30s | Print design, SVG vector |
| Recraft v4 | View tooltip | Excellent | — | ~10s | Design, branding, SVG |
| ChatGPT Image 2 | Up to 4K | 99%+, multilingual | 1 | Fastest | Multilingual text, reasoning, character consistency |
| ChatGPT Image 1.5 | 1536×1024 | Superior (dense) | 4 | Fast (4× v1) | Infographics, knowledge-grounded |
| ChatGPT Image | 1536×1024 | Best-in-class | 4 | Up to 2 min | Complex knowledge-grounded |
| Midjourney V7 | View tooltip | Good | — | Fast / Draft | Artistic, cinematic, atmospheric |
| xAI Grok Imagine | 1K | Good | 3 | Fast | Real-entity accuracy, diverse styles |
| Flux.2 Max | View tooltip | Best-in-class | 4 | 4–10s | Maximum quality, web grounding |
| Flux.2 Pro | View tooltip | Production-grade | 4 | Fast | Commercial campaigns with text |
| Flux 1.1 Ultra | 2K | Good | — | ~10s | High-res commercial, Raw mode |
| Flux Dev | 1K | Decent | — | ~7–18s | Concept art, open-weight creative |
| Z Image Turbo | View tooltip | Excellent (EN+ZH) | — | ~4× faster than FLUX | Rapid generation, bilingual advertising |
| Dreamina Image 3.1 | 2K | EN + ZH | — | Under 20s | Cinematic portraits, stylized art |
| Ideogram v3 | 1K | ~90–95% | — | Flash to Quality | Typography, branding, posters |
| Minimax Image | 1K | Limited | — | Fast | Portraits, product shots |
| Qwen Image | 1K | Excellent (ZH+EN) | — | Fast | Illustrations, bilingual, concept art |

