Documentation Index
Fetch the complete documentation index at: https://docs.imagine.art/llms.txt
Use this file to discover all available pages before exploring further.

The flagship of FLUX.2
FLUX.2 [max] combines the 32-billion parameter rectified flow transformer architecture with a Mistral-3 24B vision-language model, creating a 46,864-token context window that enables dramatically more nuanced prompt understanding than any previous FLUX model. The result is stronger adherence to complex multi-element prompts, more consistent character and style identity across references, and grounded generation that can visualize real-world trending subjects. FLUX.2 Max is the top tier in the FLUX.2 family — above FLUX.2 Pro — for workflows where maximum quality and prompt fidelity are the priority.
Capabilities
Grounded generation
Real-time web search integration enables accurate visualization of trending products, current events, and recently released subjects without detailed descriptions.
4-megapixel output
Generates up to 4MP (approximately 2752×1536) — print-ready resolution for commercial, editorial, and large-format work.
46,864-token context
Exceptionally large context window enables nuanced, multi-paragraph creative briefs and highly specific compositional instructions.



Specifications
| Feature | Details |
|---|---|
| Architecture | 32B rectified flow transformer + Mistral-3 24B VLM |
| Context window | 46,864 tokens |
| Resolution | Up to 4MP |
| Reference images | Up to 10 |
| Web grounding | Yes (real-time search) |
| Generation time | 8–10s (4MP), 4–6s (1MP) |
| Released | November 25, 2025 |
How to use
Write your prompt
Be as detailed as needed. With 46,864 tokens of context, Flux.2 Max can handle exhaustive creative briefs — describe spatial relationships, lighting, materials, mood, and style precisely.
Add reference images (optional)
Upload up to 10 reference images for character consistency or style anchoring.

Prompting tips
- Maximize the context window — Unlike smaller models, Flux.2 Max won’t drop details from long prompts. Use the space to be precise about lighting, materials, spatial composition, and style.
- Use reference images for consistency — For campaigns or series, passing reference images maintains visual identity far better than text descriptions alone.
- Web grounding works for current subjects — Reference recently launched products or current events by name; grounded generation handles the visual accuracy.

Example prompts
A high-end residential kitchen with Calacatta marble countertops, brushed brass fixtures, integrated appliances, warm pendant lighting over a central island, architectural photography style, shot at dusk.
A futuristic electric sports car (based on [reference image]) in a wind-tunnel test environment, dramatic directional lighting, photorealistic CGI quality.
Compare models
| Model | Parameters | Resolution | References | Best for |
|---|---|---|---|---|
| Flux.2 Max | 32B | Up to 4MP | 10 | Maximum quality, complex campaigns |
| Flux.2 Pro | 32B | Up to 4MP | 10 | Production-grade, text rendering |
| Flux 1.1 Ultra | — | 4MP | Image-to-image | Raw mode, high-res commercial |
| Flux Dev | 12B | ~1MP | — | Open-weight, creative exploration |
Flux.2 Max sits at the top of the FLUX.2 lineup. For production-grade output with strong text rendering where the absolute ceiling on prompt adherence isn’t needed, Flux.2 Pro is a slightly faster and more cost-efficient option.

