Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.imagine.art/llms.txt

Use this file to discover all available pages before exploring further.

VIDEO MODELby Kling AIKling O series

Kling O3

Kling AI’s most advanced video model — 1080p at 60 FPS, advanced physics engine simulating gravity, collision, and fluid dynamics, 6 generation modes, and up to 6 distinct shots in a single generation.

Resolution
1080p
Frame rate
60 FPS
Generation modes
6 modes
Physics
Advanced engine

Six modes, one model

Kling O3 is Kling AI’s most capable unified model — designed to handle every stage of a production workflow in a single system. Six distinct generation modes (text-to-video, image-to-video, video-to-video, frames-to-video, motion control, and reference-to-video) eliminate the need to switch between models for different tasks. The advanced physics engine is the headline capability that separates Kling O3 from earlier models: gravity, balance, deformation, collision, and inertia are all simulated accurately. Characters fall with real weight, water flows with physical plausibility, and rigid objects interact with correct momentum.

Capabilities

Advanced physics engine

Simulates gravity, balance, deformation, collision, and inertia — objects fall, splash, and interact with physical accuracy rarely seen in generative video.

1080p at 60 FPS

1080p resolution at 60 frames per second — high-quality output without upscaling.

6 generation modes

Text-to-video, image-to-video, video-to-video, frames-to-video, motion control, and reference-to-video — a complete production toolkit in one model.

Up to 6 shots per generation

Generate up to 6 distinct cinematographic shots within a single 15-second output — structured multi-shot storytelling from one prompt.

10+ reference images

Accepts more than 10 reference images for character appearance, style anchoring, and multi-subject scene construction.

Generation modes

ModeDescription
Text-to-videoGenerate video directly from a text prompt
Image-to-videoAnimate a reference image with described motion
Video-to-videoRestyle or transform an existing video
Frames-to-videoSpecify start, middle, and/or end frames
Motion controlApply specific camera trajectories and subject movements
Reference-to-videoAnchor generation to reference subjects and styles

Specifications

FeatureDetails
DeveloperKling AI (Kuaishou)
Resolution1080p
Frame rate60 FPS
DurationUp to 15 seconds
Shots per generationUp to 6
Generation modes6
AudioNo native audio
Max reference images10+

How to use

1

Open the AI Video Generator

Log into ImagineArt and go to the AI Video Generator.
2

Select Kling O3

Choose Kling O3 from the model dropdown.
3

Choose your generation mode

Select the mode that fits your workflow — text-to-video, image-to-video, video-to-video, frames-to-video, motion control, or reference-to-video.
4

Structure multi-shot prompts

For multi-shot output, describe each shot with explicit shot size, camera movement, duration, and scene content.
5

Add references (optional)

Upload reference images, video clips, or motion references to anchor characters, styles, or movement patterns.
6

Generate

Click Generate for 1080p, 60 FPS output.

Prompting tips

  • Leverage physics descriptions — “A glass falls off a table and shatters, water splashing realistically” will be rendered with physically accurate simulation. Be explicit about the physical behavior you want.
  • Use mode strategically — For restyling existing footage, use video-to-video. For maximum control over a sequence, use frames-to-video with defined start and end frames.
  • Multi-shot structure — “SHOT 1 (wide, 3s): … SHOT 2 (close-up, 2s): …” is interpreted as discrete cinematographic cuts by Kling O3.
  • Use references for character identity — Upload multiple reference angles of a character to improve identity consistency across shots.

Example prompts

A professional boxer trains in a gym. SHOT 1 (4s, wide): Boxer shadowboxes in the ring, motion blur on fast punches. SHOT 2 (3s, close-up): Sweat flies off the boxer’s face as they throw a right hook. SHOT 3 (3s, medium): The boxer catches their breath, hands on the rope. 1080p, 60 FPS.
A raindrop falls onto a still pond surface. Extreme close-up macro shot. Water ripples expand outward with physically accurate fluid dynamics. Slow motion, 60 FPS, 5 seconds.

Compare models

ModelResolutionFPSPhysicsModesBest for
Kling O34K60Advanced6Max capability, physics-heavy
Kling 3.0 Pro4K60StandardText + ImageCinematic 4K, multi-shot
Kling O11080pStandardUnifiedEditing + generation workflows
Kling 2.6 Pro1080p48StandardText + ImageAudio-synced, fast motion
Kling O3 is the right choice when physical realism — fluid dynamics, object collisions, realistic falls — is central to your scene. For 4K cinematic output without the physics complexity, Kling 3.0 Pro covers most professional needs.