> ## Documentation Index
> Fetch the complete documentation index at: https://docs.imagine.art/llms.txt
> Use this file to discover all available pages before exploring further.

# Kling 3 0 pro

<div style={{background: "linear-gradient(135deg, #00080f 0%, #001a3a 55%, #000812 100%)", borderRadius: "20px", padding: "3.5rem 3rem 3rem", marginBottom: "2.5rem", overflow: "hidden", position: "relative"}}>
  <div style={{position: "absolute", inset: "0", background: "radial-gradient(ellipse at 75% 20%, rgba(124,0,251,0.2) 0%, transparent 55%), radial-gradient(ellipse at 15% 80%, rgba(0,100,255,0.14) 0%, transparent 50%)", pointerEvents: "none"}} />

  <div style={{position: "relative"}}>
    <div style={{display: "flex", gap: "0.5rem", marginBottom: "1.5rem", flexWrap: "wrap"}}>
      <span style={{background: "rgba(0,80,200,0.3)", border: "1px solid rgba(0,100,255,0.4)", borderRadius: "100px", padding: "0.3rem 1rem", fontSize: "0.72rem", color: "#7eb8ff", fontWeight: "500", letterSpacing: "0.06em"}}>VIDEO MODEL</span>
      <span style={{background: "rgba(255,255,255,0.06)", border: "1px solid rgba(255,255,255,0.12)", borderRadius: "100px", padding: "0.3rem 1rem", fontSize: "0.72rem", color: "rgba(255,255,255,0.45)", fontWeight: "400"}}>by Kling AI</span>
      <span style={{background: "rgba(255,255,255,0.06)", border: "1px solid rgba(255,255,255,0.12)", borderRadius: "100px", padding: "0.3rem 1rem", fontSize: "0.72rem", color: "rgba(255,255,255,0.45)", fontWeight: "400"}}>Kling 3 family</span>
    </div>

    <h1 style={{fontSize: "clamp(2.5rem, 5vw, 3.75rem)", fontWeight: "700", color: "#ffffff", lineHeight: "1.1", letterSpacing: "-0.025em", margin: "0 0 1.1rem 0"}}>Kling 3.0 Pro</h1>
    <p style={{fontSize: "1.1rem", color: "rgba(255,255,255,0.52)", maxWidth: "580px", lineHeight: "1.7", marginBottom: "2.25rem"}}>Kling AI's most advanced video model — 1080p at 60 FPS, Omni Native Audio with multilingual dialogue and environmental soundscapes, and the ability to generate up to 6 distinct shots in a single 15-second output.</p>

    <div style={{display: "flex", gap: "0.75rem", flexWrap: "wrap"}}>
      <div style={{background: "rgba(255,255,255,0.06)", borderRadius: "14px", padding: "0.875rem 1.5rem", border: "1px solid rgba(255,255,255,0.1)"}}>
        <div style={{fontSize: "0.62rem", color: "rgba(255,255,255,0.32)", textTransform: "uppercase", letterSpacing: "0.1em", marginBottom: "0.3rem"}}>Resolution</div>
        <div style={{fontSize: "1rem", color: "#ffffff", fontWeight: "600"}}>1080p</div>
      </div>

      <div style={{background: "rgba(255,255,255,0.06)", borderRadius: "14px", padding: "0.875rem 1.5rem", border: "1px solid rgba(255,255,255,0.1)"}}>
        <div style={{fontSize: "0.62rem", color: "rgba(255,255,255,0.32)", textTransform: "uppercase", letterSpacing: "0.1em", marginBottom: "0.3rem"}}>Frame rate</div>
        <div style={{fontSize: "1rem", color: "#ffffff", fontWeight: "600"}}>60 FPS</div>
      </div>

      <div style={{background: "rgba(255,255,255,0.06)", borderRadius: "14px", padding: "0.875rem 1.5rem", border: "1px solid rgba(255,255,255,0.1)"}}>
        <div style={{fontSize: "0.62rem", color: "rgba(255,255,255,0.32)", textTransform: "uppercase", letterSpacing: "0.1em", marginBottom: "0.3rem"}}>Duration</div>
        <div style={{fontSize: "1rem", color: "#ffffff", fontWeight: "600"}}>Up to 15 seconds</div>
      </div>

      <div style={{background: "rgba(255,255,255,0.06)", borderRadius: "14px", padding: "0.875rem 1.5rem", border: "1px solid rgba(255,255,255,0.1)"}}>
        <div style={{fontSize: "0.62rem", color: "rgba(255,255,255,0.32)", textTransform: "uppercase", letterSpacing: "0.1em", marginBottom: "0.3rem"}}>Shots per generation</div>
        <div style={{fontSize: "1rem", color: "#ffffff", fontWeight: "600"}}>Up to 6</div>
      </div>
    </div>
  </div>
</div>

## Kling 3.0 Pro

Kling 3.0 Pro marks Kling AI's most significant architectural leap — 1080p output at 60 frames per second with Omni Native Audio and multi-shot storyboarding in a single generation.

The Multi-modal Visual Language (MVL) architecture unifies text, image, video, and audio inputs into a single model, enabling true multi-shot storyboarding — up to 6 distinct shots, each with specified duration, shot size, perspective, narrative, and camera movement, all generated from one prompt.

## Capabilities

<CardGroup cols={3}>
  <Card title="1080p at 60 FPS" icon="expand">
    Generates 1080p video at 60 frames per second — smooth, high frame-rate output for cinematic and action-heavy content.
  </Card>

  <Card title="Omni Native Audio" icon="music">
    Multilingual audio generation including English, Japanese, Korean, Spanish, and environmental soundscapes — generated natively alongside the video.
  </Card>

  <Card title="Multi-shot storyboarding" icon="clapperboard">
    Specify up to 6 shots in a single 15-second generation — each with its own duration, shot size, perspective, camera movement, and narrative.
  </Card>

  <Card title="MVL architecture" icon="microchip">
    Multi-modal Visual Language architecture natively processes text, images, video, and audio as unified inputs for coherent multimodal output.
  </Card>

  <Card title="Up to 10 reference images" icon="images">
    Accepts up to 10 reference images for subject appearance, style, and composition anchoring across a multi-shot sequence.
  </Card>

  <Card title="Complex action accuracy" icon="person-running">
    Handles fast, intricate physical actions — martial arts, dance, sports — with consistent body mechanics and no ghosting artifacts.
  </Card>
</CardGroup>

## Specifications

| Feature                  | Details                                   |
| ------------------------ | ----------------------------------------- |
| **Developer**            | Kling AI (Kuaishou)                       |
| **Base credits**         | 300                                       |
| **Resolution**           | 1080p                                     |
| **Frame rate**           | 60 FPS                                    |
| **Duration**             | Up to 15 seconds                          |
| **Shots per generation** | Up to 6                                   |
| **Audio**                | Omni Native Audio — dialogue, SFX, music  |
| **Languages**            | English, Japanese, Korean, Spanish + more |
| **Max reference images** | 10                                        |
| **Architecture**         | Multi-modal Visual Language (MVL)         |

## How to use

<Steps>
  <Step title="Open the AI Video Generator">
    Log into ImagineArt and go to the **AI Video Generator**.
  </Step>

  <Step title="Select Kling 3.0 Pro">
    Choose **Kling 3.0 Pro** from the model dropdown.
  </Step>

  <Step title="Structure your prompt for multi-shot">
    For multi-shot output, describe each shot with explicit transitions: "SHOT 1 (3s, wide, establishing): ... SHOT 2 (2s, close-up): ..." Kling 3.0 Pro interprets these cues to generate distinct cinematographic cuts.
  </Step>

  <Step title="Add reference images (optional)">
    Upload up to 10 reference images for character appearance, environment style, or composition guidance.
  </Step>

  <Step title="Include audio direction">
    Describe the audio landscape — dialogue lines, ambient environment, music style — within the prompt for Omni Native Audio.
  </Step>

  <Step title="Generate">
    Click **Generate**. Kling 3.0 Pro produces a 1080p, 60 FPS output with synchronized audio.
  </Step>
</Steps>

## Prompting tips

* **Structure shots explicitly** — "SHOT 1: wide establishing exterior, 3 seconds, slow pan right. SHOT 2: medium close-up on protagonist, 2 seconds, static camera." Kling 3.0 Pro follows cinematographic structure in prompts.
* **Specify language for dialogue** — If your scene requires characters speaking a specific language, state it clearly: "The character speaks in Japanese with a formal tone."
* **Reference images anchor identity** — For character consistency across shots, upload a reference image and describe the character consistently in each shot description.
* **Use technical camera terms** — "Shallow depth of field," "Dutch angle," "rack focus," and "tracking shot" all meaningfully influence the cinematic output.

### Example prompts

> SHOT 1 (4s, wide, cinematic): A samurai stands at the edge of a misty forest at dawn. Slow pan left, revealing a village in the distance. Traditional Japanese ambient sounds. SHOT 2 (3s, close-up): The samurai's hand grips a sword hilt. Rain begins to fall. SHOT 3 (3s, medium): The samurai turns and walks into the mist.

> A professional basketball player dribbles through defenders and dunks. Wide angle, 60 FPS, 5 seconds. Arena crowd roaring in the background, sneakers squeaking on hardwood.

## Compare models

| Model                                           | Resolution | FPS | Audio       | Shots   | Best for                             |
| ----------------------------------------------- | ---------- | --- | ----------- | ------- | ------------------------------------ |
| **Kling 3.0 Pro**                               | 1080p      | 60  | Omni Native | Up to 6 | Multi-shot storytelling, 60 FPS      |
| [Kling O3](/ai-models/video/kling-o3)           | 4K         | 60  | Yes         | Up to 6 | Advanced physics, 6 generation modes |
| [Kling 2.6 Pro](/ai-models/video/kling-2-6-pro) | 1080p      | 48  | Lip-sync    | —       | Audio-synced content, fast motion    |
| [Kling 2.5 Pro](/ai-models/video/kling-2-5-pro) | 1080p      | —   | No          | —       | Cost-efficient HD production         |

<Tip>
  Kling 3.0 Pro is the right choice when you need structured multi-shot storytelling at 60 FPS with native audio in a single generation. For 4K output, compare with [Kling O3](/ai-models/video/kling-o3).
</Tip>
