🎬 AI Cinema Director Skill
A specialized skill for AI Agents to direct high-fidelity cinematic video.
The Cinema Director skill translates high-level creative intent into technical cinematographic directives for state-of-the-art video models (Veo3, Kling, Luma).
Core Competencies
- 1. Shot Composition Analysis: Mapping emotional beats to appropriate framing (e.g., Extreme Close-Up for intimacy, Wide Shot for isolation).
- Camera Movement Orchestration: Directing complex physical movements (Dolly, Truck, Crane) and lens-based effects (Rack Focus, Dolly Zoom).
- Lighting & Atmosphere Design: Specifying temporal and stylistic lighting (Golden Hour, Chiaroscuro, Volumetric God Rays).
- Technical Parameter Optimization: Automatically selecting optimal frame rates, aspect ratios, and model-specific biases.
🏗️ Technical Specification
1. Intent Mapping Table
| Creative Intent | Framing | Movement | Lighting |
|---|
| Heroic Reveal | Low Angle / Wide | Crane Up / Orbit | Rim Lighting / High Contrast |
| Tense/Uneasy |
Dutch Angle | Handheld Shake | Low Key / Harsh Shadows |
|
Introspective | Close-Up | Slow Push In | Soft Rembrandt / Window Light |
|
Majestic/Epic | Extreme Wide | Drone Flyover | Golden Hour / Volumetric |
|
Melancholic | Profile / Medium | Slow Pull Out | Blue Hour / Desaturated |
2. Physical Camera Movements
- -
Dolly In/Out: Physical camera movement on a track toward/away from the subject. - INLINECODE1 : Lateral physical movement.
- INLINECODE2 : Sweeping vertical movement from a height.
- INLINECODE3 : Circular movement around a center point.
- INLINECODE4 : Vertical elevation change (without tilting).
3. Lens & Optical Controls
- -
Shallow DOF: Background blur (Bokeh). - INLINECODE6 : Horizontal flares and wide-screen cinematic feel.
- INLINECODE7 : Shifting focus between planes within the shot.
🧠 Prompt Optimization Protocol (Agent Instruction)
Before calling the script, the Agent MUST expand the user's raw prompt using these Director's Rules:
- 1. TECHNICAL INFUSION: Transform "cool action" into a technical shot.
-
Formula:
[Shot Type] + [Subject/Action] + [Environment] + [Lighting] + [Camera Movement] + [Lens Effect]
- 2. MOTION DYNAMICS: Use cinematic verbs: Dolly In (intimacy), Crane Up (majestic), Orbit (heroic), Truck (parallel motion).
- LIGHTING RECIPES: Apply specific illumination: Volumetric God Rays, Teal-and-Orange Grade, Cyberpunk Rim Lighting, Rembrandt Portrait Lighting.
- PHYSICS LOGIC: Describe light relationships (e.g., "Neon reflections shimmering on rain-slicked asphalt") to trigger model reasoning.
🚀 Protocol: Using the Cinema Director
Step 1: Define the Creative Brief
Provide the agent with a subject and a "Director's Intent."
Step 2: Invoke the Script
The
generate-film.sh script accepts a
--brief which it expands using its internal knowledge of cinematography.
CODEBLOCK0
Step 3: Handle the Async Response
Video generation is asynchronous. Use the returned
request_id to poll for completion via
core/platform/check-result.sh.
⚠️ Constraints & Guardrails
- - Temporal Consistency: Avoid complex subject transformations in a single shot (e.g., "man turns into a bird").
- Movement Collisions: Do not combine contradictory movements (e.g., "Dolly In" and "Dolly Out" simultaneously).
- Physical Realism: Prefer movements possible with real-world equipment for a more professional "film" look.
- Model Bias:
-
Veo3: Best for slow, high-quality aesthetic shots.
-
Kling: Best for complex character motion and physics.
-
Luma: Best for fast-paced, high-action cinematic sequences.
⚙️ Implementation Details
This skill acts as an "Expert Translator" for the
core/media/generate-video.sh primitive. It maintains a dictionary of cinematic styles and injects technical directives into the prompt before execution.
🎬 AI电影导演技能
一项专门为AI智能体设计的高保真电影级视频导演技能。
电影导演技能将高层次的创意意图转化为尖端视频模型(Veo3、Kling、Luma)的技术性电影摄影指令。
核心能力
- 1. 镜头构图分析:将情感节拍映射到适当的取景方式(例如,极端特写表现亲密感,广角镜头表现孤独感)。
- 摄影机运动编排:指导复杂的物理运动(推轨、横移、摇臂)和基于镜头的效果(焦点转换、推拉变焦)。
- 灯光与氛围设计:指定时间性和风格化的照明(黄金时刻、明暗对比、体积光柱)。
- 技术参数优化:自动选择最佳帧率、宽高比和模型特定偏好。
🏗️ 技术规范
1. 意图映射表
| 创意意图 | 取景方式 | 运动方式 | 灯光效果 |
|---|
| 英雄登场 | 低角度/广角 | 摇臂上升/环绕 | 轮廓光/高对比度 |
| 紧张/不安 |
荷兰角 | 手持抖动 | 低调光/硬阴影 |
|
内省沉思 | 特写 | 缓慢推进 | 柔和伦勃朗光/窗光 |
|
壮丽/史诗 | 极端广角 | 无人机航拍 | 黄金时刻/体积光 |
|
忧郁伤感 | 侧面/中景 | 缓慢拉出 | 蓝色时刻/低饱和度 |
2. 物理摄影机运动
- - 推轨前进/后退:摄影机在轨道上向被摄主体靠近或远离的物理运动。
- 横移左/右:横向的物理运动。
- 摇臂/吊臂:从高处进行的扫掠式垂直运动。
- 环绕:围绕中心点的圆周运动。
- 升降:垂直高度变化(不倾斜)。
3. 镜头与光学控制
- - 浅景深:背景虚化(散景)。
- 变形宽银幕:水平光晕和宽银幕电影感。
- 焦点转换:在镜头内不同平面之间切换焦点。
🧠 提示词优化协议(智能体指令)
在调用脚本之前,智能体必须使用以下导演规则扩展用户的原始提示词:
- 1. 技术注入:将酷炫动作转化为技术性镜头。
-
公式:[镜头类型] + [主体/动作] + [环境] + [灯光] + [摄影机运动] + [镜头效果]
- 2. 运动动态:使用电影化动词:推轨前进(亲密感)、摇臂上升(壮丽感)、环绕(英雄感)、横移(平行运动)。
- 灯光配方:应用特定照明:体积光柱、青橙色调、赛博朋克轮廓光、伦勃朗肖像光。
- 物理逻辑:描述光线关系(例如,霓虹灯反射在雨水浸湿的柏油路上闪烁)以触发模型推理。
🚀 协议:使用电影导演
第一步:定义创意简报
向智能体提供主体和导演意图。
第二步:调用脚本
generate-film.sh 脚本接受一个 --brief 参数,该参数利用其内部的电影摄影知识进行扩展。
bash
导演一个场景
bash scripts/generate-film.sh
--subject 暴风雪中的孤独武士
--intent 史诗登场
--model kling-master
第三步:处理异步响应
视频生成是异步的。使用返回的 request_id 通过 core/platform/check-result.sh 轮询完成状态。
⚠️ 约束与护栏
- - 时间一致性:避免在单个镜头中出现复杂的主体变换(例如,人变成鸟)。
- 运动冲突:不要组合矛盾的运动(例如,同时推轨前进和推轨后退)。
- 物理真实感:优先使用现实设备可能实现的运动,以获得更专业的电影效果。
- 模型偏好:
- Veo3:最适合缓慢、高质量的美学镜头。
- Kling:最适合复杂的人物动作和物理效果。
- Luma:最适合快节奏、高动作的电影序列。
⚙️ 实现细节
该技能作为 core/media/generate-video.sh 原语的专家翻译器。它维护一个电影风格词典,并在执行前将技术指令注入到提示词中。