Art process video generation
Time-compressed journey from pencil sketch to final piece—lines grow denser, color goes from none to full, detail stacks layer by layer, the last stroke lands, and the finished work is revealed. The process is the content; no on-screen copy needed. Fits illustrators, designers, and art accounts.
Dependencies: scripts/video_gen.js in this directory + WERYAI_API_KEY in the environment + Node.js 18+. No dependency on other Cursor skills.
Prerequisites
- -
WERYAI_API_KEY must be set in the environment before running video_gen.js. - Node.js 18+ is required. Image inputs must be public
https URLs (no local file paths). - Each successful
wait run consumes WeryAI credits; re-running creates new paid tasks.
Security, secrets, and API hosts
- -
WERYAI_API_KEY: Treat as a secret. Only configure it if you trust this skill's source; it is listed in OpenClaw metadata as requires.env / primaryEnv so installers know it is mandatory at runtime (never commit it inside the skill package). - Optional URL overrides (
WERYAI_BASE_URL, WERYAI_MODELS_BASE_URL): video_gen.js defaults to https://api.weryai.com and https://api-growth-agent.weryai.com. Overrides are intended for testing or approved alternate endpoints. If these variables are set in your environment, confirm they point to hosts you trust—otherwise prompts, images, and your bearer token could be sent elsewhere. - Higher assurance: Run generation in a short-lived or isolated environment (separate account or container), and review
scripts/video_gen.js (HTTPS submit + poll loop) before production use.
Prompt expansion (mandatory)
INLINECODE15 does not expand prompts. Before every wait --json, turn the user's short or vague brief into a full English production prompt.
When: The user gives only keywords, one line, or loose intent—or asks for richer video language. Exception: They paste a finished long prompt within the model's prompt_length_limit and ask you not to rewrite; still show the full text in the confirmation table.
Always add (video language): shot scale and angle; camera move or lock-off; light quality and motivation; subject action paced to duration; one clear payoff for this niche; state 9:16 vertical when this skill defaults to vertical.
Length: Obey prompt_length_limit for the chosen model_key when this doc lists it; trim filler adjectives before removing core action, lens, or light clauses.
Confirmation: The pre-submit table must include the full expanded prompt (never a one-line summary). Wait for confirm or edits.
Niche checklist
- - Process arc: sketch → base color → detail → final frame much richer than the opening blank or rough lines.
- Medium & subject: name tablet/paper/watercolor and what is being painted or designed; use time-lapse / speed-ramp language when
duration is short. - Camera & light: top-down, eye-level screen, or macro brush; motivated desk/canvas/monitor light; show stroke motion and rising saturation.
### Example prompts at the top of this file are short triggers only—always expand from the user's actual request.
Workflow
- 1. Confirm the user request matches this skill's scenario (text-to-video and/or image-to-video as documented).
- Collect the user's brief, optional image URL(s), tier (best / good / fast) or an explicit
model key. - Expand prompt (mandatory): Unless the user supplied a finished long prompt and explicitly asked not to rewrite it, expand the brief into a full English production
prompt using ## Prompt expansion (mandatory) below. Do not call the API with only the user's minimal words. - Check the expanded
prompt against the selected model's prompt_length_limit in the frozen tables in this document (when present); shorten if needed. - Verify
duration, aspect_ratio, resolution, generate_audio, negative_prompt, and other fields against the frozen tables and API notes in this SKILL.md. - Show the pre-submit parameter table including the full expanded
prompt; wait for confirm or edits. - After confirmation, run
node {baseDir}/scripts/video_gen.js wait --json '...' with the expanded prompt. - Parse stdout JSON and return video URLs; on failure, surface
errorCode / errorMessage and suggest parameter fixes.
CLI reference
CODEBLOCK0
Definition of done
Done when the user receives at least one playable video URL from the API response, or a clear failure explanation with next steps. All parameters used must fall within the selected model's allowed sets in this document. The submitted prompt must be the expanded production prompt unless the user explicitly supplied a finished long prompt and asked not to rewrite it.
Boundaries (out of scope)
- - Does not review platform compliance, copyright, or portrait rights; does not guarantee commercial usability of outputs.
- Does not provide non-WeryAI offline rendering, traditional edit timelines, or API field combinations not documented here.
- Does not hard-code absolute paths in the skill doc;
{baseDir} means the skill package root (same level as SKILL.md).
Example prompts
- - INLINECODE42
- INLINECODE43
- INLINECODE44
- INLINECODE45
Default parameters
| Field | Value |
|---|
| Model | KLINGV30_PRO |
| Aspect ratio |
9:16 (fixed) |
| Duration | 10 seconds (
duration: 10—enough time to show layered progression) |
| Audio | Off |
| Visual style | Top-down or eye-level close-up; canvas or tablet in frame; clear brush motion; color fills progressively; time-compressed timeline |
API validity (default KLING_V3_0_PRO): Text-to-video duration only 5 / 10 / 15; aspect_ratio only 9:16, 1:1, 16:9. Image-to-video aspect_ratio only 9:16, 16:9, 1:1. No resolution field—do not send it. For fast tier with VEO: text-to-video VEO_3_1_FAST, image-to-video CHATBOT_VEO_3_1_FAST, with duration fixed at 8, aspect_ratio only 9:16 or 16:9. When switching model_key, follow the allowed sets in this section’s model/API constraints and the API validity note above; do not send resolution to models that do not support it.
Full creative process: sketch to finish
Purpose: Show one illustration or design from blank to done. Pace is time-lapse compression: stroke density rises with progress, saturation climbs from low to high, and the final frame should feel dramatically richer than the starting blank.
The user should provide:
- - Subject (bear / cat / bunny / or a theme like “forest spirit,” “cyber girl”)
- Style (dreamy watercolor / line-art B&W / thick oil-like paint / flat design / cyberpunk illustration)
- Medium (paper + pencil / tablet + stylus / watercolor paper / digital screen)
Generation flow:
- 1. Confirm subject, style, and medium.
- Build the prompt in three phases: light sketch lines → base color blocking → detail polish and finish, with clear visual beats per phase.
- If parameters are unspecified, show defaults and wait for confirmation:
You must show all parameters in a table and wait for explicit user confirmation before submitting:
> 📋 Ready to generate—please confirm:
>
> | Parameter | This run | Notes |
> |-----------|----------|-------|
> | model | KLING_V3_0_PRO | Best tier default; say “cheap / draft / fast” → WAN_2_6; say “balanced” → KLING_V3_0_STA; or name a model directly |
> | aspect_ratio | 9:16 | Default KLING: 9:16, 1:1, 16:9 only; if you change model, check that model’s aspect_ratios in the table |
> | duration | 10s | KLING family: 5 / 10 / 15; VEO fast: duration 8 only |
> | generate_audio | false | Whether to auto-generate audio |
> | prompt | Full expanded English prompt (entire text for this run) | Revise before confirm |
> | Loop seam | No | Reply “loop” to enable seamless loop |
>
> Reply “confirm” to start, or list what to change.
- 4. After confirmation, run in the terminal (
{baseDir} is the skill root):
CODEBLOCK1
INLINECODE72 , duration, generate_audio, and model must match the table; add resolution only if the model supports it. Parse stdout videos for URLs.
Parameter configuration:
| Field | Value |
|---|
| model | KLINGV30PRO |
| aspectratio |
9:16 |
| duration | 10 |
| generate_audio | false |
Expanded prompt: Compose at generation time per ## Prompt expansion (mandatory) from the user's actual brief—do not reuse fixed sample paragraphs.
Expected outcome: Clear three-beat rhythm (blocking color, detail pass, final reveal), rising visual density from sparse to rich, final frame clearly stronger than the start—good for process content on art accounts.
Single “brushstroke” hero moment
Focus on one stunning beat: one highlight dot makes the eye “wake up,” a gradient snaps into place, one line closes the piece—better for punchy shorts than a full process.
Say which “one stroke” moment to show; build the prompt directly:
Parameters are shown before generation; wait for confirmation before submit.
Expanded prompt: Compose at generation time per ## Prompt expansion (mandatory) from the user's actual brief—do not reuse fixed sample paragraphs.
Prompt tips
Process rhythm: layer by layer, stage by stage with visible progress, time-lapse with speed ramp at details phase, INLINECODE83
Finish impact: final reveal is dramatically more complex than the starting blank, the completed piece has gallery-quality depth, INLINECODE86
Brush feel: stylus catches tablet light, pencil line weight variation visible, watercolor bleeds naturally at paper texture, INLINECODE90
Note: Ten seconds is already tight for a full process; say time-lapse compression in the prompt so the model treats it as accelerated display, not real-time drawing speed.
艺术创作过程视频生成
从铅笔草图到最终作品的压缩时间旅程——线条逐渐密集,色彩从无到有,细节层层叠加,最后一笔落下,完成的作品呈现。过程就是内容;无需屏幕文字说明。适合插画师、设计师和艺术类账号。
依赖项:此目录中的 scripts/videogen.js + 环境变量中的 WERYAIAPI_KEY + Node.js 18+。不依赖其他 Cursor 技能。
前置条件
- - 运行 videogen.js 前,必须在环境中设置 WERYAIAPI_KEY。
- 需要 Node.js 18+。图片输入必须是公开的 https URL(不支持本地文件路径)。
- 每次成功的 wait 运行都会消耗 WeryAI 积分;重新运行会创建新的付费任务。
安全性、密钥和 API 主机
- - WERYAIAPIKEY:视为机密信息。仅在你信任此技能来源时进行配置;它在 OpenClaw 元数据中被列为 requires.env / primaryEnv,因此安装程序知道它在运行时是必需的(切勿将其提交到技能包内)。
- 可选的 URL 覆盖(WERYAIBASEURL、WERYAIMODELSBASEURL):videogen.js 默认使用 https://api.weryai.com 和 https://api-growth-agent.weryai.com。覆盖用于测试或经批准的备用端点。如果在环境中设置了这些变量,请确认它们指向您信任的主机——否则提示词、图片和您的 bearer token 可能被发送到其他地方。
- 更高保障:在短期或隔离环境(独立账户或容器)中运行生成,并在生产使用前审查 scripts/video_gen.js(HTTPS 提交 + 轮询循环)。
提示词扩展(强制要求)
video_gen.js 不扩展提示词。在每次 wait --json 之前,将用户的简短或模糊需求转化为完整的英文制作 prompt。
何时扩展: 用户只给出关键词、一行文字或模糊意图——或要求更丰富的视频语言。例外: 用户粘贴了符合模型 promptlengthlimit 的完整长提示词,并要求您不要重写;但仍需在确认表格中显示完整文本。
始终添加(视频语言): 镜头景别和角度;相机运动或固定;光线质量和动机;与 duration 匹配的主体动作节奏;一个明确的亮点适合此细分领域;当此技能默认为竖屏时,注明 9:16 竖屏。
长度: 当本文档列出 promptlengthlimit 时,遵守所选 model_key 的限制;在删除核心动作、镜头或光线从句之前,先修剪填充性形容词。
确认: 提交前表格必须包含完整的扩展 prompt(绝不能是一行摘要)。等待确认或修改。
细分领域检查清单
- - 过程弧线: 草图 → 基础色 → 细节 → 最终画面比开头的空白或粗略线条丰富得多。
- 媒介与主体: 注明数位板/纸张/水彩以及正在绘制或设计的内容;当 duration 较短时,使用延时摄影/速度变速语言。
- 相机与光线: 俯拍、平视屏幕或微距画笔;有动机的桌面/画布/显示器光线;展示笔触运动和饱和度上升。
### 示例提示词 位于此文件顶部,仅为简短触发词——始终根据用户的实际请求进行扩展。
工作流程
- 1. 确认用户请求符合此技能的场景(文本转视频和/或图片转视频,如文档所述)。
- 收集用户的需求、可选的图片 URL、层级(最佳 / 良好 / 快速)或明确的 model 键。
- 扩展提示词(强制要求): 除非用户提供了完整的长提示词并明确要求不要重写,否则使用下方的 ## 提示词扩展(强制要求) 将需求扩展为完整的英文制作 prompt。不要仅用用户的最小化词语调用 API。
- 对照本文档固定表格中选定模型的 promptlengthlimit(如有)检查扩展后的 prompt;必要时缩短。
- 对照此 SKILL.md 中的固定表格和 API 说明,验证 duration、aspectratio、resolution、generateaudio、negativeprompt 等字段。
- 显示提交前参数表格,包括完整的扩展 prompt;等待确认或修改。
- 确认后,使用扩展后的提示词运行 node {baseDir}/scripts/videogen.js wait --json ...。
- 解析 stdout JSON 并返回视频 URL;失败时,显示 errorCode / errorMessage 并建议参数修复。
CLI 参考
sh
node {baseDir}/scripts/videogen.js wait --json {model:…,prompt:…,duration:5,aspectratio:9:16}
node {baseDir}/scripts/video_gen.js wait --json … --dry-run
node {baseDir}/scripts/video_gen.js status --task-id
完成标准
当用户从 API 响应中收到至少一个可播放的视频 URL,或收到明确的失败说明及后续步骤时,即视为完成。使用的所有参数必须在本文档中选定模型的允许范围内。提交的 prompt 必须是扩展后的制作提示词,除非用户明确提供了完整的长提示词并要求不要重写。
边界(超出范围)
- - 不审查平台合规性、版权或肖像权;不保证输出的商业可用性。
- 不提供非 WeryAI 的离线渲染、传统编辑时间线或本文档未记录的 API 字段组合。
- 不在技能文档中硬编码绝对路径;{baseDir} 表示技能包根目录(与 SKILL.md 同级)。
示例提示词
- - 在数位板上从铅笔草图到全彩的角色插画,竖屏加速过程
- 使用此线稿图像制作逐步添加色彩、光影直到最终作品的动画
- 从网格到最终标识的海报字体设计过程,艺术账号风格
- 数字绘画过程 9:16,图层逐步构建,令人满足的笔触
默认参数
9:16(固定) |
| 时长 | 10 秒(duration:10——足够展示分层进展) |
| 音频 | 关闭 |
| 视觉风格 | 俯拍或平视特写;画布或数位板在画面中;清晰的笔触运动;色彩逐步填充;时间压缩 |
API 有效性(默认 KLINGV30PRO): 文本转视频 duration 仅 5 / 10 / 15;aspectratio 仅 9:16, 1:1, 16:9。图片转视频 aspectratio 仅 9:16, 16:9, 1:1。无 resolution 字段——请勿发送。 对于使用 VEO 的快速层级:文本转视频 VEO31FAST,图片转视频 CHATBOTVEO31FAST,duration 固定为 8,aspectratio 仅 9:16 或 16:9。切换 modelkey 时,遵循本节模型/API 约束中的允许集合及上述 API 有效性说明;不要向不支持 resolution 的模型发送该字段。
完整创作过程:从草图到完成
目的: 展示从空白到完成的插画或设计。节奏为延时压缩:笔触密度随进展增加,饱和度从低到高,最终画面应比开头空白丰富得多。
用户应提供:
- - 主体(熊 / 猫 / 兔子 / 或主题如森林精灵、赛博女孩)
- 风格(梦幻水彩 / 线稿黑白 / 厚油画质感 / 扁平设计 / 赛博朋克插画)
- 媒介(纸张 + 铅笔 / 数位板 + 触控笔 / 水彩纸 / 数字屏幕)
生成流程:
- 1. 确认主体、风格和媒介。
- 分三个阶段构建提示词:轻草图线条 → 基础色块 → 细节润色和完成,每阶段有清晰的视觉节拍。
- 如果参数未指定,显示默认值并等待确认:
您必须以表格形式显示所有参数,并在提交前等待用户明确确认:
> 📋