generate-image

Constraints

- The model's final user-facing response should match the user's input language, default English.
Must request user confirmation before triggering any paid/high-cost image generation call.
After confirmed, must call CreatOK Open Skills proxy and wait until completion.
Avoid technical wording in the user-facing reply unless the user explicitly needs details for debugging.

Model Selection Rules

- INLINECODE0

- actual model id: seedream-5.0-lite - faster and lighter, good for quick iteration - resolutions: 2K, 4K only

- INLINECODE2

- actual model id: nano-banana-pro - highest quality, best for photorealistic portraits and product shots - resolutions: 1K, 2K, 4K

- INLINECODE4

- actual model id: nano-banana-2 - latest Nano Banana, best overall quality - resolutions: 1K, 2K, 4K

The model should recommend a model before generation based on the use case:

- portraits / photorealistic people → INLINECODE6
product shots / e-commerce → INLINECODE7
general illustration / concept art → INLINECODE8
quick preview / iteration → INLINECODE9
user explicitly wants 1K → nano-banana-pro or nano-banana-2 (Seedream does not support 1K)

Inputs to clarify (ask if missing)

- ask only for what is necessary to generate a good image
if resolution or aspect ratio is not specified, use sensible defaults (2K, square)
if the prompt is vague, offer to refine it before confirming generation
reference images are optional — ask only if the user implies style transfer or subject reference
when reference images are used, upload the local image file first and submit the returned uploaded reference with the generation task

Workflow

1. Confirmation gate (mandatory)

- Summarize:

- model - resolution - number of images (n) - aspect ratio if specified - estimated cost/credits if available

- Ask for a simple confirmation before submitting.
Do not submit the generation task until user says yes.

2. Submit image generation

- Call CreatOK: INLINECODE13

3. Poll status until completion

- Call CreatOK: INLINECODE14

4. Persist artifacts + respond

- Write:

- outputs/result.json with task_id/status/images/raw - outputs/result.md

- Persist the task_id immediately after submission so the user can recover later.
Return the final image URLs verbatim.

Existing Task Recovery

- If the user already has a task_id, continue from that task instead of starting a new one.
In recovery mode, do not ask the user to restate the prompt if the task id is already available.
The model can either check status once or keep polling if the user wants to wait.
If the task succeeded, return the final image URLs verbatim.
If the task is still queued or running, explain clearly and offer to keep checking.
If the task failed, explain the failure and suggest next steps.

Artifacts

All artifacts under generate-image/.artifacts/<run_id>/....

Thin Client Boundary

- This skill submits generation jobs, polls status, and persists fixed-format outputs.
The model should not make the user restate their idea if the direction is already clear from the conversation.

生成图像

约束条件

- 模型最终面向用户的回复应与用户输入语言一致，默认为英语。
在触发任何付费/高成本图像生成调用前，必须请求用户确认。
确认后，必须调用CreatOK开放技能代理并等待执行完成。
在面向用户的回复中避免使用技术性措辞，除非用户明确需要调试细节。

模型选择规则

- Seedream 5.0 Lite

- 实际模型ID：seedream-5.0-lite - 速度更快、更轻量，适合快速迭代 - 分辨率：仅支持2K、4K

- Nano Banana Pro

- 实际模型ID：nano-banana-pro - 质量最高，最适合逼真人像和产品拍摄 - 分辨率：1K、2K、4K

- Nano Banana 2

- 实际模型ID：nano-banana-2 - 最新版Nano Banana，综合质量最佳 - 分辨率：1K、2K、4K

模型应根据使用场景在生成前推荐模型：

- 人像/逼真人物 → nano-banana-2
产品拍摄/电商 → nano-banana-pro
通用插画/概念艺术 → nano-banana-2
快速预览/迭代 → seedream-5.0-lite
用户明确要求1K分辨率 → nano-banana-pro或nano-banana-2（Seedream不支持1K）

需澄清的输入项（缺失时询问）

- 仅询问生成优质图像所必需的信息
若未指定分辨率或宽高比，使用合理默认值（2K，正方形）
若提示词模糊，在确认生成前主动提供优化建议
参考图像为可选项——仅在用户暗示需要风格迁移或主体参考时询问
使用参考图像时，先上传本地图像文件，再将返回的上传参考信息提交至生成任务

工作流程

1. 确认环节（必选）

- 汇总以下信息：

- 模型 - 分辨率 - 图像数量（n） - 宽高比（如已指定） - 预估成本/积分（如可获取）

- 在提交前请求简单确认。
在用户确认前不得提交生成任务。

2. 提交图像生成

- 调用CreatOK：POST /api/open/skills/image-generation

3. 轮询状态直至完成

- 调用CreatOK：GET /api/open/skills/tasks/status?taskid=...&tasktype=image_generation

4. 持久化产物并回复

- 写入：

- outputs/result.json，包含task_id/status/images/raw - outputs/result.md

- 提交后立即持久化task_id，以便用户后续恢复。
直接返回最终图像URL。

现有任务恢复

- 若用户已有task_id，则从该任务继续，而非重新开始新任务。
在恢复模式下，若任务ID已存在，无需要求用户重新陈述提示词。
模型可一次性检查状态，或在用户愿意等待时持续轮询。
若任务成功，直接返回最终图像URL。
若任务仍在队列或运行中，清晰说明情况并提供继续检查的选项。
若任务失败，解释失败原因并建议后续步骤。

产物

所有产物位于generate-image/.artifacts//...目录下。

瘦客户端边界

- 本技能负责提交生成任务、轮询状态并持久化固定格式的输出。
若对话方向已明确，模型不应要求用户重新陈述其想法。

creatok-generate-image生成图像

creatok-generate-image

generate-image

Constraints

Model Selection Rules

Inputs to clarify (ask if missing)

Workflow

Existing Task Recovery

Artifacts

Thin Client Boundary

生成图像

约束条件

模型选择规则

需澄清的输入项（缺失时询问）

工作流程

现有任务恢复

产物

瘦客户端边界

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

creatok-generate-image生成图像

creatok-generate-image

generate-image

Constraints

Model Selection Rules

Inputs to clarify (ask if missing)

Workflow

Existing Task Recovery

Artifacts

Thin Client Boundary

生成图像

约束条件

模型选择规则

需澄清的输入项（缺失时询问）

工作流程

现有任务恢复

产物

瘦客户端边界

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement