generate-image
Constraints
- - The model's final user-facing response should match the user's input language, default English.
- Must request user confirmation before triggering any paid/high-cost image generation call.
- After confirmed, must call CreatOK Open Skills proxy and wait until completion.
- Avoid technical wording in the user-facing reply unless the user explicitly needs details for debugging.
Model Selection Rules
- actual model id:
seedream-5.0-lite
- faster and lighter, good for quick iteration
- resolutions:
2K, 4K only
- actual model id:
nano-banana-pro
- highest quality, best for photorealistic portraits and product shots
- resolutions:
1K, 2K, 4K
- actual model id:
nano-banana-2
- latest Nano Banana, best overall quality
- resolutions:
1K, 2K, 4K
The model should recommend a model before generation based on the use case:
- - portraits / photorealistic people → INLINECODE6
- product shots / e-commerce → INLINECODE7
- general illustration / concept art → INLINECODE8
- quick preview / iteration → INLINECODE9
- user explicitly wants 1K →
nano-banana-pro or nano-banana-2 (Seedream does not support 1K)
Inputs to clarify (ask if missing)
- - ask only for what is necessary to generate a good image
- if resolution or aspect ratio is not specified, use sensible defaults (2K, square)
- if the prompt is vague, offer to refine it before confirming generation
- reference images are optional — ask only if the user implies style transfer or subject reference
- when reference images are used, upload the local image file first and submit the returned uploaded reference with the generation task
Workflow
- 1. Confirmation gate (mandatory)
- model
- resolution
- number of images (
n)
- aspect ratio if specified
- estimated cost/credits if available
- - Ask for a simple confirmation before submitting.
- Do not submit the generation task until user says yes.
- 2. Submit image generation
- - Call CreatOK: INLINECODE13
- 3. Poll status until completion
- - Call CreatOK: INLINECODE14
- 4. Persist artifacts + respond
-
outputs/result.json with
task_id/status/images/raw
-
outputs/result.md
- - Persist the
task_id immediately after submission so the user can recover later. - Return the final image URLs verbatim.
Existing Task Recovery
- - If the user already has a
task_id, continue from that task instead of starting a new one. - In recovery mode, do not ask the user to restate the prompt if the task id is already available.
- The model can either check status once or keep polling if the user wants to wait.
- If the task succeeded, return the final image URLs verbatim.
- If the task is still queued or running, explain clearly and offer to keep checking.
- If the task failed, explain the failure and suggest next steps.
Artifacts
All artifacts under generate-image/.artifacts/<run_id>/....
Thin Client Boundary
- - This skill submits generation jobs, polls status, and persists fixed-format outputs.
- The model should not make the user restate their idea if the direction is already clear from the conversation.
生成图像
约束条件
- - 模型最终面向用户的回复应与用户输入语言一致,默认为英语。
- 在触发任何付费/高成本图像生成调用前,必须请求用户确认。
- 确认后,必须调用CreatOK开放技能代理并等待执行完成。
- 在面向用户的回复中避免使用技术性措辞,除非用户明确需要调试细节。
模型选择规则
- 实际模型ID:seedream-5.0-lite
- 速度更快、更轻量,适合快速迭代
- 分辨率:仅支持
2K、4K
- 实际模型ID:nano-banana-pro
- 质量最高,最适合逼真人像和产品拍摄
- 分辨率:
1K、2K、4K
- 实际模型ID:nano-banana-2
- 最新版Nano Banana,综合质量最佳
- 分辨率:
1K、2K、4K
模型应根据使用场景在生成前推荐模型:
- - 人像/逼真人物 → nano-banana-2
- 产品拍摄/电商 → nano-banana-pro
- 通用插画/概念艺术 → nano-banana-2
- 快速预览/迭代 → seedream-5.0-lite
- 用户明确要求1K分辨率 → nano-banana-pro或nano-banana-2(Seedream不支持1K)
需澄清的输入项(缺失时询问)
- - 仅询问生成优质图像所必需的信息
- 若未指定分辨率或宽高比,使用合理默认值(2K,正方形)
- 若提示词模糊,在确认生成前主动提供优化建议
- 参考图像为可选项——仅在用户暗示需要风格迁移或主体参考时询问
- 使用参考图像时,先上传本地图像文件,再将返回的上传参考信息提交至生成任务
工作流程
- 1. 确认环节(必选)
- 模型
- 分辨率
- 图像数量(n)
- 宽高比(如已指定)
- 预估成本/积分(如可获取)
- - 在提交前请求简单确认。
- 在用户确认前不得提交生成任务。
- 2. 提交图像生成
- - 调用CreatOK:POST /api/open/skills/image-generation
- 3. 轮询状态直至完成
- - 调用CreatOK:GET /api/open/skills/tasks/status?taskid=...&tasktype=image_generation
- 4. 持久化产物并回复
- outputs/result.json,包含task_id/status/images/raw
- outputs/result.md
- - 提交后立即持久化task_id,以便用户后续恢复。
- 直接返回最终图像URL。
现有任务恢复
- - 若用户已有task_id,则从该任务继续,而非重新开始新任务。
- 在恢复模式下,若任务ID已存在,无需要求用户重新陈述提示词。
- 模型可一次性检查状态,或在用户愿意等待时持续轮询。
- 若任务成功,直接返回最终图像URL。
- 若任务仍在队列或运行中,清晰说明情况并提供继续检查的选项。
- 若任务失败,解释失败原因并建议后续步骤。
产物
所有产物位于generate-image/.artifacts//...目录下。
瘦客户端边界
- - 本技能负责提交生成任务、轮询状态并持久化固定格式的输出。
- 若对话方向已明确,模型不应要求用户重新陈述其想法。