Theta EdgeCloud Skill (Cloud API Runtime)

Dedicated inference status (revalidated 2026-03-11)

- Dedicated OpenAI-compatible inference (theta.inference.models, theta.inference.chat) was revalidated successfully after Developer Plan / quota upgrade.
Important readiness nuance: endpoint creation is not instantly probe-ready.
Observed post-upgrade warm-up pattern during successful live validation:

- early authenticated GET /v1/models returned transient 404, then transient 502, then succeeded - authenticated POST /v1/chat/completions succeeded after warm-up

- Operational guidance:

- use authenticated readiness retries for ~1-2 minutes before declaring dedicated endpoint failure - prefer vm_gt1 first when allocator capacity is available - if vm_gt1 is capacity-blocked, fall back to vm_gt2 or V100-backed options

Credential scope model (important)

This skill is command-scoped: only provide the credentials needed for the command family you use.

- Controller/deployment/project commands: THETA_EC_API_KEY, INLINECODE10
Balance command: add INLINECODE11
On-demand inference commands: THETA_ONDEMAND_API_TOKEN or INLINECODE13
Inference endpoint commands: THETA_INFERENCE_ENDPOINT + auth (THETA_INFERENCE_AUTH_TOKEN or user/pass)

Credentials above are not globally required all at once.

Quick setup (new users)

1) Log in at https://www.thetaedgecloud.com/. 2) Go to Account -> Projects and select your project. 3) Click Create API Key and copy the key. 4) Provide these on install/setup prompt: - THETA_EC_API_KEY - THETA_EC_PROJECT_ID 5) (Recommended for on-demand image/video generation) create On-demand API key/token and set: - THETA_ONDEMAND_API_KEY (or THETA_ONDEMAND_API_TOKEN) 6) For dedicated deployments / GPU-backed serving, check Account -> Quota and click Increase Quota if needed. - Default machine-type quotas may be too low (or zero) for dedicated GPU deployment creation. - Current live operator guidance: add at least $20 in credits first so the organization can reach Developer Plan, then retry Increase Quota for fuller functionality. - Higher quota tiers may still exist beyond Developer Plan.

If a command says a key is missing, run theta.auth.capabilities to see exactly what to configure.

This runtime artifact is scoped to cloud API operations only.

Security behavior (explicit)

- Runtime command handlers do not execute local shell commands.
Runtime does not read local files for upload operations.
Runtime does not call localhost/default local RPC endpoints.
Dedicated inference endpoint override via args.endpoint is disabled in runtime commands; use THETA_INFERENCE_ENDPOINT.
Runtime secret resolution uses OpenClaw secret provider first, then env fallback for:

- THETA_ONDEMAND_API_TOKEN - THETA_INFERENCE_AUTH_TOKEN - THETA_INFERENCE_AUTH_USER / THETA_INFERENCE_AUTH_PASS

- Paid/mutating operations are user-triggered and can be gated by THETA_DRY_RUN=1.

Credential model (what users actually need)

Users need a Theta EdgeCloud account with billing/credits enabled for paid operations.

Use only credentials required for the feature set you plan to call:

- Deployments API:

- THETA_EC_API_KEY - THETA_EC_PROJECT_ID

- Dedicated inference endpoint:

- THETA_INFERENCE_ENDPOINT - EITHER basic auth: - THETA_INFERENCE_AUTH_USER - THETA_INFERENCE_AUTH_PASS - OR bearer token auth: - THETA_INFERENCE_AUTH_TOKEN

- On-demand model API:

- THETA_ONDEMAND_API_TOKEN

- Theta Video API:

- THETA_VIDEO_SA_ID - INLINECODE37

Runtime-only package

This ClawHub artifact is a dist/docs bundle intended for transparent inspection and low scanner surface.

Env knobs (selected)

- INLINECODE38
INLINECODE39
INLINECODE40
INLINECODE41
INLINECODE42
INLINECODE43
INLINECODE44
INLINECODE45
INLINECODE46
INLINECODE47
INLINECODE48
INLINECODE49
INLINECODE50
INLINECODE51
INLINECODE52

AI Services coverage

- Deployments API: list + create + stop + delete
Dedicated model templates: standard + custom
On-demand model APIs: live discovery + infer/status/poll
Dedicated inference endpoint: models + chat
Dedicated deployments listing
Jupyter notebook listing
GPU node and GPU cluster listing
Persistent storage listing
Agentic AI (chatbot) listing
Theta Video APIs: list/upload/video/stream/ingestor operations

Theta-only OpenClaw operating options (no other subscriptions)

If Theta is the only paid AI backend, this skill can still cover most OpenClaw execution routes:

- Content generation:

- image/logo/creative generation (flux, stable_diffusion_*) via theta.ondemand.infer - image enhancement/upscale (esrgan) - identity-preserving generation (instant_id) - virtual try-on/product visualization (stable_viton) - video generation (step_video) and talking avatars (talking_head)

- Website AI features:

- chatbot/support/Q&A/rewrite pipelines using on-demand LLMs (llama_3_8b, llama_3_1_70b)

- Vision/media intelligence:

- captioning/alt-text (blip), object detection (grounding_dino), transcription (whisper)

- Video infrastructure:

- upload/video/stream/ingestor operations via theta.video.*

- Compute/ops:

- VM/deployment lifecycle + GPU/storage listings + capability/balance checks via theta.deployments.*, theta.ai.*, theta.auth.capabilities, INLINECODE70

Recommended reliability route:

- Prefer on-demand + video/controller flows for production automation.
Current validated daily-use routes include flux image generation, step_video on-demand video generation, and dedicated OpenAI-compatible inference after readiness warm-up.
Dedicated endpoint commands are now valid when the project has quota/plan support, but should use authenticated readiness retries instead of immediate fail-fast assumptions.

Organization & Project scope

- Theta dashboard uses Organization + Project context.
Runtime commands are project-scoped and require explicit projectId where relevant.
Org membership/invite/session management endpoints are web-dashboard auth flows and not included in this skill runtime.

API key vs user/password auth (validated)

- API key (THETA_EC_API_KEY) + project/org IDs can access project-scoped controller APIs and org balance.
API key is sufficient for key runtime operations; dashboard username/password is not required for these flows.
Username/password session auth is still required for account-management endpoints (org/project membership, invite, charge usage history APIs).

On-demand API key alias

The runtime accepts either THETA_ONDEMAND_API_TOKEN or THETA_ONDEMAND_API_KEY for on-demand model API auth.

Reliability behavior

- theta.ai.dedicatedDeployments.list now emits warning metadata if serving-template catalog calls fail; this avoids silent empty output.
INLINECODE78 returns service entries tagged by source (live or catalog) and includes fallback reason/warning fields when live discovery is unavailable.
INLINECODE81 now uses service-aware timeout behavior:

- submit timeout is automatically raised above generic HTTP defaults - poll/completion timeout auto-scales from requested video size when the request status exposes frame/fps metadata - sizing rounds up with extra variance buffer to reduce false local timeouts under load

Auth diagnostics

Use theta.auth.capabilities to quickly see which command families are available with the current credential set and which env vars are missing.

First-run setup command

Use theta.setup to get a one-screen checklist for new users, including where to create API keys and which env vars to set.

Theta EdgeCloud 技能（云 API 运行时）

专用推理状态（2026-03-11 重新验证）

- 专用 OpenAI 兼容推理（theta.inference.models、theta.inference.chat）在开发者计划/配额升级后已成功重新验证。
重要就绪细节：端点创建后并非立即可探测就绪。
在成功实时验证期间观察到的升级后预热模式：

- 早期经过身份验证的 GET /v1/models 返回临时 404，然后临时 502，最后成功 - 经过身份验证的 POST /v1/chat/completions 在预热后成功

- 操作指南：

- 在声明专用端点故障前，使用经过身份验证的就绪重试约 1-2 分钟 - 当分配器容量可用时，优先使用 vm_gt1 - 如果 vmgt1 容量受限，则回退到 vmgt2 或基于 V100 的选项

凭证范围模型（重要）

此技能按命令范围划分：仅提供您使用的命令族所需的凭证。

- 控制器/部署/项目命令：THETAECAPIKEY、THETAECPROJECTID
余额命令：添加 THETAORGID
按需推理命令：THETAONDEMANDAPITOKEN 或 THETAONDEMANDAPIKEY
推理端点命令：THETAINFERENCEENDPOINT + 身份验证（THETAINFERENCEAUTH_TOKEN 或用户名/密码）

上述凭证并非一次性全局必需。

快速设置（新用户）

1) 登录 https://www.thetaedgecloud.com/。 2) 进入账户 -> 项目并选择您的项目。 3) 点击创建 API 密钥并复制密钥。 4) 在安装/设置提示时提供以下内容： - THETAECAPI_KEY - THETAECPROJECT_ID 5) （推荐用于按需图像/视频生成）创建按需 API 密钥/令牌并设置： - THETAONDEMANDAPIKEY（或 THETAONDEMANDAPITOKEN） 6) 对于专用部署/GPU 后端服务，检查账户 -> 配额，如有需要点击增加配额。 - 默认机器类型配额可能过低（或为零），无法创建专用 GPU 部署。 - 当前实时操作指南：首先至少添加 $20 的积分，使组织达到开发者计划，然后重试增加配额以获得更完整的功能。 - 开发者计划之上可能还存在更高的配额层级。

如果某个命令提示缺少密钥，请运行 theta.auth.capabilities 查看需要配置的具体内容。

此运行时制品仅限云 API 操作范围。

安全行为（明确）

- 运行时命令处理程序不执行本地 shell 命令。
运行时不为上传操作读取本地文件。
运行时不调用 localhost/默认本地 RPC 端点。
运行时命令中禁用通过 args.endpoint 覆盖专用推理端点；请使用 THETAINFERENCEENDPOINT。
运行时密钥解析首先使用 OpenClaw 密钥提供程序，然后回退到环境变量，适用于：

- THETAONDEMANDAPI_TOKEN - THETAINFERENCEAUTH_TOKEN - THETAINFERENCEAUTHUSER / THETAINFERENCEAUTHPASS

- 付费/变更操作由用户触发，可通过 THETADRYRUN=1 进行门控。

凭证模型（用户实际需要）

用户需要一个已启用计费/积分的 Theta EdgeCloud 账户，用于付费操作。

仅使用您计划调用的功能集所需的凭证：

- 部署 API：

- THETAECAPI_KEY - THETAECPROJECT_ID

- 专用推理端点：

- THETAINFERENCEENDPOINT - 基本身份验证： - THETAINFERENCEAUTH_USER - THETAINFERENCEAUTH_PASS - 或 Bearer 令牌身份验证： - THETAINFERENCEAUTH_TOKEN

- 按需模型 API：

- THETAONDEMANDAPI_TOKEN

- Theta 视频 API：

- THETAVIDEOSA_ID - THETAVIDEOSA_SECRET

仅运行时包

此 ClawHub 制品是一个分发/文档包，旨在实现透明检查和低扫描器攻击面。

环境变量（精选）

- THETADRYRUN
THETAECAPIKEY
THETAECPROJECTID
THETAORGID
THETAINFERENCEENDPOINT
THETAINFERENCEAUTHUSER
THETAINFERENCEAUTHPASS
THETAINFERENCEAUTHTOKEN
THETAONDEMANDAPITOKEN
THETAONDEMANDAPIKEY
THETAVIDEOSAID
THETAVIDEOSASECRET
THETAHTTPTIMEOUTMS
THETAHTTPMAXRETRIES
THETAHTTPRETRYBACKOFF_MS

AI 服务覆盖范围

- 部署 API：列出 + 创建 + 停止 + 删除
专用模型模板：标准 + 自定义
按需模型 API：实时发现 + 推理/状态/轮询
专用推理端点：模型 + 聊天
专用部署列表
Jupyter 笔记本列表
GPU 节点和 GPU 集群列表
持久存储列表
代理 AI（聊天机器人）列表
Theta 视频 API：列出/上传/视频/流/摄取器操作

仅 Theta OpenClaw 操作选项（无其他订阅）

如果 Theta 是唯一的付费 AI 后端，此技能仍可覆盖大多数 OpenClaw 执行路径：

- 内容生成：

- 图像/标志/创意生成（flux、stablediffusion*）通过 theta.ondemand.infer - 图像增强/放大（esrgan） - 身份保持生成（instant_id） - 虚拟试穿/产品可视化（stable_viton） - 视频生成（stepvideo）和说话头像（talkinghead）

- 网站 AI 功能：

- 使用按需 LLM（llama38b、llama31_70b）的聊天机器人/支持/问答/重写管道

- 视觉/媒体智能：

- 字幕/替代文本（blip）、目标检测（grounding_dino）、转录（whisper）

- 视频基础设施：

- 通过 theta.video.* 的上传/视频/流/摄取器操作

- 计算/运维：

- 通过 theta.deployments.、theta.ai.、theta.auth.capabilities、theta.billing.balance 的 VM/部署生命周期 + GPU/存储列表 + 能力/余额检查

推荐可靠性路径：

- 生产自动化优先使用按需 + 视频/控制器流程。
当前已验证的日常使用路径包括 flux 图像生成、step_video 按需视频生成，以及就绪预热后的专用 OpenAI 兼容推理。
当项目具有配额/计划支持时，专用端点命令现在有效，但应使用经过身份验证的就绪重试，而不是立即快速失败假设。

组织与项目范围

- Theta 仪表板使用组织 + 项目上下文。
运行时命令按项目范围划分，在相关情况下需要显式 projectId。
组织成员/邀请/会话管理端点是 Web 仪表板身份验证流程，不包含在此技能运行时中。

API 密钥与用户名/密码身份验证（已验证）

- API 密钥（THETAECAPI_KEY）+ 项目/组织 ID 可以访问项目范围的控制器 API 和组织余额。
API 密钥足以满足关键运行时操作；这些流程不需要仪表板用户名/密码。
账户管理端点（组织/项目成员、邀请、计费使用历史 API）仍需要用户名/密码会话身份验证。

按需 API 密钥别名

运行时接受 THETAONDEMANDAPITOKEN 或 THETAONDEMANDAPIKEY 用于按需模型 API 身份验证。

可靠性行为

- theta.ai.dedicatedDeployments.list 现在在服务模板目录调用失败时发出警告元数据；这避免了静默空输出。
theta.ondemand.listServices 返回按来源（live 或 catalog）标记的服务条目，并在实时发现不可用时包含回退原因/警告字段。

theta-edgecloud-skillTheta边缘云技能