Theta EdgeCloud Skill (Cloud API Runtime)
Dedicated inference status (revalidated 2026-03-11)
- - Dedicated OpenAI-compatible inference (
theta.inference.models, theta.inference.chat) was revalidated successfully after Developer Plan / quota upgrade. - Important readiness nuance: endpoint creation is not instantly probe-ready.
- Observed post-upgrade warm-up pattern during successful live validation:
- early authenticated
GET /v1/models returned transient
404, then transient
502, then succeeded
- authenticated
POST /v1/chat/completions succeeded after warm-up
- use authenticated readiness retries for ~1-2 minutes before declaring dedicated endpoint failure
- prefer
vm_gt1 first when allocator capacity is available
- if
vm_gt1 is capacity-blocked, fall back to
vm_gt2 or V100-backed options
Credential scope model (important)
This skill is command-scoped: only provide the credentials needed for the command family you use.
- - Controller/deployment/project commands:
THETA_EC_API_KEY, INLINECODE10 - Balance command: add INLINECODE11
- On-demand inference commands:
THETA_ONDEMAND_API_TOKEN or INLINECODE13 - Inference endpoint commands:
THETA_INFERENCE_ENDPOINT + auth (THETA_INFERENCE_AUTH_TOKEN or user/pass)
Credentials above are not globally required all at once.
Quick setup (new users)
1) Log in at
https://www.thetaedgecloud.com/.
2) Go to
Account -> Projects and select your project.
3) Click
Create API Key and copy the key.
4) Provide these on install/setup prompt:
-
THETA_EC_API_KEY
-
THETA_EC_PROJECT_ID
5) (Recommended for on-demand image/video generation) create On-demand API key/token and set:
-
THETA_ONDEMAND_API_KEY (or
THETA_ONDEMAND_API_TOKEN)
6) For dedicated deployments / GPU-backed serving, check
Account -> Quota and click
Increase Quota if needed.
- Default machine-type quotas may be too low (or zero) for dedicated GPU deployment creation.
- Current live operator guidance: add at least
$20 in credits first so the organization can reach
Developer Plan, then retry
Increase Quota for fuller functionality.
- Higher quota tiers may still exist beyond Developer Plan.
If a command says a key is missing, run theta.auth.capabilities to see exactly what to configure.
This runtime artifact is scoped to cloud API operations only.
Security behavior (explicit)
- - Runtime command handlers do not execute local shell commands.
- Runtime does not read local files for upload operations.
- Runtime does not call localhost/default local RPC endpoints.
- Dedicated inference endpoint override via
args.endpoint is disabled in runtime commands; use THETA_INFERENCE_ENDPOINT. - Runtime secret resolution uses OpenClaw secret provider first, then env fallback for:
-
THETA_ONDEMAND_API_TOKEN
-
THETA_INFERENCE_AUTH_TOKEN
-
THETA_INFERENCE_AUTH_USER /
THETA_INFERENCE_AUTH_PASS
- - Paid/mutating operations are user-triggered and can be gated by
THETA_DRY_RUN=1.
Credential model (what users actually need)
Users need a Theta EdgeCloud account with billing/credits enabled for paid operations.
Use only credentials required for the feature set you plan to call:
-
THETA_EC_API_KEY
-
THETA_EC_PROJECT_ID
- - Dedicated inference endpoint:
-
THETA_INFERENCE_ENDPOINT
- EITHER basic auth:
-
THETA_INFERENCE_AUTH_USER
-
THETA_INFERENCE_AUTH_PASS
- OR bearer token auth:
-
THETA_INFERENCE_AUTH_TOKEN
-
THETA_ONDEMAND_API_TOKEN
-
THETA_VIDEO_SA_ID
- INLINECODE37
Runtime-only package
This ClawHub artifact is a dist/docs bundle intended for transparent inspection and low scanner surface.
Env knobs (selected)
- - INLINECODE38
- INLINECODE39
- INLINECODE40
- INLINECODE41
- INLINECODE42
- INLINECODE43
- INLINECODE44
- INLINECODE45
- INLINECODE46
- INLINECODE47
- INLINECODE48
- INLINECODE49
- INLINECODE50
- INLINECODE51
- INLINECODE52
AI Services coverage
- - Deployments API: list + create + stop + delete
- Dedicated model templates: standard + custom
- On-demand model APIs: live discovery + infer/status/poll
- Dedicated inference endpoint: models + chat
- Dedicated deployments listing
- Jupyter notebook listing
- GPU node and GPU cluster listing
- Persistent storage listing
- Agentic AI (chatbot) listing
- Theta Video APIs: list/upload/video/stream/ingestor operations
Theta-only OpenClaw operating options (no other subscriptions)
If Theta is the only paid AI backend, this skill can still cover most OpenClaw execution routes:
- image/logo/creative generation (
flux,
stable_diffusion_*) via
theta.ondemand.infer
- image enhancement/upscale (
esrgan)
- identity-preserving generation (
instant_id)
- virtual try-on/product visualization (
stable_viton)
- video generation (
step_video) and talking avatars (
talking_head)
- chatbot/support/Q&A/rewrite pipelines using on-demand LLMs (
llama_3_8b,
llama_3_1_70b)
- - Vision/media intelligence:
- captioning/alt-text (
blip), object detection (
grounding_dino), transcription (
whisper)
- upload/video/stream/ingestor operations via
theta.video.*
- VM/deployment lifecycle + GPU/storage listings + capability/balance checks via
theta.deployments.*,
theta.ai.*,
theta.auth.capabilities, INLINECODE70
Recommended reliability route:
- - Prefer on-demand + video/controller flows for production automation.
- Current validated daily-use routes include
flux image generation, step_video on-demand video generation, and dedicated OpenAI-compatible inference after readiness warm-up. - Dedicated endpoint commands are now valid when the project has quota/plan support, but should use authenticated readiness retries instead of immediate fail-fast assumptions.
Organization & Project scope
- - Theta dashboard uses Organization + Project context.
- Runtime commands are project-scoped and require explicit
projectId where relevant. - Org membership/invite/session management endpoints are web-dashboard auth flows and not included in this skill runtime.
API key vs user/password auth (validated)
- - API key (
THETA_EC_API_KEY) + project/org IDs can access project-scoped controller APIs and org balance. - API key is sufficient for key runtime operations; dashboard username/password is not required for these flows.
- Username/password session auth is still required for account-management endpoints (org/project membership, invite, charge usage history APIs).
On-demand API key alias
The runtime accepts either
THETA_ONDEMAND_API_TOKEN or
THETA_ONDEMAND_API_KEY for on-demand model API auth.
Reliability behavior
- -
theta.ai.dedicatedDeployments.list now emits warning metadata if serving-template catalog calls fail; this avoids silent empty output. - INLINECODE78 returns service entries tagged by source (
live or catalog) and includes fallback reason/warning fields when live discovery is unavailable. - INLINECODE81 now uses service-aware timeout behavior:
- submit timeout is automatically raised above generic HTTP defaults
- poll/completion timeout auto-scales from requested video size when the request status exposes frame/fps metadata
- sizing rounds up with extra variance buffer to reduce false local timeouts under load
Auth diagnostics
Use
theta.auth.capabilities to quickly see which command families are available with the current credential set and which env vars are missing.
First-run setup command
Use
theta.setup to get a one-screen checklist for new users, including where to create API keys and which env vars to set.
Theta EdgeCloud 技能(云 API 运行时)
专用推理状态(2026-03-11 重新验证)
- - 专用 OpenAI 兼容推理(theta.inference.models、theta.inference.chat)在开发者计划/配额升级后已成功重新验证。
- 重要就绪细节:端点创建后并非立即可探测就绪。
- 在成功实时验证期间观察到的升级后预热模式:
- 早期经过身份验证的 GET /v1/models 返回临时 404,然后临时 502,最后成功
- 经过身份验证的 POST /v1/chat/completions 在预热后成功
- 在声明专用端点故障前,使用经过身份验证的就绪重试约 1-2 分钟
- 当分配器容量可用时,优先使用 vm_gt1
- 如果 vm
gt1 容量受限,则回退到 vmgt2 或基于 V100 的选项
凭证范围模型(重要)
此技能按命令范围划分:仅提供您使用的命令族所需的凭证。
- - 控制器/部署/项目命令:THETAECAPIKEY、THETAECPROJECTID
- 余额命令:添加 THETAORGID
- 按需推理命令:THETAONDEMANDAPITOKEN 或 THETAONDEMANDAPIKEY
- 推理端点命令:THETAINFERENCEENDPOINT + 身份验证(THETAINFERENCEAUTH_TOKEN 或用户名/密码)
上述凭证并非一次性全局必需。
快速设置(新用户)
1) 登录 https://www.thetaedgecloud.com/。
2) 进入
账户 -> 项目并选择您的项目。
3) 点击
创建 API 密钥并复制密钥。
4) 在安装/设置提示时提供以下内容:
- THETA
ECAPI_KEY
- THETA
ECPROJECT_ID
5) (推荐用于按需图像/视频生成)创建按需 API 密钥/令牌并设置:
- THETA
ONDEMANDAPI
KEY(或 THETAONDEMAND
APITOKEN)
6) 对于专用部署/GPU 后端服务,检查
账户 -> 配额,如有需要点击
增加配额。
- 默认机器类型配额可能过低(或为零),无法创建专用 GPU 部署。
- 当前实时操作指南:首先至少添加
$20 的积分,使组织达到
开发者计划,然后重试
增加配额以获得更完整的功能。
- 开发者计划之上可能还存在更高的配额层级。
如果某个命令提示缺少密钥,请运行 theta.auth.capabilities 查看需要配置的具体内容。
此运行时制品仅限云 API 操作范围。
安全行为(明确)
- - 运行时命令处理程序不执行本地 shell 命令。
- 运行时不为上传操作读取本地文件。
- 运行时不调用 localhost/默认本地 RPC 端点。
- 运行时命令中禁用通过 args.endpoint 覆盖专用推理端点;请使用 THETAINFERENCEENDPOINT。
- 运行时密钥解析首先使用 OpenClaw 密钥提供程序,然后回退到环境变量,适用于:
- THETA
ONDEMANDAPI_TOKEN
- THETA
INFERENCEAUTH_TOKEN
- THETA
INFERENCEAUTH
USER / THETAINFERENCE
AUTHPASS
- - 付费/变更操作由用户触发,可通过 THETADRYRUN=1 进行门控。
凭证模型(用户实际需要)
用户需要一个已启用计费/积分的 Theta EdgeCloud 账户,用于付费操作。
仅使用您计划调用的功能集所需的凭证:
- THETA
ECAPI_KEY
- THETA
ECPROJECT_ID
- THETA
INFERENCEENDPOINT
- 基本身份验证:
- THETA
INFERENCEAUTH_USER
- THETA
INFERENCEAUTH_PASS
- 或 Bearer 令牌身份验证:
- THETA
INFERENCEAUTH_TOKEN
- THETA
ONDEMANDAPI_TOKEN
- THETA
VIDEOSA_ID
- THETA
VIDEOSA_SECRET
仅运行时包
此 ClawHub 制品是一个分发/文档包,旨在实现透明检查和低扫描器攻击面。
环境变量(精选)
- - THETADRYRUN
- THETAECAPIKEY
- THETAECPROJECTID
- THETAORGID
- THETAINFERENCEENDPOINT
- THETAINFERENCEAUTHUSER
- THETAINFERENCEAUTHPASS
- THETAINFERENCEAUTHTOKEN
- THETAONDEMANDAPITOKEN
- THETAONDEMANDAPIKEY
- THETAVIDEOSAID
- THETAVIDEOSASECRET
- THETAHTTPTIMEOUTMS
- THETAHTTPMAXRETRIES
- THETAHTTPRETRYBACKOFF_MS
AI 服务覆盖范围
- - 部署 API:列出 + 创建 + 停止 + 删除
- 专用模型模板:标准 + 自定义
- 按需模型 API:实时发现 + 推理/状态/轮询
- 专用推理端点:模型 + 聊天
- 专用部署列表
- Jupyter 笔记本列表
- GPU 节点和 GPU 集群列表
- 持久存储列表
- 代理 AI(聊天机器人)列表
- Theta 视频 API:列出/上传/视频/流/摄取器操作
仅 Theta OpenClaw 操作选项(无其他订阅)
如果 Theta 是唯一的付费 AI 后端,此技能仍可覆盖大多数 OpenClaw 执行路径:
- 图像/标志/创意生成(flux、stable
diffusion*)通过 theta.ondemand.infer
- 图像增强/放大(esrgan)
- 身份保持生成(instant_id)
- 虚拟试穿/产品可视化(stable_viton)
- 视频生成(step
video)和说话头像(talkinghead)
- 使用按需 LLM(llama
38b、llama
31_70b)的聊天机器人/支持/问答/重写管道
- 字幕/替代文本(blip)、目标检测(grounding_dino)、转录(whisper)
- 通过 theta.video.* 的上传/视频/流/摄取器操作
- 通过 theta.deployments.
、theta.ai.、theta.auth.capabilities、theta.billing.balance 的 VM/部署生命周期 + GPU/存储列表 + 能力/余额检查
推荐可靠性路径:
- - 生产自动化优先使用按需 + 视频/控制器流程。
- 当前已验证的日常使用路径包括 flux 图像生成、step_video 按需视频生成,以及就绪预热后的专用 OpenAI 兼容推理。
- 当项目具有配额/计划支持时,专用端点命令现在有效,但应使用经过身份验证的就绪重试,而不是立即快速失败假设。
组织与项目范围
- - Theta 仪表板使用组织 + 项目上下文。
- 运行时命令按项目范围划分,在相关情况下需要显式 projectId。
- 组织成员/邀请/会话管理端点是 Web 仪表板身份验证流程,不包含在此技能运行时中。
API 密钥与用户名/密码身份验证(已验证)
- - API 密钥(THETAECAPI_KEY)+ 项目/组织 ID 可以访问项目范围的控制器 API 和组织余额。
- API 密钥足以满足关键运行时操作;这些流程不需要仪表板用户名/密码。
- 账户管理端点(组织/项目成员、邀请、计费使用历史 API)仍需要用户名/密码会话身份验证。
按需 API 密钥别名
运行时接受 THETA
ONDEMANDAPI
TOKEN 或 THETAONDEMAND
APIKEY 用于按需模型 API 身份验证。
可靠性行为
- - theta.ai.dedicatedDeployments.list 现在在服务模板目录调用失败时发出警告元数据;这避免了静默空输出。
- theta.ondemand.listServices 返回按来源(live 或 catalog)标记的服务条目,并在实时发现不可用时包含回退原因/警告字段。