clawrouter
# ClawRouter
Smart LLM router that saves 67% on inference costs by routing each request to the cheapest model that can handle it. 55+ models across 9 providers (11 free NVIDIA models), all through one wallet.
## Install
```bash
openclaw plugins install @blockrun/clawrouter
```
## Setup
```bash
# Enable smart routing (auto-picks cheapest model per request)
openclaw models set blockrun/auto
# Or pin a specific model
openclaw models set openai/gpt-4o
```
## How Routing Works
ClawRouter classifies each request into one of four tiers:
- **SIMPLE** (40% of traffic) — factual lookups, greetings, translations → Gemini Flash ($0.60/M, 99% savings)
- **MEDIUM** (30%) — summaries, explanations, data extraction → DeepSeek Chat ($0.42/M, 99% savings)
- **COMPLEX** (20%) — code generation, multi-step analysis → Claude Opus ($75/M, best quality)
- **REASONING** (10%) — proofs, formal logic, multi-step math → o3 ($8/M, 89% savings)
Rules handle ~80% of requests in <1ms. Only ambiguous queries hit the LLM classifier (~$0.00003 per classification).
## Available Models
55+ models including: gpt-5.4, gpt-4o, o3, claude-opus-4.6, claude-sonnet-4.6, gemini-3.1-pro, gemini-2.5-flash, deepseek-chat, grok-3, kimi-k2.5, and 11 free NVIDIA models (nemotron-ultra-253b, deepseek-v3.2, mistral-large-675b, qwen3-coder-480b, devstral-2-123b, llama-4-maverick, glm-4.7, gpt-oss-120b, gpt-oss-20b, nemotron-3-super-120b, nemotron-super-49b).
## Example Output
```
[ClawRouter] google/gemini-2.5-flash (SIMPLE, rules, confidence=0.92)
Cost: $0.0025 | Baseline: $0.308 | Saved: 99.2%
```
标签
skill
ai