Automatic LLM routing that optimizes cost and speed
Factory Router automatically selects the best AI model for each coding task, reducing costs by up to 25% while maintaining frontier performance. It uses a classifier to score models and routes to cost-efficient options without sacrificing quality. Available in private preview.
Router
Frontier performance at lower cost
Automatic model selection for every Droid session. Factory Router picks the right model for each task, maintains frontier performance, and cuts cost by up to 25%.
$ droid --model router "refactor auth middleware"
Refactor auth middleware to use JWT validationDroid is routing…
Auto-ModelAutoMCP (3)Skills (12)
router-classifierclassifier · ~2s
Reads the first user message, recent tool calls and repo signals, then emits a scalar quality probability for each model.
message0.300.84
recent tools0.200.62
repo size0.150.77
language mix0.200.91
difficulty0.150.88
Final Score0.80
candidate scoringthreshold 0.70
sorted cheapest → most expensivequality_threshold
Kimi K2.6Moonshot$0.81
MiniMax-M2.7MiniMax$$0.88
Claude Opus 4.7Anthropic$$$0.95
Kimi K2.6
streaming
›Reading src/auth/middleware.ts...
›Found legacy session cookie validation
›Replacing with JWT verify (RS256)
›Generated 7 tests covering edge cases
›PR #418 opened — ready for review
promptclassifyscorerun
The problem
AI coding costs are rising across organizations.
Enterprise AI costs are climbing, and a bigger token bill does not mean more work is getting done. To avoid losing on performance, engineers usually default to the most performant model for all tasks. Simple questions, mechanical refactors, documentation updates, small bug fixes, and search-heavy investigations end up on the same premium path as work that truly needs frontier performance. Budgets get exhausted without a clear increase in organization-level output.
For engineers
Stop choosing a model for every task.
Today you pick a model per task and lean on the most expensive one to be safe. With Factory Router you choose once and it picks the best model for each session.
Same prompts. Different cost.
Without RoutingAlways Claude Opus 4.7
“reset my password”Claude Opus 4.7$0.00
“add a copyright header”Claude Opus 4.7$0.00
“design a caching layer”Claude Opus 4.7$0.00
Total session cost$0.00
With Factory RouterRouted per task
“reset my password”Kimi K2.6$0.00
“add a copyright header”MiniMax-M2.7$0.00
“design a caching layer”Kimi K2.6$0.00
Total session cost$0.00
Savings on identical work0%
Benchmarks
On our enterprise engineering benchmarks.
Compared with Claude Opus 4.7, Factory Router maintains frontier performance at lower cost per session. At enterprise scale, those savings apply across every Droid session, with spend tied to the work being done rather than a blanket default to the most expensive model.
Read the announcement
Reliability
Reliability you can count on.
When a provider degrades, rate limits hit, or capacity gets constrained, your sessions keep going. Factory Router routes across models, providers, and capacity to deliver 99.9%+ request reliability.
Claude Opus 4.7Bedrock· degraded
reroute
Claude Opus 4.7Vertex· healthy
Provider failover
If a provider path degrades, Factory Router keeps the session running on the same model through a healthy provider.
Dedicated TPM
Enterprise customers get reserved throughput for critical work instead of relying only on shared public capacity.
Rich frontier models
Factory Router keeps frontier models available as they come online, so high-complexity work gets the strongest model class.
US-hosted open-source models
Route eligible work to US-hosted open-source models when you need cost-efficient or controlled options.
For admins
Routing that reflects how your organization works.
Routing guidance brings your team's context into Factory Router, so automatic model selection reflects how work actually happens inside your organization. The same policy surfaces that govern other Factory models apply here, so admins manage access, compliance, and eligibility without a separate control plane.
Admin routing guidance
Automatic model selection for every Droid sessionEnabled org-wide
Routing rules & context
— Routine refactors, formatting, and doc updates → favor cost-efficient models— auth/ and payments/ need deeper reasoning → keep on frontier models— Search-heavy investigation → route to open-source models
CancelSave
Available today
Use Factory Router in the Factory CLI and Desktop App.
Factory Router is in private research preview in the Factory CLI and Desktop App. Once enabled for your org, it appears in the model picker for every user with no setup required. Mission workers can use it too, so long-running autonomous work gets the same automatic model selection and savings as interactive and headless sessions.
Contact Sales
Read the docs