Financial Circuit Breaker
Automatically halts all AI agent traffic when request rates exceed your budget ceiling. Requires human approval to resume. No exceptions.
When AI agents operate autonomously, a single misconfigured prompt or recursive tool chain can generate thousands of API requests in minutes. The Financial Circuit Breaker monitors request velocity across your entire account and trips the moment your defined threshold is exceeded — immediately blocking all further tool calls until a human explicitly approves resumption.
This is not a soft rate limit. It is a hard stop with a human-in-the-loop gate.
Why this matters now
AI agents are becoming more autonomous. They chain tool calls, retry on failure, and explore APIs without human supervision. A single recursive loop can turn a $5/month API bill into a $500 surprise overnight. Traditional rate limiting doesn't solve this — it slows requests down but never stops them entirely:
How it works
The breaker operates at the infrastructure layer. It cannot be bypassed, negotiated, or ignored by the AI agent — the proxy simply stops forwarding requests.
Configuration
The circuit breaker is controlled from Settings → Circuit Breaker in the Vinkius Cloud dashboard. All parameters apply globally across your account.
Using the Circuit Breaker from the Dashboard
The circuit breaker is managed entirely from the Vinkius Cloud dashboard. Here's how to configure and monitor it:
Setting up
Responding to a trip
When the circuit breaker trips, a prominent amber alert appears at the top of the Circuit Breaker settings panel:
You have two options:
- Manual resumption — Click Approve Resumption to immediately restore traffic. Use this when you've identified and resolved the cause of the spike.
- Wait for auto-reset — After the cooldown period expires, traffic resumes automatically. Use this when you want the system to self-heal while you investigate.
BEST PRACTICE
After a trip, check the Audit Logs on your server detail pages to identify which tool or agent caused the spike. Common culprits include recursive tool chains, retry loops, and misconfigured polling agents.
Circuit breaker vs. traditional rate limiting
The circuit breaker is fundamentally different from per-endpoint rate limiting. Rate limiting throttles individual endpoints — the circuit breaker protects your entire budget:
When the breaker trips
429 Too Many Requests with a Retry-After header indicating when the cooldown expires. Active SSE connections remain open but will not process further tool calls.Real-world examples
Concrete scenarios where the circuit breaker protects you — and the recommended settings for each.
GET /users (returns 2,000 users), then loops PATCH /users/{id}/avatar for every single one. That's 2,001 requests in under 60 seconds from a single prompt. 429. You receive an alert, approve resumption after reviewing, and only 10% of your quota was used.GET /deals?status=open to check for new opportunities. At 2 AM, a misconfigured retry policy starts calling the endpoint 10 times per second instead. By morning, it has made 288,000 requests. Your monthly quota is gone. Quick reference
START WITH THE DEFAULTS
If unsure, keep the defaults — 5,000 requests / 5 min window / 15 min cooldown. This configuration catches runaway behavior within seconds while allowing normal production workflows. You can always adjust after observing your actual traffic patterns in the Audit Logs.
Next steps
Frequently Asked Questions
What is the Financial Circuit Breaker?
The Financial Circuit Breaker automatically halts all AI agent traffic when request rates exceed your defined budget ceiling within a configurable time window. It requires explicit human approval to resume — preventing runaway AI agents from generating unexpected costs.
How does the circuit breaker detect runaway behavior?
A sliding time window tracks the number of MCP requests across all your servers. When the count exceeds your configured max_requests threshold within the window, the breaker trips instantly. All subsequent requests receive a 429 response until a human approves resumption.
Can the circuit breaker reset automatically?
Yes. After the configured cooldown period expires, the circuit breaker resets automatically and traffic resumes. You can also manually approve resumption at any time from the dashboard.
Does the circuit breaker apply per-server or globally?
The circuit breaker applies globally across all servers on your account. This ensures that runaway behavior on any single server cannot exhaust your overall budget.
What happens to active AI sessions when the breaker trips?
Active SSE connections remain open but all tool call requests return a 429 Too Many Requests response with a Retry-After header. The AI client receives an error message explaining that the budget ceiling has been reached.
Can I configure different thresholds for different servers?
Currently, the circuit breaker operates at the account level with a single set of thresholds. This prevents runaway behavior on any server from bypassing budget controls by spreading requests across multiple servers.