Gloo AI Gateway Hands-On Lab: Rate Limiting and Model Failover

Sign up for the free, hands-on technical labs.

Rate Limiting and Usage Management

Control token usage within LLM provider APIs
Implement rate limiting to enforce budget constraints
Set per-user rate limits based on JWT claims
Monitor usage metrics with Grafana to optimize resource allocation

Model Failover with Gloo AI Gateway

Ensure uninterrupted service in LLM provider APIs with failover
Configure upstreams and RouteOptions to redirect requests to alternative models for reliability and resilience

Take the course

Gloo AI Gateway Hands-On Lab: Rate Limiting and Model Failover

Take the course

Gloo AI Gateway Hands-On Lab: Rate Limiting and Model Failover

Lab

Take the course

Additional Resources

BackstageCon 2024: A Recap from Paris

How to make your API gateway easier, faster, and more cost effective

A Guide to Authorization Policy in Ambient Mesh

Cloud connectivity done right