Perimattic
LiteLLM logo

Managed LiteLLM Hosting

AI & ML

LLM proxy gateway for 100+ providers

LiteLLM is a unified proxy gateway that routes requests across 100+ LLM providers using a single OpenAI-compatible API. ManageStacks deploys LiteLLM with persistent configuration, key management, and usage tracking.

About LiteLLM

LiteLLM is an open-source LLM proxy that provides a unified OpenAI-compatible interface for over 100 LLM providers including OpenAI, Anthropic, Azure, AWS Bedrock, Google Vertex AI, and self-hosted models. It enables teams to switch between providers, set budgets, track usage, and manage API keys from a single control plane.

With built-in load balancing, fallback routing, spend tracking, and virtual key management, LiteLLM is the ideal gateway for organizations using multiple AI providers. It simplifies vendor management and provides centralized observability across all LLM consumption.

Key Features

  • Unified API for 100+ LLM providers
  • Budget limits and spend tracking per key and team
  • Load balancing and fallback routing across models
  • Virtual API key management for teams
  • Request logging and usage analytics dashboard
  • Caching layer to reduce costs and latency

How ManageStacks Helps

ManageStacks deploys LiteLLM with its PostgreSQL database, Redis cache, and admin UI pre-configured. Centralize your LLM spend tracking and provider routing without managing proxy infrastructure.

Frequently Asked Questions

Can LiteLLM on ManageStacks route to both cloud and local models?+
Yes. LiteLLM can route to cloud providers like OpenAI and Anthropic as well as self-hosted Ollama or LocalAI instances running on ManageStacks. You configure all endpoints in a single proxy configuration.
How does ManageStacks handle LiteLLM's API key storage?+
ManageStacks deploys LiteLLM with a PostgreSQL database for secure key and configuration storage. All data is encrypted at rest and included in automated daily backups.
Can I set per-team spending limits with LiteLLM on ManageStacks?+
Yes. LiteLLM supports virtual keys with budget limits per key, per team, and per model. The ManageStacks deployment includes the admin dashboard for managing these controls.