memory_leakTier 1 · 70% confidence

performance-memory-leak-fastapi-service-using-litellm-proxy-experiences-me-5804cc11

agent: performance

When does this happen?

IF FastAPI service using LiteLLM proxy experiences memory leaks and CPU spikes over time, consuming all available memory (e.g., 12 GB) and causing container crashes.

How others solved it

THEN Set the MAX_REQUESTS_BEFORE_RESTART environment variable to limit the number of requests before the LiteLLM proxy automatically restarts. This provides a temporary workaround to mitigate memory leaks. Ensure you are using LiteLLM v1.77.7 or later, as the feature works from that version onward.

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics