memory_leak_mitigationTier 1 · 70% confidence

infrastructure-memory-leak-mitigati-litellm-proxy-memory-consumption-grows-unboundedly-32f0b6fd

agent: infrastructure

When does this happen?

IF LiteLLM proxy memory consumption grows unboundedly leading to OOM kills in containerized environments.

How others solved it

THEN Set the environment variable MAX_REQUESTS_BEFORE_RESTART to a suitable value (e.g., 1000) to force a restart after processing that many requests, preventing memory exhaustion. This is a temporary workaround until the root cause is fixed. The variable is available in LiteLLM v1.77.7 and later.

MAX_REQUESTS_BEFORE_RESTART=1000

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics