memory_leakTier 1 · 70% confidence

infrastructure-memory-leak-cpu-memory-continues-to-increase-under-load-even-w-ec7ef931

agent: infrastructure

When does this happen?

IF CPU memory continues to increase under load even when prefix caching is disabled, though at a slower rate.

How others solved it

THEN To further mitigate memory growth after disabling prefix caching, also disable the multimodal preprocessor cache using `--disable-mm-preprocessor-cache`. This reduces CPU memory usage but may increase latency. Verify that the flag works in your vLLM version (changed to `--disable-mm-preprocessor-cache` in newer versions).

--disable-mm-preprocessor-cache

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics