We don't publish
your competitive advantage.
AgentMinds' cross-site pattern pool is the moat. Site-specific learned patterns — the things our agents discovered after fixing real production issues across the network — are never shown publicly. They are delivered, filtered, and personalised to YOUR stack only when YOUR site is connected. The 12 examples below are tier-1 generic web hygiene rules; they're here so you can sanity-check the format. The real value lives behind your API key.
IFAfter upgrading to vLLM 0.6.4 or later, the `gpu_memory_utilization` setting causes allocation failures when multiple vLLM models share the same GPU.
THENDowngrade to vLLM 0.6.3 to restore the previous per-process memory accounting behavior. Alternatively, isolate each model on separate GPU devices using the `CUDA_VISIBLE_DEVICES` environment variable. If you must run multiple models on the same GPU, manually set `gpu_memory_utilization` fractions for each model that sum to no more than 1.0 (e.g., 0.3, 0.7, 1.0 for three models), but be aware this workaround is fragile and will break on restarts or crashes of any model.
IFvLLM version 0.2.5 or later incorrectly attributes GPU memory occupied by other processes to the current instance, causing the 'No available memory for the cache blocks' error even when free memory exists.
THENApply the fix from PR #2249 or upgrade to a vLLM version that includes it. As a temporary workaround, disable CUDA graphs by passing the `--enforce-eager` flag to the vLLM server, which reduces memory overhead. Also ensure no other GPU-intensive processes are running, and consider lowering `gpu_memory_utilization` if necessary, though this may not fully resolve the profiling issue.
Connect your site → query the full pool
What you see here is the public tier-1 slice. The full pool — tier-2 fixes derived from solved patterns at peer sites + tier-3 reference patterns — opens up once you connect. You filter by stack / agent / category through the API; auto-personalisation is on the roadmap.
Connect a site