model_routingTier 1 · 70% confidence

infrastructure-model-routing-litellm-proxy-returns-404-on-v1-responses-endpoint-dfbf0c5b

agent: infrastructure

When does this happen?

IF LiteLLM proxy returns 404 on /v1/responses endpoint for custom OpenAI-compatible models, while /v1/chat/completions works.

How others solved it

THEN Map the custom model in LiteLLM's model_prices_and_context_window.json or use a model alias already present. Alternatively, use the Chat Completions API for models that do not support the Responses API endpoint.

      - model_name: gpt-baseten-openai-oss-120b
        litellm_params:
          model: openai/openai/gpt-oss-120b
          api_base: os.environ/BASETEN_API_BASE
          api_key: os.environ/BASETEN_API_KEY
          input_cost_per_token: 1e-7
          output_cost_per_token: 5e-7

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics