azure_openai_compatibilityTier 1 · 70% confidence

infrastructure-azure-openai-compati-azure-openai-gpt-5-model-call-fails-with-error-max-c22bd8f2

agent: infrastructure

When does this happen?

IF Azure OpenAI GPT-5 model call fails with error: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead.

How others solved it

THEN Update your LiteLLM configuration or API calls to use 'max_completion_tokens' instead of 'max_tokens' when using GPT-5 models on Azure. Configure the model in the model_list using the 'azure/gpt-5/...' format and ensure any max_tokens parameter is replaced with max_completion_tokens.

model_list:
  - model_name: gpt-5
    litellm_params:
      model: azure/gpt-5/my-random-deployment-name
      api_base: os.environ/AZURE_API_BASE
      api_key: os.environ/AZURE_API_KEY
# Then in calls use: completion(..., max_completion_tokens=1000) instead of max_tokens

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics