content-processing-solution-accelerator icon indicating copy to clipboard operation
content-processing-solution-accelerator copied to clipboard

docs: Add documentation for deploying with limited Azure OpenAI quota

Open Kanchan-Microsoft opened this issue 3 months ago • 0 comments

Purpose

This pull request adds a new documentation file to help users deploy the Content Processing Solution Accelerator when their Azure OpenAI quota is limited. The guide explains how to adjust or bypass quota validation in the deployment process, ensuring users with lower token-per-minute (TPM) quotas can still deploy successfully.

Documentation additions and deployment guidance:

  • Added docs/DeployWithLimitedQuota.md with step-by-step instructions for deploying with reduced Azure OpenAI quota, including options to remove quota validation or modify quota thresholds in infra/main.bicep.
  • Provided configuration steps and example commands for setting deployment capacity and running the deployment with azd.
  • Included performance considerations and recommendations for optimal TPM values to maintain application performance.
  • Linked to related resources for further deployment, customization, and quota management guidance.
  • Explained the rationale behind quota validation and why these steps are necessary for successful deployment with limited quota.

Does this introduce a breaking change?

  • [ ] Yes
  • [x] No

Golden Path Validation

  • [ ] I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

  • [ ] I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

  • ...

Other Information

Kanchan-Microsoft avatar Sep 29 '25 15:09 Kanchan-Microsoft