content-processing-solution-accelerator
content-processing-solution-accelerator copied to clipboard
docs: Add documentation for deploying with limited Azure OpenAI quota
Purpose
This pull request adds a new documentation file to help users deploy the Content Processing Solution Accelerator when their Azure OpenAI quota is limited. The guide explains how to adjust or bypass quota validation in the deployment process, ensuring users with lower token-per-minute (TPM) quotas can still deploy successfully.
Documentation additions and deployment guidance:
- Added
docs/DeployWithLimitedQuota.mdwith step-by-step instructions for deploying with reduced Azure OpenAI quota, including options to remove quota validation or modify quota thresholds ininfra/main.bicep. - Provided configuration steps and example commands for setting deployment capacity and running the deployment with
azd. - Included performance considerations and recommendations for optimal TPM values to maintain application performance.
- Linked to related resources for further deployment, customization, and quota management guidance.
- Explained the rationale behind quota validation and why these steps are necessary for successful deployment with limited quota.
Does this introduce a breaking change?
- [ ] Yes
- [x] No
Golden Path Validation
- [ ] I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.
Deployment Validation
- [ ] I have validated the deployment process successfully and all services are running as expected with this change.
What to Check
Verify that the following are valid
- ...