Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 5 additions & 5 deletions docs/quota_check.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
## Check Quota Availability Before Deployment

Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
> **For Global Standard | GPT-4o - the capacity to at least 30K tokens for optimal performance.**
> **For Global Standard | GPT-4o - the capacity to at least 100K tokens for optimal performance.**

### Login if you have not done so already
```
Expand All @@ -11,7 +11,7 @@ azd auth login

### 📌 Default Models & Capacities:
```
gpt-4o:30
gpt-4o:100
```
### 📌 Default Regions:
```
Expand All @@ -37,19 +37,19 @@ eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southc
```
✔️ Check specific model(s) in default regions:
```
./quota_check_params.sh --models gpt-4o:30
./quota_check_params.sh --models gpt-4o:100
```
✔️ Check default models in specific region(s):
```
./quota_check_params.sh --regions eastus,westus
```
✔️ Passing Both models and regions:
```
./quota_check_params.sh --models gpt-4o:30 --regions eastus,westus2
./quota_check_params.sh --models gpt-4o:100 --regions eastus,westus2
```
✔️ All parameters combined:
```
./quota_check_params.sh --models gpt-4:30 --regions eastus,westus --verbose
./quota_check_params.sh --models gpt-4:100 --regions eastus,westus --verbose
```

### **Sample Output**
Expand Down
2 changes: 1 addition & 1 deletion infra/scripts/quota_check_params.sh
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ log_verbose() {
}

# Default Models and Capacities (Comma-separated in "model:capacity" format)
DEFAULT_MODEL_CAPACITY="gpt-4o:30"
DEFAULT_MODEL_CAPACITY="gpt-4o:100"
# Convert the comma-separated string into an array
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"

Expand Down
6 changes: 3 additions & 3 deletions src/ContentProcessorAPI/requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -379,9 +379,9 @@ typer==0.15.1 \
typing-extensions==4.12.2 \
--hash=sha256:04e5ca0351e0f3f85c6853954072df659d0d13fac324d0072316b67d7794700d \
--hash=sha256:1a7ead55c7e559dd4dee8856e3a88b41225abfe1ce8df57b7c13915fe121ffb8
urllib3==2.3.0 \
--hash=sha256:1cee9ad369867bfdbbb48b7dd50374c0967a0bb7710050facf0dd6911440e3df \
--hash=sha256:f8c5449b3cf0861679ce7e0503c7b44b5ec981bec0d1d3795a07f1ba96f0204d
urllib3==2.5.0 \
--hash=sha256:3fc47733c7e419d4bc3f6b3dc2b4f890bb743906a30d56ba4a5bfa4bbff92760 \
--hash=sha256:e6b01673c0fa6a13e374b50871808eb3bf7046c4b125b216f6bf1cc604cff0dc
uvicorn==0.34.0 \
--hash=sha256:023dc038422502fa28a09c7a30bf2b6991512da7dcdb8fd35fe57cfc154126f4 \
--hash=sha256:404051050cd7e905de2c9a7e61790943440b3416f49cb409f965d9dcd0fa73e9
Expand Down
Loading