Skip to content

Commit 992e6d9

Browse files
committed
Misc changes
1 parent fd55001 commit 992e6d9

File tree

6 files changed

+5
-9
lines changed

6 files changed

+5
-9
lines changed

dev/versions.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,7 @@ see https://github.com/moby/moby/issues/39302#issuecomment-639687466_
118118
1. Update the link at the top of the file to the URL you copied from
119119
1. Check that your diff is reasonable (and put back any of our modifications, e.g. the image path, rolling update
120120
strategy, resource requests, tolerations, node selector, priority class, etc)
121-
1. Confirm GPUs work for PyTorch, TensorFlow, and ONNX models
121+
1. Confirm GPUs work
122122

123123
## Inferentia device plugin
124124

docs/workloads/batch/jobs.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -202,7 +202,7 @@ RESPONSE:
202202
},
203203
"worker_counts": { # worker counts are only available while a job is running
204204
"pending": <int>, # number of workers that are waiting for compute resources to be provisioned
205-
"initializing": <int>, # number of workers that are initializing (downloading images or running your handler's init function)
205+
"initializing": <int>, # number of workers that are initializing
206206
"running": <int>, # number of workers that are actively working on batches from the queue
207207
"succeeded": <int>, # number of workers that have completed after verifying that the queue is empty
208208
"failed": <int>, # number of workers that have failed

docs/workloads/realtime/autoscaling.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ In addition to the autoscaling configuration options (described below), there ar
1212

1313
<br>
1414

15-
**`max_queue_length`** (default: 100): The maximum number of requests which will be queued by the replica (beyond `max_concurrency`) before requests are rejected with HTTP error code 503. For long-running APIs, decreasing `max_replica_concurrency` and configuring the client to retry when it receives 503 responses will improve queue fairness accross replicas by preventing requests from sitting in long queues.
15+
**`max_queue_length`** (default: 100): The maximum number of requests which will be queued by the replica (beyond `max_concurrency`) before requests are rejected with HTTP error code 503. For long-running APIs, decreasing `max_queue_length` and configuring the client to retry when it receives 503 responses will improve queue fairness accross replicas by preventing requests from sitting in long queues.
1616

1717
<br>
1818

manager/manifests/grafana/grafana-dashboard-realtime.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -964,9 +964,7 @@ data:
964964
}
965965
},
966966
{
967-
"aliasColors": {
968-
"iris-classifier": "light-green"
969-
},
967+
"aliasColors": {},
970968
"bars": false,
971969
"dashLength": 10,
972970
"dashes": false,

pkg/lib/regex/regex_test.go

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -596,7 +596,7 @@ func TestValidDockerImage(t *testing.T) {
596596
match: false, // Support this as valid?
597597
},
598598
{
599-
input: "680880929103.dkr.ecr.eu-central-1.amazonaws.com/cortexlabs/python-handler-cpu:latest",
599+
input: "680880929103.dkr.ecr.eu-central-1.amazonaws.com/cortexlabs/async-gateway:latest",
600600
match: true,
601601
},
602602
}

pkg/types/spec/utils.go

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,8 +24,6 @@ import (
2424
"github.com/cortexlabs/cortex/pkg/types/userconfig"
2525
)
2626

27-
type modelValidator func(paths []string, prefix string, versionedPrefix *string) error
28-
2927
func FindDuplicateNames(apis []userconfig.API) []userconfig.API {
3028
names := make(map[string][]userconfig.API)
3129

0 commit comments

Comments
 (0)