Skip to content

Add scheduler plugins helm chart #17

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

carlory
Copy link
Contributor

@carlory carlory commented May 29, 2025

Test:

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) helm upgrade --install llmaz-scheduler --set scheduler.image.repository=ghcr.io/carlory/inftyai/scheduler-plugins --set scheduler.image.tag=latest  ./chart
Release "llmaz-scheduler" has been upgraded. Happy Helming!
NAME: llmaz-scheduler
LAST DEPLOYED: Thu May 29 18:11:25 2025
NAMESPACE: default
STATUS: deployed
REVISION: 9
TEST SUITE: None

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) kubectl get po
NAME                                                            READY   STATUS    RESTARTS   AGE
llmaz-scheduler-6899f8dc7d-fljfj                                1/1     Running   0          8m15s

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) kubectl get po -owide
NAME                                                            READY   STATUS    RESTARTS   AGE     IP            NODE                           NOMINATED NODE   READINESS GATES
llmaz-scheduler-6899f8dc7d-fljfj                                1/1     Running   0          9m15s   10.244.3.24   kind-worker                    <none>           <none>
qwen2-0--5b-demo-0                                              1/1     Running   0          5m50s   10.244.10.2   quirky-hofstadter-2901052955   <none>           <none>

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) kubectl get openmodel qwen2-0--5b -oyaml
...
  inferenceConfig:
    flavors:
    - limits:
        cpu: 100m
        memory: 10Mi
      name: 0x
      nodeSelector:
        node.kubernetes.io/instance-type: c-0x-amd64-linux
    - limits:
        cpu: 100m
        memory: 10Mi
      name: 2x
      nodeSelector:
        node.kubernetes.io/instance-type: c-2x-amd64-linux
    - limits:
        cpu: 100m
        memory: 10Mi
      name: 1x
      nodeSelector:
        node.kubernetes.io/instance-type: c-1x-amd64-linux
...

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) kubectl get po qwen2-0--5b-demo-0 -oya
ml
apiVersion: v1
kind: Pod
metadata:
  annotations:
    leaderworkerset.sigs.k8s.io/size: "1"
    llmaz.io/inference-service-flavors: 1x

(base) ➜  scheduler-plugins git:(install-as-a-second-scheduler) kubectl get nodeclaim
NAME            TYPE               CAPACITY   ZONE          NODE                           READY   AGE
default-tgf4f   c-1x-amd64-linux   spot       test-zone-a   quirky-hofstadter-2901052955   True    3h56m

@InftyAI-Agent InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels May 29, 2025
@InftyAI-Agent InftyAI-Agent requested a review from kerthcet May 29, 2025 10:15
Comment on lines +7 to +10
image:
repository: inftyai/scheduler-plugins
tag: v0.0.1
pullPolicy: IfNotPresent
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no avaliable image.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
do-not-merge/needs-kind Indicates a PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants