Deploy ChatQnA on Kubernetes cluster

You should have Helm (version >= 3.15) installed. Refer to the Helm Installation Guide for more information.
For more deploy options, refer to helm charts README.

Deploy on Xeon

export HFTOKEN="insert-your-huggingface-token-here"
helm install chatqna oci://ghcr.io/opea-project/charts/chatqna  --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f cpu-values.yaml

Deploy on Gaudi

export HFTOKEN="insert-your-huggingface-token-here"
helm install chatqna oci://ghcr.io/opea-project/charts/chatqna  --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f gaudi-vllm-values.yaml

Deploy variants of ChatQnA

ChatQnA is configurable and you can enable/disable features by providing values.yaml file. For example, to run with tgi instead of vllm inference engine on Gaudi hardware, use gaudi-tgi-values.yaml file:

export HFTOKEN="insert-your-huggingface-token-here"
helm install chatqna oci://ghcr.io/opea-project/charts/chatqna  --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} -f gaudi-tgi-values.yaml

See other *-values.yaml files in this directory for more reference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

README.md

README.md

Deploy ChatQnA on Kubernetes cluster

Deploy on Xeon

Deploy on Gaudi

Deploy variants of ChatQnA

Collapse file tree

Files

README.md

Latest commit

History

README.md

File metadata and controls

Deploy ChatQnA on Kubernetes cluster

Deploy on Xeon

Deploy on Gaudi

Deploy variants of ChatQnA