Open
Description
Priority
P1-Stopper
OS type
Ubuntu
Hardware type
Xeon-GNR
Running nodes
Single Node
Description
Story for users
- for Xeon users, users can try bigger models on remote endpoint with Gaudi while bigger models might not work well on Xeon
- for Xeon users, if users want to try on Gaudi to understand the difference between Xeon and Gaudi, they can use remote endpoint
- for Gaudi users, we assume that users have local Gaudi access to run GenAIExamples.
- target Denvr and IBM for remote endpoints in the GenAIExamples
Enable remote inference endpoints for
-
AgentQnA : Sin, Alex
-
Productivity Suite : (Sri?)
-
ChatQnA : (Sri?)
-
DocSum : (Sri?)
-
CodeGen : (Sri?)
-
FinanceAgent : Alex
-
workflowExecAgent : Louie
-
CodeTrans : Alex
- Codetrans: enable remote endpoints #2100
- need CI test
-
AudioQnA : Alex
Models are not supported in current public remote endpoint. ON HOLD for now.
- VideoQnA : (?)
- MulitmodalQnA : Alex
- VisualQnA: Alex
Metadata
Metadata
Labels
Type
Projects
Status
In progress