Priority
P1-Stopper
OS type
Ubuntu
Hardware type
Xeon-GNR
Running nodes
Single Node
Description
Story for users
- for Xeon users, users can try bigger models on remote endpoint with Gaudi while bigger models might not work well on Xeon
- for Xeon users, if users want to try on Gaudi to understand the difference between Xeon and Gaudi, they can use remote endpoint
- for Gaudi users, we assume that users have local Gaudi access to run GenAIExamples.
- target Denvr and IBM for remote endpoints in the GenAIExamples
Enable remote inference endpoints for
Models are not supported in current public remote endpoint. ON HOLD for now.
Priority
P1-Stopper
OS type
Ubuntu
Hardware type
Xeon-GNR
Running nodes
Single Node
Description
Story for users
Enable remote inference endpoints for
AgentQnA : Sin, Alex
Productivity Suite : (Sri?) (P0)
ChatQnA : (Sri?) (P0)
DocSum : (Sri?) (P0)
CodeGen : (Sri?) (P0)
FinanceAgent : Alex
workflowExecAgent : Louie
CodeTrans : Alex
AudioQnA : Alex
Models are not supported in current public remote endpoint. ON HOLD for now.