llm-d incubation

llm-d-infra Public

llm-d helm charts and deployment examples

Shell 48 50

llm-d-modelservice Public

helm charts for deploying models with llm-d

Smarty 25 41

workload-variant-autoscaler Public

Variant optimization autoscaler for distributed inference workloads

Go 21 25

llm-d-fast-model-actuation Public

Go 8 9

llm-d-ci Public

Shell 2 2

ig-wva Public

Workload Variant Autoscaler is a service to compute the cost-optimal provisioning of heterogeneous accelerators for inference workloads with varying request latency objectives

Jupyter Notebook 1 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-d incubation

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!