Prerequisites

This section contains the prerequisites for llmaz.

Requirements:

  • Kubernetes version >= 1.27.

    LWS requires Kubernetes version v1.27 or higher. If you are using a lower Kubernetes version and most of your workloads rely on single-node inference, we may consider replacing LWS with a Deployment-based approach. This fallback plan would involve using Kubernetes Deployments to manage single-node inference workloads efficiently. See #32 for more details and updates.

  • Helm 3, see installation.

Note that llmaz helm chart will by default install:

  • LWS as the default inference workload in the llmaz-system, if you *already installed it * or want to deploy it in other namespaces , append --set leaderWorkerSet.enabled=false to the command below.
  • Envoy Gateway and Envoy AI Gateway as the frontier in the llmaz-system, if you already installed these two components or want to deploy in other namespaces , append --set envoy-gateway.enabled=false --set envoy-ai-gateway.enabled=false to the command below.
  • Open WebUI as the default chatbot, if you want to disable it, append --set open-webui.enabled=false to the command below.