MLOps Platform Engineering

GitOps and Infrastructure for AI Agents on GKE

Learn how to set up a robust, GitOps-driven Kubernetes cluster for running AI workloads using Helm and ArgoCD.

8 min read
The "App of Apps" Pattern

Managing modern AI infrastructure requires a declarative approach. By using ArgoCD and Helm, we can define our entire cluster state in Git. This includes core infrastructure like Istio, Cert-Manager, and Prometheus, as well as our AI stack including Vector Databases and LLM Gateways.

Istio for Traffic Management

Instead of exposing services directly, an Istio Ingress Gateway provides Layer 7 capabilities like advanced routing, TLS termination, and traffic control. This is crucial for securely exposing AI APIs to internal applications.

yaml
apiVersion: networking.istio.io/v1alpha3
kind: VirtualService
metadata:
  name: dify
spec:
  hosts:
  - "dify.ai-platform.local"
  http:
  - route:
    - destination:
        host: dify-api-svc.dify.svc.cluster.local
        port:
          number: 80
Traffic Flow
graph TD; Client-->LB[GCP Load Balancer]; LB-->Istio[Istio Ingress Gateway]; Istio-->LLM[LiteLLM Gateway]; Istio-->VDB[Vector DB];

More Recent Posts