AI Platform Engineering Scaling Agentic AI, Serverless & Kubernetes

AI Platform Engineer Lead with 8+ years spanning AI/ML and cloud infrastructure across banking, fintech, healthcare, and manufacturing. Currently architecting production-grade AI Agent deployment systems on GCP, from serverless Cloud Run and GKE clusters to Terraform-managed IaC and CI/CD automation.

Read My Blog

Explore More Insights

Dive deeper into my collection of articles covering MLOps, artificial intelligence, cloud infrastructure, and software engineering.

Browse all posts

Gen-AI

Hello World: Vibe Coding This Blog with Gemini

Nov 20, 2025 • 8 min read

Cloud Platform Engineering

Routing AI Traffic: GKE Istio vs Cloud Run Load Balancers

Apr 22, 2026 • 11 min read

Cloud Platform Engineering

Eliminating Dockerfiles with Cloud Native Buildpacks

Apr 22, 2026 • 8 min read

Featured Projects

All Projects

GKE GitOps Cluster Provisioning

Complete GitOps configuration for a Kubernetes AI platform — Istio ingress with VirtualService routing, Cert-Manager with private CA, ArgoCD self-healing sync, plus infrastructure for Qdrant, Dify, LiteLLM, and OpenTelemetry.

Kubernetes Helm ArgoCD Istio Cert-Manager GKE Prometheus

Coming Soon

Cloud Run Infrastructure Automation

Terraform modules for provisioning Internal HTTPS Load Balancers with dynamic backend services, Serverless NEGs, URL Maps, SSL certificate rotation, and HTTP-to-HTTPS redirect — all driven by environment-specific tfvars files.

Terraform GCP Cloud Run Internal Load Balancing Serverless NEGs IAM

Coming Soon

Enterprise AI Knowledge Base Platform

A FastAPI-based document retrieval and ingestion platform powering AI Agent RAG workflows. Features hybrid search (RRF combining BM25 + KNN), async ingestion via Pub/Sub and DocumentAI, neighbor-context enrichment, and multi-embedding support (Vertex AI, Qwen).

FastAPI Python Elasticsearch Qdrant PostgreSQL Pub/Sub Vertex AI DocumentAI

Coming Soon