Platform-Engineering-Lab

Built a full hybrid environment spanning AWS, GCP, and on-prem ProxMox — complete with multi-tenant Kubernetes, Terraform IaC, and observability pipelines.

View the Project on GitHub

🧩 Cloud-Native Platform Engineering Lab

Declarative Kubernetes and Terraform lab demonstrating hybrid operations across GCP and on-prem ProxMox via Minikube. Built and validated entirely on local Minikube for proof-of-concept before scaling to GKE in future iterations.

⏱️ Terraform cut provisioning 6 h → 5 m · 🧩 3-namespace multi-tenant deployment on bare-metal Ubuntu · 🔐 SSH-free Zero Trust · 📊 Prometheus observability and Istio service mesh


I. Core Competency Showcase

Pillar Key Achievement Demonstrated Proficiency
Hybrid IaC Provisioned and tore down compute resources declaratively across GCP and ProxMox using Terraform. GCP (IAM, Billing, Secrets), ProxMox RBAC, Multi-Cloud Automation.
Resilience & HA Engineered and validated self-healing application lifecycle and robust Persistent Volume (PV/PVC) storage. Deployments (Rollbacks, Rollouts), ReplicaSets, Pod Troubleshooting, Stateful Workloads.
Observability & Service Mesh Implemented a Service Mesh (Istio) for L7 traffic control and established a comprehensive Prometheus observability stack. Istio Gateways, Service Accounts, Certificate Management, Bash Automation.
Security & Zero Trust Eliminated SSH ingress and enforced granular role-based access policies across GCP and Kubernetes. GCP (Secrets Manager, Service Accounts), Certificate Management, Bash Automation.

II. Technical Deep Dive (Proof of Work)

Detailed, timestamped documentation of all operational procedures, troubleshooting steps, and successful validations can be found in the accompanying OPERATIONS_LOG.md file.


III. Highlights

📑 Full operational logs and troubleshooting steps are available in OPERATIONS_LOG.md


IV. Repository Structure

├── README.md
├── story.md
├── Artifacts/                # Visual documentation of all tasks
│   ├── Deployments/
│   ├── Docker/
│   ├── GCP/
│   ├── Istio/
│   ├── Minikube/
│   ├── Namespaces/
│   ├── Networking/
│   ├── OnPremIaC/
│   ├── Pods/
│   ├── Prometheus/
│   ├── ReplicaSets/
│   ├── Security/
│   ├── ServiceAccounts/
│   ├── Scripts/
│   ├── Storage/
│   └── Terraform/
├── Docs/                        # OPERATIONS_LOG and diagrams
├── Manifests/                   # YAML manifests for Kubernetes resources
├── Deployments/                 # Deployment manifests & testing (rollouts, scaling, rollback)
├── Docker/                      # Docker installation, container management, volumes
├── GCP/                         # GCP provisioning, storage, VM management, billing, IAM
├── Istio/                       # Service mesh setup, Bookclub demo app, gateway & virtual services
├── Minikube/                    # Local Kubernetes cluster setup, load balancer, filesystem management
├── Namespaces/                  # Namespace creation, management, and taint validation
├── Networking/                  # Network policies and troubleshooting
├── OnPremIaC/                   # OnPrem Hypervisor & VM management, configuration and troubleshooting
├── Pods/                        # Pod creation, validation, troubleshooting, volume testing
├── Prometheus/                  # Monitoring and observability solution, scrapes for pod uptime
├── ReplicaSets/                 # ReplicaSet manifests, pod scaling, HA validation
├── Security/                    # Certificates, JSON keys, Kubernetes secrets
├── ServiceAccounts/             # Service account creation, JWT retrieval, pod attachment
├── Scripts/                     # Bash scripting to automate Prometheus Kubernetes deployments and administration
├── Storage/                     # PersistentVolume (PV) and PersistentVolumeClaim (PVC) creation & validation
└── Terraform/                   # Terraform manifests, IaC & troubleshooting