São Paulo – Brazil
📧 adonai.costa@gmail.com
🔗 https://linkedin.com/in/adonaicosta
Architecting resilient, scalable, AI-augmented cloud platforms operating at scale.
Principal-level Site Reliability Engineer and Solutions Architect with 15+ years of experience in infrastructure, distributed systems, and cloud-native platforms.
Operating ~120 Kubernetes clusters across hybrid and multi-cloud environments, supporting 200–1500 deployments.
Focused on:
Actively applying AI-augmented engineering (vibe coding) using Gemini, Claude, and Antigravity to accelerate architectural design, troubleshooting, IaC, CI/CD, and incident response — reducing MTTR by ~50% in critical incidents.
Kubernetes Expertise
On-Prem · Hybrid · Managed (GKE · EKS · AKS · OKE · DOKS)
Cloud Native Architecture
Platform Engineering · GitOps · DevSecOps · Service Mesh
Reliability Engineering
SLO/SLI · Incident Response · Scalability · 1M req/sec traffic peaks
AI-Augmented Operations
LLM-driven troubleshooting · Manifest generation · Incident diagnostics · Automation acceleration
Multi-Cloud Strategy
GCP · AWS · Azure · OCI
In a Big Big Big Kubernetes and Cloud Operations in Brazil
In a Bank in Brazil
Google Cloud Professional Architect
CKA · CKS · LFCE / LFCS · LPIC 1 & 2
Kubernetes · GKE · EKS · AKS · OpenShift · Rancher
Terraform · Ansible · Argo CD · Flux CD
Prometheus · Grafana · Loki · Tempo - ElasticSearch
Istio · Linkerd · Kyverno · Cert-Manager
Docker · Linux · Python
GCP · AWS · Azure · OCI
Generative AI · AIOps · AI-Assisted Coding
Designing mission-critical Kubernetes platforms with AI-accelerated engineering.