Job Title
Senior DevOps Engineer with Python (Kubernetes expert)
Roles & Responsibilities
Cloud & Core Infrastructur
e- Architect, build, and operate highly available, scalable, and secure cloud infrastructure primarily on AWS
.- Design VPCs, IAM, compute, storage, and load balancing solutions following AWS best practices
.- Define and implement infrastructure scalability, high availability, and disaster recovery strategies
.- Support multi-AZ and multi-region architectures for production workloads
.Infrastructure as Code (IaC
)- Design and maintain Terraform-based infrastructure using modules, workspaces, and remote state backends
.- Integrate Terraform workflows into CI/CD pipelines with automated validation and provisioning
.- Drive infrastructure standardization, governance, and reusability
.Containers & Kubernetes Platfor
m- Minimum 6 years of experience in Docker & Kubernete
s- Design, deploy, and operate Kubernetes clusters (EKS and On-Premise)
.- Own Kubernetes control-plane and node lifecycle management, including safe and repeatable upgrades
.- Manage and upgrade EKS add-ons such as VPC CNI, CoreDNS, kube-proxy, metrics-server, CSI drivers, and ALB Ingress Controller
.- Design and operate highly available Kubernetes clusters across multiple availability zones
.- Implement and manage multi-cluster architectures including workload placement, regional failover, and cross-cluster service discovery
.- Package, deploy, and manage Kubernetes applications using Helm charts
.- Design and maintain reusable, versioned Helm charts with environment-specific values and override strategies
.- Manage Helm-based lifecycle operations including installs, upgrades, rollbacks, dependency management, and chart repositories
.Kubernetes Security & Governanc
e- Implement Pod Security Standards (baseline/restricted)
.- Enforce policies using OPA Gatekeeper or Kyverno
.- Design and maintain RBAC governance with least-privilege access models
.- Secure the container supply chain using image scanning, SBOM generation, Cosign signing/verification, and registry governance
.GitOps & CI/C
D- Implement GitOps workflows using ArgoCD or FluxCD for declarative deployments and environment promotion
.- Design and operate CI/CD pipelines using Jenkins or GitHub Actions with integrated security and compliance checks
.- Enable automated rollbacks, drift detection, and environment consistency
.Development, Scripting & Automatio
n- Develop automation, tooling, and platform services using Python or Golang
.- Write Shell scripts for infrastructure automation, cluster operations, and tooling integration
.Linux, Operations & Incident Managemen
t- Perform advanced Linux administration and troubleshooting across compute, networking, and storage layers
.- Lead and participate in production incident management, including triage, mitigation, and root-cause analysis
.- Diagnose complex failures across cloud infrastructure, Kubernetes, networking, and CI/CD systems
.Observability & Service Mesh (Good to Have
)- Implement monitoring, logging, and alerting using Prometheus/Grafana, Alertmanager, Loki/ELK, Datadog, or CloudWatch
.- Configure and operate Istio for traffic management, mTLS, retries, timeouts, and secure service-to-service communication
.- Understand Envoy proxy behavior within Kubernetes workloads
By continuing you agree to our Terms & Privacy Policy.