Job Title

Senior DevOps Engineer with Python (Kubernetes expert)

Roles & Responsibilities



Cloud & Core Infrastructur

e- Architect, build, and operate highly available, scalable, and secure cloud infrastructure primarily on AWS

- Design VPCs, IAM, compute, storage, and load balancing solutions following AWS best practices

- Define and implement infrastructure scalability, high availability, and disaster recovery strategies

- Support multi-AZ and multi-region architectures for production workloads

Infrastructure as Code (IaC

)- Design and maintain Terraform-based infrastructure using modules, workspaces, and remote state backends

- Integrate Terraform workflows into CI/CD pipelines with automated validation and provisioning

- Drive infrastructure standardization, governance, and reusability

Containers & Kubernetes Platfor

m- Minimum 6 years of experience in Docker & Kubernete

s- Design, deploy, and operate Kubernetes clusters (EKS and On-Premise)

- Own Kubernetes control-plane and node lifecycle management, including safe and repeatable upgrades

- Manage and upgrade EKS add-ons such as VPC CNI, CoreDNS, kube-proxy, metrics-server, CSI drivers, and ALB Ingress Controller

- Design and operate highly available Kubernetes clusters across multiple availability zones

- Implement and manage multi-cluster architectures including workload placement, regional failover, and cross-cluster service discovery

- Package, deploy, and manage Kubernetes applications using Helm charts

- Design and maintain reusable, versioned Helm charts with environment-specific values and override strategies

- Manage Helm-based lifecycle operations including installs, upgrades, rollbacks, dependency management, and chart repositories

Kubernetes Security & Governanc

e- Implement Pod Security Standards (baseline/restricted)

- Enforce policies using OPA Gatekeeper or Kyverno

- Design and maintain RBAC governance with least-privilege access models

- Secure the container supply chain using image scanning, SBOM generation, Cosign signing/verification, and registry governance

GitOps & CI/C

D- Implement GitOps workflows using ArgoCD or FluxCD for declarative deployments and environment promotion

- Design and operate CI/CD pipelines using Jenkins or GitHub Actions with integrated security and compliance checks

- Enable automated rollbacks, drift detection, and environment consistency

Development, Scripting & Automatio

n- Develop automation, tooling, and platform services using Python or Golang

- Write Shell scripts for infrastructure automation, cluster operations, and tooling integration

Linux, Operations & Incident Managemen

t- Perform advanced Linux administration and troubleshooting across compute, networking, and storage layers

- Lead and participate in production incident management, including triage, mitigation, and root-cause analysis

- Diagnose complex failures across cloud infrastructure, Kubernetes, networking, and CI/CD systems

Observability & Service Mesh (Good to Have

)- Implement monitoring, logging, and alerting using Prometheus/Grafana, Alertmanager, Loki/ELK, Datadog, or CloudWatch

- Configure and operate Istio for traffic management, mTLS, retries, timeouts, and secure service-to-service communication

- Understand Envoy proxy behavior within Kubernetes workloads


.
Similar jobs

Devops with Python (Kubernetes expert)

Apply Now
Back to search page