Job Title
Senior Dev Ops Engineer with Python (Kubernetes expert)
Roles & Responsibilities
Cloud & Core Infrastructur
e- Architect, build, and operate highly available, scalable, and secure cloud infrastructure primarily on AWS
- Design VPCs, IAM, compute, storage, and load balancing solutions following AWS best practices
- Define and implement infrastructure scalability, high availability, and disaster recovery strategies
- Support multi-AZ and multi-region architectures for production workloads
Infrastructure as Code (Ia C
)- Design and maintain Terraform-based infrastructure using modules, workspaces, and remote state backends
- Integrate Terraform workflows into CI/CD pipelines with automated validation and provisioning
- Drive infrastructure standardization, governance, and reusability
Containers & Kubernetes Platfor
m- Minimum 6 years of experience in Docker & Kubernete
s- Design, deploy, and operate Kubernetes clusters (EKS and On-Premise)
- Own Kubernetes control-plane and node lifecycle management, including safe and repeatable upgrades
- Manage and upgrade EKS add-ons such as VPC CNI, Core DNS, kube-proxy, metrics-server, CSI drivers, and ALB Ingress Controller
- Design and operate highly available Kubernetes clusters across multiple availability zones
- Implement and manage multi-cluster architectures including workload placement, regional failover, and cross-cluster service discovery
- Package, deploy, and manage Kubernetes applications using Helm charts
- Design and maintain reusable, versioned Helm charts with environment-specific values and override strategies
- Manage Helm-based lifecycle operations including installs, upgrades, rollbacks, dependency management, and chart repositories
Kubernetes Security & Governanc
e- Implement Pod Security Standards (baseline/restricted)
- Enforce policies using OPA Gatekeeper or Kyverno
- Design and maintain RBAC governance with least-privilege access models
- Secure the container supply chain using image scanning, SBOM generation, Cosign signing/verification, and registry governance
Git Ops & CI/C
D- Implement Git Ops workflows using Argo CD or Flux CD for declarative deployments and environment promotion
- Design and operate CI/CD pipelines using Jenkins or Git Hub Actions with integrated security and compliance checks
- Enable automated rollbacks, drift detection, and environment consistency
Development, Scripting & Automatio
n- Develop automation, tooling, and platform services using Python or Golang
- Write Shell scripts for infrastructure automation, cluster operations, and tooling integration
Linux, Operations & Incident Managemen
t- Perform advanced Linux administration and troubleshooting across compute, networking, and storage layers
- Lead and participate in production incident management, including triage, mitigation, and root-cause analysis
- Diagnose complex failures across cloud infrastructure, Kubernetes, networking, and CI/CD systems
Observability & Service Mesh (Good to Have
)- Implement monitoring, logging, and alerting using Prometheus/Grafana, Alertmanager, Loki/ELK, Datadog, or Cloud Watch
- Configure and operate Istio for traffic management, m TLS, retries, timeouts, and secure service-to-service communication
- Understand Envoy proxy behavior within Kubernetes workloads
.
Similar jobs

Devops with python (kubernetes expert)

Apply Now
Back to search page