Skill Set
AWS cloud infrastructure (compute,storage,networking,security) Terraform (Infrastructure as Code) Serverless architectures (e.g.,AWS Lambda) Container technologies (Docker,Kubernetes) Linux system administration DevOps & CI/CD tools (GitLab,Airflow,or similar) Observability (monitoring,logging,alerting,SLOs) AWS networking (VPCs,subnets,security groups,load balancers,DNS) IAM & cloud security (least-privilege access,best practices) Relational databases (MySQL,PostgreSQL,MariaDB) Large-scale data handling & infrastructure impact AI/ML fundamentals & familiarity with AI-assisted tools
Skill to Evaluate
AWS cloud architecture & operations Terraform (Infrastructure as Code) Containers & orchestration (Docker,Kubernetes) CI/CD & DevOps practices Observability & SRE (monitoring,logging,SLOs) AWS networking & IAM security
Job Description
Basic Qualifications our Role:
Manage AWS cloud environments, including compute, networking, data, and security
Design and provision infrastructure for analytical and ML workloads using Terraform
Define and enforce infrastructure standards, patterns, and best practices across the team
Troubleshoot production issues and drive incident resolution
Monitor system health, define SLOs, and maintain observability across services
Manage cloud costs - right-size resources, identify waste, and implement cost-optimization strategies
Design and maintain network architecture including VPCs, subnets, and security groups
Partner with developers to deploy and operationalize custom analytic applications
Partner with Decision Science team to design and deploy data science advancements into end-to-end solutions
Partner with Data Engineering team to establish scalable, efficient, and automated data processes
Job Requirements:
Minimum of 5 years of experience in cloud infrastructure and systems engineering
Strong experience managing AWS environments in production
Experience with Infrastructure as Code practices and tooling (Terraform required)
Experience with serverless architectures and container technologies (Docker, Kubernetes)
Strong Linux administration skills
Proficiency with DevOps and CI/CD tooling (GitLab, Airflow, or similar)
Experience with observability - monitoring, logging, alerting, and SLO-based reliability practices
Understanding of AWS networking - VPCs, subnets, security groups, load balancers, and DNS
Experience with IAM design, security best practices, and least-privilege access patterns
Familiarity with AI/ML concepts and comfort using AI-assisted tools in day-to-day workflows (e.g., code generation, troubleshooting, documentation)
Experience with relational databases such as MySQL, MariaDB, or PostgreSQL
Experience working with complex and large volumes of data and the impacts on infrastructure
Strong written and verbal communication skills with the ability to work cross-functionally with technical and non-technical teams Preferred Qualifications
How To Stand Out:
Deep Terraform expertise - module development, state management, and CI-driven plan/apply workflows
Experience with AWS AI/ML services (Bedrock, SageMaker, or similar)
AWS Certified Solutions Architect (Associate or Professional)
Experience with cloud cost optimization and FinOps practices
Experience with observability platforms (Datadog, New Relic, or CloudWatch)
Experience with Snowflake cloud data warehouse
Experience with non-relational, NoSQL, and semi-structured databases
Experience creating application and network architecture diagrams
Understanding of data science concepts and experience supporting ML workloads
Experience with Azure or GCP in addition to AWS (multi-cloud awareness)
Experience with entertainment, media, or related industries
Required Education BA/BS Degree Comp Sci/IS or related field