Objectives of this Role
Run the production environment by monitoring availability and taking a holistic view of system health
Build software and systems to manage platform infrastructure and applications
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye towards pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational support and engineering for multiple large, distributed software applications
Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
Partner with development teams to improve services through rigorous testing and release procedures
Participate in system design consulting and capacity planning
Create sustainable systems and services through automation
Balance feature development speed and reliability with well-defined service level objectives
Required Skills and Qualifications
Bachelor's Degree in Computer Science, CIS, or related field (or equivalent work experience in a related field)
3+ years of experience as an SRE, devops engineer, operations engineer or similar.
Experience in cloud technologies such as architecting, developing or maintaining cloud solutions in public cloud environment (Google Cloud).
Should have hands-on experience with spinnaker, Istio, Kubernetes, Docker, Zookeeper on GCP.
Experience deploying applications in SaaS, IaaS and PaaS cloud environments
Experience with newer NOSQL and key-value based systems like Couchbase ,Cassandra, Neo4j, etc.
Experience working in micro services Architecture.
Experience in defining define service level indicators (SLIs), objectives(SLOs), and agreements (SLAs) for micro-services
Excellent knowledge and familiarity with the configuration and usage of Apache and other open source systems software
Application tuning knowledge, capacity concepts, benchmarking, trending, and monitoring.
Scripting experience in mainstream languages - JAVA, shell and python.
Familiar with using Git, Jenkins, and similar CI/CD tools.
Understanding of platform level concerns, such as configuration management, network request routing, blue/green or canary deployments
Broad knowledge of application servers and web servers, networks, firewalls, switches, load balancers.
Previous success in technical engineering
Coding experience beyond simple scripts