Sorry! This job has expired and is no longer available on Recruit.net

View related jobs: Site Reliability Engineer (DevOps) jobs | Thomson Reuters jobs | Nottingham jobs

Site Reliability Engineer (DevOps)

33 days ago: Thomson Reuters

Job Description

Job Description

Site Reliability Engineers   (SREs)  are Software & Systems Engineers who are responsible for ensuring the global availability of our mission critical customer-facing products and services. 

We place a high degree of trust and responsibility in the hands of our SREs and expect them to follow DevOps principles and practices. While it is likely that no SRE will possess all of the Software Engineering & Systems Engineering skills on this page we seek candidates who have the required core   skills,  will master some, and are want to learn the vast majority. 

We ensure every individual receives ample coaching to improve their skills, and to advance their careers within Technology Operations, in Technology Development, or more broadly across the organization

Will you join us?  

The SRE Role

  • SREs are engineers with the right mix of knowledge and skills in software engineering (i.e. programming, data structures and algorithms) and systems engineering (i.e. applying scientific principles of experimentation and observation to entire systems to improve reliability, performance and operability).
  • We constantly evaluate products and services before and after production releases to prevent, identify and fix problems that impact service availability in deploying, configuring, monitoring, recovering, and scaling.
  • We work closely with development group in a collaborative DevOps environment to maintain the highest level of service up-time.
  • We participate in on-call rotations to monitor and support our products and services, taking recovery actions prior to and after disruptions.
  • We dedicate at least 50% of our time 'engineering away' problems both, directly and through pairing and coaching our team.
  • We work side-by-side with SREs in our team applying software engineering principles to resolve problems impacting service uptime or our operational efficiency.

Our SRE Culture

  • To accomplish our mission and continue to build our internal DevOps culture, we embrace and are strong advocates of the CALMS framework.
  • We seek to eliminate manual and repetitive operations tasks at every opportunity by exploiting open source tools, contributing to open source projects and building new tools when required.
  • We value technical aptitude, innovative thinking and a great learning ability above proficiency with a specific toolset.

Core Software & Systems Engineering Skills 

To be successful as an SRE, an individual must have the  required  core skills listed below, and/or be willing to  learn quickly.

Required Core Skills for all SREs

  • Programming in at least one language such as: Java, C#, Javascript, Python or Ruby - experience with other languages is also valuable such as Shell scripting, PowerShell, PERL or PHP.
  • Systems configuration and administration: Windows or Linux.
  • Analyzing and discovering how all components of a distributed system work together using a broad range of skills and tools.

Possess or will learn quickly

  • Applying an evidence based approach to solving system problems under pressure and in real time to provide the fastest path to service recovery.
  • System and software configuration management using tools such as puppet, chef or ansible.
  • Cloud technologies and platforms such as AWS or Azure using API or configuration tools.

Skills for SRE Specializations (SRE-SWE and SRE-SE)

SREs have diverse backgrounds such as software development and systems administration, from which their experience is often biased towards software engineering (SRE-SWE) or systems engineering (SRE-SE). We strongly value the breadth and depth of skills and diversity of thinking this brings to our team.

While it is likely that no SRE will possess all of the skills on this page we seek candidates who have the  required core   skills,  will master some, and are willing to learn the vast majority.

SRE-SWE

  • Object-Oriented design, design patterns and programming following clean coding practices.
  • Agile/lean development practices such as Scrum, XP and agile design.
  • Data structures and algorithms.
  • Software testing frameworks that support TDD and BDD.
  • Automating software build and testing using tools such as Jenkins.
  • Database programming, schema design and query optimization (relational and NoSQL).

SRE-SE

  • Writing code to drive system engineering activity such as system testing, load generation, instrumentation, log analysis, performance monitoring, error simulation and deep discovery of system properties.
  • Conducting investigation across any system component and related systems to discover and rectify performance bottlenecks and sources of unreliability.
  • Applying scientific principles of experimentation and measurement to system components to identify improvements to the configuration and architecture which improve reliability, performance and operability.
  • Network flow analysis and troubleshooting.
  • Selection, design and tuning storage systems for reliability and performance.
  • Configuring, analyzing and tuning (relational and NoSQL) database systems to improve reliability and performance.
  • Configuring and tuning web servers, application containers, message queueing systems and other middleware to improve reliability and performance.

The Financial and Risk Business of Thomson Reuters is now Refinitiv. Refinitiv equips the financial community with access to an open platform that uncovers opportunity and catalyzes change. With a dynamic combination of data, insights, technology, and news from Reuters, our customers can access solutions for every challenge, including a breadth of applications, tools, and content—all supported by human expertise. At Refinitiv, we facilitate the connections that propel people and organizations to find new possibilities to move forward.

As a global business, we rely on diversity of culture and thought to deliver on our goals. Therefore we seek talented, qualified employees in all our operations around the world−regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under country or local law. Refinitiv is proud to be an Equal Employment Opportunity/Affirmative Action Employer providing a drug-free workplace.
 

Intrigued by a challenge as large and fascinating as the world itself? Come join us.


Locations
Nottingham-United Kingdom

Job Description Site Reliability Engineers   (SREs)  are Software & Systems Engineers who are responsible for ensuring the global availability of our mission critical customer-facing products and services.  We place a high degree of trust and responsibility in the hands of our SREs and expect them to follow DevOps principles and practices. While it is likely that no SRE will possess all of the Software Engineering & Systems Engineering skills on this page we seek candidates who have the required core   skills,  will master some, and are want to learn the vast majority.  We ensure every individual receives ample coaching to improve their skills, and to advance their careers within Technology Operations,...