About Xebia
Desplácese hacia abajo para encontrar los detalles completos de la oferta de trabajo, incluyendo la experiencia requerida y las funciones y tareas asociadas.
For more than 25 years, our global network of passionate technologists and pioneering craftspeople has delivered cutting-edge technology and game-changing consulting to companies on the brink of AI driven digital transformation. Since 2001, we have grown into a full service digital consulting company with 6000+ professionals working on a worldwide ambition. Driven by the desire to make a difference, we keep innovating. Fuelling the growth of our company with our knowledge worker culture. When teaming up with Xebia, expect in-depth expertise based on an authentic, value-led, and high quality way of working that inspires all we do.
At Xebia, we put ‘People First’—committed to attracting diverse talent and fostering an inclusive, respectful workplace where everyone is valued for their contributions. We welcome all individuals and evaluate solely on the quality of their work and teamwork.
Job Description — Platform / DevOps Engineer (Event-Driven Platforms)
Role Overview
We are looking for a Platform or DevOps Engineer with strong experience in event-driven architectures, and cloud-native platforms.
This role focuses on enabling resilient, observable, and continuously deployable event-driven systems operating at scale. The successful candidate will help deliver core platform capabilities including end-to-end distributed tracing, experimentation frameworks, workflow orchestration, and operational tooling across multiple environments.
The ideal candidate will bring practical experience operating event-driven systems in production environments, with particular emphasis on observability, deployment strategies, and platform reliability within GCP-based ecosystems.
Key Responsibilities
Design, build, and maintain DevOps and platform engineering capabilities within an event-driven, n8n-based architecture
Implement end-to-end traceability across distributed workflows, tracking requests from client entry points through workflow execution and downstream services
Deliver distributed tracing and observability capabilities using OpenTelemetry (OTel)
Support multi-environment observability, monitoring, logging, and operational diagnostics
Implement experimentation and continuous learning capabilities including:
A/B testing
canary deployments
progressive rollout strategies
Enable execution and orchestration of multiple workflows in parallel
Develop and maintain deployment definitions, release automation, and operational tooling
Generate and publish SDKs from Avro schemas across multiple programming languages
Implement resilience patterns within shared libraries and platform components, including:
retries
circuit breakers
flow control
Collaborate with engineering teams to establish platform standards, reusable tooling, and operational best practices
Provide ongoing platform support, maintenance, troubleshooting, and continuous improvement
Required Skills & Experience
Strong experience in Platform Engineering and DevOps within cloud-native environments
Hands-on experience with event-driven architectures and distributed systems at scale
Minimum 2 years of experience with GCP, particularly Pub/Sub-based systems
Strong xkdbapo understanding of distributed tracing and observability concepts
Experience designing resilient distributed systems and fault-tolerant integration patterns
Familiarity with canary deployments, progressive delivery, and experimentation frameworks
Experience with CI/CD automation and deployment orchestration
Understanding of workflow orchestration and asynchronous processing patterns
Strong troubleshooting, operational support, and production engineering skills
Experience with n8n workflow automation platforms
Nice to have
Experience working with Avro schemas and schema registry solutions
Familiarity with shared platform libraries and internal developer platforms
Experience supporting high-throughput messaging or event streaming systems
Exposure to SRE practices and platform reliability engineering
Experience building multi-language developer tooling and SDK publishing pipelines
Experience with SDK generation or schema-driven development approaches
Experience implementing OpenTelemetry (OTel) instrumentation and tracing

Platform / DevOps Engineer

Apply Now
Back to search page