Join the team building Oracle Cloud Infrastructure's state of the art observability platform, powering visibility and operational intelligence for both OCI's internal cloud services and customers running mission-critical workloads on OCI. OCI Monitoring and Logging serve as foundational platforms used by OCI engineering teams to operate and troubleshoot hundreds of cloud services while also enabling customers to monitor, analyze, and gain insights into their own applications and infrastructure. This unique position offers the opportunity to build observability solutions that operate at massive scale, serving the demanding needs of OCI's own services as well as a global customer base. Our team tackles some of the industry's most challenging distributed systems problems, including high-throughput telemetry ingestion, large-scale data processing, cost-efficient storage, low-latency query execution, multi-tenant reliability, and operational excellence. If you are passionate about building cloud-native observability platforms that power both the cloud itself and the customers who depend on it, we'd love to talk to you.
Lead the design, development, and operation of cloud-scale observability platforms supporting metrics, logs, traces, and related telemetry data.
Architect and implement highly scalable, resilient, and cost-efficient telemetry collection, ingestion, processing, storage, and query systems.
Drive the evolution of end-to-end observability pipelines, from instrumentation and data collection through real-time analytics and long-term retention.
Design and optimize distributed systems capable of ingesting and processing massive volumes of telemetry data with stringent latency and availability requirements.
Develop scalable storage and indexing solutions for high-cardinality metrics, large-scale log analytics, and distributed tracing workloads.
Build and enhance query, search, and retrieval services that deliver fast, reliable, and intuitive access to observability data.
Collaborate with product management, architects, SREs, and engineering teams to define and deliver next-generation observability capabilities.
Identify and resolve performance bottlenecks across the observability stack, including ingestion, storage, indexing, aggregation, and query execution.
Design systems with a strong focus on reliability, fault tolerance, scalability, security, and operational excellence.
Drive technical strategy and architectural decisions for observability services operating at hyperscale cloud environments.
Mentor senior and junior engineers, provide technical leadership, and foster engineering best practices across the organization.
Partner with service teams to improve instrumentation, telemetry quality, and operational visibility across cloud services.
Establish and monitor key service health, scalability, performance, and cost-efficiency metrics for observability platforms.
Lead troubleshooting and root-cause analysis efforts for complex distributed systems and large-scale production environments.
Stay current with emerging trends, technologies, and best practices in observability, distributed systems, data processing, and cloud-native architectures.
Career Level - IC5
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing [email protected] or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
By continuing you agree to our Terms & Privacy Policy.