Design, develop, and maintain robust ETL pipelines that ingest, transform, and store network telemetry and operational metrics at petabyte scale.
Collaborate with network engineers, data scientists, and software teams to define data models, schemas, and dashboards that drive incident response and performance optimization.
Implement and optimize data processing workflows using AWS services (Glue, Redshift, Athena, S3) and Spark/Databricks for high‑throughput analytics.
Ensure data quality, lineage, and security compliance through automated testing, monitoring, and documentation.
Mentor junior engineers and contribute to best‑practice guidelines for data engineering within the network operations domain.
Requirements
5+ years of experience in data engineering, with a strong background in Python, SQL, and AWS data services.
Proven track record building scalable ETL pipelines and data models for large‑scale operational data.
Experience with Spark/Databricks, Redshift, and Athena for batch and real‑time analytics.
Solid understanding of network operations concepts and the ability to translate telemetry into actionable insights.
Excellent communication skills and a collaborative mindset in a fast‑paced, cross‑functional environment.