Create Alert
Email me similar jobs

Senior data engineer (python) - remote

Remote Friendly Full-time SQL Hybrid Flat Files Architecture Data Science
This is us At Avenga, we believe that human creativity empowers technology that matters. Operating globally, our 6000+ specialists provide a full spectrum of services, including business and tech advisory, enterprise solutions, CX, UX and Ul design, managed services, product development, and software development. This is the jobIn Mexico (CDMX) within the Data & Analytics industry, we are actively seeking a Senior Data Engineer to strengthen our team dedicated to building and optimizing end-to-end data pipelines under a Medallion architecture. Your mission will be to enable data flow for both traditional analytics (BI/ML) and advanced search architectures.This is a hybrid position; candidates must be based in CDMX. This is youBachelor's degree in Systems Engineering, Computer Engineering, Software Engineering, or related fields.Experience as a Data Engineer (5+years), with at least 4 years working in Cloudera/Hadoop environments.Expert-level proficiency in Spark (Py Spark/Scala) for distributed processing.Solid experience in relational and dimensional data modeling.Hands-on experience working with Kerberos for users and services.Proficiency in the Cloudera Ecosystem: Hive, Impala, HBase, HDFS, Kafka, Oozie, and Hue.Advanced SQL skills (complex joins, window functions) and experience with Change Data Capture (CDC).Spanish native.Nice-to-have skills:Experience in administration and troubleshooting at the Cloudera Manager level.Knowledge of Search & Indexing tools: Solr, vector databases, and Apache Iceberg.Experience with Security & Governance tools: Apache Ranger and Apache Atlas.Familiarity with Orchestration & Dev Ops: Apache Airflow and Docker. This is your roleBuild robust multi-source ingestion pipelines from RDBMS, Web Services (APIs), and flat files into HDFS.Implement Medallion Architecture layers (Bronze, Silver, Gold) for BI and Data Science consumption.Design data flows for semantic search, integrating data into vector and indexed databases.Manage complex orchestration workflows and container nfigure access controls, data lineage, and ensure security/authentication in Kerberos-protected environments.Tune Spark and Impala processes to optimize efficiency in distributed processing. At Avenga, everyone matters. We provide equal opportunities in recruitment, career development, and leadership, regardless of race, ethnicity, gender identity, sexual orientation, disability, age, religion, or any other characteristic. We are committed to fostering a work environment where our diverse community of employees, candidates, and business partners actively shapes our growth. By bringing together people from different backgrounds and experiences, we build a workplace where everyone feels free to be themselves while honoring the boundaries of others.
Similar jobs

Senior data engineer (python) - remote

Apply Now
Back to search page