Key Responsibilities:
- Diagnose, analyze, and resolve production incidents, ensuring mínimal downtime and data integrity.
- Develop and maintain scripts and tools for monitoring, automation, and reporting.
- Document troubleshooting steps, solutions, and best practices for knowledge sharing.
- Participate in on-call rotation and provide timely responses to critical incidents.
- Communicate effectively with stakeholders, providing updates and post-incident reports.
- Collaborate with development teams to identify root causes and implement long-term solutions.
Required Skills & Qualifications:
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- 3+ years of experience in L3 support or similar roles.
- Strong proficiency in Python programming, experience with FastAPI framework and scripting.
- In-depth knowledge of MongoDB.
- Experience with monitoring tools, log analysis, and incident management.
- Familiarity with ETL processes and data streaming concepts.
- Excellent problem-solving and communication skills.
- Ability to work independently and as part of a team.
Preferred:
- Experience with cloud platforms (AWS).
- Knowledge of key management and encryption best practices.
- Familiarity with CI/CD pipelines and DevOps practices
Advanced english