Engineering Manager - PSRE
On-site
Full Time
#Technology
#SRE
#Cloud Computing
#Kubernetes
#Agile
#Incident Response
#Observability
#Automation
#Engineering
#Disaster Recovery
#Problem Solving
Arcesium is a global financial technology firm dedicated to solving intricate, data-driven problems for the world's most sophisticated financial institutions. We are currently in an exciting phase of growth, leveraging our established market presence to pursue strategic opportunities and drive innovation. We believe in fostering a culture of intellectual curiosity and proactive ownership, where you are empowered to make a meaningful impact from your first day. We are currently looking for an experienced Engineering Manager to join our team in India on a full-time, on-site basis to lead our Site Reliability Engineering efforts.
Key outcomes
- Direct and mentor a team of engineers, providing technical guidance, performance management, and career development support.
- Oversee the design, implementation, and execution of our SRE processes and practices.
- Collaborate with cross-functional engineering teams to ensure our systems remain scalable, secure, and highly reliable.
- Act as the primary representative for the SRE team when engaging with internal stakeholders.
- Manage operational workflows, including the oversight of on-call rotations to ensure 24-hour system coverage.
- Handle daily support tasks, such as monitoring dashboards, responding to outages, and triaging escalated cases.
- Drive efficiency by identifying opportunities to automate manual processes and service catalog items.
- Maintain operational excellence by tracking technical KPIs and proactively remediating stability issues.
Requirements
- At least 10 years of professional experience within SRE or a closely related technical discipline.
- A deep understanding of SRE principles and modern operational practices.
- Proven experience in managing and mentoring engineering teams.
- Strong proficiency with cloud computing platforms, specifically AWS.
- Hands-on experience working with Kubernetes.
- Technical expertise in observability tools and incident management.
- Experience working within Agile environments, particularly Scrum.
- Excellent problem-solving, analytical, and troubleshooting capabilities.
- Strong communication and presentation skills in English.
Preferred qualifications
- Exposure to Chaos Engineering and reliability frameworks, including disaster recovery planning.
How to apply
If you are a driven leader with a passion for reliability and automation, we would love to hear from you. Please submit your application through our official portal to begin the process. We are committed to building a diverse team and encourage qualified individuals from all backgrounds to apply.
Arcesium LLC
4 views





