Site Reliability Engineer
Hybrid
Full Time
#Engineering
#Cloud
#Web3
#Loki
#Grafana
#Kubernetes
#Incident Management
#Automation
We are Impossible Cloud, a B2B cloud platform on a mission to become the premier cloud provider in Europe and beyond. Founded by serial entrepreneurs with a track record of building billion-euro tech companies, we are currently bridging the gap between decentralized infrastructure and mainstream B2B cloud use cases. We are looking for a Lead Site Reliability Engineer to join our team on a full-time basis in Germany. In this role, you will help us push the boundaries of distributed systems as we build the next generation of the internet.
Responsibilities
- Define and monitor Service Level Indicators and Objectives, providing regular reports on system availability and error rates.
- Design and maintain self-hosted observability tools to ensure full visibility into logs, metrics, and system performance.
- Lead incident management efforts, including participating in on-call rotations, conducting root cause analysis, and mentoring team members to improve long-term system stability.
Requirements
- Extensive experience working with the Loki, Grafana, Tempo, and Mimir stack for metrics and log management.
- Strong technical expertise in Kubernetes and managing services within containerized environments.
- Proven ability to create and follow incident response runbooks while proactively identifying opportunities for automation.
- Excellent analytical and collaboration skills, with a focus on mentoring others and driving scalability across core components.
- Fluency in English is required for this role.
What we offer
- Equity compensation through our ESOP and token participation program.
- A hybrid work environment with monthly collaborative meet-up weeks at our Hamburg headquarters.
- Access to a free gym membership to support your well-being.
- The opportunity to work with a passionate team on cutting-edge Web3 and cloud technologies.





