Job Description
The Role
As a Site Reliability Engineer (SRE) at Air Apps, you will be responsible for ensuring the reliability, availability, and scalability of our systems. You will work at the intersection of software development and operations, implementing automation, monitoring, and performance optimization strategies to minimize downtime and improve system resilience.
This is a fully onsite position, based at our office in Lisbon, where you will collaborate closely with cross‑functional teams in person and contribute to a dynamic and fast‑paced environment. We are open to support with relocation efforts.
Responsibilities
Design and implement scalable, reliable, and fault‑tolerant systems across cloud environments.
Develop and maintain observability tools , including monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK).
Automate infrastructure ...