Responsibilities
- Manages, maintains and monitors all cluster 24/7
- Manages, maintains and monitors global Lagoon Infrastructure 24/7
- Monitor all production sites 24/7
- Reacts to infrastructure alerts
- Reacts to outages reported from customers via Support Team
- Provides Out-of-OfficeHours Customer Emergency phone support
- Continuously improves amazee.io Platform
- Coordinates with external partners (for example AWS, GCP, Azure, Fastly) to ensure stable platforms
- Guarantees platform and website uptime SLA
- Coordinates with Lagoon Team for Lagoon Features, Releases, Issues
- Coordinates with and supports amazee.io Security Team
- Monitors, analyses and optimizes infrastructure costs with help of knowledge from Business Operations Team and tooling from Internal IT Ops Team