Overview join to apply for the site reliability engineering level 2 gcp and azure role at ibm. Responsibilities ensure the reliability and uptime of cloud solutions and services, aligned with user needs support pre-launch activities including system design consulting, platform development, capacity planning, and launch reviews monitor and enhance live services by tracking availability, latency, and overall system health scale systems sustainably through automation and drive improvements in reliability and delivery velocity assess and optimize existing infrastructure within geoscience workflows collaborate with network and security teams to ensure secure and reliable application operations develop and document best practices for new projects and services leverage service management systems to share lessons learned and best practices across the technical community participate in incident response and conduct blameless postmortems required technical and professional expertise proficiency in gcp and microsoft azure experience with observability tools such as grafana, prometheus, thanos, loki knowledge of google stackdriver/azure monitoring azure ci/cd pipeline expertise strong scripting and automation capabilities familiarity with microservice architecture experience with azure/gcp postgresql experience with cloud storage such as azure and google storage solutions experience of container registries very good english communication level at least b2 seniority level mid-senior level employment type full-time job function engineering and information technology industries it services and it consulting j-18808-ljbffr