Join to apply for the production service support engineer (incident management) - pacific time zone role at groupon
groupon is a marketplace where customers discover new experiences and services every day, helping local businesses thrive. We have worked with over a million merchant partners worldwide, connecting over 16 million customers with deals across various categories. We stand out as a platform committed to helping local businesses succeed on a performance basis.
groupon is on a journey to transform our business with a relentless pursuit of results. With thousands of employees across multiple continents, we maintain a culture that inspires innovation, rewards risk-taking, and celebrates success. Our scale allows for immediate impact, and our culture fosters autonomy and meaningful contributions.
we are looking for a site reliability engineer (incident management) to support and optimize internal systems that span business and engineering departments.
* docker and kubernetes
* pingdom, opsgenie, kibana, wavefront/grafana monitoring tools
* github and jira
* java, ruby, node.js, next.js
* mysql and postgresql databases
* redis and memcached
role details:
* utilize site reliability engineering best practices and itil framework to develop incident management strategies.
* serve as incident commander, change manager, and senior technical resource responsible for preventing, identifying, triaging, documenting, investigating, mitigating, and recovering from incidents across groupon’s services.
* facilitate post mortems and oversee problem management.
* participate in engaging projects.
* work as part of the incident management team (shift monday-friday with one weekend primary on-call every 6 weeks).
qualifications:
* 4+ years experience with linux system administration and root cause analysis.
* 4+ years experience with web application operations and incident analysis.
* 4+ years experience creating splunk or kibana queries to resolve incidents and outages, owning all impacting events until resolution, including coordination, documentation, and post mortems.
* 4+ years experience developing policies and procedures to improve stability.
* strong communication, consulting, and collaboration skills.
* experience with one or more programming languages (python, ruby, java).
* preferred: a degree in computer sciences or related fields.
* preferred: experience designing tools for site and service management.
desired traits:
* customer-focused
* team player
* fast learner
* pragmatic
* ownership mindset
groupon is an ai-first company: we encourage leveraging ai tools during the hiring process and value candidates interested in ai and technology-driven solutions.
groupon’s purpose is to build strong communities through thriving small businesses. To learn more, visit our website and recent news, and explore our dei initiatives. If this role fits your interests, click apply and join us in our mission to become the top destination for local experiences and services.
seniority level
* mid-senior level
employment type
* full-time
job function
* information technology
industries
* software development
#j-18808-ljbffr