Unlock your future with ai
we're on a mission to redefine industries with innovative technologies. Our limitless potential of artificial intelligence is transforming businesses, streamlining processes, and driving growth.
about the role
we're actively seeking a senior site reliability engineer focused on our integration layer. You'll be responsible for the reliability, availability, and performance of apis, data pipelines, and services that power communication between key systems.
key responsibilities
* own reliability and performance for our integration platform, including apis, event streams, and third-party connections
* design and implement robust monitoring, alerting, and incident response processes for the integration layer
* collaborate with developers and architects to influence design decisions for reliability, scalability, and observability
* build automation for deployment, scaling, failover, and remediation
* lead root cause analysis and post-mortem processes for integration-related incidents
requirements
* 5+ years of experience in site reliability engineering, devops, or related fields
* strong experience with apis, messaging systems (e.g., kafka, rabbitmq), and integration frameworks
* proficiency in cloud platforms (aws, azure, or gcp), container orchestration (kubernetes), and infrastructure-as-code (terraform, cloudformation)
* hands-on experience with observability tools (e.g., datadog, new relic, prometheus, grafana)
* strong scripting or programming skills (python, go, or similar)
* understanding of ci/cd pipelines and automated deployment best practices
what we offer
* stock options
* remote work options
* flexible working hours
* benefits above the law
* mentorship and opportunities to learn and level up