The sre role at acxiom llc is an agile development-oriented cloud-first role focused on delivering quality solutions including infrastructure as code and performance efficiencies. The role encompasses process automation and level-3 support building, maintaining, and supporting code through continuous integration / continuous delivery systems with robust version control and tracking. Your assignments may include delivering, operating, and / or maintaining solutions in support of the company’s client-facing products and solutions as well as the internal business process associated with core information technology tools and services.
You’ll be expected to have a wholistic understanding of customer requirements, project deliverables, implementing various development, testing, and automation tools, and it infrastructure. You’ll establishes service level indicators (slis) allowing for setting service level objectives (slos) and the resulting service level agreements (slas).
Location: lomas atlas, mexico
what you will do:
- work closely with other engineers to guide design and reliability discussion to maintain fault-tolerant systems
- keep a watchful eye on our various internal and external monitoring tools and systems to track slis along with production capacity and performance
- automate, automate and automate
- participate in capacity planning/analysis and performance analysis activities
- identify workload management improvements to increase cost efficiencies
- key contributor to enabling our continuous delivery model
- combine software development / project delivery and cloud operations practices to shorten the system / project / service delivery life cycle
- enable multi-tenant, multi-datacenter, and multi-cloud environments for the company's complex high-traffic, business-critical internet site communications and/or network-based (cloud) product systems.
- development of installation scripts, unit tests, and programs for the installation of operating systems and products.
- mentor other associates to improve their sre skills
- serves as the ultimate sme for one or more services: you will either know how to fix it, have the contact with the vendor to fix it, or can tap someone at acxiom with specialized knowledge
- identify security exposures and recommend corrective action using specialized domain knowledge and developed business expertise
- write run books and standard operating procedures to allow consistent operations
what you will need:
- minimum of 5 years related experience with a bachelor's degree
- at least one active public cloud (including vmware) certification above foundational / fundamental / practitioner
- 3+ years experience with linux/unix/bsd and/or windows and microsoft services
- practical knowledge of command-line scripting and at least one of the following scripting languages - python, ruby, perl, powershell, or ansible
- capable of prioritizing efforts without constant management involvement
- track record of practical problem-solving and excellent communication skills
- excellent analytical abilities; coupled with a strong sense of ownership, urgency, and drive
- software automation experience
- familiarity with system management tools - i.e. Puppet, chef, or ansible
- ability to handle periodic on-call duty
- experience mitigating security risks found in complex technology projects
what will set you apart:
- experience with end-to-end solutions, including code, hardware, and networks
- passion for open source communities and philosophies
- software engineering background with experience in java,.net
- experience with hadoop, aerospike, vertica
- amazon web services, google cloud, or azure experience
primary locations:
mexico city
additional locations (if applicable):