Req id: 375270
position: site reliability engineer
location
guadalajara, jalisco (mx-jal), mexico
responsibilities
* perform l1.5 activities such as monitoring, deployment, and rollback.
* monitor azure cloud system performance to prevent outages and initiate an incident management bridge when needed.
* troubleshoot azure resources and, when necessary, lift tickets to level3 (software development team).
* participate in 24/7 operations; shift rotation may occur every 1–2 months.
qualifications
* understanding of microsoft azure cloud – ideally azure fundamentals certified or a degree in computer science/information systems management.
* experience with paas and iaas components such as vms, storage, eventhub, service fabric cluster, azure kubernetes service, cosmosdb, sqlserver, iothub, databricks, keyvault, and data lake.
* knowledge of iot concepts, including telemetry, ingestion, processing, data storage, and reporting.
* familiarity with tools such as octopus, bamboo, terraform, azuredevops, jenkins, github, and ansible.
* experience with container orchestration platforms, e.g., kubernetes.
* proficiency with scripting languages: powershell and python.
* understanding of nosql vs. Sql databases and their maintenance.
* experience with monitoring and logging systems such as log analytics, splunk, elk, prometheus, nagios, and zabbix.
* independent thinker who proactively identifies and addresses issues.
work hours
24/7 operations it support team; rotations may occur every 1–2 months.
equal employment opportunity
nttdata is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
#j-18808-ljbffr