The work:
join us in creating a vibrant environment where innovation thrives
you will be a subject matter expert, collaborating with various teams to contribute to key decisions and provide solutions to challenges that arise. Your expertise in server operating systems will be invaluable as you engage with multiple teams and help shape the future of our operations. We are excited to see how your contributions will make a difference in our organization
* ensure the availability and optimal performance of production systems.
* drive the resolution of incidents and outages while maintaining clear communication.
* facilitate the restoration of services in the production environment.
* establish and maintain disaster recovery procedures.
* implement and uphold data retention practices.
here's what you will need:
core technical skills
* operating systems: windows server, linux redhat (administration, processes, patching).
* virtualization: vmware.
* networking: tcp/ip, vlans, dns, dhcp, firewalls, load balancing.
* storage & backup: san/nas, raid, snapshots, replication.
* monitoring: dynatrace.
* scripting & automation: powershell, bash, python.
* security: os hardening, active directory, kerberos, pki.
cloud skills
* virtual machines in azure.
* cloud security (secure images, baselines).
* patch automation with cloud-native tools.
* backup & archival with cloud solutions.
* object and block storage (azure files, blob storage).
* virtual networks, vpn, expressroute.
* finops: vm right-sizing.
* high availability: failover, snapshots, replication.
* cloud-native logging and monitoring.
devops for infrastructure
* infrastructure as code (iac): terraform.
* configuration management: puppet.
* ci/cd: gitops azure.
* containerization: docker, kubernetes, openshift.
soft skills
* collaboration with developers, dbas, and cloud architects.
* documentation: runbooks, architecture diagrams, terraform docs