Tata consultancy services is an equal opportunity employer, our commitment to diversity & inclusion drives our efforts to provide equal opportunity to all candidates who meet our required knowledge & competency needs, irrespective of any socio-economic background, race, color, national origin, religion, sex, gender identity/expression, age, marital status, disability, sexual orientation or any others. We encourage anyone interested to build a career in tcs to participate in our recruitment & selection process.
tcs is seeking skilled professionals to join our team as sre.
technical/functional skills:
observability & monitoring
* develop proactive alerting and dashboarding strategies to detect and resolve issues before they impact customers and store operations
* define and manage service level objectives (slos), service level agreements (slas), and error budgets for critical store applications
* lead critical incident recovery and postmortem processes to drive continuous improvement
performance & reliability engineering
* identify and eliminate bottlenecks in development and deployment workflows to improve lead time and reduce change failure rates
* partner with development teams to embed site reliability engineering (sre) principles into the software development lifecycle
* support and optimize applications deployed across cvs retail and pharmacy locations
* collaborate with infrastructure and store operations teams to ensure high availability and performance of store systems
microservices & deployments
* champion containerization and orchestration using openshift and kubernetes in hybrid cloud environments
* leverage ci/cd pipelines to enable automated deployments at scale
* understanding of microservices architecture
minimum qualifications:
* 5+ years of experience in sre, devops, or related technology roles
* 3+ years of experience in delivering software in a large-scale environment with reliability and resilience concepts (multi-region, multi-cloud, containerization, etc.)
* 2+ years of experience with programming languages/frameworks
* 2+ years of experience on cloud technologies (aws, microsoft azure, google cloud), microservices concepts, and capabilities like rancher, docker, kubernetes, and web api's
* 2+ years of experience with source control and continuous integration tools like github, bitbucket, or jenkins
* experience with observability and monitoring tools such as splunk, dynatrace, datadog, prometheus, grafana, etc.
* proficiency in scripting and automation frameworks
* understanding of microservices architecture and cloud-native technologies
* experience in incident management, change management, infrastructure support, and problem management concepts and processes
* excellent interpersonal and communication skills, including the ability to engage technical and non-technical stakeholder
*work modality: hybrid*
candidate must be located in or willing to relocate to querétaro, cdmx, monterrey, guadalajara, it will be requested to attend office at least 3 days per week.
boost your career and send your resume to: alejandra.galicia@tcs.com