We are seeking an accomplished *chief ai platform engineer* to lead the development and management of advanced systems that empower stream-aligned teams to deliver secure, scalable, and high-performing ai-driven solutions.
in this role, you will establish strategic direction for the platform, shape cross-functional collaboration, and drive production-grade deployment of machine learning models at scale. Additionally, you will spearhead innovation by evaluating and integrating emerging technologies.
*responsibilities*
- define and execute the architectural strategy for scalable backend systems leveraging mlflow, kubernetes, databricks, and docker
- lead the development of high-performance apis tailored for advanced data processing and seamless machine learning model integration
- set vision for large-scale operationalization and deployment of machine learning models, leveraging mlflow frameworks and best practices
- establish organizational strategies to deliver scalable, robust, and reusable cloud-based infrastructures
- drive the creation of sophisticated automated cloud configuration workflows optimized for large-scale environments
- oversee cross-domain optimizations to enhance systems, processes, and tools for advanced functionality
- provide authoritative guidance on cloud architecture in areas such as automation, orchestration, security, resilience, and operability
- ensure continuous alignment of cloud platform initiatives with long-term business priorities and evolving technological demands
- guide executive-level discussions and build consensus among diverse technical and business stakeholders for impactful delivery outcomes
- cultivate organizational talent by mentoring mid-level engineers and data scientists in leveraging complex frameworks and tools
- lead evaluations and integration of cutting-edge products, services, and technologies for enhanced platform capability
- encourage collaboration and inspire innovation by setting the tone in ceremonies, strategic discussions, and retrospectives
- act as the highest escalation authority during major incident responses, influencing long-term improvements to system reliability
- shape mitigation strategies for risks and navigate technical complexities while driving communication of outcomes across stakeholders
- maintain keen foresight on technological trends to anticipate industry shifts and inform critical strategy development
*requirements*:
- bachelor’s degree in computer science, software engineering, or a related field; a master’s degree is preferred
- 7+ years of experience architecting and managing highly available, scalable infrastructure, solutions, and services in complex environments
- at least 2 years of experience in a technical leadership role directing engineering or platform teams
- proven mastery of leading cloud (iaas, paas, saas) services and solutions across major providers
- expert-level proficiency in programming languages such as python and modern jvm languages including java, scala, or kotlin
- deep expertise in all stages of machine learning workflows, including advanced algorithms, frameworks like tensorflow, pytorch, or scikit-learn, and production-scale deployment
- extensive experience in distributed data processing frameworks such as apache spark, with strong proficiency in delta lake and parquet formats
- expert knowledge of agile methodologies and enterprise-level ci/cd processes using tools like gitlab, terraform, or similar platforms
- established track record of architecting and managing machine learning models at scale within mission-critical production environments
- advanced competency in data structures, algorithmic problem-solving, and principles of enterprise-scale system design
- exceptional proficiency in notebook-based workflows using tools like jupyter or databricks for large-scale data experimentation
- aws certified solutions architect professional or comparable elite-level cloud architecture certifications
- excellent analytical and strategic communication skills with the ability to influence diverse audiences, from technical talent to senior executives
- fluent english proficiency at a minimum of c1 level to drive collaboration across global, multidisciplinary teams
*nice to have*
- proven hands-on experience across the extended aws ecosystem, including aws s3, ec2, rds, emr, redshift, glue, sagemaker, lambda, dynamodb, and cloudwatch
- expert familiarity with apache parquet and data lake transformation workflows utilizing next-generation solutions
*we offer*
- career plan and real growth opportunities
- unlimited access to linkedin learning solutions
- constant training, mentoring, online corporate courses, elearning and more
- english classes with a certified teacher
- support for employee’s initiatives (algorithms club, toastmasters, agile club and more)
- enjoyable working environment (gaming room, napping area, amenities, events, sport teams and more)
- flexible work schedule and d