Join to apply for the Remote Senior AI Cloud Engineer role at Oracle
Get AI-powered advice on this job and more exclusive features.
Direct message the job poster from Oracle
Join the Oracle Applications Labs (OAL) team as a Senior AI Cloud Engineer, where you’ll design, build, and operate scalable, intelligent, and automated cloud solutions across Oracle’s enterprise applications.
This role bridges AI-driven automation with cloud infrastructure engineering, giving you the opportunity to work across Oracle Cloud Infrastructure (OCI), middleware technologies, and AI-powered operations. You’ll collaborate with engineers and product teams to enhance reliability, observability, and performance through smart automation and modern cloud practices.
Qualifications
- A degree (or equivalent experience) in Computer Science, Engineering, Mathematics, Physics, Statistics, or a related field.
- 3+ years of experience in cloud infrastructure, middleware engineering, or DevOps roles.
- Strong hands-on experience with Oracle Middleware (WebLogic, ODI, OTD).
- Skilled in OCI, OKE, Terraform, Ansible, and Python for automation and orchestration.
- Excellent troubleshooting and root cause analysis skills for complex distributed systems.
- Familiar with AI/ML concepts and their application in system automation or observability.
- Knowledge of Oracle Fusion Applications (SaaS) architecture and integration points.
Preferred Qualifications
- Experience integrating AI/ML models or AI services into enterprise operational workflows.
- Familiarity with OCI AI Services, Data Science, or similar cloud AI offerings.
- Experience with observability tools (Prometheus, Grafana, ELK/EFK, OCI Monitoring).
- Understanding of networking, cloud security, and DevOps/MLOps principles.
- Strong analytical, problem-solving, and collaboration skills.
Build and contribute to AI-Powered Applications
- Help design and develop features in AI-driven applications, primarily using Python or similar languages.
- Collaborate with cross-functional teams to deliver scalable and reliable solutions.
- Design, deploy, and manage solutions using Oracle Middleware products - WebLogic, ODI, and OTD.
- Deploy and troubleshoot containerized applications on Oracle Kubernetes Engine (OKE).
- Design and implement architectures using OCI Services such as Load Balancer (LBaaS), API Gateway, OCI Streams, and IDCS.
- Manage OCI networking, ensuring secure, scalable, and high-performance configurations.
- Perform advanced troubleshooting and root cause analysis for complex cloud and middleware issues.
Automation & Infrastructure as Code
- Build and maintain Ansible and Terraform automation for environment provisioning and configuration management.
- Develop Python-based automation scripts to streamline deployments, monitoring, and incident response.
- Create intelligent automation workflows that leverage AI insights to reduce manual interventions and improve operational efficiency.
AI-Powered Operations & Integration
- Integrate AI/ML capabilities into cloud operations - including anomaly detection, predictive monitoring, and automated remediation.
- Collaborate with data and AI teams to deploy lightweight models or AI services that enhance system reliability and decision-making.
- Utilize OCI AI Services, OCI Data Science, or equivalent tools for operational intelligence and optimization use cases.
Learn and Grow in MLOps & LLMs
- Get hands-on experience with tools and practices in MLOps: model deployment, monitoring, and versioning.
- Explore foundational concepts in large language models (LLMs), including prompting techniques and task automation.
- Learn to implement basic agentic workflows using tools like LangGraph or similar frameworks.
Monitoring and Observability
- Design and implement AI-driven observability frameworks combining traditional metrics with intelligent insights.
- Leverage machine learning models for anomaly detection, predictive alerting, and log/event correlation across systems.
- Use OCI AI Services, OCI Data Science, or similar platforms to develop and deploy models that automatically identify performance degradation, error spikes, and capacity risks.
- Integrate Prometheus, Grafana, ELK/EFK, and OCI Monitoring with AI analytics pipelines for real-time insights and trend prediction.
- Implement self-healing mechanisms that use AI recommendations to trigger automated corrective actions.
Understand Responsible AI
- Develop awareness of AI ethics, privacy, fairness, and secure coding practices in real-world AI development.
Why Join Us?
- Start your AI career with real impact — your code and ideas will contribute to solutions used by businesses worldwide.
- Learn from experts — work closely with experienced engineers, data scientists, and product teams.
- Grow your skills — we’ll support your development in AI/ML, MLOps, cloud technologies, and more.
- Work on diverse projects — from conversational interfaces to enterprise-scale integrations.
- Be yourself — we value diversity, inclusion, and creating a safe space for all team members to grow and thrive.
Seniority level
Associate
Employment type
Full-time
Job function
Information Technology
Industries
IT Services and IT Consulting
#J-18808-Ljbffr