Job Overview
Oracle Health is expanding its OCI Operations team and is hiring a Site Reliability Developer 3 (IC3). This role supports the EHR and Clinical AI Agent services, which power next-generation healthcare automation using advanced AI, analytics, and cloud technologies.
You’ll work on a large-scale, self-healing, cloud-native platform built with Kubernetes, Docker, Prometheus, Grafana, and other modern tools. The role focuses on reliability, scalability, security, automation, and solving complex system challenges.
Visa/work permit sponsorship is not available for this position.
Key Responsibilities
Service Ownership
- Take ownership of operational aspects of OCI services under the EHR/Clinical Agent portfolio.
- Understand end-to-end system configuration, dependencies, and performance behavior.
- Maintain service availability, reliability, and performance.
- Participate in LiveSite operations and mitigate issues quickly.
Service Design
- Design solutions to deploy software and security updates with zero downtime.
- Collaborate with product and development teams to build platform automation.
- Analyze system failures and define rapid response processes.
Operations Engineering
- Evaluate cloud deployments across commercial and government datacenters.
- Monitor degradation, performance, and scaling needs under load.
- Resolve security vulnerabilities in line with corporate and government standards.
Automation
- Identify and automate SRE procedures to minimize human error.
- Build tools, utilities, and automation workflows for production environments.
Technical Expertise
- Troubleshoot complex production issues using deep technical knowledge.
- Work with AI-driven systems such as Clinical Digital Assistant services.
- Act as SME during major incidents and help implement preventive measures.
Minimum Requirements
- 5+ years as a Site Reliability Engineer or similar role.
- BS/MS in IT, Computer Engineering, or equivalent.
- Strong troubleshooting and problem-solving skills for distributed systems.
- Experience with production operations and safe deployment practices.
- Experience with public cloud platforms (OCI, AWS, GCP, Azure).
- Hands-on experience with Python, Perl, and/or Shell scripting.
- Strong knowledge of Terraform or Shepherd (IaC tools).
- Experience with Kubernetes and cloud-native monitoring (Docker, Helm, Prometheus, Grafana, ELK/EFK, Jaeger).
- Experience with Git and Linux/Unix environments.
Role Details
- Position: Site Reliability Developer 3
- Career Level: IC3
- Job Category: Product Development
- Experience: 5+ years
- Language: English
About Oracle
Oracle is a global leader in cloud technology with a strong commitment to innovation and diversity. The company offers competitive benefits, growth opportunities, and a supportive work environment.
Disclaimer
This job information is sourced from the official Oracle careers website and shared for informational purposes only. Always verify details and apply directly through the official career page. No hiring guarantees are provided.