Oracle is hiring a Principal Site Reliability Developer (SRE) for its Oracle Cloud Infrastructure (OCI) team in India. This is a senior-level role focused on large-scale distributed systems, cloud infrastructure reliability, automation, incident management, and high availability.
If you have deep expertise in Linux, Python/Java, cloud computing, and SRE practices, this is a high-impact opportunity to work on mission-critical OCI Compute services.
πΉ Job Overview
- Company: Oracle
- Job Title: Principal Site Reliability Developer
- Job ID: 317375
- Location: India
- Job Type: Full-Time | Regular Employee
- Experience: 6 to 10+ Years
- Career Level: IC4
- Job Category: Product Development
- Posted Date: 26 November 2025
- Language: English
- Visa Sponsorship: Not Available
πΉ Job Description
As a Principal Site Reliability Developer, you will design, build, and operate highly available, scalable, and secure cloud infrastructure services for Oracle Cloud Infrastructure. The role blends software engineering and systems engineering to automate operations, prevent incident recurrence, and improve overall service reliability.
You will work closely with OCI Compute and SRE teams, owning services end-to-end and acting as the final escalation point for complex production issues.
πΉ Key Responsibilities
- Own end-to-end reliability, performance, and operability of OCI services
- Design and build automation to eliminate manual processes
- Collaborate with SRE and development teams on full-stack service ownership
- Act as escalation point for critical and complex production incidents
- Maintain high availability and zero-downtime deployments
- Perform incident response, root cause analysis (RCA), and change management
- Drive capacity planning, demand forecasting, and performance tuning
- Ensure strong production security posture and monitoring
- Build and maintain deployment tools, SOPs, and documentation
- Participate in on-call rotations, including shift-based support
- Improve service architecture with focus on scale, resiliency, and security
πΉ Required Qualifications
- Bachelorβs degree in Computer Science, Engineering, or related field
- 6+ years operating large-scale, highly available distributed systems
- 5+ years of hands-on Linux system engineering
- 4+ years experience in Python or Java for infrastructure automation
- Strong knowledge of cloud computing, networking, and load balancers
- Experience with CI/CD pipelines and cloud platforms
- Hands-on experience with monitoring & instrumentation tools
(Prometheus, Grafana, etc.) - Excellent troubleshooting, incident management, and RCA skills
πΉ Required Skills
- Cloud Computing
- Linux Administration
- Python Programming
- Java Programming
- Incident Management
- Root Cause Analysis
π Apply Now
π Official Oracle Job Link: click here
Disclaimer
This job post is shared for informational purposes only. We are not affiliated with Oracle Corporation. Job details are sourced from the official Oracle Careers portal and may change at any time. Candidates must apply directly through the official Oracle website. No recruitment or application fees are charged.