Role: Sr. SRE Engineer
Location: Naperville IL (Hybrid )
Key Skills: – GCP OR Oracle Cloud Infrastructure ( OCI ), CI/CD, Oracle Database, Scripting, infrastructure as code (IaC) tools, such as Terraform or Puppet, Docker and Kubernetes.
Key Responsibilities:
- Design, build, and maintain scalable and reliable infrastructure solutions.
- Implement automation tools and processes to streamline operations and improve efficiency.
- Monitor system performance and troubleshoot issues to ensure high availability and reliability.
- Collaborate with development teams to design and deploy applications in production environments.
- Conduct root cause analysis (RCA) and implement preventive measures to minimize downtime and outages.
- Develop and maintain documentation, runbooks, and playbooks for operational processes.
- Participate in on-call rotations and provide timely response to incidents and emergencies.
- Implement best practices for security, compliance, and disaster recovery.
- Continuously evaluate and improve system performance, reliability, and scalability.
Skills and Qualifications:
- Bachelor's degree in Computer Science, Engineering, or related field.
- Proven experience as a Site Reliability Engineer or similar role.
- Strong knowledge of OCI cloud platforms, Oracle database and must have held SRE role for over 10+ years minimum
- Experience with infrastructure as code (IaC) tools, such as Terraform or Puppet.
- Any scripting and programming languages knowledge - such as Python, Go, or Bash.
- Hands-on experience with monitoring and observability tools, such as NewRelic, Grafana, or Kibana.
- Solid understanding of containerization technologies, such as Docker and Kubernetes.
- Excellent troubleshooting and problem-solving skills.
- Strong communication and collaboration skills.
- Ability to work effectively in a fast-paced and dynamic environment.
Thank you,
Dheeraj T
Astir IT Solutions, Inc
50 Cragwood Rd., Suite 219 | South Plainfield, NJ 07080
Email: