Job Title: Site Reliability Engineer
Location: San Diego CA 92127 United States
Payrate: $70/Hr - $98.55/Hr
Shift: 8:00am to 5:00pm
It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable, scalable, operable and secure throughout the entire platform.
RESPONSIBILITIES:
- Contributes to a team of Engineers to deliver and support highly available, self-service, CI/CD capabilities.
- Showcases uncompromising ownership of outcomes and deliverables
- Adheres to software development best practices
- Role Model for customer focused delivery for both internal and external customers
- AWS Experience REQUIRED
- Energetically and effectively works across organizational boundaries, collaborating to deliver awesome developer and platform capabilities.
- Experienced Engineer that drives Operational Excellence within the team
- Builds and fosters agile engineering capabilities and quality engineering practices
- Forward looking Engineer with execution know-how to take SIE to the next level of CI/CD
- Hands-on application management and support for AWS cloud environments, including full-stack diagnosis, fault resolution and root cause analysis.
- Drive and Implement monitoring tools/metrics/reports for tracking application/service performance.
- Collaborate with engineering and system teams to drive changes and ensure optimal application performance and resiliency.
- Lead service and system performance analysis, service capacity planning, and service continuity validation for multiple applications.
- Review and influence design, architecture, standards, and methods for deploying, monitoring and operating services and applications.
- Actively participate and/or commit in the execution of tasks required to meet milestones and deliverables set by the SCRUM team throughout the release cycle.
- Provide rotational on-call support.
QUALIFICATIONS:
- BS in Computer Science or equivalent experience
- 3+ years professional Site Reliability experience operating microservices at scale
- 2+ years hands-on AWS experience deploying, supporting, managing applications
- Experience with Docker, Kubernetes, and in particular EKS
- Extensive use of automation and configuration management tools such as Ansible or Chef with obsessive desire to automate
- Strong development experience in one of these languages – Java, Python, or Go
- Experienced user of one or more source code management tools, preferably Git
- Should have experience with continuous integration, continuous delivery/deployment tools like Jenkins, Spinnaker, or similar