Role Description:
- Map an application's deployment architecture including cloud infrastructure and dependencies.
- Experience with Chaos testing scenarios (using Gremlin preferably).
- Ability to identify Failure Modes with end-to-end journeys (across UI, authentication layer, Application code, 3rd party systems, Databases, Data, Capacity, Infrastructure, Firewall and Network).
- Integrate SRE practices into Incident Management and Change Deployment process.
- Implementation of SRE practices in line with AWS security best practices and Well Architected Frameworks.
- Develop and Maintain SRE runbooks.
- Understand and share resiliency architectures.
- Strong understanding of SLO, SLI, Error Budgets and their implementation into SRE areas.
Competencies:
- Digital: Amazon Web Service (AWS) Cloud Computing
- Advanced Java Concepts
- Digital: Microservices
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
#J-18808-Ljbffr