(Canada) Site Reliability Engineer (Contract)
Contract (9 months 4 days)
Published 3 days ago
New Relic
Data Dog
Site Reliability Engineer - in the Service Management Organization
Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?
The Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, make recommendations to improve service outcomes, design solutions in partnership with product, engineering, and architecture teams, build, test, and operationalize tools and applications to improve customer experience and reduce costs. Additionally, the Lead Site Reliability Engineer will provide oversight and coaching to engineers and be an escalation point for our global command center engineers. We have multiple opportunities at different levels of seniority.
About the Role
In this opportunity as Site Reliability Engineer, you will be responsible for:
- Service design, service operations, release, development, and support
- Leading the work to drive efficiencies and reduce service operations risks
- Leading the research of new capabilities, testing new solutions, recommending and implementing new technologies to improve customer experience and reduce costs
- Collaborating and partnering with cross-functional teams to solve intractable problems and devise solutions to improve the products and services we offer our customers
About You
You're a fit for the role of Site Reliability Engineer if:
- You are proficient in cloud technologies, services, use of their APIs, and configuration tools
- You use AI/ML tools to help improve service, reduce costs, and have worked with AI-Operations solutions
- You are familiar with programming languages such as Python, Java, C#
- You have designed and supported scalable systems and services
- You are proficient with Networking, Windows, Linux, Containers, PostgreSQL, or related infrastructure services at scale
- You can automate tasks to improve service operations and support
- You use configuration management tools to manage configuration at scale
- You apply the scientific method to system components to identify improvements
- You are proficient in Observability tools such as Data Dog or New Relic
- You are proficient in data analysis from sources such as SQL, S3, Athena, etc.
#J-18808-Ljbffr