(Canada) Site Reliability Engineer

job
  • Thomson Reuters
Sorry the Job you are looking for is no Longer available

Job Summary
Location
,ON
Job Type
Contract
Visa
Any Valid Visa
Salary
PayRate
Qualification
BCA
Experience
2Years - 10Years
Posted
12 Nov 2024
Share
Job Description

(Canada) Site Reliability Engineer (Contract)

Contract (9 months 4 days)

Published 3 days ago

New Relic

Data Dog

Site Reliability Engineer - in the Service Management Organization

Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?

The Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, make recommendations to improve service outcomes, design solutions in partnership with product, engineering, and architecture teams, build, test, and operationalize tools and applications to improve customer experience and reduce costs. Additionally, the Lead Site Reliability Engineer will provide oversight and coaching to engineers and be an escalation point for our global command center engineers. We have multiple opportunities at different levels of seniority.

About the Role

In this opportunity as Site Reliability Engineer, you will be responsible for:

  • Service design, service operations, release, development, and support
  • Leading the work to drive efficiencies and reduce service operations risks
  • Leading the research of new capabilities, testing new solutions, recommending and implementing new technologies to improve customer experience and reduce costs
  • Collaborating and partnering with cross-functional teams to solve intractable problems and devise solutions to improve the products and services we offer our customers

About You

You're a fit for the role of Site Reliability Engineer if:

  • You are proficient in cloud technologies, services, use of their APIs, and configuration tools
  • You use AI/ML tools to help improve service, reduce costs, and have worked with AI-Operations solutions
  • You are familiar with programming languages such as Python, Java, C#
  • You have designed and supported scalable systems and services
  • You are proficient with Networking, Windows, Linux, Containers, PostgreSQL, or related infrastructure services at scale
  • You can automate tasks to improve service operations and support
  • You use configuration management tools to manage configuration at scale
  • You apply the scientific method to system components to identify improvements
  • You are proficient in Observability tools such as Data Dog or New Relic
  • You are proficient in data analysis from sources such as SQL, S3, Athena, etc.
#J-18808-Ljbffr