(Canada) Site Reliability Engineer

job
  • Thomson Reuters
Job Summary
Location
,ON
Job Type
Contract
Visa
Any Valid Visa
Salary
PayRate
Qualification
BCA
Experience
2Years - 10Years
Posted
16 Nov 2024
Share
Job Description

(Canada) Site Reliability Engineer (Contract)

Contract (5 months 29 days)

Published 8 months ago

CLOSED

GCP

Site Reliability Engineer - in the Service Management Organization

Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?

The Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, make recommendations to improve service outcomes, design solutions in partnership with product, engineering, and architecture teams, build, test, and operationalize tools and applications to improve customer experience and reduce costs. Additionally, the Lead Site Reliability Engineer will provide oversight and coaching to engineers and be an escalation for our global command center engineers. We have multiple opportunities at different levels of seniority.

About the Role

In this opportunity as Site Reliability Engineer, you will be responsible for:

  • Service design, service operations, release, development, and support
  • Leading the work to drive efficiencies and reduce service operations risks
  • Leading the research of new capabilities, testing new solutions, recommending and implementing new technologies to improve customer experience and reduce costs
  • Collaborating and partnering with cross-functional teams to solve intractable problems and devising solutions to improve the products and services we offer our customers
  • Support existing services running in GCP
  • Assisting with the migration of services to wherever they will go in 2024
  • Ensuring all services are in line with TR standards

About You

You're a fit for the role of Site Reliability Engineer if:

  • You are proficient in cloud technologies, services, use of their APIs, and configuration tools
  • You use AI/ML tools to help improve service, reduce costs, and have worked with AI-Operations solutions
  • You are familiar with programming languages such as Python, Java, and C#
  • You have designed and supported scalable systems and services
  • You are proficient with Networking, Windows, Linux, Containers, PostgreSQL, or related infrastructure services at scale
  • You can automate tasks to improve service operations and support
  • You use configuration management tools to manage configuration at scale
  • You apply the scientific method to system components to identify improvements
  • Experience with GCP; experience with AWS or Azure would be nice to have
  • Experience with Kubernetes and Helm
  • Experience with Python; familiarity with Java or Node.js would be nice to have as well
  • Experience using tools like Terraform and Ansible
  • Familiarity with Heroku would be a plus
#J-18808-Ljbffr