Job Intersection | Contract Job on Site Reliability Engineerin Stefanini North America and APAC

Job Summary

Location

Toronto ,ON C6A

Job Type

Contract

Visa

Any Valid Visa

Salary

PayRate

Qualification

BCA

Experience

2Years - 10Years

Posted

14 Jan 2025

Job Description

Join our team as a Site Reliability Engineer, where you'll work with a diverse range of technologies, from IoT communication protocols to cloud-scalable connectivity and big data processing. You will be instrumental in ensuring the constant uptime, seamless scalability, and robust foundation for our critical systems, enabling the growth of new applications and services. This role goes beyond traditional operations, collaborating closely with developers and architects to enhance stability, security, and scalability from the design phase onwards.

Responsibilities:

Collaborate with developers and architects to improve system design and implementation for enhanced stability, security, and scalability.
Implement and enhance AI infrastructure and application monitoring and observability capabilities to maximize reliability.
Partner with application engineering teams to improve service operability, reliability, on-call efficiency, incident management, and post-mortem analysis.
Drive production readiness and improve key areas such as capacity planning, configuration management, and observability.
Design and refine architectures for new and existing systems based on reliability and high availability principles, incorporating comprehensive logging and observability.
Develop and apply expertise in client infrastructure and best practices to enhance platforms for world-class distributed system performance.
Develop tooling and automation to streamline infrastructure and application operations.
Gather and analyze metrics from operating systems and applications to optimize performance and facilitate fault finding.
Lead deep-dive troubleshooting of production issues and actively participate in diagnostic calls.

Qualifications:

Bachelor's degree in Computer Science, Engineering, or a related technical field.
5+ years of experience supporting internet-facing production services and distributed systems.
Strong expertise in AWS managed services, including Kafka, ElastiCache (Redis), PostgreSQL, and AMQP brokers.
Hands-on experience with Pulumi, Terraform, and Terragrunt for Infrastructure as Code (IaC).
Advanced proficiency in Kubernetes, with hands-on experience managing large-scale, production-grade clusters, optimizing workloads, and implementing best practices for scalability and high availability.
Experience with ArgoCD and GitLab for CI/CD pipelines.
Expertise in Linux systems, particularly Red Hat and Debian distributions.
Experience implementing Prometheus for monitoring and observability.
Strong scripting skills (Bash, PowerShell) and command-line interface proficiency.
Proven ability to troubleshoot complex technical problems in distributed systems, networking, and security, especially on AWS.
Self-starter with a focus on continuous improvement and operational optimization.
Strong programming skills with at least one interpreted, dynamically typed language (e.g., Python, Node.js) and one compiled, statically typed language (e.g., C#, Java).
Relevant SRE training and certifications are a plus.
Excellent verbal and written communication skills in English.

This is an excellent opportunity to contribute to a cutting-edge AI product and work with a talented team. If you are passionate about reliability, scalability, and automation, we encourage you to apply.

Other Smiliar Jobs


Java Full Stack Developer Toronto, ON 12 Days ago
Deskside Technician Ingersoll, ON 8 Hours ago
Engineering Manager Toronto, ON 8 Hours ago
Senior Data Science Engineer Toronto, ON 8 Hours ago
Senior Java Developer Toronto, ON 8 Hours ago
GCP Front End Developer Southfield, MI 10 Days ago
Solutions Architect Southfield, MI 6 Days ago
Network Engineer Richmond, VA 23 Hours ago
GCP Data Engineer Southfield, MI 12 Days ago
Full Stack Engineer Southfield, MI 12 Days ago
Cyber Security Engineer Wilmington, DE 12 Days ago
SAP Project Manager Southfield, MI 12 Days ago
Engineering Project Manager Greensboro, NC 23 Hours ago
Learning and Development Coordinator Parma, ID 12 Days ago
Automotive Product Engineer Salem, VA 9 Days ago

Site Reliability Engineer

Job Summary

Location

Toronto ,ON C6A

Job Type

Contract

Visa

Any Valid Visa

Salary

PayRate

Qualification

BCA

Experience

2Years - 10Years

Posted

14 Jan 2025

Share

Job Description

Other Smiliar Jobs

Java Full Stack Developer

Deskside Technician

Engineering Manager

Senior Data Science Engineer

Senior Java Developer

GCP Front End Developer

Solutions Architect

Network Engineer

GCP Data Engineer

Full Stack Engineer

Cyber Security Engineer

SAP Project Manager

Engineering Project Manager

Learning and Development Coordinator

Automotive Product Engineer

Quick Links

Find Jobs

Search by Jobs Type

Jobs by Visa