SRE - Site Reliability Engineer - Senior

job
  • Aquent
Job Summary
Location
Lone Tree ,IA 52755
Job Type
Contract
Visa
Any Valid Visa
Salary
PayRate
Qualification
BCA
Experience
2Years - 10Years
Posted
15 Mar 2025
Share
Job Description

SRE - Site Reliability Engineer - Senior

We seek a strong technical lead with management background to help support and oversee our technical team, provide direction and guidance for development projects, and ensure that all projects meet the highest quality standards. The ideal candidate will be a creative thinker to provide innovative solutions to complex development challenges. The candidate should possess a broad technical understanding of disparate tech stack, has IBM knowledge, or has been a technical project manager.

The role incorporates aspects of software engineering and operations. We are combining these skills to come up with better ways of managing and operating applications. Fundamental to the role is managing the performance, availability, and security of the network systems and applications, using tools and techniques such as monitoring, testing, automation, and incident response.

Qualifications:

  • Bachelor’s degree in computer science, Information Technology, or related field (or equivalent experience)
  • 5+ years of experience in software development and technical leadership
  • Ability to work onsite and provide leadership in a hands-on, responsive capacity
  • Proven experience working with teams across disparate locations including offshore resources.
  • Strong understanding of cloud computing, networking, Linux systems administration, containerization (e.g., Docker, Kubernetes, PCF), and infrastructure as code
  • Strong understanding of IT security practices, including vulnerability management, patching, and system hardening, Change Management and Production Control
  • Ability to analyze complex systems, troubleshoot issues, and devise effective solutions under pressure
  • Excellent communication skills, both written and verbal, with the ability to provide clear, concise reports and updates
  • Demonstrated ability to manage multiple tasks, projects, and priorities in a fast-paced, dynamic environment
  • Strong customer service mindset, with a focus on responsiveness and building positive relationships with clients and stakeholders
  • Experience with providing 24x7x365 support and working in environments with high uptime and availability requirements
  • Solid project management and organizational skills - Project Management Professional (PMP) or equivalent certification a plus
  • ITIL certification or similar for service management best practices including working knowledge of ITIL and Service Availability concepts
  • Demonstrated knowledge of software development methodologies and techniques
  • Strong analytical, troubleshooting and problem-solving skills

Responsibilities:

  • Manage issues for software engineers, providing technical direction and guidance for development projects
  • Provide leadership and guidance in incident response practices within the designated business area, overseeing response efforts and driving continuous improvement in incident resolution and service restoration.
  • Oversee server infrastructure across multiple platforms, including M/F, Unix, Linux, Solaris, Windows, and cloud environments
  • Ensure the security, availability, and performance of all server systems by implementing best practices in system administration, patch management, and monitoring
  • Collaborate with security teams to enforce security policies, manage vulnerabilities, and improve overall system security
  • Develop and maintain operational procedures and documentation, ensuring that all team members adhere to Standard Operating Procedures (SOPs)
  • Provide comprehensive reports on team performance, system health, and project status to senior leadership
  • Develop and execute detailed project plans, managing resources, timelines, and deliverables for system upgrades, migrations, and other initiatives
  • Act as the primary point of contact for escalations and high-impact incidents, ensuring swift resolution and communication with relevant stakeholders
  • Deliver outstanding customer service by maintaining strong relationships with both internal teams and external clients, ensuring that all issues and requests are addressed promptly
  • Continuously evaluate and improve team workflows, processes, and toolsets to enhance efficiency and service delivery
#J-18808-Ljbffr
Other Smiliar Jobs
 
  • San Francisco, CA
  • 3 Days ago
  • Austin, TX
  • 2 Days ago
  • Vancouver, WA
  • 2 Days ago
  • San Francisco, CA
  • 2 Days ago
  • Portland, OR
  • 2 Days ago
  • Denver, CO
  • 2 Days ago
  • Greenwood Village, CO
  • 2 Days ago
  • Dallas, TX
  • 2 Days ago
  • San Francisco, CA
  • 2 Days ago
  • Seattle, WA
  • 2 Days ago
  • Seattle, WA
  • 2 Days ago
  • Denver, CO
  • 2 Days ago
  • Southlake, TX
  • 2 Days ago
  • Ann Arbor, MI
  • 2 Days ago
  • Beaverton, OR
  • 2 Days ago