Position Summary: The Director of DevOps is responsible for overseeing the company's DevOps strategy, ensuring the seamless integration, migration, and deployment of software across all platforms. This role requires a visionary leader who can drive the adoption of best practices, foster a culture of automation, continuous improvement and collaboration, and ensure the reliability, scalability, and security of our infrastructure.
Key Responsibilities:
- Leadership and Strategy:
- Develop and execute the DevOps strategy, aligning with business objectives and technology goals.
- Lead, mentor, and grow a high-performing DevOps team, fostering a culture of collaboration, continuous improvement, and innovation.
- Collaborate with engineering, product, infosec, and operations teams to ensure seamless integration and delivery of applications.
- Team Management:
- Build, manage, and mentor a high-performing DevOps team, including DevOps Engineers, Site Reliability Engineers (SREs), Cloud Engineers, and Release Managers.
- Foster a culture of continuous learning and improvement within the team.
- Infrastructure Management:
- Manage container infrastructure hosted in cloud environments (AWS, Azure, GCP) and container orchestration platforms (e.g., Kubernetes, Docker Swarm).
- Oversee the design, implementation, and maintenance of scalable, reliable, and secure infrastructure using container orchestration tools.
- Implement infrastructure as code (IaC) practices using tools like Terraform, Ansible, or CloudFormation.
- Ensure infrastructure as code (IaC) practices are followed and automated across all environments.
- Continuous Integration and Delivery (CI/CD):
- Develop and maintain robust CI/CD pipelines to automate the build, test, and deployment processes.
- Work closely with development and infosec teams to integrate testing and security into the CI/CD pipelines.
- Ensure the reliability and stability of deployment processes, reducing downtime and increasing release frequency.
- Monitoring and Performance:
- Implement comprehensive monitoring, logging, and alerting systems to ensure infrastructure and application performance.
- Perform regular performance tuning, capacity planning, and disaster recovery planning.
- Utilize tools such as New Relic, Prometheus, Grafana, ELK Stack, and Datadog for monitoring and observability.
- Drive incident management processes, ensuring timely resolution and root cause analysis.
- Security and Compliance:
- Ensure infrastructure security by implementing best practices for identity and access management, network security, and data protection.
- Ensure DevOps practices adhere to security standards and compliance requirements.
- Conduct regular security audits and vulnerability assessments, implementing necessary remediation measures.
- Collaborate with the security team to implement DevSecOps practices, integrating security into the CI/CD pipeline.
- Collaboration and Communication:
- Communicate DevOps goals, progress, and challenges to executive leadership.
- Communicate effectively with stakeholders, providing regular updates on infrastructure status, project progress, and potential risks.
- Promote a DevOps culture across the organization, advocating for best practices in automation, collaboration, and continuous improvement.
- Work closely with engineering, product, and QA teams to ensure alignment on goals and priorities.
- Vendor and Tool Management:
- Evaluate and manage relationships with third-party vendors and tools relevant to DevOps practices.
- Ensure the adoption of best-in-class tools and technologies.
- Budget Management:
- Develop and manage the DevOps budget, ensuring efficient allocation of resources.
- Identify cost-saving opportunities without compromising on quality and reliability.
Qualifications:
- Education:
- Bachelor’s degree or higher in Computer Science, Engineering, or a closely related field.
- Experience:
- 10+ years of experience in DevOps, infrastructure, and operations roles, with the majority in DevOps.
- 5+ years of leadership experience managing DevOps.
- Proven experience with container orchestration tools (e.g., Kubernetes, Docker Swarm).
- Strong background in CI/CD pipeline development and management.
- Experience with Azure cloud (AWS, GCP, others a plus).
- Technical Skills:
- Strong expertise in cloud platforms (Azure, AWS, GCP) and container orchestration (Kubernetes, Docker Swarm).
- Proficiency in CI/CD tools (e.g., TeamCity, Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible, CloudFormation).
- Experience with monitoring and observability tools (e.g., New Relic, Prometheus, Grafana, ELK Stack, Datadog).
- Strong scripting and automation skills (e.g., PowerShell, Python, Bash).
- Soft Skills:
- Excellent leadership and team management skills.
- Strong problem-solving and analytical skills.
- Effective communication and collaboration abilities.
- Strategic thinking and ability to align technical initiatives with business goals.
Preferred Qualifications:
- Certifications in cloud platforms (e.g., AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer, etc.).