Devops Lead

The DevOps Technical Lead will be responsible for leading, mentoring, and managing the DevOps team while driving cloud infrastructure, automation, CI/CD pipelines, security, and reliability across AWS and Azure cloud environments. The role requires strong technical expertise combined with leadership and stakeholder management skills to ensure high availability, scalability, and performance of enterprise applications.
Leadership & Team Management:
- Lead and mentor DevOps engineers and cloud engineers.
- Define DevOps best practices, standards, and governance frameworks.
- Plan resource allocation and manage sprint deliverables.
- Conduct technical reviews and provide architectural guidance.
- Collaborate with development, QA, security, and operations teams.
Cloud Infrastructure Management (AWS & Azure):
- Design, implement, and manage scalable, secure cloud architectures.
- Maintain high availability and disaster recovery strategies.
- Optimize cloud costs using best practices.
- Implement monitoring, alerting, and logging solutions.
- Manage networking, identity access management, and security configurations.
CI/CD & Automation:
- Design and maintain CI/CD pipelines using tools such as Jenkins, GitHub Actions, GitLab CI/CD, or Azure DevOps.
- Automate infrastructure using Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, ARM Templates, or Bicep.
- Implement containerization and orchestration using Docker and Kubernetes (EKS, AKS).
- Improve deployment strategies including blue-green, canary, and rolling deployments.
Security & Compliance:
- Implement DevSecOps practices.
- Ensure security best practices including vulnerability scanning, secrets management, and access controls.
- Support compliance requirements such as ISO, SOC2, GDPR where applicable.
- Work with security teams to perform audits and risk assessments.
Monitoring & Reliability:
- Implement observability solutions using tools such as Prometheus, Grafana, ELK, CloudWatch, Azure Monitor.
- Manage incident response and root cause analysis.
- Ensure SLA, SLO, and reliability standards are met.
Documentation & Process Improvement:
- Maintain architecture diagrams and operational documentation.
- Improve release management and deployment processes.
- Introduce performance optimization and automation enhancements.
You'll be redirected to
the company's application page