Lead DevOps
Gradera›
📍Hyderabad, Telangana, IN
Posted 2mo ago · via bamboohr
Apply on bamboohr→Job Description
DevOps Lead
Overview
We are seeking a hands-on, strategic Lead DevOps Engineer to architect, implement, and continuously improve the infrastructure, automation, and operational practices that underpin our agentic platform. As Lead DevOps Engineer, you will drive the design and delivery of scalable, secure, and highly available cloud-native environments, CI/CD pipelines, and observability solutions. You will collaborate closely with engineering, security, and product teams to enable rapid iteration, reliability, and operational excellence across all platform components (UI, API, AI/Data).
Key Responsibilities
- Architect, implement, and maintain cloud-native infrastructure (e.g., AWS, GCP, Azure, Kubernetes)
- Design and manage CI/CD pipelines for automated build, test, and deployment across all teams
- Ensure platform scalability, reliability, and high availability through infrastructure as code and automation
- Drive adoption of best practices in monitoring, alerting, logging, and incident response
- Champion security, compliance, and cost optimization in all DevOps practices
- Collaborate with engineering teams to ensure seamless integration and deployment of platform components
- Lead root cause analysis, incident management, and postmortem processes
- Mentor and support DevOps and engineering team members in operational excellence
- Evaluate and implement new tools, frameworks, and cloud services to improve efficiency and reliability
- Document infrastructure, processes, and operational runbooks
- Foster a culture of continuous improvement, automation, and knowledge sharing
Core Qualities & Skills
- Proven experience in DevOps, SRE, or infrastructure engineering roles
- Deep expertise in cloud platforms, containerization, and orchestration (e.g., Docker, Kubernetes)
- Strong skills in infrastructure as code (e.g., Terraform, CloudFormation)
- Experience designing and operating CI/CD pipelines (e.g., GitHub Actions, GitLab CI, Jenkins)
- Strong understanding of monitoring, observability, and incident response best practices
- Experience with security, compliance, and cost management in cloud environments
- Excellent collaboration and communication skills across disciplines
- Ability to troubleshoot complex distributed systems and drive root cause analysis
- Commitment to engineering excellence, security, and responsible practices
- Willingness to learn, adapt, and drive change in a fast-paced environment
Preferred Qualifications
- 7+ years of hands-on DevOps, SRE, or infrastructure engineering experience
- Track record of delivering and scaling cloud-native platforms
- Experience with multi-cloud and hybrid cloud architectures
- Experience with service mesh, API gateways, and advanced networking
- Experience working in cross-functional, agile teams
Highly Desirable
- Experience supporting GenAI, LLM, or agentic platforms
- Experience with advanced observability, monitoring, and analytics tools
- Experience with regulatory, compliance, or privacy-driven environments
- Experience building and scaling DevOps teams in startup or high-growth settings
Success Metrics
- Platform uptime, reliability, and availability
- Deployment frequency and lead time for changes
- Mean time to detect (MTTD) and mean time to resolve (MTTR) incidents
- Infrastructure cost efficiency and optimization
- Security and compliance posture
- Developer velocity and satisfaction with DevOps processes
- Automation coverage and reduction of manual operations
- Incident response and postmortem quality
- Team growth, engagement, and knowledge sharing
Details
- Department
- Pro-Services
- Work Type
- unknown
- Locations
- Hyderabad, Telangana, IN
- Posted
- January 26, 2026
- Source
- bamboohr