Senior Manager Software Engineering Sre Job in Optum

Senior Manager Software Engineering Sre

Apply Now
Job Summary

About the Role:

As a Senior Site Reliability Engineering (SRE) Team Lead at Optum, you'll play a pivotal role in ensuring the reliability, performance, and scalability of our critical systems. You will lead a high-performing team, drive strategic initiatives, and collaborate with cross-functional teams to deliver exceptional results.

Key Responsibilities:

  • Team Leadership: Build, develop, and lead a world-class SRE team, fostering a culture of innovation, collaboration, and continuous improvement.
  • Technical Leadership: Provide technical guidance and mentorship to the team, ensuring adherence to best practices and industry standards.
  • Incident Management: Lead incident response efforts, drive root cause analysis, and implement effective solutions to prevent future occurrences.
  • Automation and Tooling: Champion automation initiatives, leveraging tools like Terraform, Ansible, and CI/CD pipelines to streamline operations and reduce manual effort.
  • Performance Optimization: Monitor system performance, identify bottlenecks, and implement optimizations to ensure optimal performance and scalability.
  • Infrastructure as Code: Drive the adoption of infrastructure as code practices to improve consistency, reliability, and efficiency.
  • Cloud Expertise: Leverage cloud platforms (AWS, Azure, GCP) to build and maintain scalable and resilient infrastructure.
  • Collaboration: Work closely with engineering, product, and operations teams to align on goals, strategies, and priorities.
  • Strategic Planning: Develop and execute a long-term SRE strategy that aligns with the organization's business objectives.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 8+ years of experience in Site Reliability Engineering, DevOps, or related roles.
  • 3+ years of experience leading and managing technical teams.
  • Strong understanding of cloud technologies (AWS, Azure, GCP) and infrastructure automation tools (Terraform, Ansible).
  • Proficiency in scripting languages (Python, Bash) and configuration management tools (Ansible, Puppet, Chef).
  • Experience with monitoring and logging tools (Datadog, Splunk, Prometheus, Grafana, ELK stack).
  • Solid understanding of CI/CD pipelines and DevOps practices.
  • Excellent problem-solving, troubleshooting, and communication skills.
  • Ability to work effectively in a fast-paced, dynamic environment.

Preferred Qualifications:

  • Experience with containerization technologies (Docker, Kubernetes).
  • Knowledge of chaos engineering and resilience engineering principles.
  • Certifications in relevant technologies (AWS, Azure, GCP, etc.).

Qualification :
Bachelors degree in Computer Science, Engineering, or a related field (or equivalent experience)
Experience Required :

Minimum 8 Years

Vacancy :

2 - 4 Hires

Similar Jobs for you

See more recommended jobs