Site Reliability Engineering-product Owner Job in Autorabit
Site Reliability Engineering-product Owner
- Hyderabad, Telangana
- Not Disclosed
- Full-time
- Permanent
Job Role
Site Reliability Engineering (SRE) at AutoRabit combines software development, systems engineering and customer support to build, operate and support the AutoRABIT Software-as-a-Service offerings. These SaaS based products are highly reliable (self-healing) and highly available (self-scaling), ensuring that delivery of AutoRabit's customer-facing services has the extreme high availability that our customer base, containing many Fortune 500 companies, can comfortably rely upon.
The SRE team monitors our systems capacity and performance so that they can respond to trouble at the first sign, rather than face an outage. Software development within the SRE team focuses on optimizing and building cloud infrastructure, supporting software platforms and eliminating manual work through automation.
The SRE team s ultimate goal is to deliver AutoRabit services in a high quality and professional way, while aiming to ensure the long-term success of our business and the happiness and confidence of our customers.
As a Product Owner for SRE, you will develop the strategy & roadmap focused on creating ultra-scalable-and highly reliable Software-as-a-Service delivery and support system. In partnership with the engineering and program management teams, you will lead the execution of associated projects. You have strong experience in Product Management, Software Engineering, Site Reliability Engineering, Infrastructure and systems management. You enjoy operating on quick iteration cycles while working with demanding customers and multiple stakeholders.
Roles & Responsibilities
- Define, design, measure availability and key top-level Operational Intelligence (OI) metrics, plug instrumentation holes required for the metrics
- Define and develop leading indicator monitoring solutions and establish SLA monitoring to protect the top-level metrics
- Build the Service Improvement Plan and partner with Product team to land on it
- Partner with Product team in building the deployment architecture
- Responsible for landing on Service Level Objectives for the Service level Indicators
- Generate actionable insights from operational data for improving end user quality of service
- Research, architect, develop and deliver solutions in an agile development environment.
- DevOps support for web SaaS applications hosted on AWS and IBM Cloud.
- Help design, build and maintain configuration management automation and deployment automation with Ansible and Python.
- Deployment of applications with Spring Boot, Kubernetes, Tomcat, Apache, and nginx.
- Build auto-scaled systems with Kubernetes, Lambda, and Ansible.
- Tune server and application-level performance monitoring and alerting.
- Server-level troubleshooting of TimeTrade applications.
- Manage SMTP mail flows to service providers.
- Configure firewalls, VPNs, and routing for web application hosting.
- CI/CD build pipeline with Jenkins and Artifactory
- Provide and maintain system documentation.
- Maintain best practice for OS, network, and application hardening.
- Continual evaluation of processes and technologies we use and suggesting areas for improvement.
- Participate in on-call rotation as primary Operations contact (typically 1 week every 6 weeks).
- Excellent written and verbal English communication skills.
- Adhere to set internal controls.
Desired Skills and Experience
- Ability to think strategically and craft a compelling platform vision.
- Experience working with remote teams in vastly different time zones. Ability to work with customers and employees during their business hours, even when halfway around the world.
- Prior DevOps experience is strongly preferred.
- Prior experience and familiarity with industry standard ALM platforms, defect tracking tools and SDLC practices.
- Demonstrated experience with cloud infrastructure management, automation, monitoring or cost optimization is strongly preferred.
- Demonstrated experience with service support products like service desks, and incident response tools is desirable.
- Strong knowledge of cloud architecture, microservices management, Kubernetes in production is desirable
- Experience with AWS infrastructure such as ELB and AWS autoscaling.
- Must have a great, positive attitude.
- Strong analytical skills and an eye for detail. Excellent written and verbal communication skills in English are a necessity.
- Must be able to work independently and as part of a team, but always productively.
Education and Qualifications
- B.S. degree in computer science, engineering, mathematics, hard science or equivalent experience is REQUIRED.
- 5+ years experience in product management
- 2+ years working in SaaS or managed services platforms
- AWS Professional Certification is desired.
2 to 5 Years
2 - 4 Hires