Site Reliability Engineer 2 Job in Qrata
Site Reliability Engineer 2
- Bengaluru, Bangalore Urban, Karnataka
- Not Disclosed
- Full-time
Job Description
SRE 2
Exp : 3 - 5 Years
Location : Bangalore (WFO)
About :
The company was founded in 2015 with the vision of changing the way Indians approach their finances -
no matter where they work, what they earn, or how tech-savvy they are. We are a Series C
funded company with over 6 million users already on board, our aim is to make both travel and
managing finance simpler, smarter and safer.
We currently operate in 4 major business lines:
Bharat - a prepaid salary card and digital banking product for blue-collar workers
Global - a zero forex markup travel card that works anywhere in the world
X - co-branded savings account with industry-best interest rates, 0 maintenance charges, and a host of exciting features
Money - a wealth management platform with 0 commission mutual funds, domestic stocks, and more.
What you need to know about the role :
If you have the passion to build highly scalable, reliable, systems then this is the place to be.
we give high preference to highly available systems as we have a sla of 99.99%.
Must Haves :
1. Should have hands on experience in building and managing production grade Kubernetes
clusters from scratch
2. Should have handled at least 2 migrations in their careers migrations ( Cloud, databases,
Container Orchestration, Api Gateways etc )
3. Automation (Should have build workflows to automate dev requirements Ex: Kong Routes
Creation, Building mutable infra ,Database inserts deletes updates.
4. 3+ years experience in Tech-First Product-First company
5. Should have at least 2 yrs of experience in AWS
6. Should have experience in managing the cloud infrastructure for SAAS companies
7. Should have deep understanding and experience in docker and Kubernetes orchestration
8. Should have experience in managing microservices.
9. Should have experience in DR (Disaster Recovery) setup.
10. Should have a deep understanding of cloud infrastructure security and should have been
responsible or at least assisted in security and compliance audits
11. Should have seen and handled low latency and high request volume requirements.
Roles and Responsibilities:
In Partnership with engineering leadership, will work to build the Service level indicators (SLI), Service Level Objectives (SLO), Service level agreements (SLA s), and Error budgets
Manage 24/7 production support ensuring all production issues are resolved quickly; ensure RCA and fixes to ensure these do not recur in future
Latency
Maximum concurrent API calls
Availability, DR, and Business Continuity.
Data sizing.
Design and leverage best-in-class DevOps practices including CI/CD, monitoring and alerting, auto-scaling, etc.
Work hand-in-hand with the frontend and backend engineering teams to reduce or eliminate any repetitive or manual tasks, improving health and performance issues of the businesses' sites/software systems.
Ensure that development environments (local, dev, staging,QA, etc) are all setup and updated automatically.
Infrastructure Maintenance, Security & Compliance
Do capacity planning, cost optimization.
Build and own highly secure and available cloud infrastructure.
Work closely with Information Security organizations ensuring the highest levels of security and responding swiftly to any new and emerging vulnerabilities and security threats.
Ensure Disaster Recovery and Business Continuity are handled.
Assist the compliance team to ensure the audit requirements for the compliances are met.
Nice To Have:
Logging : Graylog with Elastic Search Backend
Monitoring : Datadog
Ci/Cd : BitBucket / Gitlab / Jenkins
Container Orchestration: Kubernetes
IAAS: Terraform
Languages: Python/Flask
Api Gateway: Kong
Configuration Management: Aws System Manager
Serverless: Lamda , Fargate , etc
Qualification : Any Graduate
3 to 5 Years
2 - 4 Hires