Site Reliability Engineer 2 Job in Qrata

Site Reliability Engineer 2

Apply Now
Job Summary

Job Description

SRE 2


Exp : 3 - 5 Years


Location : Bangalore (WFO)


About :


The company was founded in 2015 with the vision of changing the way Indians approach their finances -

no matter where they work, what they earn, or how tech-savvy they are. We are a Series C

funded company with over 6 million users already on board, our aim is to make both travel and

managing finance simpler, smarter and safer.


We currently operate in 4 major business lines:

Bharat - a prepaid salary card and digital banking product for blue-collar workers

Global - a zero forex markup travel card that works anywhere in the world

X - co-branded savings account with industry-best interest rates, 0 maintenance charges, and a host of exciting features

Money - a wealth management platform with 0 commission mutual funds, domestic stocks, and more.


What you need to know about the role :

If you have the passion to build highly scalable, reliable, systems then this is the place to be.

we give high preference to highly available systems as we have a sla of 99.99%.


Must Haves :

1. Should have hands on experience in building and managing production grade Kubernetes

clusters from scratch

2. Should have handled at least 2 migrations in their careers migrations ( Cloud, databases,

Container Orchestration, Api Gateways etc )

3. Automation (Should have build workflows to automate dev requirements Ex: Kong Routes

Creation, Building mutable infra ,Database inserts deletes updates.

4. 3+ years experience in Tech-First Product-First company

5. Should have at least 2 yrs of experience in AWS

6. Should have experience in managing the cloud infrastructure for SAAS companies

7. Should have deep understanding and experience in docker and Kubernetes orchestration

8. Should have experience in managing microservices.

9. Should have experience in DR (Disaster Recovery) setup.

10. Should have a deep understanding of cloud infrastructure security and should have been

responsible or at least assisted in security and compliance audits

11. Should have seen and handled low latency and high request volume requirements.


Roles and Responsibilities:

In Partnership with engineering leadership, will work to build the Service level indicators (SLI), Service Level Objectives (SLO), Service level agreements (SLA s), and Error budgets

Manage 24/7 production support ensuring all production issues are resolved quickly; ensure RCA and fixes to ensure these do not recur in future

Latency

Maximum concurrent API calls

Availability, DR, and Business Continuity.

Data sizing.

Design and leverage best-in-class DevOps practices including CI/CD, monitoring and alerting, auto-scaling, etc.

Work hand-in-hand with the frontend and backend engineering teams to reduce or eliminate any repetitive or manual tasks, improving health and performance issues of the businesses' sites/software systems.

Ensure that development environments (local, dev, staging,QA, etc) are all setup and updated automatically.

Infrastructure Maintenance, Security & Compliance

Do capacity planning, cost optimization.

Build and own highly secure and available cloud infrastructure.

Work closely with Information Security organizations ensuring the highest levels of security and responding swiftly to any new and emerging vulnerabilities and security threats.

Ensure Disaster Recovery and Business Continuity are handled.

Assist the compliance team to ensure the audit requirements for the compliances are met.


Nice To Have:


Logging : Graylog with Elastic Search Backend

Monitoring : Datadog

Ci/Cd : BitBucket / Gitlab / Jenkins

Container Orchestration: Kubernetes

IAAS: Terraform

Languages: Python/Flask

Api Gateway: Kong

Configuration Management: Aws System Manager

Serverless: Lamda , Fargate , etc



Qualification :
Any Graduate
Experience Required :

3 to 5 Years

Vacancy :

2 - 4 Hires

Similar Jobs for you

See more recommended jobs