Job Description

what is CRED?

CRED is an exclusive community for India’s most trustworthy and CREDitworthy individuals, where the members are rewarded for good financial behavior. CRED was born out of a need to bring back the focus on a long lost virtue, one of trust, the idea being to create a community centered around this virtue. a community that constantly strives to become more virtuous in this regard till they finally scale their behavior to create a utopia where being trustworthy is the norm and not the exception. to build a community like this requires a community of its own; a community special in its own way, working towards making this vision come true

here’s a thought experiment: what do you get when you put a group of incredibly passionate and driven people and entrust them with the complete freedom to chase down their goals in a completely uninhibited manner? answer: you get something close to what we have at CRED; CRED just has it better

at CRED, technology is the backbone that fuels problem-solving at speed and scale with smartness. we have a good share of left-brain/right-brain engineers who solve complex problems, yet at creative best in crafting great consumer experiences. engineers at CRED are highly empowered and are entrusted to own, understand, challenge, and create an upstream impact in Product and Business

if you are a go-getter, passionate about solving real problems, and enjoy the company and camaraderie of some of the best minds in the game, you should definitely explore CRED

here’s what will be in store for you at CRED once you join as a site reliability engineer

what you will do?

design, implement, and manage scalable, fault-tolerant cloud infrastructure

work closely with engineering teams to translate business requirements into reliable infrastructure systems

operate containerized workloads on AWS using ECS and EKS

build and maintain observability to understand system health and performance

ensure reliability by diagnosing production issues and restoring services under real-world load

automate infrastructure and operations using Infrastructure as Code and CI/CD pipelines

operate highly compliant financial services infrastructure and ensure adherence to PCI-DSS, ISO 27001, RBI data-localization, and NPCI guidelines

participate in on-call rotations and incident response, owning problems end-to-end

you should apply if you:

have 2–5 years of experience working with production infrastructure or backend systems

bring strong Linux fundamentals and a genuine interest in operating systems

are comfortable troubleshooting across systems, containers, and networks

have hands-on experience with cloud platforms, preferably AWS

have worked with or been exposed to container orchestration platforms such as ECS or Kubernetes

are deeply curious about microservice ecosystems, spanning Linux, containers, networking, infrastructure as code, CI/CD, datastores, and observability

enjoy managing large, complex distributed systems in production

demonstrate strong problem-solving skills and proficiency in at least one programming language

may have had exposure to data or platform workloads like Spark, Airflow, Flink, Kafka, or AWS Batch

may have experience running stateful or data-intensive workloads on Kubernetes

have some understanding of data pipelines, batch vs. streaming workloads, and resource / capacity tuning

have operated or interacted with observability stacks such as Loki, VictoriaMetrics, Prometheus, or Grafana

have seen or contributed to infrastructure or workload cost optimisation in cloud environments