Job Description

what is CRED?

CRED is an exclusive community for India’s most trustworthy and CREDitworthy individuals, where the members are rewarded for good financial behavior. CRED was born out of a need to bring back the focus on a long lost virtue, one of trust, the idea being to create a community centered around this virtue. a community that constantly strives to become more virtuous in this regard till they finally scale their behavior to create a utopia where being trustworthy is the norm and not the exception. to build a community like this requires a community of its own; a community special in its own way, working towards making this vision come true

here’s a thought experiment: what do you get when you put a group of incredibly passionate and driven people and entrust them with the complete freedom to chase down their goals in a completely uninhibited manner? answer: you get something close to what we have at CRED; CRED just has it better

at CRED, technology is the backbone that fuels problem-solving at speed and scale with smartness. we have a good share of left-brain/right-brain engineers who solve complex problems, yet at creative best in crafting great consumer experiences. engineers at CRED are highly empowered and are entrusted to own, understand, challenge, and create an upstream impact in Product and Business

if you are a go-getter, passionate about solving real problems, and enjoy the company and camaraderie of some of the best minds in the game, you should definitely explore CRED

here’s what will be in store for you at CRED once you join as a site reliability engineer
what you will do?
  • design, implement, and manage scalable, fault-tolerant cloud infrastructure
  • work closely with engineering teams to translate business requirements into reliable infrastructure systems
  • operate containerized workloads on AWS using ECS and EKS
  • build and maintain observability to understand system health and performance
  • ensure reliability by diagnosing production issues and restoring services under real-world load
  • automate infrastructure and operations using Infrastructure as Code and CI/CD pipelines
  • operate highly compliant financial services infrastructure and ensure adherence to PCI-DSS, ISO 27001, RBI data-localization, and NPCI guidelines
  • participate in on-call rotations and incident response, owning problems end-to-end
  • you should apply if you:
  • have 2–5 years of experience working with production infrastructure or backend systems
  • bring strong Linux fundamentals and a genuine interest in operating systems
  • are comfortable troubleshooting across systems, containers, and networks
  • have hands-on experience with cloud platforms, preferably AWS
  • have worked with or been exposed to container orchestration platforms such as ECS or Kubernetes
  • are deeply curious about microservice ecosystems, spanning Linux, containers, networking, infrastructure as code, CI/CD, datastores, and observability
  • enjoy managing large, complex distributed systems in production
  • demonstrate strong problem-solving skills and proficiency in at least one programming language
  • may have had exposure to data or platform workloads like Spark, Airflow, Flink, Kafka, or AWS Batch
  • may have experience running stateful or data-intensive workloads on Kubernetes
  • have some understanding of data pipelines, batch vs. streaming workloads, and resource / capacity tuning
  • have operated or interacted with observability stacks such as Loki, VictoriaMetrics, Prometheus, or Grafana
  • have seen or contributed to infrastructure or workload cost optimisation in cloud environments