Careem is at the forefront of creating the Everything App for the Middle East, aiming to simplify everyday life through seamless transportation, food and grocery delivery, payment solutions, and more. Since its inception in 2012, Careem has empowered over 2.5 million Captains to generate income and served more than 70 million customers across over 70 cities in 10 countries, spanning from Morocco to Pakistan. As the company advances into a new era driven by artificial intelligence, it is seeking innovative and curious AI professionals to develop impactful tools, automate workflows, and enhance operational efficiency. The Senior Site Reliability Engineer II (L10) position within the Storage & Infrastructure team presents an exciting opportunity to build, scale, and automate core data services that support both traditional and AI-driven workloads.
Key Responsibilities
- Deploy, scale, and maintain cloud-native data systems on AWS, ensuring high availability and optimal performance.
- Automate storage operations using Infrastructure as Code (IaC) tools such as Terraform and Pulumi.
- Support AI infrastructure components, including vector databases and embedding stores like Milvus, Weaviate, and Pinecone.
- Collaborate closely with machine learning engineers and platform teams to support services powered by large language models (LLMs).
- Monitor and optimize system performance using tools such as Prometheus, Grafana, and OpenTelemetry.
- Participate in on-call rotations and contribute to post-incident reviews to maintain operational excellence.
- Design secure, scalable, and cost-efficient environments that are AI-ready and future-proof.
Required Qualifications
- 5 to 8 years of experience managing distributed systems at scale.
- Proficiency in one or more programming languages such as Go, Python, or Bash.
- Strong expertise in cloud infrastructure, preferably AWS.
- Hands-on experience with Infrastructure as Code (IaC) and Continuous Integration/Continuous Deployment (CI/CD) pipelines.
- Familiarity with distributed data systems including Kafka, Redis, Cassandra, MySQL, Postgres, and OpenSearch.
Preferred Qualifications and Benefits
- Experience with AI infrastructure components such as vector stores and model serving platforms like Ray, LangChain, and LlamaIndex is a plus.
- A strong curiosity and eagerness to learn about integrating infrastructure with AI agents and LLM-based applications.
Careem offers a dynamic and supportive work environment where employees can make a meaningful impact across the region while continuously developing their skills. The company fosters collaboration with inspiring peers and provides opportunities to work on cutting-edge AI-integrated infrastructure. The work culture is flexible, with four days in the office and one day remote per week, along with the option to work remotely from any country for up to 30 days annually. Additional benefits include unlimited vacation days, healthcare coverage, and fitness reimbursements for activities such as gym memberships and training classes. This role is based in Pakistan, with offices located in Islamabad, Karachi, and Lahore.
Joining Careem means becoming part of a purposeful organization committed to innovation and regional impact, offering ample opportunities for professional growth and collaboration on some of the most advanced engineering platforms in the Middle East.