Get to know the Role
The Site Reliability Team is responsible for getting things to production in the most efficient and fastest way possible. We architect solutions, tools, and platforms around provisioning, configuration, CI/CD, monitoring, SLA, performance and uptime. Our team is passionate about the details and we work very closely with a wide range of stakeholders.
The Day-to-Day Activities
Design and implement the architecture of our next generation of automated infrastructure following Infrastructure as a Code model
Use best practices to deliver high-quality code and ensure the quality of the code for the whole project
Request and conduct code reviews
Use and promote the company’s development standards
Be involved in change, release, and incident management
Identify and resolve problems relating to critical service operations and to prevent their recurrence using different methods of automation
Help improve reliability, and stability and tackle scalability challenges with engineering teams
Write and maintain technical documentation relevant to the project
Optimize existing systems, build infrastructure and reduce work through automation.
Participate in planning and estimation of efforts to implement, test, and maintain features
Participate in code and design reviews to maintain high development standards
Mentor other engineers, define our technical culture, and help build a fast-growing team
Engage with the development team to help develop software for reliability and scale, ensuring minimal refactoring or changes
The Must-Haves
Preferably a degree in computer science, software engineering, information technology or related fields
Experience (4+ years) with Linux and Windows environments (administration, advanced networking, security)
Experience with infrastructure automation & provisioning tools (e.g Terraform & Ansible/Chef/Puppet)
Experience with one or more cloud environments (AWS, Azure preferred)
Fluency in English is a must
Nice-to-have
Experience with implementing and improving CI/CD processes (build & deployment pipelines)
Experience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes)
Familiar with DB administration, ELK stack
Familiar with infrastructure and application monitoring tools
Comfortable with managing and monitoring CI/CD tools (e.g. Jenkins)
Easily pick up new technologies and are keen to expand your knowledge
Worked with microservice architectures
Enjoys scripting in languages like Python and Bash
Get to know Grab:
Grab is more than just the leading ride-hailing and mobile payments platform in Southeast Asia. We use data and technology to improve everything from transportation to payments and financial services across a region of more than 620 million people. We work with governments, drivers, passengers, merchants, and the community, to solve critical problems in Southeast Asia.
Grab began as a taxi-hailing app in 2012, but we have since extended our product platform to include GrabCar, GrabShare, GrabBike, GrabHitch, GrabExpress, GrabFood, GrabCoach, GrabShuttle, GrabCycle. We recently launched our fintech platform – GrabFinancial, which consists of payments, lending, and insurance. Our latest addition is GrabVentures, an in-house incubation platform. We are focused on pioneering new commuting and payment alternatives for drivers and passengers with an emphasis on convenience, safety, and reliability. Currently, we offer services in 8 countries. Our R&D offices are in Singapore, Seattle, Beijing, Bangalore, Jakarta, Kuala Lumpur, Ho Chi Minh City, and Cluj-Napoca. We aspire to unlock the true potential of Southeast Asia and look for like-minded individuals to join us on this ride.
If you share our vision of driving South East Asia forward, apply to join our team today!