Job Description
Job Brief:
As a Site Reliability Engineer, you will ensure the reliability, scalability, and performance of our cloud-based systems. You will collaborate with product engineering squads to design resilient solutions, improve system stability, and empower teams to deliver features with confidence. Join an innovative tech company revolutionizing software testing with an AI-powered, codeless automation platform.
Responsibilities:
-
Design and implement cloud-native solutions that meet user needs and business requirements.
-
Propose and execute architectural or system improvements to enhance reliability, stability, and throughput.
-
Collaborate with the SRE team to plan and refine cloud infrastructure enhancements.
-
Respond to system incidents, perform root cause analysis, and implement preventive measures.
-
Build, maintain, and monitor infrastructure using Terraform and AWS, while supporting Java backend systems.
-
Work closely with product teams to ensure robust delivery of features with built-in observability and reliability.
-
Lead incident management activities, including post-incident reviews and documentation.
-
Identify and implement initiatives to improve system reliability, performance, and efficiency.
-
Provide guidance and support to cross-functional teams in areas such as monitoring, CI/CD pipelines, release engineering, and infrastructure as code.
Requirements & Skills:
-
Minimum 5 years of experience with open-source technologies and cloud computing environments, preferably AWS.
-
Experience with containerized and/or serverless architectures (e.g., ECS).
-
Understanding of relational databases (RDBMS), asynchronous message processing pipelines, and software release/migration processes.
-
Programming experience in at least one major language.
-
Ability to document knowledge, conduct workshops, and support team upskilling.
-
Master’s degree (or equivalent work experience) in engineering, computer science, or software discipline.
-
Experience working within a business environment, managing stakeholders, and delivering reliable solutions.
Next Steps:
- Do you consider yourself the ideal candidate for this role? If so, take the next step and apply now. Our team will take care of the rest!