Site Reliability Engineer (Sre) (Remote)
Posted on 2025-03-29
Employer | Datafin |
---|---|
Salary | 0 |
Category | It Computer |
Location | Gauteng / Johannesburg |
Job Summary
Durban – KwaZulu Natal – South Africa, Johannesburg – Gauteng – South Africa, Cape Town – Western Cape – South Africa, Remote
- Design, build, and maintain highly available and scalable infrastructure using cloud platforms (OCI, AWS, GCP, Azure) and on-premises environments.
- Implement and maintain monitoring, logging, and alerting systems to detect and respond to system issues promptly.
- Lead incident response efforts and perform root cause analysis.
- Develop and deploy automation tools to streamline operations, reduce manual intervention, and improve system reliability.
- Analyse system performance metrics and make recommendations to improve application and infrastructure performance.
- Ensure systems meet security, compliance, and regulatory requirements by implementing best practices and conducting regular audits.
- Work closely with Development teams to ensure new features and services are scalable, reliable, and maintainable.
- Develop and maintain Disaster Recovery plans, including data backups and system redundancy strategies.
- Identify areas for improvement in the existing infrastructure, propose, and implement solutions to enhance system reliability and performance.
- Create and maintain detailed documentation for system configurations, procedures, and processes.
- Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or a related field.
- Extensive experience with cloud services such as OCI, AWS, Google Cloud, or Azure.
- Proficiency in Scripting languages (Python, Bash, etc.) and Configuration Management tools (Terraform, Ansible, Chef, Puppet).
- Experience with monitoring tools (Zabbix, Prometheus, Grafana, Wazuh) and logging systems (ELK stack, Splunk, Elastic).
- Strong understanding of networking concepts, including DNS, load balancing, firewalls, and VPNs.
- Experience with Containerization (Docker) and Orchestration tools (Kubernetes).
- Familiarity with continuous integration/continuous deployment (CI/CD) pipelines and tools like Jenkins, GitLab CI, or CircleCI.
- Strong background in Linux/Unix system administration.
- Experience with IT Service Management platforms, optimizing and supporting tools like JIRA, Freshdesk.
- Proven ability to handle high-pressure incidents and provide clear communication to stakeholders.
- Masters or Bachelors Degree in Computer Science, Engineering, or a related field.
- Certifications: Relevant certifications such as Oracle Cloud Infrastructure Architect Associate, AWS Certified Solutions Architect or Google Cloud Professional DevOps Engineer.
- Programming: Experience with software development in languages such as Python, Go, Java, or Ruby.
- Database Management: Experience managing and optimizing databases (OracleDB, SQL).
- Experience in High-Traffic Environments: Prior experience working in environments with large-scale, high-traffic systems.
- Excellent communication and problem-solving skills.
- Someone who is an analytic thinker, who can work effectively in a fast-paced environment.
Site Reliability Engineer (Sre) (Remote) position available in Gauteng, Johannesburg. This job position was posted by Datafin. The job has been posted as a premium ad on 2025-03-29 at 16:10:37 in the It Computer category
Click Go Apply to apply online!
You might also like these jobs in the same area.
Apply directly for this position. Please read all instructions carefully.
We do not process job applications; we simply aggregate and display job listings.
More related positions
Pretoria: Senior Site Reliability Engineer posted by WatersEdge Solutions
Senior Site Reliability Engineer (SSRE) Remote (12-Month Contract)We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for build
View Job
Senior Site Reliability Engineer
Cape Town: Senior Site Reliability Engineer Cpt posted by Datafin
Senior Site Reliability Engineer - CPTEngineering/TechnicalCape Town - Western Cape - South AfricaENVIRONMENT: A globally recognized brand with a strong strategic vision, dedicated to enhancing people`s lives through innovative technology, is seeking a Sen
View Job
Senior Site Reliability Engineer Cpt
Johannesburg: Site Reliability Engineer (Sre) (Remote) posted by Datafin
Site Reliability Engineer (SRE) (Remote)Engineering/Technical ~ IT - Software DevelopmentDurban - KwaZulu Natal - South Africa, Johannesburg - Gauteng - South Africa, Cape Town - Western Cape - South Africa, RemoteENVIRONMENT: AN analytical thinking & solu
View Job
Site Reliability Engineer (Sre) (Remote)
Cape Town: Site Reliability Engineer (Mid) Cpt posted by Datafin
Site Reliability Engineer (Mid) - CPTEngineering/TechnicalCape Town - Western Cape - South AfricaENVIRONMENT: A globally recognized brand with a strong strategic vision, dedicated to enhancing lives through cutting-edge technology, is looking for a Mid-Lev
View Job
Site Reliability Engineer (Mid) Cpt
South Africa: Site Reliability Engineer Remote posted by Datafin
Site Reliability Engineer - RemoteEngineering/TechnicalSouth Africa, United Kingdom, RemoteENVIRONMENT: For over a century, our client has been helping generous individuals and the causes they care about make a lasting impact. Today, they support over 30,0
View Job
Site Reliability Engineer Remote
Pretoria: Systems Engineer/ Site Reliability Engineer posted by Hire Resolve
Position: Systems Engineer/ Site Reliability EngineerHire Resolves client is seeking a skilled and experienced Systems Engineer/Site Reliability Engineer to join their team in Pretoria, Gauteng. The successful candidate will be responsible for ensuring the
View Job
Systems Engineer/ Site Reliability Engineer
Pretoria: Site Reliability Engineer – Midrand / Semi-Remote – Contract – R600 Per Hour
A role for a Site Reliability Engineer has been made available for a candidate that has Java development experience of at least 1 year . (OCA preferable, OCP more so). You will be coordinate with internal and external team members, including QA and BA, and
View Job
Site Reliability Engineer – Midrand / Semi-Remote – Contract – R600 Per Hour