Menu Close

Johannesburg: Site Reliability Engineer (Sre) (Remote) posted by Datafin

Site Reliability Engineer (Sre) (Remote)

Posted on 2025-03-29

Employer Datafin
Salary 0
Category It Computer
Location Gauteng  /  Johannesburg

Job Summary

Site Reliability Engineer (SRE) (Remote)Engineering/Technical ~ IT – Software Development
Durban – KwaZulu Natal – South Africa, Johannesburg – Gauteng – South Africa, Cape Town – Western Cape – South Africa, Remote

ENVIRONMENT:
AN analytical thinking & solutions-driven Site Reliability Engineer is sought to join the Remote team of a dynamic provider of a unique and powerful range of LegalTech Solutions. Your core role will entail being responsible for
ensuring the reliability, scalability, and performance of our infrastructure, collaborating with Development teams, and driving continuous improvement in system operations. The ideal candidate must preferably have a Masters/Bachelors Degree in Computer Science, Engineering, or a similar qualification with relevant Certifications and at least 5 years work experience in Site Reliability Engineering, DevOps, or a related field. You will also require
extensive experience with cloud services such as OCI, AWS, Google Cloud, or Azure & be proficient in Scripting languages (Python, Bash, etc.) and Configuration Management tools (Terraform, Ansible, Chef, Puppet).
DUTIES:
Infrastructure Management
  • Design, build, and maintain highly available and scalable infrastructure using cloud platforms (OCI, AWS, GCP, Azure) and on-premises environments.
Monitoring & Incident Response
  • Implement and maintain monitoring, logging, and alerting systems to detect and respond to system issues promptly.
  • Lead incident response efforts and perform root cause analysis.
Automation
  • Develop and deploy automation tools to streamline operations, reduce manual intervention, and improve system reliability.
Performance Optimization
  • Analyse system performance metrics and make recommendations to improve application and infrastructure performance.
Security & Compliance
  • Ensure systems meet security, compliance, and regulatory requirements by implementing best practices and conducting regular audits.
Collaboration
  • Work closely with Development teams to ensure new features and services are scalable, reliable, and maintainable.
Disaster Recovery
  • Develop and maintain Disaster Recovery plans, including data backups and system redundancy strategies.
Continuous Improvement
  • Identify areas for improvement in the existing infrastructure, propose, and implement solutions to enhance system reliability and performance.
Documentation
  • Create and maintain detailed documentation for system configurations, procedures, and processes.
REQUIREMENTS:
  • Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or a related field.
  • Extensive experience with cloud services such as OCI, AWS, Google Cloud, or Azure.
  • Proficiency in Scripting languages (Python, Bash, etc.) and Configuration Management tools (Terraform, Ansible, Chef, Puppet).
  • Experience with monitoring tools (Zabbix, Prometheus, Grafana, Wazuh) and logging systems (ELK stack, Splunk, Elastic).
  • Strong understanding of networking concepts, including DNS, load balancing, firewalls, and VPNs.
  • Experience with Containerization (Docker) and Orchestration tools (Kubernetes).
  • Familiarity with continuous integration/continuous deployment (CI/CD) pipelines and tools like Jenkins, GitLab CI, or CircleCI.
  • Strong background in Linux/Unix system administration.
  • Experience with IT Service Management platforms, optimizing and supporting tools like JIRA, Freshdesk.
  • Proven ability to handle high-pressure incidents and provide clear communication to stakeholders.
Preferred to haves
  • Masters or Bachelors Degree in Computer Science, Engineering, or a related field.
  • Certifications: Relevant certifications such as Oracle Cloud Infrastructure Architect Associate, AWS Certified Solutions Architect or Google Cloud Professional DevOps Engineer.
  • Programming: Experience with software development in languages such as Python, Go, Java, or Ruby.
  • Database Management: Experience managing and optimizing databases (OracleDB, SQL).
  • Experience in High-Traffic Environments: Prior experience working in environments with large-scale, high-traffic systems.
ATTRIBUTES:
  • Excellent communication and problem-solving skills.
  • Someone who is an analytic thinker, who can work effectively in a fast-paced environment.
Apply for this Job

Site Reliability Engineer (Sre) (Remote) position available in Gauteng, Johannesburg. This job position was posted by Datafin. The job has been posted as a premium ad on 2025-03-29 at 16:10:37 in the It Computer category

Click Go Apply to apply online!

View Job  Johannesburg: Senior Sql Dba

You might also like these jobs in the same area.

Apply directly for this position. Please read all instructions carefully.

We do not process job applications; we simply aggregate and display job listings.

More related positions


Pretoria: Senior Site Reliability Engineer posted by WatersEdge Solutions

Senior Site Reliability Engineer (SSRE) Remote (12-Month Contract)We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for build


View Job
Senior Site Reliability Engineer

Cape Town: Senior Site Reliability Engineer Cpt posted by Datafin

Senior Site Reliability Engineer - CPTEngineering/TechnicalCape Town - Western Cape - South AfricaENVIRONMENT: A globally recognized brand with a strong strategic vision, dedicated to enhancing people`s lives through innovative technology, is seeking a Sen


View Job
Senior Site Reliability Engineer Cpt

Johannesburg: Site Reliability Engineer (Sre) (Remote) posted by Datafin

Site Reliability Engineer (SRE) (Remote)Engineering/Technical ~ IT - Software DevelopmentDurban - KwaZulu Natal - South Africa, Johannesburg - Gauteng - South Africa, Cape Town - Western Cape - South Africa, RemoteENVIRONMENT: AN analytical thinking & solu


View Job
Site Reliability Engineer (Sre) (Remote)

Cape Town: Site Reliability Engineer (Mid) Cpt posted by Datafin

Site Reliability Engineer (Mid) - CPTEngineering/TechnicalCape Town - Western Cape - South AfricaENVIRONMENT: A globally recognized brand with a strong strategic vision, dedicated to enhancing lives through cutting-edge technology, is looking for a Mid-Lev


View Job
Site Reliability Engineer (Mid) Cpt

South Africa: Site Reliability Engineer Remote posted by Datafin

Site Reliability Engineer - RemoteEngineering/TechnicalSouth Africa, United Kingdom, RemoteENVIRONMENT: For over a century, our client has been helping generous individuals and the causes they care about make a lasting impact. Today, they support over 30,0

View Job  Sandton: Senior Tailings Engineer posted by Hire Resolve

View Job
Site Reliability Engineer Remote

Pretoria: Systems Engineer/ Site Reliability Engineer posted by Hire Resolve

Position: Systems Engineer/ Site Reliability EngineerHire Resolves client is seeking a skilled and experienced Systems Engineer/Site Reliability Engineer to join their team in Pretoria, Gauteng. The successful candidate will be responsible for ensuring the


View Job
Systems Engineer/ Site Reliability Engineer

Pretoria: Site Reliability Engineer – Midrand / Semi-Remote – Contract – R600 Per Hour

A role for a Site Reliability Engineer has been made available for a candidate that has Java development experience of at least 1 year . (OCA preferable, OCP more so). You will be coordinate with internal and external team members, including QA and BA, and


View Job
Site Reliability Engineer – Midrand / Semi-Remote – Contract – R600 Per Hour

Error making API request.
Share this to someone who needs a job:
Posted in Jobs in Gauteng, Jobs in Johannesburg

More Jobs in Your Area