Site Reliability Engineer (Sre) (Ch1078) – Fully Remote
Posted on 2025-01-31
Job Summary
Our client is an innovative cloud-based company that leverages its software to address the legal contracting, compliance, and legal practice challenges faced by listed companies and multinationals. They are seeking a Site Reliability Engineer to join their dynamic team of professionals, delivering transformative growth by creating intelligent tech solutions that revolutionize the practice of law.
The ideal candidate will have a background in software engineering, system administration, and experience managing large-scale, high-availability systems. The SRE will be responsible for ensuring the reliability, scalability, and performance of our infrastructure, collaborating with development teams, and driving continuous improvement in system operations.
This role offers a fantastic opportunity to work in a professional environment while enjoying the flexibility of working from home.
Key
Responsibilities:
- Infrastructure Management: Design, build, and maintain highly available and scalable infrastructure using cloud platforms (OCI, AWS, GCP, Azure) and on-premises environments.
- Monitoring & Incident Response: Implement and maintain monitoring, logging, and alerting systems to detect and respond to system issues promptly. Lead incident response efforts and perform root cause analysis.
- Automation: Develop and deploy automation tools to streamline operations, reduce manual intervention, and improve system reliability.
- Performance Optimization: Analyze system performance metrics and make recommendations to improve application and infrastructure performance.
- Security & Compliance: Ensure systems meet security, compliance, and regulatory requirements by implementing best practices and conducting regular audits.
- Collaboration: Work closely with development teams to ensure new features and services are scalable, reliable, and maintainable.
- Disaster Recovery: Develop and maintain disaster recovery plans, including data backups and system redundancy strategies.
- Continuous Improvement: Identify areas for improvement in the existing infrastructure, propose, and implement solutions to enhance system reliability and performance.
- Documentation: Create and maintain detailed documentation for system configurations, procedures, and processes.
We are looking for someone with excellent communication and problem-solving skills, someone who is an analytic thinker, who can work effectively in a fast-paced environment.
Required Skills:
- Experience: Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or a related field.
- Cloud Platforms: Extensive experience with cloud services such as OCI, AWS, Google Cloud, or Azure.
- Automation & Scripting: Proficiency in scripting languages (Python, Bash, etc.) and configuration management tools (Terraform, Ansible, Chef, Puppet).
- Monitoring & Logging: Experience with monitoring tools (Zabbix, Prometheus, Grafana, Wazuh) and logging systems (ELK stack, Splunk, Elastic).
- Networking: Strong understanding of networking concepts, including DNS, load balancing, firewalls, and VPNs.
- Containers & Orchestration: Experience with containerization (Docker) and orchestration tools (Kubernetes).
- CI/CD: Familiarity with continuous integration/continuous deployment (CI/CD) pipelines and tools like Jenkins, GitLab CI, or CircleCI.
- System Administration: Strong background in Linux/Unix system administration.
- ITSM: Experience with IT Service Management platforms, optimizing and supporting tools like JIRA, Freshdesk.
- Incident Management: Proven ability to handle high-pressure incidents and provide clear communication to stakeholders.
Preferred Qualifications:
- Education: bachelors or masters degree in computer science, Engineering, or a related field.
- Certifications: Relevant certifications such as Oracle Cloud Infrastructure Architect Associate, AWS Certified Solutions Architect or Google Cloud Professional DevOps Engineer.
- Programming: Experience with software development in languages such as Python, Go, Java, or Ruby.
- Database Management: Experience managing and optimizing databases (OracleDB, SQL).
- Experience in High-Traffic Environments: Prior experience working in environments with large-scale, high-traffic systems.
General:
- Only shortlisted candidates will be contacted. Should you not hear from us after 30 days you may consider your application unsuccessful
- In keeping with our clients employment equity requirements, only South African citizens will be considered.
- Please include your current salary and salary expectations.
Site Reliability Engineer (Sre) (Ch1078) – Fully Remote position available in Western Cape, Stellenbosch. This job position was posted by Capital H Staffing and Advisory Solutions. The job has been posted as a premium ad on 2025-01-31 at 13:07:00 in the Engineering category
Click Go Apply to apply online!
You might also like these jobs in the same area.
Apply directly for this position. Please read all instructions carefully.
We do not process job applications; we simply aggregate and display job listings.
More related positions
Cape Town: Site Reliability Engineer posted by Tasiso Consulting
What Youll Do:? Lead the Site Reliability Engineering (SRE) and IT Telescope Operations Team.? Collaborate globally with stakeholders.? Manage operations, service delivery, and infrastructure for telescope construction and deployment.? Support advanced IT
View Job
Site Reliability Engineer
Cape Town City Centre: Site Reliability Engineering Manager
Role Overview: As the Site Reliability Engineering Manager, you will oversee the SRE team and work closely with engineering, product, and infrastructure teams to ensure the continuous operation of our platform. You will be responsible for defining and driv
View Job
Site Reliability Engineering Manager
Midrand: Site Reliability Engineer Snr 1917
What Youll Bring to the Table Essential Skills: Container Expertise: Skilled in Kubernetes or similar container orchestration platforms. Unix/Linux Knowledge: Strong understanding of Unix/Linux internals, administration, and networking stack. Networking Ma
View Job
Site Reliability Engineer Snr 1917
Cape Town City Centre: Site Reliability Engineer (Remote)
What Youll Be Doing: As a Site Reliability Engineer, youll be the backbone of our infrastructure, responsible for designing, maintaining, and optimizing high-availability systems Your role will include: Building Scalable Infrastructure: Craft and manage ro
View Job
Site Reliability Engineer (Remote)
Menlyn: Site Reliability Engineer (Advanced) 2076
What Youll Bring to the Table: Essential Skills: Java 11 with strong Object-Oriented Programming skills. Spring Boot for robust application development. Containerization expertise with Kubernetes and Docker . Proficiency in Git/GitHub version control. Comp
View Job
Site Reliability Engineer (Advanced) 2076
Centurion: Site Reliability Engineer
A leading company in the financial industry is looking for a highly skilled Site Reliability Engineer to join their growing IT team. The ideal candidate will bring 8-10 years of experience in software engineering, platform engineering, and working with cro
View Job
Site Reliability Engineer
Menlyn: Site Reliability Engineer (Senior) 2228
Your Journey Starts Here Contract Start Date : 1 March 2025 Contract End Date : 31 December 2027 Location : South Africa Eligibility : South African citizens or valid work permit holders preferred. Why Youll Love This Role Innovate Daily : Work with the la
View Job
Site Reliability Engineer (Senior) 2228
Pretoria: Site Reliability Engineer – Midrand / Centurion/ Semi-Remote – Contract – R582 Per Hour
A role for a Site Reliability Engineer has been made available for a candidate that has Java development experience of at least 1 year . (OCA preferable, OCP more so). You will be coordinate with internal and external team members, including QA and BA, and
View Job
Site Reliability Engineer – Midrand / Centurion/ Semi-Remote – Contract – R582 Per Hour
Stellenbosch: Site Reliability Engineer (Sre) (Ch1078) – Fully Remote posted by Capital H Staffing and Advisory Solutions
Our client is an innovative cloud-based company that leverages its software to address the legal contracting, compliance, and legal practice challenges faced by listed companies and multinationals. They are seeking a Site Reliability Engineer to join their
View Job
Site Reliability Engineer (Sre) (Ch1078) – Fully Remote
South Africa: Site Reliability Engineer – Sandton/ Remote – R1.2M Pa posted by E-Merge
An opportunity has been made available with one the leading banks offering a role as an Multi - Discipline Specialist to join this dynamic team.Looking for Multi - Discipline Specialist with Site Reliability Engineering capabilities to provide guidance and
View Job
Site Reliability Engineer – Sandton/ Remote – R1.2M Pa
Johannesburg: Site Reliability Engineer (Sre) (Remote) posted by Datafin
Site Reliability Engineer (SRE) (Remote)Engineering/Technical ~ IT - Software DevelopmentCape Town - Western Cape ~ Johannesburg - Gauteng ~ Durban - KwaZulu Natal ~ RemoteENVIRONMENT: AN analytical thinking & solutions-driven Site Reliability Engineer is
View Job
Site Reliability Engineer (Sre) (Remote)
Midrand: Site Reliability Engineer Snr 1917 posted by Opensource
What You’ll Bring to the TableEssential Skills:Container Expertise: Skilled in Kubernetes or similar container orchestration platforms.Unix/Linux Knowledge: Strong understanding of Unix/Linux internals, administration, and network
View Job
Site Reliability Engineer Snr 1917
Western Cape: Site Reliability Engineering Manager posted by One Connect Solutions
Role Overview:As the Site Reliability Engineering Manager, you will oversee the SRE team and work closely with engineering, product, and infrastructure teams to ensure the continuous operation of our platform. You will be responsible for defining and drivi
View Job
Site Reliability Engineering Manager
Cape Town City Centre: Site Reliability Engineer (Remote)
As a Site Reliability Engineer, youll be the backbone of our infrastructure, responsible for designing, maintaining, and optimizing high-availability systems. Your role will include: Building Scalable Infrastructure: Craft and manage robust cloud and on-pr
View Job
Site Reliability Engineer (Remote)
Pretoria: Systems Engineer/ Site Reliability Engineer posted by Hire Resolve
Position: Systems Engineer/ Site Reliability EngineerHire Resolves client is seeking a skilled and experienced Systems Engineer/Site Reliability Engineer to join their team in Pretoria, Gauteng. The successful candidate will be responsible for ensuring the
View Job
Systems Engineer/ Site Reliability Engineer
Gauteng: Site Reliability Engineer (Senior) 2228 posted by Opensource
Your Journey Starts HereContract Start Date: 1 March 2025Contract End Date: 31 December 2027Location: South AfricaEligibility: South African citizens or valid work permit holders preferred.Why You’ll Love This RoleInnovate Daily: Work with the latest
View Job
Site Reliability Engineer (Senior) 2228
Gauteng: Site Reliability Engineer (Advanced) 2076 posted by Opensource
What You’ll Bring to the Table:Essential Skills:Java 11+ with strong Object-Oriented Programming skills.Spring Boot for robust application development.Containerization expertise with Kubernetes and Docker.Proficiency in Git/GitHub version control.Com
View Job
Site Reliability Engineer (Advanced) 2076
Western Cape: Site Reliability Engineer (Remote) posted by Communicate Finance
As a Site Reliability Engineer, you’ll be the backbone of our infrastructure, responsible for designing, maintaining, and optimizing high-availability systems. Your role will include:Building Scalable Infrastructure: Craft and manage robust cloud and
View Job
Site Reliability Engineer (Remote)
Centurion: Site Reliability Engineer posted by Network Finance
A leading company in the financial industry is looking for a highly skilled Site Reliability Engineer to join their growing IT team. The ideal candidate will bring 8-10 years of experience in software engineering, platform engineering, and working with cro
View Job
Site Reliability Engineer
Pretoria: Site Reliability Engineer – Midrand / Semi-Remote – R650 Per Hour
An opportunity for a Site Reliability Engineer with DevOps and Jira experience to join a global leading manufacturing business You will be a team member of a larger product team that focusses on the development and support of a several mission-critical com
View Job
Site Reliability Engineer – Midrand / Semi-Remote – R650 Per Hour
Pretoria: Senior Site Reliability Engineer
Senior Site Reliability Engineer (SSRE) – Remote (12-Month Contract) We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for bui
View Job
Senior Site Reliability Engineer
Pretoria: Senior Site Reliability Engineer posted by WatersEdge Solutions
Senior Site Reliability Engineer (SSRE) Remote (12-Month Contract)We are looking for an experienced Senior Site Reliability Engineer (SSRE) to join a dynamic and innovative team. This is a fully remote contract role where you will be responsible for build
View Job
Senior Site Reliability Engineer
Western Cape: Site Reliability Engineer (Remote) posted by Communicate Finance
What You’ll Be Doing:As a Site Reliability Engineer, you’ll be the backbone of our infrastructure, responsible for designing, maintaining, and optimizing high-availability systemsYour role will include:Building Scalable Infrastructure: Craft an
View Job
Site Reliability Engineer (Remote)
Pretoria: Site Reliability Engineer posted by Sabenza IT & Recruitment
Primary responsibility is DevOps, with a strong focus on infrastructure, monitoring, debugging, fault-finding and continuous improvement to ensure a stable and reliable service (sub product / software application).Coordinate with internal and external team
View Job
Site Reliability Engineer