Menu Close

Johannesburg: Site Reliability Engineer

Advertisement



Job Description

Job Purpose The Site Reliability Engineer is responsible for ensuring the high availability, reliability, and performance of AWS-centric microservices platform supporting analytics and market-data products delivered to global brokers. This role is deeply technical, requiring strong AWS expertise and Python proficiency to automate operations, debug production services, optimize performance, and support continuous delivery in a 24×7 financial services environment where uptime is mission-critical. Reporting to the IT Operations Manager, this position demands independent technical decision-making and the ability to exercise sound judgment when responding to critical incidents. The SRE operates with significant autonomy in assessing system performance, diagnosing complex issues, and making critical determinations that impact service availability across a high-traffic, globally distributed infrastructure.  Responsibilities AWS Infrastructure Monitoring and Incident Response Monitor and manage AWS services supporting production workloads (ECS/EKS, EC2, Lambda, API Gateway, SQS/SNS, RDS, ElastiCache, CloudFront) Respond to alerts from CloudWatch, Datadog, and custom monitoring scripts with urgency and precision Exercise independent judgment in assessing incident severity and determining appropriate response strategies Diagnose scaling, networking, and performance issues in distributed AWS systems Perform incident response, ensuring rapid recovery and minimal downtime Coordinate with development teams during critical incidents and outages, serving as the technical authority for infrastructure decisions Python-Driven Troubleshooting and Automation Write Python scripts and tools to automate operational tasks, system checks, and data validation routines. Analyze Python microservice behavior by reading logs, debugging issues, and profiling performance Build or enhance internal CLI tools to improve support workflows Use Python to interrogate APIs, AWS resources (via boto3), and production data sources Independently assess automation opportunities and implement solutions to reduce manual workload Production Systems and Data Flow Stability Maintain stability across charting engines, data ingestion pipelines, market-data feeds, scanning engines, and sentiment analysis services Investigate failures in REST APIs, WebSocket streams, and asynchronous workers Validate deployments and configurations for AWS-based microservices Ensure data completeness and accuracy across instruments, markets, and broker-specific configurations Make real-time decisions on system changes, maintenance windows, and emergency response procedures Collaboration and Continuous Operations Improvement Work with DevOps engineers to refine CI/CD pipelines, infrastructure-as-code workflows, and AWS deployment patterns Collaborate with backend teams to improve microservice reliability and observability Provide feedback on Python code, error-handling logic, and operational robustness Contribute to post-incident root cause analyses and propose architectural or automation improvements Participate in an on-call rotation to provide round-the-clock infrastructure support Documentation and Runbook Management Maintain detailed operational documentation, AWS service runbooks, and troubleshooting guides Build automated checks and self-healing routines where feasible Drive SRE best practices across the team Document configurations, standards, and operational procedures that align with industry best practices Experience Requirements 2 years in production support, SRE, or DevOps with a strong AWS and Python footprint Demonstrated ability to exercise independent judgment in high-pressure situations and make critical decisions affecting system availability Strong Python scripting and debugging skills (must be able to analyse stack traces, write scripts, automate workflows) Strong analytical mindset and exceptional problem-solving ability Calm, structured communication during incidents Ability to work cross-functionally with DevOps, developers, QA, and product staff Keen attention to detail and strong ownership of production systems Comfortable working in a high-availability, high-traffic environment Off-hours support and coverage as part of on-call rotation  Technical Expertise AWS Services: ECS or EKS (service deployments, scaling behaviour, debugging containers) EC2, Lambda, API Gateway SQS/SNS messaging patterns RDS (PostgreSQL/MySQL), DynamoDB S3 and CloudFront IAM, KMS, networking (VPC, subnets, security groups) Monitoring Observability: CloudWatch, Datadog, Grafana, OpenSearch/Kibana Infrastructure DevOps: Docker containerization Infrastructure-as-code (CloudFormation, Terraform) CI/CD pipelines (CodePipeline, GitHub Actions, GitLab CI, or similar) Development Data: REST APIs, WebSocket protocols, asynchronous workers, distributed system behaviour SQL proficiency and performance investigation for relational databases MongoDB with JavaScript proficiency  Preferred Qualifications Bachelor’s degree in Computer Science, Information Systems, Engineering, or equivalent practical experience Experience in fintech, trading systems, or market-data streaming Python experience with data processing, concurrency (asyncio), or task queues (Celery/RQ) Exposure to Kinesis, Kafka, or other event-streaming platforms Familiarity with FastAPI-based microservices Experience with cost optimisation and AWS Well-Architected practices Understanding of foreign exchange markets and trading platform requirements  The ideal candidate will demonstrate: Operational Excellence: Reduced production incidents and improved uptime through proactive monitoring and rapid incident response Automation Focus: Faster MTTD and MTTR through automation and AWS-driven improvements Technical Impact: Operational tooling and Python automation that significantly reduces manual workload Collaboration: Positive feedback from internal teams and external broker partners Ownership: Strong sense of accountability for production system health and reliability

View Job  Cape Town Region: Senior Software Engineer posted by Hire Resolve

How to Apply

Click the green “Go Apply” button below to apply directly online with the employer.

About IT / Computing / Software Jobs in Gauteng

In Gauteng, the IT and computing industry is a significant contributor to the province’s economy, with a steady demand for skilled professionals. Typically, this field is characterized by rapid technological advancements, innovative projects, and a high level of job satisfaction among employees. Generally, career prospects in this sector are strong, with a range of opportunities available across various industries.

Salaries for IT and computing professionals in Gauteng can vary widely depending on factors such as experience, company size, and industry sector. While it is common to see salaries ranging from R400 000 to R1,200 000 per annum, these figures are only a general guideline and actual salaries may differ significantly. For instance, senior executives or technical leads with extensive experience may command higher salaries, while entry-level positions may start at lower levels.

Common skills required for IT and computing roles in Gauteng include proficiency in programming languages such as Java, Python, or C++, knowledge of cloud platforms like AWS or Azure, and experience with agile development methodologies. Additionally, understanding of data analysis and interpretation, cybersecurity principles, and IT project management are often essential. Familiarity with popular software applications, such as Microsoft Office 365 or Google Workspace, is also beneficial.

The financial services sector, technology industry, manufacturing sector, and government departments are among the common industries that employ IT and computing professionals in Gauteng. These sectors require a range of skills and expertise, from data analysis to network administration, making them attractive options for career development.

View Job  Johannesburg: JUNIOR GRAPHIC DESIGNER & CONTENT CREATOR (JOHANNESBURG)

For those seeking a career in IT and computing, there are numerous opportunities for professional growth and development. Typically, career progression involves moving into senior roles or specialized positions, such as technical lead, solutions architect, or data scientist. Opportunities for further education and training, such as certifications or postgraduate studies, can also enhance job prospects. Generally, a strong understanding of industry trends, networking, and continuous learning are essential for success in this field.


This information provides general career guidance. Actual salaries and requirements vary by employer.



GO APPLY NOW

Safe & secure application process

Advertisement


Advertisement



Get Similar Job Alerts
Enter your WhatsApp number (e.g., +27740908132)
You'll receive alerts for new jobs in Johannesburg, Gauteng


Job Seeker Tip

Use the STAR method (Situation, Task, Action, Result) when answering behavioral interview questions.

How to Apply

Click “GO APPLY NOW” to visit the company’s application page.
Follow their instructions carefully.

JVR Jobs connects you with employers – we don’t process applications directly.

Latest Job Opportunities

Tokai: Fitness Instructor posted by Virgin Active South Africa Pty Ltd

Jobs in Cape Peninsula March 1, 2026
Customer service support Virgin Active South Africa Pty Ltd

Your Purpose To inspire people to live active lives The role of the Fitness Instructor is to promote our world-class…

View Job

Sandton: Field Sales Consultant posted by Palanga Publishing & Training Academy

Jobs in Central Johannesburg Remote March 1, 2026
Palanga Publishing & Training Academy Sales

Job Title: Field Sales Representative Salary: It is a basic plus sales commission and performance bonuses. The basic is R14,000.00.…

View Job

Cape Town Region: Personal Trainer posted by Virgin Active South Africa Pty Ltd

Jobs in Cape Peninsula March 1, 2026
Customer service support Virgin Active South Africa Pty Ltd

Virgin Active is seeking certified and passionate Personal Trainers who want to build a successful fitness business within our world-class…

View Job

Bellville: Quality Controller posted by Isilumko Staffing

Jobs in Bellville March 1, 2026
Engineering Isilumko Staffing

Job Title: Quality Controller Location: Bellville South Job Summary: We are seeking a detail-oriented and experienced Quality Controller to join…

View Job

Cape Town Region: Intermediate .Net Developer (with Blazor) posted by Rosstone

Jobs in Cape Peninsula March 1, 2026
Computing IT Rosstone Software

We are seeking a skilled and motivated Intermediate level .Net Developer to join our dynamic development team. The ideal candidate…

View Job

Western Cape: Junior Merchandiser | Managed People Solutions | Worcester Western Cape posted by Managed People Solutions

Jobs in South Africa March 1, 2026
general Managed People Solutions Other

Are you eager to learn, energetic, and ready to gain real workplace experience? Were looking for young, motivated Junior Merchandisers…

View Job

Browse Employers

View All

Job Alerts


Share this to someone who needs a job:
Posted in Jobs in Central Johannesburg, Jobs in Gauteng, Jobs in Johannesburg, Jobs in South Africa

More Jobs in Your Area