Menu Close

Cape Town City Centre: Web Scraping Engineer posted by PST.AG

Advertisement



Job Description

Role Overview: Data Extraction Engineer designs extraction systems (and not just scripts). They build and maintain a next-generation data acquisition platform that treats web scraping as a declarative, specification-driven discipline. Instead of hard-coding XPaths for every site, Web Scraping Developer defines what data is needed—using schemas, natural language descriptions, or visual blueprints—and lets intelligent pipelines figure out how to get it. Key Responsibilities: Specification-Driven Extraction Engineering- Design and maintain declarative extraction specifications—using Pydantic models, JSON schemas, or domain-specific languages—that describe exactly which fields to capture, their types, and validation rules. Implement pipelines that translate these specifications into executable extraction plans, leveraging both classical (Scrapy, Playwright) and AI-augmented (LLM-based semantic parsing) backends. Build reusable specification libraries for recurring data types (product prices, tariff codes, regulatory texts) to accelerate onboarding of new sources. Autonomous & Self-Healing Systems- Deploy self-healing spiders that automatically detect website layout changes and repair themselves using Model Context Protocol (MCP) servers (e.g., Scrapy MCP Server, Playwright MCP). Integrate semantic extraction (Scrapy-LLM, custom LLM pipelines) to eliminate selector brittleness—spiders rely on field descriptions, not fragile XPaths. Orchestrate complex, multi-step browsing workflows with agentic frameworks (BMAD/TEA, AutoGPT-like agents) that reason about page state, adapt to anti-bot measures, and correct their own behaviour in real time. Platform Thinking & Reusability- Move beyond one-off scrapers: build a component-based extraction platform where selectors, login handlers, and pagination logic are shared, versioned, and tested. Implement monitoring, alerting, and automatic rollback for failed extraction runs. Champion ethical crawling by design—rate limiting, robots.txt respect, and compliance with GDPR/CCPA are built into the specification layer, not retrofitted. Collaboration & Continuous Innovation- Partner with data scientists and domain experts to refine extraction specifications for complex, unstructured domains (e.g., legal texts, tariff classifications). Evaluate and pilot emerging tools to push automation coverage beyond 90%. Document and evangelise specification-driven best practices across the engineering organisation. Candidate Profile: Education and Experience – Bachelor’s degree in Computer Science 3 years of experience in web scraping or data extraction Skills and competences- Specification-Driven Extraction – Experience defining extraction requirements via schemas (Pydantic, JSON Schema) and executing them through both traditional crawlers and LLM-based semantic parsers. Self‑Healing & Semantic Extraction – Hands‑on use of Scrapy‑LLM, Scrapy MCP Server, or similar systems that decouple field definitions from page structure. Agentic Workflows – Familiarity with frameworks that give LLMs browser control (Playwright MCP, BMAD/TEA) to handle complex, non‑deterministic crawling tasks. Classical Scraping Fundamentals – You still know how to write a Scrapy spider or a Playwright script when needed, but you actively seek to replace that work with reusable, specification-driven components. Data Validation & Storage – Ability to define validation rules within specifications and land clean data into SQL/NoSQL databases or data lakes. Python proficiency: the focus is on an extraction engineer who happens to use Python. HTTP, DOM, XPath, CSS. Basic API integration and authentication flows. Preferred / Nice-to-Have Skills: Contributions to open-source scraping or AI-automation projects. Experience training or fine-tuning small LLMs for domain-specific extraction. Familiarity with data privacy engineering (GDPR, CCPA) baked into specification design. DevOps light – Docker, CI/CD for testing extraction specifications. Mindset & Approach (Non-Negotiable): Strong belief that the future of scraping is declarative, not imperative. You’d rather write a schema that says “extract the price” than debug an XPath when a website redesigns. Looking to shift from “code that scrapes” to “systems that understand extraction”.

View Job  Pretoria: Electronic Hardware Engineer

How to Apply

Click the green “Go Apply” button below to apply directly online with the employer.

About Engineering Jobs in Cape Town Region

In the Cape Town region, South Africa, the engineering job market is generally thriving, with a strong demand for skilled professionals across various industries. The field of engineering is in high demand due to the country’s growing infrastructure development and manufacturing sectors. Typically, engineering jobs in this region require a combination of technical expertise, problem-solving skills, and excellent communication abilities.

When it comes to salary expectations, it’s common to expect broad ranges depending on factors like experience, company size, and industry sector. For example, engineers with 0-5 years of experience can typically expect salaries ranging from R400 000 to R700 000 per annum, while those with 10+ years of experience may earn between R800 000 and R1 500 000 per year. However, these figures are general estimates and may vary widely depending on individual circumstances.

Common skills for engineering roles in the Cape Town region include proficiency in CAD software, programming languages like Python or Java, and strong analytical and problem-solving skills. Many employers also place a high value on collaboration and teamwork, as well as excellent communication and leadership abilities. Other valuable skills include project management, data analysis, and adaptability to new technologies.

The engineering industry is diverse and encompasses various sectors, including technology, manufacturing, financial services, and infrastructure development. These industries often require engineers with specialized knowledge, such as mechanical, electrical, or civil engineering expertise. The demand for skilled engineers is high across these sectors, making it an attractive career path for those interested in technical work.

View Job  Cape Town City Centre: Key Accounts Manager

In terms of career progression, engineers can expect opportunities to move into senior roles, lead projects, and take on management positions. Many also pursue further education and training, such as a Master’s degree or certification in a specific field, to advance their careers and stay up-to-date with industry developments.


This information provides general career guidance. Actual salaries and requirements vary by employer.



GO APPLY NOW

Safe & secure application process

Advertisement


Advertisement



Get Similar Job Alerts
Enter your WhatsApp number (e.g., +27740908132)
You'll receive alerts for new jobs in Cape Town City Centre, Cape Town Region


Job Seeker Tip

Always arrive 10-15 minutes early for interviews to show punctuality and reliability.

How to Apply

Click “GO APPLY NOW” to visit the company’s application page.
Follow their instructions carefully.

View Job  Cape Town: HR BUSINESS PARTNER - ZAR 1.2 MILLION pa - DURBANVILLE & GREENPOINT DUAL LOCATION ROLE posted by DO MORE Placements

JVR Jobs connects you with employers – we don’t process applications directly.

Latest Job Opportunities

Johannesburg: Electrical Designer posted by Hire Resolve

Jobs in Central Johannesburg March 29, 2026
Hire Resolve Other Clearing and Forwarding

An exciting opportunity has opened up for a skilled Electrical Designer to join a dynamic and growing engineering team based…

View Job

Cape Town: Sales and FIT Consultant posted by Hire Resolve

Jobs in Cape Peninsula March 29, 2026
Hire Resolve Other Clearing and Forwarding

Are you passionate about travel, adventure, and crafting unforgettable experiences across Africa? Join our client, a proudly African-owned tour operator...

View Job

Johannesburg: Google Ads PPC & Paid Media Specialist (Contract) posted by Hire Resolve

Jobs in Central Johannesburg March 29, 2026
advertising Hire Resolve Media Sales

A marketing consultancy and training company based in South Africa but operating worldwide is seeking a skilled Google Ads PPC…

View Job

Centurion: Customer Success Specialist posted by Hire Resolve

Jobs in Centurion March 29, 2026
Customer service Hire Resolve

An established company that offers advanced geospatial products and services is looking for a Customer Success Specialist to join their…

View Job

Cape Town: Procurement Manager – Solar Industry posted by Hire Resolve

Jobs in Cape Peninsula March 29, 2026
Hire Resolve Warehousing

A leading company in the renewable energy sector is looking for a skilled and experienced Procurement Manager with experience in…

View Job

Kempton Park: Workshop Foreman Coordinator posted by Hire Resolve

Jobs in East Rand March 29, 2026
Hire Resolve Other Clearing and Forwarding

An established food manufacturing group is seeking a highly motivated and detail-oriented Workshop Foreman Coordinator to join their maintenance and...

View Job

Browse Employers

View All

Job Alerts


Share this to someone who needs a job:
Posted in Jobs in Cape Peninsula, Jobs in Cape Town City Centre, Jobs in Cape Town Region

More Jobs in Your Area