What Youll Be Doing (aka Fun with Data): Designing, developing, and implementing data pipelines and ETL processesthink of it as building the Large Hadron Collider of data Collaborating with clients to understand their business requirements (or as I call it, decoding their quantum superpositions of needs) Working closely with cross-functional teams, ensuring that everyones code plays nice together Collecting and integrating data from various sources like databases, APIs, external providers, and even mysterious streaming sources (cue X-Files theme) Aggregating unstructured data into a structured format for data warehousingbecause chaos is cool in physics, not in data storage Optimizing data schemas and ensuring data quality and integritybecause Schrödingers cat should be alive OR dead, not both in your dataset Processing and analyzing massive datasets (which is just a fancy way of saying, “Youll be a data wizard”) Implementing and understanding Data Architecture, including data in motion, data at rest, and the deep philosophical question: Why do we store so much data? Keeping up with the latest trends in data engineeringbecause being outdated is so dial-up What You Need to Succeed (aka Your Superpowers): Strong proficiency in SQL (obviously) Proficiency in at least one programming language: Python, Scala, or Java. (Bonus points if you can argue why one is superior.) Solid experience working with relational databases (or at least knowing why they still matter) Hands-on experience with cloud-based data platforms like AWS, Azure, or Google Cloud (because everything is in the cloud now, even your grandmas recipes) Expertise in Data Modelling and Database Design Experience in designing and implementing efficient ETL pipelinesbecause data teleportation isnt a thing yet Bonus Skills (Not Required, But Will Impress Me): Experience with Snowflake and Matillion (because who doesnt love cool-sounding tools?) Knowledge of NoSQL databases like MongoDB or Cassandra (because sometimes SQL just doesnt cut it) Experience with distributed systems like Hadoop and Spark (because Big Data is like the universeever-expanding) Familiarity with Apache NiFi (its not sci-fi, but its close) Exposure to MapReduce, Hive, Pig, or HBase (if you know these, youre already cooler than most people I know) Understanding of operating systems like UNIX, Linux, and Windows (because knowing one OS is so basic) Qualifications Im Looking For (Yes, Theres a Bit of a Nerd Filter): Essential: BSc in Computer Science or Information Technology Preferred: BSc Honours in Computer Science or Information Technology BEng / BSc Engineering Who Im Looking For (Aka “Are You My Data Jedi?”) I need someone who: Finished their studies within the prescribed time (because efficiency is sexy) Scored predominantly 70% and above in their last two years of study (because, like it or not, grades matter in tech) If this sounds like you and youre ready to level up your data engineering career, lets talk I promise itll be more fun than arguing about Star Wars plot holes. Apply today and lets geek out about data Although you may not meet all the requirements needed, please still consider sending through your CV as we have many roles available and could possibly find another great opportunity for you Contact Selandea McGuinness on
Graduate Data Engineer position available in Gauteng, Johannesburg. This job position was posted by . The job has been posted on 2025-02-25 in the Graduate category