AWS Data Engineer with Hadoop experience

 

Recruiter:

HR Genie

Job Ref:

AWSDATA

Date posted:

Tuesday, July 20, 2021

Location:

Johannesburg, South Africa

Salary:

Market Related


SUMMARY:
Design, build and operationalize large scale enterprise data solutions and applications using one or

POSITION INFO:

Job Description

Responsibilities:

  • Design, build and operationalize large scale enterprise data solutions and applications using one or more of AWS data and analytics services in combination with 3rd parties - Spark, EMR, DynamoDB, RedShift, Kinesis, Lambda, Glue, Snowflake.
  • Analyze, re-architect and re-platform on-premise data warehouses to data platforms on AWS cloud using AWS or 3rd party services.
  • Design and build production data pipelines from ingestion to consumption within a big data architecture, using Java, Python, Scala.
  • Design and implement data engineering, ingestion and curation functions on AWS cloud using AWS native or custom programming.
  • Perform detail assessments of current state data platforms and create an appropriate transition path to AWS cloud.
  • Design, implement and support an analytical data infrastructure providing ad-hoc access to large datasets and computing power.
  • Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using SQL and AWS big data technologies.
  • Creation and support of real-time data pipelines built on AWS technologies including Glue, Redshift/Spectrum, Kinesis, EMR and Athena
  • Continual research of the latest big data and visualization technologies to provide new capabilities and increase efficiency
  • Working closely with team members to drive real-time model implementations for monitoring and alerting of risk systems.
  • Collaborate with other tech teams to implement advanced analytics algorithms that exploit our rich datasets for statistical analysis, prediction, clustering and machine learning
  • Help continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers

Qualifications

  • Bachelor''s Degree in Computer Science, Information Technology or other relevant fields
  • Has experience in any of the following AWS Athena and Glue Pyspark, EMR, DynamoDB, Redshift, Kinesis, Lambda, Snowflake
  • Proficient in AWS Redshift, S3, Glue, Athena, DynamoDB, EMR
  • Knowledge of software engineering best practices across the development lifecycle, including agile methodologies, coding standards, code reviews, source management, build processes, testing, and operations

Work Experience

  • Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
  • Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
  • Experience working with distributed systems as it pertains to data storage and computing
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Strong analytic skills related to working with unstructured datasets.
  • Build processes supporting data transformation, data structures, meta data, dependency and workload management.
  • A successful history of manipulating, processing and extracting value from large disconnected data sets.
  • Working knowledge of message queuing, stream processing, and highly scalable Big Data, data stores.
  • Strong project management and organizational skills.
  • Experience supporting and working with cross-functional teams in a dynamic environment.
  • Experience in a Data Engineer or similar roles
  • Experience with big data tools is a must: Hadoop, Spark, Kafka, etc.
  • Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
  • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
  • Experience with AWS cloud services: EC2, EMR, RDS, Redshift
  • Experience with stream-processing systems: Storm, Spark-Streaming, etc.
  • Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc


 

NB! This job is now closed. You can apply for other jobs by uploading your CV.



 

 

 

Similar jobs you might be interested in:

AWS Data Engineer (Senior) 2517
Location: Pretoria
Salary: Monthly
aws data engineer
9 days ago


AWS Data Engineer (Expert)
Location: Centurion
Salary:
Join our client as an Expert aws data engineer in South Africa. Design, implement, and optimize Big data Pipelines using aws services. Ensure data integrity, security, and compliance while collaborating with teams to integrate solutions. Bring your expertise in aws technologies and data engineering to drive impactful data solutions.
11 days ago


AWS Data Engineer - 1432
Location: Pretoria
Salary: Hourly
Contract Starts: 01.06.2024Contract Ends: 31.12.2026Location: Midrand/Menlyn/Rosslyn/Home Office rotation
30 days ago


Data Engineer (AWS and Redshift) - Johannesburg – up to R1mil
Location: Johannesburg
Salary: 1000000
data engineer (aws and Redshift) - Johannesburg – up to R1mil
23 days ago


AWS Data Engineers Required - Project Based - Contractual-to start asap-JHB - Hybrid
Location: Midrand
Salary: Market related
aws, data, engineers, Required, Project, Based, Contractual, to, start, asap, JHB, Hybrid
12 days ago


Intermediate Data Engineer - Johannesburg – up to R700k per annum
Location: Johannesburg
Salary: 700000
Intermediate data engineer - Johannesburg – up to R700k per annum
2 days ago


Cloud Engineer - FinOps
Location: Sandton
Salary: R800k - 950k per year
Cloud engineer with relevant experience, qualifications + azure experience
3 days ago


Platform Engineer (Mulesoft Integration)
Location: Johannesburg
Salary: Hourly
We are in search of a PLATFORM engineer with expertise in Mulesoft Integration for one of our banking industry clients, for a 12-month rolling employment contract. The incumbent will be joining the Platforms and engineering Gateway technology tribe responsible for the development, design and run of the Mulesoft API and Integration platform residing on‐premises and aws Cloud.Apply platform e...
5 days ago


Data engineer – Johannesburg – up to R900k
Location: Johannesburg
Salary: 900000
data engineer – Johannesburg – up to R900k
5 days ago


Lead Product Engineer
Location: Johannesburg
Salary: Hourly
We are currently recruiting for a LEAD PRODUCT engineer for one of our clients in the Banking industry, for a 6-month rolling employment contract.
9 days ago


Create a free job alert for AWS Data Engineer with Hadoop experience in Johannesburg

Enter your email address below and we will email you similar jobs when they become available:

You can cancel at any time. We will not spam you.
By giving us your email address your agree to our Terms and Conditions