Course description

The "Apache Airflow on Steroids for Data Engineers" is a comprehensive course designed to equip data engineers with the skills to build robust data pipelines using Apache Airflow, Docker, and Spark Clusters.

This course takes a hands-on approach, guiding you through the creation of an end-to-end data engineering project. You'll learn to write basic jobs in multiple programming languages, manage job submission to Spark clusters, and process data at scale.

With a focus on practical application, this course includes setting up your environment, building and compiling jobs in Scala and Java, and analyzing computation results in real-time. By the end of this course, you'll have a solid understanding of workflow automation and big data analytics, preparing you to tackle complex data engineering challenges.


What will i learn?

  • Solid understanding of workflow automation
  • Better understanding of spark jobs
  • Building master worker architecture on docker
  • Better understanding of Docker
  • Writing spark jobs with Scala
  • Writing spark jobs with Java
  • Writing spark jobs with Python

Requirements

£14.99

£24.99

Lectures

4

Skill level

Beginner

Expiry period

Lifetime

Certificate

Yes

Related courses