Course description

Embark on a comprehensive journey into the world of modern data engineering.

This tutorial is meticulously crafted to guide you through the integration of platforms like Reddit, Apache Airflow, Celery, Postgres, and Amazon's suite of data services.

Begin by extracting data from Reddit using its API, then set up and orchestrate ETL processes with Apache Airflow and Celery. Dive into efficient data storage with Amazon S3 and leverage the power of AWS Glue for data cataloging and ETL jobs.

The tutorial further delves into querying and transforming data with Amazon Athena, setting up the Redshift Cluster, and understanding the best practices for loading data into Amazon Redshift for insightful analytics.

What will i learn?

Requirements

  • Basic Knowledge of Apache Airflow
  • Basic knowledge of Docker

£14.99

£24.99

Lectures

1

Skill level

Beginner

Expiry period

Lifetime

Certificate

Yes

Related courses