Overview
In this program, you’ll develop the skills and knowledge you need to join the rapidly growing data engineering field. With the help of expert instructors and mentors, you’ll design data models, build data warehouses and data lakes, automate data pipelines, and work with Big Data. These skills are in high demand and companies are facing major shortages of data engineering talent. Upon completing the program, you’ll have the skills you need to become a data engineer.
Data Engineering is the foundation for the new world of Big Data. Enroll now to build production-ready data infrastructure, an essential skill for advancing your data career.
Syllabus
- Data Modeling
- Learn to create relational and NoSQL data models to fit the diverse needs of data consumers. Use ETL to build databases in PostgreSQL and Apache Cassandra.
- Cloud Data Warehouses
- Sharpen your data warehousing skills and deepen your understanding of data infrastructure. Create cloud-based data warehouses on Amazon Web Services (AWS).
- Spark and Data Lakes
- Understand the big data ecosystem and how to use Spark to work with massive datasets. Store big data in a data lake and query it with Spark.
- Data Pipelines with Airflow
- Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.
- Capstone Project
- Combine what you've learned throughout the program to build your own data engineering portfolio project.
Taught by
Amanda Moran, Ben Goldberg, Sameh El-Ansary, Olli Iivonen, David Drummond, Judit Lantos, Juno Lee , Rodrigo G., Andrew M., Stanislav V., Eugenio C., Nitheesha T. and Jitesh S.