PySpark Tutorial

0
freeCodeCamp
Free Online Course
English
1-2 hours worth of material
selfpaced

Overview

Learn PySpark, an interface for Apache Spark in Python. PySpark is often used for large-scale data processing and machine learning.

Syllabus

Pyspark Introduction.
Pyspark Dataframe Part 1.
Pyspark Handling Missing Values.
Pyspark Dataframe Part 2.
Pyspark Groupby And Aggregate Functions.
Pyspark Mlib And Installation And Implementation.
Introduction To Databricks.
Implementing Linear Regression using Databricks in Single Clusters.

Taught by

freeCodeCamp.org