Big data processing tools – Spark core components (16:11)

Big data processing tools – Spark core components (16:11)

Topic

Starts7 June 2025

This video lesson will give you an understanding of:

  • What is Spark and how it is structured
  • How the core components of Spark (SparkContext and Resilient Distributed Datasets - RDDs) work
  • What operations are provided by Spark to interact with large datasets.

By the end of this video lesson, you will:

  • Understand the architecture of Spark and its core components
  • Understand how to use SparkContext and different types of RDDs
  • See examples of transformations and actions that can be applied over data stored in RDDs.

Instructor

Marcos Barreto Profile Photo
Marcos Barreto
Assistant Professor of Data Science, London School of Economics and Political Science