Opleiding: Apache Spark Fundamentals

Get started processing data with Apache Spark and PySpark

With the rise of cloud computing, distributed storage and (big) data processing, many organisations are starting to use Apache Spark for their data processes. Whether it is for data science, data analysis or data engineering, Apache Spark can be the right tool for the job. It is a foundation under Azure Synapse Analytics, Microsoft Fabric and Databricks.
This training aims to walk you through the fundamentals of working with Apache Spark, starting with what it is and how it works. You will then continue to read, transform and write data using PySpark.
Finally, to make sure your code can be safely used in production, there will be an added focus on using development best practices.
What is Spark, where did it come from, why was it created? And how does it work?
Lessons
- History of Apache Spark
- Technical Architecture (Driver, Cluster Manager, Executors)
- RDD and Dataframe
- Pyspark
- Benefits of using Spark
- Running Spark locally

After completing this module, students will be able to:
- Explain how Spark works

To work with data, we first need to retrieve it from wherever it is located. This is done through spark.read.
Lessons
- spark.read
- read options…

Meer...

Nu inschrijven

Informatie aanvragen

€1.610

ex. BTW

Aangeboden door

Info Support

Onderwerp

Apache Spark

Niveau

Duur

2 dagen

Looptijd

14 dagen

Taal

Type product

training

Lesvorm

Klassikaal

Aantal deelnemers

Min: 1

Max: 12

Tijdstip

Overdag

Tijden en locaties

Veenendaal

ma 13 jul. 2026

Veenendaal

do 13 aug. 2026

Keurmerken aanbieder

Microsoft Learning Partner

Cedeo

Cedeo Open

Cedeo Maatwerk

Arne Slot als zondebok: wat organisaties kunnen leren van voetbal

Bert Overbeek

Arne Slot as a scapegoat: Liverpool FC made a major mistake

Bert Overbeek

Van werkvloer tot wereldbeeld (overzicht van de boeken van Bert Overbeek)

Bert Overbeek

Hogere brandstofprijzen en wegtransport: wat zeggen de cijfers?

Walther Ploos van Amstel

Wat betekent Amsterdamse coalitie-akkoord voor stadslogistiek?

Walther Ploos van Amstel

SpaceX gaat naar de beurs. Hoort het aandeel thuis in een logistieke beleggingsportefeuille?

Walther Ploos van Amstel

Beyond Psychological Safety.

Danique Scheepers

Human Capital Governance: wie bewaakt de mens als AI werk gaat verdelen?

Willem E.A.J. Scheepers

Autosapiens en de Verdwijnende Mens in het Proces

Willem E.A.J. Scheepers

JIJ & IK richt zich op oneindige liefde

Gyuri George Vergouw

Management by headcount maakt organisaties vleugellam

Gyuri George Vergouw

Week van de Integriteit: broodnodig of facultatief?

Gyuri George Vergouw

Prestaties van AI worden verkeerd beoordeeld

Leon Dohmen

Multisourcing en de illusie van ontzorgen

Leon Dohmen

De verborgen tijdkillers in IT-projecten

Leon Dohmen

Waar voor je leergeld: Aurora laat de veiligheid zien van loslaten

Bert Overbeek

Kopen uit angst dat je achterop raakt

Bert Overbeek

Nieuwe AI-taalapp belooft van alles

Bert Overbeek

Opleiding: Apache Spark Fundamentals