Spark Batch Processing - Caching, DataFrame, Dataset, SparkSQL, and Bucketing in Iceberg (Day 2 Lab)
LOGIN
SIGNUP
PRICING
REVIEWS
CONTACT
SEARCH
About Me
Contact
Search
Home
Data Engineering Boot Camp V2 Combined Track
Spark Batch Processing - Caching, DataFrame, Dataset, SparkSQL, and Bucketing in Iceberg (Day 2 Lab)
Sign in to view content
Sign in to view this lesson and continue learning.
Sign in
Spark Batch Processing - Caching, DataFrame, Dataset, SparkSQL, and Bucketing in Iceberg (Day 2 Lab)
Week 4: Batch Pipelines with Apache Spark V2
63 mins
SQL
Data Modeling
ETL/ELT
Apache Spark
Docker
Apache Iceberg
Previous
Next
Overview
Description
In this Spark lab session, we cover caching, DataFrame, Dataset, and SparkSQL, and explore how bucketing works in Iceberg for efficient data management.