Mastering Azure Databricks with PySpark: Zero to Job-Ready

Uncategorized
Wishlist Share
Share Course
Page Link
Share On Social Media

About Course

Databricks is one of the most in demand big data tools around. It is a fast, easy, and collaborative Spark based big data analytics service designed for data science, ML and data engineering workflows.

The course is packed with lectures, code-along videos and dedicated challenge sections. This should be more than enough to keep you engaged and learning! As an added bonus you will also have lifetime access to all the lectures… and I have provided detailed notebooks as a downloadable asset, the notebooks will contain step by step documentation with additional resources and links.

I have ensured that the delivery of the course is engaging and concise, the curriculum is extensive yet delivered in an efficient way. The course will provide you with hands-on training utilising a variety of different data sets.

The course is aimed at teaching you PySpark, Spark SQL in Python and the Databricks Lakehouse Architecture.

You will primarily be using Databricks on Microsoft Azure in addition to other services such as Azure Data Lake Storage Gen 2,  Azure Repos and Azure DevOps.

The course will cover a variety of areas including:

  • Set Up and Overview

  • Azure Databricks Notebooks

  • Spark SQL

  • Reading and Writing Data

  • Data Analysis and Transformation with Spark SQL in Python

  • Charts and Dashboards in Databricks Notebooks

  • Databricks Medallion Architecture

  • Accessing Data in Cloud Object Storage

  • Hive Metastore

  • Databases, Tables and Views in Databricks

  • Delta Lake / Databricks Lakehouse Architecture

  • Spark Structured Streaming

  • Delta Live Tables

  • Databricks Jobs

  • Access Control Lists (ACLs)

  • Databricks CLI

  • Source Control with Databricks Repos

  • CI/CD on Databricks

 

By the end of this course, students will:

  • Be confident working with Spark using Python

  • Understand how to use Databricks effectively for data engineering.

  • Build, transform, and analyze large datasets.

  • Be ready for job interviews with real project experience.

Show More

What Will You Learn?

  • Azure Databricks
  • Data Lakehouse
  • Delta Lake
  • Spark SQL
  • PySpark
  • Big Data
  • Real World Scenarios
  • CI/CD on Databricks
  • Source Control with Databricks Repos

Course Content

Welcome & Course Introduction

  • Welcome to the Course
  • What You Will Learn
  • Tools You’ll Need
  • How to Make the Most of This Course

Introduction to Big Data & Spark
Objective: Introduce Spark and its role in big data

Introduction to Azure Databricks
Objective: Setup and explore Databricks UI

PySpark Basics
Objective: Build strong Python + Spark foundations

Spark SQL Basics
Objective: Use SQL within Databricks

Data Transformations with PySpark
Objective: Learn transformation techniques

File Formats & Storage
Objective: Work with different data formats

Data Engineering Best Practices
Objective: Write scalable, production-ready code

Projects (Hands-On End-to-End)
Objective: Apply everything practically

Integration with Azure Services
Objective: Learn basic integrations

Databricks Job Scheduling & Automation
Objective: Production-like automation

Career Guidance & Next Steps

Student Ratings & Reviews

No Review Yet
No Review Yet