All projects

2026

MLB Daily Games ETL Pipeline

This project implements an end-to-end ETL pipeline using Databricks, PySpark, and Delta Lake. The pipeline extracts daily MLB game data from an API,…

Overview

This project implements an end-to-end ETL pipeline using Databricks, PySpark, and Delta Lake. The pipeline extracts daily MLB game data from an API, processes it through multiple layers, and stores it in structured tables ready for analytics. Open-source project by Stanly Fernandez, published on GitHub.

Highlights

  • 1 star on GitHub
  • Primary language: Jupyter
  • Open source — view the code and contribute on GitHub

Built with

  • Jupyter

Discussion (0)

Log in to comment.

No comments yet. Be the first to start the conversation.