All projects

2023

Data Engg Gaming Industry Analysis

In this data engineering project, a dataset related to the gaming industry is utilized. The dataset is stored in an AWS S3 bucket and is mounted to a…

Overview

In this data engineering project, a dataset related to the gaming industry is utilized. The dataset is stored in an AWS S3 bucket and is mounted to a Databricks workspace. Using Databricks, a Spark DataFrame is generated from the dataset, and SparkSQL is used to analyze the data. Various queries are performed on the DataFrame to extract insights. Open-source project by Rohan Kumariya, published on GitHub.

Highlights

  • Primary language: Jupyter
  • Open source — view the code and contribute on GitHub

Built with

  • Jupyter

Discussion (0)

Log in to comment.

No comments yet. Be the first to start the conversation.