This project provides a ready-to-use environment for Big Data Analytics using PySpark and JupyterLab, fully containerized via Docker Compose.
π GitHub Repo: BigDataAnalytics
- JupyterLab environment with PySpark pre-installed
- Dockerized setup β no need to install anything locally
- Persistent storage for notebooks and datasets
- Easy to extend with additional Python packages
git clone https://github.com/usmanakhtar/BigDataAnalytics.git
cd BigDataAnalytics