Back to Portfolio
Data Engineering and Machine Learning Spark

Data Engineering and Machine Learning Spark

This project demonstrates how to utilize Apache Spark to convert Parquet file data into a CSV format and subsequently train a Random Forest model.

Apache SparkMachine LearningRandom ForestPython

Detailed information is available on GitHub.