My Practice and project on PySpark
-
Updated
Sep 17, 2021 - Jupyter Notebook
My Practice and project on PySpark
ML models predicting wine varieties based on a wine review texts
Stepic Academy: Big Data for Data Science
PySpark is a Python API for support Python with Spark. Whether it is to perform computations on large datasets or to just analyze them
Implementation of K-means,Bisecting K-means and Decision Tree in PySpark on the Iris Dataset.
PySpark ML Heart and Advertisement Data Analysis
This is a tutorial on how to exploit PySpark's Machine Learning library spark.ml in order to run basic statistical analysis and classical machine learning algorithms.
Worked on diffrent Spark classification and regression algorithms
PySpark House Price Prediction features a PySpark-based Linear Regression model for predicting median house prices. It showcases data preprocessing, model training, and evaluation, yielding an RMSE of around 0.11. The code offers insights into building robust predictive models using PySpark.
Add a description, image, and links to the pyspark-ml topic page so that developers can more easily learn about it.
To associate your repository with the pyspark-ml topic, visit your repo's landing page and select "manage topics."