- Perform Exploratory Data Analysis(EDA) and connect with BigQuery and utilize Data Studio.
- Accelerate the model training.
- Perform Different Models.
- Upload the Dataset to the BigQuery with subscription given by professor.
- You can find my Exploratory Data Analysis here.
- The visulization of the live dashboard with Data Studio is below.
- In this section to get use to all the models available I tried few basic models.
- Linear Regression, Lasso, Ridge, ElasticNet, Random Forest Regression.
- I have shown the training acceleration and accuracy comparison for the Linear Regression and Random Forest Regression.
- For this accelearation I have used Intel's OneAPI to acceleate the Data Mining Pipeline.
- Linear Regression Training-Acceleration
- Random Forest Regression Trainig-Acceleration
- As we can see in linear regression, even after using the OneAPI for the acceleration, there is no difference in the accuracy.
- However, in the random forest regression we can see some significant decrease in the accuracy of the model after wards.
Models | Score |
---|---|
XGBoost | 0.974 |
Cat Boost | 0.795 |
LightGBM | 0.729 |