Adding ML Model for Absenteeism Prediction with Hyperparameter Tuning #227
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Pull Request for PyVerse 💡
Issue Title : Adding ML model for HyperParameter Tuning including Resources
Closes: #141
I have added a new Jupyter Notebook to the project that demonstrates the process of building and training an ML model for predicting absenteeism time using the UCI dataset. The notebook includes the following steps:
Data loading and preprocessing: Loading the dataset, handling missing values, and performing feature engineering.
Model selection: Choosing a suitable ML algorithm (e.g., Random Forest, XGBoost) for the task.
Hyperparameter tuning: Using techniques like grid search or random search to find optimal hyperparameter values for the selected model.
Model training and evaluation: Training the model with the tuned hyperparameters and evaluating its performance using appropriate metrics (e.g., accuracy, precision, recall, F1-score).
The notebook also includes visualizations to illustrate the impact of hyperparameter tuning on the model's performance.
Type of change ☑️
How Has This Been Tested? ⚙️
I have thoroughly tested the notebook by running it multiple times and verifying that the results are consistent. I have also compared the performance of the model with and without hyperparameter tuning to demonstrate its effectiveness.
Checklist: ☑️