Machine Learning Tutorials with Scikit-Learn

Welcome to the Machine Learning Tutorials repository! This collection of Jupyter notebooks is designed to help you get started with machine learning using Python and Scikit-Learn. Whether you're a beginner or looking to deepen your understanding, these tutorials cover a range of topics from basic concepts to advanced techniques. These notebooks were written by yours truly, David Akman, and are my own work for the most part, except SK5 - Advanced Topics, for which my good friend Chuck Wong contributed. They have been tested with Python 3.11.

LightGBM is a powerful gradient boosting framework. This notebook provides an introduction to LightGBM, demonstrating how to train and evaluate models using this library. Practical examples highlight its efficiency and performance benefits. This notebook also covers hyperparameter finetuning using Optuna (best in the business) and feature importance using the popular SHAP module.

SK9_Forecasting.ipynb

Focusing on time series forecasting, this notebook covers essential techniques and models for making predictions based on temporal data using the Facebook Prophet Python module.

SK10_LDA_Iris.ipynb

A mini tutorial on Linear Discriminant Analysis (LDA) on the Iris Dataset.

Case_Study1_Predicting_Income_Status.ipynb

This case study uses a real-world dataset to predict income status based on various features. It demonstrates the complete machine learning workflow, from data preprocessing and feature selection to model training, evaluation, and interpretation.

Case_Study2_Maintenance_Predictive_Modelling.ipynb

This notebook presents a case study on predictive maintenance, showcasing how to predict equipment failures using historical data. It covers data preparation, model building, evaluation, and practical considerations for deployment.

Data_Prep_for_Predictive_Modelling.ipynb

Data preparation is a critical step in the machine learning process. This notebook covers techniques for cleaning, transforming, and preparing data for predictive modeling, ensuring that the dataset is ready for analysis and model building.

Decision_Trees_InfoGain_Computation.ipynb

Focusing on decision trees, this notebook explains how to compute information gain, an important concept for understanding tree-based models. It includes detailed examples and step-by-step calculations to illustrate the process.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
Case_Study1_Predicting_Income_Status.ipynb		Case_Study1_Predicting_Income_Status.ipynb
Case_Study2_Maintenance_Predictive_Modelling.ipynb		Case_Study2_Maintenance_Predictive_Modelling.ipynb
Case_Study3_Hotel_Prediction.ipynb		Case_Study3_Hotel_Prediction.ipynb
Data_Prep_for_Predictive_Modelling.ipynb		Data_Prep_for_Predictive_Modelling.ipynb
Decision_Trees_InfoGain_Computation.ipynb		Decision_Trees_InfoGain_Computation.ipynb
README.md		README.md
SK0_Scikit_Learn_Introduction.ipynb		SK0_Scikit_Learn_Introduction.ipynb
SK10_LDA_Iris.ipynb		SK10_LDA_Iris.ipynb
SK11_Linear_Regression_Assumptions.ipynb		SK11_Linear_Regression_Assumptions.ipynb
SK12_Linear_Regression_Multicollinearity.ipynb		SK12_Linear_Regression_Multicollinearity.ipynb
SK1_Basic_Modelling.ipynb		SK1_Basic_Modelling.ipynb
SK2_Feature_Selection.ipynb		SK2_Feature_Selection.ipynb
SK3_Model_Evaluation.ipynb		SK3_Model_Evaluation.ipynb
SK4_HyperParameter_Tuning.ipynb		SK4_HyperParameter_Tuning.ipynb
SK5_Advanced_Topics.ipynb		SK5_Advanced_Topics.ipynb
SK6_Clustering.ipynb		SK6_Clustering.ipynb
SK7_Neural_Networks.ipynb		SK7_Neural_Networks.ipynb
SK8_LightGBM.ipynb		SK8_LightGBM.ipynb
SK9_Forecasting.ipynb		SK9_Forecasting.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Tutorials with Scikit-Learn

Table of Contents

SK0_Scikit_Learn_Introduction.ipynb

SK1_Basic_Modelling.ipynb

SK2_Feature_Selection.ipynb

SK3_Model_Evaluation.ipynb

SK4_HyperParameter_Tuning.ipynb

SK5_Advanced_Topics.ipynb

SK6_Clustering.ipynb

SK7_Neural_Networks.ipynb

SK8_LightGBM.ipynb

SK9_Forecasting.ipynb

SK10_LDA_Iris.ipynb

Case_Study1_Predicting_Income_Status.ipynb

Case_Study2_Maintenance_Predictive_Modelling.ipynb

Data_Prep_for_Predictive_Modelling.ipynb

Decision_Trees_InfoGain_Computation.ipynb

About

Releases

Packages

Languages

akmand/machine_learning_tutorials

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Tutorials with Scikit-Learn

Table of Contents

SK0_Scikit_Learn_Introduction.ipynb

SK1_Basic_Modelling.ipynb

SK2_Feature_Selection.ipynb

SK3_Model_Evaluation.ipynb

SK4_HyperParameter_Tuning.ipynb

SK5_Advanced_Topics.ipynb

SK6_Clustering.ipynb

SK7_Neural_Networks.ipynb

SK8_LightGBM.ipynb

SK9_Forecasting.ipynb

SK10_LDA_Iris.ipynb

Case_Study1_Predicting_Income_Status.ipynb

Case_Study2_Maintenance_Predictive_Modelling.ipynb

Data_Prep_for_Predictive_Modelling.ipynb

Decision_Trees_InfoGain_Computation.ipynb

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages