Merge pull request #806 from abckhush/main

Pet's Facial Expression Detection
abhisheks008 · Jun 21, 2024 · 0420cd2 · 0420cd2
2 parents 39e42a9 + d9c81e7
commit 0420cd2
Show file tree

Hide file tree

Showing 10 changed files with 89 additions and 0 deletions.
diff --git a/Pet's Facial Expression/Dataset/README.md b/Pet's Facial Expression/Dataset/README.md
@@ -0,0 +1,5 @@
+# **Pet's Facial Expression Recognition**
+
+This dataset contains 1000 face images of various pets, such as dogs, cats, rabbits, hamsters, sheep, horses, and birds. The images capture the diversity of expressions these animals can display, such as happiness, sadness, anger etc. You can apply ML techniques to gain insights into pet emotions and personalities, create fun and creative projects with pet face images, and contribute to pet face recognition research and animal welfare
+
+### Dataset Link : https://www.kaggle.com/datasets/anshtanwar/pets-facial-expression-dataset/data
diff --git a/Pet's Facial Expression/Images/ConfusionMatrix_ViT.png b/Pet's Facial Expression/Images/ConfusionMatrix_ViT.png
diff --git a/Pet's Facial Expression/Images/Dataset.png b/Pet's Facial Expression/Images/Dataset.png
diff --git a/Pet's Facial Expression/Images/EfficientNet_plot.png b/Pet's Facial Expression/Images/EfficientNet_plot.png
diff --git a/Pet's Facial Expression/Images/ResNet50_Plot.png b/Pet's Facial Expression/Images/ResNet50_Plot.png
diff --git a/Pet's Facial Expression/Models/README.md b/Pet's Facial Expression/Models/README.md
@@ -0,0 +1,17 @@
+# **Pet's Facial Expression Detection**
+This project focuses on developing a sophisticated image classification system to identify different facial expressions in pets using deep learning techniques. The system leverages three state-of-the-art convolutional neural network (CNN) architectures: EfficientNet and ResNet50, along with Vision Transformer (ViT) architecture, each known for their robustness and high performance in image recognition tasks.
+
+
+### Models Implemented
+- ViT Transformer: Utilizes the transformer architecture for image classification, providing state-of-the-art accuracy.
+- EfficientNet: A family of models that balance model depth, width, and resolution, achieving high performance with fewer parameters.
+- ResNet50: Employs residual learning to train very deep networks, addressing the vanishing gradient problem and achieving high accuracy.
+
+### Libraries Used
+The following libraries were used to implement the models:
+
+- TensorFlow: An open-source library developed by Google for numerical computation and large-scale machine learning.
+- Keras: A high-level neural networks API, written in Python and capable of running on top of TensorFlow.
+- NumPy: A fundamental package for scientific computing with Python, used for handling arrays and performing mathematical operations.
+- Pandas: A data manipulation and analysis library for Python.
+- Matplotlib: A plotting library for Python and its numerical mathematics extension, NumPy.
diff --git a/Pet's Facial Expression/Models/pet-s-facial-expression-detection-resnet-50.ipynb b/Pet's Facial Expression/Models/pet-s-facial-expression-detection-resnet-50.ipynb
diff --git a/Pet's Facial Expression/Models/pets-facial-expression-efficientnet.ipynb b/Pet's Facial Expression/Models/pets-facial-expression-efficientnet.ipynb
diff --git a/Pet's Facial Expression/Models/pets-facial-expression-vit.ipynb b/Pet's Facial Expression/Models/pets-facial-expression-vit.ipynb
diff --git a/Pet's Facial Expression/README.md b/Pet's Facial Expression/README.md
@@ -0,0 +1,64 @@
+# **Pet's Facial Expression Detection**
+
+### 🎯 Goal
+This project aims to develop an advanced image classification system that can accurately identify the facial expressions of pets using cutting-edge deep learning techniques.
+
+### Purpose
+Understanding pet emotions through their facial expressions can significantly enhance the bond between pets and their owners. This project aims to leverage advanced AI models to detect and classify various facial expressions in pets, aiding in better care and understanding of pets' emotional states.
+
+### 🧵 Dataset
+The dataset used in this project is: https://www.kaggle.com/datasets/anshtanwar/pets-facial-expression-dataset
+
+### 🧾 Description
+This project focuses on developing a sophisticated image classification system to identify different facial expressions in pets using deep learning techniques. The system leverages three state-of-the-art convolutional neural network (CNN) architectures: EfficientNet and ResNet50, and Vision Transformer (ViT) architecture, each known for their robustness and high performance in image recognition tasks.
+
+### 🚀 Models Implemented
+- ViT Transformer: Utilizes the transformer architecture for image classification, providing state-of-the-art accuracy.
+- EfficientNet: A family of models that balance model depth, width, and resolution, achieving high performance with fewer parameters.
+- ResNet50: Employs residual learning to train very deep networks, addressing the vanishing gradient problem and achieving high accuracy.
+
+### 📚 Libraries Needed
+- TensorFlow: For building and training deep learning models.
+- Keras: For simplifying the creation and training of neural networks.
+- NumPy: For numerical computations and array operations.
+- Pandas: For data manipulation and analysis.
+- Matplotlib: For plotting and visualizing data.
+
+### 📊 Exploratory Data Analysis Results
+Exploratory Data Analysis (EDA) involved examining the distribution of the dataset, visualizing sample images, and understanding the different classes of facial expressions. The dataset was split into training and testing sets to evaluate the model's performance effectively.
+
+### 📈 Performance of the Models based on the Accuracy Scores
+
+**ViT Performance**
+
+![ConfusionMatrix_ViT](https://github.com/abckhush/DL-Simplified/assets/127378920/d708a320-014b-43bf-881e-67908057f0e2)
+
+**EficientNet Performance**
+
+![EfficientNet_plot](https://github.com/abckhush/DL-Simplified/assets/127378920/631071c2-91cc-4b96-9114-869e4f55cd96)
+
+**ResNet50 Performance**
+
+![ResNet50_Plot](https://github.com/abckhush/DL-Simplified/assets/127378920/03ef5ddc-e537-4101-aedd-3e5d3c83fe6e)
+
+
+### 📢 Conclusion
+The Vision Transformer (ViT) model successfully classified the facial expressions of pets with high accuracy. The EfficientNet model showed moderate performance, and the ResNet50 model achieved the highest accuracy among the three models. Each model's performance demonstrates the potential of using advanced neural network architectures for image classification tasks in the domain of pet emotion detection.
+
+### Accuracy Results
+The model's performance was evaluated using accuracy and F1 scores.
+1. ViT
+   - Training Accuracy: 80.0%
+   - F1 Score: 0.8
+2. EfficientNet
+   - Training Accuracy: 57.7%
+   - F1 Score: 0.54
+3. ResNet50
+   - Training Accuracy: 97.99%
+
+### Best Fitted Model
+The ResNet50 model proved to be the best-fitted model for this task, achieving the highest accuracy and demonstrating robustness in classification.
+
+## ✒️ Contributor
+- Name: Khushi Kalra
+- Github: https://www.github.com/abckhush