Dataset: Labeled Face in the Wild (LFW) by University of Massachusetts.
Link dataset: https://vis-www.cs.umass.edu/lfw/
1. Data Preprocessing: The dataset consists of labeled facial images, which are preprocessed to enhance quality and consistency. Techniques such as normalization, resizing, and augmentation are applied to improve model performance.
2. CNN Architecture: The model employs a multi-layer CNN to capture spatial hierarchies in the images. Layers include convolutional, pooling, and fully connected layers, optimized for feature extraction and classification.
3. Training and Validation: The model is trained on a diverse dataset of facial images, with a portion reserved for validation to ensure accuracy and generalization. Techniques like dropout and batch normalization are used to prevent overfitting.
4. Performance Evaluation: The system’s performance is evaluated using metrics such as accuracy and Confusion Matrix. These metrics help in fine-tuning the model for optimal results.
5. Applications: This face recognition system can be applied in various domains, including security, authentication, and personalized user experiences.
Data augmentation is a technique used in machine learning to artificially increase the size and diversity of a training dataset by creating modified versions of existing data. This helps improve the performance and robustness of machine learning models, especially when the original dataset is small or imbalanced.
Avoid Overfitting: By testing the model on different subsets of data, cross-validation helps detect overfitting, where the model performs well on training data but poorly on new data.
Better Performance Estimation: It provides a more reliable estimate of the model’s performance compared to a single train-test split.
For more about Cross-Validation: https://scikit-learn.org/stable/modules/cross_validation.html.
To select the best model, the highest value of the average accuracy of the five models produced was taken, so that Fold 3 was selected as the best model.