GitHub - jss415/recommendation-system: Evaluations and Comparisons of Recommendation Systems Using The MovieLens Dataset

Movie Recommendation and Rating Prediction

This project uses the MovieLens dataset to develop and compare various algorithms for movie recommendation and rating prediction. We evaluate the performance of different approaches including Singular Value Decomposition (SVD), content-based filtering, item-to-item K-Nearest Neighbors (KNN), user-based filtering, hybrid methods, and neural filtering. Performance metrics such as Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) are used to assess the effectiveness of each algorithm. Our comprehensive analysis provides insights into the strengths and weaknesses of each method, helping to identify the best strategies for accurate movie recommendations.

Problem Statement

Data Preprocessing:

Create a training dataset and a testing dataset for the experiment. For each user, randomly select 80% of their ratings as the training ratings, and use the remaining 20% as testing ratings. The training ratings from all users consist of the final training dataset, and the testing ratings from all users consist of the final testing dataset. Rating Prediction:
Develop algorithms to predict the ratings in the testing set based on the information in the training dataset.
Evaluate the predictions based on Mean Absolute Error (MAE) and Root Mean Square Error (RMSE).

Item Recommendation:

Construct a recommendation list for each user.
Evaluate the recommendation quality based on precision, recall, F-measure, and Normalized Discounted Cumulative Gain (NDCG).

Data and Exploratory Data Analysis

25 million ratings applied to 62,000 movies by 162,000 users. Released 12/2019 on https://grouplens.org/datasets/movielens/. Below figure shows the distribution of ratings per movie. Of the 2,83,228 users using Movies, around 69,000 people rated more then 100 movies. Most of the movies in the data received less than 10000 ratings.

Results

The study's results show significant differences in performance among various recommendation algorithms evaluated on the MovieLens dataset. Specifically, the content-based approach showed higher RMSE and MAE values compared to collaborative filtering methods, with item-item collaborative filtering outperforming user-user filtering due to the dataset's larger number of movies relative to users. KNN-based filtering fell in between these two approaches. The most successful models were SVD-based and neural collaborative filtering, which achieved the lowest error rates and higher precision and recall metrics. This success highlights the advantage of collaborative filtering. However, on the other hand, the content-based method's reliance only on movie genres limited its effectiveness. The superior performance of SVD and neural models is explained by the fact that these models capture latent factors in user-movie interactions, with the neural model excelling in capturing non-linear relationships. These findings highlight the importance of algorithm choice and dataset characteristics in optimizing recommendation system performance.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
SVD.ipynb		SVD.ipynb
content_based_filtering.ipynb		content_based_filtering.ipynb
evaluation_metrics.ipynb		evaluation_metrics.ipynb
exploratory_data_analysis.ipynb		exploratory_data_analysis.ipynb
graph.ipynb		graph.ipynb
hybrid_neural_collaborative_filtering.ipynb		hybrid_neural_collaborative_filtering.ipynb
item_to_item_collaborative_filtering_using_cosine_similarity.ipynb		item_to_item_collaborative_filtering_using_cosine_similarity.ipynb
knn_based_recommendation.ipynb		knn_based_recommendation.ipynb
preprocessing.ipynb		preprocessing.ipynb
report.pdf		report.pdf
user_based_collaborative_filtering.ipynb		user_based_collaborative_filtering.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Recommendation and Rating Prediction

Problem Statement

Data and Exploratory Data Analysis

Results

About

Releases

Packages

Languages

jss415/recommendation-system

Folders and files

Latest commit

History

Repository files navigation

Movie Recommendation and Rating Prediction

Problem Statement

Data and Exploratory Data Analysis

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages