Project Overview

This project involves preparing, analyzing, and fine-tuning dataset McAuley-Lab/Amazon-Reviews-2023 for product reviews. (The finetuned model can be downloded from this link) It includes data preparation, model fine-tuning, user interaction for review summarization, and sentiment analysis.
Below are the detailed steps and functionalities:

Data Preparation

Download the Dataset

Run the data_save.py script to download the dataset.
Match the product names in the metadata using the ASIN code.
Remove any unnecessary details to clean the data.

Model Fine-Tuning

The train.py file is used to fine-tune the BART-large-cnn model for generating summaries of product reviews.

User Interaction

Select Product and Get Summarized Reviews

Use the user_input.ipynb notebook to:

Select a product from a provided list.
Obtain a summarized review of the selected product.
View the sentiment analysis of the selected product.

Sentiment Analysis Model Comparison

The Comapring_Roberta_and_Vader_models.ipynb notebook is used to compare two models for sentiment analysis:
Here, the comparison is done for a subset of first 250 data samples from the McAuley-Lab/Amazon-Reviews-2023 dataset.

RoBERTa Model: RoBERTa (Robustly optimized BERT approach) is an advanced transformer-based model designed for natural language understanding. It builds upon the BERT (Bidirectional Encoder Representations from Transformers) architecture
VADER Model: VADER (Valence Aware Dictionary and sEntiment Reasoner) is a rule-based model specifically attuned to sentiment analysis, particularly suited for social media texts. It is designed to be fast and computationally efficient, making it ideal for real-time applications

Both RoBERTa and VADER have their strengths and are suited for different scenarios. RoBERTa, with its transformer-based architecture, excels in deep contextual understanding and complex text analysis. VADER, with its rule-based approach, provides fast and interpretable results but it has limitations to capture wider range of sentiments.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Amazon_Review__Summarizer_and__Sentiment_Analyzer_asyed12s_oraja2s_mliaqu2s__Poster_final.pdf		Amazon_Review__Summarizer_and__Sentiment_Analyzer_asyed12s_oraja2s_mliaqu2s__Poster_final.pdf
Comapring_Roberta_and_Vader_models.ipynb		Comapring_Roberta_and_Vader_models.ipynb
README.md		README.md
data_save.py		data_save.py
graphs.ipynb		graphs.ipynb
train.py		train.py
user_input.ipynb		user_input.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Overview

Data Preparation

Download the Dataset

Model Fine-Tuning

User Interaction

Select Product and Get Summarized Reviews

Sentiment Analysis Model Comparison

About

Releases

Packages

Contributors 3

Languages

MairaLiaquat/Amazon-Review-Summarizer-and-Sentiment-Analyser

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Data Preparation

Download the Dataset

Model Fine-Tuning

User Interaction

Select Product and Get Summarized Reviews

Sentiment Analysis Model Comparison

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages