Automatic Review Labelling using BERT

Reviews are essential means of knowing the performance of a product. In this project, I have created a model that predicts the score of a review based on the text. This sentiment analysis model classifies the text into 1 to 5, based on the sentiment behind the review. For example, "Nice product" usually means a score of 5 and “Poor quality” usually means a score of 1.

The model was trained using the Amazon food reviews dataset, which contains around 5 lakh reviews. Since there was a class imbalance, I did undersampling to balance the classes. I used the BERT model and a linear layer at the end. Therefore, for word embedding, I used the BERT tokenizer. The parameters of the BERT model were frozen during the training process to avoid computational complexity. The test accuracy turned out to be 47.4%, much greater than the random case (20%).

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
static		static
templates		templates
Procfile		Procfile
README.md		README.md
Sentiment_Analysis_Project.ipynb		Sentiment_Analysis_Project.ipynb
app.py		app.py
modelcode.py		modelcode.py
predictor.py		predictor.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt
weight.pth		weight.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Review Labelling using BERT

Website preview

About

Releases

Packages

Languages

NohanJoemon/Automatic-review-labelling-using-BERT

Folders and files

Latest commit

History

Repository files navigation

Automatic Review Labelling using BERT

Website preview

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages