CMPS 6730 - Sentiment Analysis for Biochem Class Reviews

This project was inspired by my primary major: cell & molecular biology

Most college STEM courses are lecture style and emphasize memorization > understanding. One Tulane professor strives to break this mold, investing energy into creating an accessible classroom environment.

Goal

This biochemistry professor understands their classroom policies may be divisive. To track student opinions, they survey several times a semester. My goal was to perform sentiment analysis on these reviews. Could I train a model to predict how a student feels about biochemistry class given textual feedback?

Through this assignment, I wanted to provide a professor I care about with a meaningful tool they could utilize to improve their classes for all.

Methods

To perform sentiment analysis, I investigated a number of approaches.

First, since I was working with raw survey data, I had to label each review as positive [label = 1] or negative [label = 0] and remove any identifying details. I initially started with survey data the afformentioned professor gave me + some Rate My Professor [RMP] reviews of the target professor [dataset of 53 sentences of reviews]. Later on, I expanded the dataset by supplementing my spreadsheet with RMP reviews of other Tulane professors who teach CELL courses.

Throughout this entire project, I ended up building numerous models for sentiment analysis. These include:

naive bayes
logistic regression [one-hot & GloVe]
HMMs [for POS modeling]
an RNN model

Conclusions

My major takeaway from this project is that working with raw, unlabeled text can be challenging! I had a hard time building models that accurately accomplished the task I planned.

Ultimately, with my RNN model, I was able to achieve a sentiment analysis [binary classification] accuracy of 0.76, providing evidence that my efforts may have been worthwhile.

With more time to work on this assignment, I would hope to work with more data and continue designing models from scratch [+ take the time to tune the hyperparameters correctly]

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
docs		docs
nlp		nlp
notebooks		notebooks
report		report
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
GettingStarted.md		GettingStarted.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
credentials.json		credentials.json
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini
web.png		web.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CMPS 6730 - Sentiment Analysis for Biochem Class Reviews

Goal

Methods

Conclusions

About

Releases

Packages

Contributors 2

Languages

License

tulane-cmps6730/project-biochem

Folders and files

Latest commit

History

Repository files navigation

CMPS 6730 - Sentiment Analysis for Biochem Class Reviews

Goal

Methods

Conclusions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages