-
-
Notifications
You must be signed in to change notification settings - Fork 215
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #586 from SRUJANA2199/main
Emotion recognition in text
- Loading branch information
Showing
8 changed files
with
997 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,108 @@ | ||
EMOTION RECOGNITION IN TEXT | ||
|
||
GOAL | ||
|
||
To develop a model that can analyze text data and classify the emotions expressed (e.g., happiness, sadness, anger). | ||
|
||
|
||
DATASET | ||
|
||
https://www.kaggle.com/datasets/pashupatigupta/emotion-detection-from-text | ||
|
||
DESCRIPTION | ||
|
||
To analyze the dataset of emotion detction from text and build,train the model on the basis of different features and variables. | ||
|
||
Visualization and EDA of different attributes: | ||
|
||
|
||
|
||
|
||
|
||
![sentiment distribution](https://github.com/abhisheks008/ML-Crate/assets/136368774/563e9d32-6c49-40a3-803a-419931a84f3a) | ||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
WHAT I HAD DONE | ||
|
||
1.Load the dataset which contains about 40000 entires. | ||
|
||
2.Checked for missing values and cleaned the data accordingly. | ||
|
||
3.Analyzed the data, found insights and visualized them accordingly. | ||
|
||
4.Plotting distribution graphs to find corelations. | ||
|
||
5.Found detailed insights of different columns with target variable using plotting libraries. | ||
|
||
6.Trained the datasets by different models and saves their accuracies into a dataframe. | ||
|
||
|
||
MODELS USED | ||
|
||
|
||
Random forest classifier as it shows high accuracy,versatility and scalability. | ||
|
||
Gradient booster which is a strong algorithm for classification and regression problems | ||
|
||
XGBClassifier help to improve machine-learning model's accuracy. | ||
|
||
|
||
LIBRARIES NEEDED | ||
|
||
Numpy | ||
|
||
Pandas | ||
|
||
Word cloud | ||
|
||
Matplotlib | ||
|
||
Seaborn | ||
|
||
Scikit-Learn | ||
|
||
Scipy | ||
|
||
Xgboost | ||
|
||
Tensorflow | ||
|
||
Keras | ||
|
||
|
||
VISUALIZATION | ||
|
||
INCLUSION OF IMAGES OF THE VISUALIZATION IS MUST (RESULT OF EDA). | ||
|
||
ACCURACIES | ||
|
||
Random forest classifier Score = 1.0 | ||
|
||
Gradient booster Score = 1.0 | ||
|
||
XGBClassifier Score = 0.25 | ||
|
||
|
||
CONCLUSION | ||
|
||
Random forest classifier and Gradient booste rmodels show promising performance . | ||
|
||
XGBClassifier shows less accuracy | ||
|
||
|
||
YOUR NAME | ||
|
||
SRUJANA | ||
|
||
|
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
772 changes: 772 additions & 0 deletions
772
Emotion Recognition In Text/Model/Emotion-Recognition-in-text.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,108 @@ | ||
EMOTION RECOGNITION IN TEXT | ||
|
||
GOAL | ||
|
||
To develop a model that can analyze text data and classify the emotions expressed (e.g., happiness, sadness, anger). | ||
|
||
|
||
DATASET | ||
|
||
https://www.kaggle.com/datasets/pashupatigupta/emotion-detection-from-text | ||
|
||
DESCRIPTION | ||
|
||
To analyze the dataset of emotion detction from text and build,train the model on the basis of different features and variables. | ||
|
||
Visualization and EDA of different attributes: | ||
|
||
|
||
|
||
|
||
|
||
![sentiment distribution](https://github.com/abhisheks008/ML-Crate/assets/136368774/563e9d32-6c49-40a3-803a-419931a84f3a) | ||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
|
||
WHAT I HAD DONE | ||
|
||
1.Load the dataset which contains about 40000 entires. | ||
|
||
2.Checked for missing values and cleaned the data accordingly. | ||
|
||
3.Analyzed the data, found insights and visualized them accordingly. | ||
|
||
4.Plotting distribution graphs to find corelations. | ||
|
||
5.Found detailed insights of different columns with target variable using plotting libraries. | ||
|
||
6.Trained the datasets by different models and saves their accuracies into a dataframe. | ||
|
||
|
||
MODELS USED | ||
|
||
|
||
Random forest classifier as it shows high accuracy,versatility and scalability. | ||
|
||
Gradient booster which is a strong algorithm for classification and regression problems | ||
|
||
XGBClassifier help to improve machine-learning model's accuracy. | ||
|
||
|
||
LIBRARIES NEEDED | ||
|
||
Numpy | ||
|
||
Pandas | ||
|
||
Word cloud | ||
|
||
Matplotlib | ||
|
||
Seaborn | ||
|
||
Scikit-Learn | ||
|
||
Scipy | ||
|
||
Xgboost | ||
|
||
Tensorflow | ||
|
||
Keras | ||
|
||
|
||
VISUALIZATION | ||
|
||
INCLUSION OF IMAGES OF THE VISUALIZATION IS MUST (RESULT OF EDA). | ||
|
||
ACCURACIES | ||
|
||
Random forest classifier Score = 1.0 | ||
|
||
Gradient booster Score = 1.0 | ||
|
||
XGBClassifier Score = 0.25 | ||
|
||
|
||
CONCLUSION | ||
|
||
Random forest classifier and Gradient booste rmodels show promising performance . | ||
|
||
XGBClassifier shows less accuracy | ||
|
||
|
||
YOUR NAME | ||
|
||
SRUJANA | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
numpy==1.19.2 | ||
pandas==1.4.3 | ||
matplotlib==3.7.1 | ||
scikit-learn~=1.0.2 | ||
scipy==1.5.0 | ||
seaborn==0.10.1 | ||
xgboost~=1.5.2 | ||
tensorflow==2.4.1 | ||
keras==2.4.0 |