GitHub - reshmamanoj2408/AI-Project

Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification

Our Sentimental LIAR dataset is a modified and further extended version of the LIAR extension introduced by Kirilin et al. In our dataset, the multi-class labeling of LIAR is converted to a binary annotation by changing half-true, false, barely-true and pants-fire labels to False, and the remaining labels to True. Furthermore, we convert the speaker names to numerical IDs in order to avoid bias with regards to the textual representation of names. The binary-label dataset is then extended by adding sentiments derived using the Google NLP API . Sentiment analysis determines the overall attitude of the text (i.e., whether it is positive or negative), and is quantified by a numerical score. If the sentiment score is positive, then we assign Positive for the sentiment attribute, otherwise Negative is assigned. We also introduced a further extension by adding emotion scores extracted using the IBM NLP API for each claim, which determine the detected level of 6 emotional states namely anger, sadness, disgust, fear and joy. The score for each emotion is between the range of 0 and 1.

Below demonstrates a sample record in Sentimental LIAR for a short claim in the LIAR dataset.

statement="McCain opposed a requirement that the government buy American-made motorcycles. And he said all buy-American provisions were quote ’disgraceful.’ "
subject: federal-budget
speaker id: 2
speaker job: President
state info: Illinois
party affiliation: democrat
sentiment: NEGATIVE
anger: 0.1353
disgust: 0.8253
sad: 0.1419
fear: 0.0157
joy: 0.0236
barely true counts: 70
false counts: 71
half true counts: 160
mostly true counts: 163
pants on fire counts: 9
SEN sentiment score: -0.7

Paper on Arxiv

This repository contains the dataset for this paper: https://arxiv.org/abs/2009.01047

Paper on IEEE

https://ieeexplore.ieee.org/document/9280528

Citation

@inproceedings{upadhayay2020sentimental,
  title={Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification},
  author={Upadhayay, Bibek and Behzadan, Vahid},
  booktitle={2020 IEEE International Conference on Intelligence and Security Informatics (ISI)},
  pages={1--6},
  year={2020},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Fake Claim Classification using BERT-base with CNN.ipynb		Fake Claim Classification using BERT-base with CNN.ipynb
Mini Project-1.pdf		Mini Project-1.pdf
Proj2_rmanojku.pdf		Proj2_rmanojku.pdf
Proj3_rmanojku.pdf		Proj3_rmanojku.pdf
Proj4_rmanojku.pdf		Proj4_rmanojku.pdf
Project Proposal - AI.pdf		Project Proposal - AI.pdf
README.md		README.md
test_final.csv		test_final.csv
train_final.csv		train_final.csv
valid_final.csv		valid_final.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification

Below demonstrates a sample record in Sentimental LIAR for a short claim in the LIAR dataset.

Paper on Arxiv

Paper on IEEE

Citation

About

Releases

Packages

Contributors 3

Languages

reshmamanoj2408/AI-Project

Folders and files

Latest commit

History

Repository files navigation

Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification

Below demonstrates a sample record in Sentimental LIAR for a short claim in the LIAR dataset.

Paper on Arxiv

Paper on IEEE

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages