Alex Ocampo [email protected] 04/30/2023
This is a term project for Data Science for Linguistics 2023 at the University of Pittsburgh.
This project analyzes the Am I the Asshole subreddit to question how we pursuade readers and shift blame while telling stories. It focuses on how much agency writers afford themselves and the other parties in their stories.
The data set for this project was generated myself using PMAW, a wrapper for a third-party Reddit API named Pushshift
final_report.md
: This is my final write-up for this projectREADME.md
: This is this current file which briefly overviews my repo.LICENSE.md
: This is the licensing information for this project. This project is licensed under GNU General Public License v3.0.class_presentation.pdf
: This is a slideshow used to supplement a class presentation about my project.progress_report.md
: This is a journal used throughout the semester to document major updates to my project.project_plan.md
: This is the original pitch for this project from the beginning of the semester.code
: This folder houses the Jupyter Notebooks used to work through my project.data_collection_test.ipynb
: This houses an initial experiment with PMAW to form my data setdata_collection.ipynb
: This maintains the full run through scraping and reorganizing my data set.data_analysis.ipynb
: This contains the bulk of my project, where I analyze the data set for linguistic features.data_analysis_experiments
: This contains attempts to utilize machine learning to explore the data set. The results here are inconclusive.
data
: This folder houses the data set develooped from this project. Most of the contents of this folder are ignored on the public repo.aita_data_sample.csv
: This is the results of the initial test run atdata_collection_test.ipynb
aita_data.csv
: This is the final data set gathered fromdata_collection.ipynb
.
figures
: This folder houses the assets used within myfinal_report.md
..gitignore
: This folder serves as housekeeping to omit any unnecessary files from the repository, such as internal saves of my data set.
The guestbook for my project can be found here! Thank you for visiting, and I hope you enjoy :D