By Sen Sub Laban, sen.s@pitt.edu | December 15, 2022
This research project analyzes topic-matched conspiratorial and mainstream documents to understand which lexical features may potentially influence the engagement with and consequent spread of narratives across social media.
This project utilized LOCO: The 88-million word language of conspiracy corpus. LOCO is composed of topic-matched conspiracy and mainstream documents harvested from 150 websites. The data is hierarchically structured in JSON files, with each document cross-nested within websites and topics. The documents are also annotated for a rich set of linguistic features and metadata. LOCO is made freely available by its authors, licensed under an open access Creative Commons license, permitting use, sharing, adaptation, distribution and reproduction in any medium or format, making it ideal for many avenues of analysis and research such as this one.
Miani, A., Hills, T. & Bangerter, A. LOCO: The 88-million-word language of conspiracy corpus. Behav Res 54, 1794–1817 (2022). https://doi.org/10.3758/s13428-021-01698-z
-
Final report provides a complete overview of this project, focusing chiefly on analysis, rationale, and conclusions drawn from the data.
-
The data folder contains a step-by-step of the coding process as well as all relevant data objects and images.
-
Presentation.pdf is a copy of the presentation aid utilized for a presentation about this project on December 8, 2022.
-
Progress report details progress made on this project throughout the semester.
-
Project plan was the original plan developed at the start of this project.
-
See the License to understand what you may and may not do with this project.