Graduation Semester and Year
2021
Language
English
Document Type
Dissertation
Degree Name
Doctor of Philosophy in Computer Science
Department
Computer Science and Engineering
First Advisor
Chengkai Li
Abstract
As social media sites have become major channels for the quick dissemination of news, misinformation has become a significant challenge for our society to tackle. Today fact-checking rests primarily on the shoulders of human fact-checkers who laboriously sift through various trustworthy sources, interview subject experts, and check references before reaching a verdict regarding the degree of truthfulness of a factual claim. Compounded with the speed and scale at which misinformation spreads, the demanding process may leave many harmful factual claims unchecked. In the fight to curb the spread of misinformation, researchers from various disciplines have come forward to assist fact-checkers by creating several automated fact-checking tools and apps. In this dissertation, we focus on studying factual claims and make the following contributions to assist the automated fact-checking efforts: (1) Understanding a factual claim and parsing the content of the claim to extract its attributes are challenging. We propose a way to represent claims in a structured format to capture various aspects of claims, such as entities involved, their relationships, quantities, points and intervals in time, comparisons, and aggregate structures. We use semantic frames for the representation of factual claims. We create a set of new semantic frames, a dataset of frame-annotated claims, and a publicly available web-based annotation tool. (2) To verify a factual claim over a relational database, it is necessary to translate it into a SQL query. However, automatically translating claims to SQL queries is hard. We conduct a preliminary investigative study: (a) to reveal challenges in claim translations and (b) to assess the efficacy of applying a state-of-the-art text-to-SQL parser in translation. (3) The problem of unchecked claims is exacerbated on social media. We build ClaimPortal, a web-based platform that enables users to monitor, search, and check English factual claims on Twitter. We further demonstrate a semantic-frame-based model to categorize tweets based on the type of factual claims they promote. (4) One of the critical components in the fact-checking process is automatically assessing the check-worthiness of a piece of information. It is crucial to have a carefully annotated ground-truth dataset that can feed a machine-learning algorithm to predict the check-worthiness of a statement. To bridge this gap, we create a large dataset of claims from all U.S. presidential debates (1960 to 2016) along with the human-annotated check-worthiness label.
Keywords
Modeling factual claims, Fact-checking, Frame semantic
Disciplines
Computer Sciences | Physical Sciences and Mathematics
License
This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 International License.
Recommended Citation
Arslan, Fatma, "MODELING FACTUAL CLAIMS WITH SEMANTIC FRAMES: DEFINITIONS, DATASETS, TOOLS, AND FACT-CHECKING APPLICATIONS" (2021). Computer Science and Engineering Dissertations. 339.
https://mavmatrix.uta.edu/cse_dissertations/339
Comments
Degree granted by The University of Texas at Arlington