Graduation Semester and Year

2019

Language

English

Document Type

Thesis

Degree Name

Master of Science in Computer Science

Department

Computer Science and Engineering

First Advisor

Shirin Nilizadeh

Abstract

Social media has become an empowering agent for individual voices and freedom of expression. Yet, it can also serve as a breeding ground for hate speech. According to a Pew Research Center study, 41% of Americans have been personally subjected to harassing behavior online, 66% have witnessed these behaviors directed at others, and 18% have been subjected to particularly severe forms of harassment online, such as physical threats, harassment over a sustained period, sexual harassment, or stalking. Recently, many research studies have tried to understand online hate speech and its implications, focusing on detecting and characterizing hate speech. One limitation of these works is that they analyze a collection of individual messages without considering the larger conversational context. Our project has two objectives: First, we characterize the impact of hate speech on Twitter conversations, in terms of conversation length and sentiment, as well as user engagement; Second, we demonstrate the feasibility of automatically generating hate replies to some tweets, using retrieval models. For the first objective, we: (1) extracted toxic tweets and their corresponding conversations; (2) defined a toxicity trend score for conversations; and (3) studied the impact of toxic replies on twitter conversations using statistical methods. For the second objective, we: (1) created a knowledge database for toxic tweets and replies; (2) implemented a retrieval model that uses Doc2vec embedding, which identifies N top tweet-reply matches for a specific tweet; (3) proposed a ranking algorithm based on Word2vec that identifies the best hate reply for the tweet; (4) evaluated our approach by implementing some alternative approaches and running several studies on Amazon Mechanical Turk.

Keywords

Toxic, Hate speech, Conversation study, Twitter conversations

Disciplines

Computer Sciences | Physical Sciences and Mathematics

Comments

Degree granted by The University of Texas at Arlington

Share

COinS