Graduation Semester and Year




Document Type


Degree Name

Master of Science in Industrial Engineering


Industrial and Manufacturing Systems Engineering

First Advisor

Aera LeBoulluec


Health and social care are areas of concern worldwide nowadays. Chronic diseases such as cancer and social problems such as tobacco consumption are leading risks of deaths in many countries, and preventive efforts are urgently needed to decrease the negative impact that those problems cause. Technology has made available an unprecedent amount of data in the health and social care fields, which scientists are using to achieve a better understanding of many problems that are a burden for the health and social systems globally. Although previous studies have provided approaches to analyze data, more efficient and accurate methods are needed to obtain predictive models with better performance. This thesis employs data science tools to analyze the characteristics of health and social care data and determine the best approaches to improve the accuracy and efficiency of predictive models in the studied fields. Data preprocessing is considered the key action to increase the statistical power of the data and perform valid data analyses in this study. In brief, this thesis present two researches that are focused on enhancing the data analysis by implementing data science techniques to preprocess data.


Missing data, Feature selection


Engineering | Operations Research, Systems Engineering and Industrial Engineering


Degree granted by The University of Texas at Arlington