Power DataMate Tool: Leveraging Logistic Regression Classification for Interactive Data Modeling

Authors

  • Mahmoud Abu Alrub The Electrical and Computer Engineering Department, The University of Michigan – Dearborn, USA https://orcid.org/0000-0002-8916-0259
  • Adnan Shaout The Electrical and Computer Engineering Department, The University of Michigan – Dearborn, USA

DOI:

https://doi.org/10.14738/tecs.122.16892

Keywords:

Data Mining, Data Modeling, Data Classification, Logistic Regression, Primary Key, Foreign Key

Abstract

The demand for efficient predictive modeling techniques has become crucial due to the growing occurrence of binary classification problems in diverse fields. This paper presents a new and innovative software tool “Power DataMate” (PDM) which performs logistic regression (LR) classification as a potent technique for data modeling. PDM is a powerful tool in capturing and analyzing correlations across varied datasets. One objective of PDM is to focus on logistic regression for inquiry based on its capability to represent intricate interactions between predictors and the binary response variable. Another important goal of the new tool is to forecast the likelihood of discovering Primary Keys (PK) and Foreign Keys (FK) within datasets.

The new tool PDM also allows users not only to automatically have their data for a project modeled, but also interactively review and confirm primary keys and features for further data analysis and modeling. While the research entails a comprehensive evaluation of model performance indicators, including accuracy, precision, and recall, results show that the accuracy of PK prediction is 89% and 82% for the FK. Hence, these results are the first of their kind and could be a starting point for further model enhancements and data analytics research, especially for projects which include large data files where PDM end user has the choice to interactively feed the learning algorithm for better outcomes.

Downloads

Published

2024-05-08

How to Cite

Abu Alrub, M., & Shaout, A. (2024). Power DataMate Tool: Leveraging Logistic Regression Classification for Interactive Data Modeling. Transactions on Engineering and Computing Sciences, 12(2), 195–216. https://doi.org/10.14738/tecs.122.16892