Linear Regression in the Spotlight: From Statistical Staple to Misused Tool
Keywords:
Residual, Autocorrelation, Homoscedasticity, Linearity, RegressionAbstract
This article explores the application, assumptions, and frequent misuses of linear regression
analysis in research, particularly within the business, social sciences and medical fields. While
linear regression remains one of the most widely used and accessible statistical tools due to its
simplicity and interpretability, it is often misapplied, especially when researchers overlook the
foundational assumptions required for valid inferences. The paper reviews the key assumptions of
linearity, normality, homoscedasticity, and independence of errors, and discusses the appropriate
use of linear regression in descriptive, predictive, and causal research. Through a critical review
of published studies and an empirical analysis of customer satisfaction data from Kaggle, the
article identifies common violations, including the inappropriate modelling of discrete and ordinal
dependent variables, unjustified covariate adjustments, and misinterpretation of regression
coefficients. Residual plots and diagnostic tests further reveal that linear regression is frequently
applied where it is not suitable, leading to misleading conclusions. The study concludes with
practical recommendations to improve statistical literacy and rigor among researchers,
emphasizing the importance of involving statisticians and aligning statistical instruction with
domain-specific research contexts.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Peter Chimwanda, Edwin Rupi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.