Comparative Evaluation of Residual Diagnostic Measures for Outlier Detection in Linear Regression: An Empirical Study Using Heart Rate Data

Reg No: 410

Authors

DOI:

https://doi.org/10.56450/JEFI.2025.v3i2Suppl.084

Keywords:

Outlier detection linear regression residual diagnostics

Abstract

Outliers in medical datasets can substantially distort regression models, leading to biased estimates and unreliable clinical inferences. This study evaluated residual-based diagnostic methods-including standardized residuals, Cook's Distance, DFFITS, DFBETAS, Covariance Ratio, and standardized scores-for detecting influential observations in a simple linear regression model. Using age as the predictor and heart rate as the outcome, data from 30 individuals were analyzed. The initial model (R² = 0.22) revealed one highly influential outlier (heart rate = 290 at age 42), consistently identified across multiple diagnostics. Removal of this case improved model adequacy, yielding a more stable regression (R² = 0.10) with no further significant outliers detected. Findings highlight that even a single outlier can markedly distort regression-based conclusions, underscoring the importance of employing complementary diagnostic tools to ensure valid statistical inference in medical research.

Downloads

Download data is not yet available.

References

Aggarwal, C. C., & Yu, P. S. (2001). Outlier detection for high dimensional data.Proceedings of the 2001 ACM SIGMOD International Conference on

Management of Data, 37-46. DOI: 10.1145/375663.375668

Barnett, V., & Lewis, T. (1994). Outliers in clinical data: Detection and treatment. Journal of Clinical Epidemiology, 47(8), 947-959.

C. C. Aggarwal, "Outlier detection in graphs and networks" in Outlier analysis, Springer, pp. 369-397, 2017.

David L. Donoho. Miriam Gasko. "Breakdown Properties of Location Estimates Based on Halfspace Depth and Projected Outlyingness." Ann.

Statist. 20 (4) 1803 - 1827, December, 1992. https://doi.org/10.1214/aos/1176348890

Published

2026-04-17


How to Cite

1.
Kirti S. Comparative Evaluation of Residual Diagnostic Measures for Outlier Detection in Linear Regression: An Empirical Study Using Heart Rate Data: Reg No: 410. JEFI [Internet]. 2026 Apr. 17 [cited 2026 Apr. 26];3((2Supp). Available from: https://efi.org.in/journal/index.php/JEFI/article/view/521

Similar Articles

1-10 of 38

You may also start an advanced similarity search for this article.