Comparative Evaluation of Residual Diagnostic Measures for Outlier Detection in Linear Regression: An Empirical Study Using Heart Rate Data
Reg No: 410
DOI:
https://doi.org/10.56450/JEFI.2025.v3i2Suppl.084Keywords:
Outlier detection linear regression residual diagnosticsAbstract
Outliers in medical datasets can substantially distort regression models, leading to biased estimates and unreliable clinical inferences. This study evaluated residual-based diagnostic methods-including standardized residuals, Cook's Distance, DFFITS, DFBETAS, Covariance Ratio, and standardized scores-for detecting influential observations in a simple linear regression model. Using age as the predictor and heart rate as the outcome, data from 30 individuals were analyzed. The initial model (R² = 0.22) revealed one highly influential outlier (heart rate = 290 at age 42), consistently identified across multiple diagnostics. Removal of this case improved model adequacy, yielding a more stable regression (R² = 0.10) with no further significant outliers detected. Findings highlight that even a single outlier can markedly distort regression-based conclusions, underscoring the importance of employing complementary diagnostic tools to ensure valid statistical inference in medical research.
Downloads
References
Aggarwal, C. C., & Yu, P. S. (2001). Outlier detection for high dimensional data.Proceedings of the 2001 ACM SIGMOD International Conference on
Management of Data, 37-46. DOI: 10.1145/375663.375668
Barnett, V., & Lewis, T. (1994). Outliers in clinical data: Detection and treatment. Journal of Clinical Epidemiology, 47(8), 947-959.
C. C. Aggarwal, "Outlier detection in graphs and networks" in Outlier analysis, Springer, pp. 369-397, 2017.
David L. Donoho. Miriam Gasko. "Breakdown Properties of Location Estimates Based on Halfspace Depth and Projected Outlyingness." Ann.
Statist. 20 (4) 1803 - 1827, December, 1992. https://doi.org/10.1214/aos/1176348890
Published
Issue
Section
License
Copyright (c) 2025 Dr Shashank Kirti (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
This means that you are free to:
- Share, copy, and redistribute the article in any medium or format
- Adapt, remix, transform, and build upon the article
as long as you provide appropriate credit to the original work, include the authors' names, article title, journal name, and indicate that the work is licensed under CC BY. Any use of the material should not imply endorsement by the authors or the journal.