Penalized Lasso Methods in Health Data: application to trauma and influenza data of Kerman

سال انتشار: 1398
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 158

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JKMU-26-6_003

تاریخ نمایه سازی: 19 دی 1401

چکیده مقاله:

Background: Two main issues that challenge model building are number of Events Per Variable and multicollinearity among exploratory variables. Our aim is to review statistical methods that tackle these issues with emphasize on penalized Lasso regression model.  The present study aimed to explain problems of traditional regressions due to small sample size and multi-colinearity in trauma and influenza data and to introduce Lasso regression as the most modern shrinkage method. Methods: Two data sets, corresponded to Events Per Variable of ۱.۵ and ۳.۴, were used. The outcomes of these two data sets were hospitalization due to trauma and hospitalization of patients suffering influenza respectively. In total, four models were developed: classic Cox and logistic regression models, as well as their penalized lasso form. The tuning parameters were selected through ۱۰-fold cross validation. Results: Traditional Cox model was not able to detect significance of any of variables. Lasso Cox model revealed significance of respiratory rate, focused assessment with sonography in trauma, difference between blood sugar on admission and ۳ h after admission, and international normalized ratio. In the second data set, while lasso logistic selected four variables as being significant, classic logistic was able to identify only the importance of one variable. Conclusion: The AIC for lasso models was lower than that for traditional regression models. Lasso method has practical appeal when Events Per Variable is low and multicollinearity exists in the data.

نویسندگان

Abolfazl Hosseinnataj

Department of Biostatistics and Epidemiology, Modeling in Health Research Center, Faculty of Health, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran

Abbas Bahrampour

Professor, Department of Biostatistics, Physiology Research Center, Institute of Basic and Clinical Physiology Sciences & Modeling in Health Research Center, Faculty of Health, Institute for Futures Studies in Health, Kerman University of

Mohammadreza Baneshi

Professor, Department of Biostatistics and Epidemiology, Modeling in Health Research Center, Faculty of Health, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran

Farzaneh Zolala

Associate Professor, Department of Biostatistics and Epidemiology, Social Determinants of Health Research Center, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran

Roya Nikbakht

Department of Biostatistics and Epidemiology, HIV/STI Surveillance Research Center, and WHO Collaborating Centre for HIV Surveillance, Kerman University of Medical Sciences, Kerman, Iran

Mehdi Torabi

Associate Professor, Department of Emergency Medicine, Kerman University of Medical Sciences, Kerman, Iran

Fereshteh Mazidi Sharaf Abadi

Department of Emergency Medicine, Kerman University of Medical Sciences, Kerman, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • Austin PC, Steyerberg EW. The number of subjects per variable ...
  • Vittinghoff E, McCulloch CE. Relaxing the rule of ten events ...
  • Lin FJ. Solving multicollinearity in the process of fitting regression ...
  • Clarke R, Ressom HW, Wang A, Xuan J, Liu MC, ...
  • Li H, Gui J. Partial Cox regression analysis for high-dimensional ...
  • Pinsky PF, Magder LS. Evaluating the tradeoff between bias and ...
  • Slinker BK, Glantz SA. Multiple regression for physiological data analysis: ...
  • Hammami D, Lee TS, Ouarda TB, Lee J. Predictor selection ...
  • Tibshirani R. Regression shrinkage and selection via the lasso. J ...
  • Tian GL, Tang ML, Fang HB, Tan M. Efficient methods ...
  • Jang DH, Anderson-Cook CM. Influence Plots for LASSO. [cited ?????] ...
  • Meinshausen N, Bühlmann P. High-dimensional graphs and variable selection with ...
  • Benner A, Zucknick M, Hielscher T, Ittrich C, Mansmann U. ...
  • Roberts S, Nowak G. Stabilizing the lasso against cross-validation variability. ...
  • Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, ...
  • Murphy TB, Dean N, Raftery AE. Variable selection and updating ...
  • Huang H. Controlling the false discoveries in LASSO. Biometrics ۲۰۱۷; ...
  • Torabi M, Mazidi Sharaf Abadi F, Baneshi MR. Blood sugar ...
  • Isbell C, Cohn SM, Inaba K, O'Keeffe T, De Moya ...
  • Cirocchi R, Grassi V, De Sol A, Renzi C, Parisi ...
  • Froberg L, Helgstrand F, Clausen C, Steinmetz J, Eckardt H. ...
  • Duane TM, Ivatury RR, Dechert T, Brown H, Wolfe LG, ...
  • Ono S, Ono Y, Matsui H, Yasunaga H. Factors associated ...
  • Czaja CA, Miller L, Alden N, Wald HL, Cummings CN, ...
  • Homaira N, Briggs N, Oei JL, Hilder L, Bajuk B, ...
  • Tempia S, Walaza S, Moyes J, Cohen AL, von Mollendorf ...
  • Radchenko P, James GM. Variable Inclusion and shrinkage algorithms. J ...
  • Zou H. The adaptive lasso and its oracle properties. J ...
  • Zou H, Hastie T. Regularization and variable selection via the ...
  • Mallick H, Yi N. Bayesian methods for high dimensional linear ...
  • Reid S, Tibshirani R, Friedman J. A study of error ...
  • Park T, Casella G. The bayesian lasso. J Am Stat ...
  • نمایش کامل مراجع