PT - JOURNAL ARTICLE AU - Levi N. Bonnell AU - Benjamin Littenberg AU - Safwan R. Wshah AU - Gail L. Rose TI - A Machine Learning Approach to Identification of Unhealthy Drinking AID - 10.3122/jabfm.2020.03.190421 DP - 2020 May 01 TA - The Journal of the American Board of Family Medicine PG - 397--406 VI - 33 IP - 3 4099 - http://www.jabfm.org/content/33/3/397.short 4100 - http://www.jabfm.org/content/33/3/397.full SO - J Am Board Fam Med2020 May 01; 33 AB - Introduction: Unhealthy drinking is prevalent in the United States, and yet it is underidentified and undertreated. Identifying unhealthy drinkers can be time-consuming and uncomfortable for primary care providers. An automated rule for identification would focus attention on patients most likely to need care and, therefore, increase efficiency and effectiveness. The objective of this study was to build a clinical prediction tool for unhealthy drinking based on routinely available demographic and laboratory data.Methods: We obtained 38 demographic and laboratory variables from the National Health and Nutrition Examination Survey (1999 to 2016) on 43,545 nationally representative adults who had information on alcohol use available as a reference standard. Logistic regression, support vector machines, k-nearest neighbor, neural networks, decision trees, and random forests were used to build clinical prediction models. The model with the largest area under the receiver operator curve was selected to build the prediction tool.Results: A random forest model with 15 variables produced the largest area under the receiver operator curve (0.78) in the test set. The most influential predictors were age, current smoker, hemoglobin, sex, and high-density lipoprotein. The optimum operating point had a sensitivity of 0.50, specificity of 0.86, positive predictive value of 0.55, and negative predictive value of 0.83. Application of the tool resulted in a much smaller target sample (75% reduced).Conclusion: Using commonly available data, a decision tool can identify a subset of patients who seem to warrant clinical attention for unhealthy drinking, potentially increasing the efficiency and reach of screening.