Abstract
Background: Primary care physicians are positioned to promote early recognition and treatment of men at risk for osteoporosis-related fractures; however, efficient screening strategies are needed. This study was designed to validate the Male Osteoporosis Risk Estimation Score (MORES) for identifying men at increased risk of osteoporosis.
Methods: This was a blinded analysis of the MORES, administered prospectively in a cross-sectional sample of men aged 60 years or older. Participants completed a research questionnaire at an outpatient visit and had a dual-energy X-ray absorptiometry (DXA) scan to assess bone density. Sensitivity, specificity, and area under-the-curve (AUC) were estimated for the MORES. Effectiveness was assessed by the number needed-to-screen (NNS) to prevent one additional major osteoporotic fracture.
Results: A total of 346 men completed the study. The mean age was 70.2 ± 6.9 years; 76% were non-Hispanic white. Fifteen men (4.3%) had osteoporosis of the hip. The operating characteristics were sensitivity 0.80 (95% confidence interval [CI], 0.52–0.96); specificity 0.70 (95% CI, 0.64–0.74), and AUC of 0.82 (95% CI, 0.71–0.92). Screening with the MORES yielded a NNS to prevent one additional major osteoporotic fracture over 10 years with 259 (95% CI, 192–449) compared to 636 for universal screening with a DXA.
Conclusion: This study validated the MORES as an effective and efficient approach to identifying men at increased risk of osteoporosis who may benefit from a diagnostic DXA scan.
Physicians rarely screen men for osteoporosis, even men at high risk because of long-term glucocorticoid use or prior fragility fractures.1⇓⇓⇓⇓⇓–7 Because low bone mineral density (BMD) is the single best predictor of future fragility fractures,8⇓–10 identifying men at increased risk for osteoporosis who should undergo a dual-energy X-ray absorptiometry (DXA) scan may be worthwhile.
Osteoporotic fractures increase in men after 65 years of age,11 an age group that is predicted to nearly double in the United States by 2030.12 Osteoporotic fractures are associated with considerable morbidity and mortality: hip fractures account for the third-highest hospital bed occupancy among men. Only chronic obstructive pulmonary disease (COPD) and myocardial infarction account for more hospital days.13 Although osteoporosis is less common in men compared with women, in-hospital and 1-year mortality rates following a fracture are higher for men.14⇓⇓⇓–18
Once osteoporosis is diagnosed, effective treatments aimed at fracture prevention are available. In women, treatment of osteoporosis with bisphosphonates reduces primary hip fractures by 40% to 50%.19⇓⇓⇓–23 Studies of the efficacy of bisphosphonate therapy for osteoporosis in men are not as robust as those in women; however, several research groups have concluded that the benefits of therapy in men, especially with respect to primary and secondary prevention of vertebral fractures, are similar to those in women.24⇓⇓–27 In men, results of treatment studies with bisphosphonates regarding nonvertebral fractures are inconsistent and may reflect the small number of men in these studies.28⇓⇓⇓–32
Guidelines for osteoporosis screening of men are not uniform. In 2011, the US Preventive Services Task Force issued an I-statement and concluded that, “the current evidence is insufficient to assess the balance of benefits and harms of screening for osteoporosis in men.”33 Canadian guidelines promote universal screening with DXA in men ≥65 years old.34 The National Osteoporosis Foundation35 and American College of Preventive Medicine36 recommend universal DXA screening of men ≥70 years old. In contrast to “hard rules,” the American College of Physicians37 recommends individualized screening decisions based on risk but offers no specific algorithms.
Several screening instruments have been developed and validated in women; however, these instruments are unlikely to be applicable in men. For example, several incorporate estrogen use, which is obviously not applicable to men. All use age, but the age-related risks are likely to differ between the sexes because of differences in age-related changes in bone density between men and women. After peak bone mass, men demonstrate a gradual decline in BMD over time, whereas women experience an accelerated loss of BMD following menopause.38 All algorithms use weight, but weight-related risks are also likely to differ between men and women because of differences in body habitus. Therefore, developing and validating an instrument for men seems prudent.
The Osteoporosis Self-Assessment Tool (OST)39 was originally developed for women but has been adapted for men40 and validated in the Osteoporotic Fractures in Men (MrOS) study; however, the more complex calculations and lack of a clearly defined cutoff value limit its clinical utility.41 Two tools have been developed and validated specifically for men: the Male Osteoporosis Screening Tool (MOST)42 and the Male Osteoporosis Risk Estimation Score (MORES).43 The MOST was developed and internally validated in a cohort of Chinese men in Hong Kong (area under the curve [AUC] of 0.84 in the validation cohort).42 The MOST combines body weight and the quantitative ultrasound index obtained from a heel ultrasound to predict men at risk for osteoporosis and was subsequently validated in the MrOS cohort (AUC of 0.80 for Caucasian men and 0.83 for Chinese men).41 In the MrOS study, the MOST was superior to the OST, which had a statistically significant lower AUC compared with the MOST (AUC of 0.71 for Caucasian men and 0.76 for Chinese men). The major limitation of the MOST is the reliance on a heel ultrasound, which is not readily available in most primary care practices and increases costs associated with screening.
The MORES, the primary focus of this study, was developed and validated in 2995 men, ≥50 years old and representative of the general US population, enrolled in the National Health and Nutrition Examination Survey (NHANES) III.43 Using a weighted scale that includes age, weight, and history of COPD, the MORES identifies men at higher risk of osteoporosis who should undergo a diagnostic DXA scan (AUC, 0.83). The tool is easily incorporated into a clinical encounter and takes less than a minute to administer and score.
The development and evaluation of clinical decision rules are subject to methodological standards44 and progress through several defined stages45 during the process. This study describes the next critical step in the validation and evaluation of the MORES, namely prospective validation in an appropriate clinical setting. The purpose of this study is to validate the MORES in a primary care clinic population as a point-of-care decision tool to identify men at risk for osteoporosis and estimate the number needed-to-screen (NNS) to prevent one additional hip or major osteoporotic fracture over 10 years. The NHANES III cohort used to develop and internally validate the MORES was representative of the general US population at the time (1988 to 1994). Validation in a more current population is important because of variation in clinical characteristics over time and differences between the general population and patients attending primary care practices, who are likely to be less healthy. These potential differences may have unpredictable effects on the performance of a clinical prediction rule such as the MORES.
Methods
Study Design
This was a cross-sectional study of men who attended primary care outpatient clinics for usual care; its aim was to validate use of the MORES as a clinical decision tool to identify men at increased risk of osteoporosis. We enrolled men from November 12, 2008, through November 14, 2011. All participants provided written, informed consent. The Office of Research Subject Protections Institutional Review Board approved this study.
Setting
Participants came from university-based primary care outpatient clinics of the university's department of family medicine and the divisions of general internal medicine and geriatric medicine of the department of internal medicine.
Subjects
Men ≥60 years old were invited to take part in the study. Men were excluded if they had a prior diagnosis of osteoporosis or bone diseases such as Paget disease; were taking bone density conservation agents such as bisphosphonates, calcitonin, or teriparatide; or had a history of bilateral hip replacement surgery. Men also were excluded if they exceeded the weight limit (300 lb) of the DXA scanner.
Procedure
A research assistant interviewed participants and assisted with the completion of an investigator-designed questionnaire that included demographic data, medical history, and the questions contained in the MORES.
After completing the research questionnaire, all participants were asked to have a DXA scan to measure BMD. The DXA scan served as the criterion to classify men as having osteoporosis. For this study, we extended the World Health Organization's definition of osteoporosis46 to include not only a T-score ≤−2.5 for the femoral neck but also a T-score ≤−2.5 for the total hip because each is equally predictive of future fractures.47 The NHANES III cohort of non-Hispanic white women, 20 to 29 years old, served as the reference group.48,49 DXA scans were performed on a Hologic QDR 4500A (Hologic, Inc., Bedford, MA) and a GE Lunar iDXA (GE Healthcare Global Diagnostic Imaging, Pewaukee, WI) scanner. The same technician completed all scans. Standardized conversion formulas furnished by GE Health Care were used to transform all measurements into comparable units (GE Healthcare, personal communication, 2010).
Data were collected prospectively and analyzed in a blinded manner. The MORES was calculated independent of the BMD based on the scoring algorithm in Table 1. Likewise, the results of BMD were obtained independent of the results of the MORES.
Instruments
The MORES is a clinical risk stratification index developed in 2007 and internally validated in 2995 men ≥50 years old from the NHANES III dataset.43 A best-fitted logistic regression model used age, weight, and history of COPD to predict osteoporosis. Each factor retained in the model contributed a significant and independent effect to the risk of osteoporosis. The weights assigned to each factor were transformed from the regression coefficients to yield integer values indicative of the magnitude of effect on the risk of osteoporosis. A score of ≥6 maximally discriminated between men at higher versus lower risk of osteoporosis and yielded a sensitivity of 0.93, a specificity of 0.59, an AUC of 0.83, and a NNS to prevent one additional hip fracture over 10 years of 279. In a subsequent study, the MORES was validated for identifying men at increased risk for lumbar osteoporosis.50 Table 1 summarizes the weighted scoring algorithm.
Power Analysis
We estimated the prevalence of osteoporosis of the hip in men ≥60 years old at 7.0%. A sample of 350 men could then be expected to make available approximately 25 cases of osteoporosis of the hip to validate the MORES.
With a sample size of 25 men with osteoporosis, a 2-sided exact binomial test for a single proportion (α <0.05) would have at least 80% power to detect the difference between the null hypothesis of 0.50 and an alternative hypothesis as low as 0.77 for the sensitivity of the MORES. Calculations were made using nQuery Advisor 7.0.51
Analysis
We used a 2 × 2 contingency table to obtain the operating characteristics of the MORES, using a threshold of ≥6 points as the positive criterion and osteoporosis of the hip as the outcome state. We obtained 95% confidence intervals [CIs] for the sensitivity, specificity, positive predictive value, and negative predictive value using an exact method based on the binomial distribution. We used the binomial test for a single proportion to determine whether the observed sensitivity differed significantly from 0.50 (the null hypothesis). Assuming a nonparametric distribution, we estimated the AUC for the MORES.
To estimate the clinical utility of the MORES, we conducted an impact analysis estimating the NNS to prevent one additional hip fracture and one additional major osteoporotic fracture over 10 years following the methods described by Nelson et al.52 We used the prevalence observed in the current study to estimate the prevalence of osteoporosis and the operating characteristics observed in the current study to estimate the sensitivity and specificity of the MORES. Ten-year fracture rates were derived from estimates afforded by the FRAX calculation tool.53,54 Additional assumptions for the impact analysis were taken from Nelson et al52 and included a 70% rate of compliance with treatment and 0.63 relative risk reduction for hip fracture with treatment. We varied the relative risk reduction with treatment for a major osteoporotic fracture as a weighted average from other published studies.19,20
Results
We enrolled 386 men; however, 40 men did not report for the DXA scan after 3 requests and were dropped from the study. The final study population included the 346 men with an interpretable DXA scan. There were no significant differences between the men with and without a DXA scan with regard to age, weight, race/ethnicity, education, marital status, or history of COPD. The average age of men completing the study was 70.2 ± 7.0 years. Of these men, 76.0% were non-Hispanic white, 11.8% were African American, 10.7% were Hispanic, and 1.5% were Asian or other. The majority of men were high school graduates (87.9%), and most were married or living with someone (70.9%). Fifteen men (4.3%) had osteoporosis of the hip based on the criteria described in the Methods section. See Table 2 for a detailed description of the study population.
The MORES correctly identified 12 of the 15 men with osteoporosis of the hip, yielding a sensitivity of 0.80 (95% CI, 0.52–0.96; P = 0.035, 1-sample binomial test). The observed specificity for the MORES was 0.70 (95% CI, 0.64–0.74). As a result, the 11% of men who screened positive with the MORES were nearly 3 times as likely to have osteoporosis compared with the baseline prevalence of 4.3%. Men who screened negative with the MORES had only a 1% chance of having osteoporosis. The MORES demonstrated an excellent capacity to discriminate between men with and without osteoporosis of the hip based on an AUC of 0.82 (95% CI, 0.71–0.92).55 Table 3 provides a detailed summary comparing the operating characteristics of the MORES derived from the current study with those obtained from the NHANES III cohort.
Table 4 summarizes the adjusted odds ratios for the MORES indicator variables derived from a logistic regression equation predicting the dependent variable, osteoporosis of the hip, from the dependent variables defined by age groups, weight groups, and history of COPD. The magnitude of the adjusted odds ratios are similar to the weights derived from the NHANES III cohort. The regression equation from the MORES predictor variables yielded a well-calibrated model for predicting osteoporosis in the study population (Hosmer-Lemeshow goodness of fit statistic,56 χ2df = 4 = 3.339; P = 0.503).
To estimate the potential clinical utility of the MORES, we conducted an impact analysis to estimate the number of men who would be referred for a diagnostic DXA scan to prevent one additional hip fracture or one additional major osteoporotic fracture. The NNS to prevent one additional hip fracture with treatment over 10 years was 654 (95% CI, 485-1132) compared with 1604 for universal DXA screening. The NNS to prevent one additional major osteoporotic fracture over 10 years ranged from 245 to 276, depending on the effectiveness of treatment in preventing major osteoporotic fractures.19,20 By comparison, the NNS for universal screening to prevent one additional major osteoporotic fracture ranged from 600 to 675. Table 5 provides the details of the impact analyses.
Discussion
The MORES is a simple risk stratification tool that can be used at the point of care and requires <1 minute to administer and score. Men with a score of ≥6 points are considered at risk for osteoporosis and should be referred for a diagnostic DXA scan. In our clinical sample, we found a sensitivity of 0.80 and a specificity of 0.70 for the MORES. Although the estimated sensitivity and specificity varied from the NHANES III cohort, the AUC was similar (0.82 vs. 0.83), which indicates excellent discrimination55 and validates the MORES as a clinical risk stratification tool that can be used in a clinical population to identify men at increased risk for osteoporosis. Using the 2-step approach (positive MORES followed by DXA), 33% of men ≥60 years old would be referred for a DXA scan and 80% of men with osteoporosis would be identified correctly.
The slightly lower prevalence of osteoporosis in the clinical setting compared with the prevalence in the NHANES III data set was unexpected. In fact, a higher prevalence was expected because of the older age of the clinic participants. The lower prevalence may have been affected by 2 factors. The first is related to the debate over which reference standard and anatomic site should be used to diagnosis osteoporosis, which has taken place in the decade after publication of the NHANES III data.48 The development and internal validation of the MORES used T-scores based on sex-specific and race/ethnicity-based reference groups for the total hip. In contrast, the World Health Organization46 recommends that osteoporosis should be determined by comparing BMD measurements of the femoral neck for males and females of all races/ethnicities to 20 to 29 year-old white female norms from the NHANES III. For example, changing the critical value to define osteoporosis from a BMD of ≤0.681g/cm2 for the total hip, based on 20 to 29 year-old white men, to a BMD of ≤0.558 g/cm2 for the femoral neck, based on 20 to 29 year-old white women, results in fewer men being classified as having osteoporosis.48
The second factor that may explain a lower prevalence is a potential study bias toward more robust older men. This study involved a sample of men from one geographic area along the Gulf coast of Texas, with Galveston Island as the major population center. On September 13, 2008, 2 months before the study began, Galveston Island sustained a direct hit from Hurricane Ike and the devastation of a 22-ft storm surge.57 Before landfall, a mandatory evacuation was ordered, which included all patients in skilled nursing facilities, nursing homes, and hospitals. For months afterward, access to medical facilities was limited, and many potentially eligible men (especially elderly men and those with frailties) were displaced. This may have biased the study group toward men who were healthy enough to return during hurricane recovery and restoration efforts and who may have been less likely to have osteoporosis.
The lower sensitivity observed for the MORES in the current study, compared with the NHANES III cohort, is a concern; however, the CIs for the 2 estimates overlap. The sensitivity found in this study may be more a function of the limited number of men with osteoporosis than a limitation of the MORES as risk stratification instrument. The estimated AUC in this study (0.81) is essentially identical to that seen in the NHANES III cohort (0.83) and, in absolute terms, confirms that the MORES is a robust instrument that can be used to differentiate men at higher risk of osteoporosis from those at lower risk.
The impact analysis demonstrates that the MORES provides an efficient approach to screening men for osteoporosis. The overall NNS to prevent one additional hip fracture over 10 years (654) was higher than that in the NHANES III cohort (279). The difference is mostly due to the lower 10-year risk of fracture (0.056) used in the current study, which was derived from the FRAX estimates for our population, compared with the 10-year risk of fracture (0.135) obtained from Kanis et al58 used in the initial study. If the hip fracture rates estimated from FRAX are used to compare the current study to the NHANES III study, then the difference in the NNS is negligible (654 vs. 672), despite the lower sensitivity of the MORES seen in the current study. More importantly, in this study, the NNS to prevent any major osteoporotic fracture, based on FRAX estimates, is much lower (298) than for hip fracture alone.
Our study had limitations. The study was conducted in one geographical area among a relatively small number of men, which limits generalizability. However, this study does add to the evidence that the MORES is a valid instrument.43,50 The small sample size precluded any stratified analyses by age, race/ethnicity, or anatomic site of osteoporosis. Studies of larger samples from other geographic regions are needed to assess further the validity of the MORES and evaluate the operating characteristics of the MORES stratified by age and race/ethnicity.
Conclusion
We developed the MORES to be used by primary care physicians during an ambulatory visit to identify men at increased risk for osteoporosis, the foremost modifiable risk factor for future major fractures. The results of this study provide further evidence that the MORES is a valid and efficient tool that can be used to identify men at increased risk for osteoporosis in a clinical setting. On the basis of the results from this clinical study, 33% of men screened positive with the MORES and would be referred for a diagnostic DXA scan. At least 80% of men with osteoporosis would be identified correctly. While the evidence that treatment of osteoporosis reduces future fracture in men is less well established than in women, we believe the early evidence supports treating men with osteoporosis.24⇓⇓⇓⇓⇓⇓⇓–32 For primary care physicians, identifying and treating men with osteoporosis may be an important step in the prevention of future fractures.
Notes
This article was externally peer reviewed.
Funding: This research was supported by a grant from the Agency for Healthcare Research & Quality (1R03HS017732-01) and funded a portion of the salary of ARC.
Conflict of interest: none declared.
- Received for publication July 11, 2012.
- Revision received March 7, 2013.
- Accepted for publication March 11, 2013.