Abstract
Importance: Hypogonadism is defined by consistently low serum testosterone levels in conjunction with clinical symptoms. Testosterone replacement therapy (TRT) can be used to achieve physiologic levels of testosterone. Testosterone deficiency is associated with increased mortality and poorer health outcomes.
Purpose: To compare rates of mortality, atrial fibrillation (AF), stroke, myocardial infarction (MI), and prostate cancer in hypogonadal men who received TRT versus those who did not.
Methods: The TriNetX database was utilized to access deidentified, retrospective propensity matched EMR data from 57 participating health care organizations between 2005 to 2020. Cohorts included males 40 to 80 years old diagnosed with hypogonadism who were prescribed TRT versus no TRT. Propensity matching was performed to reduce bias and balance confounding factors between the 2 groups. The following 3-year outcomes were analyzed: mortality, AF, stroke, MI, and prostate cancer.
Results: There were 163,456 male patients identified with hypogonadism, and 133,584 were included after propensity matching. There was a lower mortality rate, (3.1% vs 3.6%; RR, 0.886; P < .001), decreased risk of AF (3.6% vs 4.0%; RR 0.900; P < .001), less stroke (1.6% vs 1.8%; RR, 0.898; P < .011), and fewer cases of prostate cancer (1.9% vs 2.9%; RR 0.648; P < .001) for patients on TRT.
Conclusions: Using TRT is associated with moderately lower rates of mortality, atrial fibrillation, stroke, and prostate cancer in hypogonadal men versus no TRT. There is potential for missed cases of stroke, prostate cancer, and cardiovascular disease incidence not captured by the database. As prescriptions of TRT increase, understanding risks and benefits will help guide future practice.
- Cardiovascular Diseases
- Chronic Disease
- Hormone Replacement Therapy
- Hypogonadism
- Lifestyle
- Preventive Medicine
- Primary Health Care
- Reproductive Health
- Testosterone
Introduction
Hypogonadism is failure of the testes to produce physiologic levels of testosterone (T).1 It is defined by consistently low serum testosterone levels in conjunction with clinical symptoms.2 Testosterone levels vary significantly, due to diurnal and seasonal changes as well as between individuals. For this reason, at least 2 measurements of fasting, morning serum T should be acquired 4 weeks apart to establish the diagnosis.3 Multiple guidelines propose different serum T cut offs for male hypogonadism, ranging from 231 ng/dL to 300 ng/dL, so it is important to look at the entire clinical picture.4 Clinical manifestations of testosterone deficiency (TD) present sexually, physically, and psychologically.
The rate of testosterone prescription has increased 3- to 8-fold in the United States since 2008.5,6 However, during the years 2013 and 2014, testosterone replacement therapy prescriptions had a tumultuous decline owing in part to a change in guidelines and mention of a possible increase in suspected cardiovascular risk.7 Since then, after further investigation and an accruing evidence base in favor of decreased negative cardiovascular outcomes, along with favorable insurance and industry sentiments, TRT prescription rates benefited from a resurgence. It is estimated that 10 to 31% of men diagnosed with hypogonadism have a testosterone prescription, equating to 0.8-2.91% of the total population.8
Most longitudinal community-based population studies have demonstrated significant associations between TD and all-cause mortality.9 A meta-analysis of 12 community-based studies demonstrated a 35% increased risk of all-cause mortality and a 25% increase in Cardiovascular Disease (CVD) mortality in association with a 2.18 standard deviation decrease of serum T.10 As such, this study is significant in adding to existing literature to answer the question of whether TRT prescriptions in select males with hypogonadism are safe and possibly protective from a cardiovascular disease and prostate cancer risk standpoint.
Atrial fibrillation (AF) is the most common cardiac arrhythmia worldwide, affecting 2.7 to 6.1 million US individuals in 2010, and prevalence of the dysrhythmia is expected to double in the year 2030.11 Prevalence of AF is higher in men than women and increases with age.12 It has been shown that normalization of T levels by TRT is associated with a significantly lower incidence of AF when compared with non-normalized treated and untreated groups.11 However, other studies noted a modest positive association between the incidence of AF in men and total serum testosterone concentrations.13,14
For individuals over 20 years of age, stroke was the fifth leading cause of death in the United States between 2009 and 2012.15 One prospective observational study with a median follow-up period of 3.5 years consisting of 3,775 males over the age of 70 correlated free testosterone levels at baseline below 222 pmol/L with a higher incidence of stroke in older men when adjusting for confounding variables.16 Another, smaller scale study corroborated these findings by measuring levels of sex hormones including free and total testosterone levels in a broad array of males aged between 35 and 92 years with ischemic strokes and noted a decrease of 18 to 20% in total and free serum testosterone in subjects with strokes when compared with an age-approximated control population.17
Cardiovascular death is the leading cause of death in the United States, with over half of those being from acute myocardial infarction.18 Recent studies have reported a relationship between TD and atherosclerosis, coronary artery disease, and Cardiovascular (CV) events.19 Previously, the cardiovascular safety of TRT in hypogonadal men had not been determined, but the TRAVERSE study shows promising results.14 This randomized, placebo controlled trial enrolled 5,246 clinically hypogonadal men between the ages of 45 to 80 who had preexisting or high risk of CV disease and the TRT group was found to be noninferior to placebo in incidence of major adverse cardiac events.14
TRT was once contraindicated in men with prostate cancer based on the hormonal dependence of prostate cancer.20,21 A systematic review of 44 studies concluded that TRT does not have a significant effect on serum PSA levels.21,22 In addition, studies have found that patients treated for age-related hypogonadism do not develop prostate cancer at a higher incidence than the general population of untreated men.21 This is in balance and contrast to a 2004 case report which mentions a marginal increase in PSA in a male on TRT, which decreased when discontinued; however, the article makes an important acknowledgment that a rising PSA level in the setting of androgen therapy does not necessarily mean the patient has prostate cancer.23
The goal of this investigation is to use the TriNetX database to compare health outcomes in hypogonadal men who received TRT and those who did not to determine the safety of exogenous testosterone.
Methods
TriNetX is a global health care research network that provides access to deidentified, retrospective EMR data. This study utilized the US Collaborative Network consisting of 57 health care organizations (HCOs) and approximately 100 million patients. The participating HCOs are predominantly located in the US and include academic medical centers, specialty physician practices, and community hospitals.
Study Cohort
The platform was used to generate 2 cohorts consisting of men between 40 and 80 years old at the time of their hypogonadism diagnosis, identified using the International Statistical Classification of Diseases, Ninth and Tenth Revisions (ICD-9 and ICD-10) codes E29.1. The first cohort included patients who were prescribed testosterone (T) within 3 months on or after hypogonadism was diagnosed. Inclusion criteria for the second cohort consisted of male patients without a prescription of testosterone before or up to 3 years after the diagnosis of hypogonadism. The second cohort did not receive any prescription of testosterone through any route, including buccal, gel, transdermal, pellets or clomiphene citrate. The 3-year time frame was set for the second cohort to exclude any patients who may have been started on testosterone replacement therapy within the outcome period. Prescription of testosterone was identified using RXNORM:10379. Data were analyzed within the time window of January 1, 2005, through May 1, 2020. Patients were excluded if the index event occurred greater than 20 years before the time of analysis.
Outcomes
For each cohort the following 3-year outcomes were analyzed: mortality, AF, stroke, myocardial infarction (MI), and prostate cancer (PC). Patients with any outcomes before the 3-year period of analysis were excluded to maintain the structural integrity of the cohort study. We evaluated patient diagnoses with hypogonadism from the years 2005 - 2020 and excluded the following 3 years (2020 to 2023) as these were potential follow-up years for outcomes of interest in patients diagnosed in 2018 to 2020. Limitations of the database restrict data collection behind 20 years from the present.
The respective ICD-10 codes used to identify these diagnoses within the cohorts were: I48, I63, I21, and C61. Mortality is identified in the database as a demographic. Mortality data within the TriNetX platform is obtained from EMR data and HCOs, in conjunction with national death registries. There is potential for missed death events when a patient is treated at an HCO not affiliated with the TriNetX network and subsequently experiences a fatal outcome outside of this network. However, this represents only a minor issue, as currently, 94% of Health care Organizations (HCOs) within the TriNetX network are also linked to the US death registries. The search was generated on August 16, 2023.
Propensity Matching
A 1:1 propensity score matching was conducted with linear and logistic regression to limit confounding variables. Linear regression was used for any continuous variables for which propensity scoring was done such as age. Logistic regression was done for those items that were dichotomous. Variables included were age, race/ethnicity, and preexisting conditions including hypertensive diseases (I10-I16), diabetes mellitus (E08-E13), acute kidney failure and chronic kidney disease (N17-N19), overweight and obesity (E66), heart failure (I50), cardiac arrest (I46), ischemic heart diseases (I20-I25), and malignant neoplasm of the bronchus and lung (C34). These preexisting conditions were included in propensity matching because they are known risk factors for the analyzed outcomes. Greedy nearest-neighbor matching was used with a tolerance of 0.1 and a difference between propensity scores less than or equal to 0.1. The order of data rows is randomized in TriNetX to mitigate bias introduced by the nearest-neighbor algorithm. These methods have been previously validated.24
Statistical Analysis
Statistical analysis was completed through the TriNetX platform. Univariate analysis was performed using the measure of association tool with Chi-Square and t-testing to provide comparisons of patients with measured outcomes in each cohort, risk (within cohort), risk difference (between cohorts), risk ratio (RR), accompanied by 95% confidence intervals (CI) with P-values. Statistical significance was set with a 2-sided α of less than 0.05. Patients with outcomes from before the time window or with previous outcome diagnoses were excluded. Comparisons were made before and after propensity matching.
IRB Approval
Utilization of the data from TriNetX does not require UTMB IRB review as this is secondary analysis of deidentified data. The UTMB IRB determined that this project is considered “not human subjects research.”
Results
The queries returned a total of 163,456 patients, before propensity matching for demographics and risk factors from 57 HCOs within the US Collaborative network. Cohort 1 was prescribed testosterone after a diagnosis of hypogonadism. Cohort 2 was not prescribed testosterone before or within 3 years of a hypogonadism diagnosis. The query for cohort 1 included 57 HCOs with 68,448 patients. The query for cohort 2 included 57 HCOs with 95,008 patients. Following propensity matching, there were a total of 133,584 patients diagnosed with hypogonadism. The cohorts were thus well-balanced with 66,792 patients each in the testosterone and no testosterone groups. Outcomes were tabulated 3 years following the initial diagnosis of hypogonadism.
Before propensity matching, there was statistical significance between all demographic groups except Native Hawaiian or Other Pacific Islander. All preexisting risk factor groups before propensity matching were significantly different between cohorts 1 and 2 except for Malignant Neoplasm of Bronchus & Lung. Table 1 presents demographic trends in cohorts 1 and 2 before and after propensity matching. Table 2 presents preexisting risk factor trends in cohorts 1 and 2 before and after propensity matching. After propensity matching, most of the differences between demographic factors and preexisting conditions were eliminated.
Patient Demographics Before and After Propensity Matching
Preexisting Conditions Associated with Mortality Before and After Propensity Matching
Analysis of the 2 cohorts showed statistically significant differences in all outcomes except MI (Table 3 and 4). There was a lower mortality rate, (3.1% vs 3.6%; RR, 0.886; P < .001), decreased risk of AF (3.6% vs 4.0%; RR 0.900; P < .001), decreased risk of stroke (1.6% vs 1.8%; RR, 0.898; P < .011), and fewer cases of prostate cancer (1.9% vs 2.9%; RR 0.648; P < .001) for patients on TRT when compared against patients with hypogonadism who were not prescribed TRT. These results were similar in the nonpropensity-matched groups, with a significantly decreased mortality rate (3.2% vs 3.4%; RR 0.940; P = .023), decreased risk of AF (3.6% vs 4.2%; RR 0.873; P < .001), decreased risk of stroke (1.6% vs 1.7%; RR 0.915; P = .023), and fewer cases of prostate cancer (1.9% vs 3.3%; RR 0.575; P < .001). See Table 5 for a representation of exclusions from each cohort for each outcome; patients were excluded if they had one of the outcomes before the diagnosis of hypogonadism.
Mortality, Cardiovascular, and Prostate Cancer 3-Year Outcomes After Propensity Score Matching
Mortality, Cardiovascular, and Prostate Cancer 3-Year Outcomes Before Propensity Score Matching
Patients Excluded If the Outcome Occurred Before the 3-Year Time Frame Evaluated
Discussion
In this study, the hypogonadal men treated with TRT within 3 months of diagnosis had significantly lower rates of mortality, atrial fibrillation, stroke, and prostate cancer compared with the hypogonadal men who received no testosterone within 3 years of diagnosis following propensity matching. There was a nonsignificant lower rate of myocardial infarction in the treatment cohort. This high-powered study is currently the largest analysis concerning testosterone replacement therapy and mortality, with generalizable findings after propensity matching.
Testosterone status has been linked to the general health of the male population.9 Whether this is an etiology or an association is not entirely known. Low testosterone has also been linked to increased all-cause mortality in disease-specific populations, such as type 2 diabetes mellitus and acute MI.9 The Seattle study retrospectively analyzed 1031 hypogonadal (T < 250 ng/dL) male veterans over the age of 40 and compared survival rates in those who received TRT and those who did not. After propensity score adjustment, they found TRT was associated with a decreased risk of death.25
In their retrospective observational VA cohort study, Sharma et al. analyzed 3 groups of propensity-matched male veterans aged between 40 to 69 with documented low testosterone levels: those who received testosterone replacement with normalization, those who received therapy without normalization, and those who did not receive any testosterone replacement therapy. The study found that all-cause mortality, risk of MI, and stroke were significantly lower in those males who received testosterone replacement therapy with normalization than those whose testosterone levels did not normalize. While their study relied on a different database and cohorts, these findings are supported by our much larger study.26 Fantus et al. showed that hypogonadal men were more likely to have obesity, diabetes mellitus, hypertension, CHF, MI, walking disability, and a higher atherosclerotic cardiovascular disease risk.27 This study also demonstrated an increased incidence of premature mortality associated with untreated hypogonadism.27
In men, the prevalence of AF ranges from 0.2% for those younger than 55 to more than 11% among those over the age of 85.28 In this study, TRT was associated with an approximately 9% risk reduction in atrial fibrillation rates. It is unclear whether low testosterone has a causal relationship to atrial fibrillation, and if it does, it is most likely multifactorial and heterogeneous.13 Evidence supports a relationship between TD and increased risk factors for AF.29 Diabetes mellitus, hypertension, obesity, and metabolic syndrome have been associated with lowered testosterone in men.29 Animal models have demonstrated that modification of systemic hormones can alter atrial electrophysiology: orchiectomized rats showed greater repetitive atrial responses compared with controls.30
A population-based study of 1,482 men found that lower total testosterone levels were significantly related to increased carotid intima-media thickness, an indicator for general atherosclerosis.31 Hypogonadism has also been linked to risk factors associated with cerebrovascular disease, such as blood pressure, increased serum cholesterol, diabetes, AF, obesity, and smoking.17,32 In a study of 167 men with acute ischemic stroke, total and free serum T concentrations were 18% and 20% lower, respectively, than healthy control subjects.17
Three studies published from 2010 to 2014 suggested an association between adverse CV events and exogenous testosterone use.33,34 In March of 2015, the Food and Drug Administration (FDA) changed the labeling for testosterone prescriptions to reflect the possible increased risk of heart attacks and strokes.35 This action may have been in response to the literature written by Vigen et al.,7 a retrospective national cohort study of all males with low testosterone in the VA system who received a coronary angiogram and had serum testosterone levels checked.7 The study had noted a significant increase of death, MI and stroke in the population, and the resulting data were implicated in the premature discontinuation of the Testosterone in Older Men with Mobility Limitations (TOM) trial conducted in older frail men with a high prevalence of cardiovascular diseases.33
Many studies since this label change have shown the opposite association, including our study. Although our query did not find a statistically significant reduction in MI when comparing the TRT group with the control group, the TRT group had slightly lower rates of MI (1.8% vs 1.9%) and a higher-powered patient population could show a significant protective association. The findings in our study also agree with a meta-analysis of 70 studies which demonstrated significantly lower T and higher 17-β estradiol levels in patients with CV disease, even after adjusting for age and body mass index.19,36 The TRAVERSE study further provides clarity on the safety of TRT in men with CV risk factors and established CVD.13 Baillargeon et al. demonstrated that older men treated with IM testosterone did not have an increased risk of MI. Testosterone use was also found to be modestly protective for men with high MI risk.37
Prostate cancer has historically been a feared complication of TRT, due to the androgen dependency of these tumors38; however, our findings demonstrate a 35.2% reduction in PC in the treated group. There is no conclusive evidence that high serum T levels are associated with increased prostate cancer risk.38 Several trials have demonstrated no increase in the risk of prostate cancer in hypogonadal men on TRT versus the general population.39 The saturation theory proposes that prostate growth is extremely sensitive to variations in testosterone at very low concentrations, but becomes insensitive to changes in testosterone concentrations at higher levels.40 This could explain why men with baseline total T less than 250 ng/dL are more likely to have increased prostate specific antigen (PSA) levels after TRT than those with a baseline total T of 250 ng/dL or greater.41
A significant limitation of this study is that it is assumed that the men in Cohort 1 reached physiologic levels of testosterone with TRT. In other words, the numeric testosterone values of men on TRT after prescription was not obtained, and it is unknown whether males who received this therapy remained subtherapeutic, normalized, or were supratherapeutic. If some of the men in Cohort 1 remained hypogonadal or supratherapeutic with treatment, the observed relationship in the variables studied may be skewed. Future studies demonstrating an assessment of the proportion of each arm that reached eugonadal levels after approximately 3 months of prescription following established guidelines would be beneficial to address this.
There is a potential for missed cases of atrial fibrillation, stroke, myocardial infarction and prostate cancer if the patient was treated at an HCO not affiliated with the TriNetX network. Limiting the analyses to men who have at least 2 visits after their hypogonadism diagnoses could reduce the bias for men who seek treatments at non-TriNetX health care organizations and could serve as a sensitivity analysis. Diagnoses of intermediate endpoints between cohorts like total testosterone, BMI, total cholesterol, HgbA1c or new antihypertensives were not available on the database at the time of data collection; the inclusion of such data if it were available could serve to further reduce confounding bias.
Another limitation of this study is that it is retrospective. Retrospective studies can suggest associations between treatment groups and health outcomes, but due to confounding it can be difficult to establish causality. By using propensity matching and analyzing present outcomes after a retrospective review as has been done in this study, it is plausible to draw causal inferences. While propensity matching for several comorbidities was performed, there may be other conditions (for example, smoking history) during data collection that could serve as confounding variables. Unfortunately, such characteristics were not accessed from the database. The corollary to this is that major articles in the literature that have influenced the prescription of TRT have been retrospective in nature. Noting this, further investigation through a prospective randomized controlled study could strengthen the causality between these variables and address a gap in the literature.
Overall, the use of TRT to achieve physiologic testosterone levels is associated with a decrease in mortality, AF, stroke, and prostate cancer. The prevalence of age-related hypogonadism is not known with certainty due to variations in diagnostic criteria between organizations, but it is projected to be up to 25%.42 Normalizing testosterone levels in these men through the use of TRT can improve quality of life and reduce risk factors associated with increased morbidity/mortality. Future studies should aim to measure testosterone levels after TRT and determine threshold levels for both risks and benefits for various outcomes to determine the optimal therapeutic level of testosterone. This data would guide clinicians in determining optimal replacement dosing for patients.
The goal of this study is to evaluate health outcomes in hypogonadal men receiving TRT compared with those who did not. In the largest study to date, TRT in hypogonadal men was associated with significantly lower rates of mortality, atrial fibrillation, stroke, and prostate cancer. Testosterone’s effects on risk factors associated with these outcomes is an important area of study that should be further expanded on in the future. As the rates of testosterone prescribing increase, it is important to investigate the benefits/risks and optimal levels for TRT. This will help guide physicians in their prescribing practices and patient education.43
Appendix 1
Greedy Nearest Neighbor Matching (NNM)
The most common implementation of propensity matching is pair-matching, in which pairs of treated and control participants are formed. There are several common implementations of pair-matching. The most commonly used is greedy nearest neighbor matching (NNM), which we used, in which a treated participant is selected at random and then matched to the control participant whose propensity score is closest to that of the treated participant. The process is described as greedy because at each stage the control is selected who is closest to the currently considered treated participant, even if that untreated participant would serve better as a control for a subsequently treated participant. This process is then repeated until a matched control participant has been selected for each treated participant. This process generally uses matching without replacement, so that once a control participant is matched to a treated participant, that control participant is no longer available for matching to a subsequent treated participant. A refinement to NNM is NNM with a caliper restriction. Using this approach, a control participant is an acceptable match for a treated participant only if the difference in their propensity scores is less than a maximum amount (the caliper width or distance). For technical reasons, one typically matches on the logit of the propensity score and uses a caliper width that is defined as a proportion of the (0.1–0.2) SD of the logit of the propensity score. A crucial step in any study that uses propensity score matching is to assess the degree to which matching on the propensity score resulted in the formation of a matched sample in which the distribution of baseline characteristics is similar between treated and control participants. This assessment is critical as it allows both the researcher and readers to assess whether matching on the estimated propensity score has removed systematic baseline differences between treatment. The use of the standardized difference, which is the difference in means in units of SD, is often used for assessing the similarity of matched treated and control participants. Some authors have suggested that a threshold of 0.10 (or 10%) be used to denote acceptable balance after matching. Once acceptable balance has been achieved, analysts can unblind themselves to the outcome and compare outcomes between treated and control participants in the matched sample. The analyses conducted in the propensity score–matched sample can be similar to those that would be done in an Randomized Controlled Trial (RCT) with a similar outcome.
Notes
This article was externally peer reviewed.
Funding: This research was supported by the Institute for Translational Sciences at the University of Texas Medical Branch, and in part by a Clinical and Translational Science Award (UL1 TR001439) from the National Center for Advancing Translational Sciences at the National Institutes of Health (NIH). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
Conflict of interest: The authors have no conflicts of interest to disclose.
To see this article online, please go to: http://jabfm.org/content/37/5/816.full.
- Received for publication January 18, 2024.
- Revision received April 2, 2024.
- Accepted for publication April 8, 2024.