Abstract
Purpose: To understand the ability of trigger tools to detect preventable adverse events (pAEs) in the primary care outpatient setting using the Institute for Healthcare Improvement's (IHI) Outpatient Adverse Event Trigger Tool (IHI Tool).
Methods: The OVID MEDLINE and OVID MEDLINE In-process and non-Indexed citations databases were queried using controlled vocabulary and Medical Subject Headings related to the concepts “primary care” and “adverse events.” Included articles were conducted in the outpatient setting, used at least 1 of the triggers identified in the IHI Tool, and identified pAEs of any type. Articles were selected for inclusion based first on assessment of titles then abstracts by 2 trained reviewers independently, followed by full text review by 2 authors.
Results: Our search identified 6435 unique articles, and we included 15 in our review. The most common studied trigger was laboratory abnormalities. The most common pAEs were medication errors followed by unplanned hospitalizations. The effectiveness of triggers in identifying AEs varied widely.
Conclusion: There is insufficient data on the IHI Tool and its use to identify pAEs in the general real-world outpatient setting. Health care providers of the primary care setting may benefit from better trigger tools and other methods to help them detect pAEs. More research is needed to further evaluate the effectiveness of trigger tools to reduce barriers of cost and time and improve patient safety.
- Ambulatory Care
- Family Physicians
- Medical Subject Headings
- Medication Errors
- MEDLINE
- NHS
- Patient Safety
- Primary Health Care
Since the 1999 landmark report by the Institute of Medicine (IOM), To Err is Human1, patient safety has become a priority in health care systems.2 Although, most work has focused on acute, inpatient settings3,4, a much larger number of patients seek care in the outpatient setting.5,6 Therefore, identifying preventable adverse events (pAEs) in the outpatient setting is important.
Medical errors are mistakes that may or may not cause harm (eg, a minor error in dosing a medication). Not all medical errors are adverse events (AEs). An AE is harm caused from medical care and not the disease process itself (eg, a rash in response to an antibiotic vs a rash in response to an infection). A pAE is an AE or harm to a patient that is the “result of care that fell below the standard expected of physicians in their community”7 or “avoidable by any means currently available unless that means was not considered standard care.”8 (Figure 1). Voluntary reporting and chart review are common, traditionally accepted methods used to identify medical errors and pAEs in the hospital setting.9,10 These methods do not translate easily to the outpatient setting for several reasons. First, patients are not under constant observation in the outpatient setting. In fact, the provider-patient contact time is relatively limited. In addition, it is well documented that voluntary reporting vastly underestimates the rate of medical errors.11,12 Finally, extensive chart review is not only costly, subjective, and unsustainable but also extremely time consuming and inefficient.13⇓–15 One method for streamlining error identification of “high-risk” charts that should be reviewed for errors and AEs relies uses a collection high-risk situation or events called “triggers.”16,17 Several tools have been proposed18⇓–20, but one of the most recognized is that from the Institute for Healthcare Improvement (IHI).19 This IHI Outpatient Adverse Event Trigger Tool (IHI Tool) was developed by experts at the IHI using malpractice claims data and outlines eleven triggers to identify patient records at risk for an AE.19 The list of triggers can be found in Table 1. This tool was chosen because it has been previously described as particularly relevant to the outpatient and primary care setting and it represents an established list of predefined triggers.20
The tool developed by the IHI has been reportedly validated in outpatients to detect any AEs, preventable or not.19 Although there are reasons why assessing preventability can be problematic21, pAEs and medical errors are opportunities for many safety interventions22 and are often the most easily accepted improvement opportunities by providers due to their intuitive appeal as targets for improvement. We identified 5 systematic reviews that focused on safety events in the outpatient setting.15,23⇓⇓–26 However, none of these focused on the accuracy of tools in identifying pAEs.
Therefore, the goal of this systematic review was to determine the accuracy of each of the components of the IHI Tool, alone or in combination, in identifying pAEs in the outpatient primary care setting.
Methods
This review was conducted by following the Preferred Reporting for Systematic Reviews and Meta-Analyses (PRISMA) Guidelines.27 (online Appendix 1) A master's prepared medical librarian (BH) conducted a comprehensive literature search for English-language articles published on the use of a trigger tool to identify AEs in the outpatient setting. Dates searched were 1946 to Present in Ovid MEDLINE and Ovid MEDLINE In-Process & Other Non-Indexed Citations.
An initial search was run in December 2015. Following discussions among team members, the search was modified several times, with the final search strategy finalized and performed in February 2016. The search was updated in February 2017. We chose relevant controlled vocabulary (Medical Subject Headings and keywords) to capture the concepts of the outpatient setting and AEs. The search for these 2 concepts yielded 6435 articles. (Full search strategy available in online Appendix 2).
All article titles and abstracts were independently reviewed for inclusion by at least 2 trained reviewers (JD, NH, KB). If either reviewer selected a reference, they ordered the full text for further review. Using this strategy, 158 articles were obtained. The percent agreement on initial independent selection of articles for further review was 97.4%. Interrater reliability using Cohen's Kappa was κ = 0.34 (95% CI, 0.26 to 0.41). To identify other relevant articles, the reference sections of all included articles were checked by one of the authors (JD or NH). As we only included primary research studies in our systematic review, we checked reference sections of systematic reviews identified during the search for potentially relevant primary studies.
Inclusion and Exclusion Criteria
At the outset, we developed a comprehensive systematic review protocol, including operational definitions, inclusion and exclusion criteria, and search strategy details. Our operational definition of a trigger was “a signal for detecting likely AEs.”28 Figure 1 illustrates the components of patient safety that factor into deliverables of medical care. A priori, we determined that although the IHI Outpatient Trigger Tool was designed to detect any AEs, the focus of our systematic review would be on pAEs to have a more clinically relevant impact. Because the reporting and assessment of AEs is highly variable, we determined that pAEs detected by any means in the outpatient setting would be eligible for inclusion. Preventability was assessed using the original description in the article and whether the study assessed for preventability.
Articles meeting all 3 of the following criteria were eligible for review: research that used any of the triggers identified in the IHI's Tool19, assessed pAEs by any means, occurred primarily in the primary care outpatient setting. There were a number of exclusion criteria, including articles that were non-English language, did not use outpatient data, did not assess preventability of AEs, greater than 50% pediatric patients, specialty-focused, fewer than 10 patients or case reports, or no primary data (reviews, systematic reviews, editorials, newsletters).
Abstraction Process
The team used an iterative process to develop and pilot test an abstraction form designed to confirm final eligibility for full review, assess article characteristics, and extract data relevant to the study. Each article was independently abstracted by 2 trained reviewers (JD and NH). These 2 reviewers along with 2 authors who were not involved in the abstraction process (HBF and MS) discussed and combined the 2 summaries into a final version. All abstraction disagreements were minor and were resolved during discussions among all 4 reviewers. More than half of these were typographic or human error on the part of the reviewer. On rereading the article, a clear “correct” answer was found and agreed on.
Quality Assessment
Systematic review guidelines recommend the use of a measure of risk of bias within studies.27 Heterogeneity among studies, including disparate methods and training of reviewers has been cited as a reason to forego quality assessment altogether.28 In addition, there is no accepted standard for quality assessment of patient safety studies of this type. We used a tool from the National Heart, Lung, and Blood Institute (NHLBI) to assess the quality of the articles included in our review.29 It should be noted, though, that this tool is for use with observational studies in general, and not patient safety articles.29 We therefore modified the tool and categorized the trigger as the exposure and the pAE as the outcome. As recommended by the developers of the tool, the results were not tallied, but the interpretation of the results and final categorizations of articles was based on the perceived overall risk of bias. According to statements from the Standards for Reporting of Diagnostic Accuracy Studies (STARD), scales that numerically summarize multiple components into a single number are misleading and unhelpful.30 Thus, the authors applied the NHLBI Quality Assessment Tool and the STARD tool to our selected group of 15 articles to help characterize quality.29,30 To make comparisons between the 2 tools comparable, the STARD checklist was transformed into a rating of good, fair, and poor using a cutoff of >75% of applicable checklist items as a cutoff for good, and less than 60% for poor. No outliers were identified and all 15 were included in our review. This was done independently by 2 authors who met and agreed on categorization by consensus with little disagreement. Given the heterogeneity expected, it was determined that a funnel plot or other assessment of publication bias would be impossible.
Results
Our initial search strategy identified 6,435 articles (Figure 2). After our initial review of titles and abstracts, we requested 158 full-text articles. Of these full-text articles, 10 fulfilled inclusion and exclusion criteria and are included in this review. One review article was published in English but cited a study originally published in French.31 We did obtain the original French study32 and had it translated by a native French speaker to ensure the data in the English version was an accurate representation of the original study. In addition, reviews of reference sections of included articles and relevant systematic reviews identified 4 new articles17,33⇓⇓–36 for inclusion. Therefore, 15 studies with a total of 278,212 patients and 126,197 incidents of positive triggers were included in this review.17,20,31,33⇓⇓⇓⇓⇓⇓⇓⇓⇓⇓⇓–45
Of the 15 included studies, 9 were conducted in the United States, 3 in Scotland, and 1 each in Brazil, France, and Australia (Table 2). Eight studies were published since 2010 and 2 were published before 2000. Most were retrospective, but 3 were prospective. Most studies involved only 1 site, but the number ranged from 1 to 40.
Adverse Events
Only 1 study evaluated a control group, using a case control methodology and a random selection of patients without a trigger.38 Therefore, the quantification of the accuracy of the triggers is limited to the positive predictive value (PPV). The value ranged widely from 0.05% for drug laboratory rules39 to 92% for hospitalization for new rectal cancer.42 The PPV of polypharmacy ranged from 19.5% to 29.3% (depending on the number of medications used to define polypharmacy), for laboratory abnormalities ranged from 0.05%39 to 30.6%36, for hospital admissions ranged from 1.8%33 to 9%37, and for combined tools ranged from 1.8%33 to 24%.36 International normalized ratio (INR), in general, had a particularly high detection rate of adverse drug event (ADEs), as high as 96% in 1 study.38
The overall detection of AEs in the samples studied varied widely, from 1.4%39 to 14.6%38 (Table 2). In the 2 studies that examined fixable (ameliorable) events, all the AEs that were not preventable were ameliorable.38,44 The most commonly studied pAEs were medication related (preventable adverse drug events).33,36,38⇓⇓–41,44 Preventable or unplanned hospital admission was the second most commonly studied type of pAE.42,43,45 Three studies looked at any type of pAE identified in the chart.20,31,34,35,37
The most commonly used method to identify pAEs was chart review.20,31,33⇓⇓⇓⇓⇓⇓⇓⇓⇓⇓–44 Four studies used physicians only as reviewers35,37,38,40, 3 used a combination of physicians and pharmacists.33,36,41 1 used nurses34, 1 used pharmacists44, and 1 used a combination of physicians and nurses20, 5 studies used more than 1 reviewer and all these used at least 2 independent reviewers.20,33,37⇓–39 Interrater reliability of chart review was assessed in 4 studies33,37,40,44 and ranged from a Cohen's kappa of 0.3337 to 0.89.44 Bigby et al37 showed an intrarater reliability from 0.34 to .75 (P < .05). All 3 studies showed that interrater agreement on presence of an AE was higher than agreement on preventability/causation. Besides chart review, the other methods to identify pAEs included information from a research database43 and public data for the National Health System.45
Triggers
The incidence of triggers in a random cohort of charts ranged from 45.6% (507/1111)42 to 1 study that had 1342 triggers in 622 episodes of care38 (Table 2). Several studies cited the same patients having multiple triggers within a given period of review. Honigman et al39 noted that “The relationship between computer-identified incidents and ADEs was often ‘many to 1′.” Across all studies, most of the triggers were identified by either computerized notification (8/14; 57.1%) or manual record review (6/14; 42.9%). One study also interviewed patients, in addition to manually obtaining lab values from the primary care physician.44
Several triggers in the IHI trigger tool were not used in any of the included studies: nursing home placement, surgical procedure, physician change, and complaint letter. Several studies assessed more than 1 of the triggers at the same time, but none of the studies used all of them. The most IHI triggers used in a study was 5. Eight studies17,20,33,36,39,40,42,44 paired triggers from the IHI tool with other triggers, such as free text searches of charts33,39,40, new allergy or “high-priority” codes20,35,40, ICD-9 codes39,40, antidote administration33,36, symptom review44, and incident reporting.33
Triggers that were studied individually included lab abnormality38⇓–40,44, polypharmacy43,45, and hospital admission.31,37,42 The definition of polypharmacy provided by the IHI tool is >5 medications; however, studies in this review used numbers from 4 to 12.43,45 Lab abnormalities evaluated across all studies included international normalized ratio (INR),33,34,36,38⇓⇓–41 creatinine17,33⇓⇓–36,38⇓⇓–41, hemoglobin17,33⇓⇓–36,40, potassium,33,36,39,40 aminotransferase enzymes33,35,37, eosinophilia33,35,39, platelet count33,36,38, toxic drug levels33,36,40, bilirubin36,40, blood urea nitrogen36,38, hemoglobin A1c33,36, thyroid-stimulating hormone (TSH)36,38, positive C. difficile33,36, and white blood cell count.40 Brenner et al37 found no patients with an undetectable TSH while on levothyroxine and Singh et al17 found that toxic drug levels, hemoglobin, platelet count, eosinophilia, potassium, bilirubin, alkaline phosphatase, hemoglobin A1c, and C. difficile had incredibly low PPVs.
Related Findings
Several studies had findings that were related to our research question and would be interesting to those interested in studying triggers further. Elderly patients were often the focus of many studies17,33⇓⇓–36,43,44, and 4 studies found that older patients were more likely to have pAEs.20,37,43,45 One study that utilized 2 sites found that while implementation at 2 sites were successful, there were variable rates in success of using abnormal laboratory values in identifying pAEs across 2 sites.36 Bigby et al36 found that 9% of emergency admissions were preventable: 6.8% due to a combination of iatrogenic and patient factors, and 2.2% due exclusively to patient factors, for example, noncompliance. Brenner et al37 noted “that most ADEs [not only preventable ones] occurred during the self-management and monitoring stages of medication use, rather than being prescribing or dispensing errors”; however, Singh et al17 noted that preventable ADEs “most commonly originated during the prescribing or administration of medications.” Payne et al44 found that number of conditions/diagnoses was more predictive of preventable hospitalization than number of medications.
Of the studies, 4 reported average time to collect data20,34,36,42, and none reported cost involved in detecting either the triggers or both the pAEs and the triggers. In 1 study by Macnee et al41, the initial time investment from information technology to establish a computerized tool to detect hospital admissions for cancer ranged from “a few hours to 2 days” across 5 clinical sites. The maximum time reported to review the outpatient records for pAEs of about 20 minutes per record.35 One study reported that the “duration of time taken was ok,”17 while another reported, “…the low PPVs suggest that extensive investigator time will be required to use any of the sources investigated.”33
Quality Assessment
Overall, quality assessment showed a moderate risk of bias. Using the NHLBI tool, we rated 4 articles (27%) as “Good,”20,38,40,45 6 (40%) as “Fair”33,35,36,39,43,44, and 5 (33%) as “Poor.”17,31,37,41,42 Using the STARD tool, we rated 4 articles (27%) as “good”33,36,38,40, 6 articles (40%) as “fair”20,35,36,39,42⇓–44, and 5 articles (33%) as “poor.”31,37,41,43,45 Percent agreement among reviewers was 84%. Complete details are available from the authors on request.
Discussion
An ideal trigger tool would efficiently detect, monitor, and measure harm and identify AEs. As studies have shown, trigger tools can identify a number of different levels of harm and are tailored to health care organizations and health professionals that can effectively use the tools in conjunction with other patient safety interventions. The IHI Global Trigger tool is among the more developed, recommended tools for health organizations. The tool was designed to help health organizations implement quality improvement approaches and complement other information sources regarding potential patient harm. Moreover, the ideal tool (or system of tools) would be able to detect AEs in advance thereby preventing harm.
Summary of Findings
Our study identified 15 articles that assessed the accuracy of any of the triggers in the IHI Tool in identifying pAEs in the general outpatient setting. The studies had a moderate risk of bias according to the NHLBI tool. The outcome measurements were remarkably heterogeneous and precluded our ability to quantitatively compare the studies. Our data suggest that accurate trigger tools remain elusive, primarily because of the high number of false positives detected with current tools. Certain items from the IHI tool had no studies on their accuracy in the primary care setting and deserve specific further research: nursing home placement, surgical procedure, change of physician, and patient complaint letter.
In another recent systematic review, the IHI Global Trigger Tool had only 4 studies conducted in the outpatient setting.21 Despite this, the role of outpatient medical care in patient safety is being increasingly recognized as equally if not more important than that of inpatient medicine.3,4 A different approach to identifying AEs in the outpatient setting is needed. Inpatient care can use a single admission as representative of an entire episode, whereas outpatient care is a time-limited representation of a continuum of care, much of which is unobserved. Identifying AEs in the outpatient setting will require a combination of robust identification mechanisms such as trigger tools, scoring systems, chart reviews, and perhaps patient reporting.
Comparison of Other Review Studies
Robust outpatient safety programs will rest on such mechanisms but will need to address the barriers of time and cost.45 Very few studies included in our study evaluated time or cost of implementing triggers. A successful trigger tool must add value and efficiency in identifying patients or charts that are a high risk of AEs. Preventable AEs are those that are currently actionable for improvement among family medicine clinicians. A recent systematic review summarized the rate of AEs in the outpatient setting and identified an overall median AE rate of 4% (range, <1% to 24%) in over 100 primary studies.25 A quality trigger tool should identify a significantly higher percentage of AEs than those that could be identified by random chart review. In addition, random chart review would undoubtedly miss relevant and preventable AEs. In our included studies, conclusions on the use of trigger tools for identifying pAEs varied. For example, DeWet et al20 stated that “the tool may have greater utility as a research rather than an audit technique”; however, Macnee et al41 stated that their approach “may be 1 of the most effective, low-cost methods of identifying critical occurrences in ambulatory care.” Indeed, further work on improving the identification of AEs, including pAEs, is desperately needed. A systemic review by Hatoun et al23 notes that “…trigger events are used increasingly in the inpatient setting, and although others now exist for ambulatory surgery, few exist for ambulatory primary care.” To better understand the value of trigger tools, it will be important for future research to use control groups (like voluntary reported and random chart review) and to evaluate the time and cost necessary to identify triggers. Some combination of triggers and chart review and patient input in a standardized, multi-step process may also improve the accuracy of identification of pAEs.
Implications
This study was limited to the identification of pAEs to be more relevant to real practitioners looking to improve the safety of real practice. However, the concept of preventability is not well defined, let alone practical, for research purposes. There is often confusion among both health-care practitioners and researchers about preventability. The determination of preventability and the value of pAEs versus AEs as a quality measure is a hotly debated topic in patient safety.47 Proponents of preventability assessment argue that the area for the greatest current potential improvement is in pAEs and that nonpreventable AEs are of little value. They also argue that the use of overall AEs as a measure of safety seems to overestimate the number of safety events that are preventable versus those that are inherent (as in the IOM reports).1,48,49 pAEs are also the easiest errors for those who are unfamiliar with patient safety terminology to identify as areas for improvement. Those against the use of preventability measures cite that its assessment is unreliable50⇓–52, the definition of preventable changes as technology and research evolve, most AEs are preventable53, and determinations of preventability fail to take into account the existing disease and surrounding circumstances.48 To that end, our results do support the notion that reliability of preventability assessment is lower than that of AE identification, although the interrater reliability for both varied widely.
Limitations
This review has limitations. First, we chose to focus our search on the triggers listed in the IHI Tool rather than a more comprehensive list. Various definitions of triggers and various methods of identifying pAEs limits the conclusions that can be drawn from this analysis, given that results could have been missed or misclassified. The possibility of missing potentially relevant articles always exists, although we used a robust search strategy, multiple databases, an a priori protocol, and a trained medical librarian to attempt to mitigate this, although publication bias is likely to be especially important as much of this work may be conducted under the banner of quality improvement. In addition, the quality, limited reporting, and heterogeneity of the original studies precluded formal meta-analysis and limited quality assessment. Finally, no minimum standards of effectiveness for trigger tools are established. Our authors agreed that a tool should identify more events (perhaps twice as many) as random chart review. Yet feasibility, a more subjective measure, matters a great deal and feasibility would depend on who was trying to use the tool and for what purpose.
Conclusion
Given the limited data available, it is premature to endorse the universal use of the IHI Tool to identify outpatient pAEs. This work highlights a starting point for future research on the topic of patient safety in the outpatient primary care setting. Greater emphasis on patient safety research will help direct health care providers to consider the use of triggers to reduce barriers of cost and time associated with a large number of random chart reviews. An increased focus on identification of pAEs in the outpatient setting is urgently needed to improve patient safety.
Acknowledgments
The authors acknowledge the work of Kameron Brown for his help in obtaining and screening articles for this study.
Notes
This article was externally peer reviewed.
Funding: Work supported by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under grant number U54-GM104941 (PI: Binder-Macleod).
Conflict of interest: JD, HBF, and MS have received an honorarium from the American Academy of Family Physicians to write a review on patient safety.
To see this article online, please go to: http://jabfm.org/content/31/1/113.full.
- Received for publication June 26, 2017.
- Revision received September 14, 2017.
- Accepted for publication September 19, 2017.