Tuesday, November 5, 2024

Accuracy of alvarado score and AUS in acute appendicitis | OAEM

Must read

Introduction

The estimated lifetime risk of acute appendicitis is 7–8% which renders it one of the most commonly encountered surgical emergencies and the most common indication of emergent surgery worldwide.1 Notwithstanding its high incidence, acute appendicitis poses a diagnostic challenge with the symptomatic overlap with other causes of acute abdomen. Moreover, the limited sensitivity and specificity of the clinical findings further complicate the diagnostic accuracy of acute appendicitis, which requires early recognition as the delayed diagnosis has been linked to increased mortality and morbidity, particularly among children and older individuals.2,3

To aid in the diagnosis, several scoring systems have been developed to estimate the probability of acute appendicitis. These include appendicitis inflammatory response (AIR), Raja Isteri Pengiran Anak Saleha Appendicitis (RIPASA), or Alvarado’s score.4 Despite the limitations that entailed these scoring systems, they have been shown to decrease imaging, hospitalizations, and false-positive appendectomies.5 Alvarado’s score remains the most commonly used,4 and was implemented in the current study.

Prior to the implementation of imaging, post-operative negative appendectomies had a prevalence exceeding 20%, which was deemed acceptable to minimize perforation risk.6–8 Several reports have shown higher diagnostic accuracy with modalities including abdominal ultrasonography (US), computed tomography (CT), and magnetic resonance imaging (MRI) in evaluating acute appendicitis.9,10 Nevertheless, the use of CT or MRI might be limited by availability, affordability, or contrast exposure.11 Abdominal US has been proposed as a method for detecting acute appendicitis. The lack of radiation exposure, widespread availability, and cost-effectiveness are significant advantages of US over other modalities.12 However, the diagnostic performance of abdominal US shows variable outcomes across diverse studies, with sensitivity ranging from 44% to 100% and specificity from 47% to 99% in diagnosing acute appendicitis.13 This variability is partly attributed to factors such as operator proficiency, patient adiposity, and the presence of intestinal gaseous contents.13 While delayed recognition or missed diagnosis of acute appendicitis can lead to suboptimal patient care, higher complication rates, and legal claims, overdiagnosis can risk patients undergoing unnecessary surgical intervention. This study aims to investigate the diagnostic accuracy of abdominal US and the Alvarado score in diagnosing acute appendicitis in a resource-limited setting to reduce negative appendectomy rates.

Materials and Methods

Study Design and Inclusion Criteria

From January 2021 to July 2022, a retrospective cross-sectional study was conducted at Al-Thawra Hospital in Yemen on individuals (greater than 13 years) who arrived at the emergency room with acute abdominal pain and suspected acute appendicitis based on clinical suspicion and/or abdominal US. Among 1021 cases who visited our emergency department with chief complaints of abdominal pain, 171 cases were suspected of appendicitis based on clinical suspicion and/or abdominal US and were included in our analysis. Ethical approval for the study was obtained from the ethics committees of IBB University (ID: IBBUNI.AC.YEM.2022.119), and we conducted the study following the Helsinki Declaration.

Exclusion Criteria

Patients with no documented Alvarado score or who did not have an abdominal ultrasound, pregnant patients, patients aged less than 13 years, patients taking nonsteroidal anti-inflammatory drugs, and those who had been treated recently for urinary tract infection or pelvic inflammatory disease were excluded.

Study Protocol and Outcome

The patients’ demographics, including age, gender, residency, body mass index, past medical history, physical examination findings, and laboratory data (complete blood count, blood sugar, viral markers, liver function test, and renal function test), were collected along with corresponding Alvarado’s scores (Table 1). The duration of symptoms was calculated from the onset of symptoms to hospitalization. Regarding the abdominal US, experienced radiologists performed all abdominal ultrasonography utilizing a graded compression approach and a 5.0-MHz linear array transducer. Positive US results for appendicitis were defined as the presence of an enlarged non-compressible appendix with an outer wall diameter greater than 6 mm, a complex mass, or an appendicolith. Reports lacking these suggestive findings or including acute appendicitis solely as a potential differential diagnosis without a higher probability were deemed negative for appendicitis. These definitions were implemented from prior studies.14,15 Among patients who underwent appendectomy, a histopathological examination of acute appendicitis was considered confirmatory, and it was associated with individual Alvarado’s scores and US findings. Patients who did not undergo surgery, based on low Alvarado score probability or lack of suggestive abdominal US findings were followed for one week and 1 month after their discharge from the hospital.15

Table 1 Alvarado Score

Statistical Analysis

The categorical variables were described using frequencies and percentages. For continuous variables, the mean ±SD was determined. Independent t-tests were conducted for normal data, whereas the Mann–Whitney test was employed for non-normal data. The sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) of US and Alvarado’s score in determining acute appendicitis were analyzed using respective formulas and the area under the receiver operating characteristic (ROC) curve, and the appropriate cut-off for Alvarado’s scores was presented separately. A p-value

Results

Among 1021 patients who presented with acute abdomen, 171 cases were provisionally diagnosed as acute appendicitis. Of these, 135 were identified by Abdominal US as positive for appendicitis, while 36 were deemed negative (2 of these were incorrectly classified as negative by Abdominal US). A total of 137 patients underwent open appendectomy, with 130 (94.9%) having positive intraoperative and histopathological Results, and 7 (5.1%) being diagnosed as negative. The main age of patients was 24.9 ±11.5 (range 18–72 years). Most of the cases were male (71.9%) and live in rural areas (52.6%). Smoking and khat chewing were reported in 30 (17.5%) and 122 (71.3%) of the cases respectively (Table 2).

Table 2 Demographic and Clinical Information of Participants

Alvarado Score Result

The mean Alvarado’s score was 6.9 ±2.4, with 57.9% of patients having Alvarado’s score of ≥7. According to a cutoff point of 6, the sensitivity of Alvarado’s score in detecting appendicitis was 94.62%, with a specificity of 87.80%. The PPV, NPV, and Area under the curve (AUC) were determined to be 96.09%, 83.72%, and 0.938, CI: 0.954 to 0.998, respectively (recommended for “ruling in” appendicitis and progression to surgery). According to a cutoff point of 5, the sensitivity of Alvarado’s score in detecting appendicitis was 98.46%, with a specificity of 82.93% (ruling out’ admission for appendicitis). The PPV and NPV were determined to be 94.81%, 94.44%, and 0.938, respectively.

The ROC curve for Alvarado’s score showed significant AUC values for detecting acute appendicitis and was statistically significant [95% Confidence interval (CI): 0.891 to 0.969; pFigure 1A).

Figure 1 Sensitivity, Specificity, and AUC of Alvarado Score in subfigure; the Area under the curve (AUC) for the ROC curve of Alvarado Score was 0.985 with a 95% confidence interval (CI), 0.954 to 0.998, and was statistically significant (p A) and abdominal ultrasound; The Area under the curve (AUC) for the ROC curve of Abdominal US was 0.907 with a 95% CI, 0.853 to 0.946, and was statistically significant (p B).

Ultrasound Results

Based on the pre-specified criteria, 135 patients were classified by abdominal US as positive for appendicitis, and 36 patients were considered negative (of them 2 cases were incorrectly negative by abdominal US). Open appendectomy was performed in 137 patients, of which 130 (94.9%) had positive intraoperative findings and histopathological results while 7 (5.1%) patients were diagnosed negative (Table 3). The sensitivity of abdominal US in detecting appendicitis was 98.46%, with a specificity of 82.93%. The PPV, NPV, and AUC were determined to be 94.81%, 94.44%, and 0.907, respectively. The ROC curve for the abdominal US showed significant AUC values for detecting acute appendicitis and was statistically significant (95% CI: 0.853 to 0.946; p Figure 1B).

Table 3 Total Diagnosed Cases by Abdominal Ultrasound

Discussion

The reliance on patients’ clinical assessment for diagnosing acute appendicitis may result in delayed recognition or intervention.16 Classic symptoms, such as periumbilical pain with migration or shifting to the right lower quadrant, are reported only in about 50% of patients. These symptoms have similar or even lower sensitivity and specificity compared to several historic physical signs of appendicitis.6,7 Furthermore, the practice of mandatory surgical exploration for suspected appendicitis has led to a heightened incidence of negative appendectomies, with rates considered acceptable up to 25% in the general population and up to 50% during pregnancy.6,7 Nevertheless, retrospective reports have shown increased morbidity and mortality with negative appendectomies, making such practices less favorable.17,18

In this study, we evaluated the role of abdominal US and Alvarado score in establishing the diagnosis of acute appendicitis. This approach was chosen due to the affordability and availability of US in our setting. Notably, abdominal US showed a high sensitivity of 98.46% in detecting appendicitis, with a specificity of 82.93%. Its PPV, NPV, and AUC were 94.81%, 94.44%, and 0.907. There is a notable variation in the sensitivity and specificity of abdominal US reported in the literature. For instance, Hosseini et al reported a sensitivity and specificity of 58% and 68%, respectively.16 Conversely, Abu-Yousef et al reported higher sensitivity (80%) and specificity (95%).14 These findings, when taken together, might indicate methodological differences attributed to study settings and radiological expertise. While pooling studies were thought to provide more precise findings, there is notable heterogeneity in the literature that might limit this effect. For instance, a meta-analysis by Orr et al showed a sensitivity and specificity of 84.7% and 92.1%, respectively. Nevertheless, US performance was noted to be related to the pre-test probability of acute appendicitis, with a high PPV among patients with a high likelihood of having acute appendicitis. The authors concluded that US should not be used to exclude appendicitis given the poor NPV.10 Conversely, a recent meta-analysis that included 21 studies reported a pooled sensitivity and specificity of 81% and 87%, respectively.19 While the subgroup analysis was similar, there was noted heterogeneity and a risk for publication bias.

Understandably, the clinician’s expertise might be a significant factor in the study outcome, which might be difficult to control with cases that require urgent evaluation as acute appendicitis. This was particularly evident in Hosseini et al and Pinto et al which showed lower US sensitivity and specificity based on the operator performance.13,16

Scoring systems for acute appendicitis are of long use and have been utilized for diagnostic and risk stratification purposes. Several scores have been introduced, including appendicitis inflammatory response (AIR), and Raja Isteri Pengiran Anak Saleha Appendicitis (RIPASA).4 Nevertheless, the Alvarado Score is the most extensively studied and utilized in clinical practice. This score is primarily based on symptoms, clinical finding, and laboratory results, offering a considerably sensitive method for identifying individuals with probable acute appendicitis.20 In this study, Alvarado’s score has a sensitivity of 94.62% in detecting appendicitis, with a specificity of 87.80%. Its PPV, NPV, and AUC values are 96.09%, 83.72%, and 0.938, respectively. The ROC curve for Alvarado’s score shows significant AUC values for detecting acute appendicitis. Our inclusion of Alvarado’s score in this study was based on the remarkably high sensitivity and specificity reported in the literature.21 Indeed, consensus from international guidelines has recommended the use of Alvarado’s score in the diagnostic evaluation of acute appendicitis.12,22 A meta-analysis by Gupta et al demonstrated a high predictability of the Alvarado score in acute appendicitis.23 Our findings align with the study by Kanumba et al, which reported Alvarado score sensitivity, specificity, PPV, NPV, and accuracy of 94.1%, 90.4%, 95.2%, and 88.4%, respectively.24 In the study by Memon et al, the Alvarado scoring system exhibited high sensitivity and specificity at 93.5% and 80.6%, with PPV and NPV at 92.3% and 83.3%, respectively, and an accuracy of 89.8%.25 Notwithstanding the benefits of the Alvarado score in limiting the overutilization of imaging modalities, especially in limited-resource settings, these remarkable findings need to be approached with caution. There is noticeable variability in the cutoff used to indicate significant sensitivity and specificity of the Alvarado score.26 Furthermore, there is a low reproducibility rate of several parameters used in the Alvarado score that might limit the certainty of the evaluation.4 Nevertheless, the role of the Alvarado score might be significant in ruling out the diagnosis of acute appendicitis with a low cutoff, a conclusion endorsed in a meta-analysis by Ohle et al.27 The authors, however, reported an overestimation of the Alvarado score among female patients. In a recent clinical trial by Noori et al, the significance of high clinical scores in warranting surgical intervention was underscored.28 Interestingly, clinical scores performance exceeded the performance of the US in our study.

While the Alvarado score can provide substantial diagnostic value, it is crucial to recognize its limitations. For instance, several reports have shown suboptimal performance in the pediatric age group. In a recent meta-analysis of 26 studies involving 5985 children from 11 countries, the Alvarado score had a combined sensitivity of 76.0% and a combined specificity of 71.0% for diagnosing acute appendicitis in children, while the modified Alvarado score had a combined sensitivity of 87.0% and a combined specificity of 47.0%.29 Additionally, the Alvarado score has been noted to overestimate the probability of appendicitis in females,27 and poorly identify complicated appendicitis in the elderly population.30 Further studies are needed to address these drawbacks and develop a possible calibration that accounts for patients’ gender and age.

This study has several limitations attributed to its retrospective design and the small sample size, which may introduce confounding biases. In addition, the lack of a control group limits the generalizability of our findings. Furthermore, the reliance on medical records may further introduce the risk of selection and misclassification biases, which influence documentation quality.

Conclusion

Alvarado’s score and abdominal ultrasound exhibited high sensitivity and specificity in diagnosing acute appendicitis. The substantial accuracy and efficacy of both the Alvarado score and abdominal ultrasound support their utilization as primary investigative tools in resource-limited settings. This approach can help avoid unnecessary appendectomies and minimize the financial burden on patients.

Data Sharing Statement

The datasets used and analyzed during the current study are available from the corresponding author (Faisal Ahmed) upon reasonable request.

Ethics Approval and Consent to Participate

The Ibb University’s ethics committee approved the study protocol and all patients provided informed consent before enrolment and the study complies with the Declaration of Helsinki.

Consent for Publication

All authors reviewed the manuscript and approved its submission.

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis, and interpretation, or all these areas; took part in drafting, revising, or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Disclosure

The authors declare that they have no competing interests in this work.

References

1. Dagne H, Abebaw TA. Characteristics of patients presented with complicated appendicitis in adama, Ethiopia: a cross-sectional study. Open Access Emerg Med. 2022;14:573–580. doi:10.2147/OAEM.S383550

2. Kim K, Rhee JE, Lee CC, et al. Impact of helical computed tomography in clinically evident appendicitis. Emerg Med J. 2008;25(8):477–481. doi:10.1136/emj.2006.044552

3. Giljaca V, Nadarevic T, Poropat G, Nadarevic VS, Stimac D. Diagnostic accuracy of abdominal ultrasound for diagnosis of acute appendicitis: systematic review and meta-analysis. World J Surg. 2017;41(3):693–700. doi:10.1007/s00268-016-3792-7

4. Kharbanda AB. Appendicitis: do clinical scores matter? Ann Emergency Med. 2014;64(4):373–375. doi:10.1016/j.annemergmed.2014.05.002

5. Podda M, Pisanu A, Sartelli M, et al. Diagnosis of acute appendicitis based on clinical scores: is it a myth or reality? Acta Biomed. 2021;92(4):e2021231. doi:10.23750/abm.v92i4.11666

6. Noureldin K, Hatim Ali AA, Issa M, Shah H, Ayantunde B, Ayantunde A. Negative appendicectomy rate: incidence and predictors. Cureus. 2022;14(1):e21489. doi:10.7759/cureus.21489

7. Hatem F, Baig H, Khaldas F, Lucocq J. Negative appendicectomy rates in females of childbearing age: a retrospective analysis and literature review. Cureus. 2022;14(7):e27412. doi:10.7759/cureus.27412

8. Flum DR, Morris A, Koepsell T, Dellinger EP. Has misdiagnosis of appendicitis decreased over time? A population-based analysis. JAMA. 2001;286(14):1748–1753. doi:10.1001/jama.286.14.1748

9. Lietzén E, Salminen P, Rinta-Kiikka I, et al. The accuracy of the computed tomography diagnosis of acute appendicitis: does the experience of the radiologist matter? Scandinavian J Surg. 2018;107(1):43–47. doi:10.1177/1457496917731189

10. Orr RK, Porter D, Hartman D. Ultrasonography to evaluate adults for appendicitis: decision making based on meta-analysis and probabilistic reasoning. Acad Emerg Med. 1995;2(7):644–650. doi:10.1111/j.1553-2712.1995.tb03606.x

11. D’Souza N, Hicks G, Beable R, Higginson A, Rud B. Magnetic resonance imaging (MRI) for diagnosis of acute appendicitis. Cochrane Database Syst Rev. 2021;12(12):Cd012028. doi:10.1002/14651858.CD012028.pub2

12. Gorter RR, Eker HH, Gorter-Stam MA, et al. Diagnosis and management of acute appendicitis. EAES consensus development conference 2015. Surg Endosc. 2016;30(11):4668–4690. doi:10.1007/s00464-016-5245-7

13. Pinto F, Pinto A, Russo A, et al. Accuracy of ultrasonography in the diagnosis of acute appendicitis in adult patients: review of the literature. Crit Ultrasound J. 2013;5(Suppl 1):S2. doi:10.1186/2036-7902-5-S1-S2

14. Abu-Yousef MM, Bleicher JJ, Maher JW, Urdaneta LF, Franken EA, Metcalf AM. High-resolution sonography of acute appendicitis. AJR Am J Roentgenol. 1987;149(1):53–58. doi:10.2214/ajr.149.1.53

15. Bhardwaj P, Behura A, Mohapatra I, et al. Assessment of the Diagnostic Reliability of Modified Alvarado Scores and Abdominal Ultrasonography in Acute Appendicitis. Cureus. 2023;15(5):e38991. doi:10.7759/cureus.38991

16. Hosseini A, Omidian J, Nazarzadeh R. Investigating diagnostic value of ultrasonography in acute appendicitis. Adv Biomed Res. 2018;7(1):113. doi:10.4103/abr.abr_79_18

17. Tamini N, Santurro L, Chiappetta MF, et al. Morbidity after negative appendectomy: a single-centre experience on 627 cases. Eur J Trauma Emerg Surg. 2020;46(4):859–864. doi:10.1007/s00068-019-01138-w

18. Gelpke K, Hamminga JTH, van Bastelaar JJ, et al. Reducing the negative appendectomy rate with the laparoscopic appendicitis score; a multicenter prospective cohort and validation study. Int j Surg. 2020;79:257–264. doi:10.1016/j.ijsu.2020.04.041

19. Cho SU, Oh SK. Accuracy of ultrasound for the diagnosis of acute appendicitis in the emergency department: a systematic review. Medicine. 2023;102(13):e33397. doi:10.1097/MD.0000000000033397

20. Bouali M, El Berni Y, Moufakkir A, et al. Value of Alvarado scoring system in diagnosis of acute appendicitis. Ann Med Surg Lond. 2022;77:103642. doi:10.1016/j.amsu.2022.103642

21. Karami MY, Niakan H, Zadebagheri N, Mardani P, Shayan Z, Deilami I. Which one is better? Comparison of the acute inflammatory response, raja isteri Pengiran anak saleha appendicitis and Alvarado scoring systems. Ann Coloproctol. 2017;33(6):227–231. doi:10.3393/ac.2017.33.6.227

22. Howell JM, Eddy OL, Lukens TW, Thiessen ME, Weingart SD, Decker WW. Clinical policy: critical issues in the evaluation and management of emergency department patients with suspected appendicitis. Ann Emerg Med. 2010;55(1):71–116. doi:10.1016/j.annemergmed.2009.10.004

23. Gupta S, Kolli VS, Da Costa K, Javed S, Ammar A, Rasheed A. A systematic review and meta-regression for validation of the Alvarado score as a tool for predicting acute appendicitis. Ann Med Surg. 2023;85(2):111–121. doi:10.1097/MS9.0000000000000238

24. Kanumba ES, Mabula JB, Rambau P, Chalya PL. Modified Alvarado scoring system as a diagnostic tool for acute appendicitis at Bugando Medical Centre, Mwanza, Tanzania. BMC Surg. 2011;11(1):4. doi:10.1186/1471-2482-11-4

25. Memon ZA, Irfan S, Fatima K, Iqbal MS, Sami W. Acute appendicitis: diagnostic accuracy of Alvarado scoring system. Asian J Surg. 2013;36(4):144–149. doi:10.1016/j.asjsur.2013.04.004

26. Ebell MH, Shinholser J. What are the most clinically useful cutoffs for the Alvarado and pediatric appendicitis scores? A systematic review. Ann Emerg Med. 2014;64(4):365–72.e2. doi:10.1016/j.annemergmed.2014.02.025

27. Ohle R, O’Reilly F, O’Brien KK, Fahey T, Dimitrov BD. The Alvarado score for predicting acute appendicitis: a systematic review. BMC Med. 2011;9(1):139. doi:10.1186/1741-7015-9-139

28. Noori IF, Jabbar AS, Noori AF. Clinical scores (Alvarado and AIR scores) versus imaging (ultrasound and CT scan) in the diagnosis of equivocal cases of acute appendicitis: a randomized controlled study. Ann Med Surg. 2023;85(4):676–683. doi:10.1097/MS9.0000000000000270

29. Bai S, Hu S, Zhang Y, Guo S, Zhu R, Zeng J. The VALUE OF THE ALVARADO SCORE FOR THE DIAGNOSIS OF ACUTE APPENDICITIS IN CHILDREN: A SYSTEMATIC REVIEW AND META-ANALYSis. J Pediatr Surg. 2023;58(10):1886–1892. doi:10.1016/j.jpedsurg.2023.02.060

30. Deiters A, Drozd A, Parikh P, Markert R, Shim JK. Use of the Alvarado Score in Elderly Patients with Complicated and Uncomplicated Appendicitis. Am Surg. 2019;85(4):397–402. doi:10.1177/000313481908500431

Latest article