Secondary Logo

Journal Logo


Prognostic capabilities and agreement of three different scores in diagnosing appendicitis in children from a developing setting

Pinzón-Redondo, Hernando; Zarate-Vergara, Andrea; Barrios-Redondo, Katherine; Muñoz, Cesar; Guzmán, Ángel; Morales-Payares, Dorys; Alvis-Guzmán, Nelson; Paternina-Caicedo, Ángel

Author Information
Annals of Pediatric Surgery: January 2016 - Volume 12 - Issue 1 - p 5-9
doi: 10.1097/01.XPS.0000476033.40918.0f
  • Free



Appendicitis is the most common cause of abdominal surgery in children, with an estimated annual incidence rate of 26 appendectomies per 100 000 population 1. The diagnosis of appendicitis continues to be mostly based on clinical symptomatology and natural history of the disease. Several scores have been designed and validated in children to aid in management decisions for patients with appendicitis. The Pediatric Appendicitis Score (PAS) 2, the Alvarado score, the modified Alvarado score 3, the Kharbanda score 4, the Lintula score 5, and the Van den Broek scores 6 were designed for use in pediatric population, but validation has provided mixed results of their diagnostic utility 7.

The validation of all these scores had been made previously in developed settings, proving useful in some instances for decision making in children with appendicitis. However, in developing settings, where case mix and comorbidities or baseline characteristics can be different, some variables may impact the diagnostic usefulness of these scores. In children from developing countries, for example, stunting and age of presentation may impact symptom presentation, delay attention and treatment, and affect the overall score performance. In Colombia, for example, an upper-middle income country in South America, stunting frequency in children is 15% 8.

The diagnostic utility of PAS, the modified Alvarado score, and the Alvarado score, the most used and researched scores for diagnosing appendicitis, has been tested infrequently in a low-income and middle-income setting population. We therefore carried out a cohort study with the aim of testing the diagnostic capacity of these scores in diagnosing appendicitis in children from a developing country center from Colombia.


A retrospective cohort study was designed and carried out to test the diagnostic capacity of the PAS and the Alvarado score in diagnosing appendicitis in children from Cartagena (Colombia). The setting of the present assessment is Hospital Infantil Napoleón Franco Pareja (HINFP), a reference pediatric center in a city with approximately one million inhabitants.

Children presenting to the emergency department at HINFP in 2013 (1 January to 31 December), with unspecified abdominal pain suggestive of appendicitis as the main complaint, under 18 years of age and of any sex, were included in the present study.

Variable definitions

Population characteristics

Variables collected for this study were age, weight, sex, rural residence, key vital signs at admission (pulse, respiratory rate, and temperature), white blood cell count, percentage of neutrophils, hospitalization duration, and ICU stay.

Gold standard

True disease was defined as histopathology from appendectomy suggestive of appendicitis. The gold standard also included ruling out appendicitis at the time of discharge after a complete follow-up.

Pediatric Appendicitis Score

The PAS is an eight-item pediatric score for predicting appendicitis 2. The full list of symptomatology of this score is listed in Table 1.

Table 1
Table 1:
Scoring systems for evaluating appendicitis, as reported in the literature

Alvarado score

Alvarado 9 designed an eight-item pediatric score for predicting appendicitis. The full list of signs and symptoms is also listed in Table 1.

Modified Alvarado score

This score only differs from the Alvarado score in the absence of neutrophilia as a predictor of appendicitis 3.

Data collection and procedures

Health personnel in the emergency department at our institution follows specific protocols to assess patients with suspected appendicitis. At first contact with physicians, the health personnel retrospectively collected, if present, information on the following: cough, percussion, heel tapping tenderness at interquartile range (IQR)/rebound pain; anorexia; migration of pain to IQR; nausea/vomit; IQR tenderness on light palpation; white blood cell count; neutrophil count; and temperature. All data were retrospectively collected at emergency admission, including the required laboratory data, from electronic health records.

Data analysis

All analysis assumed a P-value less than 0.05 as statistically significant, and were carried out using Stata (Stata v.13; StataCorp, College Station, Texas, USA).

Categorical variables were reported as percentages. Continuous variables were reported as mean or median, depending on variable normality. Dispersion measures for continuous variables were SDs, and 25th and 75th percentiles IQR, depending on the normality or non-normality of the variable, respectively. Normality was assessed using the Shapiro–Wilk test for each continuous variable.

Analyses of categorical variables were performed with the χ2 or Fisher exact test, when appropriate. Analysis of variance or the Kruskal–Wallis rank test were used for analysis of continuous variables, according to the parameter distribution.

The sensitivity and specificity of each score value in diagnosing appendicitis were estimated through the receiver operative characteristics curve results, and 95% confidence intervals (CI) were estimated. Area under the receiver operative characteristic curve (AUC) with 95% CI was also reported. We excluded patients with missing data in any variable of the analysis.

Sensitivity analysis

To account for differences in performance related to age, a sensitivity analysis was performed including only patients older than 5 years of age. AUC with 95% CI and each score value were reported.


Patient characteristics

A total of 236 patients were admitted to HINFP with abdominal pain presumptive of appendicitis. Of them, 49 patients (20.8%) had missing data, and hence were excluded from the study. The median age of the cohort sample was 11.58 years (IQR, 8.33–13.61), and 22 of 187 (11.8%) patients were under 5 years of age. The median hospitalization time was 4 days (IQR, 3–6); six (3.2%) patients were admitted to the ICU during their first hospitalization, for a median length of ICU stay of 6 days (IQR, 4–8). The median stay in the ICU was not different in patients with appendicitis when compared with nonappendicitis patients (P=0.164).

A more comprehensive overview of sample characteristics is shown in Table 2. According to our analysis, none of the vital signs recorded at admission showed differences among appendicitis versus nonappendicitis patients (pulse, respiratory rate, and temperature).

Table 2
Table 2:
Sample characteristics of patients with suspected appendicitis at emergency admission in HINFP (Cartagena, Colombia)

Prognostic capabilities of the scores

With regard to the symptomatology and parameters of the three scores, only neutrophilia was associated with appendicitis in our sample (P=0.008). Below, a detailed overview of the prognostic ability of the scores is given. AUC was not statistically different in the three scores assessed (P=0.549) (Table 3).

Table 3
Table 3:
Signs and symptoms used in Pediatric Appendicitis Score, Alvarado score, and modified Alvarado score to diagnose appendicitis

Receiver operator curve of the Alvarado score, the modified Alvarado score, and the PAS in our setting is shown in Fig. 1. The percentage of patients per score-value in each score for patients with and without appendicitis in the sample is listed in Fig. 2, and its sensitivity and specificity in Table 4.

Fig. 1
Fig. 1:
Receiver operator curve of Alvarado score, Modified Alvarado score, and PAS in patients with suspected appendicitis at emergency admission in HINFP (Cartagena, Colombia).
Fig. 2
Fig. 2:
Percentage of patients per score-value in the Alvarado score, Modified Alvarado score, and PAS in patients with and without appendicitis in the sample.
Table 4
Table 4:
Diagnostic utility of the Alvarado score, modified Alvarado score, and Pediatric Appendicitis Score in our sample of pediatric patients, and children above 5 years of age

Pediatric Appendicitis Score

Twelve (54.6%) children without appendicitis had an Alvarado score less than 6, and 117 (71.3%) appendicitis patients had a score of 6 or greater (P=0.109). The median PAS score was 6 (IQR, 4–7) in patients without appendicitis versus 7 (IQR, 5–7) in patients with appendicitis (P=0.050). AUC was 0.628 (95% CI, 0.495–0.763).

Alvarado score

Seven (31.8%) children without appendicitis had an Alvarado score less than 7, and 93 (50.0%) appendicitis patients had a score of 7 or greater (P=0.023). The median Alvarado score in patients with appendicitis was 7 (IQR, 6–8), compared with 6 (IQR, 4–7) in patients without appendicitis (P=0.030). AUC was 0.642 (95% CI, 0.514–0.770).

Modified Alvarado score

Five (22.7%) children without appendicitis had an Alvarado score less than 7, and 55 (33.5%) appendicitis patients had a score of 7 or greater (P=0.308). The score was statistically larger (P=0.90) in patients without appendicitis (median: 5; IQR, 4–6) versus appendicitis patients (median: 6; IQR, 5–7). AUC was 0.611 (95% CI, 0.471–0.751).

Sensitivity analysis in children above 5 years of age

AUC of the three evaluated scores was very low in children above 5 years of age, and not statistically different (P=0.061). The PAS had an AUC of 0.556 (95% CI, 0.394–0.717); the Alvarado score had an AUC of 0.579 (95% CI, 0.421–0.737), and the modified Alvarado score had an AUC of 0.528 (95% CI, 0.353–0.702) in children above 5 years of age.


To our knowledge, this is one of the largest study assessing scores to diagnose appendicitis in a low-income or middle-income pediatric setting 10. In our developing setting, as other studies have shown, the scores have a relatively poor performance of the overall score.

Several studies have shown evidence that Alvarado score and the PAS 11 do not have an adequate overall accuracy for diagnosing appendicitis. This means that a binary yes/no classification is inadequate for the diagnosis. However, both scores and the modified Alvarado score may be used for risk stratification. This means that stratification in low, intermediate, and high appendicitis risk is likely to be more useful in the clinical setting. A value of 3 or less in the Alvarado and the modified Alvarado score, and a score of 2 or less in the PAS have a sensitivity of 96–97% in diagnosing appendicitis, and a value in any score of 8 or greater has a specificity between 81 and 86% for the three scores. These values, given results from our study, would prove useful in an urban developing setting around the world.

A recent meta-analysis showed in pediatric population with the Alvarado Score a sensitivity of 0.99 (95% CI, 0.83–1.00) for a cutoff value of 5, and a sensitivity of 0.99 (95% CI, 0.83–1.00) for a value of 7. Specificity was 0.81 (95% CI, 0.76–0.85) for that study at a cutoff value of 5, and 0.76 (95% CI, 0.55–0.89) for 7 10. These diagnostic values are in disagreement with the results from our study; for a value of 5 or greater, the Alvarado score had a sensitivity of 89% and specificity of 27%.

Children are a special population for the scores currently available to predict appendicitis. Current scores attempt to use clinical symptomatology to make an accurate diagnosis of this disease. However, because of unspecific clinical findings such as age, socioeconomic status, or comorbidities that may affect poor children (i.e. malnutrition), and/or the possibility of intrinsic differences with adult clinical presentation of appendicitis, the PAS was designed by Samuel in 2002 2 to overcome these shortcomings. The Alvarado and modified Alvarado scoring systems did not seem to be as accurate in diagnosing appendicitis as PAS. The sensitivity of PAS was 100% in their initial validation study, with a specificity of 92% with a cutoff value of 6 or greater.

This study has several limitations. First, the retrospective design could increase information bias. To counteract this, our data collection was carried out by highly trained personnel, and supervised by pediatric specialists who care for patients at HINFP, which use specific protocols. Moreover, selection bias may have been an issue because we retrospectively collected patients without appendectomy 7,10.

In summary, stratification of risk according to prespecified values of the Alvarado score, the modified Alvarado score, and the PAS can be useful in the pediatric clinical setting, if properly performed in the context of evidence-based medicine. No classification of yes/no according to these scores could properly diagnose appendicitis. The group with intermediate risk for appendicitis could benefit with additional diagnostic measures (i.e. ultrasonography or computed tomography scan) in the pediatric population.


Conflicts of interest

There are no conflicts of interest.


1. Addiss DG, Shaffer N, Fowler BS, Tauxe RV. The epidemiology of appendicitis and appendectomy in the United States. Am J Epidemiol 1990; 132:910–925.
2. Samuel M. Pediatric Appendicitis Score. J Pediatr Surg 2002; 37:877–881.
3. Macklin CP, Radcliffe GS, Merei JM, Stringer MD. A prospective evaluation of the modified Alvarado score for acute appendicitis in children. Ann R Coll Surg Engl 1997; 79:203–205.
4. Kharbanda AB, Taylor GA, Fishman SJ, Bachur RG. A clinical decision rule to identify children at low risk for appendicitis. Pediatrics 2005; 116:709–716.
5. Lintula H, Pesonen E, Kokki H, Vanamo K, Eskelinen M. A diagnostic score for children with suspected appendicitis. Langenbecks Arch Surg 2005; 390:164–170.
6. Van den Broek WT, van der Ende ED, Bijnen AB, Breslau PJ, Gouma DJ. Which children could benefit from additional diagnostic tools in case of suspected appendicitis? J Pediatr Surg 2004; 39:570–574.
7. Kulik DM, Uleryk EM, Maguire JL. Does this child have appendicitis? A systematic review of clinical prediction rules for children with acute abdominal pain. J Clin Epidemiol 2013; 66:95–104.
8. Larrea C, Freire W. Social inequality and child malnutrition in four Andean countries. Rev Panam Salud Publica 2002; 11 (5–6):356–364.
9. Alvarado A. A practical score for the early diagnosis of acute appendicitis. Ann Emerg Med 1986; 15:557–564.
10. Ohle R, O'Reilly F, O'Brien KK, Fahey T, Dimitrov BD. The Alvarado score for predicting acute appendicitis: a systematic review. BMC Med 2011; 9:139.
11. Schneider C, Kharbanda A, Bachur R. Evaluating appendicitis scoring systems using a prospective pediatric cohort. Ann Emerg Med 2007; 49:778–784.
© 2016 Annals of Pediatric Surgery