American Journal of Respiratory and Critical Care Medicine

The reliability of spirometry is dependent on strict quality control. We examined whether quality control criteria recommended for adults could be applied to children aged 2–5 years. Forty-two children with cystic fibrosis and 37 healthy children attempted spirometry during their first visit to our laboratory. Whereas 59 children (75%) were able to produce a technically satisfactory forced expiration lasting 0.5 second, only 46 (58%) could produce an expiration lasting 1 second, with the youngest children having the most difficulty. Start of test criteria for adults were inappropriate for this age group, with only 16 of 59 children producing a volume of back extrapolation as a proportion of forced vital capacity of less than 5%, whereas all but 4 could produce a volume of back extrapolation of 80 ml or less. More than 90% of children were able to produce a second forced vital capacity and a second forced expired volume in 0.75 second within 10% of their highest. Errors in the spirometry software resulted in inaccurate reporting of expiratory duration and inappropriate timed expired volumes in some children. We describe recommendations for modified start of test and repeatability criteria for this age group, and for improvements in software to facilitate better quality control.

Spirometry is the most frequently used method for measuring lung function. The reliability of spirometry depends on standardized methodology, particularly regarding how quickly the subject increases flow at the beginning of the expiration; sustained effort throughout the expiration; duration of the expiratory maneuver; and repeatability. Detailed criteria for data collection and interpretation have been published by the American Thoracic Society (ATS) and by the European Respiratory Society (ERS) (1, 2). Application of these criteria when collecting and interpreting spirometry data is considered mandatory in adult pulmonary function laboratories.

Spirometry is not only employed in adults; it is also commonly performed in pediatric pulmonary function laboratories. Spirometry was previously limited to school-age children (those aged 6 to 16 years), but reports have confirmed that preschool children (those aged 2 to 5 years) are also able to attempt these maneuvers (310). Both the ATS and ERS guidelines were written for adult patients, and Arets and colleagues have already demonstrated that many school-age children have difficulty meeting some of the quality control criteria (11). Reports of spirometry in preschool children have focused on feasibility (35), use of incentive devices (5), and recording of reference values in healthy populations (3, 4) rather than on the issue of quality control. We hypothesized that children aged 2 to 5 years would find it impossible to meet some of the quality control criteria for spirometry, but that alternative criteria may be feasible.

The primary aims of this study were to analyze results from spirometry performed by preschool children against quality control criteria recommended for such measures in adults, to identify which of these criteria are easily achieved by preschool children and which are not, to determine whether success in meeting these criteria is affected by the presence of lung disease or by subject age, and to identify possible modifications that could be applicable in preschool children. The secondary aim of this study was to compare results obtained from healthy children in our laboratory with those previously reported from other centers, and to assess whether reference ranges published by those centers apply to our population.

A detailed description of the study population and methods, including examples of unacceptable recordings, is presented in the online supplement.

All children were at least 2, and less than 6, years of age at the time of testing. Children with cystic fibrosis (CF) were recruited from centers participating in the London Cystic Fibrosis Collaboration, a longitudinal study of airway function in CF (12). Healthy control children were recruited from the community. All tests were performed between September 2000 and March 2003, at the Institute of Child Health (London, UK). All children were participating in a study comparing different methods of measuring airway function, and performed multiple-breath inert gas washout, specific resistance measurement by plethysmography, and spirometry, in that order. Data reported in this article are limited to spirometric measurements obtained at the child's first laboratory visit.

The study was approved by the Research Ethics Committee of the Institute of Child Health and Great Ormond Street Hospital for Children (London, UK). Written informed consent was obtained from the parents of all children entering the study.

Spirometry was performed with a Jaeger MasterScope spirometer (VIASYS Healthcare, Höchberg, Germany). The first expirations were encouraged by means of the Jaeger “Candle” incentive program, which promotes a rapid expiration. When the child had mastered rapid expiration the program was changed to display an incentive that encourages prolongation of that rapid expiration.

All volume–time and flow–volume curves were visually inspected by at least two of the authors. Recordings were passed if they met the following criteria on visual inspection: the flow–volume trace showed a rapid rise to peak flow, and a smooth descending limb, with no evidence of cough or glottic closure; and the volume–time trace was approaching a horizontal plateau (Figure 1)

. Duration of expiration was recorded and the tracing was accepted if it was 0.5 second or more. Results were reported only if two or more technically acceptable curves were obtained.

The Jaeger MasterScope program allows only five maneuvers to be recorded per test file. Additional test files were therefore created as necessary, and a maximum of 25 maneuvers was recorded. Recording was stopped earlier than this if the investigator assessed that maximal results had been obtained, or if the child was showing signs of fatigue or restlessness. Spirometry parameters were obtained by first transferring all data from the Jaeger MasterScope system to a separate PC workstation, and then exporting satisfactory results into an Access relational database (Microsoft, Redmond, WA), using a custom-written export program. This procedure was necessary because standard Jaeger software does not allow (1) reporting of all necessary parameters, or (2) data to be excluded but saved for future reanalysis. The following parameters were reported from each maneuver: FVC, FEV1, FEV0.75, FEV0.5, FEF25–75, duration of maneuver producing the best FVC (forced expiratory time [FET]), back-extrapolated volume (Vbe), and Vbe/FVC.

It was discovered that the Jaeger MasterScope software occasionally reported timed expired volumes inappropriately, so that some children appeared to have an FEV1 equal to FEV0.75 or an FEV0.75 equal to FEV0.5. The reason for this fault is that the Jaeger system continues recording until an inspired volume of 40 ml has been recorded. The software then identifies the point at which inspiratory flow began, and defines this point as the end of expiration (J. Reinstaedtler, personal communication). When children kept their mouth on the mouthpiece at the end of the expiration, the FET recording was correctly terminated at the beginning of the next inspiration. Some children, particularly the younger ones, removed their mouth from the mouthpiece as soon as they had completed their expiration. In this case, the program continued measuring FET until the investigator manually terminated the recording. On visual inspection of the traces we discovered that some of the FET results were overestimated.

Therefore, verification of the data was undertaken. First, all reports in which FET was less than 1 second, and/or in which FEV0.75 or FEV1 was equal to FVC, were identified. These curves were then visually inspected and timed volumes that had been inappropriately reported were excluded. By this method six children who had produced satisfactory FEV0.5, but from whom FEV0.75 should not have been reported, and seven children who had produced satisfactory FEV0.5 and FEV0.75, but from whom FEV1 should not have been reported, were identified.

Statistical Analysis

The CF and control populations were compared for age, weight, and height. Success in producing FVC, FEV1, FEV0.75, FEV0.5, and FEF25–75 was compared by age group, and by diagnosis. FET was plotted against age and height. Start of test was examined by plotting Vbe (as absolute, and as %FVC) against age. Repeatability was examined by calculating the difference between the two highest FEV0.75 readings (ΔFEV0.75), and between the two highest FVC readings (ΔFVC). Timed expired volumes as fractions of FVC (FEV0.5/FVC, FEV0.75/FVC, and FEV1/FVC) were calculated. Vbe, Vbe/FVC, ΔFEV0.75, ΔFVC, FEV0.5/FVC, FEV0.75/FVC, and FEV1/FVC were all compared by age group and by diagnosis. Data obtained from healthy children were compared with published reference values.

Summary statistics are presented as mean and standard deviation (SD) if normally distributed, and as median and interquartile range (IQR) if nonnormally distributed. Proportions were compared by χ2 test. t tests or Mann–Whitney tests were employed for comparison of two groups; Analysis of variance and post hoc analysis by Tukey's honestly significant difference (HSD) were used to compare three or more groups. When interaction between several factors was suspected this was investigated by multiple regression. z scores for FVC, FEV1, and FEF25–75 were calculated from reference equations derived from 184 healthy white preschool children tested in Indianapolis, Indiana (3). z scores for FVC, FEV0.5, and FEV1 were calculated from reference equations derived from 603 preschool children tested in Oslo, Norway (4). For all analyses a p value of 0.05 was regarded as significant.

Forty-two children with CF and 37 healthy children were recruited. Mean (SD) age of the children with CF was 4.14 (0.90) years. Mean age of the control children was 4.14 (0.85) years. Although matched for age, the children with CF were significantly lighter and shorter than the control children (Table 1)

TABLE 1. Characteristics of study population, summarized by diagnosis




Cystic Fibrosis

Healthy Control

95% CI for
 Difference
 (CF–HC)
n4237
Male, %*4560−36, 8
White, %*88700, 36 
Age, yr4.14 (0.90)4.14 (0.85)−0.39, 0.40
Height, cm100.9 (7.3)104.2 (7.9)−6.6, 0.1
Weight, kg16.7 (2.9)18.6 (3.8)−3.4, −0.4
Height, z score−0.42 (1.13)0.26 (1.23)−1.22, −0.13
Weight, z score
−0.01 (1.11)
0.71 (1.23)
−1.26, −0.18

* Comparison between groups by χ2 test.

p < 0.05.

Definition of abbreviations: CF = cystic fibrosis; CI = confidence interval; HC = healthy control.

Results presented are means (SD) unless otherwise stated.

.

Success Rates

Seven children were unwilling to attempt spirometry. The median number of spirometry maneuvers attempted by the remaining 72 children was 12 (range, 6–22). Success in producing a reportable FVC, FEV0.5, FEV0.75, FEV1, and FEF25–75 is presented in Table 2

TABLE 2. Success rates in obtaining forced expiratory parameters






Age

All
 Children (n = 79)
Healthy Control Subjects
 (n = 37)
Cystic
 Fibrosis
 (n = 42)
2 to < 4 yr (n = 39)
4 to < 5 yr (n = 25)
5 to < 6 yr (n = 15)
FVC59 (75)29 (78)30 (71)25 (64)21 (84)13 (87)
FEV0.559 (75)29 (78)30 (71)25 (64)21 (84)13 (87)
FEV0.7553 (67)25 (68)28 (67)20 (51)20 (80)13 (87)
FEV146 (58)20 (54)26 (62)16 (41)18 (72)12 (80)
FEF25–75
59 (75)
29 (78)
30 (71)
25 (64)
21 (84)
13 (87)

Results are expressed as n (%), summarized by age group and diagnosis. As only six of the children were less than 3 years of age, children aged 2 to less than 4 years were analyzed together.

. Fifty-nine children were able to produce acceptable spirometry loops (Figure 1). Nine of these children could produce only 2 acceptable curves, and the remaining 50 children produced between 3 and 9 acceptable loops each (median, 4).

There was no significant difference between the CF and control groups in success rate for any of the parameters. Children aged 2 to less than 4 years were significantly less likely to produce FEV0.75 or FEV1 than children aged 4 to less than 5 years or children aged 5 to less than 6 years. No other differences in success rates by age group were seen.

Ability to meet quality control criteria was assessed for the 59 children who produced acceptable results.

Start of Test Criteria

Values for back-extrapolated volume (Vbe) and Vbe/FVC are summarized in Table 3

TABLE 3. Start of test criteria and repeatability criteria






Age

All Children
Healthy
 Control
 Subjects
Cystic Fibrosis
2 to < 4 yr
4 to < 5 yr
5 to < 6 yr
VBE, ml*65 (17)63 (20)    67 (14)64 (16)67 (12)    66 (25)
VBE/FVC, %*7.2 (2.8)6.4 (2.3)  7.9 (3.0)7.7 (2.7)7.2 (2.8)    5.9 (2.6)
FET, s1.5 (1.1, 2.2)1.4 (1.1, 1.8)1.5 (1.2, 2.2)1.3 (1.1, 2.1)1.6 (1.1, 2.2)1.6 (1.1, 2.4)
ΔFVC, ml34 (16, 63)34 (14, 77)33 (16, 58)34 (14, 63)31 (18, 73)36 (11, 66)
ΔFVC, %3.5 (1.7, 7.0)4.2 (1.6, 7.6)3.1 (1.7, 6.5)4.3 (1.5, 7.6)3.3 (1.8, 7.1)2.9 (1.5, 5.5)
ΔFEV0.75, ml32 (11, 53)38 (15, 62)23 (10, 44)23 (11, 40)36 (9, 59)34 (14, 73)
ΔFEV0.75, %
3.4 (1.4, 5.2)
4.0 (1.5, 5.6)
3.1 (1.4, 5.1)
3.2 (2.3, 4.3)
3.9 (1.0, 6.2)
3.4 (1.3, 6.5)

* Mean (SD).

Median (interquartile range).

Definition of abbreviations: FET = duration of maneuver producing the best FVC; ΔFEV0.75 = difference between best and second best FEV0.75; ΔFVC = difference between best and second best FVC; VBE = back-extrapolated volume.

Results are summarized by age group and diagnosis.

. Mean (SD) Vbe for all children was 65 (17) ml; Vbe/FVC was 7.2 (2.8)%. There was no relationship between Vbe and height (Figure 2A) , or between Vbe and age, and all but 4 of 59 children (7%) produced a Vbe less than or equal to 80 ml. Figure 2B demonstrates the relationship between Vbe/FVC and height (r2 = 0.27, p < 0.0005). Although younger children tended to have higher Vbe/FVC values this difference was not significant. Only 16 of 59 children (27%) were able to produce a Vbe/FVC value less than 5%. Seven children (12%) produced a Vbe/FVC greater than 10%, of whom 2 produced a Vbe/FVC greater than 12.5%. There was no relationship between diagnosis and Vbe. Children with CF had a significantly higher Vbe/FVC, but this was no longer significant after correction for height. Curves with Vbe greater than 80 ml and curves with Vbe/FVC greater than 12.5% were reexamined, and compared with curves that had been excluded on visual inspection.

Duration of Maneuver

FET, summarized by age group and diagnosis, is presented in Table 3. Twelve of the 59 children produced a FET less than 1 second. These results must be interpreted with caution.

For the majority of children FEV1/FVC was greater than 90%, irrespective of age group or diagnosis. By contrast, for the majority of children FEV0.5/FVC was less than 90%. In healthy children there was a significant negative relationship between FEV0.5/FVC and height (r2 = 0.22, p = 0.009) and between FEV0.75/FVC and height (r2 = 0.27, p = 0.008), but no relationship between FEV1/FVC and height. There was no relationship between FEV/FVC parameters and height in the CF population.

Within-Occasion Repeatability

Differences between the two highest values of FVC (ΔFVC) and FEV0.75 (ΔFEV0.75) are displayed in Table 3. The population median for all age groups was less than 4% for ΔFEV0.75, and less than 5% for ΔFVC, with no significant difference for either parameter by age group. Twenty-four children (41%) had ΔFVC greater than 5%, of whom 5 (9%) had ΔFVC greater than 10%. Fifteen children (28%) had ΔFEV0.75 greater than 5%, of whom 3 (6%) had ΔFEV0.75 greater than 10%.

Comparison with Published Reference Data

Figure 3

presents FVC, FEV1, and FEF25–75 results from Indianapolis and the Institute of Child Health. Results are presented for healthy children of all ethnic groups, and separately for all white children. When the data for white children alone were plotted there was good concordance between results obtained in the two centers. z scores for spirometry parameters were calculated for the healthy children. Of the 29 children who produced reportable results, 18 were white, 3 were black, and 8 were of mixed race. Summary results are presented for the whole control population, and for the white children (Table 4)

TABLE 4. Z scores for spirometry parameters in healthy children




All Healthy Children (n = 29)

Healthy White Children (n = 18)

t Test (95% CI)*
FVC z score (Oslo)0.18 (0.70)0.43 (0.52)0.17, 0.67
FVC z score (Indianapolis)−0.76 (1.21)−0.30 (0.79)−0.70, 0.09 
FEV0.5 z score (Oslo)−0.11 (0.84)0.08 (0.77)−0.30, 0.46 
FEV1 z score (Oslo)0.17 (0.66)0.31 (0.47)0.02, 0.59
FEV1 z score (Indianapolis)0.13 (1.1)0.37 (0.71)−0.06, 0.80 
FEF25–75 z score (Indianapolis)
−0.35 (1.44)
−0.29 (1.30)
−0.94, 0.36

* Comparison between z scores obtained in healthy white children versus zero, by one-sided t test.

p < 0.005.

p < 0.05.

Results are presented as means (SD), for all healthy children, and for healthy white children. z scores are calculated from reference equations derived from data collected from healthy children in Indianapolis (3) and Oslo (4).

.

Additional results are presented in the online supplement. The online supplement includes traces that were excluded on visual inspection, traces from maneuvers that had high Vbe or Vbe/FVC, data on ethnic background, analysis of timed expired volumes as proportions of FVC by age and diagnosis, plots of ΔFEV0.75 and ΔFVC against age, and plots of z scores for our healthy population (3, 4).

In this study results of spirometry attempts in seventy-nine 2- to 5-year-old children were analyzed in detail, to determine whether the results met published quality control criteria for adult spirometry. All flow–volume and volume–time traces were first visually inspected and passed or rejected. Forced expiratory volumes and flows obtained in healthy children were similar to those reported by other groups. The criteria tested were start of test, assessed by back extrapolation; duration of expiration, and the timed expired volumes that are influenced by this; and repeatability. For all these parameters modifications to quality control criteria appear necessary for application to preschool children.

Visual Inspection

Visual inspection of the flow–volume curve and the volume–time curve is a mandatory first step for quality control of spirometry. Some of the faults that require results to be excluded are self-evident: these include cough or glottic closure, double expiration (seen as a double peak on the flow–volume curve), or failure to produce an adequate peak flow. Previous studies of spirometry in this age group have used similar criteria for accepting or discarding loops (35).

Evaluation of end-of-test by visual inspection is more difficult. In older children and adults the volume–time trace asymptotes to horizontal toward the end of expiration (1). In preschool children, particularly the youngest, this asymptote is frequently not seen, even in children who appear to have produced a complete expiration (Figure 1). The abrupt cessation of expiration in very young children complicates assessment of end-of-test, as curves such as that shown in Figure 1 should not be excluded inappropriately. Eigen and colleagues (from the Indianapolis group) have suggested excluding curves when expiratory flow abruptly ceases from a point greater than 25% of peak flow (3); Marostica and colleagues (also from the Indianapolis group) have proposed excluding curves when expiratory flow abruptly ceases from a point greater than 10% of peak flow (9); while Vilozni and colleagues have suggested excluding curves when cessation occurs abruptly from a flow of 300 ml · second–1 or more (5). All suggestions have merit, particularly if consensus could be reached, but many spirometry software systems do not provide such information automatically. Our current study design, in which all curves are inspected by at least two experienced researchers, and “borderline” curves by at least three, is not possible in daily practice. We therefore suggest that manufacturers should modify software so that information on change in rate of lung emptying toward the end of expiration is easily available, with automatic identification and display of critical cutoff points as suggested above. In those curves where incomplete expiration is identified, timed expired volumes and peak flows may still be reportable (13), even though FVC and forced expiratory flows cannot be reported from such maneuvers.

Quantitative Start of Test Criteria

ATS guidelines recommend that start of test be assessed quantitatively by calculating the Vbe, and that this volume should be no greater than 5% of FVC, or an absolute volume of 150 ml, whichever is the greater (1). ERS criteria use 5% of FVC, or absolute volume of 100 ml as the cutoff (2). Although the majority of school-age children can achieve the 5% criterion (11), this does not appear to be the case among preschool children. In our study only 16 (27%) children produced a Vbe/FVC of less than 5%, suggesting that this cutoff is too strict, whereas a cutoff of 150 ml is too high to be of any value in a preschool population. In Figure 2B we have marked the four children who produced a Vbe greater than 80 ml; all these children had Vbe/FVC less than 12.5%.

All the curves included in this analysis had been passed visually as showing acceptable rise to peak flow. After reinspection of curves with high Vbe or Vbe/FVC, we concluded that our initial visual inspection had included and excluded curves correctly. We therefore suggest that expirations with Vbe greater than 80 ml or Vbe/FVC greater than 12.5% should be visually reinspected to ensure acceptable rise to peak flow, but should not necessarily be excluded.

Quantitative End-of-Test Criteria and Timed Expired Volumes

ATS criteria recommend that expiration continue until there is a clear plateau on the volume–time trace, and that FET should be at least 6 seconds, or that there should be no volume change for 1 second. Although these ATS recommendations state that a shorter exhalation time is acceptable in children they do not specify what this time should be. ERS criteria do not quantify duration of expiration for quality control. A previous study has demonstrated that a 6-second forced expiration is impossible even for school-age children to achieve (11), and we intended to examine this in our preschool population. Unfortunately, the FET results obtained from the Jaeger system were found to be unreliable.

Duration of expiration determines which time-limited volumes are reportable. If FET is less than 1 second, then FEV1 cannot be reported from that expiration. Success in producing FEV1 was dependent on age in this study, with only 41% of children younger than 4 years able to produce a FEV1, compared with 80% of those more than 5 years of age. There was no difference in the success rates between the healthy and CF groups. Previous researchers have reported similar findings. Eigen and coworkers reported results only from children who were able to expire for at least 1 second, but still recorded a creditable success rate of 83% in a slightly older population than ours (mean [SD] age, 5.0 [0.9] years) (3). Crenesse and coworkers reported results in 473 children aged 3 to 5 years. Seventy-five percent (355) were able to produce at least 1 acceptable forced expiratory maneuver. Of these, only 75% of children 3 to less than 4 years old, 73% of children 4 to less than 5 years old, and 87% of children 5 to less than 6 years old were able to produce an FET of greater than 1 second (10). Nystad and colleagues reported results from 630 children aged 3–6 years, and found that 10% were unable to expire for 1 second (4). It is therefore recommended that spirometry software should calculate FEV0.5 and FEV0.75, and that all laboratories should analyze and report both these parameters in addition to FEV1 for children aged under 6 years, to allow comparison with others of similar age. For many 2- to 4-year-old children, only the shorter timed volumes will be reportable. The Jaeger software currently allows timed expiratory volumes to be reported incorrectly. Operators should ensure that their own software does not contain similar errors. Any timed volume that is reported as equal to FVC should be considered potentially erroneous.

Even when an FEV1 is attainable, the clinical value of this parameter in this age group is questionable. Compared with older children and adults, infants and preschool children have large airways relative to their lung volume, and therefore empty their lungs more rapidly. As a result, the FEV1/FVC ratio is more than 90% in the majority of preschool children, including those aged 5 to 6 years, and regardless of whether these children have lung disease. Reporting of FEV0.4 or FEV0.5 in infants is now accepted (13), and assessment of FEV0.5 or FEV0.75 may be more clinically relevant than FEV1 in the preschool age group. Further investigation of the discriminative ability of different spirometric parameters is beyond the scope of the present study, but is an important area for future research.

Repeatability

ATS criteria recommend that the difference between the two highest values of FVC and FEV1 should ideally be less than 200 ml, whereas ERS criteria stipulate that such differences should be less than 100 ml, or 5% of the best effort, whichever is the greater. In the present study, the repeatability of FVC and FEV0.75 were assessed. All but five children had ΔFVC less than 100 ml, and all had a ΔFEV0.75 less than 100 ml. However, as FVC and FEV0.75 are much lower in this age group, use of absolute values as measures of repeatability may not be appropriate. Arets and colleagues (11) have previously reported that 88% of school-age children can produce a second FVC within 5% of their highest, and 87% can produce a second FEV1 within 5% of their highest. In comparison, only 59% of our population were able to produce a second FVC and 72% a second FEV0.75 within 5% of their highest. However, for both parameters, almost all were able to produce a second effort within 10% of their highest. These figures are similar to those previously reported by Nystad and colleagues (4). We therefore suggest that either an absolute value of 100 ml, or 10% of best effort, would be achievable repeatability targets for FVC and FEV0.75 in this age group. We echo the ATS recommendation for adults that failure to meet repeatability criteria does not necessarily invalidate the maneuver (11).

Comparison with Previously Published Data

To test whether results in our healthy children were similar to those reported by other groups we plotted our data alongside those previously reported by Eigen and colleagues, and also calculated z scores from published reference equations. The population measured in the current study had a higher proportion of nonwhite children (11 of 29 successful children, or 38%) than that reported by Eigen and colleagues (30 of 214, or 14%) (3). This may explain why concordance between the two populations is better for white children only than when data from children of all ethnic groups are combined. From data presented in Figure 3 and Table 4, it appears that nonwhite children have lower FVC and lower timed expired volumes than white children of the same height. Small numbers and the heterogeneity of our nonwhite population prevent further conclusions being drawn from the data currently available. However, this phenomenon warrants further investigation.

Ethnic group was not specified by the Oslo group, but Oslo has a very small nonwhite population. We therefore calculated summary statistics for our entire healthy population and separately for our healthy white children. z scores for our control population should be zero, with a standard deviation of 1, for all parameters, provided that our equipment and methodology are similar to that used by previous groups, and that our control population is similar to those from whom the reference equations were derived (14). For our white population, this was true for FEV0.5 z scores calculated from Oslo data, for FVC and FEF25–75 z scores calculated from Indianapolis data, and arguably true for FEV1 z scores calculated from Indianapolis data. Our FVC and FEV1 results did not fit the reference equations reported by the Oslo group.

Methodologic Issues and Strengths and Limitations

In this study we tested children in the seated position whenever possible, and asked them to wear nose clips. Very young children may rebel against such instructions and we believe in such cases that these criteria should be relaxed if this is the only means of obtaining results. ATS guidelines suggest that subjects can be tested either seated or standing, but that this should be reported with the data (1). A report from school-age children who were tested with and without nose clips found no systematic effect on FEV1 or FVC (15).

Incentive software can be distracting for older children (16), but in our population we found it helpful. We used the Jaeger incentive program (which is routinely available with Jaeger spirometry software) for all children. Vilozni and colleagues have reported that only 2% of 3- to 6-year-old children are able to produce an FEV1 with the Jaeger incentive software, as compared with 49% of children who could produce an FEV1 with their own SpiroGame incentive software (5). We note that in Vilozni's study only the Jaeger “Candle” incentive (which encourages rapid expiration) was used, and that, unlike in our study, the investigators did not move on to alternative incentives that encourage prolongation of expiration. We believe this methodologic difference explains the marked discrepancy between our results (in a younger population) and those previously reported by Vilozni's group.

On a related matter, children in the current study performed a maximum of 22 expiratory maneuvers, rather than the 8 recommended by the ATS (1). The reason for this is that many younger children initially produced incomplete expirations, particularly when using the “Candle” incentive. The Jaeger system does not allow easy recording of repeated maneuvers, and does not allow borderline or excluded traces to be stored for later reanalysis. Overcoming these hurdles required considerable effort on our part, including the use of custom-written software, and we consider this a strength of our study.

The most notable weakness of our study is that children were tested toward the end of a 2-hour laboratory visit. It is likely that some children were fatigued by the time they performed spirometry, and success rates may have been adversely affected. It is unlikely that other parameters would have been influenced by this methodologic issue. By contrast, one factor that may have increased our success rate was our use of a specially adapted preschool laboratory, with data collection performed by trained operators. This setup is not always available in the clinical setting.

Conclusions and Suggestions for Future Work

There is now no doubt that spirometry in preschool children is feasible. Although some failure is inevitable, results can be obtained in 70 to 80% of these subjects. It remains to be proven whether measures obtained at this age are sensitive enough to influence clinical or research practice, but we suggest that establishing standards for quality control in the preschool age group, similar to those established for infant lung function testing (17), is now essential. The ATS and ERS have formed a working party to address this issue, and it is hoped that the data presented here will contribute to their effort. From our data we would suggest the following:

  1. All curves must be visually inspected, but this process could be speeded and facilitated by modifications to software.

  2. Start of test can be quantitatively assessed as in adults, but results greater than 80 ml for Vbe or 12.5% for Vbe/FVC should be indications for visual reinspection of the flow–volume trace, rather than automatic exclusion.

  3. FEVt should be reported only if FET for that expiration is greater than t.

  4. In all preschool children both FEV0.75 and FEV0.5 should be reported in addition to FEV1.

  5. Repeatability can be assessed as for adults, but criteria of 100 ml and 10% of best effort for ΔFVC and ΔFEVt may be more appropriate than the criteria applied to adults.

In addition to these recommendations, we caution users that current spirometry software may result in erroneous reporting of FET and timed expired volumes. We have contacted manufacturers with requests to amend software to allow quantitative assessment of end-of-test, and to allow recording and reanalysis of multiple maneuvers, including those initially assessed as inadequate or borderline. These software amendments should allow data to be excluded from reports or summaries, while still being retained on databases to allow future reanalysis.

Before incorporating such recommendations into guidelines it will be necessary for other groups to assess these criteria in their own populations. Given the portability of spirometry equipment, these criteria should ideally be tested in both specialized laboratories and in general clinics or health centers. Accurate application and interpretation of spirometric assessment in the preschool age group has considerable potential benefit, both for clinical practice and research. This benefit justifies the effort required for standardization.

The authors thank the children and their families for taking part in this study. The authors also thank Dr. Rod Lane and Mr. Aidan Laverty (Respiratory Function Laboratory, Great Ormond Street Hospital for Children) and Dr. Sooky Lum (Portex Unit, Institute of Child Health) for comments on the final draft. The authors particularly acknowledge the assistance of Ms. Gail Slade, who assisted with some of the measurements, but who sadly passed away before this manuscript was written.

1. American Thoracic Society. Standardization of spirometry: 1994 update. Am J Respir Crit Care Med 1995;152:1107–1136.
2. Quanjer PH, Tammeling GJ, Cotes JE, Pedersen OF, Peslin R, Yernault JC. Lung volumes and forced ventilatory flows: report of Working Party on Standardization of Lung Function Tests, European Community for Steel and Coal. Official statement of the European Respiratory Society. Eur Respir J Suppl 1993;16:5s–40s.
3. Eigen H, Bieler H, Grant D, Christoph K, Terrill D, Heilman DK, Ambrosius WT, Tepper RS. Spirometric pulmonary function in healthy preschool children. Am J Respir Crit Care Med 2001;163:619–623.
4. Nystad W, Samuelsen SO, Nafstad P, Edvardsen E, Stensrud T, Jaakkola JJ. Feasibility of measuring lung function in preschool children. Thorax 2002;57:1021–1027.
5. Vilozni D, Barker M, Jellouschek H, Heimann G, Blau H. An interactive computer-animated system (SpiroGame) facilities spirometry in preschool children. Am J Respir Crit Care Med 2001;164:2200–2205.
6. Jones MH, Davis SD, Grant D, Christoph K, Kisling J, Tepper RS. Forced expiratory maneuvers in very young children: assessment of flow limitation. Am J Respir Crit Care Med 1999;159:791–795.
7. Zapletal A, Chalupova J. Forced expiratory parameters in healthy preschool children (3–6 years of age). Pediatr Pulmonol 2003;35:200–207.
8. Kanengiser S, Dozor AJ. Forced expiratory maneuvers in children aged 3 to 5 years. Pediatr Pulmonol 1994;18:144–149.
9. Marostica PJ, Weist AD, Eigen H, Angelicchio C, Christoph K, Savage J, Grant D, Tepper RS. Spirometry in 3 to 6 year old children with cystic fibrosis. Am J Respir Crit Care Med 2002;166:67–71.
10. Crenesse D, Berlioz M, Bourrier T, Albertini M. Spirometry in children aged 3 to 5 years: reliability of forced expiratory maneuvers. Pediatr Pulmonol 2001;32:56–61.
11. Arets HG, Brackel HJ, van der Ent CK. Forced expiratory manoeuvres in children: do they meet ATS and ERS criteria for spirometry? Eur Respir J 2001;18:655–660.
12. Ranganathan S, Dezateux CA, Bush A, Carr SB, Castle R, Madge SL, Price JF, Stroobant J, Wade AM, Wallis CE, et al. Airway function in infants newly diagnosed with cystic fibrosis. Lancet 2001;358:1964–1965.
13. Ranganathan SC, Hoo AF, Lum SY, Goetz I, Castle RA, Stocks J. Exploring the relationship between forced maximal flow at functional residual capacity and parameters of forced expiration from raised lung volume in healthy infants. Pediatr Pulmonol 2002;33:419–428.
14. Stocks J, Quanjer PH. Reference values for residual volume, functional residual capacity and total lung capacity: ATS Workshop on Lung Volume Measurements. Official statement of the European Respiratory Society. Eur Respir J 1995;8:492–506.
15. Chavasse R, Johnson P, Francis J, Balfour-Lynn I, Rosenthal M, Bush A. To clip or not to clip? Noseclips for spirometry. Eur Respir J 2003;21:876–878.
16. Gracchi V, Boel M, van der Laag J, van der Ent CK. Spirometry in young children: should computer-animation programs be used during testing? Eur Respir J 2003;21:872–875.
17. Stocks J, Sly P, Morris MG, Frey U. Standards for infant respiratory function testing: what(ever) next? Eur Respir J 2000;16:581–584.
Correspondence and requests for reprints should be addressed to Paul Aurora, M.R.C.P., Portex Respiratory Unit, Institute of Child Health, 30 Guilford Street, London WC1N 1EH, UK. E-mail:

Related

No related items
American Journal of Respiratory and Critical Care Medicine
169
10

Click to see any corrections or updates and to confirm this is the authentic version of record