Flexible Meta-Regression to Assess the Shape of the Benzene–Leukemia Exposure–Response Curve
Jelle Vlaanderen1, Lützen Portengen1, Nathaniel Rothman2, Qing Lan2, Hans Kromhout1, Roel Vermeulen1
1 Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands, 2 Division of Cancer Epidemiology and Genetics, National Cancer Institute, Department of Health and Human Services, National Institutes of Health, Bethesda, Maryland, USA
Environ Health Perspect 118:526-532 (2010). http://dx.doi.org/10.1289/ehp.0901127 [online 18 November 2009]
Background: Previous evaluations of the shape of the benzene–leukemia exposure–response curve (ERC) were based on a single set or on small sets of human occupational studies. Integrating evidence from all available studies that are of sufficient quality combined with flexible meta-regression models is likely to provide better insight into the functional relation between benzene exposure and risk of leukemia.
Objectives: We used natural splines in a flexible meta-regression method to assess the shape of the benzene–leukemia ERC.
Methods: We fitted meta-regression models to 30 aggregated risk estimates extracted from nine human observational studies and performed sensitivity analyses to assess the impact of a priori assessed study characteristics on the predicted ERC.
Results: The natural spline showed a supralinear shape at cumulative exposures less than 100 ppm-years, although this model fitted the data only marginally better than a linear model (p = 0.06). Stratification based on study design and jackknifing indicated that the cohort studies had a considerable impact on the shape of the ERC at high exposure levels (> 100 ppm-years) but that predicted risks for the low exposure range (< 50 ppm-years) were robust.
Conclusions: Although limited by the small number of studies and the large heterogeneity between studies, the inclusion of all studies of sufficient quality combined with a flexible meta-regression method provides the most comprehensive evaluation of the benzene–leukemia ERC to date. The natural spline based on all data indicates a significantly increased risk of leukemia [relative risk (RR) = 1.14; 95% confidence interval (CI), 1.04–1.26] at an exposure level as low as 10 ppm-years.
Key words: benzene, epidemiology, leukemia, meta-regression, quantitative risk assessment
Address correspondence to J. Vlaanderen, Institute for Risk Assessment Sciences (IRAS), Division Environmental Epidemiology, Jenalaan 18d, 3584 CK, Utrecht, the Netherlands. Telephone: 31-30-2539681. Fax: 31-30-2539499. E-mail: J.J.Vlaanderen@uu.nl
This work was performed as part of the work package “integrated risk assessment” of the Environmental Cancer Risk, Nutrition and Individual Susceptibility Network of Excellence, operating within the European Union 6th Framework Program, Priority 5: “Food Quality and Safety” (FOOD-CT-2005-513943).
The authors declare they have no competing financial interests.
We thank V. Bagnardi for providing his macro “Flexible.” The macro used in this study is available from J.V. on request.
Received 25 June 2009; accepted 18 November 2009; online 18 November 2009.
Although broad consensus exists in the scientific community that benzene is a leukemogen, there is considerable uncertainty regarding the actual shape of the exposure–response curve (ERC) (Crump 1996; Paustenbach et al. 1993). Most of the current epidemiologic evidence for an increased leukemia risk stems from studies among workers exposed to relatively high levels of benzene [McDonald 2001; U.S. Environmental Protection Agency (EPA) 1998]. The risks of leukemia at lower benzene exposures, however, remain largely unclear. Importantly, exposure to benzene in occupational settings has dropped considerably during the last three decades (Capleton and Levy 2005). Furthermore, a large proportion of the general population is exposed to low levels of benzene (< 0.2 ppm) through car exhaust, cigarette smoke, and other sources (Johnson et al. 2007). Therefore, the primary interest of current benzene risk assessment is in risks associated with exposure to low levels of benzene.
In existing risk assessments of benzene, the ERC was assessed based on evidence from one “best” study or, alternatively, from a limited set of “best” studies (McDonald 2001; U.S. EPA 1998). These investigations were mostly conducted among relatively highly exposed workers, so the derived ERC might not be directly applicable to workers exposed to lower levels or to the general population. Furthermore, these risk assessments used linear models to describe the ERC, but increasing evidence from molecular epidemiologic studies of workers exposed to a wide range of benzene levels indicates that the shape of the ERC for benzene and its toxic effects may be nonlinear. This hypothesis is based on the observation that the dose-related production of urinary metabolites of benzene, which include the toxic metabolites muconic acid and hydroquinone and the less toxic metabolites phenol and catechol, decreases with increasing benzene exposure (Kim et al. 2006a, 2006b; Rothman et al. 1998). Furthermore, some evidence shows that benzene metabolism favors the production of the toxic metabolites hydroquinone and muconic acid at low exposures (Kim et al. 2006a). This is especially important because hydroquinone is the precursor of 1,4-benzoquinone, which is generally regarded as the most hematotoxic metabolite of benzene (Kim et al. 2006a). The nonlinear production of benzene’s toxic metabolites would have important consequences for risk assessment because one would expect this to result in a nonlinear relationship between benzene exposure and health outcomes as well. Indeed, in a study that looked at the shape of the dose–response curve of benzene-related hematologic effects, a sharper drop in the peripheral white blood cell count was observed at lower levels of exposure (< 1 ppm) than at higher levels (Lan et al. 2004, 2006).
To explore the shape of the benzene–leukemia ERC, we performed flexible meta-regressions on a set of studies that reported results from quantitative exposure–response analysis or benzene and leukemia. For our analyses, we used a modified version of the approach proposed by Bagnardi et al. (2004) that was applied to studies on alcohol and mortality and on silica and lung cancer (Lacasse et al. 2009). It consists of fitting a set of regression models that includes (flexible) regression splines and linear models to aggregated data, adjusting for the expected correlation of estimated (relative) risks within studies. An improvement of this approach over existing meta-regression methods is that the use of regression splines eliminates the need to make strict a priori assumptions regarding the shape of the ERC, which allows for a more objective evaluation of its actual shape.
Materials and Methods
Identification of studies and evaluation of study quality. Publications eligible for the meta-regression were identified by a PubMed search that included the MESH key words “benzene,” “humans,” and “leukemia” in combination with either “cohort studies” or “case–control studies.” Other publications were added by following references included in a literature review by Schnatter et al. (2005) that was identified in the original PubMed search and in regulatory risk assessments by the Canadian Centre for Occupational Health and Safety, the U.S. National Institute for Occupational Safety and Health, the U.S. Agency for Toxic Substances and Disease Registry, and the U.S. EPA (Agency for Toxic Substances and Disease Registry 2007; U.S. National Institute for Occupational Safety and Health 1976; Penney 1995; U.S. EPA 1998). The quality of the 11 studies that reported results from quantitative exposure–response analysis for benzene and leukemia (mortality or incidence) was evaluated using a previously developed evaluation framework (Vlaanderen et al. 2008). The first tier of the framework consisted of six criteria that are related to crucial aspects of the quality of the design, the quality of conduct, and the quality of the reporting of human observational studies [see Supplemental Material (doi:10.1289/ehp.0901127)]. A study was excluded from the meta-regression if it did not meet all of the six criteria. Nine studies were of sufficient quality to be included in the meta-regression (Table 1). Two studies that reported results from quantitative exposure–response analysis were excluded: One expressed exposure in undefined units (Guénel et al. 2002), and the other provided insufficient details that resulted in a lack of insight regarding the decisions made in the statistical analysis (Collins et al. 2003).
Extraction of data from the incorporated studies. A database was constructed based on published data available for the studies incorporated in the meta-regression. We extracted only risk estimates reported for cumulative exposure to benzene (expressed in ppm-years or ppm-months). The database contained the following fields: study identifier, study design, exposure category, risk estimate, confidence interval for the risk estimate, and number of cases and controls for each exposure category (nested case–control studies) or the size of the study population for the exposure category (cohort studies). Three different epidemiologic study designs contributed to the current meta-regression: the nested case–control design (n = 3), the cohort design with an internal reference population (n = 1), and the cohort design with an external reference population (n = 5; Table 1). Reported odds ratios (ORs), relative risks (RRs), and standardized mortality ratios (SMRs) were combined and interpreted as estimates of the RR for the purpose of this meta-regression (McElvenny et al. 2004). The studies selected for the meta-regression were also different with regard to the definition of the reference population that was used. The cohort studies assumed “background (environmental) exposure” in their reference populations (Bloemen et al. 2004; Costantini et al. 2003; Hayes et al. 1997; Rinsky et al. 2002; Swaen et al. 2005; Wong 1987). Typical daily environmental exposure to benzene can range up to 0.2 ppm, which, over a 70-year life span, accumulates to a maximum of 14 ppm-years of cumulative exposure (Johnson et al. 2007). In the nested case–control studies, individuals in the lowest exposure category were used as the reference population (ranging from < 0.17 ppm-years to < 1 ppm-years occupational exposure) (Glass et al. 2003; Rushton and Romaniuk 1997; Schnatter et al. 1996).
Preparation of the data extracted from the publications. Three steps were necessary to prepare the extracted data for use in the meta-regression models. In the first step, we assigned a specific cumulative exposure estimate to each risk estimate. It is common practice to report only the boundaries of the exposure categories used in an exposure–response analysis, and this was the case for all included studies except the Swaen study, which reported an average mean exposure (Greenland and Longnecker 1992; Swaen et al. 2005). To estimate the mean exposure for the assigned exposure categories, we assumed a log-normal distribution for the cumulative exposure for each study. Any data providing information on the exposure distribution within a study were collected from the publication (i.e., the number of person-years per exposure category, the number of controls per exposure category, or the number of expected cases per exposure category). Maximum likelihood estimation was used to fit a probability density function (PDF) to the available data. Using the PDF, we assigned an average cumulative exposure to each exposure category based on its respective boundaries. To avoid unreasonably high estimates of the average exposure in the highest exposure group, we truncated the exposure distribution at the maximum reported cumulative exposure level. This method of assigning specific cumulative exposures is similar to the approach that was proposed by Hartemink et al. (2006).
In the second step, we estimated the variance of each specific risk estimate to allow weighting based on the precision of the risk estimates in the meta-regression (DerSimonian and Laird 1986). The variance of RRs and ORs was estimated using the reported confidence intervals following a method discussed by Rothman et al. (1982). Estimated variances for studies that reported asymmetrical confidence intervals on the log scale were based on the upper confidence limit only (Boice and McLaughlin 2001; Wartenberg et al. 2000).
In the third step, we estimated the covariance between the different risk estimates within a study by applying the approach advocated by Shi and Copas (2004). This approach is necessary because risk estimates of a study based on a common internal reference group will be correlated. Ignoring this correlation in the meta-regression underestimates the variance of the risk estimates from the study that results in an overestimation of its weight (Bagnardi et al. 2004; Greenland and Longnecker 1992). The Canada-Petrol study (Schnatter et al. 1996) lacked the information necessary to estimate the covariance matrix; therefore, uncorrected variances were used for this study. For studies that reported SMRs, we did not estimate covariance because SMRs within a single study can be assumed to be largely independent when the expected number of deaths used to calculate the SMRs is based on a sufficiently large population.
Application of the (regression) models to describe the exposure–response relation. Natural spline models (with knots at the 20th, 50th, and 80th percentiles) as well as linear models were fitted to the data to investigate the shape of the exposure–response relation. To improve the statistical properties of the regression models, we fitted all models to the natural logarithm of the reported risk estimates (Berlin et al. 1993). Regression models were fitted to the data using a modified version of a macro developed by Bagnardi et al. (2004). All regression models allowed for (random) study-specific intercepts and exposure effects to accommodate potential between-study heterogeneity (DerSimonian and Laird 1986). Model deviance was used to compare goodness of fit between (nested) models.
Sensitivity analyses. For two cohort studies Pliofilm (Rinsky et al. 2002) and Dow (Bloemen et al. 2004) multiple updates were available (Table 1). These updates represent multiple reports on the same cohort. In the most recent report, researchers had the opportunity to follow the participants for the longest time since the start of the follow-up period.For these cohorts, the most recent update was included in the meta-regression. To assess the impact of varying follow-up times in our analysis, we also conducted the meta-regression with risk estimates that were abstracted from earlier updates of the Pliofilm and Dow cohorts (Bond et al. 1986; Rinsky et al. 1987). Substitution of the risk estimates did not have a substantial effect on the shape of the predicted ERC or on model fit to the data (data not shown). Sensitivity of the predicted ERC to the inclusion of specific studies was assessed with a jackknifing analysis, excluding one study at a time before (re)predicting the exposure–response relation. In addition, we analyzed the cohort studies (including those with an external reference group as well as the single study with an internal reference group) and nested case–control studies separately and compared their ERC predictions. To allow flexible prediction of the ERC, all sensitivity analyses were done using natural splines.
Prediction of risk estimates. Benzene–leukemia ERC RRs were estimated for four plausible scenarios at three different levels of cumulative exposure (10, 20, and 40 ppm-years, corresponding to 0.25, 0.5, and 1 ppm intensity of exposure over a tenure of 40 years). We used fitted regression models to predict the risk estimates with associated confidence intervals. In addition, corrected risk estimates and confidence intervals were calculated by subtracting the intercept at zero exposure predicted by the regression model from the risk estimates.
Software. Average exposure levels for reported exposure categories were estimated using R, version 2.7 (R Core Development Group, Vienna, Austria). All other statistical analyses were performed using SAS software for Windows (version 9.1, SAS Institute Inc., Cary, NC, USA).
Nine studies had sufficient quality to be included in the meta-regression. All included studies were performed in the occupational setting. Together, these studies provided 30 risk estimates over a range of 0.32–554.3 (assigned) ppm-years (Figure 1A). Nineteen (63%) of the risk estimates were assigned a cumulative exposure < 50 ppm-years (Figure 1B). Most of the risk estimates for the lower exposure range were provided by nested case–control studies. The differences in exposure levels between studies can be largely attributed to the different industries in which the studies were performed. The nested case–control studies were all performed in the petroleum industry, whereas the cohort studies were performed in the chemical industry (Pliofilm, Dow, Wong, and Swaen studies), in a shoe factory (Costantini study), or a wide range of different industries [Chinese Academy of Preventive Medicine–National Cancer Institute (CAPM-NCI) study] (Hayes et al. 1997; Table 1).
Scatter plot of the risk estimates extracted from the nine studies included in the meta-regression, based on the assigned average cumulative exposure: Full range of cumulative exposures (A) and cumulative exposures < 50 ppm-years (B). AHW, Australian Health Watch.
Predictions of the ERC based on a natural spline model and a linear model are presented in Figure 2. The lower deviance of the natural spline (deviance = 25.84, 27 df) compared with that of the linear model (deviance = 29.25, 28 df) suggests a slightly better fit [chi square test (1 df), p = 0.06]. The natural spline model also indicates a strong supralinear shape of the ERC in the low-exposure region, resulting in a considerable lower intercept than the linear model (RR = 1.33 vs. 1.65).
Predicted ERC using all risk estimates from the nine included studies based on a natural spline and linear regression model. Plot 1 is the predicted ERC based on a linear model. Plot 2 is the predicted ERC based on a natural spline model (knots are located at 2.9, 22.7, and 125.5 ppm-years). Dashed lines represent the 95% CIs of the predictions. Rug plot indicates the distribution of the estimated cumulative exposure for each risk estimate included in the analyses.
Results from a jackknifing analysis (Figure 3) suggest that the Pliofilm and the CAPM-NCI studies were particularly influential for the (high-exposure region of the) predicted ERC. Exclusion of the Pliofilm study from the meta-regression resulted in a strong reduction of risks predicted for cumulative exposures > 100 ppm-years, whereas exclusion of the CAPM-NCI study had the opposite effect. Exclusion of other studies had little impact on the predicted ERC.
Sensitivity analysis on the prediction of the ERC based on a natural spline. Graph represents nine plots of the predicted ERC based on all studies minus one. The plots are identified by the study that was excluded: 1, CAPM-NCI; 2, Dow; 3, Costantini; 5, US-Chemical (Wong et al. 1987); 6, Swaen; 7, Canada-Petrol; 8, AHW; 9, UK-Petrol; and 10, Pliofilm. Plot 4 is the predicted ERC based on all available studies (blue line). Abbreviation: AHW, Australian Health Watch.
Stratified analyses showed that study design had a considerable impact on the predicted ERC (Figure 4). The ERC based on the cohort studies had a similar shape compared with the ERC based on the full data. However, the supralinear shape was somewhat less pronounced and the predicted intercept slightly lower (RR = 1.13 vs. 1.33). The deviance of the natural spline model fitted to the cohort studies was smaller than the deviance of the corresponding linear model [deviance = 8.43 and 11.97 respectively; chi square test (1 df), p = 0.06]. The analysis based on the three nested case–control studies resulted in extremely wide confidence intervals around the predicted ERC and was essentially uninformative (Figure 4).
Predicted ERC stratified by study design based on a natural spline. Plot 1 is the predicted ERC based on only the nested case–control studies (knots are located at 1.0, 2.9, and 18.1 ppm-years). Plot 2 is the predicted ERC based on only the cohort studies (knots are located at 13.1, 67.1, and 277.6 ppm-years). Dashed lines represent the 95% CIs of the predictions.
We predicted the RRs for leukemia for three cumulative exposure levels (10, 20, 40 ppm-years) based on four different modeling scenarios for the shape of the benzene leukemia ERC (Table 2). The four scenarios were a natural spline with intercept fitted to all studies (scenario A), a natural spline without intercept fitted to all studies (scenario B), a natural spline with intercept fitted to the cohort studies (scenario C), and a linear model without intercept fitted to all studies (scenario D1) or only the cohort studies (scenario D2; Figure 5). Scenario A predicted the highest RRs (1.52, 1.73, and 2.11 for cumulative exposures of 10, 20, and 40 ppm-years respectively), although these dropped considerably (RR = 1.14, 1.29, and 1.59) after correction for the predicted intercept. RRs predicted in scenario B (RR = 1.22, 1.46, and 1.96) were somewhat lower than the uncorrected RRs from scenario A. Predictions using data from the cohort studies only (scenario C) were also lower than those predicted by scenario A (all studies) (RR = 1.25, 1.38, 1.67), although these differences largely disappeared after we corrected for the intercept. Finally, predictions of the RRs based on scenario D1 (RR = 1.04, 1.09, 1.19) and D2 (RR = 1.05, 1.10, 1.20) were very similar and considerably lower than the predictions based on the other scenarios.
Interpretation of the predicted ERC. We presented estimates of the benzene–leukemia ERC based on predictions from two regression models. Both the natural spline and linear regression models indicated a positive relation between cumulative exposure to benzene and leukemia risk, although risk appeared to increase more strongly at low exposures in the natural spline model. This supralinear shape of the natural spline model at low exposures is consistent with the increasing evidence that saturable metabolism plays an important role in the low-dose carcinogenicity of benzene (Kim et al. 2006a, 2006b; Lan et al. 2006; Rappaport et al. 2002a, 2002b, 2005, 2009; Rothman et al. 1998).
Alternative explanations for the nonlinear relation between (inhalatory) benzene exposure and leukemia that we found are depletion of susceptible individuals at high benzene exposure levels and bias due to attenuation of the exposure–response relation within or between studies (Stayner et al. 2003). Attenuation of exposure–response relations is commonly observed in occupational studies and can be caused by several factors, including the healthy worker survivor effect, high disease background rates, exposure measurement error, and confounding and effect modification (Stayner et al. 2003). However, not all of these factors are equally likely to have played a role in occupational studies on benzene and leukemia. Confounding should be considered a potential factor that might have introduced attenuation, but none of the included studies demonstrated (or was able to demonstrate) a confounding effect from potential confounders such as ionizing radiation, smoking, and family history of leukemia (Zeeb and Blettner 1998). In addition, it is unlikely that these factors could have caused serious distortion of the study findings considering the general lack of association with exposure to benzene across the assessed industries (ionizing radiation), weak association with leukemia (smoking), and rare occurrence (family history of leukemia) (Zeeb and Blettner 1998).
Factors that contributed to the heterogeneity observed between studies were differences in study design and exposure assessment. Although all studies were comparable regarding the context of exposure (occupational exposure), considerable differences exist in the geographical location, type of industry, and intensity and frequency of exposure to benzene (Table 1). Differences in study design resulted in different types of risk estimates that were reported: RRs, ORs, and SMRs (Table 1). However, ORs and SMRs can be interpreted as reasonable approximations of the RR when the disease is rare, and these measures have been pooled with RRs for meta-analysis in previous analyses (McElvenny et al. 2004; Steinmaus et al. 2008). According to our evaluation, the quality of exposure assessment was sufficient in all included studies (Vlaanderen et al. 2008). However, systematic differences in exposure assessment strategies between studies might have contributed to the between-study heterogeneity. Because all included studies assessed exposure retrospectively based on a relatively limited set of exposure measurements, exposure estimation in these studies was based partly on decision rules to extrapolate exposure measurements to time periods and exposure circumstances for which no measurements were available (Bloemen et al. 2004; Costantini et al. 2003; Glass et al. 2003; Hayes et al. 1997; Rinsky et al. 2002; Rushton and Romaniuk 1997; Schnatter et al. 1996; Swaen et al. 2005; Wong 1987). The significant amount of expert judgment that goes into these decision rules makes it conceivable that systematic differences in exposure assessment may exist between studies. This situation is illustrated by the exposure assessment for the Pliofilm cohort where three groups of authors have published three different sets of exposure estimates, based on the same exposure measurement data (Crump 1996; Paustenbach et al. 1992; Rinsky et al. 1987, 2002). In contrast, the three nested case–control studies attempted to limit systematic error in exposure assessment by applying similar exposure assessment approaches (Glass et al. 2003; Rushton and Romaniuk 1997; Schnatter et al. 1996).
We tried to assess the potential impact of between-study heterogeneity in the sensitivity analyses. Visual inspection of the results from a jackknifing analysis (Figure 3) showed that two studies that provided risk estimates for the highest assigned cumulative exposures had a considerable impact on the ERC for the higher exposure range. The impact on the lower exposure range was less pronounced. Exclusion of the Australian Health Watch (AHW; Atkinson et al. 2001; Glass et al. 2003) study, which reported relatively high risks for the low-exposure range (Figure 1), had little impact on the shape of the ERC. Results from the sensitivity analysis stratified by study design indicated considerable differences between the ERC based on the nested case–control studies from the petroleum industry and the ERC based on the cohort studies. The shape of the ERC based on the cohort studies only was similar to the shape of the ERC based on all studies. Although the shape of the ERC for the nested case–control studies could be estimated only very imprecisely, the results indicated that these studies were largely responsible for the rather high intercepts that were predicted for the ERCs based on the full data.
Assessing publication bias in a flexible meta-regression is complicated because no standard statistical approaches are available to deal with the correlated effect estimates and the nonlinear exposure–response relations. However, in our opinion it is unlikely that studies with quantitative benzene exposure estimates would not have reported risk estimates for leukemia even if these had been negative because this is one of the major cancer outcomes associated with the exposure. Also, considering the large effort that is required to generate quantitative benzene exposure estimates, it is improbable that any study that had quantitative exposure estimates available would not have been published at all.
Considering that risk estimates in the included studies were calculated in reference to populations with assumed no or negligible occupational exposure to benzene, one might have expected a predicted marginal intercept (ln RR) of approximately 0 (RR = 1) at 0 ppm-years. Although intercepts above 0 are frequently observed in exposure–response studies based on epidemiologic data, most other meta-regression studies have avoided the issue by forcing their regression models to fit through the origin (Bagnardi et al. 2004; Greenland and Longnecker 1992). We did include intercepts in our regression models to attain the best possible fit to the data, which resulted in intercepts of RRs of 1.33 and 1.65 in the natural spline and linear regression models, respectively. An explanation for these nonzero intercepts may be the lack of risk estimates for very low exposure levels (< 0.32 ppm-year). Risk estimates at slightly higher, but still very low, exposure levels (ppm-years) already indicate a strong increase in risk. A natural spline, being linear in its tails, may be unable to track the curvature of the ERC at these low levels, resulting in a nonzero intercept. The same reasoning would apply to intercepts from the linear model, although the effect may be even more extreme. However we cannot exclude the effect of lower-than-expected leukemia risk in the reference populations or (conversely) higher nonbenzene-related leukemia risk in the exposed populations. Finally, attenuation of the ERC due to random and systematic error in the exposure assessment might also have forced the intercept up (Berman and Crump 2008; Stayner et al. 2003).
Implications of the findings for quantitative risk assessment (QRA). To facilitate the use of our meta-regression results in QRA, we provided the benzene leukemia ERC for three plausible scenarios (Figure 5, Table 2). In scenario A, the natural spline model was used, which fitted the data slightly better than a linear model. However, the estimate of leukemia risk at 0 ppm-years (the intercept) for this model was much lower than that for the linear model. Because application of these models for risk assessment purposes will most likely entail subtraction of the intercept from all predictions [effectively lowering the predicted (increased) risks for benzene at each exposure level], we favor scenario A (Figure 5A) for risk prediction over the alternatives (Table 2). If one believes that the intercept is due to the natural spline failing to track the shape of the ERC at very low exposures, one may prefer predictions from a spline model without an intercept, thus forcing the predicted ERC through the origin (scenario B, Figure 5B). This model fitted the data only slightly worse than scenario A and may therefore be considered a plausible alternative [chi-square test (1 df), p = 0.11]. If one believes that the nested case–control studies should not be used for prediction of the ERC, scenario C (Figure 5C) could be used, which was based on a natural spline model with an intercept fitted to data from the cohort studies only. This scenario resulted in slightly lower predicted risks for exposures < 100 ppm-years. Finally, scenarios D1 (all studies, Figure 5D) and D2 (cohort studies only) were based on linear models without an intercept and are therefore similar in spirit to models commonly used in QRA (McDonald 2001; U.S. EPA 1998) (Figure 5). Clearly, these models fitted the data clearly worse than did the relevant alternative models in scenario A [chi-square test (2 df), p = 0.002] and scenario C chi-square test (2 df), p = 0.02].
To compare predictions from scenarios A–D with predictions from existing QRAs, we estimated RRs for three cumulative exposures in the low exposure range (10, 20, and 40 ppm-years; Table 2). This showed that risk estimates from our models at these exposures are very similar to those based on the (multiplicative) models used by the U.S. EPA and the California Environmental Protection Agency in their QRA of benzene (McDonald 2001; U.S. EPA 1998). However, although these QRAs were based on data from either the Pliofilm study or the CAPM-NCI study, our approach allowed us to use all available epidemiologic evidence to date and should therefore be more robust. In addition, our approach allowed for a nonlinear shape of the ERC to be used in QRA, which appears to be more appropriate. It is important to note that all analyses were performed on the overarching disease outcome “leukemia.” Slight differences in the definition of this disease existed between studies (Table 1). Unfortunately, analyses for specific subtypes of leukemia will be hampered by a lack of data.
Flexible meta-regression of the aggregated risk estimates from a set of occupational human observational studies offers an efficient approach to acquiring more insight in the functional relation between exposure to benzene and leukemia. The flexible meta-regression model predicted a supralinear shape of the ERC. Although the limited number of available studies and the large heterogeneity between studies were considerable limitations, sensitivity analyses demonstrated that results were not strongly affected. Our application of a flexible meta-regression method provides the most comprehensive evaluation of the benzene–leukemia ERC to date.
- Atkinson S, Coppock J, Fritschi L, Glass DC, Gibbons C, Gray CN, et al. 2001. Monash University and Deakin University.. Lympho-haematopoietic Cancer and Exposure to Benzene in the Australian Petroleum Industry. Technical Report and Appendices. Available: http://www.aip.com.au/pdf/health/case_study.pdf [accessed 2 February 2010]
- ATSDR (Agency for Toxic Substances and Disease Registry) 2007. Toxicological Profile for Benzene. Available: http://www.atsdr.cdc.gov/toxprofiles/tp3.html [accessed 2 February 2010]
- Bagnardi V, Zambon A, Quatto P, Corrao G.. 2004. Flexible meta-regression functions for modeling aggregate dose-response data, with an application to alcohol and mortality. Am J Epidemiol 159(11):1077–1086.
- Berlin JA, Longnecker MP, Greenland S. 1993. Meta-analysis of epidemiologic dose-response data. Epidemiology 4(3):218–228.
- Berman DW, Crump KS. 2008. Update of potency factors for asbestos-related lung cancer and mesothelioma. Crit Rev Toxicol 38: ( suppl 1):1–47.
- Bloemen LJ, Youk A, Bradley TD, Bodner KM, Marsh G. 2004. Lymphohaematopoietic cancer risk among chemical workers exposed to benzene. Occup Environ Med 61(3):270–274.
- Boice JD Jr, McLaughlin JK. 2001. Errors in TCE analysis. Environ Health Perspect 109:A108–A109.
- Bond GG, McLaren EA, Baldwin CL, Cook RR. 1986. An update of mortality among chemical workers exposed to benzene. Br J Ind Med 43(10):685–691.
- Capleton AC, Levy LS. 2005. An overview of occupational benzene exposures and occupational exposure limits in Europe and North America. Chem Biol Interact 153–154:43–53.
- Collins JJ, Ireland B, Buckley CF, Shepperly D. 2003. Lymphohaematopoeitic cancer mortality among workers with benzene exposure. Occup Environ Med 60(9):676–679.
- Costantini A, Quinn M, Consonni D.. 2003. Exposure to benzene and risk of leukemia among shoe factory workers. Scand J Work Environ Health 29(1):51–59.
- Crump KS. 1996. Risk of benzene-induced leukemia predicted from the Pliofilm cohort. Environ Health Perspect 104: ( suppl 6):1437–1441.
- DerSimonian R, Laird N.. 1986. Meta-analysis in clinical trials. Control Clin Trials 7(3):177–188.
- Glass DC, Gray CN, Jolley DJ, Gibbons C, Sim MR, Fritschi L, et al. 2003. Leukemia risk associated with low-level benzene exposure. Epidemiology 14(5):569–577.
- Greenland S, Longnecker MP. 1992. Methods for trend estimation from summarized dose-response data, with applications to meta-analysis. Am J Epidemiol 135(11):1301–1309.
- Guénel P, Imbernon E, Chevalier A, Crinquand-Calastreng A, Goldberg M.. 2002. Leukemia in relation to occupational exposures to benzene and other agents: a case-control study nested in a cohort of gas and electric utility workers. Am J Ind Med 42(2):87–97.
- Hartemink N, Boshuizen HC, Nagelkerke NJ, Jacobs MA, van Houwelingen HC. 2006. Combining risk estimates from observational studies with different exposure cutpoints: a meta-analysis on body mass index and diabetes type 2. Am J Epidemiol 163(11):1042–1052.
- Hayes RB, Yin SN, Dosemeci M, Li GL, Wacholder S, Travis LB, et al. 1997. Benzene and the dose-related incidence of hematologic neoplasms in China. Chinese Academy of Preventive Medicine–National Cancer Institute Benzene Study Group. J Natl Cancer Inst 89(14):1065–1071.
- Johnson ES, Langard S, Lin YS. 2007. A critique of benzene exposure in the general population. Sci Total Environ 374(2-3):183–198.
- Kim S, Vermeulen R, Waidyanatha S, Johnson BA, Lan Q, Rothman N, et al. 2006. . Using urinary biomarkers to elucidate dose-related patterns of human benzene metabolism. Carcinogenesis 27(4):772–781.
- Kim S, Vermeulen R, Waidyanatha S, Johnson BA, Lan Q, Smith MT, et al. 2006. . Modeling human metabolism of benzene following occupational and environmental exposures. Cancer Epidemiol Biomarkers Prev 15(11):2246–2252.
- Lacasse Y, Martin S, Gagné D, Lakhal L.. 2009. Dose–response meta-analysis of silica and lung cancer. Cancer Causes Control 20(6):925–933.
- Lan Q, Vermeulen R, Zhang L, Li G, Rosenberg PS, Alter BP, et al. 2006. Benzene exposure and hematotoxicity. Science 312(5776):998–999.; doi: 10.1126/science.312.5776.998b
- Lan Q, Zhang L, Li G, Vermeulen R, Weinberg RS, Dosemeci M, et al. 2004. Hematotoxicity in workers exposed to low levels of benzene. Science 306(5702):1774–1776.
- McDonald TA. 2001. Public Health Goal for Benzene in Drinking Water. Sacramento:California Environmental Protection Agency, Office of Environmental Health Hazard Assessment.
- McElvenny DM, Armstrong BG, Järup L, Higgins JP. 2004. Meta-analysis in occupational epidemiology: a review of practice. Occup Med (Lond) 54(5):336–344.
- NIOSH (National Institute for Occupational Safety and Health) 1976. Revised Recommendation for an Occupational Exposure Standard for Benzene. Available: http://www.cdc.gov/niosh/76-benz.html [accessed 2 February 2010]
- Paustenbach DJ, Bass RD, Price P. 1993. Benzene toxicity and risk assessment, 1972–1992: implications for future regulation. Environ Health Perspect 101: suppl 6):177–200.
- Paustenbach DJ, Price PS, Ollison W, Blank C, Jernigan JD, Bass RD, et al. 1992. Reevaluation of benzene exposure for the Pliofilm (rubberworker) cohort (1936–1976). J Toxicol Environ Health 36(3):177–231.
- Penney J. 1995. Toronto, Canada:Canadian Centre for Occupational Health and Safety. Report to the Occupational Disease Panel (Industrial Disease Standards Panel) on Occupational Exposure to Benzene and Leukaemia. Available: http://www.canoshweb.org/odp/html/rp7.htm [accessed 2 February 2010]
- Rappaport SM, Kim S, Lan Q, Vermeulen R, Waidyanatha S, Zhang L, et al. 2009. Humans activate benzene via two metabolic pathways. Environ Health Perspect 117:946–952.
- Rappaport SM, Waidyanatha S, Qu Q, Shore R, Jin X, Cohen B, et al. 2002. . Albumin adducts of benzene oxide and 1,4-benzoquinone as measures of human benzene metabolism. Cancer Res 62(5):1330–1337.
- Rappaport SM, Waidyanatha S, Yeowell-O’Connell K, Rothman N, Smith MT, Zhang L, et al. 2005. Protein adducts as biomarkers of human benzene metabolism. Chem Biol Interact 153-154:103–109.
- Rappaport SM, Yeowell-O’Connell K, Smith MT, Dosemeci M, Hayes RB, Zhang L, et al. 2002. . Non-linear production of benzene oxide-albumin adducts with human exposure to benzene. J Chromatogr B Analyt Technol Biomed Life Sci 778(1-2):367–374.
- Rinsky RA, Hornung RW, Silver SR, Tseng CY. 2002. Benzene exposure and hematopoietic mortality: a long-term epidemiologic risk assessment. Am J Ind Med 42(6):474–480.
- Rinsky RA, Smith AB, Hornung R, Filloon TG, Young RJ, Okun AH, et al. 1987. Benzene and leukemia. An epidemiologic risk assessment. N Engl J Med 316(17):1044–1050.
- Rothman KJ, Boice JD, Austin H. 1982. Epidemiologic Analysis with a Programmable Calculator. Boston, MA:Epidemiology Resources.
- Rothman N, Bechtold WE, Yin SN, Dosemeci M, Li GL, Wang YZ, et al. 1998. Urinary excretion of phenol, catechol, hydroquinone, and muconic acid by workers occupationally exposed to benzene. Occup Environ Med 55(10):705–711.
- Rushton L, Romaniuk H.. 1997. A case-control study to investigate the risk of leukaemia associated with exposure to benzene in petroleum marketing and distribution workers in the United Kingdom. Occup Environ Med 54(3):152–166.
- Schnatter AR, Armstrong TW, Nicolich MJ, Thompson FS, Katz AM, Huebner WW, et al. 1996. Lymphohaematopoietic malignancies and quantitative estimates of exposure to benzene in Canadian petroleum distribution workers. Occup Environ Med 53(11):773–781.
- Schnatter AR, Rosamilia K, Wojcik NC. 2005. Review of the literature on benzene exposure and leukemia subtypes. Chem Biol Interact 153-154:9–21.
- Shi JQ, Copas JB. 2004. Meta-analysis for trend estimation. Stat Med 23(1):3–19, 159–162.
- Stayner L, Steenland K, Dosemeci M, Hertz-Picciotto I.. 2003. Attenuation of exposure-response curves in occupational cohort studies at high exposure levels. Scand J Work Environ Health 29(4):317–324.
- Steinmaus C, Smith AH, Jones RM, Smith MT. 2008. Meta-analysis of benzene exposure and non-Hodgkin lymphoma: biases could mask an important association. Occup Environ Med 65(6):371–378.
- Swaen GM, Scheffers T, de Cock J, Slangen J, Drooge H. 2005. Leukemia risk in caprolactam workers exposed to benzene. Ann Epidemiol 15(1):21–28.
- U.S. Environmental Protection Agency 1998. Carcinogenic Effects of Benzene: An Update. EPA/600/P-97/001F. Washington, DC:U.S. Environmental Protection Agency.
- Vlaanderen J, Vermeulen R, Heederik D, Kromhout H.. 2008. Guidelines to evaluate human observational studies for quantitative risk assessment. Environ Health Perspect 116(12):1700–1705.
- Wartenberg D, Reyner D, Scott CS. 2000. Trichloroethylene and cancer: epidemiologic evidence. Environ Health Perspect 108: suppl 2):161–176.
- Wong O.. 1987. An industry wide mortality study of chemical workers occupationally exposed to benzene. II. Dose response analyses. Br J Ind Med 44(6):382–395.
- WHO 1967. nternational Classification of Diseases (ICD), Eighth revision. Geneva:World Health Organization.
- WHO 1977. International Classification of Diseases (ICD), Ninth revision. Geneva:World Health Organization.
- Zeeb H, Blettner M.. 1998. Adult leukaemia: what is the role of currently known risk factors? Radiat Environ Biophys 36(4):217–228.
Introducing Children’s Health Collection 2014
EHP’s fifth annual Children’s Health Collection is now available. The collection comprises abstracts of all relevant articles published in EHP from October 2013 through September 2014: peer-reviewed research articles, news features, Science Selections, and editorials.
ISEE 2014 Abstracts Now Available
EHP is pleased to present the abstracts for the 26th annual conference of the International Society for Environmental Epidemiology (ISEE), From Local to Global: Advancing Science for Policy in Environmental Health, held 24–28 August 2014 in Seattle, Washington.
Sign Up to Receive E-mail Alerts
Recent Advance Publications
Variation in Urinary Flow Rates According to Demographic Characteristics and Body Mass Index in NHANES: Potential Confounding of Associations between Health Outcomes and Urinary Biomarker Concentrations
Air Pollution and Atherosclerosis: A Cross-Sectional Analysis of Four European Cohort Studies in the ESCAPE Study
Metro Nature, Environmental Health, and Economic Value
Association between Ambient Air Pollution and Diabetes Mellitus in Europe and North America: Systematic Review and Meta-Analysis
Environmental, Dietary, Maternal, and Fetal Predictors of Bulky DNA Adducts in Cord Blood: A European Mother–Child Study (NewGeneris)