Although the link between particulate matter (PM) and mortality has been investigated for some time, the interpretation of this connection remains controversial. Early cross-
sectional studies (
1-3) found that PM had a significant association with mortality rates, measured as total suspended particles (TSP), sulfates, or other particle size distributions [including particulate matter

10 µm in aerodynamic diameter (PM
10)]. These studies were bolstered by more recent time-series studies, many of which found statistically significant increases in all-age and all-cause mortality associated with increases in ambient PM concentrations (
4-24). However, some recent studies found that the PM-mortality relationship was statistically insignificant at the 95% confidence level (
25-28), whereas others found both significant and insignificant relationships, depending on the study setting (
29-32). In addition, some reanalyses of previous studies argued that reported positive findings could be explained by correlated gaseous pollutants, weather, season, or the analytical model used (
33-35). Studies that considered multiple particle sizes have come to different conclusions about which size range is largely responsible for increased mortality, with recent evidence on the role of fine particles (
31) and on stronger relationships with PM
10 (
29).
Although there is some toxicologic evidence that supports the role of PM in human mortality (36-38), the mechanisms of action are not yet well understood, placing epidemiologic evidence at the center of the debate. The variability among epidemiologic findings and their interpretations played a major role in the contentious debate (39) over the revised National Ambient Air Quality Standard for PM (40).
Because the epidemiologic studies differ in a number of ways, the variability in findings could be a function of site-specific differences, analytical decisions, or simply random variation. In these studies, PM concentrations were measured in a number of ways, with conversion between measurements impeded by geographic and temporal variability in particle size distributions. These studies were set in different geographic regions, with a wide range of ambient pollution concentrations and correlations between PM and gaseous pollutants. The study sites differed in their demographic compositions, residential environments, and baseline mortality rates and patterns. The studies used different statistical models, including different lag times and averaging times, controls for confounding pollutants, and consideration of climate and season. Finally, the studies were conducted by a limited number of research groups, suggesting potential correlations related to analytical methods and choices.
In this study, we focused on whether any of these characteristics can explain some of the differences in effect estimates, and we attempted to determine the magnitude of the independent relationship between PM10 and mortality (both at specific sites and averaged across sites). Previous reviews and meta-analyses have not adequately answered these questions because of analytical limitations. Many review articles have relied on qualitative descriptions of pros and cons for a subset of studies, discussing the credibility of the evidence related to potential confounding by climate or correlated pollutants. Although these analyses are valuable, the lack of a quantitative base leads these reviews to very different conclusions depending on the studies chosen and the points argued; some authors conclude that the existing epidemiologic evidence clearly shows a causal relationship (41-44), but others feel that this relationship is spurious (45,46).
Past meta-analyses have used methods such as averages of central estimates (47) and variance-weighted averages of percentage increases in mortality per unit of pollution (48) or of PM10-mortality elasticities (49). All of these models implicitly assume fixed effects, in which each effect estimate ßi is a random sample from a single underlying distribution N(µi,
i2). The U.S. Environmental Protection Agency (EPA) criteria document (50) used random effects models to estimate PM mortality, assuming that each ßi is drawn from N(µi,
i2), where the µi are random values drawn from N(µ,
2), accounting for between-study variability. This is an appealing concept because of the expected heterogeneity among sampling sites.
Although random effects models quantify the amount of residual variance that can be explained by study-specific factors, they cannot determine what these factors are or how they would influence the effect estimates. Because of the number of potential confounders and effect modifiers for PM mortality, these factors must be quantitatively evaluated before determining the true relationship between PM and mortality.
To evaluate these factors, we conducted a screening analysis by applying random effects models to stratifications of study estimates. We evaluated confounders in a multivariate context using empirical Bayes (EB) meta-analysis, which considers mixed effects in a two-stage hierarchical linear model, decomposing within-study and between-study variability. We used this model because of its ability to incorporate both specified study characteristic differences and random effects, and because it can provide posterior site-
specific estimates using information from all studies. We can use these estimates to determine the mortality impacts directly attributable to PM, to pinpoint confounding
variables, and to estimate the expected effects in new settings. These findings would be useful for externality assessment or benefit-cost analysis of PM remediation; if effect estimates vary across populations, this information could be incorporated with demographics and emission profiles for use in site-specific impact analyses.
We gathered time-series studies for this analysis from the EPA criteria document (50), previously published meta-analyses or review articles, and Medline (National Library of Medicine, Bethesda, MD), and Current Contents (Institute for Scientific Information, Philadelphia, PA) searches for studies including the key words "mortality" and "particle" or "particulate matter." We excluded studies if they did not contain basic population and concentration data or if they lacked gravimetric measures of PM (although studies that conducted on-site calibration between optical measures and gravimetric measures were included). We also excluded studies that only calculated elasticities without considering absolute concentrations, that did not report sampling variability, that did not report single-pollutant estimates, or that focused only on respiratory or elderly mortality. Finally, we excluded studies if the same authors reanalyzed the data in an updated article. Analyses of the same site and time period by different authors are all included, with the potential effects of double-counting considered in the analysis.
For our models, we assumed that these studies were independent samples from a random distribution of the conceivable population of studies. Clearly, given potential overweighting of sites and multiple studies conducted by the same authors using the same methodology, this assumption was difficult to support. To determine whether our results were sensitive to study selection, we conducted analyses with different subsets of studies in the sensitivity analysis. We also assessed the assumption of random selection by comparing the characteristics of our sample with the population at large.
In total, we selected 29 estimates of the PM-mortality link from 21 published studies. Of these 29 estimates, 14 were from PM10 analyses in the United States, 4 were from PM10 analyses outside the United States, 5 were from TSP analyses in the United States, and 6 were from TSP analyses outside the United States. We converted all of the TSP estimates to PM10 using reported values [0.50; (18)] or a default value of 0.55. Because of errors associated with this conversion, we considered PM10 and TSP studies both separately and in conjunction.
With these 29 estimates, we used reported effect estimates that did not analytically control for correlated gaseous pollutants. This was mandated by the fact that only 11 of the 29 single-pollutant estimates were accompanied by multivariate estimates (many authors controlled for confounding by choosing sites with minimal concentrations of key pollutants or by studying multiple sites with a range of pollution patterns). These effect estimates also allowed us to use the EB model to estimate confounding by gaseous pollutants. When studies gave multiple effect estimates, we chose the estimate that was presented by the authors as most reasonable considering all short- and long-term time trends, as well as climate and seasonal factors.
We considered three basic categories of predictors that might act as confounders or effect modifiers for biologic or physical reasons: pollution-related variables, demographic/
site characteristics, and analytical factors (Table 1). In the first category, we derived ambient PM concentrations from the study text to account for possible dose-response nonlinearities. In addition, we wanted to estimate the relationship between PM concentrations and concentrations of gaseous pollutants, which may be independently linked with premature mortality [ozone (O3), sulfur dioxide (SO2), nitrogen dioxide (NO2), and carbon monoxide (CO)]. To estimate these relationships, we gathered daily average gaseous pollutant and PM concentrations for the relevant time periods from the Aerometric Information Retrieval System (AIRS; U.S. EPA, Washington, DC) and from the Harvard Six Cities Study (51). With these data, we ran univariate linear regressions with PM as the independent variable to estimate the magnitude of gaseous pollutant concentration change associated with a 1-µg/m3 increase in PM concentrations. These regression coefficients are more informative than correlations because they account for concentration magnitudes as well as relationships between pollutants. We also estimated ratios of particulate matter
2.5 µm in aerodynamic diameter (PM2.5)/PM10 from study texts (29,31), AIRS data covering the study sites and time periods, the EPA criteria document (50), or by using data collected from nearby sites.
We extracted baseline mortality rates from the studies. We also used data on the percentage of the population older than 65 years of age and the percentage of the population in poverty from the 1990 U.S. Census (U.S. Census Bureau, Suitland, MD). The census data were used to incorporate factors such as age-dependent mortality and accessibility of health care. Because personal exposures can be affected by indoor air quality and air exchange rates, we included multiple housing characteristics in the analysis (prevalence of central air conditioning, gas stoves, and warm air furnaces). These characteristics were taken from the American Housing Survey conducted by the U.S. Census Bureau, using data from the nearest metropolitan area for the relevant study years. We estimated heating and cooling degree days for all sites from the International Station Meteorological Climate Summary (52) to approximate differences in climate. Averaging time and lag time were extracted from each study to incorporate the influence that analytical decisions can have on the PM-mortality relationship.
Analytical methodology. In this paper, we focused our analysis on standard random effects and EB models. For both models, the derivations we present are only meant to provide familiarity with the underlying assumptions and terminology. More comprehensive model derivation can be found in papers by DerSimionian and Laird (53) and Raudenbush and Bryk (54).
We determined pooled central estimates using the random effects model (RE) derived by DerSimionian and Laird (53). This model assumes that ßi (the reported effect from study i) is comprised of a true effect µi with a sampling error ei that is N(0, si2) for all i = 1, ..., n. The true effect µi is decomposed into the mean population effect µ and a between-study variability term
i that is N(0,
2).
We assessed homogeneity of the studies with Cochran's Q-statistic, defined as
Q =
wi(ßi - ß*)2 [1]
where wi is 1/si2 and ß* is the weighted average of the effect estimates, weighted by wi. Under the null hypothesis of homogeneity, Q is approximately a
2 statistic with n - 1 degrees of freedom. Given this calculation,
2 can be estimated as
[2]
Finally, we define wi* as 1/(si2 +
2), and the estimate of µ is the weighted average of the effect estimates, weighted by wi*. This methodology is an extension of simple variance weighting, incorporating study heterogeneity.
We incorporated study characteristics using a mixed effects EB model derived by Raudenbush and Bryk (54). With all variables defined as above, µi is assumed to be a function of both known study characteristics and random error. Thus, rather than defining µi = µ +
i, we define µi = Wi´
+
i, where Wi is a (q
1) vector of characteristics varying by study, and
is a (q
1) vector of coefficients estimated to describe between-study variability. In this model,
2 is determined by maximum likelihood methods, where the log of the likelihood function is proportional to:
-
log(si2 +
2) - log|
(si2 +
2)-1WiWi´|
-
(si2 +
2)-1(ßi - Wi´
*)2 [3]
In Equation 3,
* is the maximum likelihood estimate for the vector of derived coefficients, defined as (
iWiWi´)-1
iWißi, where
i is
2/(si2 +
2).
Given these iteratively derived estimates, the maximum likelihood estimate for µi is the weighted average of the reported estimate ßi and the between-study variability estimate Wi´
*, where the weight on ßi is
2/(
2 + si2) and the weight on Wi´
* is si2/(
2 + si2). As explained by Raudenbush and Bryk (54), this allows data drawn from completely homogeneous populations to be described by study characteristics, whereas data drawn with no knowledge of other studies are best described by the prior effect estimates. For other scenarios, a weighted average of these values that minimizes squared-error loss is most appropriate. Thus, EB provides posterior estimates for each study, in contrast to RE, in which the sole output is the pooled effect estimate.
Summary of study findings. The effect estimates from all of the studies are presented in Table 2 as the percentage change in daily mortality associated with a 10-µg/m3 increase in PM10 concentrations. The central effect estimates ranged from a low of -0.5% to a high of 2.6%. For these studies, the pooled random effects estimate is 0.73% [95% confidence interval (CI), 0.59-0.87%], with significant heterogeneity in effect estimates (p < 0.01). The pooled estimates were similar when the studies were stratified by location and particle measure, with estimates of 0.63 and 0.83% for PM10 and TSP, respectively, and 0.70 and 0.80% for U.S. and non-U.S. studies, respectively. Stratifying across both dimensions, we found pooled estimates of 0.67% for PM10/U.S., 0.57% for PM10/non-U.S., 0.77% for TSP/U.S., and 0.93% for TSP/non-U.S. studies. Because of the relative stability of estimates from studies within the United States and the uniformity of data sources, we focused our analytical efforts on these 19 study estimates. Thus, our central random effects estimate was 0.70% (CI, 0.54-0.86%), with marginal heterogeneity in effect estimates (p = 0.1). In comparison, previous meta-analyses have estimated pooled values of 1.0% (47,48,55) and 0.5-1.0% (50), using different subsets of studies.
For these 19 effect estimates, the relationship between daily average PM10 concentrations and gaseous pollutant concentrations varied widely (Table 3). There appeared to be some geographic variability, with higher SO2 coefficients in the East Coast and Rust Belt and higher O3 coefficients in the Midwest. If any of these pollutants were independently related to increased mortality, this would imply differences in the PM10-mortality relationship by site. The relationships between coefficients are similar if high-hour gaseous pollutant concentrations are considered in lieu of daily average concentrations.
Figure 1 shows the relationships between effect estimates and potential confounders/
effect modifiers. To better understand the univariate relationships, we stratified the 19 effect estimates across some of these variables. This stratification shows a strong relationship between the PM2.5/PM10 ratio and PM10 mortality rates, with weaker relationships for a number of other variables (Table 4). However, some of these relationships are counterintuitive, which may be a result of confounding due to strong correlations between predictors (Table 5).
Figure 1. Univariate relationship between the PM10-mortality effect estimate and predictive variables for all 19 U.S. studies. Abbreviations: AC, air conditioning; avg, average. The graph for the PM2.5/PM10 ratio is indicated by the arrows.
EB model. If we apply the EB model to our 19 U.S. study estimates, a model without predictors yields an estimated grand mean of 0.70% (CI, 0.54-0.85%), similar to the random effects estimate. To add predictors to the model from a large set of potential candidates, given a small number of studies and correlated predictors, we followed the methodology of Bryk and Raudenbush (56). We used the "t-to-enter" statistic, which was based on a simple linear regression of the predictor in question on the EB residuals. Like forward regression, predictors were entered one at a time, using a threshold of t = 1.
For these studies, the predictor with the greatest "t-to-enter" was the warm air furnace prevalence (t = 1.43). Adding this predictor to the model decreased
2 from 0.032 to 0.020, indicating that unexplained heterogeneity remained. Additional variables were added, yielding six other significant predictors, which entered the model in the order that they are presented in Table 6. The PM2.5/PM10 ratio was the most statistically significant covariate in the final model. With these predictors,
2 was reduced to 0.00006, indicating that much of the between-study heterogeneity was explained. The value of
2 was slightly lower if only the first three predictors were included in the model. Interaction terms (e.g., between air conditioning prevalence and cooling degree days) were tested and were not significant.
To assess confounding with gaseous pollutants other than SO2, despite the lack of statistical significance, we also generated an EB model with all of the significant predictors and gaseous pollutant regression coefficients. As shown in Table 6, these coefficients are statistically insignificant and only the O3 coefficient has any explanatory power.
We can use our EB model to make posterior estimates for these 19 studies, and we can also estimate the degree to which the association between mortality and PM10 is influenced by correlated gaseous pollutants. Using the optimum model, the posterior EB estimates for the 19 studies have reduced heterogeneity, with values largely ranging between 0.6 and 1.0%. The heterogeneity is further reduced if the model is limited to three predictors. To estimate average confounding by gaseous pollutants, we used the EB model with all of the gaseous pollutant terms forced into the model. When we used this model with the mean population characteristics for all predictors, the average mortality effect was 0.7%. If we control for correlated pollutants by setting the gaseous pollutant regression coefficients to zero, our estimate is reduced to 0.2%. Because forcing insignificant terms into the model may result in overmodeling, we also estimated confounding using our optimum model, which only considered confounding from SO2. Using this model, the effect for a site with mean study characteristics is reduced from 0.7 to 0.4%.
Sensitivity analysis. Because of the small number of study estimates and uncertainties in many dimensions, a detailed sensitivity analysis is needed before conclusions can be made about potential causal predictors of the PM10-mortality relationship. We tested the sensitivity of our findings to study selection and model selection.
First, to test whether the inclusion of studies measuring TSP induced estimation errors or changed the variables entering the model, we considered only the 14 PM10/U.S. studies. For these studies, we found that the PM2.5/PM10 ratio entered the EB model first, followed by heating degree days, central air prevalence, gas stove prevalence, the O3 regression coefficient, and averaging time. For the terms in common, all of the coefficients were similar in magnitude and direction to the optimum model. When all of the gaseous pollutants were forced into the model, the O3 and SO2 regression coefficients remained the most statistically significant gaseous pollutant terms.
We also expanded the analysis to include all 29 study estimates. Because we only have data for a subset of predictors for all studies, we could not make direct comparisons with our 19-study model. We considered the subset of predictors that included ambient PM10 concentration, averaging time and lag time, percent of the population older than 65 years of age, baseline mortality rate, heating and cooling degree days, and dummy variables for PM10/TSP and U.S./non-U.S. studies. Of these predictors, only baseline mortality rate entered the EB model, with significant unexplained heterogeneity (p = 0.005). The grand mean estimate was similar to that of the 19-study model (0.72%).
We also tested the sensitivity of our findings to the decision to include multiple studies of the same location, to avoid the potential effects of double counting. We reduced the sample size to 13, with 48 potential study combinations, when we considered only one study per city. The PM2.5/PM10 ratio was the only predictor that entered the EB model for all combinations of studies. In addition, gas stove prevalence, the SO2 and O3 regression coefficients, warm air furnace prevalence, heating degree days, and percent of the population older than 65 years of age appeared in at least two-thirds of study combinations (in descending order of frequency). The PM2.5/PM10 ratio was the first predictor that entered the regression for 56% of the study combinations.
Finally, we tested the sensitivity of our findings to the choice of an EB model for the inclusion of study characteristics. We applied a random effects regression model (57) to the 19 U.S. study estimates, which estimated a grand mean value of 0.71% (CI, 0.54-0.87%), which was nearly identical to EB. When all predictors from the optimum model were included, the coefficients and standard errors were nearly identical to the EB model, with similar posterior effect estimates.
Clearly, interpretation of the above findings is the most difficult portion of this analysis. Of all of the variables tested, the PM2.5/
PM10 ratio appears to be the strongest predictor of the relationship between PM10 and mortality. The positive coefficient for the PM2.5/PM10 ratio provides additional evidence of the role of fine particles in increased mortality rates.
Although not as robust, other variables entering the model could have some interesting implications if proven valid. The significance of the SO2 regression coefficient (and the O3 regression coefficient in many subanalyses) might imply an independent link with mortality for these pollutants, with the SO2 term potentially linked to sulfate particles as well as gaseous pollutant effects. The negative coefficient for heating degree days could imply a greater mortality effect in warmer climates, possibly related to more rapid conversion of NO2 and SO2 to fine particles, heat-related phenomena, or activity patterns. The negative coefficient for warm air furnace prevalence could be related in part to its correlation with central air conditioning, with the two components together influencing the penetration of particles from the outdoors.
The negative coefficient for gas stove prevalence is puzzling because there is no evidence of beneficial health effects from increased exposure to combustion pollutants related to gas stoves. This coefficient may be a proxy for terms that are strongly correlated with gas stove presence. For example, gas stove prevalence is higher in cities with fewer cooling degree days; therefore, a negative coefficient would imply a positive relationship with cooling degree days (supporting the greater effects in warmer climates). Similarly, the negative coefficient for the percentage of elderly is counterintuitive, but may be related to a strong positive correlation with terms such as the PM2.5/PM10 ratio and the SO2 regression coefficient.
Although these explanations are plausible, there are a number of barriers that made it difficult to reach definitive conclusions. The model findings are often dependent on a few studies, leading different variables to enter the model with different subsets of studies. Because the PM2.5/PM10 ratio is the only variable that is significant across nearly all of the study combinations, the validity of other terms can be questioned. The magnitude of the EB coefficient, even for the PM2.5/PM10 ratio, depends on the variables included in the model, given a limited number of studies and high correlations between predictors.
Although multivariate analyses are needed to help distinguish among numerous potential predictors, the lack of independence can pose problems, particularly with a large number of predictors. If we use only three predictors in the EB model, the coefficient for the PM2.5/PM10 ratio is reduced from 8.7 to 2.3, demonstrating this influence. Similarly, findings such as the negative coefficients for the percentage of elderly or for gas stove prevalence may be a function of these correlations. These problems are exacerbated by our inability to conduct a complete analysis on all 29 study estimates.
We did not include all analytical differences in our model; study authors used analytical models and methods of controlling for weather that differed in a number of ways which could not be captured quantitatively. Because analytical methods to deal with climate have been targeted as crucial in understanding the true PM effect, this omission is a limitation of our analysis. Analytical methodology could also significantly influence the correlations among study findings because of the dependence of model selection on the author and the time period when the study was conducted.
In addition, some of the predictors that we included in the model may not measure the desired dimensions or may not represent the actual characteristics of the site. Heating and cooling degree days are crude proxies for climate, as weather patterns related to humidity or temperature extremes might be more likely to influence mortality rates. Our housing characteristic data were drawn from the nearest metropolitan area, which may not properly represent nonurban settings. The gaseous pollutant coefficients may not capture the complete relationship between pollutants, particularly if the dose-response relationships are nonlinear or have thresholds, or if peak exposures are more important than daily averages. To test the latter premise, we ran the EB model with high-hour gaseous pollutant concentrations rather than daily average concentrations, and the findings were similar. In general, there may be differences between concentration patterns and exposure patterns, particularly if large populations are represented by a small number of monitors.
Because of the high correlations between predictors, it is difficult to attribute causality to any one variable. Many of the patterns in both pollution concentrations and demographics can be related to geographic region, which may also influence PM10 composition in ways related to mortality effects. As detailed in the EPA criteria document (58), each city has a distinctive particle size distribution and chemical composition based on local sources and meteorologic patterns. For example, organics contribute a substantial fraction of fine particles in Los Angeles, California, whereas sulfates dominate in the Northeast. These complex chemical differences may contribute to differences in mortality impacts, and are not captured by our meta-analysis.
To test the potential effects of chemical composition, we examined sulfate concentrations. Because SO4 concentration data are not available for all 19 U.S. estimates, we used data from the 10 estimates derived from the Six Cities studies (6,29,31,35).For these 10 estimates, the SO4/PM10 ratio is highly correlated with both the mortality effect estimate (r = 0.84) and with the PM2.5/PM10 ratio (r = 0.70). If we conducted an EB regression on these 10 estimates, including both ratios, the SO4/PM10 ratio would enter the equation and the PM2.5/PM10 ratio would not. Because of the high correlation and small sample size, it is difficult to draw conclusions from this analysis, but this demonstrates that constituent components of fine particles may act as stronger predictors of mortality.
It is also possible that we omitted relevant predictors that could explain the significance of the PM2.5/PM10 ratio. We did not include smoking prevalence and other lifestyle-related predictors (e.g., percentage of the population that is overweight or which leads a sedentary lifestyle) in our primary analysis because data were not available at the city or county level. We tested statewide average values for these variables, but none entered the optimum model, likely because of the variability in behaviors within states. To determine whether these omitted variables could plausibly explain our findings, we replicated an earlier experiment for cigarette smoking (59).
Smoking would need to be highly correlated with the PM2.5/PM10 ratio with sufficient variability among sites to explain the entire PM2.5/PM10 ratio effect. Using the optimum model (Table 6) and assuming ambient Los Angeles pollution conditions, the cities with minimum and maximum PM2.5/PM10 ratios (Los Angeles, 0.47; and Knoxville, TN, 0.70, respectively) would have a mortality difference of 2.5
10-6/person/day if all other parameters were constant. Using the rate of 2 million cigarettes per death, as reported by Wilson et al. (59), this implies a consumption difference of approximately 5 cigarettes/person/day. This difference would be found if smokers in Knoxville smoked approximately 1 pack/day more than smokers in Los Angeles, or if the prevalence were 25% in Los Angeles and 46% in Knoxville with average consumption per smoker. Even if we use the lower PM2.5/PM10 coefficient associated with the three-parameter model, the necessary smoking differences are significant, and it is highly unlikely that differences of this magnitude are systematically correlated with PM2.5/PM10 ratios across multiple cities.
Even if the above barriers could be discounted, we also must deal with the issue of study selection bias, which is a problem in all meta-analyses. We tried to avoid selection bias by choosing all of the available studies that fulfilled our criteria, including multiple analyses of the same city when differences occurred. However, there is no meaningful way to avoid a possible "file drawer"
phenomenon in which studies finding no significant relationship between PM10 and mortality may not have been published or submitted for publication. Although this phenomenon could potentially influence our EB predictors, a "file drawer" bias would not greatly affect the general findings of a significant PM-mortality link. Given the body of largely positive evidence, it would take 120 unpublished studies with central estimates of -0.1% and the median variance of our 29 study estimates, or 57 studies with these central estimates and the minimum variance of our study estimates, to yield a statistically insignificant grand mean estimate.
Selection bias could also arise if the sample cities in the analysis were not representative of the true family of conceivable studies. If a subset of cities that were more or less prone to PM health effects were chosen for epidemiologic analyses, this would undermine the generalizability of the analysis and the formulated models. For the 19 U.S. estimates, the percentages of the population younger than 65 years of age and below the poverty level are comparable to the U.S. average. However, many of these studies were conducted in cities in the eastern half of the United States, which may not represent the population at large. Nevertheless, the cities chosen have a wide range of ambient PM10 and gaseous pollutant concentrations and represent both rural and urban locations.
One of our study objectives was to determine the degree to which correlated gaseous pollutants confound the PM-mortality relationship. From our primary regression, SO2 may act as a confounder of PM10 mortality, with some evidence of confounding from O3 in the sensitivity analyses. One way to assess whether the EB model reasonably depicted the influence of confounding pollutants was to look at the six U.S. studies that analytically controlled for correlated gaseous pollutants. We compared the reported multipollutant estimates with posterior estimates from the EB model, forcing the pollutants controlled in each study into the model. The univariate and multivariate effect estimates were similar for all but the Philadelphia studies [Table 7; (7,21)]. The errors for the Philadelphia data may be a function of overmodeling, uncaptured site-specific characteristics, or errors in the SO2-PM regression coefficients because of problems associated with the TSP/PM10 conversion. These errors did not occur when the three-parameter EB model was used, although there was more error associated with the univariate estimates.
Stratified random effects models and EB meta-analysis can provide valuable information both about the set of existing studies and about directions for future investigations. Many of the significant EB predictors are related either to the study location or to characteristics of the housing stock. To help unravel the effects of these correlated predictors, we recommend that future epidemiologic studies target sites that have a limited number of potential confounders or effect modifiers or contain a different mix of predictors than previously considered. For example, Seattle, Washington, has minimal central air and gas stove prevalence in a moderate climate, whereas Phoenix, Arizona, has significant central air prevalence in a warm climate with low sulfate concentrations (58). In general, a more comprehensive geographic spread and the consideration of urban and rural settings will help to determine causal predictors by reducing the correlations among variables. Future studies should consider the PM-
mortality literature as a whole and should choose sites that consider variability across dimensions other than air pollution.
EB models and their posterior estimates can also show which studies might be outliers and in need of further analysis, and can be used to estimate the findings of new studies if site-specific information is known. As the number of epidemiologic studies on air pollution mortality increases, the statistical power will improve and will allow for the evaluation of more predictors. In particular, additional studies to consider both PM2.5 and PM10, as well as studies addressing the issues of gaseous pollutant confounding either analytically or by exclusion, will help update our initial estimates and verify whether the derived relationships are accurate. Once more comprehensive models for PM mortality have been created, they could be used to help target geographic regions where PM reductions could have greater impacts.
We applied an EB meta-analysis model to the time-series PM-mortality literature to determine whether variability in effect estimates can be explained by simple study-
specific factors. We estimated that mortality rates increased by approximately 0.7% for a 10-µg/m3 increase in PM10 concentrations. Our model finds compelling evidence that the PM10-mortality relationship is stronger in locations with higher PM2.5/PM10 ratios, supporting the hypothesized role of fine particles. The significance of housing characteristics, climate, and correlated SO2 and O3 demonstrate that a number of factors have a measurable influence on the magnitude of the PM-mortality relationship. EB analysis of the PM10-mortality literature is recommended on an ongoing basis, to better determine factors that contribute to heterogeneity and causal determinants of increased mortality.