Toxicogenomics in risk assessment: an overview of an HESI collaborative research program.

The value of genomic approaches in hypothesis generation is being realized as a tool for understanding toxicity and consequently contributing to an assessment of drug and chemical safety. In 1999 the membership of the International Life Sciences Institute Health and Environmental Sciences Institute formed a committee to develop a collaborative scientific program to address issues, challenges, and opportunities afforded by the emerging field of toxicogenomics. Experts and advisors from academia and government laboratories participate on the committee, along with approximately 30 corporate member organizations from the pharmaceutical, agrochemical, chemical, and consumer products industries. The committee has designed, conducted, and analyzed numerous toxicogenomic experiments within the broad fields of hepatotoxicity, nephrotoxicity, and genotoxicity. The considerable body of data generated by these programs has been instrumental in increasing understanding of sources of biological and technical variability in the alignment of toxicant-induced transcription changes with the accepted mechanism of action of these agents and the challenges in the consistent analysis and sharing of the voluminous data sets generated by these approaches. Recognizing the importance of standardized microarray data formats and public repository databases as the mechanism by which microarray data can be compared and interpreted by the scientific community, the committee has partnered with the European Bioinformatics Institute to develop a database to house the data generated by its collaborative research.

Gene transcription lies at the beginning of a response of a cell to a xenobiotic. Thus, a transcriptional response can give a preliminary indication of the biochemical or biological mechanism being affected by a xenobiotic, and gene expression data can provide starting points in a toxicological examination. The use of genomics technologies, particularly gene arrays, as tools for identifying profiles of gene expression associated with particular compounds and/or toxicities has shown increasing merit. If a good correlation exists between gene expression and a toxic mechanism, then the genomic data may provide supportive evidence for that mechanism (Chevalier and Roberts 2001;Harries et al. 2001;Lord et al. 2001). Even when the mechanism is unknown, genomic data can help identify components (i.e., proteins or enzymes) of pathways that may be involved in the biological process under study (Crosby et al. 2000). Developing databases of expression profiles for a wide variety of toxic compounds and toxic models makes it possible to create statistical and computational methods that can indicate the toxic potential of a drug or chemical from the pattern of gene expression changes it elicits in in vitro (Brooks and Pennie 2001;Burczynski et al. 2000;Waring et al. 2001) or in vivo systems (Hamadeh et al. 2002).
Over the past several years there has been considerable investment by chemical and pharmaceutical companies, government agencies, and technology providers in the application of gene array-based approaches in chemical and drug development. The value of genomic approaches to generate hypotheses is being realized for understanding toxicity and consequently contributing to an evaluation of drug and chemical safety both in predictive toxicology and in mechanism-based risk assessment.
As the field of toxicogenomics emerged in the late 1990s, it became clear there was a need to establish a body of available knowledge to serve as a foundation for applying the data generated by gene array methodologies to risk assessment. To this end, in 1999 the membership of the nonprofit scientific research organization, the International Life Sciences Institute (ILSI) Health and Environmental Sciences Institute (HESI) initiated a collaborative scientific program-the HESI Committee on the Application of Genomics to Mechanism-Based Risk Assessment-to address issues, challenges, and opportunities afforded by toxicogenomics (Robinson et al. 2003). The articles in this minimonograph include overviews of the design and objectives of this experimental program (Kramer et al. 2004;Mattes et al. 2004;Newton et al. 2004;Ulrich et al. 2004) as well as a series of technical articles detailing data generated and analyzed as part of the HESI genomics research programs Baker et al. 2004;Goodsaid et al. 2004;Mattes 2004;Rosenzweig et al. 2004;Thompson et al. 2004;Waring et al. 2004). Additional manuscripts detailing the experimental findings of the HESI Genotoxicity Working Group will be published in the journal Mutation Research in March 2004. The complete data set is currently being submitted to ArrayExpress (European Bioinformatics Institute, Hinxton, UK; http://www.ebi.ac.uk/arrayexpress) and will be available for public download by second quarter 2004. Accession numbers referencing this data set will be available on the HESI website (http://hesi.ilsi.org/index.cfm? pubentityid=120).
Toxicogenomics has progressed considerably during the course of the HESI Genomics Committee's research program (1999)(2000)(2001)(2002)(2003). Publications on the subject evolved from illustrating the theoretical promise of the technologies (Burchiel et al. 2001;Fielden and Zacharewski 2001;Nuwaysir et al. 1999;Simmons and Portier 2002;Smith 2001;Storck et al. 2002;Tennant 2002;Ulrich and Friend 2002;Waring and Halbert 2002) to illustrating the practical use of the technologies in toxicology (Brooks and Pennie 2001;Hamadeh et al. 2002;Lord et al. 2001;Waring et al. 2001). Many of the initial concerns about the practical use of toxicogenomics (e.g., oversensitivity, lack of comparability, availability of analytical tools) have been addressed, The value of genomic approaches in hypothesis generation is being realized as a tool for understanding toxicity and consequently contributing to an assessment of drug and chemical safety. In 1999 the membership of the International Life Sciences Institute Health and Environmental Sciences Institute formed a committee to develop a collaborative scientific program to address issues, challenges, and opportunities afforded by the emerging field of toxicogenomics. Experts and advisors from academia and government laboratories participate on the committee, along with approximately 30 corporate member organizations from the pharmaceutical, agrochemical, chemical, and consumer products industries. The committee has designed, conducted, and analyzed numerous toxicogenomic experiments within the broad fields of hepatotoxicity, nephrotoxicity, and genotoxicity. The considerable body of data generated by these programs has been instrumental in increasing understanding of sources of biological and technical variability in the alignment of toxicant-induced transcription changes with the accepted mechanism of action of these agents and the challenges in the consistent analysis and sharing of the voluminous data sets generated by these approaches. Recognizing the importance of standardized microarray data formats and public repository databases as the mechanism by which microarray data can be compared and interpreted by the scientific community, the committee has partnered with the European Bioinformatics Institute to develop a database to house the data generated by its collaborative research. although several issues still merit resolution before such data are used fully in the risk assessment process.

The HESI Collaborative Research Program
The HESI genomics research program was executed via a multinational team of scientists from academic and government laboratories, along with scientists from more than 30 corporate member organizations from the pharmaceutical, agrochemical, chemical, and consumer products industries (Table 1). Participation in the activities of the committee has afforded the collaborators an unprecedented opportunity to share experiences, best-operating practices, and actual data from a wider cross-section of commercially available and proprietary platforms, protocols, instrumentation, and analysis methods than they would have had access to individually.
Since its inception in 1999, the committee has conducted and analyzed toxicogenomics experiments within the broad fields of hepatotoxicity, nephrotoxicity, and genotoxicity. These experiments were designed a) to evaluate responses to prototypical toxicants to determine if known mechanisms of toxicity could be associated with characteristic gene expression profiles, b) to identify technological and biological sources of variability associated with toxicogenomic experimental protocols, and c) to ultimately evaluate the utility of this technology for risk assessment applications.
The experience highlighted the challenges inherent in the analysis and sharing of the data sets generated by these approaches. As reported in this mini-monograph, the experimental program has characterized the biological and technical sources of variability such as the isolation and labeling of mRNA samples, the detection hardware settings, the analysis software threshold settings, the microarray lot number, and the differences in gene coverage and probe annotation across different technical platforms. Nevertheless, in support for toxicogenomics as a valuable tool in assessing toxicity, the experimental programs have shown that a) patterns of gene expression relating to biological pathways are robust enough to allow insight into mechanisms of toxicity; b) gene expression data can provide meaningful information on the physical location of the toxicity; c ) dose-dependent changes can be observed; and d) concerns about oversensitivity of the technology may be unfounded (particularly when compared with existing in vitro assays for direct-and indirect-acting genotoxicants).

Toxicogenomic Data Storage and Exchange
As part of its mission, the HESI Genomics Committee also recognized the importance of standardized microarray data formats and public repository databases for better comparison and interpretation by the broader scientific community. Because of this, the committee has partnered with the European Bioinformatics Institute, to develop a database (Tox-MIAMExpress; www.ebi.ac.uk/ tox-miamexpress/) based on ArrayExpress database structure (Brazma et al. 2003) and MIAME (Minimum Information About a Microarray Experiment) data format standards (Brazma et al. 2001). The database will be consistently annotated and integrated with other relevant information (e.g., histopathology, clinical chemistry, gross observations), employ standard controlled vocabulary, and be supported by a query and data analysis interface. This database will house all the data generated by the HESI genomics consortium as part of the project described herein and will be made available to the public in early 2004. See Mattes et al. (2004) in this issue for more information.

Toxicogenomics and Risk Assessment
As the investments made in the application of genomic technologies mature, there is a determined effort to bring the full force of the technology into risk assessment. The committee's practical experimental experience has provided a valuable substrate for discussions on the technical and logistical challenges associated with the use of such data in the regulatory environment. A workshop with invited academic and regulatory participation (representing North America, Japan, and Europe) was held 5-6 June 2003 in Fairfax, Virginia, to discuss the findings of the committee's working groups (Pettit 2003). The participants were able to exploit these findings to form opinions on the implementation and interpretation of genomic microarray data in the risk assessment process.
Scientists from government agencies are encouraging greater input from both the academic and private sectors in the development of data-quality standards and commonly accepted analysis methods. The committee's work can help answer the many questions related to the use of toxicogenomic data in regulatory submissions or other risk evaluations. For example, what depth of analysis (and in what format) of gene expression changes is required for scientifically meaningful risk assessment? Genomic microarrays used in the pharmaceutical industry typically are limited to early-stage predictive assays and are not used for advanced mechanistic analysis of compounds in later stages of assessment. However, it is becoming clear that microarray data results should be placed in an appropriate biological context (i.e., with other biological end points) for researchers to understand mechanisms underlying toxicity. The relevance of single gene expression changes in the absence of pathway-level gene expression data cannot be assured.
Raw data from publicly referenced experiments needs to be available via public toxicogenomic databases [for example, Tox-MIAMExpress or the Chemical Effects in Biological Systems (CEBS) being developed by the National Center for  Challenges inherent in populating and using these resources include limitations in the availability of sequence information about probes on microarray platforms, use of nonstandard ontologies for toxicology end points, and relatively few public submissions of genomic data. However, as reported in this mini-monograph, several public consortia and organizations (including HESI and the NCT) are actively engaging the toxicological community in addressing these challenges.

Conclusion
Genomics, and more specifically toxicogenomics, can no longer be regarded as a new technology. The technologies are maturing to the extent that we now have considerable experience in their use. As evidenced in the research conducted by the HESI Genomics Committee and presented in this mini-monograph, it is clear that transcriptional profiles can discriminate between classes of compounds and some toxicities. These very preliminary data can provide users with valuable information for mechanistic evaluation and may even facilitate commercial decisions concerning the compounds to be developed and how to develop them. This committee's collective experience presented in this mini-monograph also provides awareness of the limits of sensitivity and reproducibility of the methods and an understanding of how transcriptomic data can be interpreted in the context of the pathology and other biological data from a toxicology study. As discussed in this mini-monograph, the experimental work and ongoing interactions of the participants on the HESI Genomics Committee represent a unique opportunity for the integration and distillation of this collective experience for the benefit of the regulators and regulated industries as well as for the toxicology community as a whole.