New Microbial Metagenome Data Analysis System Producing Results

An experimental metagenomics data management and analysis system co-developed by the Biological Data Management and Technology Center (BDMTC) at Berkeley Lab with the Genome Biology Program and Microbial Ecology Program at DOE’s Joint Genome Institute and released earlier this year is also already helping produce scientific discoveries. Called IMG/M, the system extends the Integrated Microbial Genomes (IMG) system with the ability to integrate and analyze metagenome data, and has provided immediate support for metagenomics studies at JGI. “IMG/M is the first publicly available metagenome data management and analysis system —the first of its kind,” said Victor Markowitz, head of BDMTC. “Most research in microbial genome analysis focuses on individual organisms,” said Nikos Kyrpides, head of JGI’s Genome Biology Program. “The application of high throughput sequencing to environmental samples has revealed a new universe of microbial community genomes including mostly microorganisms unknown to science. Microbial community genome analysis, also known as metagenomics, focuses on how the entire community functions together”. Thus far, IMG/M has been used by Phil Hugenholtz, head of the Microbial Ecology Program, and his colleagues at JGI for completing the analysis of Enhanced Biological Phosphorus Removing sludge communities and for studying the metagenomes of several key microbial communities recently sequenced by DOE JGI, including the lignocellulose-hydrolyzing communities in termite hindguts. “IMG/M has proven to be an extremely useful resource and tool for analyzing our metagenomic data,” said Jared R. Leadbetter, associate professor of environmental microbiology at the California Institute of Technology and collaborator on the termite hindgut microbial community for bioenergy project. “Such datasets are large, complex, and potentially unwieldy. Importantly, IMG/M is more than just an excellent tool to analyze data. The manner in which the results of that analysis are organized and made accessible through a user-friendly interface allows the researcher to rapidly move in a number of different intellectual directions. As a result, the user becomes better educated with and gets a real ‘feel’ for the data in a manner that would not otherwise be possible on such short timescales.” IMG/M was presented by Markowitz at the recent Keystone Symposium on Microbial Community Genomics in Animals and the Environment, organized by DOE JGI Director Edward M. Rubin and Edward F. DeLong, professor in the Division of Biological Engineering & Department of Civil and Environmental Engineering at the Massachusetts Institute of Technology. “IMG/M provides an intuitive interface, and nice complement to IMG for comparing gene content and phylogenetic profiles of microbial genomes, and relating them to the large microbial community datasets now accumulating,” DeLong said. “These are great and sorely needed data-exploration tools.” IMG/M was demonstrated at a workshop on April 1, as part of the DOE JGI First Annual User Meeting. Additionally, BDMTC staff members will be presenting the project this summer at workshops and symposia in England, Brazil and Austria. “We have an experimental system which has proved to be immediately useful in helping scientists to conduct their studies effectively,” Markowitz said. IMG/M is accessible to the public at its Web site. More information about the The Biological Data Management and Technology Center is at its Web site. Berkeley Lab is a U.S. Department of Energy national laboratory located in Berkeley, California. It conducts unclassified scientific research and is managed by the University of California. Learn more at its Web site. Berkeley Lab’s Computing Sciences organization is well known for managing large-scale computational and networking facilities such as the U.S. Department of Energy’s National Energy Research Scientific Computing Center (NERSC) and Energy Sciences Network (ESnet) for national user communities with thousands of scientific users.