INDUSTRY
The Bandwidth Challenge: Unleashing the Power of the Network
The National Center for Data Mining (NCDM) at UIC and the Open Cloud Consortium were awarded the 2008 Supercomputing Bandwidth Challenge Award at SC08 in Austin. Their entry was titled "Towards Global Scale Cloud Computing: Using Sector and Sphere on the Open Cloud Testbed" and was led by Dr. Yunhong Gu of the University of Illinois at Chicago and Dr. Robert Grossman of the University of Illinois at Chicago and Open Data Group. Although cloud computing is common today, processing data by clouds today is almost always done within a single data center due to the technical challenges processing data across multiple data centers. The team today demonstrated technology for the first time that enables cloud computing to utilize high performance networks and spread cloud computing across data centers to create wide area clouds. The technology that makes this possible is the open source Sector storage cloud and Sphere compute cloud developed by the NCDM. NCDM used the Open Cloud Testbed, which is a testbed managed by the Open Cloud Consortium for this challenge. The Open Cloud Consortium develops standards for computing within clouds and frameworks for interoperating between clouds. "A whole new generation of cloud computing is now possible using the open source Sector storage cloud and the Sphere computing cloud and standards developed by the Open Cloud Consortium. For the first time, developing applications that span multiple distributed clouds is now possible," according to Robert Grossman. According to Joe Mambretti, Director of the International Center of Advanced Internet Research at Northwestern University and Co-Director of the Open Cloud Testbed, "These innovative technologies provide unique capabilities that will enable new generations of applications based on extremely large scale data streams." During the Bandwidth Challenge at SC08, the team demonstrated three applications that used the Sector/Sphere cloud. The application transported bioinformatics data using Sector from the conference floor in Austin to Kitakyushu in Japan at over 8 Gb/s. The second application demonstrated was Creditstone, which is a benchmark for financial services applications. The Sector/Sphere implementation of Creditstone processed about 53.5 billion synthetic credit card transaction records in less than 1 hour. The third application was TeraSort, which sorted 1 terabyte of data within 30 minutes. The average data moving rate was about 4.8Gb/s in the Open Cloud Testbed, with a peak speed reaching 10Gb/s. One of the key achievements of the Sector and Sphere software is that it is very easy to use. For example, the TeraSort code only requires about 50 lines of C++ code. This is critical, as it allows researchers to use their time to focus on research problems, rather than spending time dealing with distributed programming. According to Yunhong Gu, "Sphere is a new software system that supports simplified distributed data processing application development. In contrast to traditional distributed computing methods such as MPI, Sphere allows users to write distributed applications with a few lines of code and without knowing the details of the underlying hardware."