Cornell CAC offers data analysis training class

On October 11-12, 2010, members of the Cornell University Center for Advanced Computing (CAC) will present a National Science Foundation-sponsored training class, "Data Analysis on Ranger" on the campus of Cornell University, Ithaca, NY.

The two-day workshop will focus on data analysis on Ranger, the 3,936 node, 579 peak teraflops Linux Sun cluster located at the Texas Advanced Computing Center, although the concepts of this workshop will readily transfer to other high-performance computing platforms. There is no fee for this workshop.

The class will include lectures, labs, and discussions on:

  • Data formats, transfer, movement, and storage
  • Data analysis with R, Python, and MATLAB
  • MapReduce with Hadoop
  • Visualization
  • Optimization
  • Scientific workflows and provenance

To register for the workshop, please visit http://portal.teragrid.org/training. Questions regarding the workshop may be directed to the TeraGrid User Portal at http://portal.teragrid.org/consulting.

The Cornell University Center for Advanced Computing (CAC) receives support from Cornell University, the National Science Foundation, and other leading public agencies, foundations, and corporations. For more information, please visit http://www.cac.cornell.edu.

The Ranger supercomputer is funded through the NSF Office of Cyberinfrastructure "Path to Petascale" program. The system is collaboration among the Texas Advanced Computing Center (TACC), The University of Texas at Austin’s Institute for Computational Engineering and Science (ICES), Sun Microsystems, Advanced Micro Devices, Arizona State University, and Cornell University. For details on Ranger, please visit http://www.tacc.utexas.edu/resources/hpc/.

Ranger is a key resource of the NSF TeraGrid ( http://www.teragrid.org ), a nationwide network of academic HPC centers, sponsored by the NSF Office of Cyberinfrastructure, which provides scientists and researchers access to large-scale computing power and resources. TeraGrid is a partnership of people, resources and services that enables discovery in U.S. science and engineering by providing researchers with access to large-scale computing, networking, data-analysis and visualization resources and expertise.