Sunday, Half Day, PM
Room A209

Title: Data Grids: Drivers, Technologies, Opportunities

Level: 20% Introductory | 40% Intermediate | 40% Advanced

Presenters: Ann Chervenak , USC Information Sciences Institute; Michael Wilde, Argonne National Laboratory


In numerous scientific, engineering, and business disciplines, terabyte- and petabyte-scale data collections are emerging as critical resources. These data sets must be shared by large communities of users that pool their resources from a large number of institutions. This 2-part tutorial shows how to design and implement new information infrastructures called "Data Grids" to access and analyze the enormous distributed datasets employed by these communities. Part 1 surveys the current body of data grid concepts and techniques. It details the goals, requirements, and architectures of both deployed and proposed data grids. Examples will be drawn from case studies and detailed requirements analyses from physics, climate science, and engineering communities. Part 2 presents data grid implementation tools and techniques. We start by examining how to use Grid-enabled data transport and file replication components in application environments. We then focus on Grid-enabling applications directly with Data Grid toolkit components, and conclude with illustrations of integrating components of the Globus Toolkit for security, policy management, and resource monitoring with data management capabilities.

Related Tutorial: S9 - The Emerging Grid: Introduction, Tools, Applications