Humans are generating, sensing, and harvesting massive amounts of digital data, and many of these unprecedentedly large data sets will be archived in their entirety. The familiar notions of sequential or random access files no longer apply in the cloud. Instead developers will write code that mines this mass of unstructured data, extracts what is of interest, and then inserts the resulting data subset into a relational database or other structured data store where it will be analyzed and visualized. In a data-intensive world where the sheer volume of data demands new approaches and techniques, the inclination is to move the computation to the data, a basic theme underlying this course. Called the "fourth paradigm" (after theory, experiment, and computation), data-intensive computing is poised to transform scientific research. Students will learn about the notion of "data at rest" and its impact on data movement and computation, the role of cloud infrastructure in data-intensive computing, and the need for semantic metadata, preservation, and curation of digital data. Participants will get hands-on programming experience with data-intensive computing languages such as MapReduce.


  • Computer Science > General

Education Levels:

  • Grade 1
  • Grade 2
  • Grade 3
  • Grade 4
  • Grade 5
  • Grade 6
  • Grade 7
  • Grade 8
  • Grade 9
  • Grade 10
  • Grade 11
  • Grade 12


Informal Education,Biology,Higher Education,NSDL,Undergraduate (Upper Division),Computational Science,oai:nsdl.org:2200/20110907122822198T,Graduate/Professional,NSDL_SetSpec_ncs-NSDL-COLLECTION-000-003-112-055,Computer Science,Vocational/Professional Development Education,Life Science,Astronomy,Computing and Information,Space Science



Access Privileges:

Public - Available to anyone

License Deed:

Creative Commons Attribution Non-Commercial Share Alike


This resource has not yet been aligned.
Curriki Rating
'NR' - This resource has not been rated
'NR' - This resource has not been rated

This resource has not yet been reviewed.

Not Rated Yet.

Non-profit Tax ID # 203478467