 DataIntensive Computing
 The Fourth Paradigm
 History of eScience
 Big Data in Science
 Introduction to Databases
 Relational databases, ACID
 Indexing
 Introduction to SQL
 User defined functions
 Hardware architectures
 Storage hierarchy
 Nature of low level I/O
 Redundant storage, RAID, erasure codes
 Networking issues
 Balanced systems, Amdahl's Laws
 Cloud computing vs Beowulf
 Elementary Statistics
 Distributions
 Expectation values, moments
 Central limit theorem
 Linear regression
 Principal component analysis
 Random forests
 Data transformations
 Fourier transforms
 Wavelets
 Random projections
 Data structures
 Trees
 Kd trees
 Quad and octrees, space filling curves
 Hashing
 Hash functions
 Locality sensitive hashing
 Bloom filters
 Graphs
 Representation of graphs
 Properties of graphs
 Laplacian, eigenvalues
 Graphs as spring networks
 Sorting and Searching
 Quicksort
 Queues
 Mergesort
 Data streams, streaming algorithms
 Data visualization
