Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

1/30PPT1, PPT2
 Lecture NameDateSlidesNotes
1

Introduction to the course

1/17/2015PPT 
2

Course outline and student presentation assignment

1/23/2015PPT 
3

Big Data: Applications & Platform Architectures

1/30/2015PPT 
4

Big Data and Analytics Systems: Computer System Architecture

2/6/2015PPT 
5More Applications of Big Data4/24/2015  

...

Schedule for Student Presentations

Note: The slides and related material for each topic will be provided. Hence, do not hesitate about the workload if you like a topic not in your domain.

 
 
Topic
Student  Name
Date
Presentation
1

Distributed file systems and map-reduce as a tool for creating parallel algorithms that succeed on very large amounts of data

Yiting Xia

2/613/2015

PPT
2

Similarity search, including the key techniques of min-hashing and locality sensitive hashing

 Deepak Majeti2/1320/2015 PDF
3

Data-stream processing and specialized algorithms for dealing with data that arrives so fast it must be processed immediately or lost

Wei-Cheng Xiao2/2027/2015 PPTX
4

Anomaly Detection Frequent-itemset mining, including association rules, market-baskets, the A-Priori Algorithm and its improvements

Deepak Majeti3/13/2015PPT
5 Omid Pouya23/2720/2015 
5

Frequent-itemset mining, including association rules, market-baskets, the A-Priori Algorithm and its improvements

 2/6/2015 
PPT1, PPT2
66

Algorithms for clustering very large, high-dimensional datasets

 Simbarashe Dzinamarira3/1327/2015 
7

Two key problems for Web applications: managing advertising and recommendation systems

 Lei Tang34/2010/2015 
8

Algorithms for analyzing and mining the structure of very large graphs, especially social-network graphs

Shangyu Luo 34/2717/2015 
9 

Techniques for obtaining the important properties of a large dataset by dimensionality reduction, including singular-value decomposition

and latent semantic indexing

 Zhipeng Wang4/324/2015 
10 

Machine-learning algorithms that can be applied to very large data, such as perceptrons, support-vector machines, and gradient descent

 Zhipeng Wang4/1030/2015 

...

Resources Resources

Text Book:  Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman

...