Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

COMP 322: Fundamentals of Parallel Programming (Spring

...

2022)

 

InstructorInstructors:

Mackale Joyner, DH 2063

Zoran Budimlić, DH 3003

TAs:Elian Ahmar, Timothy Goh, Kelly Park, Tucker ReinhardtAdrienne Li, Austin Hushower, Claire Xu, Diep Hoang, Hunena Badat, Maki Yu, Mantej Singh, Minh Vu, Thanh Vu, Robert Walsh, Frederick Wang, Xincheng Wang, Rose Zhang, Victor Song, Yidi Wang   
Admin Assistant:Annepha Hurlock, annepha@rice.edu , DH 3122, 713-348-5186 

 

Piazza site:

https://piazza.com/rice/spring2021spring2022/comp322 (Piazza is the preferred medium for all course communications)

Cross-listing:

ELEC 323

Lecture location:

Fully OnlineHerzstein Amphitheater (online 1st 2 weeks)

Lecture times:

MWF 1:30pm 00pm - 21:25pm50pm

Lab locations:

Fully OnlineKeck 100 (online 1st 2 weeks)

Lab times:

Tu 1Mon  3:30pm 00pm - 23:25pm 50pm (TV, MS, TG, RWAustin, Claire)

Th Wed 4:50pm 30pm - 5:45pm (XW, TR, KP, YW, FW, EA20pm (Hunena, Mantej, Yidi, Victor, Rose, Adrienne, Diep, Maki)

Course Syllabus

A summary PDF file containing the course syllabus for the course can be found here.  Much of the syllabus information is also included below in this course web site, along with some additional details that are not included in the syllabus.

...

The desired learning outcomes fall into three major areas (course modules):

1) Parallelism: functional programming, Java streams, creation and coordination of parallelism (async, finish), abstract performance metrics (work, critical paths), Amdahl's Law, weak vs. strong scaling, data races and determinism, data race avoidance (immutability, futures, accumulators, dataflow), deadlock avoidance, abstract vs. real performance (granularity, scalability), collective & point-to-point synchronization (phasers, barriers), parallel algorithms, systolic algorithms.

...

3) Locality & Distribution: memory hierarchies, locality, cache affinity, data movement, message-passing (MPI), communication overheads (bandwidth, latency), MapReduce, accelerators, GPGPUs, CUDA, OpenCL., MapReduce

To achieve these learning outcomes, each class period will include time for both instructor lectures and in-class exercises based on assigned reading and videos.  The lab exercises will be used to help students gain hands-on programming experience with the concepts introduced in the lectures.

To ensure that students gain a strong knowledge of parallel programming foundations, the classes and homeworks homework will place equal emphasis on both theory and practice. The programming component of the course will mostly use the  Habanero-Java Library (HJ-lib)  pedagogic extension to the Java language developed in the  Habanero Extreme Scale Software Research project  at Rice University.  The course will also introduce you to real-world parallel programming models including Java Concurrency, MapReduce, MPI, OpenCL and CUDA. An important goal is that, at the end of COMP 322, you should feel comfortable programming in any parallel language for which you are familiar with the underlying sequential language (Java or C). Any parallel programming primitives that you encounter in the future should be easily recognizable based on the fundamentals studied in COMP 322.

...

There are no required textbooks for the class. Instead, lecture handouts are provided for each module as follows.  You are expected to read the relevant sections in each lecture handout before coming to the lecture.  We will also provide a number of references in the slides and handouts.The links to the latest versions of the lecture handouts are included below:

  • Module 1 handout (Parallelism)
  • Module 2 handout handout (Concurrency)

There

...

There are also a few optional textbooks that we will draw from during the course.  You are encouraged to get copies of any or all of these books.  They will serve as useful references both during and after this course:

 

Finally, here are some additional resources that may be helpful for you:

...

Lecture Schedule

 

 

Topic 1.1 Lecture, Topic 1.1 Demonstration Homework 1WedFeb 03 Future Tasks, Functional Parallelism ("Back to the Future")Topic 2.1 Lecture, Topic 2.1 Demonstration   Feb 08 Map Reduce4 4    Lecture 9: Java’s Fork/Join LibraryNo class (weather) Quiz for Unit 26 WedMar 03 13: Parallelism in Java Streams, Parallel Prefix Sums  37 37 Demonstration (includes one intermediate checkpoint)  FriMar 05 14: Iterative Averaging Revisited, SPMD patternQuiz for Unit 3Quiz for Unit 5 24Homework 3, Checkpoint-1   31  24: Java Threads, Java synchronized statement1 2  Apr 02 25: Java Threads, Java synchronized statement (contd), wait/notify 7 7 Lecturelec25

 

 lec33Fri 23 34: Algorithms based on Parallel Prefix (Scan) operationsQuiz for Unit 8Mon 26 35: TBD

Week

Day

Date (20212022)

Lecture

Assigned Reading

Assigned Videos (see Canvas site for video links)

In-class Worksheets

Slides

Work Assigned

Work Due

 Worksheet Solutions 

1

Mon

Jan 2510

Lecture 1: Task Creation and Termination (Async, Finish)

Module 1: Section 1.1

Introduction

 

 

worksheet1lec1-slides  worksheet1lec1-slides

 

 

 
WS1-solution 

 

Wed

Jan 2712

Lecture 2:  Computation Graphs, Ideal Parallelism

Module 1: Sections 1.2, 1.3Topic 1.2 Lecture, Topic 1.2 Demonstration, Topic 1.3 Lecture, Topic 1.3 Demonstrationworksheet2lec2-slides

Functional Programming

GList.java worksheet2lec02-slides

 

 

WS2-solution 
 FriJan 2914Lecture 3: Abstract Performance Metrics, Multiprocessor SchedulingModule 1: Section 1.4Topic 1.4 Lecture, Topic 1.4 Demonstrationworksheet3 Higher order functions  worksheet3 lec3-slides   lec3-slides

 

  WS3-solution 

2

Mon

Jan 17

No class: MLK

        

 

Wed

Jan 19

Lecture 4: Lazy Computation

LazyList.java

Lazy.java

 

Feb 01

Lecture 4: Parallel Speedup and Amdahl's Law

Module 1: Section 1.5Topic 1.5 Lecture, Topic 1.5 Demonstrationworksheet4lec4-slidesQuiz for Unit 1   WS4-solution 

 

Fri

Jan 21

Lecture 5:

Java Streams

  Module 1: Section 2.1worksheet5lec5-slidesHomework 1 WS5-solution 
3FriMonFeb 05Jan 24

Lecture 6:   Finish Accumulators Map Reduce with Java Streams

Module 1: Section 2.34Topic 2.3 4 Lecture, Topic 2.3 4 Demonstration  worksheet6lec6-slides

 

 Quiz for Unit 1WS6-solution 

 

3Mon

Wed

Jan 26

Lecture 7:

Futures

Module 1: Section 2.41Topic 2.1 Lecture , Topic 2.1 Demonstrationworksheet7lec7-slides

 

 WS7-solution 

 

WedFri

Feb 10Jan 28

Lecture 8: Data Races, Functional & Structural Determinism  Computation Graphs, Ideal Parallelism

Module 1: Section Sections 1.2.5, 21.63Topic 1.2 .5 Lecture, Topic 1.2 .5 Demonstration, Topic 21.6 3 Lecture, Topic 21.6 3 Demonstration   worksheet8lec8-slides 

Homework 2

 WS8-solutionHomework 1  

4

Mon

 

Fri

Feb 12

Jan 31 Lecture 9: Async, Finish, Data-Driven Tasks 

Module 1: Section 1.1, 4.5

 

Topic

2

1.

7

1 Lecture, Topic

2

1.

7

1 Demonstration, Topic

2

4.

8

5 Lecture, Topic

2

4.

8

5 Demonstration

worksheet9

lec9-slidesslides Quiz for Unit 2   

comp322-s21-lab6.pdf4

Mon

 

Feb 15WS9-solution 
 WedFeb 02Lecture 10: Event-based programming model

 

  worksheet10 lec10-slides  WS10-solution 
 WedFriFeb 17Spring "Sprinkle" Day (no class)04Lecture 11: GUI programming as an example of event-based,
futures/callbacks in GUI programming
 
   worksheet11    
 FriFeb 19No class (weather)        
lec11-slidesHomework 2Homework 1WS11-solution 
5

Mon

Feb 2207

Lecture 10: Loop-Level Parallelism, Parallel Matrix Multiplication12: Scheduling/executing computation graphs
Abstract performance metrics
Module 1: Sections 3.Section 1, 3.24Topic 31.1 4 Lecture , Topic 3. 1 Demonstration ,  Topic 3.2 Lecture,  Topic 3.2 .4 Demonstrationworksheet10worksheet12lec10lec12-slides   WS12-solution 

 

Wed

Feb 2409

Lecture 11: Iteration Grouping (Chunking), Barrier Synchronization 13: Parallel Speedup, Critical Path, Amdahl's Law

Module 1: Sections 3.3, 3.4Section 1.5

Topic 31.3 5 Lecture , Topic 3.3 Demonstration, Topic 3.4 Lecture  ,   Topic 3.4 Demonstration

worksheet11

1.5 Demonstration

worksheet13lec13lec11-slides   WS13-solution 

 

Fri

Feb 26

 Lecture 12: Data-Driven Tasks 

 

Module 1: Sections 4.5

Topic 4.5 Lecture   Topic 4.5 Demonstration

worksheet12lec12-slides 

11

No class: Spring Recess

 

        
6

Mon

Feb 14

Lecture 14: Accumulation and reduction. Finish accumulators

Module 1: Section 2.3Topic 2.3 Lecture   Topic 2.3 Demonstrationworksheet14lec14-slides  WS14-solution 

 

Wed

Mon

Mar 01

Feb 16

Lecture 15: Recursive Task Parallelism  

 Spring "Sprinkle" Day (no class)  worksheet15 lec15-slides

 

 

 WS15-solution 
 FriFeb 18

Lecture

16: Data Races, Functional & Structural Determinism

Module 1: Sections 3.72.5, 2.6Topic 2.5 Lecture ,  Topic 2.5 Demonstration,  Topic 2.6 Lecture,  Topic 2.6 Demonstrationworksheet16 lec16worksheet13lec13-slidesHomework 3Homework 2WS16-solution  

7

Mon

Feb 21

Lecture

Module 1: Sections 3.5, 3.6Topic 3.5 Lecture , Topic 3.5 Demonstration , Topic 3.6 Lecture,   Topic 3.6 Demonstrationworksheet14 lec14-slides

17: Midterm Review

   lec17-slides    

7 

MonWed

Mar 08Feb 23

Lecture 15:  Point-to-point Synchronization with Phasers

Module 1: Section 4.2, 4.3Topic 4.2 Lecture ,   Topic 4.2 Demonstration, Topic 4.3 Lecture,  Topic 4.3 Demonstrationworksheet15

18: Limitations of Functional parallelism.
Abstract vs. real performance. Cutoff Strategy

  worksheet18lec18-slides  WS18-solution 

 

Fri

Feb 25 

Lecture 19: Fork/Join programming model. OS Threads. Scheduler Pattern 

 Topic 2.7 Lecture, Topic 2.7 Demonstration, Topic 2.8 Lecture, Topic 2.8 Demonstration, worksheet19lec19lec15-slides   WS19-solution 

 8

WedMon

Mar 10Feb 28

Lecture 16: Midterm Review

   

20: Confinement & Monitor Pattern. Critical sections
Global lock

Module 2: Sections 5.1, 5.2, 5.6 Topic 5.1 Lecture, Topic 5.1 Demonstration, Topic 5.2 Lecture, Topic 5.2 Demonstration, Topic 5.6 Lecture, Topic 5.6 Demonstrationworksheet20lec20-slides      lec16-slides   WS20-solution 

 

FriWed

Mar

12 

02

Lecture 17: Pipeline Parallelism, Signal Statement, Fuzzy Barriers21:  Atomic variables, Synchronized statements

Module 2: Sections 5

Module 1: Sections 4

.4,

4

7.

1

2

Topic 45.4 Lecture,   Topic 45.4 Demonstration, Topic 4.1 Lecture,  Topic 4.1 Demonstrationworksheet177.2 Lectureworksheet21lec21lec17-slides  WS21-solution 

 

8

MonFri

Mar 1504

Lecture 18: Abstract vs. Real Performance22: Parallel Spanning Tree, other graph algorithms 

  worksheet18worksheet22lec18lec22-slides   Homework Quiz for Unit 4Quiz for Unit

Homework 3

 
WS22-solution 

 9

WedMon

Mar 1707

Lecture 23: Java Threads and LocksLecture 19: Critical Sections, Isolated construct (start of Module 2)

Module 2: Sections 57.1, 5.2, 5.6, 7.3

Topic

5

7.1 Lecture, Topic

5.1 Demonstration, Topic 5.2 Lecture, Topic 5.2 Demonstration, Topic 5.6 Lecture, Topic 5.6 Demonstration
worksheet19

7.3 Lecture

worksheet23 lec23lec19-slides  

 

 
WS23-solution 

 

FriWed

Mar 1909

Lecture 20: Parallel Spanning Tree algorithm, Atomic variables 24: Java Locks - Soundness and progress guarantees  

Module 2: Sections 5.3, 5.4, 57.5Topic 5.3 Demonstration, Topic 5.4 Lecture, Topic 5.4 Demonstration, Topic 5.5 Lecture, Topic 5.5 Demonstrationworksheet20lec20-slides 7.5 Lecture worksheet24 lec24-slides 

 

WS24-solutionQuiz for Unit 4 

 9

Fri

Mon

Mar 2211Lecture 21: Actors

 Lecture 25: Dining Philosophers Problem  Module 2: 67.1, 6.2Topic 7.6 .1 Lecture ,   Topic 6.1 Demonstration ,   Topic 6.2 Lecture, Topic 6.2 Demonstrationworksheet21 lec21-slides Lectureworksheet25lec25-slides 

 

WS25-solution 
 

Mon

Mar 14

No class: Spring Break

     

 

  
 WedMar 16

Lecture 22: Actors (contd)

Module 2: 6.3, 6.4Topic 6.3 Lecture, Topic 6.3 Demonstration, Topic 6.4 Lecture , Topic 6.4 Demonstrationworksheet22 lec22-slides No class: Spring Break    

 

   

 

Fri

Mar 2618

Spring "Sprinkle" Day (no class)

 

No class: Spring Break

     

 

  

10

Mon

Mar 29

Lecture 23: Actors (contd)

Module 2: 6.5, 6.6Topic 6.5 Lecture, Topic 6.5 Demonstration, Topic 6.6 Lecture, Topic 6.6 Demonstrationworksheet23

Mon

Mar 21

Lecture 26: N-Body problem, applications and implementations 

  worksheet26lec26lec23-slides  

Quiz for Unit 5

 WS26-solution 

 

Wed

Mar

23

Lecture

27: Read-Write Locks, Linearizability of Concurrent Objects

Module 2: 7.13, 7.24Topic 7.3 Lecture, Topic 7.4 Lectureworksheet24worksheet27lec24lec27-slides Quiz for Unit 6

 

 WS27-solution 

 

Fri

Mar 25

Lecture

28: Message-Passing programming model with Actors

Module 2: 76.1, 76.2Topic 6.1 Lecture, Topic 6.1 Demonstration,   Topic 6.2 Lecture, Topic 6.2 Demonstrationworksheet25worksheet28lec28-slides

 

 

 

WS28-solution 

11

MonApr

05Mar 28

 

Wed

Apr 07

Lecture 26: Java Locks

Module 2: 7.3Topic 7.3 Lecture worksheet26lec26-slides Homework 4 (includes one intermediate checkpoint)Homework 3 (all)  

Lecture 27: Linearizability of Concurrent Objects29: Active Object Pattern. Combining Actors with task parallelism 

Module 2: 76.3, 6.4Topic 76.3 Lecture, Topic 6.3 Demonstration,   Topic 6.4 Lectureworksheet27, Topic 6.4 Demonstrationworksheet29lec29lec27-slides

 

 

 
WS29-solution 

 

Fri

Apr 09

Lecture 28: Safety and Liveness Properties, Java Synchronizers, Dining Philosophers Problem

 Topic 7.5 Lecture, Topic 7.6 Lectureworksheet28lec28-slides

Quiz for Unit 7

Wed

Mar 30

Lecture 30: Task Affinity and locality. Memory hierarchy 

  worksheet30lec30-slides

 

 WS30-solution 

 12

Fri

Mon

Apr 1201

Lecture 29: Message Passing Interface (MPI), (start of Module 3)

 Topic 8.1 Lecture, Topic 8.2 Lecture, Topic 8.3 Lectureworksheet29lec29-slides

 

Quiz for Unit 6

  

31: Data-Parallel Programming model. Loop-Level Parallelism, Loop Chunking

Module 1: Sections 3.1, 3.2, 3.3Topic 3.1 Lecture, Topic 3.1 Demonstration , Topic 3.2 Lecture,  Topic 3.2 Demonstration, Topic 3.3 Lecture,  Topic 3.3 Demonstrationworksheet31lec31-slidesHomework 5

Homework 4

WS31-solution 

12

Mon

Apr 04

Lecture 32: Barrier Synchronization with PhasersModule 1: Section 3.4Topic 3.4 Lecture,  Topic 3.4 Demonstrationworksheet32lec32

 

Wed

Apr 14

Lecture 30: Message Passing Interface (MPI, contd)

 Topic 8.4 Lectureworksheet30lec30-slides

 

 

WS32-solution 

 

Wed

 

Fri

Apr 16

Lecture 31: Message Passing Interface (MPI, contd)

  Topic 8.5 Lecture, Topic 8 Demonstration Videoworksheet31lec31-slidesQuiz for Unit 8

Apr 06

Lecture 33:  Stencil computation. Point-to-point Synchronization with Phasers

Module 1: Section 4.2, 4.3

Topic 4.2 Lecture, Topic 4.2 Demonstration, Topic 4.3 Lecture,  Topic 4.3 Demonstration

worksheet33lec33-slides

 

 WS33-solutionQuiz for Unit 7 

 

13

MonFri

Apr 1908

Lecture 32: Task Affinity with Places

   lec32-slides

 

Homework 4 Checkpoint-1

  

34: Fuzzy Barriers with Phasers

Module 1: Section 4.1Topic 4.1 Lecture, Topic 4.1 Demonstrationworksheet34lec34-slides 

 

WS34-solution 

13

Mon

Apr 11

Lecture 35

 

Wed

Apr 21

Lecture 33: Eureka-style Speculative Task Parallelism 

 

 worksheet35lec35-slides

 

 

 
WS35-solution 
 WedApr 13Lecture 36: Scan Pattern. Parallel Prefix Sum 

 

 
worksheet36lec34lec36-slides  WS36-solution 
 

14

FriApr 15Lecture 37: Parallel Prefix Sum applications   worksheet37lec35lec37-slides     
14WedMonApr 2818Lecture 36: TBD38: Overview of other models and frameworks   lec36lec38-slides Homework 4 (all)   
 FriWedApr 3020Lecture 3739: Course Review (Lectures 19-3438)   lec37lec39-slides    

Lab Schedule

 FriApr 22Lecture 40: Course Review (Lectures 19-38)   lec40-slides Homework 5  

Lab Schedule

Isolated Statement and Atomic Variables-Apache Spark  Java's ForkJoin Framework

Lab #

Date (2022)

Topic

Handouts

Examples

1

Jan 10

Infrastructure setup

lab0-handout

lab1-handout

 
2Jan 17Functional Programminglab2-handout 

3

Jan 24

Java Streams

lab3-handout
 
4Jan 31Futureslab4-handout 

5

Feb 07

Data-Driven Tasks

lab5-handout 
6

Feb 14

Async / Finish

Lab #

Date (2021)

Topic

Handouts

Examples

0 Infrastructure Setuplab0-handout 

1

Jan 26

Async-Finish Parallel Programming with abstract metrics

lab1-handout
 
-Feb 02No lab this week  

2

Feb 09

Futures

lab2-handout
 
-Feb 16No lab this week (classes cancelled)  

3

Feb 23

Cutoff Strategy and Real World Performance

lab3-handout  
4

Mar 02

DDFs

lab4-handout  
-

Mar 09

No lab this week (Midterm exam)

  
-Mar 16No lab this week (Spring "Sprinkle" Day)  
5Mar 23Loop-level Parallelismlab5-handoutlab5-intro

6

Mar 30

lab6-handout 
-Apr 06

Feb 21

No lab this week (

Spring "Sprinkle" Day

Midterm)

  
7Apr 13Feb 28Recursive Task Cutoff StrategyJava Threads, Java Lockslab7-handout 
8

Apr 20

Mar 07Java ThreadsActorslab8-handout 

-

Mar 14

No lab this week (Spring Break)

  
9Mar 21Concurrent Listslab9-handout 
10Mar 28Actorslab10-handout

Message Passing Interface (MPI)

   
11

Apr 04

Loop Parallelism

lab11-handout 

-

Apr 11

No lab this weekEureka-style Speculative Task Parallelism

  

-

 

Apr 18

No lab this week

  

Grading, Honor Code Policy, Processes and Procedures

...

Labs must be submitted by the following Monday Wednesday at 114:59pm30pm.  Labs must be checked off by a TA.

Worksheets should be completed by the deadline listed in Canvas before the start of the following class (for full credit) so that solutions to the worksheets can be discussed in the next class.

...