Mackale Joyner, DH 2063
Zoran Budimlić, DH 1038
|TAs:||Adrienne Li, Austin Hushower, Claire Xu, Diep Hoang, Hunena Badat, Maki Yu, Mantej Singh, Rose Zhang, Victor Song, Yidi Wang|
|Admin Assistant:||Annepha Hurlock, email@example.com, DH 3122, 713-348-5186|
https://piazza.com/rice/spring2022/comp322 (Piazza is the preferred medium for all course communications)
Herzstein Amphitheater (online 1st 2 weeks)
MWF 1:00pm - 1:50pm
Keck 100 (online 1st 2 weeks)
Mon 3:00pm - 3:50pm ()
Wed 4:30pm - 5:20pm ()
A summary PDF file containing the course syllabus for the course can be found here. Much of the syllabus information is also included below in this course web site, along with some additional details that are not included in the syllabus.
The primary goal of COMP 322 is to introduce you to the fundamentals of parallel programming and parallel algorithms, by following a pedagogic approach that exposes you to the intellectual challenges in parallel software without enmeshing you in the jargon and lower-level details of today's parallel systems. A strong grasp of the course fundamentals will enable you to quickly pick up any specific parallel programming system that you may encounter in the future, and also prepare you for studying advanced topics related to parallelism and concurrency in courses such as COMP 422.
The desired learning outcomes fall into three major areas (course modules):
1) Parallelism: creation and coordination of parallelism (async, finish), abstract performance metrics (work, critical paths), Amdahl's Law, weak vs. strong scaling, data races and determinism, data race avoidance (immutability, futures, accumulators, dataflow), deadlock avoidance, abstract vs. real performance (granularity, scalability), collective & point-to-point synchronization (phasers, barriers), parallel algorithms, systolic algorithms.
2) Concurrency: critical sections, atomicity, isolation, high level data races, nondeterminism, linearizability, liveness/progress guarantees, actors, request-response parallelism, Java Concurrency, locks, condition variables, semaphores, memory consistency models.
3) Locality & Distribution: memory hierarchies, locality, cache affinity, data movement, message-passing (MPI), communication overheads (bandwidth, latency), MapReduce, accelerators, GPGPUs, CUDA, OpenCL.
To achieve these learning outcomes, each class period will include time for both instructor lectures and in-class exercises based on assigned reading and videos. The lab exercises will be used to help students gain hands-on programming experience with the concepts introduced in the lectures.
To ensure that students gain a strong knowledge of parallel programming foundations, the classes and homeworks will place equal emphasis on both theory and practice. The programming component of the course will mostly use the Habanero-Java Library (HJ-lib) pedagogic extension to the Java language developed in the Habanero Extreme Scale Software Research project at Rice University. The course will also introduce you to real-world parallel programming models including Java Concurrency, MapReduce, MPI, OpenCL and CUDA. An important goal is that, at the end of COMP 322, you should feel comfortable programming in any parallel language for which you are familiar with the underlying sequential language (Java or C). Any parallel programming primitives that you encounter in the future should be easily recognizable based on the fundamentals studied in COMP 322.
The prerequisite course requirements are COMP 182 and COMP 215. COMP 322 should be accessible to anyone familiar with the foundations of sequential algorithms and data structures, and with basic Java programming. COMP 321 is also recommended as a co-requisite.
There are no required textbooks for the class. Instead, lecture handouts are provided for each module as follows. You are expected to read the relevant sections in each lecture handout before coming to the lecture. We will also provide a number of references in the slides and handouts.The links to the latest versions of the lecture handouts are included below:
There are also a few optional textbooks that we will draw from during the course. You are encouraged to get copies of any or all of these books. They will serve as useful references both during and after this course:
Finally, here are some additional resources that may be helpful for you:
Assigned Videos (see Canvas site for video links)
Lecture 1: Task Creation and Termination (Async, Finish)
|Module 1: Section 1.1||worksheet1||lec1-slides|
Lecture 2: Computation Graphs, Ideal Parallelism
|Module 1: Sections 1.2, 1.3||Topic 1.2 Lecture, Topic 1.2 Demonstration, Topic 1.3 Lecture, Topic 1.3 Demonstration||worksheet2||lec2-slides|
|Fri||Jan 14||Lecture 3: Abstract Performance Metrics, Multiprocessor Scheduling||Module 1: Section 1.4||Topic 1.4 Lecture, Topic 1.4 Demonstration||worksheet3||lec3-slides|
Lecture 4: Parallel Speedup and Amdahl's Law
|Module 1: Section 1.5||Topic 1.5 Lecture, Topic 1.5 Demonstration||worksheet4||lec4-slides||Quiz for Unit 1|
|Lecture 5: Future Tasks, Functional Parallelism ("Back to the Future")||Module 1: Section 2.1||Topic 2.1 Lecture, Topic 2.1 Demonstration||worksheet5||lec5-slides|
Lecture 6: Finish Accumulators
|Module 1: Section 2.3||Topic 2.3 Lecture, Topic 2.3 Demonstration||worksheet6||lec6-slides||Quiz for Unit 1|
Lecture 7: Map Reduce
|Module 1: Section 2.4||Topic 2.4 Lecture, Topic 2.4 Demonstration||worksheet7||lec7-slides|
Lecture 8: Computation Graphs, Ideal Parallelism
|Module 1: Sections 1.2, 1.3||Topic 1.2 Lecture, Topic 1.2 Demonstration, Topic 1.3 Lecture, Topic 1.3 Demonstration||worksheet8||lec8-slides|
Lecture 9: Java’s Fork/Join Library
|Topic 2.7 Lecture, Topic 2.7 Demonstration, Topic 2.8 Lecture, Topic 2.8 Demonstration||worksheet9||lec9-slides||Quiz for Unit 2|
|Jan 31||Lecture 9: Data-Driven Tasks|
Module 1: Section 4.5
Topic 4.5 Lecture Topic 4.5 Demonstration
|Wed||Feb 02||Lecture 10: Event-based programming model|
|Fri||Feb 04||Lecture 11: GUI programming as an example of event-based,|
futures/callbacks in GUI programming
|Lecture 12: Scheduling/executing computation graphs|
Abstract performance metrics
|Module 1: Section 1.4||Topic 1.4 Lecture , Topic 1.4 Demonstration||worksheet12||lec12-slides|
Lecture 13: Lightweight task parallelism. Finish/async
|Module 1: Section 1.1|
Topic 1.1 Lecture , Topic 1.1 Demonstration
|No class: Spring Recess|
|Quiz for Unit 2|
Lecture 14: Parallel Speedup, Critical Path, Amdah's Law
|Module 1: Section 1.5||Topic 1.5 Lecture Topic 1.5 Demonstration||worksheet14||lec14-slides|
Lecture 15: Recursive Task Parallelism
Homework 3 (includes one intermediate checkpoint)
Lecture 16: Accumulation and reduction. Finish accumulators
|Module 1: Section 2.3||Topic 2.3 Lecture , Topic 2.3 Demonstration||lec16-slides||Quiz for Unit 3|
Lecture 17: Midterm Review
Lecture 18: Limitations of Functional parallelism.
Lecture 19: Data Races, Functional & Structural Determinism
|Module 1: Sections 2.5, 2.6||Topic 2.5 Lecture, Topic 2.5 Demonstration, Topic 2.6 Lecture, Topic 2.6 Demonstration||worksheet19||lec19-slides|
Lecture 20: Confinement & Monitor Pattern. Critical sections
|Module 2: Sections 5.1, 5.2, 5.6||Topic 5.1 Lecture, Topic 5.1 Demonstration, Topic 5.2 Lecture, Topic 5.2 Demonstration, Topic 5.6 Lecture, Topic 5.6 Demonstration||worksheet20||lec20-slides||Quiz for Unit 4||Quiz for Unit 3|
Lecture 21: N-Body problem, applications and implementations
Lecture 22: Fork/Join programming model. OS Threads. Scheduler Pattern
|Module 2: Sections 2.7, 2.8||Topic 2.7 Lecture, Topic 2.7 Demonstration, Topic 2.8 Lecture, Topic 2.8 Demonstration,||worksheet22||lec22-slides|
Quiz for Unit 4
Lecture 23: Locks, Atomic variables
|Module 2: 7.3|
Topic 7.3 Lecture
|worksheet23||lec23-slides||Quiz for Unit 5|
Lecture 24: Parallel Spanning Tree, other graph algorithms
Homework 3, Checkpoint-1
|Lecture 25: Linearizability of Concurrent Objects||Module 2: 7.4||Topic 7.4 Lecture||worksheet25||lec25-slides||Quiz for Unit 6|
Quiz for Unit 5
No class: Spring Break
|Wed||Mar 16||No class: Spring Break|
No class: Spring Break
Lecture 26: Java Locks - Soundness and progress guarantees
|Module 2: 7.5||Topic 7.5 Lecture||worksheet26||lec26-slides||Homework 4 (includes one intermediate checkpoint)||Homework 3 (all)|
Lecture 27: Dining Philosophers Problem
|Module 2: 7.6||Topic 7.4 Lecture Topic 7.6 Lecture||worksheet27|
Lecture 28: Read-Write Pattern. Read-Write Locks. Fairness & starvation
|Module 2: 7.3, 7.5||Topic 7.3 Lecture, Topic 7.5 Lecture,||worksheet28|
Quiz for Unit 7
Lecture 29: Task Affinity and locality. Memory hierarchy
Quiz for Unit 6
Lecture 30: Reactor Pattern. Web servers
Lecture 31: Scan Pattern. Parallel Prefix Sum, uses and algorithms
|worksheet31||lec31-slides||Quiz for Unit 8|
Quiz for Unit 7
|Lecture 32: Data-Parallel Programming model. Loop-Level Parallelism, Loop Chunking||Module 1: Sections 3.1, 3.2, 3.3||Topic 3.1 Lecture , Topic 3.1 Demonstration , Topic 3.2 Lecture, Topic 3.2 Demonstration, Topic 3.3 Lecture, Topic 3.3 Demonstration||worksheet32||lec32-slides|
Homework 4 Checkpoint-1
Lecture 33: Barrier Synchronization with phasers
|Module 1: Section 3.4|
Topic 3.4 Lecture , Topic 3.4 Demonstration
Lecture 34: Stencil computation. Point-to-point Synchronization with Phasers
|Module 1: Section 4.2, 4.3||Topic 4.2 Lecture , Topic 4.2 Demonstration, Topic 4.3 Lecture, Topic 4.3 Demonstration||worksheet34||lec34-slides|
Quiz for Unit 8
|Lecture 35: Message-Passing programming model with Actors||Module 2: 6.1, 6.2|
Topic 6.1 Lecture , Topic 6.1 Demonstration , Topic 6.2 Lecture, Topic 6.2 Demonstration
|Wed||Apr 13||Lecture 36: Active Object Pattern. Combining Actors with task parallelism||Module 2: 6.3, 6.4|
Topic 6.3 Lecture , Topic 6.3 Demonstration , Topic 6.4 Lecture, Topic 6.4 Demonstration
|worksheet36||lec36-slides||Homework 4 (all)|
|Fri||Apr 15||Lecture 37: Eureka-style Speculative Task Parallelism||worksheet37||lec37-slides|
|14||Mon||Apr 18||Lecture 38: Overview of other models and frameworks||lec38-slides|
|Wed||Apr 20||Lecture 39: Course Review (Lectures 19-38)||lec39-slides|
|Fri||Apr 22||Lecture 40: Course Review (Lectures 19-38)||lec40-slides|
Async-Finish Parallel Programming with abstract metrics
Cutoff Strategy and Real World Performance
No lab this week (Midterm exam)
|5||Mar 07||Loop-level Parallelism||lab5-handout||lab5-intro|
Isolated Statement and Atomic Variables
|7||Mar 28||Java Threads, Java Locks||lab7-handout|
Message Passing Interface (MPI)
Eureka-style Speculative Task Parallelism
Java's ForkJoin Framework
Grading will be based on your performance on four homework assignments (weighted 40% in all), two exams (weighted 40% in all), lab exercises (weighted 10% in all), online quizzes (weighted 5% in all), and in-class worksheets (weighted 5% in all).
The purpose of the homework is to give you practice in solving problems that deepen your understanding of concepts introduced in class. Homework is due on the dates and times specified in the course schedule. No late submissions (other than those using slip days mentioned below) will be accepted.
The slip day policy for COMP 322 is similar to that of COMP 321. All students will be given 3 slip days to use throughout the semester. When you use a slip day, you will receive up to 24 additional hours to complete the assignment. You may use these slip days in any way you see fit (3 days on one assignment, 1 day each on 3 assignments, etc.). Slip days will be tracked using the README.md file. Other than slip days, no extensions will be given unless there are exceptional circumstances (such as severe sickness, not because you have too much other work). Such extensions must be requested and approved by the instructor (via e-mail, phone, or in person) before the due date for the assignment. Last minute requests are likely to be denied.
Labs must be submitted by the following Monday at 11:59pm. Labs must be checked off by a TA.
Worksheets should be completed in Canvas before the start of the following class (for full credit) so that solutions to the worksheets can be discussed in the next class.
You will be expected to follow the Honor Code in all homework and exams. The following policies will apply to different work products in the course:
For grade disputes, please send an email to the course instructors within 7 days of receiving your grade. The email subject should include COMP 322 and the assignment. Please provide enough information in the email so that the instructor does not need to perform a checkout of your code.
Students with disabilities are encouraged to contact me during the first two weeks of class regarding any special needs. Students with disabilities should also contact Disabled Student Services in the Ley Student Center and the Rice Disability Support Services.