You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 35 Next »

COMP 515: Advanced Compilation for Vector and Parallel Processors (Fall 2013)

Instructor:

Prof. K.V. Palem, DH 3096; Prof. Vivek Sarkar, DH 3131 (office hours by appointment)

Assistant:

Sherry Nassar, sherry.nassar@rice.edu, DH 3140, 713-348-2031

Lectures:

Keck Hall (KH) 107, TTh 10:50am-12:05pm

Introduction

Are you curious about what makes a parallelizing compiler tick, and what program properties enable or prevent software from executing efficiently on vector and parallel computers?

This question has taken on new importance with the ubiquity of multicore processors, and the increased interest in heterogeneous accelerators such as GPGPUs and FPGAs. To answer this question, this course will cover the foundations of advanced compilation techniques for vector and parallel computer systems, including the analysis of program dependence, program transformations to enhance parallelism, compiler management of the memory hierarchy, and inter-procedural data flow analysis. At the conclusion of the course, the students should be knowledgeable about the strengths and limitations of state-of-the-art compilers, both from the viewpoint of the compiler user as well as of the compiler designer. The techniques taught in the course should be relevant to anyone interested in enabling software to execute efficiently on vector and parallel computers, whether by hand or through the use of compilers.

Required Prerequisite: COMP 412 or equivalent

Lectures

  1. Lecture 1 (08/27/2013, Profs. Palem & Sarkar): Compiler Challenges for High Performance Architectures (Chapter 1) 
  2. Lecture 2 (08/29/2013, Prof. Palem): Dependence: Theory and Practice (Chapter 2) 
  3. Lecture 3 (09/03/2013, Prof. Palem): Dependence: Theory and Practice (Chapter 2, contd)
  4. Lecture 4 (09/05/2013, Prof. Palem): Dependence Testing (Chapter 3) 
  5. Lecture 5 (09/10/2013, Prof. Sarkar): Dependence Testing (Chapter 3, contd) 
  6. Lecture 6 (09/12/2013, Prof. Sarkar): Dependence Testing & Preliminary Transformations (Chapter 3, contd, Chapter 4)
  7. Lecture 7 (09/17/2013, Prof. Sarkar): Preliminary Transformations & Enhancing Fine-Grained Parallelis (Chapter 4, contd, Chapter 5)
  8. Lecture 8 (09/19/2013, Prof. Sarkar): Vectorization Algorithm & Enhancing Fine-Grained Parallelism (Chapter 2, contd, Chapter 5, contd)
  9. Lecture 9 (09/24/2013, Prof. Sarkar):  Enhancing Fine-Grained Parallelism (Chapter 5, contd)
  10. Lecture 10 (09/26/2013, Prof. Sarkar):  Enhancing Fine-Grained Parallelism, Coarse-Grained Parallelism (Chapter 5, contd, Chapter 6)
  11. Lecture 11 (10/01/2013, Prof. Sarkar):  Coarse-Grained Parallelism (Chapter 6, contd)
  12. Lecture 12 (10/03/2013, Prof. Sarkar):  Coarse-Grained Parallelism (Chapter 6, contd), Control Dependence (Chapter 7)
  13. Lecture 13 (10/08/2013):  Midterm Review
  14. Lecture 14 (10/10/2013, Prof. Sarkar): Unimodular Transformations
  15. Lecture 15 (10/17/2013, Prof. Sarkar): Polyhedral Compilation Framework (Guest lecture by Dr. Louis-Noel Pouchet)
  16. Lecture 16 (10/22/2013, Prof. Palem): Control Dependence (Chapter 7, contd)
  17. Lecture 17 (10/24/2013, Prof. Palem):  Control Dependence (Chapter 7, contd), Compiler Improvement of Register Usage (Chapter 8)
  18. Lecture 18 (10/29/2013, Prof. Palem):  Compiler Improvement of Register Usage (Chapter 8, contd)
  19. Lecture 19 (10/31/2013, Prof. Palem): Compiler Improvement of Register Usage (Chapter 8, contd), Managing Cache (Chapter 9)
  20. Lecture 20 (11/05/2013, Prof. Palem):  Managing Cache (Chapter 9)
  21. Lecture 21 (11/07/2013, Prof. Palem):  Case study: Automatic Selection of High-Order Transformations in the IBM XL Fortran Compiler
  22. Lecture 22 (11/12/2013, Prof. Palem):  Case study: Automatic Selection of High-Order Transformations in the IBM XL Fortran Compiler (contd.)
  23. Lecture 23 (11/14/2013, Prof. Palem):  Interprocedural Analysis and Optimization (Chapter 11)
  24. Lecture 24 (11/19/2013, Prof. Palem):  Compiling Array Assignments (Chapter 13)
  25. Lecture 25 (11/21/2013, Prof. Palem):  End-Semester Summary (Chapters 7, 8, 9, 11, 13)
  26. Lecture 26 (11/26/2013): Student presentations
  27. Lecture 27 (12/03/2013): Student presentations
  28. Lecture 28 (12/05/2013): Student presentations

References

  1. Course text: Optimizing Compilers for Modern Architectures, Randy Allen & Ken Kennedy 
  2. A General Framework for Iteration-Reordering Loop Transformations. Vivek Sarkar  and Radhika Thekkath. Proceedings of the ACM SIGPLAN ’92 Conference on  Programming Language Design and Implementation (PLDI), San Francisco, California,  pages 175-187, June 1992.
  3. Optimal Weighted Loop Fusion for Parallel Programs. Nimrod Megiddo and Vivek  Sarkar. Proceedings of the Ninth Annual ACM Symposium on Parallel Algorithms  and Architecture (SPAA), Newport, Rhode Island, June, 1997, pages 282-291.
  4. Array SSA form and its use in Parallelization. Kathleen Knobe and Vivek Sarkar.  Proceedings of the 25th ACM SIGPLAN-SIGACT Symposium on Principles of  Programming Languages, San Diego, California, January 1998.
  5. Automatic Selection of High Order Transformations in the IBM XL Fortran Compilers.  Vivek Sarkar. IBM Journal of Research and Development, 41(3), May 1997.

Past offerings of COMP 515

  • No labels