Stanford PPL Stanford Engineering


Filter by Project

Publications


Simplifying Scalable Graph Processing with a Domain-Specific Language
Sungpack Hong, Semih Salihoglu, Jennifer Widom, and Kunle Olukotun
CGO'14: International Symposium on Code Generation and Optimization, February 2014.
Paper PDF

On Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
Sungpack Hong, Nicole C. Rodia, and Kunle Olukotun
SC'13: International Conference for High Performance Computing, Networking, Storage, and Analysis, November 2013.
Paper PDF | Code | Slides

Forge: Generating a High Performance DSL Implementation from a Declarative Speciļ¬cation
Arvind K. Sujeeth, Austin Gibbons, Kevin J. Brown, HyoukJoong Lee, Tiark Rompf, Martin Odersky, and Kunle Olukotun
GPCE'13: 12th International Conference on Generative Programming: Concepts & Experiences, October 2013.
Paper PDF

Composition and Reuse with Compiled Domain-Specific Languages
Arvind K. Sujeeth, Tiark Rompf, Kevin J. Brown, HyoukJoong Lee, Hassan Chafi, Victoria Popic, Michael Wu, Aleksander Prokopec, Vojin Jovanovic, Martin Odersky, and Kunle Olukotun
ECOOP'13: European Conference on Object-Oriented Programming, July 2013.
Paper PDF

On Fast Parallel Detection of Strongly Connected Components (SCC) in Small-World Graphs
Sungpack Hong, Nicole C. Rodia, Kunle Olukotun
Technical Report, March 2013.
Paper PDF

Optimizing Data Structures in High-Level Programs: New Directions for Extensible Compilers based on Staging
Tiark Rompf, Arvind K. Sujeeth, Nada Amin, Kevin J. Brown, Vojin Jovanovic, HyoukJoong Lee, Manohar Jonnalagedda, Kunle Olukotun, Martin Odersky
POPL'13: 40th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, January 2013.
Paper PDF | Slides

A Case of System-level Hardware/Software Co-design and Co-verification of a Commodity Multi-Processor System with Custom Hardware
Sungpack Hong, Tayo Oguntebi, Jared Casper, Nathan Bronson, Christos Kozyrakis and Kunle Olukotun
CODES+ISSS'12: 17th International Conference on Hardware/Software Codesign and System Synthesis, Oct 2012.
Paper PDF | Slides

Green-Marl: A DSL for Easy and Efficient Graph Analysis
Sungpack Hong, Hassan Chafi, Eric Sedlar, and Kunle Olukotun
ASPLOS '12: 17th International Conference on Architectural Support for Programming Languages and Operating Systems, March 2012.
Paper PDF | Slides

SCD: A Scalable Coherence Directory with Flexible Sharer Set Encoding
Daniel Sanchez and Christos Kozyrakis
HPCA '12: 18th international Symposium on High Performance Computer Architecture, February 2012.
Paper PDF | Slides

Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers
Zachary DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan Alonso, Pat Hanrahan
SC '11: Proceedings of the 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, November 2011.
Paper PDF | Slides

CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization
Michael Bauer, Henry Cook, and Brucek Khailany
SC '11: Proceedings of the 2011 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, November 2011.
Paper PDF | Slides

A Heterogeneous Parallel Framework for Domain-Specific Languages
Kevin J. Brown, Arvind K. Sujeeth, HyoukJoong Lee, Tiark Rompf, Hassan Chafi, Martin Odersky, Kunle Olukotun
PACT '11: 20th International Conference on Parallel Architectures and Compilation Techniques, October 2011.
Paper PDF | Slides

Efficient Parallel Graph Exploration on Multi-Core CPU and GPU
Sungpack Hong, Tayo Oguntebi, Kunle Olukotun
PACT '11: 20th International Conference on Parallel Architectures and Compilation Techniques, October 2011.
Paper PDF | Slides

Dynamic Fine-Grain Scheduling of Pipeline Parallelism
Daniel Sanchez, David Lo, Richard M. Yoo, Jeremy Sugerman, Christos Kozyrakis
PACT '11: 20th International Conference on Parallel Architectures and Compilation Techniques, October 2011.
Paper PDF | Slides

Building-Blocks for Performance Oriented DSLs
Tiark Rompf, Arvind K. Sujeeth, HyoukJoong Lee, Kevin J. Brown, Hassan Chafi, Martin Odersky, Kunle Olukotun
DSL '11: IFIP Working Conference on Domain-Specific Languages, September 2011.
Paper PDF | Slides

Implementing Domain-Specific Languages for Heterogeneous Parallel Computing
HyoukJoong Lee, Kevin J. Brown, Arvind K. Sujeeth, Hassan Chafi, Tiark Rompf, Martin Odersky, Kunle Olukotun
IEEE Micro: Special Issue on CPU, GPU, and Hybrid Computing, September/October 2011.
Paper PDF

Vantage: Scalable and Efficient Fine-Grain Cache Partitioning
Daniel Sanchez, Christos Kozyrakis
ISCA '11: Proceedings of the 38th Intl. Symposium on Computer Architecture, June 2011.
Paper PDF | Slides

OptiML: An Implicitly Parallel Domain-Specific Language for Machine Learning
Arvind K. Sujeeth, HyoukJoong Lee, Kevin J. Brown, Tiark Rompf, Hassan Chafi, Michael Wu, Anand R. Atreya, Martin Odersky, Kunle Olukotun
ICML '11: Proceedings of the 28th Intl. Conference on Machine Learning, June 2011.
Paper PDF | Slides

Hardware Acceleration of Transactional Memory on Commodity Systems
Jared Casper, Tayo Oguntebi, Sungpack Hong, Nathan G. Bronson, Christos Kozyrakis, Kunle Olukotun
ASPLOS '11: Proceedings of the 16th Intl. Conference on Architectural Support for Programming Languages and Operating Systems, March 2011.
Paper PDF | Slides

A Domain-Specific Approach to Heterogeneous Parallelism
Hassan Chafi, Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Anand R. Atreya and Kunle Olukotun
PPoPP '11: Proceedings of the 16th Annual Symposium on Principles and Practice of Parallel Programming, February 2011.
Paper PDF | Slides

Accelerating CUDA Graph Algorithms at Maximum Warp
Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, Kunle Olukotun
PPoPP '11: Proceedings of the 16th Annual Symposium on Principles and Practice of Parallel Programming, February 2011.
Paper PDF | Slides

Programming the Memory Hierarchy Revisited: Supporting Irregular Parallelism in Sequoia
Michael Bauer, John Clark, Eric Schkufza, Alex Aiken
PPoPP '11: Proceedings of the 16th Annual Symposium on Principles and Practice of Parallel Programming, February 2011.
Paper PDF

The ZCache: Decoupling Ways and Associativity
Daniel Sanchez, Christos Kozyrakis
Micro '10: Proceedings of the 43rd Intl. Symposium on Microarchitecture, December 2010.
Paper PDF | Slides

EigenBench: A Simple Exploration Tool for Orthogonal TM Characterisitics
Sungpack Hong, Tayo Oguntebi, Jared Casper, Nathan Bronson, Christos Koyrakis, Kunle Olukotun
IISWC '10: Proceedings of the IEEE International Symposium on Workload Characteristics, December 2010. (best paper award)
Paper PDF | Slides

Language Virtualization for Heterogeneous Parallel Computing
Hassan Chafi, Zach DeVito, Adriaan Moors, Tiark Rompf, Arvind K. Sujeeth, Pat Hanrahan, Martin Odersky, Kunle Olukotun
Onward! '10: Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, October 2010.
Paper PDF | Slides

Transactional Predication: High-Performance Concurrent Sets and Maps for STM
Nathan G. Bronson, Jared Casper, Hassan Chafi, Kunle Olukotun
PODC '10: Proceedings of the 29th Annual ACM Conference on Principles of Distributed Computing, July 2010.
Paper PDF | Slides

Understanding Sources of Inefficiency in General-Purpose Chips
Rehan Hameed, Wajahat Qadeer, Megan Wachs, Omid Azizi, Alex Solomatnikov, Benjamin C. Lee, Stephen Richardson, Christos Kozyrakis, Mark Horowitz
ISCA '10: Proceedings of the 37th Intl. Symposium on Computer Architecture, June 2010.
Paper PDF | Slides

Making Nested Parallel Transactions Practical using Lightweight Hardware Support
Woongki Baek, Nathan Bronson, Christos Kozyrakis, Kunle Olukotun
ICS '10: Proceedings of the 24th Intl. Conference on Supercomputing, June 2010.
Paper PDF | Slides

Implementing and Evaluating Nested Parallel Transactions in Software Transactional Memory
Woongki Baek, Nathan Bronson, Christos Kozyrakis, Kunle Olukotun
SPAA '10: Proceedings of the 22nd ACM Symposium on Parallelism in Algorithms and Architectures, June 2010.
Paper PDF | Slides

FARM: A Prototyping Environment for Tightly-Coupled, Heterogeneous Architectures
Tayo Oguntebi, Sungpack Hong, Jared Casper, Nathan Bronson, Christos Kozyrakis, Kunle Olukotun
FCCM '10: The 18th Annual International IEEE Symposium on Field-Programmable Custom Computing Machines, May 2010.
Paper PDF | Slides

A Large-scale Architecture for Restricted Boltzmann Machines
Sang Kyun Kim, Peter L. McMahon, Kunle Olukotun
FCCM '10: Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines, May 2010.
Paper PDF

Evaluating Bufferless Flow Control for On-Chip Networks
George Michelogiannakis, Daniel Sanchez, William J. Dally, Christos Kozyrakis
NOCS '10: Proceedings of the 4th ACM/IEEE International Symposium on Networks-on-Chip, May 2010.
Paper PDF | Slides

CCSTM: A Library-Based STM for Scala
Nathan G. Bronson, Hassan Chafi, Kunle Olukotun
The First Annual Scala Workshop at Scala Days 2010, April 2010.
Paper PDF | Slides

An Analysis of On-Chip Interconnection Networks for Large-Scale Chip Multiprocessors
Daniel Sanchez, George Michelogiannakis, Christos Kozyrakis.
TACO: ACM Transactions on Architecture and Code Optimization, vol. 7, no. 1, April 2010.
Paper PDF

Flexible Architectural Support for Fine-Grain Scheduling
Daniel Sanchez, Richard Yoo, Christos Kozyrakis
ASPLOS '10: Proceedings of the 15th Intl. Conference on Architectural Support for Programming Languages and Operating Systems, March 2010.
Paper PDF

Implementing and Evaluating a Model Checker for Transactional Memory Systems
Woongki Baek, Nathan G. Bronson, Christos Kozyrakis, and Kunle Olukotun
ICECCS '10: Proceedings of the 15th IEEE International Conference on Engineering of Complex Computer Systems, March 2010.
Paper PDF

A Practical Concurrent Binary Search Tree.
Nathan G. Bronson, Jared Casper, Hassan Chafi, and Kunle Olukotun
PPoPP '10: Proceedings of the 15th Annual Symposium on Principles and Practice of Parallel Programming, January 2010.
Paper PDF | Slides

Phoenix Rebirth: Scalable MapReduce on a Large-Scale Shared-Memory System
Richard M. Yoo, Anthony Romano, and Christos Kozyrakis
IISWC '09: Proceedings of the 2009 IEEE International Symposium on Workload Characterization, October 2009.
Paper PDF

A Highly Scalable Restricted Boltzmann Machine FPGA Implementation
Sang Kyun Kim, Lawrence C. McAfee, Peter L. McMahon, Kunle Olukotun
FPL '09: Proceedings of the IEEE Conference on Field Programmable Logic and Applications, September 2009.
Paper PDF

Feedback-Directed Barrier Optimization in a Strongly Isolated STM
Nathan G. Bronson, Christos Kozyrakis, and Kunle Olukotun
POPL '09: Proceedings of the 36th Annual ACM SIGPLAN-SIGACT Symposium on Principals of Programming Languages, January 2009.
Paper PDF | Slides

GRAMPS: A Programming Model for Graphics Pipelines
Jeremy Sugerman, Kayvon Fatahalian, Solomon Boulos, Kurt Akeley, and Pat Hanrahan
ACM Transactions on Graphics, January 2009.
Paper PDF