Travelled to:
1 × Hungary
1 × Ireland
1 × Japan
1 × United Kingdom
2 × France
21 × USA
3 × Canada
Collaborated with:
W.Thies Q.Zhao ∅ D.Bruening J.Ansel M.S.Lam M.Stephenson W.Wong M.Karczmarek C.Mendis R.M.Rabbah M.Olszewski F.Kjolstad S.Chou S.Kamil J.Bosboom S.Larsen J.Ragan-Kelley S.Rajadurai J.Babb J.Sermulins M.I.Gordon A.A.Lamb U.O'Reilly L.Rudolph D.Lugato W.Lee R.Baghdadi T.Garnett J.M.Anderson D.E.Maydan Y.L.Wong C.P.Chan A.Edelman S.Paris P.M.Phothilimthana M.B.Taylor A.Agarwal M.C.Martin F.Vivien J.Sheldon A.Jain P.Jain D.Koh Ariya Shajii Ibrahim Numanagic B.Berger Y.Ding K.Veeramachaneni X.Shen C.Barnes A.Adams F.Durand Yunming Zhang Mengjiao Yang J.Shun R.Barua M.Frank D.Srikrishna V.Sarkar K.Wu J.Lin A.S.Meli C.Leger J.Wong H.Hoffmann D.Maze J.H.Perkins S.Kim J.Bachrach M.Carbin C.Pacheco F.Sherwood S.Sidiroglou G.Sullivan Y.Zibin M.D.Ernst M.C.Rinard
Talks about:
compil (12) stream (11) program (8) optim (8) architectur (5) parallel (5) perform (5) languag (5) algebra (4) tensor (4)
Person: Saman P. Amarasinghe
DBLP: Amarasinghe:Saman_P=
Facilitated 3 volumes:
Contributed to:
Wrote 44 papers:
- PLDI-2015-DingAVSOA #algorithm
- Autotuning algorithmic choice for input sensitivity (YD, JA, KV, XS, UMO, SPA), pp. 379–390.
- PLDI-2015-MendisBWKRPZA #domain-specific language #kernel #named
- Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code (CM, JB, KW, SK, JRK, SP, QZ, SPA), pp. 391–402.
- OOPSLA-2014-BosboomRWA #compilation #named #programming
- StreamJIT: a commensal compiler for high-performance stream programming (JB, SR, WFW, SPA), pp. 177–195.
- ASPLOS-2013-PhothilimthanaARA #architecture #performance
- Portable performance on heterogeneous architectures (PMP, JA, JRK, SPA), pp. 431–444.
- PLDI-2013-Ragan-KelleyBAPDA #compilation #image #locality #named #optimisation #parallel #pipes and filters
- Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines (JRK, CB, AA, SP, FD, SPA), pp. 519–530.
- ASPLOS-2012-OlszewskiZKAA #analysis #named
- Aikido: accelerating shared data dynamic analyses (MO, QZ, DK, JA, SPA), pp. 173–184.
- CGO-2011-AnselWCOEA #algorithm #compilation
- Language and compiler support for auto-tuning variable-accuracy algorithms (JA, YLW, CPC, MO, AE, SPA), pp. 85–96.
- CGO-2010-ZhaoBA #memory management #named #performance #scalability
- Umbra: efficient and scalable memory shadowing (QZ, DB, SPA), pp. 22–31.
- ISMM-2010-ZhaoBA #architecture #memory management #performance
- Efficient memory shadowing for 64-bit architectures (QZ, DB, SPA), pp. 93–102.
- ASPLOS-2009-OlszewskiAA #multi #named #performance #thread
- Kendo: efficient deterministic multithreading in software (MO, JA, SPA), pp. 97–108.
- PLDI-2009-AnselCWOZEA #algorithm #compilation #named
- PetaBricks: a language and compiler for algorithmic choice (JA, CPC, YLW, MO, QZ, AE, SPA), pp. 38–49.
- SOSP-2009-PerkinsKLABCPSSSWZER #automation #fault
- Automatically patching errors in deployed software (JHP, SK, SL, SPA, JB, MC, CP, FS, SS, GS, WFW, YZ, MDE, MCR), pp. 87–102.
- CC-2008-ZhaoRARW #debugging #how #performance #using
- How to Do a Million Watchpoints: Efficient Debugging Using Dynamic Instrumentation (QZ, RMR, SPA, LR, WFW), pp. 147–162.
- CGO-2007-ZhaoRARW #memory management #ubiquitous
- Ubiquitous Memory Introspection (QZ, RMR, SPA, LR, WFW), pp. 299–311.
- ASPLOS-2006-GordonTA #parallel #pipes and filters #source code
- Exploiting coarse-grained task, data, and pipeline parallelism in stream programs (MIG, WT, SPA), pp. 151–162.
- CGO-2005-Amarasinghe #compilation #multi #question
- Multicores from the Compiler’s Perspective: A Blessing or a Curse? (SPA), p. 137.
- CGO-2005-BrueningA #bound #capacity #consistency #maintenance
- Maintaining Consistency and Bounding Capacity of Software Code Caches (DB, SPA), pp. 74–85.
- CGO-2005-StephensonA #classification #predict #using
- Predicting Unroll Factors Using Supervised Classification (MS, SPA), pp. 123–134.
- LCTES-2005-SermulinsTRA #optimisation #source code
- Cache aware optimization of stream programs (JS, WT, RMR, SPA), pp. 115–126.
- PPoPP-2005-ThiesKSRA #distributed #source code
- Teleport messaging for distributed stream programs (WT, MK, JS, RMR, SPA), pp. 224–235.
- CGO-2003-BrueningGA #adaptation #framework #optimisation
- An Infrastructure for Adaptive Dynamic Optimization (DB, TG, SPA), pp. 265–275.
- HPCA-2003-TaylorLAA #architecture #network
- Scalar Operand Networks: On-Chip Interconnect for ILP in Partitioned Architecture (MBT, WL, SPA, AA), pp. 341–353.
- LCTES-2003-KarczmarekTA #scheduling #source code
- Phased scheduling of stream programs (MK, WT, SPA), pp. 103–112.
- PLDI-2003-LambTA #analysis #linear #optimisation #source code
- Linear analysis and optimization of stream programs (AAL, WT, SPA), pp. 12–25.
- PLDI-2003-StephensonAMO #compilation #heuristic #machine learning #optimisation
- Meta optimization: improving compiler heuristics with machine learning (MS, SPA, MCM, UMO), pp. 77–90.
- ASIA-PEPM-2002-Amarasinghe #architecture #compilation
- Defying the speed of light: : a spatially-aware compiler for wire-exposed architectures (SPA), p. 70.
- ASPLOS-2002-GordonTKLMLLWHMA #architecture #compilation
- A stream compiler for communication-exposed architectures (MIG, WT, MK, JL, ASM, AAL, CL, JW, HH, DM, SPA), pp. 291–303.
- CC-2002-ThiesKA #named #streaming
- StreamIt: A Language for Streaming Applications (WT, MK, SPA), pp. 179–196.
- PLDI-2001-ThiesVSA #framework #optimisation
- A Unified Framework for Schedule and Storage Optimization (WT, FV, JS, SPA), pp. 232–242.
- PLDI-2000-LarsenA #parallel #set
- Exploiting superword level parallelism with multimedia instruction sets (SL, SPA), pp. 145–156.
- PLDI-2000-StephensonBA #analysis #compilation
- Bitwidth analysis with application to silicon compilation (MS, JB, SPA), pp. 108–120.
- ASPLOS-1998-LeeBFSBSA #parallel #scheduling
- Space-Time Scheduling of Instruction-Level Parallelism on a Raw Machine (WL, RB, MF, DS, JB, VS, SPA), pp. 46–57.
- PPoPP-1995-AndersonAL #multi
- Data and Computation Transformations for Multiprocessors (JAMA, SPA, MSL), pp. 166–178.
- PLDI-1993-AmarasingheL #code generation #communication #distributed #memory management #optimisation
- Communication Optimization and Code Generation for Distributed Memory Machines (SPA, MSL), pp. 126–138.
- POPL-1993-MaydanAL #analysis #array #data flow
- Array Data-Flow Analysis and its Use in Array Privatization (DEM, SPA, MSL), pp. 2–15.
- ASE-2017-KjolstadCLKA #algebra #kernel #named
- taco: a tool to generate tensor algebra kernels (FK, SC, DL, SK, SPA), pp. 943–948.
- CC-2019-Amarasinghe #algebra #compilation
- The sparse tensor algebra compiler (keynote) (SPA), p. 1.
- CC-2019-MendisJJA #named
- Revec: program rejuvenation through revectorization (CM, AJ, PJ, SPA), pp. 29–41.
- OOPSLA-2017-KjolstadKCLA #algebra #compilation
- The tensor algebra compiler (FK, SK, SC, DL, SPA), p. 29.
- OOPSLA-2018-ChouKA #abstraction #algebra #compilation
- Format abstraction for sparse tensor algebra compilers (SC, FK, SPA), p. 30.
- OOPSLA-2018-MendisA #framework #named #parallel
- goSLP: globally optimized superword level parallelism framework (CM, SPA), p. 28.
- OOPSLA-2018-ZhangYBKSA #domain-specific language #graph #named
- GraphIt: a high-performance graph DSL (YZ, MY, RB, SK, JS, SPA), p. 30.
- OOPSLA-2019-ShajiiNBBA #named
- Seq: a high-performance language for bioinformatics (AS, IN, RB, BB, SPA), p. 29.
- ASPLOS-2018-RajaduraiBWA #configuration management #named #source code
- Gloss: Seamless Live Reconfiguration and Reoptimization of Stream Programs (SR, JB, WFW, SPA), pp. 98–112.