R. Govindarajan, David A. Padua, Mary W. Hall
Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming
PPoPP, 2010.
@proceedings{PPoPP-2010, acmid = "1693453", address = "Bangalore, India", editor = "R. Govindarajan and David A. Padua and Mary W. Hall", isbn = "978-1-60558-877-3", publisher = "{ACM}", title = "{Proceedings of the 15th Symposium on Principles and Practice of Parallel Programming}", year = 2010, }
Contents (49 items)
- PPoPP-2010-Agerwala #challenge
- Exascale computing: the challenges and opportunities in the next decade (TA), pp. 1–2.
- PPoPP-2010-Mendez-LojoNPSHKBP #optimisation #source code
- Structure-driven optimizations for amorphous data-parallel programs (MML, DN, DP, XS, MAH, MK, MB, KP), pp. 3–14.
- PPoPP-2010-CoonsBM #concurrent #effectiveness #library #named #testing
- GAMBIT: effective unit testing for concurrency libraries (KEC, SB, MM), pp. 15–24.
- PPoPP-2010-LeeP #calculus #parallel
- Featherweight X10: a core calculus for async-finish parallelism (JKL, JP), pp. 25–36.
- PPoPP-2010-MannarswamyCRS #compilation #memory management #performance #transaction
- Compiler aided selective lock assignment for improving the performance of software transactional memory (SM, DRC, KR, SS), pp. 37–46.
- PPoPP-2010-RossbachHW #programming #question #transaction
- Is transactional programming actually easier? (CJR, OSH, EW), pp. 47–56.
- PPoPP-2010-ZyulkyarovHUCV #debugging #memory management #source code #transaction
- Debugging programs that use atomic blocks and transactional memory (FZ, TH, OSÜ, AC, MV), pp. 57–66.
- PPoPP-2010-DalessandroSS #named
- NOrec: streamlining STM by abolishing ownership records (LD, MFS, MLS), pp. 67–78.
- PPoPP-2010-MaldonadoMFSHFLM #memory management #scheduling #transaction
- Scheduling support for transactional memory contention management (WM, PM, PF, AS, DH, AF, JLL, GM), pp. 79–90.
- PPoPP-2010-BarretoDFGK #memory management #parallel #transaction
- Leveraging parallel nesting in transactional memory (JB, AD, PF, RG, MK), pp. 91–100.
- PPoPP-2010-TorrellasGMOS #challenge
- Extreme scale computing: challenges and opportunities (JT, BG, JHM, KO, VS), pp. 101–102.
- PPoPP-2010-Arvind #hardware #question
- Is hardware innovation over? (A), pp. 103–104.
- PPoPP-2010-BaghsorkhiDPGH #adaptation #architecture #gpu #modelling #performance
- An adaptive performance modeling tool for GPU architectures (SSB, MD, SJP, WDG, WmWH), pp. 105–114.
- PPoPP-2010-ChoiSV #modelling #multi
- Model-driven autotuning of sparse matrix-vector multiply on GPUs (JC, AS, RWV), pp. 115–126.
- PPoPP-2010-ZhangCO #gpu #performance
- Fast tridiagonal solvers on the GPU (YZ, JC, JDO), pp. 127–136.
- PPoPP-2010-SandesM #comparison #gpu #named #sequence #using
- CUDAlign: using GPU to accelerate the comparison of megabase genomic sequences (EFdOS, ACMAdM), pp. 137–146.
- PPoPP-2010-HofmeyrIB
- Load balancing on speed (SAH, CI, FB), pp. 147–158.
- PPoPP-2010-HoeflerSL #communication #protocol #scalability
- Scalable communication protocols for dynamic sparse data exchange (TH, CS, AL), pp. 159–168.
- PPoPP-2010-RomeinBMN #analysis #implementation #performance
- The LOFAR correlator: implementation and performance analysis (JWR, PCB, JDM, RvN), pp. 169–178.
- PPoPP-2010-TzannesCBV #adaptation #lazy evaluation #runtime
- Lazy binary-splitting: a run-time adaptive work-stealing scheduler (AT, GCC, RB, UV), pp. 179–190.
- PPoPP-2010-RadojkovicCVPCNV #concurrent #network #parallel #thread
- Thread to strand binding of parallel network applications in massive multi-threaded systems (PR, VC, JV, AP, FJC, MN, MV), pp. 191–202.
- PPoPP-2010-ZhangJS #matter #parallel #performance #question #source code #thread
- Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? (EZZ, YJ, XS), pp. 203–212.
- PPoPP-2010-LiuL #algorithm #locality #parallel
- Improving parallelism and locality with asynchronous algorithms (LL, ZL), pp. 213–222.
- PPoPP-2010-CastaldoW #parallel #scalability #using
- Scaling LAPACK panel operations using parallel cache assignment (AMC, RCW), pp. 223–232.
- PPoPP-2010-SutherlandS #composition #concurrent #thread
- Composable thread coloring (DFS, WLS), pp. 233–244.
- PPoPP-2010-AgrawalLS #parallel #programming
- Helper locks for fork-join parallel programming (KA, CEL, JS), pp. 245–256.
- PPoPP-2010-BronsonCCO #concurrent
- A practical concurrent binary search tree (NGB, JC, HC, KO), pp. 257–268.
- PPoPP-2010-TallentMP #parallel #thread
- Analyzing lock contention in multithreaded applications (NRT, JMMC, AP), pp. 269–280.
- PPoPP-2010-UpadhyayaMP #data type #generative #performance #using
- Using data structure knowledge for efficient lock generation and strong atomicity (GU, SPM, VSP), pp. 281–292.
- PPoPP-2010-AliMP #algorithm #communication #modelling
- Modeling advanced collective communication algorithms on cell-based systems (QA, SPM, VSP), pp. 293–304.
- PPoPP-2010-ZhaiCZ #named #parallel #performance #predict #scalability #using
- PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node (JZ, WC, WZ), pp. 305–314.
- PPoPP-2010-AleenSP #execution #predict #streaming
- Input-driven dynamic execution prediction of streaming applications (FA, MS, SP), pp. 315–324.
- PPoPP-2010-LupeiSPMBKA #game studies #memory management #parallel #scalability #towards #transaction #using
- Towards scalable and transparent parallelization of multiplayer games using transactional memory support (DL, BS, DP, MM, MB, WK, CA), pp. 325–326.
- PPoPP-2010-PerarnauH #cpu #generative #named
- KRASH: reproducible CPU load generation on many cores machines (SP, GH), pp. 327–328.
- PPoPP-2010-MuralidharaKR #clustering #parallel #thread
- Intra-application shared cache partitioning for multithreaded applications (SPM, MTK, PR), pp. 329–330.
- PPoPP-2010-DashD #distributed #memory management #transaction
- Symbolic prefetching in transactional distributed shared memory (AD, BD), pp. 331–332.
- PPoPP-2010-Chakrabarti #abstraction #analysis #effectiveness #performance #source code
- New abstractions for effective performance analysis of STM programs (DRC), pp. 333–334.
- PPoPP-2010-ZhangDGKBF #parallel
- Continuous speculative program parallelization in software (CZ, CD, XG, KK, TB, XF), pp. 335–336.
- PPoPP-2010-MarjanovicLAV #communication #effectiveness #hybrid
- Effective communication and computation overlap with hybrid MPI/SMPSs (VM, JL, EA, MV), pp. 337–338.
- PPoPP-2010-CedermanT #composition #concurrent
- Supporting lock-free composition of concurrent data objects (DC, PT), pp. 339–340.
- PPoPP-2010-GuoZCS #adaptation #manycore #named #scalability
- SLAW: a scalable locality-aware adaptive work-stealing scheduler for multi-core systems (YG, YZ, VC, VS), pp. 341–342.
- PPoPP-2010-YangXKZ #compilation #optimisation #source code
- An optimizing compiler for GPGPU programs with input-data sharing (YY, PX, JK, HZ), pp. 343–344.
- PPoPP-2010-ChandramowlishwaranKV #algebra #concurrent #linear #parallel #programming
- Applying the concurrent collections programming model to asynchronous parallel dense linear algebra (AC, KK, RWV), pp. 345–346.
- PPoPP-2010-HoffmannESMA #health #performance
- Application heartbeats for software performance and health (HH, JE, MDS, JEM, AA), pp. 347–348.
- PPoPP-2010-PorterW #memory management #modelling #performance #transaction
- Modeling transactional memory workload performance (DEP, EW), pp. 349–350.
- PPoPP-2010-CarterGG #library
- The pilot library for novice MPI programmers (JDC, WBG, GG), pp. 351–352.
- PPoPP-2010-JangMSDK #architecture #data transformation #parallel #thread
- Data transformations enabling loop vectorization on multithreaded data parallel architectures (BJ, PM, DS, RD, DRK), pp. 353–354.
- PPoPP-2010-BuehrerPT #distributed
- A distributed placement service for graph-structured and tree-structured data (GB, SP, ST), pp. 355–356.
- PPoPP-2010-LiGKQ #source code #verification
- A symbolic verifier for CUDA programs (GL, GG, RMK, DQ), pp. 357–358.
14 ×#parallel
10 ×#performance
8 ×#transaction
7 ×#memory management
6 ×#concurrent
6 ×#named
6 ×#source code
6 ×#thread
5 ×#scalability
5 ×#using
10 ×#performance
8 ×#transaction
7 ×#memory management
6 ×#concurrent
6 ×#named
6 ×#source code
6 ×#thread
5 ×#scalability
5 ×#using