Alex Nicolau, Xiaowei Shen, Saman P. Amarasinghe, Richard W. Vuduc
Proceedings of the 18th Symposium on Principles and Practice of Parallel Programming
PPoPP, 2013.
@proceedings{PPoPP-2013, acmid = "2442516", address = "Shenzhen, China", editor = "Alex Nicolau and Xiaowei Shen and Saman P. Amarasinghe and Richard W. Vuduc", isbn = "978-1-4503-1922-5", publisher = "{ACM}", title = "{Proceedings of the 18th Symposium on Principles and Practice of Parallel Programming}", year = 2013, }
Contents (45 items)
- PPoPP-2013-YangXFGLXLSYZ #algorithm #cpu #gpu #simulation
- A peta-scalable CPU-GPU algorithm for global atmospheric simulations (CY, WX, HF, LG, LL, YX, YL, JS, GY, WZ), pp. 1–12.
- PPoPP-2013-LifflanderMK #detection #fault tolerance #protocol #termination
- Adoption protocols for fanout-optimal fault-tolerant termination detection (JL, PM, LVK), pp. 13–22.
- PPoPP-2013-YukiFRS #analysis #array #data flow #source code
- Array dataflow analysis for polyhedral X10 programs (TY, PF, SVR, VS), pp. 23–34.
- PPoPP-2013-PrountzosP #algorithm #implementation
- Betweenness centrality: algorithms and implementations (DP, KP), pp. 35–46.
- PPoPP-2013-XiangS #compilation #concurrent #data type #performance
- Compiler aided manual speculation for high performance concurrent data structures (LX, MLS), pp. 47–56.
- PPoPP-2013-WuZZJS #algorithm #analysis #complexity #design #gpu #memory management
- Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU (BW, ZZ, EZZ, YJ, XS), pp. 57–68.
- PPoPP-2013-LePCN #memory management #modelling #performance
- Correct and efficient work-stealing for weak memory models (NML, AP, AC, FZN), pp. 69–80.
- PPoPP-2013-BergstromFRRRS #parallel
- Data-only flattening for nested data parallelism (LB, MF, MR, JHR, SR, AS), pp. 81–92.
- PPoPP-2013-MorozovW #distributed
- Distributed merge trees (DM, GHW), pp. 93–102.
- PPoPP-2013-MorrisonA #concurrent #performance
- Fast concurrent queues for x86 processors (AM, YA), pp. 103–112.
- PPoPP-2013-WamhoffFFRM #concurrent #memory management #named #performance #thread #transaction
- FastLane: improving performance of software transactional memory for low thread counts (JTW, CF, PF, ER, GM), pp. 113–122.
- PPoPP-2013-BartheCKGM #relational #synthesis #verification
- From relational verification to SIMD loop synthesis (GB, JMC, SG, CK, MM), pp. 123–134.
- PPoPP-2013-ShunB #framework #graph #lightweight #memory management #named
- Ligra: a lightweight graph processing framework for shared memory (JS, GEB), pp. 135–146.
- PPoPP-2013-NasreBP #algorithm
- Morph algorithms on GPUs (RN, MB, KP), pp. 147–156.
- PPoPP-2013-CalciuDLLMS
- NUMA-aware reader-writer locks (IC, DD, YL, VL, VJM, NS), pp. 157–166.
- PPoPP-2013-Chen #algorithm #detection #fault #fault tolerance #named #online
- Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methods (ZC), pp. 167–176.
- PPoPP-2013-FriedleyHBLM #distributed #manycore #memory management #performance #programming
- Ownership passing: efficient distributed memory programming on multi-core systems (AF, TH, GB, AL, CCM), pp. 177–186.
- PPoPP-2013-MeyerovichTAB #attribute grammar #parallel #synthesis
- Parallel schedule synthesis for attribute grammars (LAM, MET, EA, RB), pp. 187–196.
- PPoPP-2013-DeoK #array #gpu #parallel
- Parallel suffix array and least common prefix for the GPU (MD, SK), pp. 197–206.
- PPoPP-2013-ChenC #parallel #scalability
- Scalable deterministic replay in a parallel full-system emulator (YC, HC), pp. 207–218.
- PPoPP-2013-AcarCR #parallel #scheduling #source code
- Scheduling parallel programs by work stealing with private deques (UAA, AC, MR), pp. 219–228.
- PPoPP-2013-YanLZ #algorithm #named #performance
- StreamScan: fast scan algorithms for GPUs without global barrier synchronization (SY, GL, YZ), pp. 229–238.
- PPoPP-2013-HeumannAW #concurrent
- The tasks with effects model for safe concurrency (SH, VSA, SW), pp. 239–250.
- PPoPP-2013-BonettaBP #javascript #named #parallel
- TigerQuoll: parallel event-based JavaScript (DB, WB, CP), pp. 251–260.
- PPoPP-2013-DiceLLLM #algorithm #hardware #memory management #transaction #using
- Using hardware transactional memory to correct and simplify and readers-writer lock algorithm (DD, YL, YL, VL, MM), pp. 261–270.
- PPoPP-2013-CascavalFMPRRWB #manycore #mobile #named #parallel #web
- ZOOMM: a parallel web browser engine for multicore mobile devices (CC, SF, PMO, WP, MR, BR, MW, VB), pp. 271–280.
- PPoPP-2013-GrassoKCF #automation #clustering #parallel #problem
- Automatic problem size sensitive task partitioning on heterogeneous parallel systems (IG, KK, BC, TF), pp. 281–282.
- PPoPP-2013-LiuDJK #architecture #layout #optimisation
- Data layout optimization for GPGPU architectures (JL, WD, OJ, MTK), pp. 283–284.
- PPoPP-2013-PadmanabhanCC #composition #streaming
- Decomposition techniques for optimal design-space exploration of streaming applications (SP, YC, RDC), pp. 285–286.
- PPoPP-2013-YuB #automaton #performance #regular expression
- Exploring different automata representations for efficient regular expression matching on GPUs (XY, MB), pp. 287–288.
- PPoPP-2013-EdmondsWL #algorithm #graph #using
- Expressing graph algorithms using generalized active messages (NE, JW, AL), pp. 289–290.
- PPoPP-2013-LuM #migration #parallel
- Multi-level parallel computing of reverse time migration for seismic imaging on blue Gene/Q (LL, KAM), pp. 291–292.
- PPoPP-2013-ParkST #parallel #programming
- Parallel programming with big operators (CP, GLSJ, JBT), pp. 293–294.
- PPoPP-2013-AfekLM #hardware #programming
- Programming with hardware lock elision (YA, AL, AM), pp. 295–296.
- PPoPP-2013-LuZWZ0 #multi #named #performance #thread
- RaceFree: an efficient multi-threading model for determinism (KL, XZ, XW, WZ, GL), pp. 297–298.
- PPoPP-2013-ShunBFG
- Reducing contention through priority updates (JS, GEB, JTF, PBG), pp. 299–300.
- PPoPP-2013-DiamosWWLY #algorithm #multi #relational
- Relational algorithms for multi-bulk-synchronous processors (GFD, HW, JW, AL, SY), pp. 301–302.
- PPoPP-2013-CarvalhoC #memory management #runtime #transaction
- Runtime elision of transactional barriers for captured memory (FMC, JPC), pp. 303–304.
- PPoPP-2013-ParkSI #concurrent #detection #scalability #source code
- Scalable data race detection for partitioned global address space programs (CSP, KS, CI), pp. 305–306.
- PPoPP-2013-DiceLM #scalability #statistics
- Scalable statistics counters (DD, YL, MM), pp. 307–308.
- PPoPP-2013-WozniakAWKLF #data flow #named #programming #scalability
- Swift/T: scalable data flow programming for many-task applications (JMW, TGA, MW, DSK, ELL, ITF), pp. 309–310.
- PPoPP-2013-CaiZWC #detection #named #parallel #source code #thread
- TeamWork: synchronizing threads globally to detect real deadlocks for multithreaded programs (YC, KZ, SW, WKC), pp. 311–312.
- PPoPP-2013-DiouriGLC #energy #fault tolerance #protocol #towards
- Towards an energy estimator for fault tolerance protocols (MeMD, OG, LL, FC), pp. 313–314.
- PPoPP-2013-0003CTT #configuration management #scheduling
- Work-stealing with configurable scheduling strategies (MW, DC, JLT, PT), pp. 315–316.
- PPoPP-2013-ZhouKB #debugging #effectiveness #named #scalability
- WuKong: effective diagnosis of bugs at large system scales (BZ, MK, SB), pp. 317–318.
11 ×#parallel
10 ×#named
9 ×#algorithm
8 ×#performance
7 ×#memory management
5 ×#concurrent
5 ×#scalability
4 ×#detection
4 ×#programming
4 ×#source code
10 ×#named
9 ×#algorithm
8 ×#performance
7 ×#memory management
5 ×#concurrent
5 ×#scalability
4 ×#detection
4 ×#programming
4 ×#source code