Kunle Olukotun, Aaron Smith, Robert Hundt, Jason Mars
Proceedings of the 13th International Symposium on Code Generation and Optimization
CGO, 2015.
@proceedings{CGO-2015, acmid = "2738600", address = "San Francisco, California, USA", editor = "Kunle Olukotun and Aaron Smith and Robert Hundt and Jason Mars", ee = "http://www.computer.org/csdl/proceedings/cgo/2015/8161/00/index.html", isbn = "978-1-4799-8161-8", publisher = "{IEEE Computer Society}", title = "{Proceedings of the 13th International Symposium on Code Generation and Optimization}", year = 2015, }
Event page: http://cgo.org/cgo2015/
Contents (24 items)
- CGO-2015-JiaoLHM #concurrent #energy #execution #kernel
- Improving GPGPU energy-efficiency through concurrent kernel execution and DVFS (QJ, ML, HPH, TM), pp. 1–11.
- CGO-2015-FauziaPS #memory management
- Characterizing and enhancing global memory data coalescing on GPUs (NF, LNP, PS), pp. 12–22.
- CGO-2015-LiYLZ #automation #gpu #memory management
- Automatic data placement into GPU on-chip memory resources (CL, YY, ZL, HZ), pp. 23–33.
- CGO-2015-DeweyKH #interpreter #javascript #parallel
- A parallel abstract interpreter for JavaScript (KD, VK, BH), pp. 34–45.
- CGO-2015-StepanovS #c++ #detection #memory management #named #performance
- MemorySanitizer: fast detector of uninitialized memory use in C++ (ES, KS), pp. 46–55.
- CGO-2015-ZhengLHWJ #approach #debugging #manycore #on the #performance
- On performance debugging of unnecessary lock contentions on multicore processors: a replay-based approach (LZ, XL, BH, SW, HJ), pp. 56–67.
- CGO-2015-HawkinsDBZ #optimisation
- Optimizing binary translation of dynamically generated code (BH, BD, DB, QZ), pp. 68–78.
- CGO-2015-ArthurMDA #control flow
- Getting in control of your control flow with control-data isolation (WA, BM, RD, TMA), pp. 79–90.
- CGO-2015-SrinivasDK
- Reactive tiling (JS, WD, MTK), pp. 91–102.
- CGO-2015-RohouSS #branch #performance #predict #trust
- Branch prediction and the performance of interpreters: don’t trust folklore (ER, BNS, AS), pp. 103–114.
- CGO-2015-PallisterEH #embedded #energy #optimisation #trade-off
- Optimizing the flash-RAM energy trade-off in deeply embedded systems (JP, KE, SJH), pp. 115–124.
- CGO-2015-McAfeeO #framework #generative #learning #multi #named
- EMEURO: a framework for generating multi-purpose accelerators via deep learning (LCM, KO), pp. 125–135.
- CGO-2015-TangZLLHLG #multi #optimisation
- Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon Phi (WTT, RZ, ML, YL, HPH, XL, RSMG), pp. 136–145.
- CGO-2015-LuciaC #concurrent #source code
- Data provenance tracking for concurrent programs (BL, LC), pp. 146–156.
- CGO-2015-ShresthaGMMF #concurrent #locality
- Locality aware concurrent start for stencil applications (SS, GRG, JM, AM, JF), pp. 157–166.
- CGO-2015-HasabnisQS #architecture #code generation #correctness #specification
- Checking correctness of code generator architecture specifications (NH, RQ, RS), pp. 167–178.
- CGO-2015-OhM #web
- Snapshot-based loading-time acceleration for web applications (JO, SMM), pp. 179–189.
- CGO-2015-PorpodasMJ #automation #named
- PSLP: padded SLP automatic vectorization (VP, AM, TMJ), pp. 190–201.
- CGO-2015-LeissaKH #graph #higher-order #representation
- A graph-based higher-order intermediate representation (RL, MK, SH), pp. 202–212.
- CGO-2015-OanceaR #analysis #induction #scalability
- Scalable conditional induction variables (CIV) analysis (CEO, LR), pp. 213–224.
- CGO-2015-NagarajG #analysis #approximate #mining #pointer #using
- Approximating flow-sensitive pointer analysis using frequent itemset mining (VN, RG), pp. 225–234.
- CGO-2015-CampanoniHWB #named #parallel #semantics
- HELIX-UP: relaxing program semantics to unleash parallelization (SC, GHH, GYW, DMB), pp. 235–245.
- CGO-2015-ZhangGCCH #named #performance
- HERMES: a fast cross-ISA binary translator with post-optimization (XZ, QG, YC, TC, WH), pp. 246–256.
- CGO-2015-KimHSLH #architecture #concurrent #cpu #modelling #programming #scheduling #thread
- Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures (HSK, IEH, JAS, SSL, WmWH), pp. 257–268.
5 ×#named
4 ×#concurrent
4 ×#performance
3 ×#memory management
3 ×#optimisation
2 ×#analysis
2 ×#architecture
2 ×#automation
2 ×#energy
2 ×#multi
4 ×#concurrent
4 ×#performance
3 ×#memory management
3 ×#optimisation
2 ×#analysis
2 ×#architecture
2 ×#automation
2 ×#energy
2 ×#multi