Tag #cpu
75 papers:
- CIKM-2016-Wang0PB #algorithm #detection #online #self
- A Self-Learning and Online Algorithm for Time Series Anomaly Detection, with Application in CPU Manufacturing (XW0, JL0, NP, MB), pp. 1823–1832.
- CC-2016-MajetiMBS #architecture #automation #generative #gpu #kernel #layout
- Automatic data layout generation and kernel mapping for CPU+GPU architectures (DM, KSM, RB, VS), pp. 240–250.
- CGO-2016-BarikFLHS #approach #black box #energy #gpu #scheduling
- A black-box approach to energy-aware scheduling on integrated CPU-GPU systems (RB, NF, BTL, CH, TS), pp. 70–81.
- SIGIR-2015-CatenaMT #power management #web
- Load-sensitive CPU Power Management for Web Search Engines (MC, CM, NT), pp. 751–754.
- ICSE-v2-2015-Salgado #behaviour #gpu #interactive #kernel #profiling
- Profiling Kernels Behavior to Improve CPU / GPU Interactions (RS), pp. 754–756.
- SAC-2015-JoCKBO #algorithm #approach #collaboration #data-driven #on the
- On running data-intensive algorithms with intelligent SSD and host CPU: a collaborative approach (YYJ, SC, SWK, DHB, HO), pp. 2060–2065.
- CGO-2015-KimHSLH #architecture #concurrent #modelling #programming #scheduling #thread
- Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures (HSK, IEH, JAS, SSL, WmWH), pp. 257–268.
- DAC-2015-HanF #analysis #approach #gpu #graph #scalability
- Transient-simulation guided graph sparsification approach to scalable harmonic balance (HB) analysis of post-layout RF circuits leveraging heterogeneous CPU-GPU computing systems (LH, ZF), p. 6.
- DAC-2015-KadjoAKG #approach #energy #gpu #mobile #performance #platform
- A control-theoretic approach for energy efficient CPU-GPU subsystem in mobile platforms (DK, RA, MK, PVG), p. 6.
- DATE-2015-BruggerVWTK #hybrid
- Reverse longstaff-schwartz american option pricing on hybrid CPU/FPGA systems (CB, JAV, NW, ST, RK), pp. 1599–1602.
- HPCA-2015-AroraMPJT #behaviour #benchmark #comprehension #gpu #metric #power management
- Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems (MA, SM, IP, NJ, DMT), pp. 366–377.
- HPDC-2015-XiaoCHZ #gpu #monte carlo
- Monte Carlo Based Ray Tracing in CPU-GPU Heterogeneous Systems and Applications in Radiation Therapy (KX, DZC, XSH, BZ), pp. 247–258.
- PPoPP-2015-PiaoKOLKKL #adaptation #framework #gpu #javascript #named
- JAWS: a JavaScript framework for adaptive CPU-GPU work sharing (XP, CK, YO, HL, JK, HK, JWL), pp. 251–252.
- SOSP-2015-AmitTSAS #validation
- Virtual CPU validation (NA, DT, AS, AA, ES), pp. 311–327.
- VLDB-2015-HeZH14 #architecture #gpu #query
- In-Cache Query Co-Processing on Coupled CPU-GPU Architectures (JH, SZ, BH), pp. 329–340.
- ASPLOS-2014-PichaiHB #architecture #design #memory management
- Architectural support for address translation on GPUs: designing memory management units for CPU/GPUs with unified address spaces (BP, LH, AB), pp. 743–758.
- DAC-2014-KoKYKH #gpu #platform #simulation
- Hardware-in-the-loop Simulation for CPU/GPU Heterogeneous Platforms (YK, TK, YY, MK, SH), p. 6.
- DAC-2014-PathaniaJPM #3d #game studies #gpu #mobile #power management
- Integrated CPU-GPU Power Management for 3D Mobile Games (AP, QJ, AP, TM), p. 6.
- PDP-2014-BoobGP #automation #gpu #parallel #performance
- Automated Instantiation of Heterogeneous Fast Flow CPU/GPU Parallel Pattern Applications in Clouds (SB, HGV, AMP), pp. 162–169.
- PDP-2014-GargH #gpu #library #multi
- A Portable and High-Performance General Matrix-Multiply (GEMM) Library for GPUs and Single-Chip CPU/GPU Systems (RG, LJH), pp. 672–680.
- PDP-2014-GschwandtnerKMPF #energy #modelling
- Modeling CPU Energy Consumption of HPC Applications on the IBM POWER7 (PG, MK, BM, DP, TF), pp. 536–543.
- ISSTA-2014-NejatiB #constraints #identification #trade-off #using
- Identifying optimal trade-offs between CPU time usage and temporal constraints using search (SN, LCB), pp. 351–361.
- VLDB-2013-HeLH #architecture #gpu
- Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture (JH, ML, BH), pp. 889–900.
- VLDB-2013-ZhangHHL #architecture #gpu #named #parallel #performance #query #towards
- OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures (SZ, JH, BH, ML), pp. 1374–1377.
- VLDB-2014-DasNLS13 #as a service #multitenancy #performance #relational
- CPU Sharing Techniques for Performance Isolation in Multitenant Relational Database-as-a-Service (SD, VRN, FL, MS), pp. 37–48.
- ASE-2013-NejatiABHBC #embedded #risk management
- Minimizing CPU time shortage risks in integrated embedded software (SN, MA, LCB, JH, JB, YC), pp. 529–539.
- DATE-2013-ZakharenkoAM #gpu #performance #using
- Characterizing the performance benefits of fused CPU/GPU systems using FusionSim (VZ, TMA, AM), pp. 685–688.
- HPCA-2013-LustigM #fine-grained #gpu #latency
- Reducing GPU offload latency via fine-grained CPU-GPU synchronization (DL, MM), pp. 354–365.
- PPoPP-2013-YangXFGLXLSYZ #algorithm #gpu #simulation
- A peta-scalable CPU-GPU algorithm for global atmospheric simulations (CY, WX, HF, LG, LL, YX, YL, JS, GY, WZ), pp. 1–12.
- VLDB-2012-WangHLWZS #gpu #hybrid #image
- Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems (KW, YH, RL, FW, XZ, JHS), pp. 1543–1554.
- ITiCSE-2012-Black #design #education #hardware
- A hardware simulator for teaching CPU design (MB), p. 380.
- MoDELS-2012-NejatiASB #analysis #embedded #modelling #safety #testing
- Modeling and Analysis of CPU Usage in Safety-Critical Embedded Systems to Support Stress Testing (SN, SDA, MS, LCB), pp. 759–775.
- CGO-2012-JablinJPLA #architecture #gpu
- Dynamically managed data for CPU-GPU architectures (TBJ, JAJ, PP, FL, DIA), pp. 165–174.
- DAC-2012-JeongESP #gpu #memory management
- A QoS-aware memory controller for dynamically balancing GPU and CPU bandwidth use in an MPSoC (MKJ, ME, CS, NCP), pp. 850–855.
- DAC-2012-KimLCKWYL #gpu #hybrid #in memory #memory management
- Hybrid DRAM/PRAM-based main memory for single-chip CPU/GPU (DK, SL, JC, DK, DHW, SY, SL), pp. 888–896.
- HPCA-2012-AyoubNR #energy #memory management #named
- JETC: Joint energy thermal and cooling management for memory and CPU subsystems in servers (RZA, RN, TR), pp. 299–310.
- HPCA-2012-LeeK #architecture #gpu #named #policy
- TAP: A TLP-aware cache management policy for a CPU-GPU heterogeneous architecture (JL, HK), pp. 91–102.
- HPCA-2012-YangXMZ #architecture #gpu
- CPU-assisted GPGPU on fused CPU-GPU architectures (YY, PX, MM, HZ), pp. 103–114.
- HPDC-2012-XuGRKKX #named #scheduling #slicing #virtual machine
- vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing (CX, SG, PNR, AK, RRK, DX), pp. 3–14.
- OSDI-2012-BelayBMTMK #named
- Dune: Safe User-level Access to Privileged CPU Features (AB, AB, AJM, DT, DM, CK), pp. 335–348.
- PDP-2012-BoukedjarLB #bound #branch #gpu #parallel
- Parallel Branch and Bound on a CPU-GPU System (AB, MEL, DEB), pp. 392–398.
- PDP-2012-SpigaG #gpu #hybrid #library #migration #named #quantum
- phiGEMM: A CPU-GPU Library for Porting Quantum ESPRESSO on Hybrid Systems (FS, IG), pp. 368–375.
- PPoPP-2012-KimSLNJL #clustering #gpu #programming
- OpenCL as a unified programming model for heterogeneous CPU/GPU clusters (JK, SS, JL, JN, GJ, JL), pp. 299–300.
- PLDI-2011-JablinPJJBA #automation #communication #gpu #optimisation
- Automatic CPU-GPU communication management and optimization (TBJ, PP, JAJ, NPJ, SRB, DIA), pp. 142–151.
- DAC-2011-ZhuDC #architecture #gpu #named
- Hermes: an integrated CPU/GPU microarchitecture for IP routing (YZ, YD, YC), pp. 1044–1049.
- HPDC-2011-LiLTCZ #3d #experience #gpu #re-engineering
- Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system (LL, XL, GT, MC, PZ), pp. 195–204.
- PDP-2011-BarlasHJ #approach #case study #design #encryption #gpu #parallel
- An Analytical Approach to the Design of Parallel Block Cipher Encryption/Decryption: A CPU/GPU Case Study (GDB, AH, YAJ), pp. 247–251.
- DATE-2010-ZhuSJ #configuration management #design #performance #streaming
- Pareto efficient design for reconfigurable streaming applications on CPU/FPGAs (JZ, IS, AJ), pp. 1035–1040.
- HPCA-2010-TangBHC #architecture #performance #using
- DMA cache: Using on-chip storage to architecturally separate I/O data from CPU data for improving I/O performance (DT, YB, WH, MC), pp. 1–12.
- PPoPP-2010-PerarnauH #generative #named
- KRASH: reproducible CPU load generation on many cores machines (SP, GH), pp. 327–328.
- DATE-2009-ZhuSJ #architecture #hybrid #realtime #scheduling #streaming
- Buffer minimization of real-time streaming applications scheduling on hybrid CPU/FPGA architectures (JZ, IS, AJ), pp. 1506–1511.
- PDP-2009-Barker #manycore #memory management
- Realities of Multi-Core CPU Chips and Memory Contention (DPB), pp. 446–453.
- ISSTA-2009-MartignoniPRB #testing
- Testing CPU emulators (LM, RP, GFR, DB), pp. 261–272.
- VLDB-2008-ChhuganiNLMHCBKD #architecture #implementation #manycore #performance #sorting
- Efficient implementation of sorting on multi-core SIMD CPU architecture (JC, ADN, VWL, WM, MH, YKC, AB, SK, PD), pp. 1313–1324.
- SAC-2008-SchobelP #case study #clustering #kernel #research #scheduling #using
- Kernel-mode scheduling server for CPU partitioning: a case study using the Windows research kernel (MS, AP), pp. 1700–1704.
- HPCA-2008-DiaoS #predict #process
- Prediction of CPU idle-busy activity pattern (QD, JJS), pp. 27–36.
- LCTES-2007-AbouGhazalehFRXLCMM #machine learning #scalability #using
- Integrated CPU and l2 cache voltage scaling using machine learning (NA, APF, CR, RX, FL, BRC, DM, RGM), pp. 41–50.
- PPoPP-2007-KejariwalTGLKBNVP #analysis #concurrent #performance #specification #thread #using
- Tight analysis of the performance potential of thread speculation using spec CPU 2006 (AK, XT, MG, WL, SK, UB, AN, AVV, CDP), pp. 215–225.
- QAPL-2005-BinderH06 #bytecode #metric #using
- Using Bytecode Instruction Counting as Portable CPU Consumption Metric (WB, JH), pp. 57–77.
- PEPM-2004-HulaasB #java #program transformation
- Program transformations for portable CPU accounting and control in Java (JH, WB), pp. 169–177.
- PLDI-2003-HsuK #algorithm #compilation #design #energy #evaluation #implementation #reduction
- The design, implementation, and evaluation of a compiler algorithm for CPU energy reduction (CHH, UK), pp. 38–48.
- SOSP-2003-YuanN #energy #mobile #multi #realtime #scheduling
- Energy-efficient soft real-time CPU scheduling for mobile multimedia systems (WY, KN), pp. 149–163.
- DAC-2002-RichardsonHHZSL
- The iCOREtm 520 MHz synthesizable CPU core (NR, LBH, RH, TZ, NS, JL), pp. 640–645.
- DATE-2001-TagoHINSY
- CPU for PlayStation 2 (HT, KH, NI, MN, MS, YY), p. 696.
- VLDB-2000-ManegoldBK #memory management #optimisation #what
- What Happens During a Join? Dissecting CPU and Memory Optimization Effects (SM, PAB, MLK), pp. 339–350.
- OSDI-2000-ChandraAGS #algorithm #multi #scheduling #symmetry
- Surplus Fair Scheduling: A Proportional-Share CPU Scheduling Algorithm for Symmetric Multiprocessors (AC, MA, PG, PJS), pp. 45–58.
- HPDC-1999-WolskiSH #grid #predict
- Predicting the CPU Availability of Time-shared Unix Systems on the Computational Grid (RW, NTS, JH), pp. 105–112.
- SOSP-1997-JonesRR #constraints #independence #performance #predict #process #scheduling
- CPU Reservations and Time Constraints: Efficient, Predictable Scheduling of Independent Activities (MBJ, DR, MCR), pp. 198–211.
- DAC-1996-DesaiCJ #network #performance
- Sizing of Clock Distribution Networks for High Performance CPU Chips (MPD, RC, JJ), pp. 389–394.
- DAC-1996-DesaiY #design #simulation #using #verification
- A Systematic Technique for Verifying Critical Path Delays in a 300MHz Alpha CPU Design Using Circuit Simulation (MPD, YTY), pp. 125–130.
- HPDC-1996-NurmiBGLP #automation #distributed #network
- Automatic Management of CPU and I/O Bottlenecks in Distributed Applications on ATM Networks (MAN, WEB, RNG, KCL, MDP), pp. 481–489.
- TRI-Ada-1994-GreeneL #embedded #migration
- Embedded CPU Target Migration, Doing More With Less (RG, GL), pp. 429–436.
- EDAC-1994-NguyenTDTV #logic #synthesis #verification
- Logic Synthesis and Verification of the CPU and Caches of a Mainframe System (HNN, JPT, LD, MT, PV), pp. 60–64.
- SIGMOD-1987-GrayP #memory management
- The 5 Minute Rule for Trading Memory for Disk Accesses and The 10 Byte Rule for Trading Memory for CPU Time (JG, GRP), pp. 395–398.
- DAC-1982-KangKL #adaptation #design #evolution #layout #logic #matrix #random
- Gate matrix layout of random control logic in a 32-bit CMOS CPU chip adaptable to evolving logic design (SMK, RHK, HFSL), pp. 170–174.