BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Used together with:
architectur (13)
parallel (12)
code (8)
instruct (8)
vector (8)

Stem simd$ (all stems)

58 papers:

DATEDATE-2015-FuWH #code generation
Improving SIMD code generation in QEMU (SYF, JJW, WCH), pp. 1233–1236.
SIGMODSIGMOD-2015-PolychroniouRR #database #in memory
Rethinking SIMD Vectorization for In-Memory Databases (OP, AR, KAR), pp. 1493–1508.
VLDBVLDB-2015-InoueT #algorithm #array #sorting
SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures (HI, KT), pp. 1274–1285.
DACDAC-2014-WaeijenSCH #reduction
Reduction Operator for Wide-SIMDs Reconsidered (LW, DS, HC, YH), p. 6.
DATEDATE-2014-BoettcherAEGR #architecture
Advanced SIMD: Extending the reach of contemporary SIMD architectures (MB, BMAH, ME, GG, AR), pp. 1–4.
DATEDATE-2014-KimH #automation #generative #parallel
Automatic generation of custom SIMD instructions for Superword Level Parallelism (TK, YH), pp. 1–6.
VLDBVLDB-2015-InoueOT14 #branch #performance #predict #set
Faster Set Intersection with SIMD instructions by Reducing Branch Mispredictions (HI, MO, KT), pp. 293–304.
PLDIPLDI-2013-KongVSFPS #code generation
When polyhedral transformations meet SIMD code generation (MK, RV, KS, FF, LNP, PS), pp. 127–138.
ICFPICFP-2013-PetersenOG #automation #haskell
Automatic SIMD vectorization for Haskell (LP, DAO, NG), pp. 25–36.
CGOCGO-2013-RenALMPS #data type #parallel
SIMD parallelization of applications that traverse irregular data structures (BR, GA, JRL, TM, TP, WS), p. 10.
HPCAHPCA-2013-WangCWMZLN #architecture #execution #parallel
A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments (YW, SC, JW, JM, KZ, WL, XN), pp. 603–614.
PPoPPPPoPP-2013-BartheCKGM #relational #synthesis #verification
From relational verification to SIMD loop synthesis (GB, JMC, SG, CK, MM), pp. 123–134.
DACDAC-2012-SeoDWPCMBM #architecture #process
Process variation in near-threshold wide SIMD architectures (SS, RGD, MW, YP, CC, SAM, DB, TNM), pp. 980–987.
ASPLOSASPLOS-2012-ParkSPCM #architecture #performance
SIMD defragmenter: efficient ILP realization on data-parallel architectures (YP, SS, HP, HKC, SAM), pp. 363–374.
PPoPPPPoPP-2012-KimH #code generation #kernel #performance
Efficient SIMD code generation for irregular kernels (SK, HH), pp. 55–64.
PPoPPPPoPP-2012-LeissaHW #programming
Extending a C-like language for portable SIMD programming (RL, SH, IW), pp. 65–74.
DACDAC-2011-ZhaoF #3d #gpu #parallel #performance
Fast multipole method on GPU: tackling 3-D capacitance extraction on massively parallel SIMD platforms (XZ, ZF), pp. 558–563.
DATEDATE-2011-MichelFP #embedded #simulation
Speeding-up SIMD instructions dynamic binary translation in embedded processor simulation (LM, NF, FP), pp. 277–280.
DATEDATE-2011-WohSDKSBM #power management
Low power interconnects for SIMD computers (MW, SS, RGD, DK, DS, DB, TNM), pp. 600–605.
CIKMCIKM-2011-StepanovGREO
SIMD-based decoding of posting lists (AAS, ARG, DER, RJE, PSO), pp. 317–326.
CCCC-2011-HenrettySPFRS #architecture #layout
Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures (TH, KS, LNP, FF, JR, PS), pp. 225–245.
CGOCGO-2011-NuzmanDRRWYCZ
Vapor SIMD: Auto-vectorize once, run everywhere (DN, SD, ER, IR, KW, DY, AC, AZ), pp. 151–160.
DACDAC-2010-HePKYALC #energy #named #throughput
Xetal-Pro: an ultra-low energy and high throughput SIMD processor (YH, YP, RPK, ZY, AAA, SML, HC), pp. 543–548.
SIGMODSIGMOD-2010-SatishKCNLKD #performance
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort (NS, CK, JC, ADN, VWL, DK, PD), pp. 351–362.
HPCAHPCA-2010-HuangSWSXM #named #permutation
SIF: Overcoming the limitations of SIMD devices via implicit permutation (LH, LS, ZW, WS, NX, SM), pp. 1–12.
VLDBVLDB-2009-WillhalmPBPZS #in memory #named #performance #using
SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units (TW, NP, YB, HP, AZ, JS), pp. 385–394.
DATEDATE-2008-BonnotLEGRG #approach #architecture #implementation #multi
Definition and SIMD Implementation of a Multi-Processing Architecture Approach on FPGA (PB, FL, GE, GG, OR, PG), pp. 610–615.
VLDBVLDB-2008-ChhuganiNLMHCBKD #architecture #cpu #implementation #manycore #performance #sorting
Efficient implementation of sorting on multi-core SIMD CPU architecture (JC, ADN, VWL, WM, MH, YKC, AB, SK, PD), pp. 1313–1324.
CCCC-2008-FranchettiP #generative #permutation
Generating SIMD Vectorized Permutations (FF, MP), pp. 116–131.
CCCC-2008-LashariLM #architecture #control flow
Control Flow Emulation on Tiled SIMD Architectures (GL, OL, MM), pp. 100–115.
PPoPPPPoPP-2008-Cameron #case study #parallel
A case study in SIMD text processing with parallel bit streams: UTF-8 to UTF-16 transcoding (RDC), pp. 91–98.
DATEDATE-2007-KraemerLAM #interactive #parallel #program transformation #source code #using
Interactive presentation: SoftSIMD — exploiting subword parallelism using source code transformations (SK, RL, GA, HM), pp. 1349–1354.
AGTIVEAGTIVE-2007-AnandK #assembly #generative #graph transformation
Code Graph Transformations for Verifiable Generation of SIMD-Parallel Assembly Code (CKA, WK), pp. 217–232.
CCCC-2007-FiremanPZ #algorithm
New Algorithms for SIMD Alignment (LF, EP, AZ), pp. 1–15.
HPCAHPCA-2007-ClarkHYMF #hardware #lightweight #using
Liquid SIMD: Abstracting SIMD Hardware using Lightweight Dynamic Mapping (NC, AH, SY, SAM, KF), pp. 216–227.
DATEDATE-DF-2006-DavilaTSSBR #algorithm #architecture #configuration management #design #implementation
Design and implementation of a rendering algorithm in a SIMD reconfigurable architecture (MorphoSys) (JD, AdT, JMS, MSE, NB, FR), pp. 52–57.
PLDIPLDI-2006-NuzmanRZ
Auto-vectorization of interleaved data for SIMD (DN, IR, AZ), pp. 132–143.
PLDIPLDI-2006-RenWP #optimisation #permutation
Optimizing data permutations for SIMD devices (GR, PW, DAP), pp. 118–131.
ASPLOSASPLOS-2006-PatwardhanJDL #architecture #fault #self
A defect tolerant self-organizing nanoscale SIMD architecture (JPP, VJ, CD, ARL), pp. 241–251.
CGOCGO-2006-LiZXH #optimisation
Optimizing Dynamic Binary Translation for SIMD Instructions (JL, QZ, SX, BH), pp. 269–280.
LCTESLCTES-2006-ZhangQWZZ #architecture #compilation #multi #optimisation
Optimizing compiler for shared-memory multiple SIMD architecture (WZ, XQ, YW, BZ, CZ), pp. 199–208.
CCCC-2005-JiangMHLZZZ #multi #performance #using
Boosting the Performance of Multimedia Applications Using SIMD Instructions (WJ, CM, BH, JL, JZ, BZ, CZ), pp. 59–75.
CGOCGO-2005-WuEW #code generation #performance #runtime
Efficient SIMD Code Generation for Runtime Alignment and Length Conversion (PW, AEE, AW), pp. 153–164.
LCTESLCTES-2005-KudriavtsevK #generative #permutation
Generation of permutations for SIMD processors (AK, PMK), pp. 147–156.
PLDIPLDI-2004-EichenbergerWO #architecture #constraints
Vectorization for SIMD architectures with alignment constraints (AEE, PW, KO), pp. 82–93.
DATEDATE-2003-BeeckGBMCD #data transformation #implementation #power management #realtime
Background Data Organisation for the Low-Power Implementation in Real-Time of a Digital Audio Broadcast Receiver on a SIMD Processor (POdB, CG, EB, MM, FC, GD), pp. 11144–11145.
DATEDATE-2003-DuSTBAF #configuration management #interactive
Interactive Ray Tracing on Reconfigurable SIMD MorphoSys (HD, MSE, NT, NB, MLA, MF), pp. 20144–20149.
SIGMODSIGMOD-2002-ZhouR #database #implementation #using
Implementing database operations using SIMD instructions (JZ, KAR), pp. 145–156.
LCTESLCTES-SCOPES-2002-LorenzWD #compilation #energy
Energy aware compilation for DSPs with SIMD instructions (ML, LW, TD), pp. 94–101.
DATEDATE-2000-Leupers
Code Selection for Media Processors with SIMD Instructions (RL), pp. 4–8.
ICPRICPR-1996-ArumugaveluR #algorithm #clustering
SIMD algorithms for single link and complete link pattern clustering (SA, NR), pp. 625–629.
SACSAC-1995-BaudinoCMS #set
Processing sets on a SIMD machine (AB, GC, GM, GS), pp. 593–598.
ICLPILPS-1993-TongL #concurrent #constraints #logic programming #parallel
Concurrent Constraint Logic Programming On Massively Parallel SIMD Computers (BMT, HfL), pp. 388–402.
ESOPESOP-1992-Levaire #case study #programming #using
Using the Centaur System to for Data-Parallel SIMD Programming: A Case Study (JLL), pp. 341–350.
PLDIPLDI-1992-HanxledenK #constraints #control flow #using
Relaxing SIMD Control Flow Constraints using Loop Transformations (RvH, KK), pp. 188–199.
ISMMIWMM-1992-Yuasa #architecture #garbage collection #lisp #memory management #parallel
Memory Management and Garbage Collection of an Extended Common Lisp System for Massively Parallel SIMD Architecture (TY), pp. 490–506.
AdaEuropeAdaEurope-1991-Gudenberg #ada #parallel
Modellin SIMD — Type Parallel Arithmetic Operations in Ada (JWvG), pp. 110–124.
LISPLFP-1988-HudakH
Graphinators and the Duality of SIMD and MIMD (PH, EM), pp. 224–234.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.