BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Canada
1 × China
1 × Cyprus
1 × Estonia
1 × Hungary
1 × India
1 × Romania
18 × USA
2 × France
2 × United Kingdom
3 × Germany
Collaborated with:
J.Ramanujam L.Pouchet S.Krishnamoorthy U.Bondhugula A.Rountev F.Rastello M.M.Baskaran N.Fauzia Wenlei Bao A.Cohen S.Tavarageri V.P.Krothapalli V.Visvanathan K.Stock A.Hartono R.Kettimuthu T.M.Kurç J.H.Saltz V.Elango T.Grosser J.Holewinski F.Franchetti T.Henretty X.Yang S.Parthasarathy G.Sabin V.Sahasrabudhe V.Ashok R.L.Costello M.Kong N.Vydyanathan Q.Lu M.Ravishankar G.Khanna Ü.V.Çatalyürek C.Lam G.Baumgartner R.J.Harrison J.Kim K.V.Valkenburgh D.Yan V.Subramani S.Srinivasan D.Ghosh S.K.Nandy K.Parthasarathy L.Domagala D.v.Amstel Changwan Hong S.Verdoolaege E.Park J.Cavazos R.Veras J.Lin X.Ding Z.Zhang X.Zhang A.Ashari S.Tatikonda M.Böhm B.Reinwald K.Campbell J.Keenleyside R.Dathathri J.Shirako K.Sharma V.Sarkar R.Ramamurthi C.Bastoul N.Vasilache I.T.Foster X.Gao S.K.Sahoo S.Rajbhandari D.Cociorva M.Nooijen D.E.Bernholdt A.C.0001 Aravind Sukumaran-Rajam P.S.Rawat V.S.Kumar G.Mehta K.Vahi E.Deelman V.Ratnakar J.Kim Y.Gil M.W.Hall
Talks about:
optim (11) data (8) parallel (7) automat (6) transform (5) schedul (5) program (5) comput (5) model (5) polyhedr (4)

Person: P. Sadayappan

DBLP DBLP: Sadayappan:P=

Facilitated 1 volumes:

PPoPP 2012Ed

Contributed to:

CGO 20152015
POPL 20152015
PPoPP 20152015
CGO 20142014
PLDI 20142014
PLDI 20132013
CC 20122012
PLDI 20122012
CC 20112011
CGO 20112011
POPL 20112011
VLDB 20112011
CC 20102010
CGO 20102010
ICSM 20102010
HPDC 20092009
PPoPP 20092009
CC 20082008
HPCA 20082008
HPDC 20082008
PLDI 20082008
PPoPP 20082008
PLDI 20072007
PPoPP 20072007
HPDC 20062006
HPDC 20052005
PPoPP 20052005
HPDC 20022002
PLDI 20022002
DAC 19931993
PPoPP 19911991
DAC 19891989
DAC 19851985
CC 20162016
PLDI 20162016
POPL 20162016
PLDI 20182018
POPL 20182018

Wrote 41 papers:

CGO-2015-FauziaPS #memory management
Characterizing and enhancing global memory data coalescing on GPUs (NF, LNP, PS), pp. 12–22.
POPL-2015-ElangoRPRS #complexity #data access #on the #source code
On Characterizing the Data Access Complexity of Programs (VE, FR, LNP, JR, PS), pp. 567–580.
PPoPP-2015-AshariTBRCKS #kernel #machine learning #on the #optimisation
On optimizing machine learning workloads via kernel fusion (AA, ST, MB, BR, KC, JK, PS), pp. 173–182.
PPoPP-2015-RavishankarDEPRRS #code generation #distributed #memory management
Distributed memory code generation for mixed Irregular/Regular computations (MR, RD, VE, LNP, JR, AR, PS), pp. 65–75.
CGO-2014-GrosserCHSV #hybrid
Hybrid Hexagonal/Classical Tiling for GPUs (TG, AC, JH, PS, SV), p. 66.
PLDI-2014-StockKGPRRS #framework #order #reuse
A framework for enhancing data reuse via associative reordering (KS, MK, TG, LNP, FR, JR, PS), p. 10.
PLDI-2014-TavarageriKS #detection #fault #memory management
Compiler-assisted detection of transient memory errors (ST, SK, PS), p. 24.
PLDI-2013-KongVSFPS #code generation
When polyhedral transformations meet SIMD code generation (MK, RV, KS, FF, LNP, PS), pp. 127–138.
CC-2012-ShirakoSFPRSS #bound
Analytical Bounds for Optimal Tile Size Selection (JS, KS, NF, LNP, JR, PS, VS), pp. 101–121.
PLDI-2012-HolewinskiRRFPRS #analysis
Dynamic trace-based analysis of vectorization potential of applications (JH, RR, MR, NF, LNP, AR, PS), pp. 371–382.
CC-2011-HenrettySPFRS #architecture #layout
Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures (TH, KS, LNP, FF, JR, PS), pp. 225–245.
CGO-2011-ParkPCCS #modelling #optimisation #predict
Predictive modeling in a polyhedral optimization space (EP, LNP, JC, AC, PS), pp. 119–129.
POPL-2011-PouchetBBCRSV #optimisation
Loop transformations: convexity, pruning and optimization (LNP, UB, CB, AC, JR, PS, NV), pp. 549–562.
VLDB-2011-YangPS #graph #mining #multi #performance
Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining (XY, SP, PS), pp. 231–242.
CC-2010-BaskaranRS #automation #code generation #source code
Automatic C-to-CUDA Code Generation for Affine Programs (MMB, JR, PS), pp. 244–263.
CGO-2010-BaskaranHTHRS #revisited
Parameterized tiling revisited (MMB, AH, ST, TH, JR, PS), pp. 200–209.
ICSM-2010-RountevVYS #comprehension #dependence #java #source code
Understanding parallelism-inhibiting dependences in sequential Java programs (AR, KVV, DY, PS), pp. 1–9.
HPDC-2009-KumarSMVDRKGHKS #framework #optimisation #workflow
An integrated framework for performance-based optimization of scientific workflows (VSK, PS, GM, KV, ED, VR, JK, YG, MWH, TMK, JHS), pp. 177–186.
PPoPP-2009-BaskaranVBRRS #effectiveness #manycore #parallel #scheduling
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors (MMB, NV, UB, JR, AR, PS), pp. 219–228.
CC-2008-BondhugulaBKRRS #automation #locality #optimisation #parallel
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model (UB, MMB, SK, JR, AR, PS), pp. 132–146.
HPCA-2008-LinLDZZS #clustering #manycore #simulation
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems (JL, QL, XD, ZZ, XZ, PS), pp. 367–378.
HPDC-2008-KhannaCKSSKF #data transfer #multi #network #optimisation #using
Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP (GK, ÜVÇ, TMK, PS, JHS, RK, ITF), pp. 225–226.
PLDI-2008-BondhugulaHRS #automation #locality
A practical automatic polyhedral parallelizer and locality optimizer (UB, AH, JR, PS), pp. 101–113.
PPoPP-2008-BaskaranBKRRS #architecture #automation #data flow #parallel
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories (MMB, UB, SK, JR, AR, PS), pp. 1–10.
PLDI-2007-KrishnamoorthyBBRRS #automation #effectiveness #parallel
Effective automatic parallelization of stencil computations (SK, MMB, UB, JR, AR, PS), pp. 235–244.
PPoPP-2007-BondhugulaRS #automation
Automatic mapping of nested loops to FPGAS (UB, JR, PS), pp. 101–111.
HPDC-2006-KhannaVCKKSS #data-driven #replication #scheduling
Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O (GK, NV, ÜVÇ, TMK, SK, PS, JHS), pp. 241–252.
HPDC-2005-SabinSS #assessment #multi
Assessment and enhancement of meta-schedulers for multi-site job sharing (GS, VS, PS), pp. 144–153.
PPoPP-2005-GaoSLRLBS #modelling #optimisation #parallel #performance
Performance modeling and optimization of parallel out-of-core tensor contractions (XG, SKS, CCL, JR, QL, GB, PS), pp. 266–276.
HPDC-2002-SubramaniKSS #distributed #multi #scheduling #using
Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests (VS, RK, SS, PS), p. 359–?.
PLDI-2002-CociorvaBLSRNBH #optimisation #trade-off
Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations (DC, GB, CCL, PS, JR, MN, DEB, RJH), pp. 177–186.
DAC-1993-GhoshNSP #architecture #multi #synthesis
Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving (DG, SKN, PS, KP), pp. 303–307.
PPoPP-1991-KrothapalliS #constant #dependence
Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences (VPK, PS), pp. 51–60.
DAC-1989-SadayappanV #matrix #performance #simulation
Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers (PS, VV), pp. 13–18.
DAC-1985-AshokCS #data flow #modelling #simulation #using
Modeling switch-level simulation using data flow (VA, RLC, PS), pp. 637–644.
CC-2016-DomagalaARS #scheduling
Register allocation and promotion through combined instruction scheduling and loop unrolling (LD, DvA, FR, PS), pp. 143–151.
CC-2016-RajbhandariKKPR #on the #recursion #traversal
On fusing recursive traversals of K-d trees (SR, JK, SK, LNP, FR, RJH, PS), pp. 152–162.
PLDI-2016-HongB0KPRRS #array #effectiveness #multi
Effective padding of multidimensional arrays to avoid cache conflict misses (CH, WB, AC0, SK, LNP, FR, JR, PS), pp. 129–144.
POPL-2016-BaoKPRS #named #source code #verification
PolyCheck: dynamic verification of iteration space transformations on affine programs (WB, SK, LNP, FR, PS), pp. 539–554.
PLDI-2018-HongSKRKPRS #analysis #gpu #kernel #optimisation #using
GPU code optimization using abstract kernel emulation and sensitivity analysis (CH, ASR, JK, PSR, SK, LNP, FR, PS), pp. 736–751.
POPL-2018-BaoKPS #behaviour #modelling #source code
Analytical modeling of cache behavior for affine programs (WB, SK, LNP, PS), p. 26.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.