Travelled to:
1 × Canada
1 × China
1 × Cyprus
1 × Estonia
1 × Hungary
1 × India
1 × Romania
18 × USA
2 × France
2 × United Kingdom
3 × Germany
Collaborated with:
J.Ramanujam L.Pouchet S.Krishnamoorthy U.Bondhugula A.Rountev F.Rastello M.M.Baskaran N.Fauzia Wenlei Bao A.Cohen S.Tavarageri V.P.Krothapalli V.Visvanathan K.Stock A.Hartono R.Kettimuthu T.M.Kurç J.H.Saltz V.Elango T.Grosser J.Holewinski F.Franchetti T.Henretty X.Yang S.Parthasarathy G.Sabin V.Sahasrabudhe V.Ashok R.L.Costello M.Kong N.Vydyanathan Q.Lu M.Ravishankar G.Khanna Ü.V.Çatalyürek C.Lam G.Baumgartner R.J.Harrison J.Kim K.V.Valkenburgh D.Yan V.Subramani S.Srinivasan D.Ghosh S.K.Nandy K.Parthasarathy L.Domagala D.v.Amstel Changwan Hong S.Verdoolaege E.Park J.Cavazos R.Veras J.Lin X.Ding Z.Zhang X.Zhang A.Ashari S.Tatikonda M.Böhm B.Reinwald K.Campbell J.Keenleyside R.Dathathri J.Shirako K.Sharma V.Sarkar R.Ramamurthi C.Bastoul N.Vasilache I.T.Foster X.Gao S.K.Sahoo S.Rajbhandari D.Cociorva M.Nooijen D.E.Bernholdt A.C.0001 Aravind Sukumaran-Rajam P.S.Rawat V.S.Kumar G.Mehta K.Vahi E.Deelman V.Ratnakar J.Kim Y.Gil M.W.Hall
Talks about:
optim (11) data (8) parallel (7) automat (6) transform (5) schedul (5) program (5) comput (5) model (5) polyhedr (4)
Person: P. Sadayappan
DBLP: Sadayappan:P=
Facilitated 1 volumes:
Contributed to:
Wrote 41 papers:
- CGO-2015-FauziaPS #memory management
- Characterizing and enhancing global memory data coalescing on GPUs (NF, LNP, PS), pp. 12–22.
- POPL-2015-ElangoRPRS #complexity #data access #on the #source code
- On Characterizing the Data Access Complexity of Programs (VE, FR, LNP, JR, PS), pp. 567–580.
- PPoPP-2015-AshariTBRCKS #kernel #machine learning #on the #optimisation
- On optimizing machine learning workloads via kernel fusion (AA, ST, MB, BR, KC, JK, PS), pp. 173–182.
- PPoPP-2015-RavishankarDEPRRS #code generation #distributed #memory management
- Distributed memory code generation for mixed Irregular/Regular computations (MR, RD, VE, LNP, JR, AR, PS), pp. 65–75.
- CGO-2014-GrosserCHSV #hybrid
- Hybrid Hexagonal/Classical Tiling for GPUs (TG, AC, JH, PS, SV), p. 66.
- PLDI-2014-StockKGPRRS #framework #order #reuse
- A framework for enhancing data reuse via associative reordering (KS, MK, TG, LNP, FR, JR, PS), p. 10.
- PLDI-2014-TavarageriKS #detection #fault #memory management
- Compiler-assisted detection of transient memory errors (ST, SK, PS), p. 24.
- PLDI-2013-KongVSFPS #code generation
- When polyhedral transformations meet SIMD code generation (MK, RV, KS, FF, LNP, PS), pp. 127–138.
- CC-2012-ShirakoSFPRSS #bound
- Analytical Bounds for Optimal Tile Size Selection (JS, KS, NF, LNP, JR, PS, VS), pp. 101–121.
- PLDI-2012-HolewinskiRRFPRS #analysis
- Dynamic trace-based analysis of vectorization potential of applications (JH, RR, MR, NF, LNP, AR, PS), pp. 371–382.
- CC-2011-HenrettySPFRS #architecture #layout
- Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures (TH, KS, LNP, FF, JR, PS), pp. 225–245.
- CGO-2011-ParkPCCS #modelling #optimisation #predict
- Predictive modeling in a polyhedral optimization space (EP, LNP, JC, AC, PS), pp. 119–129.
- POPL-2011-PouchetBBCRSV #optimisation
- Loop transformations: convexity, pruning and optimization (LNP, UB, CB, AC, JR, PS, NV), pp. 549–562.
- VLDB-2011-YangPS #graph #mining #multi #performance
- Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining (XY, SP, PS), pp. 231–242.
- CC-2010-BaskaranRS #automation #code generation #source code
- Automatic C-to-CUDA Code Generation for Affine Programs (MMB, JR, PS), pp. 244–263.
- CGO-2010-BaskaranHTHRS #revisited
- Parameterized tiling revisited (MMB, AH, ST, TH, JR, PS), pp. 200–209.
- ICSM-2010-RountevVYS #comprehension #dependence #java #source code
- Understanding parallelism-inhibiting dependences in sequential Java programs (AR, KVV, DY, PS), pp. 1–9.
- HPDC-2009-KumarSMVDRKGHKS #framework #optimisation #workflow
- An integrated framework for performance-based optimization of scientific workflows (VSK, PS, GM, KV, ED, VR, JK, YG, MWH, TMK, JHS), pp. 177–186.
- PPoPP-2009-BaskaranVBRRS #effectiveness #manycore #parallel #scheduling
- Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors (MMB, NV, UB, JR, AR, PS), pp. 219–228.
- CC-2008-BondhugulaBKRRS #automation #locality #optimisation #parallel
- Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model (UB, MMB, SK, JR, AR, PS), pp. 132–146.
- HPCA-2008-LinLDZZS #clustering #manycore #simulation
- Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems (JL, QL, XD, ZZ, XZ, PS), pp. 367–378.
- HPDC-2008-KhannaCKSSKF #data transfer #multi #network #optimisation #using
- Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP (GK, ÜVÇ, TMK, PS, JHS, RK, ITF), pp. 225–226.
- PLDI-2008-BondhugulaHRS #automation #locality
- A practical automatic polyhedral parallelizer and locality optimizer (UB, AH, JR, PS), pp. 101–113.
- PPoPP-2008-BaskaranBKRRS #architecture #automation #data flow #parallel
- Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories (MMB, UB, SK, JR, AR, PS), pp. 1–10.
- PLDI-2007-KrishnamoorthyBBRRS #automation #effectiveness #parallel
- Effective automatic parallelization of stencil computations (SK, MMB, UB, JR, AR, PS), pp. 235–244.
- PPoPP-2007-BondhugulaRS #automation
- Automatic mapping of nested loops to FPGAS (UB, JR, PS), pp. 101–111.
- HPDC-2006-KhannaVCKKSS #data-driven #replication #scheduling
- Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O (GK, NV, ÜVÇ, TMK, SK, PS, JHS), pp. 241–252.
- HPDC-2005-SabinSS #assessment #multi
- Assessment and enhancement of meta-schedulers for multi-site job sharing (GS, VS, PS), pp. 144–153.
- PPoPP-2005-GaoSLRLBS #modelling #optimisation #parallel #performance
- Performance modeling and optimization of parallel out-of-core tensor contractions (XG, SKS, CCL, JR, QL, GB, PS), pp. 266–276.
- HPDC-2002-SubramaniKSS #distributed #multi #scheduling #using
- Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests (VS, RK, SS, PS), p. 359–?.
- PLDI-2002-CociorvaBLSRNBH #optimisation #trade-off
- Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations (DC, GB, CCL, PS, JR, MN, DEB, RJH), pp. 177–186.
- DAC-1993-GhoshNSP #architecture #multi #synthesis
- Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving (DG, SKN, PS, KP), pp. 303–307.
- PPoPP-1991-KrothapalliS #constant #dependence
- Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences (VPK, PS), pp. 51–60.
- DAC-1989-SadayappanV #matrix #performance #simulation
- Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers (PS, VV), pp. 13–18.
- DAC-1985-AshokCS #data flow #modelling #simulation #using
- Modeling switch-level simulation using data flow (VA, RLC, PS), pp. 637–644.
- CC-2016-DomagalaARS #scheduling
- Register allocation and promotion through combined instruction scheduling and loop unrolling (LD, DvA, FR, PS), pp. 143–151.
- CC-2016-RajbhandariKKPR #on the #recursion #traversal
- On fusing recursive traversals of K-d trees (SR, JK, SK, LNP, FR, RJH, PS), pp. 152–162.
- PLDI-2016-HongB0KPRRS #array #effectiveness #multi
- Effective padding of multidimensional arrays to avoid cache conflict misses (CH, WB, AC0, SK, LNP, FR, JR, PS), pp. 129–144.
- POPL-2016-BaoKPRS #named #source code #verification
- PolyCheck: dynamic verification of iteration space transformations on affine programs (WB, SK, LNP, FR, PS), pp. 539–554.
- PLDI-2018-HongSKRKPRS #analysis #gpu #kernel #optimisation #using
- GPU code optimization using abstract kernel emulation and sensitivity analysis (CH, ASR, JK, PSR, SK, LNP, FR, PS), pp. 736–751.
- POPL-2018-BaoKPS #behaviour #modelling #source code
- Analytical modeling of cache behavior for affine programs (WB, SK, LNP, PS), p. 26.