Travelled to:
1 × France
1 × Germany
1 × Hungary
1 × The Netherlands
1 × United Kingdom
5 × USA
Collaborated with:
P.Sadayappan J.Lifflander L.Pouchet L.V.Kalé J.Ramanujam F.Rastello Wenlei Bao U.Bondhugula M.M.Baskaran A.Rountev S.Tavarageri W.Ma G.Agrawal J.Kim N.Ali N.Govind B.J.Palmer Changwan Hong B.Ren Y.Jo K.Agrawal M.Kulkarni V.A.Saraswat P.Kambadur S.B.Kodali D.Grove G.Khanna N.Vydyanathan Ü.V.Çatalyürek T.M.Kurç J.H.Saltz S.Rajbhandari R.J.Harrison A.C.0001 Aravind Sukumaran-Rajam P.S.Rawat
Talks about:
program (5) transform (3) parallel (3) automat (3) recurs (3) memori (3) steal (3) optim (3) model (3) cach (3)
Person: Sriram Krishnamoorthy
DBLP: Krishnamoorthy:Sriram
Contributed to:
Wrote 17 papers:
- PLDI-2015-RenJKAK #execution #hardware #performance #recursion #source code
- Efficient execution of recursive programs on commodity vector hardware (BR, YJ, SK, KA, MK), pp. 509–520.
- PLDI-2014-TavarageriKS #detection #fault #memory management
- Compiler-assisted detection of transient memory errors (ST, SK, PS), p. 24.
- PLDI-2013-LifflanderKK
- Steal Tree: low-overhead tracing of work stealing schedulers (JL, SK, LVK), pp. 507–518.
- HPDC-2012-LifflanderKK
- Work stealing and persistence-based load balancers for iterative overdecomposed applications (JL, SK, LVK), pp. 137–148.
- CC-2011-MaKA #memory management #multi
- Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory Hierarchies (WM, SK, GA), pp. 266–285.
- PDP-2011-AliKGP #approach #communication #fault tolerance #modelling #programming #scalability
- A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models (NA, SK, NG, BJP), pp. 24–31.
- PPoPP-2011-SaraswatKKGK
- Lifeline-based global load balancing (VAS, PK, SBK, DG, SK), pp. 201–212.
- CC-2008-BondhugulaBKRRS #automation #locality #optimisation #parallel
- Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model (UB, MMB, SK, JR, AR, PS), pp. 132–146.
- PPoPP-2008-BaskaranBKRRS #architecture #automation #data flow #parallel
- Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories (MMB, UB, SK, JR, AR, PS), pp. 1–10.
- PLDI-2007-KrishnamoorthyBBRRS #automation #effectiveness #parallel
- Effective automatic parallelization of stencil computations (SK, MMB, UB, JR, AR, PS), pp. 235–244.
- HPDC-2006-KhannaVCKKSS #data-driven #replication #scheduling
- Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O (GK, NV, ÜVÇ, TMK, SK, PS, JHS), pp. 241–252.
- CC-2016-RajbhandariKKPR #on the #recursion #traversal
- On fusing recursive traversals of K-d trees (SR, JK, SK, LNP, FR, RJH, PS), pp. 152–162.
- PLDI-2016-HongB0KPRRS #array #effectiveness #multi
- Effective padding of multidimensional arrays to avoid cache conflict misses (CH, WB, AC0, SK, LNP, FR, JR, PS), pp. 129–144.
- POPL-2016-BaoKPRS #named #source code #verification
- PolyCheck: dynamic verification of iteration space transformations on affine programs (WB, SK, LNP, FR, PS), pp. 539–554.
- PLDI-2017-LifflanderK #locality #optimisation #recursion #source code
- Cache locality optimization for recursive programs (JL, SK), pp. 1–16.
- PLDI-2018-HongSKRKPRS #analysis #gpu #kernel #optimisation #using
- GPU code optimization using abstract kernel emulation and sensitivity analysis (CH, ASR, JK, PSR, SK, LNP, FR, PS), pp. 736–751.
- POPL-2018-BaoKPS #behaviour #modelling #source code
- Analytical modeling of cache behavior for affine programs (WB, SK, LNP, PS), p. 26.