Travelled to:
1 × Australia
1 × China
1 × France
1 × Germany
1 × Mexico
1 × Turkey
1 × United Kingdom
2 × India
20 × USA
Collaborated with:
W.Ahn Z.Zhang A.Ansari N.Honarmand Y.Duan T.Shull L.Rauchwerger L.Yang C.Xia P.Montesinos U.R.Karpuzcu Y.Zhang L.Ceze A.Agrawal X.Qian B.Sahelices J.Choi ∅ J.F.Martínez M.H.Cintra S.Basu B.Greskamp S.Qi A.Muzahid S.Abadal S.T.King B.Gropp V.Sarkar J.H.Moreno K.Olukotun J.Tuck M.J.Garzarán J.Renau J.Lee Y.Solihin A.Nguyen Y.Kang T.S.Huang A.Raynaud R.L.Daigle A.Gupta J.L.Hennessy Jian Huang 0006 Tanmay Gangwani Adam Morrison 0001 A.K.Mishra J.Xu D.A.Padua D.Qian P.Jain A.A.Sinkar N.S.Kim X.Zhou M.Hicks C.v.Praun J.Nakano K.Gharachorloo P.Trancoso J.Larriba-Pey A.Cabellos-Aparicio E.Alarcón Dimitrios Skarlatos Apostolos Kokolis T.Xu S.Feng S.Gupta S.A.Mahlke N.Otsuki L.O.Nogueira B.B.Fraguela P.Feautrier Vimuth Fernando Antonio Franques S.Misailovic N.Dautenhahn G.Pokam C.Pereira M.Prvulovic J.M.Llabería V.Viñals L.Wan J.J.Cook D.Chen C.B.Zilles W.Liu K.Strauss Zhangxiaowen Gong Zhi Chen 0001 Justin Josef Szaday David C. Wong 0001 Z.Sura Neftali Watkinson S.Maleki A.V.Veidenbaum A.Nicolau N.P.Carter S.Borkar R.Cledat H.David D.Dunning J.B.Fryman I.Ganev R.A.Golliver R.C.Knauerhase R.Lethin B.Meister W.R.Pinfold J.Teller N.Vasilache G.Venkatesh
Talks about:
multiprocessor (12) memori (12) perform (9) specul (9) parallel (8) architectur (6) cach (6) synchron (5) hardwar (5) system (5)
Person: Josep Torrellas
DBLP: Torrellas:Josep
Facilitated 1 volumes:
Contributed to:
Wrote 50 papers:
- ASPLOS-2015-DuanHT #memory management #optimisation #performance #symmetry
- Asymmetric Memory Fences: Optimizing Both Performance and Implementability (YD, NH, JT), pp. 531–543.
- ASPLOS-2014-HonarmandT #multi #named
- RelaxReplay: record and replay for relaxed-consistency multiprocessors (NH, JT), pp. 223–238.
- DATE-2014-Torrellas #architecture #energy #performance
- Extreme-scale computer architecture: Energy efficiency from the ground up‡ (JT), pp. 1–5.
- HPCA-2014-AgrawalAT #energy #locality #named #process
- Mosaic: Exploiting the spatial locality of process variation to reduce refresh energy in on-chip eDRAM modules (AA, AA, JT), pp. 84–95.
- HPCA-2014-AnsariMXT #energy #named #network
- Tangle: Route-oriented dynamic voltage minimization for variation-afflicted, energy-efficient on-chip networks (AA, AKM, JX, JT), pp. 440–451.
- HPCA-2014-QiMAT #detection
- Dynamically detecting and tolerating IF-Condition Data Races (SQ, AM, WA, JT), pp. 120–131.
- PLDI-2014-AhnCSGT #javascript #performance #type system
- Improving JavaScript performance by deconstructing the type system (WA, JC, TS, MJG, JT), p. 51.
- ASPLOS-2013-AhnDT #alias #named #using
- DeAliaser: alias speculation using atomic region support (WA, YD, JT), pp. 167–180.
- ASPLOS-2013-HonarmandDTKPP #named #parallel
- Cyrus: unintrusive application-level record-replay for replay parallelism (NH, ND, JT, STK, GP, CP), pp. 193–206.
- ASPLOS-2013-QianTSQ #consistency #detection #named #precise #scalability
- Volition: scalable and precise sequential consistency violation detection (XQ, JT, BS, DQ), pp. 535–548.
- HPCA-2013-AgrawalJAT #multi #named
- Refrint: Intelligent refresh to minimize power in on-chip multiprocessor cache hierarchies (AA, PJ, AA, JT), pp. 400–411.
- HPCA-2013-AnsariFGTM #lightweight #named
- Illusionist: Transforming lightweight cores into aggressive cores on demand (AA, SF, SG, JT, SAM), pp. 436–447.
- HPCA-2013-CarterABCDDFGGKLMMPTTVVX #architecture #named #ubiquitous
- Runnemede: An architecture for Ubiquitous High-Performance Computing (NPC, AA, SB, RC, HD, DD, JBF, IG, RAG, RCK, RL, BM, AKM, WRP, JT, JT, NV, GV, JX), pp. 198–209.
- HPCA-2013-KarpuzcuSKT #energy #named #towards
- EnergySmart: Toward energy-efficient manycores for Near-Threshold Computing (URK, AAS, NSK, JT), pp. 542–553.
- HPCA-2012-DuanZAT #commit #execution #named
- BulkCompactor: Optimized deterministic execution via Conflict-Aware commit of atomic blocks (YD, XZ, WA, JT), pp. 361–372.
- HPCA-2012-QiONMT #hardware #named #symmetry
- Pacman: Tolerating asymmetric data races with unintrusive hardware (SQ, NO, LON, AM, JT), pp. 349–360.
- HPCA-2012-QianST #design #execution #named #smt
- BulkSMT: Designing SMT processors for atomic-block execution (XQ, BS, JT), pp. 153–164.
- HPCA-2010-GreskampKT #configuration management #multi #named #performance #thread
- LeadOut: Composing low-overhead frequency-enhancing techniques for single-thread performance in configurable multicores (BG, URK, JT), pp. 1–12.
- HPCA-2010-TorrellasGSMO #challenge
- Extreme scale computing: Challenges and opportunities (JT, BG, VS, JHM, KO), p. 1.
- PPoPP-2010-TorrellasGMOS #challenge
- Extreme scale computing: challenges and opportunities (JT, BG, JHM, KO, VS), pp. 101–102.
- ASPLOS-2009-MontesinosHKT #interface #multi #named
- Capo: a software-hardware interface for practical deterministic multiprocessor replay (PM, MH, STK, JT), pp. 73–84.
- HPCA-2009-GreskampWKCTCZ #design #named
- Blueshift: Designing processors for timing speculation from the ground up (BG, LW, URK, JJC, JT, DC, CBZ), pp. 213–224.
- ASPLOS-2008-TuckACT #analysis #hardware #named #optimisation
- SoftSig: software-exposed hardware signatures for code analysis and optimization (JT, WA, LC, JT), pp. 145–156.
- HPCA-2007-CezeMPT #architecture #named
- Colorama: Architectural Support for Data-Centric Synchronization (LC, PM, CvP, JT), pp. 133–144.
- HPCA-2006-NakanoMGT #named #performance
- ReViveI/O: efficient handling of I/O in highly-available rollback-recovery servers (JN, PM, KG, JT), pp. 200–211.
- PPoPP-2006-LiuTCASRT #compilation #named
- POSH: a TLS compiler that exploits program structure (WL, JT, LC, WA, KS, JR, JT), pp. 158–167.
- HPCA-2003-GarzaranPLVRT #concurrent #memory management #multi #thread #trade-off
- Tradeoffs in Buffering Memory State for Thread-Level Speculation in Multiprocessors (MJG, MP, JML, VV, LR, JT), pp. 191–202.
- PPoPP-2003-FraguelaRFPT #memory management #parallel #programming
- Programming the FlexRAM parallel intelligent memory system (BBF, JR, PF, DAP, JT), pp. 49–60.
- ASPLOS-2002-MartinezT #concurrent #parallel #thread
- Speculative synchronization: applying thread-level speculation to explicitly parallel applications (JFM, JT), pp. 18–29.
- HPCA-2002-CintraT #learning #parallel #thread
- Speculative Multithreading Eliminating Squashes through Learning Cross-Thread Violations in Speculative Parallelization for Multiprocessors (MHC, JT), pp. 43–54.
- HPCA-2001-LeeST #architecture #automation #memory management
- Automatically Mapping Code on an Intelligent Memory Architecture (JL, YS, JT), pp. 121–132.
- HPCA-2000-TorrellasYN #effectiveness #integration #towards
- Toward a Cost-Effective DSM Organization That Exploits Processor-Memory Integration (JT, LY, ATN), pp. 15–25.
- HPCA-1999-ZhangRT #hardware #parallel
- Hardware for Speculative Parallelization of Partially-Parallel Loops in DSM Multiprocessors (YZ, LR, JT), pp. 135–139.
- HPCA-1998-BasuT #memory management #multi
- Enhancing Memory Use in Simple Coma: Multiplexed Simple Coma (SB, JT), pp. 152–161.
- HPCA-1998-ZhangRT #distributed #hardware #parallel #runtime
- Hardware for Speculative Run-Time Parallelization in Distributed Shared-Memory Multiprocessors (YZ, LR, JT), pp. 162–173.
- ICPR-1998-KangTH #analysis #architecture #image #pattern matching #pattern recognition #recognition
- An IRAM architecture for image analysis and pattern recognition (YK, JT, TSH), pp. 1561–1564.
- HPCA-1997-TrancosoLZT #memory management #multi #performance
- The Memory Performance of DSS Commercial Workloads in Shared-Memory Multiprocessors (PT, JLLP, ZZ, JT), pp. 250–260.
- HPCA-1997-YangT #memory management #multi
- Speeding up the Memory Hierarchy in Flat COMA Multiprocessors (LY, JT), pp. 4–13.
- HPCA-1997-ZhangT
- Reducing Remote Conflict Misses: NUMA with Remote Cache versus COMA (ZZ, JT), pp. 272–281.
- HPCA-1996-RaynaudZT #adaptation #multi #protocol #scalability
- Distance-Adaptive Update Protocols for Scalable Shared-Memory Multiprocessors (AR, ZZ, JT), pp. 323–334.
- HPCA-1996-XiaT #multi #operating system #performance
- Improving the Data Cache Performance of Multiprocessor Operating Systems (CX, JT), pp. 85–94.
- HPCA-1995-TorrellasXD #operating system #optimisation #performance
- Optimizing Instruction Cache Performance for Operating System Intensive Workloads (JT, CX, RLD), pp. 360–369.
- ASPLOS-1992-TorrellasGH #multi #operating system #performance
- Characterizing the Caching and Synchronization Performance of a Multiprocessor Operating System (JT, AG, JLH), pp. 162–174.
- OOPSLA-2018-GongCS0SWMPVNT #compilation #empirical
- An empirical study of the effect of source-level loop transformations on compiler stability (ZG, ZC0, JJS, DCW0, ZS, NW, SM, DAP, AVV, AN, JT), p. 29.
- PLDI-2019-ChoiST #javascript #performance #reuse
- Reusable inline caching for JavaScript performance (JC, TS, JT), pp. 889–901.
- PLDI-2019-ShullHT #framework #java #named #reachability
- AutoPersist: an easy-to-use Java NVM framework based on reachability (TS, JH0, JT), pp. 316–332.
- ASPLOS-2016-AbadalCAT #architecture #communication #named #performance
- WiSync: An Architecture for Fast Synchronization through On-Chip Wireless Communication (SA, ACA, EA, JT), pp. 3–17.
- ASPLOS-2016-GangwaniMT #manycore #named
- CASPAR: Breaking Serialization in Lock-Free Multicore Synchronization (TG, AM0, JT), pp. 789–804.
- ASPLOS-2019-FernandoFAMT #approximate #manycore #named
- Replica: A Wireless Manycore for Communication-Intensive and Approximate Data (VF, AF, SA, SM, JT), pp. 849–863.
- ASPLOS-2020-SkarlatosKXT #memory management #parallel
- Elastic Cuckoo Page Tables: Rethinking Virtual Memory Translation for Parallelism (DS, AK, TX, JT), pp. 1093–1108.