Devesh Tiwari, Saurabh Gupta, James H. Rogers, Don Maxwell, Paolo Rech, Sudharshan S. Vazhkudai, Daniel A. G. de Oliveira, Dave Londo, Nathan DeBardeleben, Philippe Olivier Alexandre Navaux, Luigi Carro, Arthur S. Bland
Understanding GPU errors on large-scale HPC systems and the implications for system design and operation
HPCA, 2015.
@inproceedings{HPCA-2015-TiwariGRMRVOLDN, author = "Devesh Tiwari and Saurabh Gupta and James H. Rogers and Don Maxwell and Paolo Rech and Sudharshan S. Vazhkudai and Daniel A. G. de Oliveira and Dave Londo and Nathan DeBardeleben and Philippe Olivier Alexandre Navaux and Luigi Carro and Arthur S. Bland", booktitle = "{Proceedings of the 21st International Symposium on High-Performance Computer Architecture}", doi = "10.1109/HPCA.2015.7056044", isbn = "978-1-4799-8930-0", pages = "331--342", publisher = "{IEEE}", title = "{Understanding GPU errors on large-scale HPC systems and the implications for system design and operation}", year = 2015, }