Travelled to:
1 × China
1 × Finland
1 × France
1 × United Kingdom
2 × Canada
2 × USA
Collaborated with:
R.S.Sutton K.Ciosek G.Tesauro S.Gelly ∅ J.Heinrich M.Lanctot M.Müller A.Koop T.Schaul D.Horgan K.Gregor J.Vines P.C.Wright M.Winchcombe P.Olivier L.Newnham D.Barker S.Weller J.McFall G.Lever N.Heess T.Degris D.Wierstra M.A.Riedmiller H.R.Maei D.Precup S.Bhatnagar C.Szepesvári E.Wiewiora
Talks about:
learn (3) knowledg (2) gradient (2) function (2) approxim (2) determinist (1) stationari (1) transient (1) technolog (1) algorithm (1)
Person: David Silver
DBLP: Silver:David
Contributed to:
Wrote 12 papers:
- CSCW-2015-VinesWSWO #authentication #collaboration #information management
- Authenticity, Relatability and Collaborative Approaches to Sharing Knowledge about Assistive Living Technology (JV, PCW, DS, MW, PO), pp. 82–94.
- ICML-2015-HeinrichLS #game studies #self
- Fictitious Self-Play in Extensive-Form Games (JH, ML, DS), pp. 805–813.
- ICML-2015-SchaulHGS #approximate
- Universal Value Function Approximators (TS, DH, KG, DS), pp. 1312–1320.
- ICML-c1-2014-SilverLHDWR #algorithm #policy
- Deterministic Policy Gradient Algorithms (DS, GL, NH, TD, DW, MAR), pp. 387–395.
- ICML-c3-2013-SilverNBWM #concurrent #interactive #learning
- Concurrent Reinforcement Learning from Customer Interactions (DS, LN, DB, SW, JM), pp. 924–932.
- ICML-2012-SilverC #composition #modelling #using
- Compositional Planning Using Optimal Option Models (DS, KC), p. 165.
- ICML-2009-SilverT #monte carlo #simulation
- Monte-Carlo simulation balancing (DS, GT), pp. 945–952.
- ICML-2009-SuttonMPBSSW #approximate #learning #linear #performance
- Fast gradient-descent methods for temporal-difference learning with linear function approximation (RSS, HRM, DP, SB, DS, CS, EW), pp. 993–1000.
- ICML-2008-SilverSM #learning
- Sample-based learning and search with permanent and transient memories (DS, RSS, MM), pp. 968–975.
- ICML-2007-GellyS #online
- Combining online and offline knowledge in UCT (SG, DS), pp. 273–280.
- ICML-2007-SuttonKS #on the
- On the role of tracking in stationary environments (RSS, AK, DS), pp. 871–878.
- AIIDE-2005-Silver
- Cooperative Pathfinding (DS), pp. 117–122.