BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × China
1 × Finland
1 × France
1 × United Kingdom
2 × Canada
2 × USA
Collaborated with:
R.S.Sutton K.Ciosek G.Tesauro S.Gelly J.Heinrich M.Lanctot M.Müller A.Koop T.Schaul D.Horgan K.Gregor J.Vines P.C.Wright M.Winchcombe P.Olivier L.Newnham D.Barker S.Weller J.McFall G.Lever N.Heess T.Degris D.Wierstra M.A.Riedmiller H.R.Maei D.Precup S.Bhatnagar C.Szepesvári E.Wiewiora
Talks about:
learn (3) knowledg (2) gradient (2) function (2) approxim (2) determinist (1) stationari (1) transient (1) technolog (1) algorithm (1)

Person: David Silver

DBLP DBLP: Silver:David

Contributed to:

CSCW 20152015
ICML 20152015
ICML c1 20142014
ICML c3 20132013
ICML 20122012
ICML 20092009
ICML 20082008
ICML 20072007
AIIDE 20052005

Wrote 12 papers:

CSCW-2015-VinesWSWO #authentication #collaboration #information management
Authenticity, Relatability and Collaborative Approaches to Sharing Knowledge about Assistive Living Technology (JV, PCW, DS, MW, PO), pp. 82–94.
ICML-2015-HeinrichLS #game studies #self
Fictitious Self-Play in Extensive-Form Games (JH, ML, DS), pp. 805–813.
ICML-2015-SchaulHGS #approximate
Universal Value Function Approximators (TS, DH, KG, DS), pp. 1312–1320.
ICML-c1-2014-SilverLHDWR #algorithm #policy
Deterministic Policy Gradient Algorithms (DS, GL, NH, TD, DW, MAR), pp. 387–395.
ICML-c3-2013-SilverNBWM #concurrent #interactive #learning
Concurrent Reinforcement Learning from Customer Interactions (DS, LN, DB, SW, JM), pp. 924–932.
ICML-2012-SilverC #composition #modelling #using
Compositional Planning Using Optimal Option Models (DS, KC), p. 165.
ICML-2009-SilverT #monte carlo #simulation
Monte-Carlo simulation balancing (DS, GT), pp. 945–952.
ICML-2009-SuttonMPBSSW #approximate #learning #linear #performance
Fast gradient-descent methods for temporal-difference learning with linear function approximation (RSS, HRM, DP, SB, DS, CS, EW), pp. 993–1000.
ICML-2008-SilverSM #learning
Sample-based learning and search with permanent and transient memories (DS, RSS, MM), pp. 968–975.
ICML-2007-GellyS #online
Combining online and offline knowledge in UCT (SG, DS), pp. 273–280.
ICML-2007-SuttonKS #on the
On the role of tracking in stationary environments (RSS, AK, DS), pp. 871–878.
Cooperative Pathfinding (DS), pp. 117–122.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.