BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Australia
1 × Finland
1 × Israel
1 × United Kingdom
5 × USA
Collaborated with:
D.Chakraborty S.Kalyanakrishnan D.Pardoe M.E.Taylor R.S.Sutton M.L.Littman J.Reisinger R.Miikkulainen A.Tewari P.Auer S.P.Singh N.K.Jong R.E.Schapire D.A.McAllester J.A.Csirik
Talks about:
learn (6) reinforc (3) select (3) transfer (2) bandit (2) boost (2) arm (2) uncertainti (1) represent (1) structur (1)

Person: Peter Stone

DBLP DBLP: Stone:Peter

Contributed to:

ICML 20122012
ICML 20112011
ICML 20102010
ICML 20082008
ICML 20072007
ICML 20032003
ICML 20022002
ICML 20012001
ICML 20002000

Wrote 11 papers:

ICML-2012-KalyanakrishnanTAS #multi #probability #set
PAC Subset Selection in Stochastic Multi-armed Bandits (SK, AT, PA, PS), p. 34.
ICML-2011-ChakrabortyS #learning
Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree (DC, PS), pp. 737–744.
ICML-2010-ChakrabortyS #convergence #learning #multi #safety
Convergence, Targeted Optimality, and Safety in Multiagent Learning (DC, PS), pp. 191–198.
ICML-2010-KalyanakrishnanS #multi #performance #theory and practice
Efficient Selection of Multiple Bandit Arms: Theory and Practice (SK, PS), pp. 511–518.
ICML-2010-PardoeS
Boosting for Regression Transfer (DP, PS), pp. 863–870.
ICML-2008-ReisingerSM #kernel #learning #online
Online kernel selection for Bayesian reinforcement learning (JR, PS, RM), pp. 816–823.
ICML-2007-TaylorS #learning
Cross-domain transfer for reinforcement learning (MET, PS), pp. 879–886.
ICML-2003-SinghLJPS #learning #predict
Learning Predictive State Representations (SPS, MLL, NKJ, DP, PS), pp. 712–719.
ICML-2002-SchapireSMLC #estimation #modelling #nondeterminism #using
Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation (RES, PS, DAM, MLL, JAC), pp. 546–553.
ICML-2001-StoneS #learning #scalability #towards
Scaling Reinforcement Learning toward RoboCup Soccer (PS, RSS), pp. 537–544.
ICML-2000-Stone #network
TPOT-RL Applied to Network Routing (PS), pp. 935–942.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.