Travelled to:
1 × Australia
1 × Finland
1 × Israel
1 × United Kingdom
5 × USA
Collaborated with:
D.Chakraborty S.Kalyanakrishnan D.Pardoe M.E.Taylor R.S.Sutton ∅ M.L.Littman J.Reisinger R.Miikkulainen A.Tewari P.Auer S.P.Singh N.K.Jong R.E.Schapire D.A.McAllester J.A.Csirik
Talks about:
learn (6) reinforc (3) select (3) transfer (2) bandit (2) boost (2) arm (2) uncertainti (1) represent (1) structur (1)
Person: Peter Stone
DBLP: Stone:Peter
Contributed to:
Wrote 11 papers:
- ICML-2012-KalyanakrishnanTAS #multi #probability #set
- PAC Subset Selection in Stochastic Multi-armed Bandits (SK, AT, PA, PS), p. 34.
- ICML-2011-ChakrabortyS #learning
- Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree (DC, PS), pp. 737–744.
- ICML-2010-ChakrabortyS #convergence #learning #multi #safety
- Convergence, Targeted Optimality, and Safety in Multiagent Learning (DC, PS), pp. 191–198.
- ICML-2010-KalyanakrishnanS #multi #performance #theory and practice
- Efficient Selection of Multiple Bandit Arms: Theory and Practice (SK, PS), pp. 511–518.
- ICML-2010-PardoeS
- Boosting for Regression Transfer (DP, PS), pp. 863–870.
- ICML-2008-ReisingerSM #kernel #learning #online
- Online kernel selection for Bayesian reinforcement learning (JR, PS, RM), pp. 816–823.
- ICML-2007-TaylorS #learning
- Cross-domain transfer for reinforcement learning (MET, PS), pp. 879–886.
- ICML-2003-SinghLJPS #learning #predict
- Learning Predictive State Representations (SPS, MLL, NKJ, DP, PS), pp. 712–719.
- ICML-2002-SchapireSMLC #estimation #modelling #nondeterminism #using
- Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation (RES, PS, DAM, MLL, JAC), pp. 546–553.
- ICML-2001-StoneS #learning #scalability #towards
- Scaling Reinforcement Learning toward RoboCup Soccer (PS, RSS), pp. 537–544.
- ICML-2000-Stone #network
- TPOT-RL Applied to Network Routing (PS), pp. 935–942.