Travelled to:
1 × China
1 × United Kingdom
Collaborated with:
A.Lazaric E.Brunskill R.Munos B.Kappen
Talks about:
stochast (1) reinforc (1) feedback (1) generat (1) complex (1) correl (1) bandit (1) under (1) sampl (1) optim (1)
Person: Mohammad Gheshlaghi Azar
DBLP: Azar:Mohammad_Gheshlaghi
Contributed to:
Wrote 2 papers:
- ICML-c2-2014-AzarLB #correlation #feedback #online #optimisation #probability
- Online Stochastic Optimization under Correlated Bandit Feedback (MGA, AL, EB), pp. 1557–1565.
- ICML-2012-AzarMK #complexity #generative #learning #on the
- On the Sample Complexity of Reinforcement Learning with a Generative Model (MGA, RM, BK), p. 222.