Travelled to:
1 × United Kingdom
3 × USA
Collaborated with:
A.Tamar S.Mannor A.Hallak R.Meir
Talks about:
model (3) varianc (2) markovian (1) algorithm (1) knowledg (1) gradient (1) criteria (1) process (1) partial (1) tempor (1)
Person: Dotan Di Castro
DBLP: Castro:Dotan_Di
Contributed to:
Wrote 4 papers:
- ICML-c3-2013-TamarCM #difference
- Temporal Difference Methods for the Variance of the Reward To Go (AT, DDC, SM), pp. 495–503.
- KDD-2013-HallakCM #markov #process
- Model selection in markovian processes (AH, DDC, SM), pp. 374–382.
- ICML-2012-CastroTM #policy
- Policy Gradients with Variance Related Risk Criteria (DDC, AT, SM), p. 215.
- ICML-2011-TamarCM #algorithm
- Integrating Partial Model Knowledge in Model Free RL Algorithms (AT, DDC, RM), pp. 305–312.