Collaborated with:
Bilal Kartal M.E.Taylor Chao Gao
Talks about:
reinforc (4) learn (4) deep (3) auxiliari (2) task (2) pommerman (1) predict (1) guidanc (1) termin (1) explor (1)
Person: Pablo Hernandez-Leal
DBLP: Hernandez-Leal:Pablo
Contributed to:
Wrote 4 papers:
- AIIDE-2019-GaoKHT #case study #learning #on the
- On Hard Exploration for Reinforcement Learning: A Case Study in Pommerman (CG, BK, PHL, MET), pp. 24–30.
- AIIDE-2019-Hernandez-LealK #learning #modelling
- Agent Modeling as Auxiliary Task for Deep Reinforcement Learning (PHL, BK, MET), pp. 31–37.
- AIIDE-2019-KartalHT #learning #predict
- Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning (BK, PHL, MET), pp. 38–44.
- AIIDE-2019-KartalHT19a #learning
- Action Guidance with MCTS for Deep Reinforcement Learning (BK, PHL, MET), pp. 153–159.