BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Collaborated with:
Bilal Kartal M.E.Taylor Chao Gao
Talks about:
reinforc (4) learn (4) deep (3) auxiliari (2) task (2) pommerman (1) predict (1) guidanc (1) termin (1) explor (1)

Person: Pablo Hernandez-Leal

DBLP DBLP: Hernandez-Leal:Pablo

Contributed to:

AIIDE 20192019

Wrote 4 papers:

AIIDE-2019-GaoKHT #case study #learning #on the
On Hard Exploration for Reinforcement Learning: A Case Study in Pommerman (CG, BK, PHL, MET), pp. 24–30.
AIIDE-2019-Hernandez-LealK #learning #modelling
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning (PHL, BK, MET), pp. 31–37.
AIIDE-2019-KartalHT #learning #predict
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning (BK, PHL, MET), pp. 38–44.
AIIDE-2019-KartalHT19a #learning
Action Guidance with MCTS for Deep Reinforcement Learning (BK, PHL, MET), pp. 153–159.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.