Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter

Dennis J. N. J. Soemers, Éric Piette, Matthew Stephenson, Cameron Browne
Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates
CoG, 2019.

CoG 2019
DBLP
Scholar
DOI
Full names Links ISxN
@inproceedings{CoG-2019-SoemersPSB,
	author        = "Dennis J. N. J. Soemers and Éric Piette and Matthew Stephenson and Cameron Browne",
	booktitle     = "{Proceedings of the IEEE Conference on Games}",
	doi           = "10.1109/CIG.2019.8848037",
	isbn          = "978-1-7281-1884-0",
	pages         = "1--8",
	publisher     = "{IEEE}",
	title         = "{Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates}",
	year          = 2019,
}

Tags:



Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.