BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Canada
1 × United Kingdom
2 × USA
Collaborated with:
D.Silver J.O.Kephart M.Sridharan
Talks about:
learn (4) backgammon (2) connectionist (1) strategi (1) pricebot (1) competit (1) regress (1) converg (1) tempor (1) pseudo (1)

Person: Gerald Tesauro

DBLP DBLP: Tesauro:Gerald

Contributed to:

ICML 20092009
ICML 20002000
ML 19921992
ML 19881988

Wrote 5 papers:

ICML-2009-SilverT #monte carlo #simulation
Monte-Carlo simulation balancing (DS, GT), pp. 945–952.
ICML-2000-KephartT #pseudo
Pseudo-convergent Q-Learning by Competitive Pricebots (JOK, GT), pp. 463–470.
ICML-2000-SridharanT #automation #multi
Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions (MS, GT), pp. 927–934.
ML-1992-Tesauro #difference #learning
Temporal Difference Learning of Backgammon Strategy (GT), pp. 451–457.
ML-1988-Tesauro #learning
Connectionist Learning of Expert Backgammon Evaluations (GT), pp. 200–206.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.