BibSLEIGH — Gerald_Tesauro

BibSLEIGH

BibSLEIGH corpus

BibSLEIGH tags

BibSLEIGH bundles

BibSLEIGH people

EDIT!

CC-BY

Open Knowledge

XHTML 1.0 W3C Rec

CSS 2.1 W3C CanRec

email

Travelled to:

1 × Canada
1 × United Kingdom
2 × USA

Collaborated with:

∅ D.Silver J.O.Kephart M.Sridharan

Talks about:

learn (4) backgammon (2) connectionist (1) strategi (1) pricebot (1) competit (1) regress (1) converg (1) tempor (1) pseudo (1)

Person: Gerald Tesauro

DBLP: Tesauro:Gerald

Contributed to:

ICML 2009

2009

ICML 2000

2000

ML 1992

1992

ML 1988

1988

Wrote 5 papers:

ICML-2009-SilverT #monte carlo #simulation: Monte-Carlo simulation balancing (DS, GT), pp. 945–952.
ICML-2000-KephartT #pseudo: Pseudo-convergent Q-Learning by Competitive Pricebots (JOK, GT), pp. 463–470.
ICML-2000-SridharanT #automation #multi: Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions (MS, GT), pp. 927–934.
ML-1992-Tesauro #difference #learning: Temporal Difference Learning of Backgammon Strategy (GT), pp. 451–457.
ML-1988-Tesauro #learning: Connectionist Learning of Expert Backgammon Evaluations (GT), pp. 200–206.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.