BibSLEIGH
BibSLEIGH corpus
BibSLEIGH tags
BibSLEIGH bundles
BibSLEIGH people
EDIT!
CC-BY
Open Knowledge
XHTML 1.0 W3C Rec
CSS 2.1 W3C CanRec
email twitter
Travelled to:
1 × Slovenia
4 × USA
Collaborated with:
S.Kobayashi S.Katayama K.Miyazaki M.Yamamura
Talks about:
learn (5) reinforc (3) function (3) control (2) linear (2) use (2) imperfect (1) algorithm (1) stochast (1) discount (1)

Person: Hajime Kimura

DBLP DBLP: Kimura:Hajime

Contributed to:

ICML 20002000
ICML 19991999
ICML 19981998
ICML 19971997
ICML 19951995

Wrote 5 papers:

ICML-2000-KatayamaKK #learning #using
A Universal Generalization for Temporal-Difference Learning Using Haar Basis Functions (SK, HK, SK), pp. 447–454.
ICML-1999-KimuraK #linear #performance
Efficient Non-Linear Control by Combining Q-learning with Local Linear Controllers (HK, SK), pp. 210–219.
ICML-1998-KimuraK #algorithm #analysis #learning #using
An Analysis of Actor/Critic Algorithms Using Eligibility Traces: Reinforcement Learning with Imperfect Value Function (HK, SK), pp. 278–286.
ICML-1997-KimuraMK #approximate #learning
Reinforcement Learning in POMDPs with Function Approximation (HK, KM, SK), pp. 152–160.
ICML-1995-KimuraYK #learning #probability
Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward (HK, MY, SK), pp. 295–303.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.