BibSLEIGH — Lihong_Li

BibSLEIGH

BibSLEIGH corpus

BibSLEIGH tags

BibSLEIGH bundles

BibSLEIGH people

EDIT!

CC-BY

Open Knowledge

XHTML 1.0 W3C Rec

CSS 2.1 W3C CanRec

email

Travelled to:

1 × China
1 × Finland
2 × Canada
4 × USA

Collaborated with:

M.L.Littman J.Langford E.Brunskill ∅ R.Parr C.Painter-Wakefield W.Chu M.Dudík C.Diuk B.R.Leffler T.J.Walsh M.Zinkevich A.Thomas B.L.Tseng G.Taylor A.L.Strehl E.Wiewiora A.Agarwal D.Hsu S.Kale R.E.Schapire T.Moon C.Liao Z.Zheng Y.Chang

Talks about:

learn (9) reinforc (4) function (3) approxim (3) linear (3) featur (3) select (2) onlin (2) model (2) valu (2)

Person: Lihong Li

DBLP: Li:Lihong

Contributed to:

ICML c2 2014

2014

ICML 2011

2011

KDD 2011

2011

CIKM 2010

2010

ICML 2009

2009

ICML 2008

2008

ICML 2007

2007

ICML 2006

2006

Wrote 11 papers:

ICML-c2-2014-AgarwalHKLLS #algorithm #performance: Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits (AA, DH, SK, JL, LL, RES), pp. 1638–1646.
ICML-c2-2014-BrunskillL #learning: PAC-inspired Option Discovery in Lifelong Reinforcement Learning (EB, LL), pp. 316–324.
ICML-2011-DudikLL #evaluation #learning #policy #robust: Doubly Robust Policy Evaluation and Learning (MD, JL, LL), pp. 1097–1104.
KDD-2011-ChuZLTT #data type #learning #online: Unbiased online active learning in data streams (WC, MZ, LL, AT, BLT), pp. 195–203.
CIKM-2010-MoonLCLZC #feedback #learning #online #ranking #realtime #using: Online learning for recency search ranking using real-time user feedback (TM, LL, WC, CL, ZZ, YC), pp. 1501–1504.
ICML-2009-DiukLL #adaptation #feature model #learning #problem: The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning (CD, LL, BRL), pp. 249–256.
ICML-2008-Li #approximate #comparison #difference #linear #worst-case: A worst-case comparison between temporal difference and residual gradient with linear function approximation (LL), pp. 560–567.
ICML-2008-LiLW #framework #learning #self #what: Knows what it knows: a framework for self-aware learning (LL, MLL, TJW), pp. 568–575.
ICML-2008-ParrLTPL #analysis #approximate #feature model #learning #linear #modelling: An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning (RP, LL, GT, CPW, MLL), pp. 752–759.
ICML-2007-ParrPLL #approximate #generative: Analyzing feature generation for value-function approximation (RP, CPW, LL, MLL), pp. 737–744.
ICML-2006-StrehlLWLL #learning: PAC model-free reinforcement learning (ALS, LL, EW, JL, MLL), pp. 881–888.

Bibliography of Software Language Engineering in Generated Hypertext (BibSLEIGH) is created and maintained by Dr. Vadim Zaytsev.
Hosted as a part of SLEBOK on GitHub.