BibSLEIGH — mdps stem

Used together with:

factor (5)
learn (4)
function (3)
model (3)
uncertainti (2)

Stem mdps$ (all stems)

14 papers:

TACAS-2015-BrazdilCFK #multi #named #synthesis: MultiGain: A Controller Synthesis Tool for MDPs with Multiple Mean-Payoff Objectives (TB, KC, VF, AK), pp. 181–187.
ICML-c2-2014-ScholzLIW #object-oriented: A Physics-Based Model Prior for Object-Oriented MDPs (JS, ML, CLIJ, DW), pp. 1089–1097.
ICML-c2-2014-TamarMX #approximate #robust #scalability #using: Scaling Up Robust MDPs using Function Approximation (AT, SM, HX), pp. 181–189.
RecSys-2014-TavakolB #detection #topic: Factored MDPs for detecting topics of user sessions (MT, UB), pp. 33–40.
CAV-2013-PuggelliLSS #nondeterminism #polynomial #verification: Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties (AP, WL, ALSV, SAS), pp. 527–542.
ICML-2012-GrunewalderLBPG #modelling: Modelling transition dynamics in MDPs with RKHS embeddings (SG, GL, LB, MP, AG), p. 208.
ICML-2012-MannorMX #nondeterminism #robust: Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty (SM, OM, HX), p. 62.
ICALP-v2-2011-BrazdilBEK #approximate #game studies #probability #termination: Approximating the Termination Value of One-Counter MDPs and Stochastic Games (TB, VB, KE, AK), pp. 332–343.
ICML-2011-ChakrabortyS #learning: Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree (DC, PS), pp. 737–744.
ICML-2010-KrishnamurthyT: Inverse Optimal Control with Linearly-Solvable MDPs (KD, ET), pp. 335–342.
ICML-2009-SzitaL #learning #polynomial: Optimistic initialization and greediness lead to polynomial time learning in factored MDPs (IS, AL), pp. 1001–1008.
ICML-2007-OsentoskiM #learning: Learning state-action basis functions for hierarchical MDPs (SO, SM), pp. 705–712.
ICML-2005-JonssonB #approach #composition: A causal approach to hierarchical decomposition of factored MDPs (AJ, AGB), pp. 401–408.
ICML-2002-GuestrinPS #learning #modelling: Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs (CG, RP, DS), pp. 235–242.