14 papers:
TACAS-2015-BrazdilCFK #multi #named #synthesis- MultiGain: A Controller Synthesis Tool for MDPs with Multiple Mean-Payoff Objectives (TB, KC, VF, AK), pp. 181–187.
ICML-c2-2014-ScholzLIW #object-oriented- A Physics-Based Model Prior for Object-Oriented MDPs (JS, ML, CLIJ, DW), pp. 1089–1097.
ICML-c2-2014-TamarMX #approximate #robust #scalability #using- Scaling Up Robust MDPs using Function Approximation (AT, SM, HX), pp. 181–189.
RecSys-2014-TavakolB #detection #topic- Factored MDPs for detecting topics of user sessions (MT, UB), pp. 33–40.
CAV-2013-PuggelliLSS #nondeterminism #polynomial #verification- Polynomial-Time Verification of PCTL Properties of MDPs with Convex Uncertainties (AP, WL, ALSV, SAS), pp. 527–542.
ICML-2012-GrunewalderLBPG #modelling- Modelling transition dynamics in MDPs with RKHS embeddings (SG, GL, LB, MP, AG), p. 208.
ICML-2012-MannorMX #nondeterminism #robust- Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty (SM, OM, HX), p. 62.
ICALP-v2-2011-BrazdilBEK #approximate #game studies #probability #termination- Approximating the Termination Value of One-Counter MDPs and Stochastic Games (TB, VB, KE, AK), pp. 332–343.
ICML-2011-ChakrabortyS #learning- Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function’s In-Degree (DC, PS), pp. 737–744.
ICML-2010-KrishnamurthyT- Inverse Optimal Control with Linearly-Solvable MDPs (KD, ET), pp. 335–342.
ICML-2009-SzitaL #learning #polynomial- Optimistic initialization and greediness lead to polynomial time learning in factored MDPs (IS, AL), pp. 1001–1008.
ICML-2007-OsentoskiM #learning- Learning state-action basis functions for hierarchical MDPs (SO, SM), pp. 705–712.
ICML-2005-JonssonB #approach #composition- A causal approach to hierarchical decomposition of factored MDPs (AJ, AGB), pp. 401–408.
ICML-2002-GuestrinPS #learning #modelling- Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs (CG, RP, DS), pp. 235–242.