Now showing items 1-1 of 1
Information theoretic learning methods for Markov decision processes with parametric uncertainty
Markov decision processes (MDPs) model a class of stochastic sequential decision problems with applications in engineering, medicine, and business analytics. There is considerable interest in the literature in MDPs with imperfect information, where the search for well-performing policies faces many challenges. There is no ...