Subject: continuous time Markov decision processes - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Subject continuous time Markov decision processes

1. First passage risk probability optimality for continuous time Markov decision processes

Creator:: Huo , Haifeng and Wen, Xian
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: optimal policy, first passage time, continuous time Markov decision processes, and risk probability criterion
Language:: English
Description:: In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerable state space, a Borel action space, unbounded transition rates and nonnegative reward function. The optimality criterion to be considered is the first passage risk probability criterion. To ensure the non-explosion of the state processes, we first introduce a so-called drift condition, which is weaker than the well known regular condition for semi-Markov decision processes (SMDPs). Furthermore, under some suitable conditions, by value iteration recursive approximation technique, we establish the optimality equation, obtain the uniqueness of the value function and the existence of optimal policies. Finally, two examples are used to illustrate our results.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

2. Strong average optimality criterion for continuous-time Markov decision processes

Creator:: Wei, Qingda and Chen, Xian
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: continuous time Markov decision processes, strong average optimality criterion, finite-horizon expected total cost criterion, unbounded transition rates, optimal policy, and optimal value function
Language:: English
Description:: This paper deals with continuous-time Markov decision processes with the unbounded transition rates under the strong average cost criterion. The state and action spaces are Borel spaces, and the costs are allowed to be unbounded from above and from below. Under mild conditions, we first prove that the finite-horizon optimal value function is a solution to the optimality equation for the case of uncountable state spaces and unbounded transition rates, and that there exists an optimal deterministic Markov policy. Then, using the two average optimality inequalities, we show that the set of all strong average optimal policies coincides with the set of all average optimal policies, and thus obtain the existence of strong average optimal policies. Furthermore, employing the technique of the skeleton chains of controlled continuous-time Markov chains and Chapman-Kolmogorov equation, we give a new set of sufficient conditions imposed on the primitive data of the model for the verification of the uniform exponential ergodicity of continuous-time Markov chains governed by stationary policies. Finally, we illustrate our main results with an example.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

Search

Search Constraints

Search Results

Limit your search

Coverage

Creator

Format

Language

Rights

Subject

Type

Original context has metadata only

Harvested from