Faculty Publications

A learning algorithm for risk-sensitive cost

Arnab Basu
Tirthankar Bhattacharyya
Vivek S Borkar

Document Type

Article

Publication Title

Mathematics of Operations Research

Abstract

A linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the "o.d.e. method" for stochastic approximation. The scheme is also extended to continuous state space processes.

DOI Link

https://doi.org/10.1287/moor.1080.0324

Publication Date

1-4-2008

Publisher

Informs

Volume

Vol.33

Issue

Iss.4

Recommended Citation

Basu, Arnab; Bhattacharyya, Tirthankar; and Borkar, Vivek S, "A learning algorithm for risk-sensitive cost" (2008). Faculty Publications. 943.
https://research.iimb.ac.in/fac_pubs/943

Link to Full Text

Request Access

COinS

Faculty Publications

A learning algorithm for risk-sensitive cost

Document Type

Publication Title

Abstract

DOI Link

Publication Date

Publisher

Volume

Issue

Recommended Citation

Search

Browse

Author Corner

Faculty Publications

A learning algorithm for risk-sensitive cost

Authors

Document Type

Publication Title

Abstract

DOI Link

Publication Date

Publisher

Volume

Issue

Recommended Citation

Share

Search

Browse

Author Corner