Simulation-Based Algorithms for Markov Decision Processes de Hyeong Soo Chang

Simulation-Based Algorithms for Markov Decision Processes: Communications and Control Engineering

Autor Hyeong Soo Chang, Jiaqiao Hu, Michael C. Fu, Steven I. Marcus

en Limba Engleză Paperback – 7 mar 2015

Markov decision process (MDP) models are widely used for modeling sequential decision-making problems that arise in engineering, economics, computer science, and the social sciences. Many real-world problems modeled by MDPs have huge state and/or action spaces, giving an opening to the curse of dimensionality and so making practical solution of the resulting models intractable. In other cases, the system of interest is too complex to allow explicit specification of some of the MDP model parameters, but simulation samples are readily available (e.g., for random transitions and costs). For these settings, various sampling and population-based algorithms have been developed to overcome the difficulties of computing an optimal solution in terms of a policy and/or value function. Specific approaches include adaptive sampling, evolutionary policy iteration, evolutionary random policy search, and model reference adaptive search.
This substantially enlarged new edition reflects the latest developments in novel algorithms and their underpinning theories, and presents an updated account of the topics that have emerged since the publication of the first edition. Includes:
innovative material on MDPs, both in constrained settings and with uncertain transition properties;
game-theoretic method for solving MDPs;
theories for developing roll-out based algorithms; and
details of approximation stochastic annealing, a population-based on-line simulation-based algorithm.
The self-contained approach of this book will appeal not only to researchers in MDPs, stochastic modeling, and control, and simulation but will be a valuable source of tuition and reference for students of control and operations research.

Citește tot Restrânge

Toate formatele și edițiile

	Preț	Express
Paperback (2)	638^.11 lei 6-8 săpt.
SPRINGER LONDON – 7 mar 2015	638^.11 lei 6-8 săpt.
SPRINGER LONDON – 19 oct 2010	826^.10 lei 6-8 săpt.
Hardback (1)	648^.24 lei 6-8 săpt.
SPRINGER LONDON – 20 mar 2013	648^.24 lei 6-8 săpt.

Din seria Communications and Control Engineering

18%

Preț: 953^.65 lei
18%

Preț: 953^.65 lei
18%

Preț: 1128^.08 lei
20%

Preț: 879^.71 lei
18%

Preț: 1117^.03 lei
15%

Preț: 659^.70 lei
15%

Preț: 656^.89 lei
18%

Preț: 896^.52 lei
18%

Preț: 1113^.26 lei
18%

Preț: 1388^.22 lei
15%

Preț: 647^.27 lei
18%

Preț: 954^.45 lei
18%

Preț: 1231^.47 lei
18%

Preț: 948^.92 lei
18%

Preț: 1232^.57 lei
15%

Preț: 643^.34 lei
18%

Preț: 1127^.28 lei
18%

Preț: 1401^.30 lei
15%

Preț: 651^.51 lei
20%

Preț: 1454^.07 lei
18%

Preț: 948^.79 lei
18%

Preț: 1233^.06 lei
18%

Preț: 947^.85 lei
18%

Preț: 950^.96 lei
18%

Preț: 956^.99 lei
15%

Preț: 644^.18 lei
18%

Preț: 946^.87 lei
18%

Preț: 951^.14 lei
18%

Preț: 961^.55 lei
15%

Preț: 644^.18 lei
20%

Preț: 990^.80 lei
18%

Preț: 1006^.72 lei
18%

Preț: 942^.44 lei
18%

Preț: 1233^.06 lei
15%

Preț: 641^.85 lei
18%

Preț: 957^.75 lei
15%

Preț: 649^.87 lei
18%

Preț: 958^.07 lei
18%

Preț: 1117^.99 lei
18%

Preț: 1395^.94 lei
18%

Preț: 781^.62 lei
18%

Preț: 953^.20 lei
18%

Preț: 1109^.78 lei
18%

Preț: 947^.35 lei

Preț: 638^.11 lei

Preț vechi: 750^.72 lei
-15% Nou

Puncte Express: 957

:
122^.12€ • 127^.33$ • 101^.49£

Carte tipărită la comandă

Livrare economică 21 martie-04 aprilie

Adaugă în coș

Wish list
Gift list
Am citit!

Adaugă în listă

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9781447159902
ISBN-10: 144715990X
Pagini: 248
Ilustrații: XVII, 229 p. 49 illus., 1 illus. in color.
Dimensiuni: 155 x 235 x 13 mm
Greutate: 0.35 kg
Ediția:2nd ed. 2013
Editura: SPRINGER LONDON
Colecția Springer
Seria Communications and Control Engineering

Locul publicării:London, United Kingdom

Cuprins

Markov Decision Processes.- Multi-stage Adaptive Sampling Algorithms.- Population-based Evolutionary Approaches.- Model Reference Adaptive Search.- On-line Control Methods via Simulation.- Game-theoretic Methods via Simulation.

Recenzii

From the book reviews:
“The book consists of five chapters. … This well-written book is addressed to researchers in MDPs and applied modeling with an interests in numerical computations, but the book is also accessible to graduate students in operation research, computer science, and economics. The authors gives many pseudocodes of algorithms, numerical examples, algorithms convergence analysis and bibliographical notes that can be very helpful for readers to understand the ideas presented in the book and to perform experiments on their own.” (Wiesław Kotarski, zbMATH, Vol. 1293, 2014)

Notă biografică

Hyeong Soo Chang (SM’07 of the IEEE, Member of INFORMS) received the B.S. and M.S. degrees in electrical engineering and the Ph.D. degree in electrical and computer engineering, all from Purdue University,West Lafayette, IN, in 1994, 1996, and 2001, respectively. Since 2003, he has been with the Department of Computer Science and Engineering, Sogang University, Seoul, Korea, where he is now an Associate Professor. He has about 30 journal papers in the area of MDPs and related areas. His main research interests include Markov decision processes, Markov games, computational learning theory, computational intelligence, and stochastic optimization. He currently serves as an Associate Editor for the IEEE Transactions on Automatic Control.
Jiaqiao Hu (M’11 of the IEEE, Member of INFORMS) received the B.S. degree in automation from Shanghai Jiao Tong University, Shanghai, China, in 1997, the M.S. degree in applied mathematics from the University of Maryland, Baltimore County, in 2001, and the Ph.D. degree in electrical engineering from the University of Maryland, College Park, in 2006. Since 2006, he has been with the Department of Applied Mathematics and Statistics, State University of New York, Stony Brook, where he is currently an Assistant Professor Markov decision processes, simulation-based optimization, global optimization, applied probability, and stochastic modeling and analysis.
Michael Fu (Fellow of the IEEE, Member of INFORMS) received his Ph.D. and M.S degrees in applied mathematics from Harvard University in 1989 and 1986, respectively. He received S.B. and S.M. degrees in electrical engineering and an S.B. degree in mathematics from the Massachusetts Institute of Technology in 1985. Since 1989, he has been at the University of Maryland, College Park, in the College of Business and Management. He was the Simulation Area Editor for Operations and is an Associate Editor for Management Science, and has served on the Editorial Boards ofthe INFORMS Journal on Computing, Production and Operations Management and IIE Transactions. He was on the program committee for the Spring 1996 INFORMS National Meeting, in charge of contributed papers. In 1995, he received the Maryland Business School's annual Allen J. Krowe Award for Teaching Excellence. He is the co-author (with Jian-Qiang Hu) of the book, Conditional Monte Carlo: Gradient Estimation and Optimization Applications (0-7923-9873-4, 1997), which received the 1998 INFORMS College on Simulation Outstanding Publication Award. Other awards include the 1999 IIE Operations Research Division Award and a 1998 IIE Transactions Best Paper Award. In 2002, he received ISR's Outstanding Systems Engineering Faculty Award. He currently serves as a director of National Science Foundation Operations Research Program. Dr. Fu's research interests lie in the areas of stochastic derivative estimation and simulation optimization of discrete-event systems, particularly with applications towards manufacturing systems, inventory control, and the pricing of financial derivatives.
Steven I. Marcus (Fellow of the IEEE, Fellow of SIAM, Member of INFORMS) received his Ph.D. and S.M. from the Massachusetts Institute of Technology in 1975 and 1972, respectively. He received a B.A. from Rice University in 1971. From 1975 to 1991, he was with the Department of Electrical and Computer Engineering at the University of Texas at Austin, where he was the L.B. (Preach) Meaders Professor in Engineering. He was Associate Chairman of the Department during the period 1984-89. In 1991, he joined the University of Maryland, College Park, where he was Director of the Institute for Systems Research until 1996. He is currently a Professor in the Electrical Engineering Department and the Institute for Systems Research. He has served as an Editor of the SIAM Journal on Control and Optimization, and Associate Editor of Mathematics of Control, Signals, and Systems, Journal on Discrete Event Dynamic Systems, and Acta Applicandae Mathematicae. He has authored or co-authored more than 100 articles, conference proceedings, and book chapters. Dr. Marcus's research interests lie in the areas of control and systems engineering, analysis and control of stochastic systems, Markov decision processes, stochastic and adaptive control, learning, fault detection, and discrete event systems, with applications in manufacturing, acoustics, and communication networks.

Textul de pe ultima copertă

Markov decision process (MDP) models are widely used for modeling sequential decision-making problems that arise in engineering, economics, computer science, and the social sciences. Many real-world problems modeled by MDPs have huge state and/or action spaces, giving an opening to the curse of dimensionality and so making practical solution of the resulting models intractable. In other cases, the system of interest is too complex to allow explicit specification of some of the MDP model parameters, but simulation samples are readily available (e.g., for random transitions and costs). For these settings, various sampling and population-based algorithms have been developed to overcome the difficulties of computing an optimal solution in terms of a policy and/or value function. Specific approaches include adaptive sampling, evolutionary policy iteration, evolutionary random policy search, and model reference adaptive search.
This substantially enlarged new edition reflects the latest developments in novel algorithms and their underpinning theories, and presents an updated account of the topics that have emerged since the publication of the first edition. Includes:
. innovative material on MDPs, both in constrained settings and with uncertain transition properties;
. game-theoretic method for solving MDPs;
. theories for developing roll-out based algorithms; and
. details of approximation stochastic annealing, a population-based on-line simulation-based algorithm.
The self-contained approach of this book will appeal not only to researchers in MDPs, stochastic modeling, and control, and simulation but will be a valuable source of tuition and reference for students of control and operations research.
The Communications and Control Engineering series reports major technological advances which have potential for great impact in the fields ofcommunication and control. It reflects
research in industrial and academic institutions around the world so that the readership can exploit new possibilities as they become available.

Caracteristici

Rigorous theoretical derivation of sampling and population-based algorithms enables the reader to expand on the work presented in the certainty that new results will have a sound foundation New chapter on game-theoretic methods for solving Markov decision processes gives the researcher up-to-date information Presents recently developed on-line methods in constrained and uncertain model settings for the reader to use and adapt in their own research Includes supplementary material: sn.pub/extras

Papetărie, jocuri, reviste