Cantitate/Preț
Produs

Reinforcement Learning from Scarce Experience viaPolicy Search

Autor Leonid Peshkin
en Limba Engleză Paperback – 26 oct 2013
Today we live in the world which is very much a man-made or artificial. In such a world there are many systems and environments, both real and virtual, which can be very well described by formal models. This creates an opportunity for developing a "synthetic intelligence" - artificial systems which cohabit these environments with human beings and carry out some useful function. In this book we address some aspects of this development in the framework of reinforcement learning, learning how to map sensations to actions, by trial and error from feedback. In some challenging cases, actions may affect not only the immediate reward, but also the next sensation and all subsequent rewards. The general task of reinforcement learning stated in a traditional way is unreasonably ambitious for these two characteristics: search by trial-and-error and delayed reward. We investigate general ways of breaking the task of designing a controller down to more feasible sub-tasks which are solved independently. We propose to consider both taking advantage of past experience by reusing parts of other systems, and facilitating the learning phase by employing a bias in initial configuration.
Citește tot Restrânge

Preț: 38967 lei

Nou

Puncte Express: 585

Preț estimativ în valută:
7457 7795$ 6301£

Carte tipărită la comandă

Livrare economică 06-20 martie

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9783639088038
ISBN-10: 3639088034
Pagini: 140
Dimensiuni: 150 x 220 x 8 mm
Greutate: 0.2 kg
Editura: VDM Verlag Dr. Müller e.K.

Notă biografică

Dr. Peshkin currently works at the Department of Systems Biology, Harvard Medical School. Previously Dr. Peshkin was a visiting researcher at AI Lab of MIT and research fellow at Harvard Faculty of Applied Sciences. Leonid earned his PhD in Computer Science from Brown University in 2002. He grew up and went through college in Moscow, Russia.