48,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 1-2 Wochen
payback
24 °P sammeln
  • Broschiertes Buch

This monograph introduces various value-based approaches for solving the policy evaluation problem in the online reinforcement learning (RL) scenario, which aims to learn the value function associated with a specific policy under a single Markov decision process (MDP). Approaches vary depending on whether they are implemented in an on-policy or off-policy manner. In on-policy settings, where the evaluation of the policy is conducted using data generated from the same policy that is being assessed, classical techniques such as TD(0), TD(¿), and their extensions with function approximation or…mehr

Produktbeschreibung
This monograph introduces various value-based approaches for solving the policy evaluation problem in the online reinforcement learning (RL) scenario, which aims to learn the value function associated with a specific policy under a single Markov decision process (MDP). Approaches vary depending on whether they are implemented in an on-policy or off-policy manner. In on-policy settings, where the evaluation of the policy is conducted using data generated from the same policy that is being assessed, classical techniques such as TD(0), TD(¿), and their extensions with function approximation or variance reduction are employed in this setting. For off-policy evaluation, where samples are collected under a different behavior policy, this monograph introduces gradient-based two-timescale algorithms like GTD2, TDC, and variance-reduced TDC. These algorithms are designed to minimize the mean-squared projected Bellman error (MSPBE) as the objective function. This monograph also discusses their finite-sample convergence upper bounds and sample complexity.
Hinweis: Dieser Artikel kann nur an eine deutsche Lieferadresse ausgeliefert werden.
Autorenporträt
Yi Zhou is a thought leader in the fusion of healthcare and artificial intelligence (AI), gaining global recognition for his expertise in Medical AI. As an accomplished Chief Technology Officer (CTO) and Chief Information Officer (CIO), Yi has played transformative roles at leading organizations like GE Healthcare and Quest Diagnostics. Notably, his trailblazing work in formulating the GE Healthcare AI Standard and Playbook set groundbreaking industry standards. Additionally, Yi was instrumental in launching the world's first AI-driven, FDA-approved X-ray and MRI devices. His achievements earned him a finalist for the "CIO of the Year" 2023 Seattle ORBIE Award. Yi holds dual master's degrees and has authored numerous influential publications including AI book and articles, software architecture book, and life sciences papers. His journey epitomizes visionary leadership and deep expertise in the AI field.