24,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 1-2 Wochen
payback
12 °P sammeln
  • Broschiertes Buch

This book explores the capabilities of convolutional neural networks to deal with a task that is easily manageable for humans: perceiving 3D pose of a human body from varying angles. However, in our approach, we are restricted to using a monocular vision system. For this purpose, a convolutional neural network approach is applied on RGB videos and is extended to three dimensional convolutions. This is done via encoding the time dimension in videos as the third dimension in convolutional space, and directly regressing to human body joint positions in 3D coordinate space. This research shows the…mehr

Produktbeschreibung
This book explores the capabilities of convolutional neural networks to deal with a task that is easily manageable for humans: perceiving 3D pose of a human body from varying angles. However, in our approach, we are restricted to using a monocular vision system. For this purpose, a convolutional neural network approach is applied on RGB videos and is extended to three dimensional convolutions. This is done via encoding the time dimension in videos as the third dimension in convolutional space, and directly regressing to human body joint positions in 3D coordinate space. This research shows the ability of such a network to achieve state-of-the-art performance on the selected Human3.6M dataset, thus demonstrating the possibility of successfully representing temporal data with an additional dimension in the convolutional operation.
Autorenporträt
Agne Grinciunaite hat einen B.S. in Statistik von der Vilniaus Universitetas und einen M.Sc. in IT von der Vilniaus Gedimino Technikos Universitetas und der Universitat Politècnica de Catalunya. Agne verfügt über vielfältige Erfahrungen in den Bereichen IT-Beratung und Software-Engineering mit Schwerpunkt auf Anwendungen des maschinellen Lernens für Big Data.