Video Grounding and Its Generalization

From I.D. and Task-specific Models to O.O.D. and Large Foundation Models

Fotogalerie

Xin Wang, Xiaohan Lan, Wenwu Zhu

Video Grounding and Its Generalization

From I.D. and Task-specific Models to O.O.D. and Large Foundation Models

Gebundenes Buch

Jetzt bewerten Jetzt bewerten

Weitere Ausgabe:
eBook, PDF

Inhaltsangabe

Andere Kunden interessierten sich auch für

Mitosis Domain Generalization and Diabetic Retinopathy Analysis

49,99 €
Advances in Mobile Computing and Multimedia Intelligence

45,99 €
Advances in Mobile Computing and Multimedia Intelligence

42,99 €
Computer Vision - ACCV 2024 Workshops

50,99 €
Hari Kalva
Delivering MPEG-4 Based Audio-Visual Services

77,99 €
Computer Vision - ACCV 2024 Workshops

50,99 €
Hua Harry Li / Shan Sun / Haluk Derin (Hgg.)
Video Data Compression for Multimedia Computing

153,99 €

Produktbeschreibung

This book consists of two parts: Part I Methodologies for Video Grounding and Part II Generalized Video Grounding and Trending Directions. To make this book self-contained and cutting edge, Part I will cover basic and advanced methodologies for Video Grounding, discussing key comparisons with several representative Vision-Language learning tasks including multimodal understanding and generation. Part II will cover our insights for Generalized Video Grounding and the development of Video Grounding in the era of large foundation models, discussing future directions such as Out-of-Distribution settings which deserve further investigations.

Discussions on Video Grounding will cover both the task of Video Grounding and other Vision-Language Task, as well as their relations. The basics and advances will touch Video Grounding from model to benchmark, from supervised learning to unsupervised pre-training, from single video grounding to video corpus grounding, and from in-distribution setting to out-of-distribution setting. As for Generalized Video Grounding, we discuss cross-modal grounding, event grounding for multi-modal tasks, various distribution shifts in out-of-distribution setting, explainable Video Grounding, and large foundation model for Video Grounding.

We deeply hope this book can benefit interested readers from both academy and industry, covering needs from junior starters in research to senior practitioners in IT companies.

Produktdetails

Produktdetails
Verlag: Springer / Springer Nature Switzerland / Springer, Berlin
Artikelnr. des Verlages: 978-3-031-94836-7
Seitenzahl: 180
Erscheinungstermin: August 2025
Englisch
Abmessung: 235mm x 155mm
ISBN-13: 9783031948367
Artikelnr.: 74212011

Herstellerkennzeichnung

Produktdetails

Verlag: Springer / Springer Nature Switzerland / Springer, Berlin
Artikelnr. des Verlages: 978-3-031-94836-7
Seitenzahl: 180
Erscheinungstermin: August 2025
Englisch
Abmessung: 235mm x 155mm
ISBN-13: 9783031948367
Artikelnr.: 74212011

Herstellerkennzeichnung

Inhaltsangabe

Preface.- Introduction.- Traditional Temporal Sentence Grounding in Videos.- Generalized Video Grounding.- Future Research Directions.- References.

Inhaltsangabe

Preface.- Introduction.- Traditional Temporal Sentence Grounding in Videos.- Generalized Video Grounding.- Future Research Directions.- References.