Pedro Cano

CONTENT-BASED AUDIO SEARCH

FROM AUDIO FINGERPRINTING TO SEMANTIC AUDIO RETRIEVAL

Fotogalerie

Pedro Cano

CONTENT-BASED AUDIO SEARCH

FROM AUDIO FINGERPRINTING TO SEMANTIC AUDIO RETRIEVAL

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Autorenporträt

Andere Kunden interessierten sich auch für

Natividad Martínez Madrid / Ralf E.D. Seepold (ed.)
Intelligent Technical Systems

116,99 €
Yu-Li You
Audio Coding

154,99 €
Speech and Audio Coding for Wireless and Network Applications

161,99 €
Sudeep D. Thepade
Content Based Image Retrieval

53,99 €
Audio compression (data)

27,99 €
Raghavendra Chilamakur
AUDIO/ TEXT TO SIGN LANGUAGE CONVERSION: PYTHON APPROACH

30,99 €
Meenakshi Garg
Design of a Feature Descriptor for Content Based Image Retrieval

57,99 €

Produktbeschreibung

This book is for audio information retrieval
practitioners.
It is about audio content-based search.
Specifically, it is on exploring promising paths for
bridging the semantic gap that currently prevents
wide deployment of audio content-based search
engines. Music search sound engines rely on metadata,
mostly human generated, to manage collections of
audio assets. Even though time-consuming and
error-prone, human labeling is a common practice.
Audio content-based methods, algorithms that
automatically extract description from audio files,
are generally not mature enough to provide the user
friendly representation that users demand when
interacting with audio content. This dissertation has two
parts. In a first part we explore the strengths and
limitation of a pure low-level audio description
technique: audio
fingerprinting.
In the second part, we hypothesize that one of the
problems that hinders the closing the semantic gap is
the lack of intelligence that encodes common sense
knowledge and that such a knowledge base is a primary
step toward bridging
the semantic gap. We present a sound effects
retrieval system which leverages both low-level and
semantic technologies.

Produktdetails

Produktdetails
Verlag: VDM Verlag Dr. Müller
Seitenzahl: 224
Englisch
Abmessung: 220mm x 150mm x 13mm
Gewicht: 315g
ISBN-13: 9783639134186
ISBN-10: 3639134184
Artikelnr.: 26105490

Herstellerkennzeichnung

Produktdetails

Verlag: VDM Verlag Dr. Müller
Seitenzahl: 224
Englisch
Abmessung: 220mm x 150mm x 13mm
Gewicht: 315g
ISBN-13: 9783639134186
ISBN-10: 3639134184
Artikelnr.: 26105490

Herstellerkennzeichnung

Autorenporträt

Scientist, Entrepreneur and Music Technologist. PhD in Computer
Science and Communication, Telecommunication Engineer. Author
more than 45 publications as well as 5 patents in the audio and
music computing area. Assistant professor at UPF. Recipient of an
ICREA Junior scholarship. Pedro is a founder of the firm BMAT,
and currently the CTO.