Search and access to information contained in the speech of multimedia resources

Amparo Varona Fernández; Luis Javier Rodríguez Fuentes; Mikel Peñagarikano Badiola; Silvia Nieto Nieto; Mireia Díez Sánchez; Germán Bordel García

Search and access to information contained in the speech of multimedia resources

Revista:

Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2010

Número: 45

Páginas: 317-318

Tipo: Artículo

DIALNET GOOGLE SCHOLAR RUA editor

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

The main goal of this project is to make scientific contributions and technological improvements related to the spoken document retrieval system (Hearch) developed by the Working Group on Software Technologies of the University of the Basque Country. Hearch looks like a conventional search tool (such as Google, Bing, etc.) but it is designed to retrieve audio/video segments based on the automatic transcription of speech contents. The system consists of a back-end that captures, processes and indexes audio/video resources, and a front-end that allows to search contents, configure various modules and display performance statistics through a web interface. An early version of this tool is available (http://gtts.ehu.es/Hearch/), which searches and retrieves segments on broadcast news repositories in Spanish and Basque, through it can also deal with resources in English.

Fuente de los datos: Dialnet