Ahots sintetiko pertsonalizatuakesperientzia baten deskribapena

  1. Hernáez Rioja, Inmaculada 1
  2. Navas Cordón, Eva 2
  3. Saratxaga Couceiro, Ibon 2
  4. Sanchez de la Fuente, Jon 2
  1. 1 Universidad del País Vasco/Euskal Herriko Unibertsitatea
    info

    Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Lejona, España

    ROR https://ror.org/000xsnr85

  2. 2 University of the Basque Country (UPV/EHU)
Journal:
Ekaia: Euskal Herriko Unibertsitateko zientzi eta teknologi aldizkaria

ISSN: 0214-9001

Year of publication: 2021

Issue Title: Garapen Iraunkorra

Issue: 1

Pages: 173-194

Type: Article

DOI: 10.1387/EKAIA.22077 DIALNET GOOGLE SCHOLAR lock_openOpen access editor

More publications in: Ekaia: Euskal Herriko Unibertsitateko zientzi eta teknologi aldizkaria

Abstract

The voice is so essential for human communication that its loss drastically affects the integration of people in society. Text-to-speech can provide a synthetic voice for people with oral disabilities. The most common solutions usually provide a standard voice, and users have difficulties to identify themselves with it. For this reason, we need to create personalized synthetic voices and offer a catalogue of voices to people with oral disabilities so that they can choose one that suits their needs. The objective of the ZureTTS project is to provide these personalized voices, both in Spanish and in Basque. Through the AhoMyTTS web portal, people who are going to lose their voice or altruistic people who want to provide voices to those who do not have it, record 100 carefully se-lected sentences. A synthetic voice with similar characteristics to the voice of the recording is generated by applying an adaptation process. The user is provided with a synthesis engine along with that personalized voice, so that they can use it in applications that require oral message generation. In addition, we offer a catalogue of voices to choose from if one is no longer able to record. More than 1,200 people have used the system to obtain a personalized voice and 58 of them have been selected to be included in the cata-logue. User surveys show user satisfaction with various aspects of the synthetic voice: most think that the synthetic voice is similar to the original, pleasant and clear, although a bit robotic. This work contributes mainly to goal 10 for sustainable development by re-ducing inequality within and among countries. It also contributes to goal 4 for sustainable development, providing tools that facilitate access for all to an inclusive, equitable and quality education.