TTS4ALL: TTS in low resource scenarios: data management, methodology, models, evaluation

The project aims to effectively train and evaluate TTS systems in a situation of scarce training data and complex linguistic contexts. We aim to set up an effective data collection, preparation and evaluation protocols that are adapted to the situation above-mentioned. We will also explore effective strategies for training TTS models for spoken languages without written form or dialects without standardized writing systems. Besides that, we will also address the use of Self-Supervised Learning (SSL) for building TTS and investigate SSL layers in order to find where linguistic content and emotions are encoded. Furthermore, we will benefit from our multidisciplinary and highly-skilled team to build TTS for additional applications that include speech pseudonymization and streaming TTS. Speech pseudonymization is an area lacking existing resources and previous studies. It involves altering the linguistic content of recorded natural speech to protect the speaker’s identity while maintaining the intelligibility of the utterance. This could be particularly useful in scenarios where privacy is a concern, such as in legal or child protection contexts. Streaming TTS is also an emerging topic, which allows for speech generation as symbolic inputs (text or discrete tokens) are provided. This could be particularly useful for integrating TTS with the output of a textual Large Language Model (LLM) or for simultaneous speech translation. Streaming TTS could enable real-time applications where immediate feedback is required, such as in conversational agents or live broadcasting.

Read detailed description of the topic

Group members:

Team Leaders:
Yannick Estève
Fethi Bougares

Senior Members:
Salima Mdhaffar
Meysam Shamsi
Tien Ping Tan
Aghilas Sini
Ana Rosa Montalvo Bereau
Sarah Samson Juan
Gabriel Hernández Sierra
Mickaël Rouvier
Mohammad Mohammadamini
Sankar Mukherjee

Junior Members:
Lilian Rodrigues
Vadrianey Asas
María Lara Gauder Altabas
Martin Bernando Meza
Bengono Obiang Saint-Germes
Lucas Maison
Imen Laouirine
Natacha Miniconi
Kélian Estève
Santiago Rubio

Affiliates:
Marie Tahon