Vocal double | Algorithmes, Intelligence Artificielle, Interactions et Décision

Par Safia Kedad-Sidhoum , 24 novembre, 2014

The ability to capture and reproduce the voice timber and the speech prosody of a given person has
made impressive progress, e.g., with recent Ircam projects: based on several hours of recording of an
actor reading texts, entirely new synthetic speech of incredible resemblance can be uttered in the
same language; also, using much less training data, voice timber can also be transferred to some
extent to the natural speech of another speaker. In this work, we would like to investigate ways to
push further such systems to create polyglot voice doubles, with various applications. The intern will
have to get acquainted with existing voice analysis and reproduction (either by synthesis or by
transfer) techniques and will try and alleviate the current mono-lingual limitations.

Keywords:
Voice analysis, Prosody, Speech Synthesis.

Lieu

Technicolor - Rennes

Encadrant

Patrick Pérez

Co-encadrant

Alexey Ozerov

Référent universitaire

Safia Kedad-Sidhoum

Fichier descriptif

Internship_proposal_2015_vocal_double.pdf (297.45 Ko)