How to generate speech signal from spectral envelope, aperiodicity, fundamental frequency, V/UV signal

3 visualizaciones (últimos 30 días)
I have implemented the network as shown in fig which takes 2 inputs namely, video input and mfcc(audio) input. Video input consists of lip images and audio input is mfcc of corresponding video frame. The video and mfcc frames are passed through several layers and then added to generate speeech parameters. I have found fundamental frequency, spectral envelope, V/UV speech, fundamental frequency. I have taken ifft of spectral envelope to generate sound but it generates random signal.
please guide how to generate speech signal from speech parameters.

Respuestas (0)

Categorías

Más información sobre Simulation, Tuning, and Visualization en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by